Korjus, Kristjan; Hebart, Martin N; Vicente, Raul
2016-01-01
Supervised machine learning methods typically require splitting data into multiple chunks for training, validating, and finally testing classifiers. For finding the best parameters of a classifier, training and validation are usually carried out with cross-validation. This is followed by application of the classifier with optimized parameters to a separate test set for estimating the classifier's generalization performance. With limited data, this separation of test data creates a difficult trade-off between having more statistical power in estimating generalization performance versus choosing better parameters and fitting a better model. We propose a novel approach that we term "Cross-validation and cross-testing" improving this trade-off by re-using test data without biasing classifier performance. The novel approach is validated using simulated data and electrophysiological recordings in humans and rodents. The results demonstrate that the approach has a higher probability of discovering significant results than the standard approach of cross-validation and testing, while maintaining the nominal alpha level. In contrast to nested cross-validation, which is maximally efficient in re-using data, the proposed approach additionally maintains the interpretability of individual parameters. Taken together, we suggest an addition to currently used machine learning approaches which may be particularly useful in cases where model weights do not require interpretation, but parameters do.
Korjus, Kristjan; Hebart, Martin N.; Vicente, Raul
2016-01-01
Supervised machine learning methods typically require splitting data into multiple chunks for training, validating, and finally testing classifiers. For finding the best parameters of a classifier, training and validation are usually carried out with cross-validation. This is followed by application of the classifier with optimized parameters to a separate test set for estimating the classifier’s generalization performance. With limited data, this separation of test data creates a difficult trade-off between having more statistical power in estimating generalization performance versus choosing better parameters and fitting a better model. We propose a novel approach that we term “Cross-validation and cross-testing” improving this trade-off by re-using test data without biasing classifier performance. The novel approach is validated using simulated data and electrophysiological recordings in humans and rodents. The results demonstrate that the approach has a higher probability of discovering significant results than the standard approach of cross-validation and testing, while maintaining the nominal alpha level. In contrast to nested cross-validation, which is maximally efficient in re-using data, the proposed approach additionally maintains the interpretability of individual parameters. Taken together, we suggest an addition to currently used machine learning approaches which may be particularly useful in cases where model weights do not require interpretation, but parameters do. PMID:27564393
Statistical validation of normal tissue complication probability models.
Xu, Cheng-Jian; van der Schaaf, Arjen; Van't Veld, Aart A; Langendijk, Johannes A; Schilstra, Cornelis
2012-09-01
To investigate the applicability and value of double cross-validation and permutation tests as established statistical approaches in the validation of normal tissue complication probability (NTCP) models. A penalized regression method, LASSO (least absolute shrinkage and selection operator), was used to build NTCP models for xerostomia after radiation therapy treatment of head-and-neck cancer. Model assessment was based on the likelihood function and the area under the receiver operating characteristic curve. Repeated double cross-validation showed the uncertainty and instability of the NTCP models and indicated that the statistical significance of model performance can be obtained by permutation testing. Repeated double cross-validation and permutation tests are recommended to validate NTCP models before clinical use. Copyright © 2012 Elsevier Inc. All rights reserved.
Küçükdeveci, Ayse A; Sahin, Hülya; Ataman, Sebnem; Griffiths, Bridget; Tennant, Alan
2004-02-15
Guidelines have been established for cross-cultural adaptation of outcome measures. However, invariance across cultures must also be demonstrated through analysis of Differential Item Functioning (DIF). This is tested in the context of a Turkish adaptation of the Health Assessment Questionnaire (HAQ). Internal construct validity of the adapted HAQ is assessed by Rasch analysis; reliability, by internal consistency and the intraclass correlation coefficient; external construct validity, by association with impairments and American College of Rheumatology functional stages. Cross-cultural validity is tested through DIF by comparison with data from the UK version of the HAQ. The adapted version of the HAQ demonstrated good internal construct validity through fit of the data to the Rasch model (mean item fit 0.205; SD 0.998). Reliability was excellent (alpha = 0.97) and external construct validity was confirmed by expected associations. DIF for culture was found in only 1 item. Cross-cultural validity was found to be sufficient for use in international studies between the UK and Turkey. Future adaptation of instruments should include analysis of DIF at the field testing stage in the adaptation process.
Piette, Elizabeth R; Moore, Jason H
2018-01-01
Machine learning methods and conventions are increasingly employed for the analysis of large, complex biomedical data sets, including genome-wide association studies (GWAS). Reproducibility of machine learning analyses of GWAS can be hampered by biological and statistical factors, particularly so for the investigation of non-additive genetic interactions. Application of traditional cross validation to a GWAS data set may result in poor consistency between the training and testing data set splits due to an imbalance of the interaction genotypes relative to the data as a whole. We propose a new cross validation method, proportional instance cross validation (PICV), that preserves the original distribution of an independent variable when splitting the data set into training and testing partitions. We apply PICV to simulated GWAS data with epistatic interactions of varying minor allele frequencies and prevalences and compare performance to that of a traditional cross validation procedure in which individuals are randomly allocated to training and testing partitions. Sensitivity and positive predictive value are significantly improved across all tested scenarios for PICV compared to traditional cross validation. We also apply PICV to GWAS data from a study of primary open-angle glaucoma to investigate a previously-reported interaction, which fails to significantly replicate; PICV however improves the consistency of testing and training results. Application of traditional machine learning procedures to biomedical data may require modifications to better suit intrinsic characteristics of the data, such as the potential for highly imbalanced genotype distributions in the case of epistasis detection. The reproducibility of genetic interaction findings can be improved by considering this variable imbalance in cross validation implementation, such as with PICV. This approach may be extended to problems in other domains in which imbalanced variable distributions are a concern.
Sisic, Nedim; Jelicic, Mario; Pehar, Miran; Spasic, Miodrag; Sekulic, Damir
2016-01-01
In basketball, anthropometric status is an important factor when identifying and selecting talents, while agility is one of the most vital motor performances. The aim of this investigation was to evaluate the influence of anthropometric variables and power capacities on different preplanned agility performances. The participants were 92 high-level, junior-age basketball players (16-17 years of age; 187.6±8.72 cm in body height, 78.40±12.26 kg in body mass), randomly divided into a validation and cross-validation subsample. The predictors set consisted of 16 anthropometric variables, three tests of power-capacities (Sargent-jump, broad-jump and medicine-ball-throw) as predictors. The criteria were three tests of agility: a T-Shape-Test; a Zig-Zag-Test, and a test of running with a 180-degree turn (T180). Forward stepwise multiple regressions were calculated for validation subsamples and then cross-validated. Cross validation included correlations between observed and predicted scores, dependent samples t-test between predicted and observed scores; and Bland Altman graphics. Analysis of the variance identified centres being advanced in most of the anthropometric indices, and medicine-ball-throw (all at P<0.05); with no significant between-position-differences for other studied motor performances. Multiple regression models originally calculated for the validation subsample were then cross-validated, and confirmed for Zig-zag-Test (R of 0.71 and 0.72 for the validation and cross-validation subsample, respectively). Anthropometrics were not strongly related to agility performance, but leg length is found to be negatively associated with performance in basketball-specific agility. Power capacities are confirmed to be an important factor in agility. The results highlighted the importance of sport-specific tests when studying pre-planned agility performance in basketball. The improvement in power capacities will probably result in an improvement in agility in basketball athletes, while anthropometric indices should be used in order to identify those athletes who can achieve superior agility performance.
Cross-Cultural Validation of TEMAS, a Minority Projective Test.
ERIC Educational Resources Information Center
Costantino, Giuseppe; And Others
The theoretical framework and cross-cultural validation of Tell-Me-A-Story (TEMAS), a projective test developed to measure personality development in ethnic minority children, is presented. The TEMAS test consists of 23 chromatic pictures which incorporate the following characteristics: (1) representation of antithetical concepts which the…
Reiss, Philip T
2015-08-01
The "ten ironic rules for statistical reviewers" presented by Friston (2012) prompted a rebuttal by Lindquist et al. (2013), which was followed by a rejoinder by Friston (2013). A key issue left unresolved in this discussion is the use of cross-validation to test the significance of predictive analyses. This note discusses the role that cross-validation-based and related hypothesis tests have come to play in modern data analyses, in neuroimaging and other fields. It is shown that such tests need not be suboptimal and can fill otherwise-unmet inferential needs. Copyright © 2015 Elsevier Inc. All rights reserved.
Pernambuco, Leandro; Espelt, Albert; Magalhães, Hipólito Virgílio; Lima, Kenio Costa de
2017-06-08
to present a guide with recommendations for translation, adaptation, elaboration and process of validation of tests in Speech and Language Pathology. the recommendations were based on international guidelines with a focus on the elaboration, translation, cross-cultural adaptation and validation process of tests. the recommendations were grouped into two Charts, one of them with procedures for translation and transcultural adaptation and the other for obtaining evidence of validity, reliability and measures of accuracy of the tests. a guide with norms for the organization and systematization of the process of elaboration, translation, cross-cultural adaptation and validation process of tests in Speech and Language Pathology was created.
Lam, Simon C
2014-05-01
To perform detailed psychometric testing of the compliance with standard precautions scale (CSPS) in measuring compliance with standard precautions of clinical nurses and to conduct cross-cultural pilot testing and assess the relevance of the CSPS on an international platform. A cross-sectional and correlational design with repeated measures. Nursing students from a local registered nurse training university, nurses from different hospitals in Hong Kong, and experts in an international conference. The psychometric properties of the CSPS were evaluated via internal consistency, 2-week and 3-month test-retest reliability, concurrent validation, and construct validation. The cross-cultural pilot testing and relevance check was examined by experts on infection control from various developed and developing regions. Among 453 participants, 193 were nursing students, 165 were enrolled nurses, and 95 were registered nurses. The results showed that the CSPS had satisfactory reliability (Cronbach α = 0.73; intraclass correlation coefficient, 0.79 for 2-week test-retest and 0.74 for 3-month test-retest) and validity (optimum correlation with criterion measure; r = 0.76, P < .001; satisfactory results on known-group method and hypothesis testing). A total of 19 experts from 16 countries assured that most of the CSPS findings were relevant and globally applicable. The CSPS demonstrated satisfactory results on the basis of the standard international criteria on psychometric testing, which ascertained the reliability and validity of this instrument in measuring the compliance of clinical nurses with standard precautions. The cross-cultural pilot testing further reinforced the instrument's relevance and applicability in most developed and developing regions.
ERIC Educational Resources Information Center
Zeidner, Moshe
1987-01-01
This study examined the cross-cultural validity of the sex bias contention with respect to standardized aptitude testing, used for academic prediction purposes in Israel. Analyses were based on the grade point average and scores of 1778 Jewish and 1017 Arab students who were administered standardized college entrance test batteries. (Author/LMO)
Cross-Validation of easyCBM Reading Cut Scores in Washington: 2009-2010. Technical Report #1109
ERIC Educational Resources Information Center
Irvin, P. Shawn; Park, Bitnara Jasmine; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald
2011-01-01
This technical report presents results from a cross-validation study designed to identify optimal cut scores when using easyCBM[R] reading tests in Washington state. The cross-validation study analyzes data from the 2009-2010 academic year for easyCBM[R] reading measures. A sample of approximately 900 students per grade, randomly split into two…
Sattler, Tine; Sekulic, Damir; Spasic, Miodrag; Osmankac, Nedzad; Vicente João, Paulo; Dervisevic, Edvin; Hadzic, Vedran
2016-01-01
Previous investigations noted potential importance of isokinetic strength in rapid muscular performances, such as jumping. This study aimed to identify the influence of isokinetic-knee-strength on specific jumping performance in volleyball. The secondary aim of the study was to evaluate reliability and validity of the two volleyball-specific jumping tests. The sample comprised 67 female (21.96±3.79 years; 68.26±8.52 kg; 174.43±6.85 cm) and 99 male (23.62±5.27 years; 84.83±10.37 kg; 189.01±7.21 cm) high- volleyball players who competed in 1st and 2nd National Division. Subjects were randomly divided into validation (N.=55 and 33 for males and females, respectively) and cross-validation subsamples (N.=54 and 34 for males and females, respectively). Set of predictors included isokinetic tests, to evaluate the eccentric and concentric strength capacities of the knee extensors, and flexors for dominant and non-dominant leg. The main outcome measure for the isokinetic testing was peak torque (PT) which was later normalized for body mass and expressed as PT/Kg. Block-jump and spike-jump performances were measured over three trials, and observed as criteria. Forward stepwise multiple regressions were calculated for validation subsamples and then cross-validated. Cross validation included correlations between and t-test differences between observed and predicted scores; and Bland Altman graphics. Jumping tests were found to be reliable (spike jump: ICC of 0.79 and 0.86; block-jump: ICC of 0.86 and 0.90; for males and females, respectively), and their validity was confirmed by significant t-test differences between 1st vs. 2nd division players. Isokinetic variables were found to be significant predictors of jumping performance in females, but not among males. In females, the isokinetic-knee measures were shown to be stronger and more valid predictors of the block-jump (42% and 64% of the explained variance for validation and cross-validation subsample, respectively) than that of the spike-jump (39% and 34% of the explained variance for validation and cross-validation subsample, respectively). Differences between prediction models calculated for males and females are mostly explained by gender-specific biomechanics of jumping. Study defined importance of knee-isokinetic-strength in volleyball jumping performance in female athletes. Further studies should evaluate association between ankle-isokinetic-strength and volleyball-specific jumping performances. Results reinforce the need for the cross-validation of the prediction-models in sport and exercise sciences.
Cross-Validation of easyCBM Reading Cut Scores in Oregon: 2009-2010. Technical Report #1108
ERIC Educational Resources Information Center
Park, Bitnara Jasmine; Irvin, P. Shawn; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald
2011-01-01
This technical report presents results from a cross-validation study designed to identify optimal cut scores when using easyCBM[R] reading tests in Oregon. The cross-validation study analyzes data from the 2009-2010 academic year for easyCBM[R] reading measures. A sample of approximately 2,000 students per grade, randomly split into two groups of…
How to test validity in orthodontic research: a mixed dentition analysis example.
Donatelli, Richard E; Lee, Shin-Jae
2015-02-01
The data used to test the validity of a prediction method should be different from the data used to generate the prediction model. In this study, we explored whether an independent data set is mandatory for testing the validity of a new prediction method and how validity can be tested without independent new data. Several validation methods were compared in an example using the data from a mixed dentition analysis with a regression model. The validation errors of real mixed dentition analysis data and simulation data were analyzed for increasingly large data sets. The validation results of both the real and the simulation studies demonstrated that the leave-1-out cross-validation method had the smallest errors. The largest errors occurred in the traditional simple validation method. The differences between the validation methods diminished as the sample size increased. The leave-1-out cross-validation method seems to be an optimal validation method for improving the prediction accuracy in a data set with limited sample sizes. Copyright © 2015 American Association of Orthodontists. Published by Elsevier Inc. All rights reserved.
Lee, Shin-Young; Lee, Eunice E
2015-02-01
The purpose of this study was to report the instrument modification and validation processes to make existing health belief model scales culturally appropriate for Korean Americans (KAs) regarding colorectal cancer (CRC) screening utilization. Instrument translation, individual interviews using cognitive interviewing, and expert reviews were conducted during the instrument modification phase, and a pilot test and a cross-sectional survey were conducted during the instrument validation phase. Data analyses of the cross-sectional survey included internal consistency and construct validity using exploratory and confirmatory factor analysis. The main issues identified during the instrument modification phase were (a) cultural and linguistic translation issues and (b) newly developed items reflecting Korean cultural barriers. Cross-sectional survey analyses during the instrument validation phase revealed that all scales demonstrate good internal consistency reliability (Cronbach's alpha=.72~.88). Exploratory factor analysis showed that susceptibility and severity loaded on the same factor, which may indicate a threat variable. Items with low factor loadings in the confirmatory factor analysis may relate to (a) lack of knowledge about fecal occult blood testing and (b) multiple dimensions of the subscales. Methodological, sequential processes of instrument modification and validation, including translation, individual interviews, expert reviews, pilot testing and a cross-sectional survey, were provided in this study. The findings indicate that existing instruments need to be examined for CRC screening research involving KAs.
Shapira Galitz, Yael; Halperin, Doron; Bavnik, Yosef; Warman, Meir
2016-05-01
To perform the translation, cross-cultural adaptation, and validation of the Sino-Nasal Outcome Test-22 (SNOT-22) questionnaire to the Hebrew language. A single-center prospective cross-sectional study. Seventy-three chronic rhinosinusitis (CRS) patients and 73 patients without sinonasal disease filled the Hebrew version of the SNOT-22 questionnaire. Fifty-one CRS patients underwent endoscopic sinus surgery, out of which 28 filled a postoperative questionnaire. Seventy-three healthy volunteers without sinonasal disease also answered the questionnaire. Internal consistency, test-retest reproducibility, validity, and responsiveness of the questionnaire were evaluated. Questionnaire reliability was excellent, with a high internal consistency (Cronbach's alpha coefficient, 0.91-0.936) and test-retest reproducibility (Spearman's coefficient, 0.962). Mean scores for the preoperative, postoperative, and control groups were 50.44, 29.64, and 13.15, respectively (P < .0001 for CRS vs controls, P < .001 for preoperative vs postoperative), showing validity and responsiveness of the questionnaire. The Hebrew version of SNOT-22 questionnaire is a valid outcome measure for patients with CRS with or without nasal polyps. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Reliable Digit Span: A Systematic Review and Cross-Validation Study
ERIC Educational Resources Information Center
Schroeder, Ryan W.; Twumasi-Ankrah, Philip; Baade, Lyle E.; Marshall, Paul S.
2012-01-01
Reliable Digit Span (RDS) is a heavily researched symptom validity test with a recent literature review yielding more than 20 studies ranging in dates from 1994 to 2011. Unfortunately, limitations within some of the research minimize clinical generalizability. This systematic review and cross-validation study was conducted to address these…
Validity Tests of the Adolescent Domain Screening Inventory (ADSI) with Older Adolescents
ERIC Educational Resources Information Center
Corrigan, Matthew J.; Forte, James; Bulgaris, Sarah
2017-01-01
The purpose of this replication study is to test the validity of the Adolescent Domain Screening Inventory (ADSI) on an older adolescent population. This cross sectional study used a convenience sample to preliminarily test the validity of the ADSI. Concurrent validity correlations ranged from a high of 0.924 to a low of 0.760. The known…
Bullinger, Monika; Quitmann, Julia; Silva, Neuza; Rohenkohl, Anja; Chaplin, John E; DeBusk, Kendra; Mimoun, Emmanuelle; Feigerlova, Eva; Herdman, Michael; Sanz, Dolores; Wollmann, Hartmut; Pleil, Andreas; Power, Michael
2014-01-01
Testing cross-cultural equivalence of patient-reported outcomes requires sufficiently large samples per country, which is difficult to achieve in rare endocrine paediatric conditions. We describe a novel approach to cross-cultural testing of the Quality of Life in Short Stature Youth (QoLISSY) questionnaire in five countries by sequentially taking one country out (TOCO) from the total sample and iteratively comparing the resulting psychometric performance. Development of the QoLISSY proceeded from focus group discussions through pilot testing to field testing in 268 short-statured patients and their parents. To explore cross-cultural equivalence, the iterative TOCO technique was used to examine and compare the validity, reliability, and convergence of patient and parent responses on QoLISSY in the field test dataset, and to predict QoLISSY scores from clinical, socio-demographic and psychosocial variables. Validity and reliability indicators were satisfactory for each sample after iteratively omitting one country. Comparisons with the total sample revealed cross-cultural equivalence in internal consistency and construct validity for patients and parents, high inter-rater agreement and a substantial proportion of QoLISSY variance explained by predictors. The TOCO technique is a powerful method to overcome problems of country-specific testing of patient-reported outcome instruments. It provides an empirical support to QoLISSY's cross-cultural equivalence and is recommended for future research.
Cross-cultural adaptation and validation of Persian Achilles tendon Total Rupture Score.
Ansari, Noureddin Nakhostin; Naghdi, Soofia; Hasanvand, Sahar; Fakhari, Zahra; Kordi, Ramin; Nilsson-Helander, Katarina
2016-04-01
To cross-culturally adapt the Achilles tendon Total Rupture Score (ATRS) to Persian language and to preliminary evaluate the reliability and validity of a Persian ATRS. A cross-sectional and prospective cohort study was conducted to translate and cross-culturally adapt the ATRS to Persian language (ATRS-Persian) following steps described in guidelines. Thirty patients with total Achilles tendon rupture and 30 healthy subjects participated in this study. Psychometric properties of floor/ceiling effects (responsiveness), internal consistency reliability, test-retest reliability, standard error of measurement (SEM), smallest detectable change (SDC), construct validity, and discriminant validity were tested. Factor analysis was performed to determine the ATRS-Persian structure. There were no floor or ceiling effects that indicate the content and responsiveness of ATRS-Persian. Internal consistency was high (Cronbach's α 0.95). Item-total correlations exceeded acceptable standard of 0.3 for the all items (0.58-0.95). The test-retest reliability was excellent [(ICC)agreement 0.98]. SEM and SDC were 3.57 and 9.9, respectively. Construct validity was supported by a significant correlation between the ATRS-Persian total score and the Persian Foot and Ankle Outcome Score (PFAOS) total score and PFAOS subscales (r = 0.55-0.83). The ATRS-Persian significantly discriminated between patients and healthy subjects. Explanatory factor analysis revealed 1 component. The ATRS was cross-culturally adapted to Persian and demonstrated to be a reliable and valid instrument to measure functional outcomes in Persian patients with Achilles tendon rupture. II.
Correcting Evaluation Bias of Relational Classifiers with Network Cross Validation
2010-01-01
classi- fication algorithms: simple random resampling (RRS), equal-instance random resampling (ERS), and network cross-validation ( NCV ). The first two... NCV procedure that eliminates overlap between test sets altogether. The procedure samples for k disjoint test sets that will be used for evaluation...propLabeled ∗ S) nodes from train Pool in f erenceSet =network − trainSet F = F ∪ < trainSet, test Set, in f erenceSet > end for output: F NCV addresses
Cross-Cultural Validation of the Five-Factor Structure of Social Goals: A Filipino Investigation
ERIC Educational Resources Information Center
King, Ronnel B.; Watkins, David A.
2012-01-01
The aim of the present study was to test the cross-cultural validity of the five-factor structure of social goals that Dowson and McInerney proposed. Using both between-network and within-network approaches to construct validation, 1,147 Filipino high school students participated in the study. Confirmatory factor analysis indicated that the…
ERIC Educational Resources Information Center
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald
2011-01-01
In this technical report, we document the results of a cross-validation study designed to identify optimal cut-scores for the use of the easyCBM[R] mathematics test in the state of Washington. A large sample, randomly split into two groups of roughly equal size, was used for this study. Students' performance classification on the Washington state…
A Cross-Validation of easyCBM[R] Mathematics Cut Scores in Oregon: 2009-2010. Technical Report #1104
ERIC Educational Resources Information Center
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald
2011-01-01
In this technical report, we document the results of a cross-validation study designed to identify optimal cut-scores for the use of the easyCBM[R] mathematics test in Oregon. A large sample, randomly split into two groups of roughly equal size, was used for this study. Students' performance classification on the Oregon state test was used as the…
Mills, Tamara L; Holm, Margo B; Schmeler, Mark
2007-01-01
The purpose of this study was to establish the test-retest reliability and content validity of an outcomes tool designed to measure the effectiveness of seating-mobility interventions on the functional performance of individuals who use wheelchairs or scooters as their primary seating-mobility device. The instrument, Functioning Everyday With a Wheelchair (FEW), is a questionnaire designed to measure perceived user function related to wheelchair/scooter use. Using consumer-generated items, FEW Beta Version 1.0 was developed and test-retest reliability was established. Cross-validation of FEW Beta Version 1.0 was then carried out with five samples of seating-mobility users to establish content validity. Based on the content validity study, FEW Version 2.0 was developed and administered to seating-mobility consumers to examine its test-retest reliability. FEW Beta Version 1.0 yielded an intraclass correlation coefficient (ICC) Model (3,k) of .92, p < .001, and the content validity results revealed that FEW Beta Version 1.0 captured 55% of seating-mobility goals reported by consumers across five samples. FEW Version 2.0 yielded ICC(3,k) = .86, p < .001, and captured 98.5% of consumers' seating-mobility goals. The cross-validation study identified new categories of seating-mobility goals for inclusion in FEW Version 2.0, and the content validity of FEW Version 2.0 was confirmed. FEW Beta Version 1.0 and FEW Version 2.0 were highly stable in their measurement of participants' seating-mobility goals over a 1-week interval.
Genome-based prediction of test cross performance in two subsequent breeding cycles.
Hofheinz, Nina; Borchardt, Dietrich; Weissleder, Knuth; Frisch, Matthias
2012-12-01
Genome-based prediction of genetic values is expected to overcome shortcomings that limit the application of QTL mapping and marker-assisted selection in plant breeding. Our goal was to study the genome-based prediction of test cross performance with genetic effects that were estimated using genotypes from the preceding breeding cycle. In particular, our objectives were to employ a ridge regression approach that approximates best linear unbiased prediction of genetic effects, compare cross validation with validation using genetic material of the subsequent breeding cycle, and investigate the prospects of genome-based prediction in sugar beet breeding. We focused on the traits sugar content and standard molasses loss (ML) and used a set of 310 sugar beet lines to estimate genetic effects at 384 SNP markers. In cross validation, correlations >0.8 between observed and predicted test cross performance were observed for both traits. However, in validation with 56 lines from the next breeding cycle, a correlation of 0.8 could only be observed for sugar content, for standard ML the correlation reduced to 0.4. We found that ridge regression based on preliminary estimates of the heritability provided a very good approximation of best linear unbiased prediction and was not accompanied with a loss in prediction accuracy. We conclude that prediction accuracy assessed with cross validation within one cycle of a breeding program can not be used as an indicator for the accuracy of predicting lines of the next cycle. Prediction of lines of the next cycle seems promising for traits with high heritabilities.
Zambelli, Roberto; Pinto, Rafael Z; Magalhães, João Murilo Brandão; Lopes, Fernando Araujo Silva; Castilho, Rodrigo Simões; Baumfeld, Daniel; Dos Santos, Thiago Ribeiro Teles; Maffulli, Nicola
2016-01-01
There is a need for a patient-relevant instrument to evaluate outcome after treatment in patients with a total Achilles tendon rupture. The purpose of this study was to undertake a cross-cultural adaptation of the Achilles Tendon Total Rupture Score (ATRS) into Brazilian Portuguese, determining the test-retest reliability and construct validity of the instrument. A five-step approach was used in the cross-cultural adaptation process: initial translation (two bilingual Brazilian translators), synthesis of translation, back-translation (two native English language translators), consensus version and evaluation (expert committee), and testing phase. A total of 46 patients were recruited to evaluate the test-retest reproducibility and construct validity of the Brazilian Portuguese version of the ATRS. Test-retest reproducibility was performed by assessing each participant on two separate occasions. The construct validity was determined by the correlation index between the ATRS and the Orthopedic American Foot and Ankle Society (AOFAS) questionnaires. The final version of the Brazilian Portuguese ATRS had the same number of questions as the original ATRS. For the reliability analysis, an ICC(2,1) of 0.93 (95 % CI: 0.88 to 0.96) with SEM of 1.56 points and MDC of 4.32 was observed, indicating excellent reliability. The construct validity showed excellent correlation with R = 0.76 (95 % CI: 0.52 to 0.89, P < 0.001). The ATRS was successfully cross-culturally validated into Brazilian Portuguese. This version was a reliable and valid measure of function in patients who suffered complete rupture of the Achilles Tendon.
Progress toward the determination of correct classification rates in fire debris analysis.
Waddell, Erin E; Song, Emma T; Rinke, Caitlin N; Williams, Mary R; Sigman, Michael E
2013-07-01
Principal components analysis (PCA), linear discriminant analysis (LDA), and quadratic discriminant analysis (QDA) were used to develop a multistep classification procedure for determining the presence of ignitable liquid residue in fire debris and assigning any ignitable liquid residue present into the classes defined under the American Society for Testing and Materials (ASTM) E 1618-10 standard method. A multistep classification procedure was tested by cross-validation based on model data sets comprised of the time-averaged mass spectra (also referred to as total ion spectra) of commercial ignitable liquids and pyrolysis products from common building materials and household furnishings (referred to simply as substrates). Fire debris samples from laboratory-scale and field test burns were also used to test the model. The optimal model's true-positive rate was 81.3% for cross-validation samples and 70.9% for fire debris samples. The false-positive rate was 9.9% for cross-validation samples and 8.9% for fire debris samples. © 2013 American Academy of Forensic Sciences.
An empirical assessment of validation practices for molecular classifiers
Castaldi, Peter J.; Dahabreh, Issa J.
2011-01-01
Proposed molecular classifiers may be overfit to idiosyncrasies of noisy genomic and proteomic data. Cross-validation methods are often used to obtain estimates of classification accuracy, but both simulations and case studies suggest that, when inappropriate methods are used, bias may ensue. Bias can be bypassed and generalizability can be tested by external (independent) validation. We evaluated 35 studies that have reported on external validation of a molecular classifier. We extracted information on study design and methodological features, and compared the performance of molecular classifiers in internal cross-validation versus external validation for 28 studies where both had been performed. We demonstrate that the majority of studies pursued cross-validation practices that are likely to overestimate classifier performance. Most studies were markedly underpowered to detect a 20% decrease in sensitivity or specificity between internal cross-validation and external validation [median power was 36% (IQR, 21–61%) and 29% (IQR, 15–65%), respectively]. The median reported classification performance for sensitivity and specificity was 94% and 98%, respectively, in cross-validation and 88% and 81% for independent validation. The relative diagnostic odds ratio was 3.26 (95% CI 2.04–5.21) for cross-validation versus independent validation. Finally, we reviewed all studies (n = 758) which cited those in our study sample, and identified only one instance of additional subsequent independent validation of these classifiers. In conclusion, these results document that many cross-validation practices employed in the literature are potentially biased and genuine progress in this field will require adoption of routine external validation of molecular classifiers, preferably in much larger studies than in current practice. PMID:21300697
Cross-Validation of the Computerized Adaptive Screening Test (CAST).
ERIC Educational Resources Information Center
Pliske, Rebecca M.; And Others
The Computerized Adaptive Screening Test (CAST) was developed to provide an estimate at recruiting stations of prospects' Armed Forces Qualification Test (AFQT) scores. The CAST was designed to replace the paper-and-pencil Enlistment Screening Test (EST). The initial validation study of CAST indicated that CAST predicts AFQT at least as accurately…
Tilov, Boris; Dimitrova, Donka; Stoykova, Maria; Tornjova, Bianka; Foreva, Gergana; Stoyanov, Drozdstoj
2012-12-01
Health-care professions have long been considered prone to work-related stress, yet recent research in Bulgaria indicates alarmingly high levels of burnout. Cloninger's inventory is used to analyse and evaluate correlation between personality characteristics and degree of burnout syndrome manifestation among the risk categories of health-care professionals. The primary goal of this study was to test the conceptual validity and cross-cultural applicability of the revised TCI (TCI-R), developed in the United States, in a culturally, socially and economically diverse setting. Linguistic validation, test-retest studies, statistical and expert analyses were performed to assess cross-cultural applicability of the revised Cloninger's temperament and character inventory in Bulgarian, its reliability and internal consistency and construct validity. The overall internal consistency of TCI-R and its scales as well as the interscale and test-retest correlations prove that the translated version of the questionnaire is acceptable and cross-culturally applicable for the purposes of studying organizational stress and burnout risk in health-care professionals. In general the cross-cultural adaptation process, even if carried out in a rigorous way, does not always lead to the best target version and suggests it would be useful to develop new scales specific to each culture and, at the same time, to think about the trans-cultural adaptation. © 2012 Blackwell Publishing Ltd.
Cross-Validation of Survival Bump Hunting by Recursive Peeling Methods.
Dazard, Jean-Eudes; Choe, Michael; LeBlanc, Michael; Rao, J Sunil
2014-08-01
We introduce a survival/risk bump hunting framework to build a bump hunting model with a possibly censored time-to-event type of response and to validate model estimates. First, we describe the use of adequate survival peeling criteria to build a survival/risk bump hunting model based on recursive peeling methods. Our method called "Patient Recursive Survival Peeling" is a rule-induction method that makes use of specific peeling criteria such as hazard ratio or log-rank statistics. Second, to validate our model estimates and improve survival prediction accuracy, we describe a resampling-based validation technique specifically designed for the joint task of decision rule making by recursive peeling (i.e. decision-box) and survival estimation. This alternative technique, called "combined" cross-validation is done by combining test samples over the cross-validation loops, a design allowing for bump hunting by recursive peeling in a survival setting. We provide empirical results showing the importance of cross-validation and replication.
Cross-Validation of Survival Bump Hunting by Recursive Peeling Methods
Dazard, Jean-Eudes; Choe, Michael; LeBlanc, Michael; Rao, J. Sunil
2015-01-01
We introduce a survival/risk bump hunting framework to build a bump hunting model with a possibly censored time-to-event type of response and to validate model estimates. First, we describe the use of adequate survival peeling criteria to build a survival/risk bump hunting model based on recursive peeling methods. Our method called “Patient Recursive Survival Peeling” is a rule-induction method that makes use of specific peeling criteria such as hazard ratio or log-rank statistics. Second, to validate our model estimates and improve survival prediction accuracy, we describe a resampling-based validation technique specifically designed for the joint task of decision rule making by recursive peeling (i.e. decision-box) and survival estimation. This alternative technique, called “combined” cross-validation is done by combining test samples over the cross-validation loops, a design allowing for bump hunting by recursive peeling in a survival setting. We provide empirical results showing the importance of cross-validation and replication. PMID:26997922
Kumar, Y Kiran; Mehta, Shashi Bhushan; Ramachandra, Manjunath
2017-01-01
The purpose of this work is to provide some validation methods for evaluating the hemodynamic assessment of Cerebral Arteriovenous Malformation (CAVM). This article emphasizes the importance of validating noninvasive measurements for CAVM patients, which are designed using lumped models for complex vessel structure. The validation of the hemodynamics assessment is based on invasive clinical measurements and cross-validation techniques with the Philips proprietary validated software's Qflow and 2D Perfursion. The modeling results are validated for 30 CAVM patients for 150 vessel locations. Mean flow, diameter, and pressure were compared between modeling results and with clinical/cross validation measurements, using an independent two-tailed Student t test. Exponential regression analysis was used to assess the relationship between blood flow, vessel diameter, and pressure between them. Univariate analysis is used to assess the relationship between vessel diameter, vessel cross-sectional area, AVM volume, AVM pressure, and AVM flow results were performed with linear or exponential regression. Modeling results were compared with clinical measurements from vessel locations of cerebral regions. Also, the model is cross validated with Philips proprietary validated software's Qflow and 2D Perfursion. Our results shows that modeling results and clinical results are nearly matching with a small deviation. In this article, we have validated our modeling results with clinical measurements. The new approach for cross-validation is proposed by demonstrating the accuracy of our results with a validated product in a clinical environment.
Cross-Validation of a Short Form of the Marlowe-Crowne Social Desirability Scale.
ERIC Educational Resources Information Center
Zook, Avery, II; Sipps, Gary J.
1985-01-01
Presents a cross-validation of Reynolds' short form of the Marlowe-Crowne Social Desirability Scale (N=233). Researchers administered 13 items as a separate entity, calculated Cronbach's Alpha for each sex, and computed test-retest correlation for one group. Concluded that the short form is a viable alternative. (Author/NRB)
ERIC Educational Resources Information Center
Gold, Bernadette; Holodynski, Manfred
2015-01-01
The current study describes the development and construct validation of a situational judgment test for assessing the strategic knowledge of classroom management in elementary schools. Classroom scenarios and accompanying courses of action were constructed, of which 17 experts confirmed the content validity. A pilot study and a cross-validation…
Reliability and validity of functional performance tests in dancers with hip dysfunction.
Kivlan, Benjamin R; Carcia, Christopher R; Clemente, F Richard; Phelps, Amy L; Martin, Robroy L
2013-08-01
Quasi-experimental, repeated measures. Functional performance tests that identify hip joint impairments and assess the effect of intervention have not been adequately described for dancers. The purpose of this study was to examine the reliability and validity of hop and balance tests among a group of dancers with musculoskeletal pain in the hip region. NINETEEN FEMALE DANCERS (AGE: 18.90±1.11 years; height: 164.85±6.95 cm; weight: 60.37±8.29 kg) with unilateral hip pain were assessed utilizing the cross-over reach, medial triple hop, lateral triple hop, and cross-over hop tests on two occasions, 2 days apart. Test-retest reliability and comparisons between the involved and uninvolved side for each respective test were determined. Intra-class correlation coefficients for the functional performance tests ranged from 0.89-0.96. The cross-over reach test had a SEM of 2.79 cm and a MDC of 7.73 cm. The medial and lateral triple hop tests had SEM values of 7.51 cm and 8.17 cm, and MDC values of 20.81 cm and 22.62 cm, respectively. The SEM was 0.15 seconds and the MDC was 0.42 seconds for the cross-over hop test. Performance on the medial triple hop test was significantly less on the involved side (370.21±38.26 cm) compared to the uninvolved side (388.05±41.49 cm); t(18) = -4.33, p<0.01. The side-to-side comparisons of the cross-over reach test (involved mean=61.68±10.9 cm; uninvolved mean=61.69±8.63 cm); t(18) = -0.004, p=0.99, lateral triple hop test (involved mean=306.92±35.79 cm; uninvolved mean=310.68±24.49 cm); t(18) = -0.55, p=0.59, and cross-over hop test (involved mean=2.49±0.34 seconds; uninvolved mean= 2.61±0.42 seconds; t(18) = -1.84, p=0.08) were not statistically different between sides. The functional performance tests used in this study can be reliably performed on dancers with unilateral hip pain. The medial triple hop test was the only functional performance test with evidence of validity in side-to-side comparisons. These results suggest that the medial triple hop test may be a reliable and valid functional performance test to assess impairments related to hip pain among dancers. 3b. Non-consecutive cohort study.
RELIABILITY AND VALIDITY OF FUNCTIONAL PERFORMANCE TESTS IN DANCERS WITH HIP DYSFUNCTION
Carcia, Christopher R.; Clemente, F. Richard; Phelps, Amy L.; Martin, RobRoy L.
2013-01-01
Study Design: Quasi-experimental, repeated measures. Purpose/Background: Functional performance tests that identify hip joint impairments and assess the effect of intervention have not been adequately described for dancers. The purpose of this study was to examine the reliability and validity of hop and balance tests among a group of dancers with musculoskeletal pain in the hip region. Methods: Nineteen female dancers (age: 18.90±1.11 years; height: 164.85±6.95 cm; weight: 60.37±8.29 kg) with unilateral hip pain were assessed utilizing the cross-over reach, medial triple hop, lateral triple hop, and cross-over hop tests on two occasions, 2 days apart. Test-retest reliability and comparisons between the involved and uninvolved side for each respective test were determined. Results: Intra-class correlation coefficients for the functional performance tests ranged from 0.89-0.96. The cross-over reach test had a SEM of 2.79 cm and a MDC of 7.73 cm. The medial and lateral triple hop tests had SEM values of 7.51 cm and 8.17 cm, and MDC values of 20.81 cm and 22.62 cm, respectively. The SEM was 0.15 seconds and the MDC was 0.42 seconds for the cross-over hop test. Performance on the medial triple hop test was significantly less on the involved side (370.21±38.26 cm) compared to the uninvolved side (388.05±41.49 cm); t(18) = −4.33, p<0.01. The side-to-side comparisons of the cross-over reach test (involved mean=61.68±10.9 cm; uninvolved mean=61.69±8.63 cm); t(18) = −0.004, p=0.99, lateral triple hop test (involved mean=306.92±35.79 cm; uninvolved mean=310.68±24.49 cm); t(18) = −0.55, p=0.59, and cross-over hop test (involved mean=2.49±0.34 seconds; uninvolved mean= 2.61±0.42 seconds; t(18) = −1.84, p=0.08) were not statistically different between sides. Conclusion: The functional performance tests used in this study can be reliably performed on dancers with unilateral hip pain. The medial triple hop test was the only functional performance test with evidence of validity in side-to-side comparisons. These results suggest that the medial triple hop test may be a reliable and valid functional performance test to assess impairments related to hip pain among dancers. Level of Evidence: 3b. Non-consecutive cohort study PMID:24175123
Zhang, Yin-Ping; Wei, Huan-Huan; Wang, Wen; Xia, Ru-Yi; Zhou, Xiao-Ling; Porr, Caroline; Lammi, Mikko
2016-04-01
The Osteoporosis Assessment Questionnaire Short Version (OPAQ-SV) was cross-culturally adapted to measure health-related quality of life in Chinese osteoporotic fracture females and then validated in China for its psychometric properties. Cross-cultural adaptation, including translation of the original OPAQ-SV into Mandarin Chinese language, was performed according to published guidelines. Validation of the newly cross-culturally adapted OPAQ-SV was conducted by sampling 234 Chinese osteoporotic fracture females and also a control group of 235 Chinese osteoporotic females without fractures, producing robust content, construct, and discriminant validation results. Major categories of reliability were also met: the Cronbach alpha coefficient was 0.975, indicating good internal consistency; the test-retest reliability was 0.80; and principal component analysis resulted in a 6-factor structure explaining 75.847 % of the total variance. Further, the Comparative Fit Index result was 0.922 following the modified model confirmatory factor analysis, and the chi-squared test was 1.98. The root mean squared error of approximation was 0.078. Moreover, significant differences were revealed between females with fractures and those without fractures across all domains (p < 0.001). Overall, the newly cross-culturally adapted OPAQ-SV appears to possess adequate validity and reliability and may be utilized in clinical trials to assess the health-related quality of life in Chinese osteoporotic fracture females.
Cross-cultural adaption and validation of the Persian version of the SWAL-QOL.
Tarameshlu, Maryam; Azimi, Amir Reza; Jalaie, Shohreh; Ghelichi, Leila; Ansari, Noureddin Nakhostin
2017-06-01
The aim of this study was to translate and cross-culturally adapt the swallowing quality-of-life questionnaire (SWAL-QOL) to Persian language and to determine validity and reliability of the Persian version of the swallow quality-of-life questionnaire (PSWAL-QOL) in the patients with oropharyngeal dysphagia.The cross-sectional survey was designed to translate and cross-culturally adapt SWAL-QOL to Persian language following steps recommended in guideline. A total of 142 patients with dysphagia (mean age = 56.7 ± 12.22 years) were selected by non-probability consecutive sampling method to evaluate construct validity and internal consistency. Thirty patients with dysphagia were completed the PSWAL-QOL 2 weeks later for test-retest reliability.The PSWAL-QOL was favorably accepted with no missing items. The floor effect was ranged 0% to 21% and ceiling effect was ranged 0% to 16%. The construct validity was established via exploratory factor analysis. Internal consistency was confirmed with Cronbach α >0.7 for all scales except eating duration (α = 0.68). The test-retest reliability was excellent with intraclass correlation coefficient (ICC) ≥0.75 for all scales.The SWAL-QOL was cross-culturally adapted to Persian and demonstrated to be a valid and reliable self-report questionnaire to measure the impact of dysphagia on the quality-of-life in the Persian patients with oropharyngeal dysphagia.
Comprehensive Assessment of Emotional Disturbance: A Cross-Validation Approach
ERIC Educational Resources Information Center
Fisher, Emily S.; Doyon, Katie E.; Saldana, Enrique; Allen, Megan Redding
2007-01-01
Assessing a student for emotional disturbance is a serious and complex task given the stigma of the label and the ambiguities of the federal definition. One way that school psychologists can be more confident in their assessment results is to cross validate data from different sources using the RIOT approach (Review, Interview, Observe, Test).…
Cross-Cultural Validation of Stages of Exercise Change Scale among Chinese College Students
ERIC Educational Resources Information Center
Keating, Xiaofen D.; Guan, Jianmin; Huang, Yong; Deng, Mingying; Wu, Yifeng; Qu, Shuhua
2005-01-01
The purpose of the study was to test the cross-cultural concurrent validity of the stages of exercise change scale (SECS) in Chinese college students. The original SECS was translated into Chinese (C-SECS). Students from four Chinese universities (N = 1843) participated in the study. The leisure-time exercise (LTE) questionnaire was used to…
Scale for positive aspects of caregiving experience: development, reliability, and factor structure.
Kate, N; Grover, S; Kulhara, P; Nehra, R
2012-06-01
OBJECTIVE. To develop an instrument (Scale for Positive Aspects of Caregiving Experience [SPACE]) that evaluates positive caregiving experience and assess its psychometric properties. METHODS. Available scales which assess some aspects of positive caregiving experience were reviewed and a 50-item questionnaire with a 5-point rating was constructed. In all, 203 primary caregivers of patients with severe mental disorders were asked to complete the questionnaire. Internal consistency, test-retest reliability, cross-language reliability, split-half reliability, and face validity were evaluated. Principal component factor analysis was run to assess the factorial validity of the scale. RESULTS. The scale developed as part of the study was found to have good internal consistency, test-retest reliability, cross-language reliability, split-half reliability, and face validity. Principal component factor analysis yielded a 4-factor structure, which also had good test-retest reliability and cross-language reliability. There was a strong correlation between the 4 factors obtained. CONCLUSION. The SPACE developed as part of this study has good psychometric properties.
A Cross-Modal Assessment of Reading Achievement in Children.
ERIC Educational Resources Information Center
Webb, Kathryn; And Others
1982-01-01
This study examined the ability of the Listen and Look (LL) test of cross-modal perception and the Metropolitan Readiness Test (MRT) to predict reading achievement. Data from 79 first-grade pupils were analyzed. Both the LL and MRT demonstrated predictive validity. (Author/BW)
ERIC Educational Resources Information Center
Knapp, Deirdre J.; Pliske, Rebecca M.
A study was conducted to validate the Army's Computerized Adaptive Screening Test (CAST), using data from 2,240 applicants from 60 army recruiting stations across the nation. CAST is a computer-assisted adaptive test used to predict performance on the Armed Forces Qualification Test (AFQT). AFQT scores are computed by adding four subtest scores of…
Cross-cultural adaptation and validation of the neonatal/infant Braden Q risk assessment scale.
de Lima, Edson Luiz; de Brito, Maria José Azevedo; de Souza, Diba Maria Sebba Tosta; Salomé, Geraldo Magela; Ferreira, Lydia Masako
2016-02-01
To translate into Brazilian Portuguese and cross-culturally adapt the Neonatal/Infant Braden Q Risk Assessment Scale (Neonatal/Infant Braden Q Scale), and test the psychometric properties, reproducibility and validity of the instrument. There is a lack of studies on the development of pressure ulcers in children, especially in neonates. Thirty professionals participated in the cross-cultural adaptation of the Brazilian-Portuguese version of the scale. Fifty neonates of both sexes were assessed between July 2013 and June 2014. Reliability and reproducibility were tested in 20 neonates and construct validity was measured by correlating the Neonatal/Infant Braden Q Scale with the Braden Q Risk Assessment Scale (Braden Q Scale). Discriminant validity was assessed by comparing the scores of neonates with and without ulcers. The scale showed inter-rater reliability (ICC = 0.98; P < 0.001) and intra-rater reliability (ICC = 0.79; P < 0.001). A strong correlation was found between the Neonatal/Infant Braden Q Scale and Braden Q Scale (r = 0.96; P < 0.001). The cross-culturally adapted Brazilian version of the Neonatal/Infant Braden Q Scale is a reliable instrument, showing face, content and construct validity. Copyright © 2015 Tissue Viability Society. Published by Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Shahnazari-Dorcheh, Mohammadtaghi; Roshan, Saeed
2012-01-01
Due to the lack of span test for the use in language-specific and cross-language studies, this study provides L1 and L2 researchers with a reliable language-independent span test (math span test) for the measurement of working memory capacity. It also describes the development, validation, and scoring method of this test. This test included 70…
The cross-cultural equivalence of participation instruments: a systematic review.
Stevelink, S A M; van Brakel, W H
2013-07-01
Concepts such as health-related quality of life, disability and participation may differ across cultures. Consequently, when assessing such a concept using a measure developed elsewhere, it is important to test its cultural equivalence. Previous research suggested a lack of cultural equivalence testing in several areas of measurement. This paper reviews the process of cross-cultural equivalence testing of instruments to measure participation in society. An existing cultural equivalence framework was adapted and used to assess participation instruments on five categories of equivalence: conceptual, item, semantic, measurement and operational equivalence. For each category, several aspects were rated, resulting in an overall category rating of 'minimal/none', 'partial' or 'extensive'. The best possible overall study rating was five 'extensive' ratings. Articles were included if the instruments focussed explicitly on measuring 'participation' and were theoretically grounded in the ICIDH(-2) or ICF. Cross-validation articles were only included if it concerned an adaptation of an instrument developed in a high or middle-income country to a low-income country or vice versa. Eight cross-cultural validation studies were included in which five participation instruments were tested (Impact on Participation and Autonomy, London Handicap Scale, Perceived Impact and Problem Profile, Craig Handicap Assessment Reporting Technique, Participation Scale). Of these eight studies, only three received at least two 'extensive' ratings for the different categories of equivalence. The majority of the cultural equivalence ratings given were 'partial' and 'minimal/none'. The majority of the 'none/minimal' ratings were given for item and measurement equivalence. The cross-cultural equivalence testing of the participation instruments included leaves much to be desired. A detailed checklist is proposed for designing a cross-validation study. Once a study has been conducted, the checklist can be used to ensure comprehensive reporting of the validation (equivalence) testing process and its results. • Participation instruments are often used in a different cultural setting than initial developed for. • The conceptualization of participation may vary across cultures. Therefore, cultural equivalence – the extent to which an instrument is equally suitable for use in two or more cultures – is an important concept to address. • This review showed that the process of cultural equivalence testing of the included participation instruments was often addressed insufficiently. • Clinicians should be aware that application of participations instruments in a different culture than initially developed for needs prior testing of cultural validity in the next context.
Translation, cross-cultural adaptation and validation of the Diabetes Empowerment Scale – Short Form
Chaves, Fernanda Figueredo; Reis, Ilka Afonso; Pagano, Adriana Silvina; Torres, Heloísa de Carvalho
2017-01-01
ABSTRACT OBJECTIVE To translate, cross-culturally adapt and validate the Diabetes Empowerment Scale – Short Form for assessment of psychosocial self-efficacy in diabetes care within the Brazilian cultural context. METHODS Assessment of the instrument’s conceptual equivalence, as well as its translation and cross-cultural adaptation were performed following international standards. The Expert Committee’s assessment of the translated version was conducted through a web questionnaire developed and applied via the web tool e-Surv. The cross-culturally adapted version was used for the pre-test, which was carried out via phone call in a group of eleven health care service users diagnosed with type 2 diabetes mellitus. The pre-test results were examined by a group of experts, composed by health care consultants, applied linguists and statisticians, aiming at an adequate version of the instrument, which was subsequently used for test and retest in a sample of 100 users diagnosed with type 2 diabetes mellitus via phone call, their answers being recorded by the web tool e-Surv. Internal consistency and reproducibility of analysis were carried out within the statistical programming environment R. RESULTS Face and content validity were attained and the Brazilian Portuguese version, entitled Escala de Autoeficácia em Diabetes – Versão Curta, was established. The scale had acceptable internal consistency with Cronbach’s alpha of 0.634 (95%CI 0.494– 0.737), while the correlation of the total score in the two periods was considered moderate (0.47). The intraclass correlation coefficient was 0.50. CONCLUSIONS The translated and cross-culturally adapted version of the instrument to spoken Brazilian Portuguese was considered valid and reliable to be used for assessment within the Brazilian population diagnosed with type 2 diabetes mellitus. The use of a web tool (e-Surv) for recording the Expert Committee responses as well as the responses in the validation tests proved to be a reliable, safe and innovative method. PMID:28355337
Imamura, Masaaki; Usui, Tomoko; Johnin, Kazuyoshi; Yoshimura, Koji; Farhat, Walid; Kanematsu, Akihiro; Ogawa, Osamu
2014-07-01
Validated questionnaire for evaluation of pediatric lower urinary tract symptoms (LUTS) is of a great need. We performed cross-cultural validated adaptation of Dysfunctional Voiding Symptom Score (DVSS) to Japanese language, and assessed whether children understand and respond to questionnaire correctly, using cognitive linguistic approach. We translated DVSS into two Japanese versions according to a standard validation methodology: translation, synthesis, back-translation, expert review, and pre-testing. One version was written in adult language for parents, and the other was written in child language for children. Pre-testing was done with 5 to 15-year-old patients visiting us, having normal intelligence. A specialist in cognitive linguistics observed the response by children and parents to DVSS as an interviewer. When a child could not understand a question without adding or paraphrasing the question by the parents, it was defined as 'misidentification'. We performed pretesting with 2 trial versions of DVSS before having the final version. The pre-testing for the first trial version was done for 32 patients (male to female ratio was 19 : 13). The pre-testing for the second trial version was done for 11 patients (male to female ratio was 8 : 3). In DVSS in child language, misidentification was consistently observed for representation of time or frequency. We completed the formal validated translation by amending the problems raised in the pre-testing. The cross-cultural validated adaptation of DVSS to child and adult Japanese was completed. Since temporal perception is not fully developed in children, caution should be taken for using the terms related with time or frequency in the questionnaires for children.
Accelerating cross-validation with total variation and its application to super-resolution imaging
NASA Astrophysics Data System (ADS)
Obuchi, Tomoyuki; Ikeda, Shiro; Akiyama, Kazunori; Kabashima, Yoshiyuki
2017-12-01
We develop an approximation formula for the cross-validation error (CVE) of a sparse linear regression penalized by ℓ_1-norm and total variation terms, which is based on a perturbative expansion utilizing the largeness of both the data dimensionality and the model. The developed formula allows us to reduce the necessary computational cost of the CVE evaluation significantly. The practicality of the formula is tested through application to simulated black-hole image reconstruction on the event-horizon scale with super resolution. The results demonstrate that our approximation reproduces the CVE values obtained via literally conducted cross-validation with reasonably good precision.
2012-01-01
Background The purpose of this study was to examine the internal consistency, test-retest reliability, construct validity and predictive validity of a new German self-report instrument to assess the influence of social support and the physical environment on physical activity in adolescents. Methods Based on theoretical consideration, the short scales on social support and physical environment were developed and cross-validated in two independent study samples of 9 to 17 year-old girls and boys. The longitudinal sample of Study I (n = 196) was recruited from a German comprehensive school, and subjects in this study completed the questionnaire twice with a between-test interval of seven days. Cronbach’s alphas were computed to determine the internal consistency of the factors. Test-retest reliability of the latent factors was assessed using intra-class coefficients. Factorial validity of the scales was assessed using principle components analysis. Construct validity was determined using a cross-validation technique by performing confirmatory factor analysis with the independent nationwide cross-sectional sample of Study II (n = 430). Correlations between factors and three measures of physical activity (objectively measured moderate-to-vigorous physical activity (MVPA), self-reported habitual MVPA and self-reported recent MVPA) were calculated to determine the predictive validity of the instrument. Results Construct validity of the social support scale (two factors: parental support and peer support) and the physical environment scale (four factors: convenience, public recreation facilities, safety and private sport providers) was shown. Both scales had moderate test-retest reliability. The factors of the social support scale also had good internal consistency and predictive validity. Internal consistency and predictive validity of the physical environment scale were low to acceptable. Conclusions The results of this study indicate moderate to good reliability and construct validity of the social support scale and physical environment scale. Predictive validity was only confirmed for the social support scale but not for the physical environment scale. Hence, it remains unclear if a person’s physical environment has a direct or an indirect effect on physical activity behavior or a moderation function. PMID:22928865
Reimers, Anne K; Jekauc, Darko; Mess, Filip; Mewes, Nadine; Woll, Alexander
2012-08-29
The purpose of this study was to examine the internal consistency, test-retest reliability, construct validity and predictive validity of a new German self-report instrument to assess the influence of social support and the physical environment on physical activity in adolescents. Based on theoretical consideration, the short scales on social support and physical environment were developed and cross-validated in two independent study samples of 9 to 17 year-old girls and boys. The longitudinal sample of Study I (n = 196) was recruited from a German comprehensive school, and subjects in this study completed the questionnaire twice with a between-test interval of seven days. Cronbach's alphas were computed to determine the internal consistency of the factors. Test-retest reliability of the latent factors was assessed using intra-class coefficients. Factorial validity of the scales was assessed using principle components analysis. Construct validity was determined using a cross-validation technique by performing confirmatory factor analysis with the independent nationwide cross-sectional sample of Study II (n = 430). Correlations between factors and three measures of physical activity (objectively measured moderate-to-vigorous physical activity (MVPA), self-reported habitual MVPA and self-reported recent MVPA) were calculated to determine the predictive validity of the instrument. Construct validity of the social support scale (two factors: parental support and peer support) and the physical environment scale (four factors: convenience, public recreation facilities, safety and private sport providers) was shown. Both scales had moderate test-retest reliability. The factors of the social support scale also had good internal consistency and predictive validity. Internal consistency and predictive validity of the physical environment scale were low to acceptable. The results of this study indicate moderate to good reliability and construct validity of the social support scale and physical environment scale. Predictive validity was only confirmed for the social support scale but not for the physical environment scale. Hence, it remains unclear if a person's physical environment has a direct or an indirect effect on physical activity behavior or a moderation function.
Stevanovic, Dejan; Jafari, Peyman; Knez, Rajna; Franic, Tomislav; Atilola, Olayinka; Davidovic, Nikolina; Bagheri, Zahra; Lakic, Aneta
2017-02-01
In this systematic review, we assessed available evidence for cross-cultural measurement invariance of assessment scales for child and adolescent psychopathology as an indicator of cross-cultural validity. A literature search was conducted using the Medline, PsychInfo, Scopus, Web of Science, and Google Scholar databases. Cross-cultural measurement invariance data was available for 26 scales. Based on the aggregation of the evidence from the studies under review, none of the evaluated scales have strong evidence for cross-cultural validity and suitability for cross-cultural comparison. A few of the studies showed a moderate level of measurement invariance for some scales (such as the Fear Survey Schedule for Children-Revised, Multidimensional Anxiety Scale for Children, Revised Child Anxiety and Depression Scale, Revised Children's Manifest Anxiety Scale, Mood and Feelings Questionnaire, and Disruptive Behavior Rating Scale), which may make them suitable in cross-cultural comparative studies. The remainder of the scales either showed weak or outright lack of measurement invariance. This review showed only limited testing for measurement invariance across cultural groups of scales for pediatric psychopathology, with evidence of cross-cultural validity for only a few scales. This study also revealed a need to improve practices of statistical analysis reporting in testing measurement invariance. Implications for future research are discussed.
Koritar, Priscila; Philippi, Sonia Tucunduva; Alvarenga, Marle dos Santos; Santos, Bernardo dos
2014-08-01
The scope of this study was to show the cross-cultural adaptation and validation of the Health and Taste Attitude Scale in Portuguese. The methodology included translation of the scale; evaluation of conceptual, operational and item-based equivalence by 14 experts and 51 female undergraduates; semantic equivalence and measurement assessment by 12 bilingual women by the paired t-test, the Pearson correlation coefficient and the coefficient intraclass correlation; internal consistency and test-retest reliability by Cronbach's alpha and intraclass correlation coefficient, respectively, after application on 216 female undergraduates; assessment of discriminant and concurrent validity via the t-test and Spearman's correlation coefficient, respectively, in addition to Confirmatory Factor and Exploratory Factor Analysis. The scale was considered adequate and easily understood by the experts and university students and presented good internal consistency and reliability (µ 0.86, ICC 0.84). The results show that the scale is valid and can be used in studies with women to better understand attitudes related to taste.
Finn, James E.; Burger, Carl V.; Holland-Bartels, Leslie E.
1997-01-01
We used otolith banding patterns formed during incubation to discriminate among hatchery- and wild-incubated fry of sockeye salmon Oncorhynchus nerka from Tustumena Lake, Alaska. Fourier analysis of otolith luminance profiles was used to describe banding patterns: the amplitudes of individual Fourier harmonics were discriminant variables. Correct classification of otoliths to either hatchery or wild origin was 83.1% (cross-validation) and 72.7% (test data) with the use of quadratic discriminant function analysts on 10 Fourier amplitudes. Overall classification rates among the six test groups (one hatchery and five wild groups) were 46.5% (cross-validation) and 39.3% (test data) with the use of linear discriminant function analysis on 16 Fourier amplitudes. Although classification rates for wild-incubated fry from any one site never exceeded 67% (cross-validation) or 60% (test data), location-specific information was evident for all groups because the probability of classifying an individual to its true incubation location was significantly greater than chance. Results indicate phenotypic differences in otolith microstructure among incubation sites separated by less than 10 km. Analysis of otolith luminance profiles is a potentially useful technique for discriminating among and between various populations of hatchery and wild fish.
2D-QSAR and 3D-QSAR Analyses for EGFR Inhibitors
Zhao, Manman; Zheng, Linfeng; Qiu, Chun
2017-01-01
Epidermal growth factor receptor (EGFR) is an important target for cancer therapy. In this study, EGFR inhibitors were investigated to build a two-dimensional quantitative structure-activity relationship (2D-QSAR) model and a three-dimensional quantitative structure-activity relationship (3D-QSAR) model. In the 2D-QSAR model, the support vector machine (SVM) classifier combined with the feature selection method was applied to predict whether a compound was an EGFR inhibitor. As a result, the prediction accuracy of the 2D-QSAR model was 98.99% by using tenfold cross-validation test and 97.67% by using independent set test. Then, in the 3D-QSAR model, the model with q2 = 0.565 (cross-validated correlation coefficient) and r2 = 0.888 (non-cross-validated correlation coefficient) was built to predict the activity of EGFR inhibitors. The mean absolute error (MAE) of the training set and test set was 0.308 log units and 0.526 log units, respectively. In addition, molecular docking was also employed to investigate the interaction between EGFR inhibitors and EGFR. PMID:28630865
ERIC Educational Resources Information Center
Gorzycki, Meg; Howard, Pamela; Allen, Diane; Desa, Geoffrey; Rosegard, Erik
2016-01-01
Academic reading proficiently is characterized by the ability to perform cognitive tasks associated with interpreting text. Researchers developed an externally validated Informal Academic Reading Proficiency Test to gauge undergraduates' academic reading proficiency. A cross-sectional study of 23 classes completed the reading test in 2014. This…
Niemeijer, Anuschka S; van Waelvelde, Hilde; Smits-Engelsman, Bouwien C M
2015-02-01
The Movement Assessment Battery for Children has been revised as the Movement ABC-2 (Henderson, Sugden, & Barnett, 2007). In Europe, the 15th percentile score on this test is recommended for one of the DSM-IV diagnostic criteria for Developmental Coordination Disorder (DCD). A representative sample of Dutch and Flemish children was tested to cross-validate the UK standard scores, including the 15th percentile score. First, the mean, SD and percentile scores of Dutch children were compared to those of UK normative samples. Item standard scores of Dutch speaking children deviated from the UK reference values suggesting necessary adjustments. Except for very young children, the Dutch-speaking samples performed better. Second, based on the mean and SD and clinical relevant cut-off scores (5th and 15th percentile), norms were adjusted for the Dutch population. For diagnostic use, researchers and clinicians should use the reference norms that are valid for the group of children they are testing. The results indicate that there possibly is an effect of testing procedure in other countries that validated the UK norms and/or cultural influence on the age norms of the Movement ABC-2. It is suggested to formulate criterion-based norms for age groups in addition to statistical norms. Copyright © 2014 Elsevier B.V. All rights reserved.
Validation and cross cultural adaptation of the Italian version of the Harris Hip Score.
Dettoni, Federico; Pellegrino, Pietro; La Russa, Massimo R; Bonasia, Davide E; Blonna, Davide; Bruzzone, Matteo; Castoldi, Filippo; Rossi, Roberto
2015-01-01
The Harris Hip Score (HHS) is one of the most widely used health related quality of life (HRQOL) measures for the assessment of hip pathology: in spite of this, a validation study, and an official Italian version have not been provided yet. The aim of this study was to create an Italian valid and reliable version of the HHS. The score was translated and modified in Italian; then 103 patients with different hip pathologies were evaluated using this HHS version and also with the WOMAC and the SF-12 questionnaires. Content, construct and criterion validities were tested, such as interobserver reliability, test-retest reliability and internal consistency. Cross-cultural adaptation was easy, and only minor adaptation was required in the translation process. Construct and criterion validity of the HHS Italian Version were confirmed by satisfactory values of Spearman's Rho for correlation between specific domains of HHS and Womac and SF12 scores. Interobserver and test-retest reliabilities obtained values of 0.996 and 0.975 respectively; Cronbach's alpha for internal consistency was 0.816. Statistical and clinical analysis showed that HHS is highly valid and reliable in this new Italian version.
[Selection of risk and diagnosis in diabetic polyneuropathy. Validation of method of new systems].
Jurado, Jerónimo; Caula, Jacinto; Pou i Torelló, Josep Maria
2006-06-30
In a previous study we developed a specific algorithm, the polyneuropathy selection method (PSM) with 4 parameters (age, HDL-C, HbA1c, and retinopathy), to select patients at risk of diabetic polyneuropathy (DPN). We also developed a simplified method for DPN diagnosis: outpatient polyneuropathy diagnosis (OPD), with 4 variables (symptoms and 3 objective tests). To confirm the validity of conventional tests for DPN diagnosis; to validate the discriminatory power of the PSM and the diagnostic value of OPD by evaluating their relationship to electrodiagnosis studies and objective clinical neurological assessment; and to evaluate the correlation of DPN and pro-inflammatory status. Cross-sectional, crossed association for PSM validation. Paired samples for OPD validation. Primary care in 3 counties. Random sample of 75 subjects from the type-2 diabetes census for PSM evaluation. Thirty DPN patients and 30 non-DPN patients (from 2 DM2 sub-groups in our earlier study) for OPD evaluation. The gold standard for DPN diagnosis will be studied by means of a clinical neurological study (symptoms, physical examination, and sensitivity tests) and electrodiagnosis studies (sensitivity and motor EMG). Risks of neuropathy, macroangiopathy and pro-inflammatory status (PCR, TNF soluble fraction and total TGF-beta1) will be studied in every subject. Electrodiagnosis studies should confirm the validity of conventional tests for DPN diagnosis. PSM and OPD will be valid methods for selecting patients at risk and diagnosing DPN. There will be a significant relationship between DPN and pro-inflammatory tests.
Absorption in Sport: A Cross-Validation Study
Koehn, Stefan; Stavrou, Nektarios A. M.; Cogley, Jeremy; Morris, Tony; Mosek, Erez; Watt, Anthony P.
2017-01-01
Absorption has been identified as readiness for experiences of deep involvement in the task. Conceptually, absorption is a key psychological construct, incorporating experiential, cognitive, and motivational components. Although, no operationalization of the construct has been provided to facilitate research in this area, the purpose of this research was the development and examination of the psychometric properties of a sport-specific measure of absorption that evolved from the use of the modified Tellegen Absorption Scale (MODTAS; Jamieson, 2005) in mainstream psychology. The study aimed to provide evidence of the psychometric properties, reliability, and validity of the Measure of Absorption in Sport Contexts (MASCs). The psychometric examination included a calibration sample from Scotland and a cross-validation sample from Australia using a cross-sectional design. The item pool was developed based on existing items from the modified Tellegen Absorption Scale (Jamieson, 2005). The MODTAS items were reworded and translated into a sport context. The Scottish sample consisted of 292 participants and the Australian sample of 314 participants. Congeneric model testing and confirmatory factor analysis for both samples and multi-group invariance testing across samples was used. In the cross-validation sample the MASC subscales showed acceptable internal consistency and construct reliability (≥0.70). Excellent fit indices were found for the final 18-item, six-factor measure in the cross-validation sample, χ(120)2 = 197.486, p < 0.001; CFI = 0.957; TLI = 0.945; RMSEA = 0.045; SRMR = 0.044. Multi-group invariance testing revealed no differences in item meaning, except for two items. The MASC and the Dispositional Flow Scale-2 showed moderate-to-strong positive correlations in both samples, r = 0.38, p < 0.001 and r = 0.42, p < 0.001, supporting the external validity of the MASC. This article provides initial evidence in support of the psychometric properties, reliability, and validity of the sport-specific measure of absorption. The MASC provides rich research opportunities in sport psychology that can enhance the theoretical understanding between absorption and related constructs and facilitate future intervention studies. PMID:28883802
Cross-Cultural Validation of the Patient Perception of Integrated Care Survey.
Tietschert, Maike V; Angeli, Federica; van Raak, Arno J A; Ruwaard, Dirk; Singer, Sara J
2017-07-20
To test the cross-cultural validity of the U.S. Patient Perception of Integrated Care (PPIC) Survey in a Dutch sample using a standardized procedure. Primary data collected from patients of five primary care centers in the south of the Netherlands, through survey research from 2014 to 2015. Cross-sectional data collected from patients who saw multiple health care providers during 6 months preceding data collection. The PPIC survey includes 59 questions that measure patient perceived care integration across providers, settings, and time. Data analysis followed a standardized procedure guiding data preparation, psychometric analysis, and included invariance testing with the U.S. dataset. Latent scale structures of the Dutch and U.S. survey were highly comparable. Factor "Integration with specialist" had lower reliability scores and noninvariance. For the remaining factors, internal consistency and invariance estimates were strong. The standardized cross-cultural validation procedure produced strong support for comparable psychometric characteristics of the Dutch and U.S. surveys. Future research should examine the usability of the proposed procedure for contexts with greater cultural differences. © Health Research and Educational Trust.
Korakakis, Vasileios; Patsiaouras, Asterios; Malliaropoulos, Nikos
2014-12-01
To cross-culturally adapt the VISA-P questionnaire for Greek-speaking patients and evaluate its psychometric properties. The VISA-P was developed in the English language to evaluate patients with patellar tendinopathy. The validity and use of self-administered questionnaires in different language and cultural populations require a specific procedure in order to maintain their content validity. The VISA-P questionnaire was translated and cross-culturally adapted according to specific guidelines. The validity and reliability were tested in 61 healthy recreational athletes, 64 athletes at risk from different sports, 32 patellar tendinopathy patients and 30 patients with other knee injuries. Participants completed the questionnaire at baseline and after 15-17 days. The questionnaire's face and content validity were judged as good by the expert committee, and the participants. Concurrent validity was almost perfect (ρ=-0.839, p<0.001). Also, factorial validity testing revealed a two-factor solution, which explained 85.6% of the total variance. A one-factor solution explained 80.8% of the variance when the other knee injury group was excluded. Known group validity was demonstrated by significant differences between patients compared with the asymptomatic groups (p<0.001). The VISA-P-GR exhibited very good test-retest reliability (ICC=0.818, p<0.001; 95% CI 0.758 to 0.864) and internal consistency since Cronbach's α analysis ranged from α=0.785 to 0.784 following a 15-17 days interval. The translated VISA-P-GR is a valid and reliable questionnaire and its psychometric properties are comparable with the original and adapted versions. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Validation of Cross Sections for Monte Carlo Simulation of the Photoelectric Effect
NASA Astrophysics Data System (ADS)
Han, Min Cheol; Kim, Han Sung; Pia, Maria Grazia; Basaglia, Tullio; Batič, Matej; Hoff, Gabriela; Kim, Chan Hyeong; Saracco, Paolo
2016-04-01
Several total and partial photoionization cross section calculations, based on both theoretical and empirical approaches, are quantitatively evaluated with statistical analyses using a large collection of experimental data retrieved from the literature to identify the state of the art for modeling the photoelectric effect in Monte Carlo particle transport. Some of the examined cross section models are available in general purpose Monte Carlo systems, while others have been implemented and subjected to validation tests for the first time to estimate whether they could improve the accuracy of particle transport codes. The validation process identifies Scofield's 1973 non-relativistic calculations, tabulated in the Evaluated Photon Data Library (EPDL), as the one best reproducing experimental measurements of total cross sections. Specialized total cross section models, some of which derive from more recent calculations, do not provide significant improvements. Scofield's non-relativistic calculations are not surpassed regarding the compatibility with experiment of K and L shell photoionization cross sections either, although in a few test cases Ebel's parameterization produces more accurate results close to absorption edges. Modifications to Biggs and Lighthill's parameterization implemented in Geant4 significantly reduce the accuracy of total cross sections at low energies with respect to its original formulation. The scarcity of suitable experimental data hinders a similar extensive analysis for the simulation of the photoelectron angular distribution, which is limited to a qualitative appraisal.
Jain, Meena; Tandon, Shourya; Sharma, Ankur; Jain, Vishal; Rani Yadav, Nisha
2018-01-01
Background: An appropriate scale to assess the dental anxiety of Hindi speaking population is lacking. This study, therefore, aims to evaluate the psychometric properties of Hindi version of one of the oldest dental anxiety scale, Corah’s Dental Anxiety Scale (CDAS) in Hindi speaking Indian adults. Methods: A total of 348 subjects from the outpatient department of a dental hospital in India participated in this cross-sectional study. The scale was cross-culturally adapted by forward and backward translation, committee review and pretesting method. The construct validity of the translated scale was explored with exploratory factor analysis. The correlation of the Hindi version of CDAS with visual analogue scale (VAS) was used to measure the convergent validity. Reliability was assessed through calculations of Cronbach’s alpha and intra class correlation 48 forms were completed for test-retest. Results: Prevalence of dental anxiety in the sample within the age range of 18-80 years was 85.63% [95% CI: 0.815-0.891]. The response rate was 100 %. Kaiser-Meyer-Olkin (KMO) test value was 0.776. After factor analysis, a single factor (dental anxiety) was obtained with 4 items.The single factor model explained 61% variance. Pearson correlation coefficient between CDASand VAS was 0.494. Test-retest showed the Cronbach’s alpha value of 0.814. The test-retest intraclass correlation coefficient of the total CDAS score was 0.881 [95% CI: 0.318-0.554]. Conclusion: Hindi version of CDAS is a valid and reliable scale to assess dental anxiety in Hindi speaking population. Convergent validity is well recognized but discriminant validity is limited and requires further study. PMID:29744307
Jain, Meena; Tandon, Shourya; Sharma, Ankur; Jain, Vishal; Rani Yadav, Nisha
2018-01-01
Background: An appropriate scale to assess the dental anxiety of Hindi speaking population is lacking. This study, therefore, aims to evaluate the psychometric properties of Hindi version of one of the oldest dental anxiety scale, Corah's Dental Anxiety Scale (CDAS) in Hindi speaking Indian adults. Methods: A total of 348 subjects from the outpatient department of a dental hospital in India participated in this cross-sectional study. The scale was cross-culturally adapted by forward and backward translation, committee review and pretesting method. The construct validity of the translated scale was explored with exploratory factor analysis. The correlation of the Hindi version of CDAS with visual analogue scale (VAS) was used to measure the convergent validity. Reliability was assessed through calculations of Cronbach's alpha and intra class correlation 48 forms were completed for test-retest. Results: Prevalence of dental anxiety in the sample within the age range of 18-80 years was 85.63% [95% CI: 0.815-0.891]. The response rate was 100 %. Kaiser-Meyer-Olkin (KMO) test value was 0.776. After factor analysis, a single factor (dental anxiety) was obtained with 4 items.The single factor model explained 61% variance. Pearson correlation coefficient between CDASand VAS was 0.494. Test-retest showed the Cronbach's alpha value of 0.814. The test-retest intraclass correlation coefficient of the total CDAS score was 0.881 [95% CI: 0.318-0.554]. Conclusion: Hindi version of CDAS is a valid and reliable scale to assess dental anxiety in Hindi speaking population. Convergent validity is well recognized but discriminant validity is limited and requires further study.
Ko, Jupil; Rosen, Adam B; Brown, Cathleen N
2015-12-01
The Cumberland Ankle Instability Tool (CAIT) is a valid and reliable patient reported outcome used to assess the presence and severity of chronic ankle instability (CAI). The CAIT has been cross-culturally adapted into other languages for use in non-English speaking populations. However, there are no valid questionnaires to assess CAI in individuals who speak Korean. The purpose of this study was to translate, cross-culturally adapt, and validate the CAIT, for use in a Korean-speaking population with CAI. Cross-cultural reliability study. The CAIT was cross-culturally adapted into Korean according to accepted guidelines and renamed the Cumberland Ankle Instability Tool-Korean (CAIT-K). Twenty-three participants (12 males, 11 females) who were bilingual in English and Korean were recruited and completed the original and adapted versions to assess agreement between versions. An additional 168 national level Korean athletes (106 male, 62 females; age = 20.3 ± 1.1 yrs), who participated in ≥ 90 minutes of physical activity per week, completed the final version of the CAIT-K twice within 14 days. Their completed questionnaires were assessed for internal consistency, test-retest reliability, criterion validity, and construct validity. For bilingual participants, intra-class correlation coefficients (ICC2,1) between the CAIT and the CAIT-K for test-retest reliability were 0.95 (SEM=1.83) and 0.96 (SEM=1.50) in right and left limbs, respectively. The Cronbach's alpha coefficients were 0.92 and 0.90 for the CAIT-K in right and left limbs, respectively. For native Korean speakers, the CAIT-K had high internal consistency (Cronbach's α=0.89) and intra-class correlation coefficient (ICC2,1 = 0.94, SEM=1.72), correlation with the physical component score (rho=0.70, p = 0.001) of the Short-Form Health Survey (SF-36), and the Kaiser-Meyer-Olkin score was 0.87. The original CAIT was translated, cross-culturally adapted, and validated from English to Korean. The CAIT-K appears to be valid and reliable and could be useful in assessing the Korean speaking population with CAI.
Teacher Immediacy Scales: Testing for Validity across Cultures
ERIC Educational Resources Information Center
Zhang, Qin; Oetzel, John G.; Gao, Xiaofang; Wilcox, Richard G.; Takai, Jiro
2007-01-01
Cross-cultural validity of teacher immediacy scales is a constant concern in instructional communication research. The present study examines the validity of two existing teacher immediacy scales: the Revised Nonverbal Immediacy Measure (RNIM) and the Chinese Teacher Immediacy Scale (CTIS) in U.S., Chinese, German, and Japanese cultures. Results…
Validation of Yoon's Critical Thinking Disposition Instrument.
Shin, Hyunsook; Park, Chang Gi; Kim, Hyojin
2015-12-01
The lack of reliable and valid evaluation tools targeting Korean nursing students' critical thinking (CT) abilities has been reported as one of the barriers to instructing and evaluating students in undergraduate programs. Yoon's Critical Thinking Disposition (YCTD) instrument was developed for Korean nursing students, but few studies have assessed its validity. This study aimed to validate the YCTD. Specifically, the YCTD was assessed to identify its cross-sectional and longitudinal measurement invariance. This was a validation study in which a cross-sectional and longitudinal (prenursing and postnursing practicum) survey was used to validate the YCTD using 345 nursing students at three universities in Seoul, Korea. The participants' CT abilities were assessed using the YCTD before and after completing an established pediatric nursing practicum. The validity of the YCTD was estimated and then group invariance test using multigroup confirmatory factor analysis was performed to confirm the measurement compatibility of multigroups. A test of the seven-factor model showed that the YCTD demonstrated good construct validity. Multigroup confirmatory factor analysis findings for the measurement invariance suggested that this model structure demonstrated strong invariance between groups (i.e., configural, factor loading, and intercept combined) but weak invariance within a group (i.e., configural and factor loading combined). In general, traditional methods for assessing instrument validity have been less than thorough. In this study, multigroup confirmatory factor analysis using cross-sectional and longitudinal measurement data allowed validation of the YCTD. This study concluded that the YCTD can be used for evaluating Korean nursing students' CT abilities. Copyright © 2015. Published by Elsevier B.V.
Angers, Magalie; Svotelis, Amy; Balg, Frederic; Allard, Jean-Pascal
2016-04-01
The Ankle Osteoarthritis Scale (AOS) is a self-administered score specific for ankle osteoarthritis (OA) with excellent reliability and strong construct and criterion validity. Many recent randomized multicentre trials have used the AOS, and the involvement of the French-speaking population is limited by the absence of a French version. Our goal was to develop a French version and validate the psychometric properties to assure equivalence to the original English version. Translation was performed according to American Association of Orthopaedic Surgeons (AAOS) 2000 guidelines for cross-cultural adaptation. Similar to the validation process of the English AOS, we evaluated the psychometric properties of the French version (AOS-Fr): criterion validity (AOS-Fr v. Western Ontario and McMaster Universities Arthritis Index [WOMAC] and SF-36 scores), construct validity (AOS-Fr correlation to single heel-lift test), and reliability (AOS-Fr test-retest). Sixty healthy individuals tested a prefinal version of the AOS-Fr for comprehension, leading to modifications and a final version that was approved by C. Saltzman, author of the AOS. We then recruited patients with ankle OA for evaluation of the AOS-Fr psychometric properties. Twenty-eight patients with ankle OA participated in the evaluation. The AOS-Fr showed strong criterion validity (AOS:WOMAC r = 0.709 and AOS:SF-36 r = -0.654) and construct validity (r = 0.664) and proved to be reliable (test-retest intraclass correlation coefficient = 0.922). The AOS-Fr is a reliable and valid score equivalent to the English version in terms of psychometric properties, thus is available for use in multicentre trials.
Oyeyemi, Adewale L; Oyeyemi, Adetoyeje Y; Adegoke, Babatunde O; Oyetoke, Fatima O; Aliyu, Habeeb N; Aliyu, Salamatu U; Rufai, Adamu A
2011-11-22
Accurate assessment of physical activity is important in determining the risk for chronic diseases such as cardiovascular disease, stroke, type 2 diabetes, cancer and obesity. The absence of culturally relevant measures in indigenous languages could pose challenges to epidemiological studies on physical activity in developing countries. The purpose of this study was to translate and cross-culturally adapt the Short International Physical Activity Questionnaire (IPAQ-SF) to the Hausa language, and to evaluate the validity and reliability of the Hausa version of IPAQ-SF in Nigeria. The English IPAQ-SF was translated into the Hausa language, synthesized, back translated, and subsequently subjected to expert committee review and pre-testing. The final product (Hausa IPAQ-SF) was tested in a cross-sectional study for concurrent (correlation with the English version) and construct validity, and test-retest reliability in a sample of 102 apparently healthy adults. The Hausa IPAQ-SF has good concurrent validity with Spearman correlation coefficients (ρ) ranging from 0.78 for vigorous activity (Min Week-1) to 0.92 for total physical activity (Metabolic Equivalent of Task [MET]-Min Week-1), but poor construct validity, with cardiorespiratory fitness (ρ = 0.21, p = 0.01) and body mass index (ρ = 0.22, p = 0.04) significantly correlated with only moderate activity and sitting time (Min Week-1), respectively. Reliability was good for vigorous (ICC = 0.73, 95% C.I = 0.55-0.84) and total physical activity (ICC = 0.61, 95% C.I = 0.47-0.72), but fair for moderate activity (ICC = 0.33, 95% C.I = 0.12-0.51), and few meaningful differences were found in the gender and socioeconomic status specific analyses. The Hausa IPAQ-SF has acceptable concurrent validity and test-retest reliability for vigorous-intensity activity, walking, sitting and total physical activity, but demonstrated only fair construct validity for moderate and sitting activities. The Hausa IPAQ-SF can be used for physical activity measurements in Nigeria, but further construct validity testing with objective measures such as an accelerometer is needed.
Konzelmann, M; Burrus, C; Hilfiker, R; Rivier, G; Deriaz, O; Luthi, F
2015-03-01
Functional evaluation of upper limb is not only based on clinical findings but requires self-administered questionnaires to address patients' perspective. The Hand Function Sort (HFS©) was only validated in English. The aim of this study was the French cross cultural adaptation and validation of the HFS© (HFS-F). 150 patients with various upper limbs impairments were recruited in a rehabilitation center. Translation and cross-cultural adaptation were made according to international guidelines. Construct validity was estimated through correlations with Disabilities Arm Shoulder and Hand (DASH) questionnaire, SF-36 mental component summary (MCS),SF-36 physical component summary (PCS) and pain intensity. Internal consistency was assessed by Cronbach's α and test-retest reliability by intraclass correlation. Cronbach's α was 0.98, test-retest reliability was excellent at 0.921 (95 % CI 0.871-0.971) same as original HFS©. Correlations with DASH were-0.779 (95 % CI -0.847 to -0.685); with SF 36 PCS 0.452 (95 % CI 0.276-0.599); with pain -0.247 (95 % CI -0.429 to -0.041); with SF 36 MCS 0.242 (95 % CI 0.042-0.422). There were no floor or ceiling effects. The HFS-F has the same good psychometric properties as the original HFS© (internal consistency, test retest reliability, convergent validity with DASH, divergent validity with SF-36 MCS, and no floor or ceiling effects). The convergent validity with SF-36 PCS was poor; we found no correlation with pain. The HFS-F could be used with confidence in a population of working patients. Other studies are necessary to study its psychometric properties in other populations.
He, S L; Wang, J H; Ji, P
2018-03-01
To validate the Pain Resilience Scale (PRS) for use in Chinese patients with temporomandibular disorders (TMD) pain. According to international guidelines, the original PRS was first translated and cross-culturally adapted to formulate the Chinese version of PRS (PRS-C). A total of 152 patients with TMD pain were recruited to complete series of questionnaires. Reliability of the PRS-C was investigated using internal consistency and test-retest reliability. Validity of the PRS-C was calculated using cross-cultural validity and convergent validity. Cross-cultural validity was evaluated by examining the confirmatory factor analysis (CFA). And convergent validity was examined through correlating the PRS-C scores with scores of 2 commonly used pain-related measures (the Connor-Davidson Resilience Scale [CD-RISC] and the Tampa Scale for Kinesiophobia for Temporomandibular Disorders [TSK-TMD]). The PRS-C had a high internal consistency (Cronbach's alpha = 0.92) and good test-retest reliability (intra-class correlation coefficient [ICC] = 0.81). The CFA supported a 2-factor model for the PRS-C with acceptable fit to the data. The fit indices were chi-square/DF = 2.21, GFI = 0.91, TLI = 0.97, CFI = 0.98 and RMSEA = 0.08. As regards convergent validity, the PRS-C evidenced moderate-to-good relationships with the CD-RISC and the TSK-TMD. The PRS-C shows good psychometric properties and could be considered as a reliable and valid measure to evaluate pain-related resilience in patients with TMD pain. © 2017 John Wiley & Sons Ltd.
Watt, Torquil; Barbesino, Giuseppe; Bjorner, Jakob Bue; Bonnema, Steen Joop; Bukvic, Branka; Drummond, Russell; Groenvold, Mogens; Hegedüs, Laszlo; Kantzer, Valeska; Lasch, Kathryn E; Marcocci, Claudio; Mishra, Anjali; Netea-Maier, Romana; Ekker, Merel; Paunovic, Ivan; Quinn, Terence J; Rasmussen, Åse Krogh; Russell, Audrey; Sabaretnam, Mayilvaganan; Smit, Johannes; Törring, Ove; Zivaljevic, Vladan; Feldt-Rasmussen, Ulla
2015-03-01
Thyroid diseases are common and often affect quality of life (QoL). No cross-culturally validated patient-reported outcome measuring thyroid-related QoL is available. The purpose of the present study was to test the cross-cultural validity of the newly developed thyroid-related patient-reported outcome ThyPRO, using tests for differential item functioning (DIF) according to language version. The ThyPRO consists of 85 items summarized in 13 multi-item scales and one single item. Scales cover physical and mental symptoms, well-being and function as well as social and daily function and cosmetic concerns. Translation applied standard forward-backward methodology with subsequent cognitive interviews and reviews. Responses (N = 1,810) to the ThyPRO were collected in seven countries: UK (n = 166), The Netherlands (n = 147), Serbia (n = 150), Italy (n = 110), India (n = 148), Denmark (n = 902) and Sweden (n = 187). Translated versions were compared pairwise to the English version by examining uniform and nonuniform DIF, i.e., whether patients from different countries respond differently to a particular item, although they have identical level of the concept measured by the item. Analyses were controlled for thyroid diagnosis. DIF was investigated by ordinal logistic regression, testing for both statistical significance and magnitude (ΔR (2) > 0.02). Scale level was estimated by the sum score, after purification. For twelve of the 84 tested items, DIF was identified in more than one language. Eight of these were small, but four were indicative of possible low translatability. Twenty-one instances of DIF in single languages were identified, indicating potential problems with the particular translation. However, only seven were of a magnitude which could affect scale scores, most of which could be explained by sample differences not controlled for. The ThyPRO has good cross-cultural validity with only minor cross-cultural invariance and is recommended for use in international multicenter studies.
Sucupira, Eduardo; Sabino, Miguel; Lima, Edson Luiz de; Dini, Gal Moreira; Brito, Maria José Azevedo de; Ferreira, Lydia Masako
2017-01-01
Patient-reported outcome measurements assessing the emotional state of children and adolescents who seek plastic surgery are important for determining whether the intervention is indicated or not. The aim of this study was to cross-culturally adapt and validate the Short Mood and Feelings Questionnaire (child/adolescent and parent versions) for Brazilian Portuguese, test its psychometric properties and assess the emotional state of children and adolescents who seek plastic surgery. DESIGN AND SETTING: Cross-cultural validation study conducted in a plastic surgery outpatient clinic at a public university hospital. A total of 124 consecutive patients of both sexes were selected between September 2013 and February 2014. Forty-seven patients participated in the cultural adaptation of the questionnaire. The final version was tested for reliability on 20 patients. Construct validity was tested on 57 patients by correlating the Short Mood and Feelings Questionnaire (child/adolescent and parent versions) with the Strengths and Difficulties Questionnaire and the Rosenberg Self-Esteem scale. The child/adolescent and parent versions of the Short Mood and Feelings Questionnaire showed Cronbach's alpha of 0.768 and 0.874, respectively, and had good inter-rater reliability (intraclass correlation coefficient, ICC = 0.757 and ICC = 0.853, respectively) and intra-rater reliability (ICC = 0.738 and ICC = 0.796, respectively). The Brazilian-Portuguese version of the Short Mood and Feelings Questionnaire is a reproducible instrument with face, content and construct validity.The mood state and feelings among children and adolescents seeking cosmetic surgery were healthy.
ERIC Educational Resources Information Center
Walsh, Jennifer R.; Hebert, Angel; Byrd-Bredbenner, Carol; Carey, Gale; Colby, Sarah; Brown-Esters, Onikia N.; Greene, Geoffrey; Hoerr, Sharon; Horacek, Tanya; Kattelmann, Kendra; Kidd, Tandalayo; Koenings, Mallory; Phillips, Beatrice; Shelnutt, Karla P.; White, Adrienne A.
2012-01-01
Objective: To develop and test the validity of the Behavior, Environment, and Changeability Survey (BECS) for identifying the importance and changeability of nutrition, exercise, and stress management behavior and related aspects of the environment. Design: A cross-sectional, online survey of the BECS and selected validated instruments. Setting:…
Bucci, Rosaria; Rongo, Roberto; Zito, Eugenio; Galeotti, Angela; Valletta, Rosa; D'Antò, Vincenzo
2015-03-01
To validate and cross-culturally adapt the Italian version of the Psychological Impact of Dental Aesthetics Questionnaire (PIDAQ) among Italian young adults. After translation, back translation, and cross-cultural adaptation of the English PIDAQ, a first version of the Italian questionnaire was pretested. The final Italian PIDAQ was administered to 598 subjects aged 18-30 years, along with two other instruments: the aesthetic component of the index of orthodontic treatment need (IOTN-AC) and the perception of occlusion scale (POS), which identified the self-reporting grade of malocclusion. Structural validity was assessed by means of factorial analysis, internal consistency was measured with Cronbach's alpha coefficient (α), convergent validity was assessed by means of Spearman correlation, and test-retest reliability was calculated with intra-class correlation coefficient (ICC) and standard measurement error. Criterion validity was evaluated by multivariate and univariate analysis of variance with Bonferroni post hoc tests. The α of the Italian PIDAQ domains ranged between 0.79 and 0.92. The ICC was between 0.81 and 0.90. The mean scores of each PIDAQ domain showed a statistically significant difference when analysed according to the IOTN-AC and POS scores. The satisfactory psychometric properties make PIDAQ a usable tool for future studies on oral health-related quality of life among Italian young adults.
Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.
Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott
2015-12-01
To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.
Parametric vs. non-parametric statistics of low resolution electromagnetic tomography (LORETA).
Thatcher, R W; North, D; Biver, C
2005-01-01
This study compared the relative statistical sensitivity of non-parametric and parametric statistics of 3-dimensional current sources as estimated by the EEG inverse solution Low Resolution Electromagnetic Tomography (LORETA). One would expect approximately 5% false positives (classification of a normal as abnormal) at the P < .025 level of probability (two tailed test) and approximately 1% false positives at the P < .005 level. EEG digital samples (2 second intervals sampled 128 Hz, 1 to 2 minutes eyes closed) from 43 normal adult subjects were imported into the Key Institute's LORETA program. We then used the Key Institute's cross-spectrum and the Key Institute's LORETA output files (*.lor) as the 2,394 gray matter pixel representation of 3-dimensional currents at different frequencies. The mean and standard deviation *.lor files were computed for each of the 2,394 gray matter pixels for each of the 43 subjects. Tests of Gaussianity and different transforms were computed in order to best approximate a normal distribution for each frequency and gray matter pixel. The relative sensitivity of parametric vs. non-parametric statistics were compared using a "leave-one-out" cross validation method in which individual normal subjects were withdrawn and then statistically classified as being either normal or abnormal based on the remaining subjects. Log10 transforms approximated Gaussian distribution in the range of 95% to 99% accuracy. Parametric Z score tests at P < .05 cross-validation demonstrated an average misclassification rate of approximately 4.25%, and range over the 2,394 gray matter pixels was 27.66% to 0.11%. At P < .01 parametric Z score cross-validation false positives were 0.26% and ranged from 6.65% to 0% false positives. The non-parametric Key Institute's t-max statistic at P < .05 had an average misclassification error rate of 7.64% and ranged from 43.37% to 0.04% false positives. The nonparametric t-max at P < .01 had an average misclassification rate of 6.67% and ranged from 41.34% to 0% false positives of the 2,394 gray matter pixels for any cross-validated normal subject. In conclusion, adequate approximation to Gaussian distribution and high cross-validation can be achieved by the Key Institute's LORETA programs by using a log10 transform and parametric statistics, and parametric normative comparisons had lower false positive rates than the non-parametric tests.
Santo, Ruth Miyuki; Ribeiro-Ferreira, Felipe; Alves, Milton Ruiz; Epstein, Jonathan; Novaes, Priscila
2015-04-01
To provide a reliable, validated, and culturally adapted instrument that may be used in monitoring dry eye in Brazilian patients and to discuss the strategies for the enhancement of the cross-cultural adaptation and validation process of a self-report measure for dry eye. The cross-cultural adaptation process (CCAP) of the original Ocular Surface Disease Index (OSDI) into Brazilian-Portuguese was conducted using a 9-step guideline. The synthesis of translations was tested twice, for face and content validity, by different subjects (focus groups and cognitive interviews). The expert committee contributed on several steps, and back translations were based on the final rather than the prefinal version. For validation, the adapted version was applied in a prospective longitudinal study to 101 patients from the Dry Eye Clinic at the General Hospital of the University of São Paulo, Brazil. Simultaneously to the OSDI, patients answered the short form-36 health survey (SF-36) and the 25-item visual function questionnaire (VFQ-25) and underwent clinical evaluation. Internal consistency, test-retest reliability, and measure validity were assessed. Cronbach's alpha value of the cross-culturally adapted Brazilian-Portuguese version of the OSDI was 0.905, and the intraclass correlation coefficient was 0.801. There was a statistically significant difference between OSDI scores in patients with dry eye (41.15 ± 27.40) and without dry eye (17.88 ± 17.09). There was a negative association between OSDI and VFQ-25 total score (P < 0.01) and between the OSDI and five SF-36 domains. OSDI scores correlated positively with lissamine green and fluorescein staining scores (P < 0.001) and negatively with Schirmer test I and tear break-up time values (P < 0.001). Although most of the reviewed guidelines on CCAP involve well-defined steps (translation, synthesis/reconciliation, back translation, expert committee review, pretesting), the proposed methodological steps have not been applied in a uniform way. The translation and adaptation process requires skill, knowledge, experience, and a considerable investment of time to maximize the attainment of semantic, idiomatic, experiential, and conceptual equivalence between the source and target questionnaires. A well-established guideline resulted in a culturally adapted Brazilian-Portuguese version of the OSDI, tested and validated on a sample of Brazilian population, and proved to be a valid and reliable instrument for assessing patients with dry eye syndrome in Brazil. Copyright © 2015 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Li, Cheng-Hsien
2012-01-01
Of the several measures of optimism presently available in the literature, the Life Orientation Test (LOT; Scheier & Carver, 1985) has been the most widely used in empirical research. This article explores, confirms, and cross-validates the factor structure of the Chinese version of the LOT with ordinal data by using robust weighted least…
ERIC Educational Resources Information Center
Flight, Ingrid H.; Wilson, Carlene J.; McGillivray, Jane; Myers, Ronald E.
2010-01-01
We investigated whether the five-factor structure of the Preventive Health Model for colorectal cancer screening, developed in the United States, has validity in Australia. We also tested extending the model with the addition of the factor Self-Efficacy to Screen using Fecal Occult Blood Test (SESFOBT). Randomly selected men and women aged between…
Development of 1-Mile Walk Tests to Estimate Aerobic Fitness in Children
ERIC Educational Resources Information Center
Sung, Hoyong; Collier, David N.; DuBose, Katrina D.; Kemble, C. David; Mahar, Matthew T.
2018-01-01
To examine the reliability and validity of 1-mile walk tests for estimation of aerobic fitness (VO[subscript 2max]) in 10- to 13-year-old children and to cross-validate previously published equations. Participants (n = 61) walked 1-mile on two different days. Self-reported physical activity, demographic variables, and aerobic fitness were used in…
Yapali, Gökmen; Günel, Mintaze Kerem; Karahan, Sevilay
2012-05-15
The study design was cross-cultural adaptation and investigation of reliability and validity of the Copenhagen Neck Functional Disability Scale (CNFDS). The aim of this study was to translate the CNFDS into Turkish language and assess its reliability and validity among patients with neck pain in Turkish population. The CNFDS is a reliable and valid evaluation instrument for disability, but there is no published the Turkish version of the CNFDS. One hundred one subjects who had chronic neck pain were included in this study. The CNFDS, Neck Pain and Disability Scale, and visual analogue scale were administered to all subjects. For investigating test-retest reliability, correlation between CNFDS scores, applied at 1-week interval, intraclass correlation coefficient score for test-retest reliability was 0.86 (95% confidence interval = 0.679-0.935). There was no difference between test-retest scores (P < 0.001). For investigating concurrent validity, correlation between total score of the CNFDS and the mean visual analogue scale was r = 0.73 (P < 0.001). Concurrent validity of the CNFDS was very good. For investigating construct validity, correlation between total score of the CNFDS and the Neck Pain and Disability Scale was r = 0.78 (P < 0.001). Construct validity of the CNFDS was also very good. Our results suggest that the Turkish version of the CNFDS is a reliable and valid instrument for Turkish people.
1989-04-20
International business Machines Corporati,:i IBM Development System for the Ada Language, CMS/MVS Ada Cross Compiler, Version 2.1.1, Wright-Patterson AFB, IBM...VALIDATION SUMMARY REPORT: Certificate Number: 890420W1.10075 International Business Machines Corporation IBM Development System for the Ada Language CMS...command scripts provided by International Business Machines Corporation and reviewed by the validation team. The compiler was tested using all default
Estimation of Sensory Analysis Cupping Test Arabica Coffee Using NIR Spectroscopy
NASA Astrophysics Data System (ADS)
Safrizal; Sutrisno; Lilik, P. E. N.; Ahmad, U.; Samsudin
2018-05-01
Flavors have become the most important coffee quality parameters now day, many coffee consuming countries require certain taste scores for the coffee to be ordered, the currently used cupping method of appraisal is the method designed by The Specialty Coffee Association Of America (SCAA), from several previous studies was found that Near-Infrared Spectroscopy (NIRS) can be used to detect chemical composition of certain materials including those associated with flavor so it is possible also to be applied to coffee powder. The aim of this research is to get correlation between NIRS spectrum with cupping scoring by tester, then look at the possibility of testing coffee taste sensors using NIRS spectrum. The coffee samples were taken from various places, altitudes and postharvest handling methods, then the samples were prepared following the SCAA protocol, for sensory analysis was done in two ways, with the expert tester and with the NIRS test. The calibration between both found that Without pretreatment using PLS get RMSE cross validation 6.14, using Multiplicative Scatter Correction spectra obtained RMSE cross validation 5.43, the best RMSE cross-validation was 1.73 achieved by de-trending correction, NIRS can be used to predict the score of cupping.
Benchmarking protein classification algorithms via supervised cross-validation.
Kertész-Farkas, Attila; Dhir, Somdutta; Sonego, Paolo; Pacurar, Mircea; Netoteia, Sergiu; Nijveen, Harm; Kuzniar, Arnold; Leunissen, Jack A M; Kocsor, András; Pongor, Sándor
2008-04-24
Development and testing of protein classification algorithms are hampered by the fact that the protein universe is characterized by groups vastly different in the number of members, in average protein size, similarity within group, etc. Datasets based on traditional cross-validation (k-fold, leave-one-out, etc.) may not give reliable estimates on how an algorithm will generalize to novel, distantly related subtypes of the known protein classes. Supervised cross-validation, i.e., selection of test and train sets according to the known subtypes within a database has been successfully used earlier in conjunction with the SCOP database. Our goal was to extend this principle to other databases and to design standardized benchmark datasets for protein classification. Hierarchical classification trees of protein categories provide a simple and general framework for designing supervised cross-validation strategies for protein classification. Benchmark datasets can be designed at various levels of the concept hierarchy using a simple graph-theoretic distance. A combination of supervised and random sampling was selected to construct reduced size model datasets, suitable for algorithm comparison. Over 3000 new classification tasks were added to our recently established protein classification benchmark collection that currently includes protein sequence (including protein domains and entire proteins), protein structure and reading frame DNA sequence data. We carried out an extensive evaluation based on various machine-learning algorithms such as nearest neighbor, support vector machines, artificial neural networks, random forests and logistic regression, used in conjunction with comparison algorithms, BLAST, Smith-Waterman, Needleman-Wunsch, as well as 3D comparison methods DALI and PRIDE. The resulting datasets provide lower, and in our opinion more realistic estimates of the classifier performance than do random cross-validation schemes. A combination of supervised and random sampling was used to construct model datasets, suitable for algorithm comparison.
Igwesi-Chidobe, Chinonso N; Obiekwe, Chinwe; Sorinola, Isaac O; Godfrey, Emma L
2017-12-14
Cross-culturally adapt and validate the Igbo Roland Morris Disability Questionnaire. Cross-cultural adaptation, test-retest, and cross-sectional psychometric testing. Roland Morris Disability Questionnaire was forward and back translated by clinical/non-clinical translators. An expert committee appraised the translations. Twelve participants with chronic low back pain pre-tested the measure in a rural Nigerian community. Internal consistency using Cronbach's alpha; test-retest reliability using intra-class correlation coefficient and Bland-Altman plot; and minimal detectable change were investigated in a convenient sample of 50 people with chronic low back pain in rural and urban Nigeria. Pearson's correlation analyses using the eleven-point box scale and back performance scale, and exploratory factor analysis were used to examine construct validity in a random sample of 200 adults with chronic low back pain in rural Nigeria. Ceiling and floor effects were investigated in the two samples. Modifications gave the option of interviewer-administration and reflected Nigerian social context. The measure had excellent internal consistency (α = 0.91) and intraclass correlation coefficient (ICC =0.84), moderately high correlations (r > 0.6) with performance-based disability and pain intensity, and a predominant uni-dimensional structure, with no ceiling or floor effects. Igbo Roland Morris Disability Questionnaire is a valid and reliable measure of pain-related disability. Implications for rehabilitation Low back pain is the leading cause of years lived with disability worldwide, and is particularly prevalent in rural Nigeria, but there are no self-report measures to assess its impact due to low literacy rates. This study describes the cross-cultural adaptation and validation of a core self-report back pain specific disability measure in a low-literate Nigerian population. The Igbo Roland Morris Disability Questionnaire is a reliable and valid measure of self-reported disability in Igbo populations as indicated by excellent internal consistency (α = 0.91) and intra-class correlation coefficient (ICC =0.84), moderately high correlations (r > 0.6) with performance-based disability and pain intensity that supports a pain-related disability construct, a predominant one factor structure with no ceiling or floor effects. The measure will be useful for researchers and clinicians examining the factors associated with low back pain disability or the effects of interventions on low back pain disability in this culture. This measure will support global health initiatives concurrently involving people from several cultures or countries, and may inform cross-cultural disability research in other populations.
ERIC Educational Resources Information Center
Schmidt, Silke; Power, Mick
2006-01-01
Recent projects on international instrument development have produced a wide array of health indicators that may be used for cross-cultural field-testing, however more information on their cross-cultural performance in relation to health determinants is necessary. The current study approaches one step for international conceptual validation by…
Kerlinger's Criterial Referents Theory Revisited.
ERIC Educational Resources Information Center
Zak, Itai; Birenbaum, Menucha
1980-01-01
Kerlinger's criterial referents theory of attitudes was tested cross-culturally by administering an education attitude referents summated-rating scale to 713 individuals in Israel. The response pattern to criterial and noncriterial referents was examined. Results indicated empirical cross-cultural validity of theory, but questioned measuring…
Hasanpour, Neda; Attarbashi Moghadam, Behrouz; Sami, Ramin; Tavakol, Kamran
2016-08-01
The clinical COPD questionnaire (CCQ) has been developed to measure the health status of COPD patients. The aim of this study was to translate CCQ into the Persian language and assess the validity and reliability of the translated version. We used a forward-backward procedure to translate the questionnaire. In a cross-sectional study 100 COPD patients and 50 healthy subjects over 40 years old were selected to assess the reliability and construct validity of the instrument. The face and content validity were used for the questionnaire validity. Validity was examined in a population of patients with COPD, using the Persian validated version of the St George's Respiratory Questionnaire (PSGRQ). In order to assess the questionnaire's reliability, the Intraclass correlation coefficient (ICC) and Cronbach's alpha were calculated. Test-retest reliability was tested by re-administering the Persian version of the CCQ (PCCQ) after 1 week. Test-retest carry out of data demonstrates that the PCCQ has excellent reliability (ICC for all 3 domains were higher than 0.9). Internal consistency was found by Cronbach's alpha to be 0.96, 0.94, 0.97, and 0.98 for the symptom, mental state, functional state and total scores respectively. In addition, the correlation between the components of PCCQ and PSGRQ showed satisfactory construct validity. Analyzing the data from healthy subjects and patients divulged that the PCCQ has acceptable discriminant validity. In general, the PCCQ had satisfactory reliability and validity for assessing health-related quality of life status of Iranian COPD patients.
Glassmire, David M; Toofanian Ross, Parnian; Kinney, Dominique I; Nitch, Stephen R
2016-06-01
Two studies were conducted to identify and cross-validate cutoff scores on the Wechsler Adult Intelligence Scale-Fourth Edition Digit Span-based embedded performance validity (PV) measures for individuals with schizophrenia spectrum disorders. In Study 1, normative scores were identified on Digit Span-embedded PV measures among a sample of patients (n = 84) with schizophrenia spectrum diagnoses who had no known incentive to perform poorly and who put forth valid effort on external PV tests. Previously identified cutoff scores resulted in unacceptable false positive rates and lower cutoff scores were adopted to maintain specificity levels ≥90%. In Study 2, the revised cutoff scores were cross-validated within a sample of schizophrenia spectrum patients (n = 96) committed as incompetent to stand trial. Performance on Digit Span PV measures was significantly related to Full Scale IQ in both studies, indicating the need to consider the intellectual functioning of examinees with psychotic spectrum disorders when interpreting scores on Digit Span PV measures. © The Author(s) 2015.
Lohrer, Heinz; Nauck, Tanja
2009-10-30
Achilles tendinopathy is the predominant overuse injury in runners. To further investigate this overload injury in transverse and longitudinal studies a valid, responsive and reliable outcome measure is demanded. Most questionnaires have been developed for English-speaking populations. This is also true for the VISA-A score, so far representing the only valid, reliable, and disease specific questionnaire for Achilles tendinopathy. To internationally compare research results, to perform multinational studies or to exclude bias originating from subpopulations speaking different languages within one country an equivalent instrument is demanded in different languages. The aim of this study was therefore to cross-cultural adapt and validate the VISA-A questionnaire for German-speaking Achilles tendinopathy patients. According to the "guidelines for the process of cross-cultural adaptation of self-report measures" the VISA-A score was cross-culturally adapted into German (VISA-A-G) using six steps: Translation, synthesis, back translation, expert committee review, pretesting (n = 77), and appraisal of the adaptation process by an advisory committee determining the adequacy of the cross-cultural adaptation. The resulting VISA-A-G was then subjected to an analysis of reliability, validity, and internal consistency in 30 Achilles tendinopathy patients and 79 asymptomatic people. Concurrent validity was tested against a generic tendon grading system (Percy and Conochie) and against a classification system for the effect of pain on athletic performance (Curwin and Stanish). The "advisory committee" determined the VISA-A-G questionnaire as been translated "acceptable". The VISA-A-G questionnaire showed moderate to excellent test-retest reliability (ICC = 0.60 to 0.97). Concurrent validity showed good coherence when correlated with the grading system of Curwin and Stanish (rho = -0.95) and for the Percy and Conochie grade of severity (rho 0.95). Internal consistency (Cronbach's alpha) for the total VISA-A-G scores of the patients was calculated to be 0.737. The VISA-A questionnaire was successfully cross-cultural adapted and validated for use in German speaking populations. The psychometric properties of the VISA-A-G questionnaire are similar to those of the original English version. It therefore can be recommended as a sufficiently robust tool for future measuring clinical severity of Achilles tendinopathy in German speaking patients.
Lohrer, Heinz; Nauck, Tanja
2009-01-01
Background Achilles tendinopathy is the predominant overuse injury in runners. To further investigate this overload injury in transverse and longitudinal studies a valid, responsive and reliable outcome measure is demanded. Most questionnaires have been developed for English-speaking populations. This is also true for the VISA-A score, so far representing the only valid, reliable, and disease specific questionnaire for Achilles tendinopathy. To internationally compare research results, to perform multinational studies or to exclude bias originating from subpopulations speaking different languages within one country an equivalent instrument is demanded in different languages. The aim of this study was therefore to cross-cultural adapt and validate the VISA-A questionnaire for German-speaking Achilles tendinopathy patients. Methods According to the "guidelines for the process of cross-cultural adaptation of self-report measures" the VISA-A score was cross-culturally adapted into German (VISA-A-G) using six steps: Translation, synthesis, back translation, expert committee review, pretesting (n = 77), and appraisal of the adaptation process by an advisory committee determining the adequacy of the cross-cultural adaptation. The resulting VISA-A-G was then subjected to an analysis of reliability, validity, and internal consistency in 30 Achilles tendinopathy patients and 79 asymptomatic people. Concurrent validity was tested against a generic tendon grading system (Percy and Conochie) and against a classification system for the effect of pain on athletic performance (Curwin and Stanish). Results The "advisory committee" determined the VISA-A-G questionnaire as been translated "acceptable". The VISA-A-G questionnaire showed moderate to excellent test-retest reliability (ICC = 0.60 to 0.97). Concurrent validity showed good coherence when correlated with the grading system of Curwin and Stanish (rho = -0.95) and for the Percy and Conochie grade of severity (rho 0.95). Internal consistency (Cronbach's alpha) for the total VISA-A-G scores of the patients was calculated to be 0.737. Conclusion The VISA-A questionnaire was successfully cross-cultural adapted and validated for use in German speaking populations. The psychometric properties of the VISA-A-G questionnaire are similar to those of the original English version. It therefore can be recommended as a sufficiently robust tool for future measuring clinical severity of Achilles tendinopathy in German speaking patients. PMID:19878572
de Klerk, Susan; Buchanan, Helen; Jerosch-Herold, Christina
Systematic review. The Disabilities of the Arm Shoulder and Hand Questionnaire has multiple language versions from many countries around the world. In addition there is extensive research evidence of its psychometric properties. The purpose of this study was to systematically review the evidence available on the validity and clinical utility of the Disabilities of the Arm Shoulder and Hand as a measure of activity and participation in patients with musculoskeletal hand injuries in developing country contexts. We registered the review with international prospective register of systematic reviews prior to conducting a comprehensive literature search and extracting descriptive data. Two reviewers independently assessed methodological quality with the Consensus-Based Standards for the Selection of Health Measurement Instruments critical appraisal tool, the checklist to operationalize measurement characteristics of patient-rated outcome measures and the multidimensional model of clinical utility. Fourteen studies reporting 12 language versions met the eligibility criteria. Two language versions (Persian and Turkish) had an overall rating of good, and one (Thai) had an overall rating of excellent for cross-cultural validity. The remaining 9 language versions had an overall poor rating for cross-cultural validity. Content and construct validity and clinical utility yielded similar results. Poor quality ratings for validity and clinical utility were due to insufficient documentation of results and inadequate psychometric testing. With the increase in migration and globalization, hand therapists are likely to require a range of culturally adapted and translated versions of the Disabilities of the Arm Shoulder and Hand. Recommendations include rigorous application and reporting of cross-cultural adaptation, appropriate psychometric testing, and testing of clinical utility in routine clinical practice. Copyright © 2017 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Kaux, Jean-François; Delvaux, François; Schaus, Jean; Demoulin, Christophe; Locquet, Médéa; Buckinx, Fanny; Beaudart, Charlotte; Dardenne, Nadia; Van Beveren, Julien; Croisier, Jean-Louis; Forthomme, Bénédicte; Bruyère, Olivier
Translation and validation of algo-functional questionnaire. The lateral elbow tendinopathy is a common injury in tennis players and physical workers. The Patient-Rated Tennis Elbow Evaluation (PRTEE) Questionnaire was specifically designed to measure pain and functional limitations in patients with lateral epicondylitis (tennis elbow). First developed in English, this questionnaire has since been translated into several languages. The aims of the study were to translate and cross-culturally adapt the PRTEE questionnaire into French and to evaluate the reliability and validity of this translated version of the questionnaire (PRTEE-F). The PRTEE was translated and cross-culturally adapted into French according to international guidelines. To assess the reliability and validity of the PRTEE-F, 115 participants were asked twice to fill in the PRTEE-F, and once the Disabilities of Arm, Shoulder and Hand Questionnaire (DASH) and the Short Form Health Survey (SF-36). Internal consistency (using Cronbach's alpha), test-retest reliability (using intraclass correlation coefficient (ICC), standard error of measurement and minimal detectable change), and convergent and divergent validity (using the Spearman's correlation coefficients respectively with the DASH and with some subscales of the SF-36) were assessed. The PRTEE was translated into French without any problems. PRTEE-F showed a good test-retest reliability for the overall score (ICC 0.86) and for each item (ICC 0.8-0.96) and a high internal consistency (Cronbach's alpha = 0.98). The correlation analyses revealed high correlation coefficients between PRTEE-F and DASH (convergent validity) and, as expected, a low or moderate correlation with the divergent subscales of the SF-36 (discriminant validity). There was no floor or ceiling effect. The PRTEE questionnaire was successfully cross-culturally adapted into French. The PRTEE-F is reliable and valid for evaluating French-speaking patients with lateral elbow tendinopathy. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Validation of hindi translation of DSM-5 level 1 cross-cutting symptom measure.
Goel, Ankit; Kataria, Dinesh
2018-04-01
The DSM-5 Level 1 Cross-Cutting Symptom Measure is a self- or informant-rated measure that assesses mental health domains which are important across psychiatric diagnoses. The absence of this self- or informant-administered instrument in Hindi, which is a major language in India, is an important limitation in using this scale. To translate the English version of the DSM-5 Level 1 Cross-Cutting Symptom Measure to Hindi and evaluate its psychometric properties. The study was conducted at a tertiary care hospital in Delhi. The DSM-5 Level 1 Cross-Cutting Symptom Measure was translated into Hindi using the World Health Organization's translation methodology. Mean and standard deviation were evaluated for continuous variables while for categorical variables frequency and percentages were calculated. The translated version was evaluated for cross-language equivalence, test-retest reliability, internal consistency, and split half reliability. Hindi version was found to have good cross-language equivalence and test-retest reliability at the level of items and domains. Twenty two of the 23 items and all the 23 items had a significant correlation (ρ < 0.001) in cross language concordance and test-retest reliability data, respectively. The Cronbach's alpha was 0.95, and the Spearman-Brown Sphericity value was 0.79 for the Hindi version. The present study shows that cross-language concordance, internal consistency, split-half reliability, and test-retest reliability of the Hindi version of the measure are excellent. Thus, the Hindi version of DSM-5 Level 1 Cross-Cutting Symptom Measure as translated in this study is a valid instrument. Copyright © 2018 Elsevier B.V. All rights reserved.
Boer, Annemarie; Dutmer, Alisa L; Schiphorst Preuper, Henrica R; van der Woude, Lucas H V; Stewart, Roy E; Deyo, Richard A; Reneman, Michiel F; Soer, Remko
2017-10-01
Validation study with cross-sectional and longitudinal measurements. To translate the US National Institutes of Health (NIH)-minimal dataset for clinical research on chronic low back pain into the Dutch language and to test its validity and reliability among people with chronic low back pain. The NIH developed a minimal dataset to encourage more complete and consistent reporting of clinical research and to be able to compare studies across countries in patients with low back pain. In the Netherlands, the NIH-minimal dataset has not been translated before and measurement properties are unknown. Cross-cultural validity was tested by a formal forward-backward translation. Structural validity was tested with exploratory factor analyses (comparative fit index, Tucker-Lewis index, and root mean square error of approximation). Hypothesis testing was performed to compare subscales of the NIH dataset with the Pain Disability Index and the EurQol-5D (Pearson correlation coefficients). Internal consistency was tested with Cronbach α and test-retest reliability at 2 weeks was calculated in a subsample of patients with Intraclass Correlation Coefficients and weighted Kappa (κω). In total, 452 patients were included of which 52 were included for the test-retest study. factor analysis for structural validity pointed into the direction of a seven-factor model (Cronbach α = 0.78). Factors and total score of the NIH-minimal dataset showed fair to good correlations with Pain Disability Index (r = 0.43-0.70) and EuroQol-5D (r = -0.41 to -0.64). Reliability: test-retest reliability per item showed substantial agreement (κω=0.65). Test-retest reliability per factor was moderate to good (Intraclass Correlation Coefficient = 0.71). The Dutch language version measurement properties of the NIH-minimal were satisfactory. N/A.
Iversen, J V; Bartels, E M; Jørgensen, J E; Nielsen, T G; Ginnerup, C; Lind, M C; Langberg, H
2016-12-01
The VISA-A questionnaire has proven to be a valid and reliable tool for assessing severity of Achilles tendinopathy (AT). The aim was to translate and cross-culturally adapt the VISA-A questionnaire for a Danish-speaking AT population, and subsequently perform validity and reliability tests. Translation and following cross-cultural adaptation was performed as translation, synthesis, reverse translation, expert review, and pretesting. The final Danish version (VISA-A-DK) was tested for reliability on healthy controls (n = 75) and patients (n = 36). Tests for internal consistency, validity, and structure were performed on 71 patients. VISA-A-DK showed good reliability for patients (r = 0.80 ICC = 0.79) and healthy individuals (r = 0.98 ICC = 0.97). Internal consistency was 0.73 (Cronbach's alpha). The mean VISA-A-DK score in AT patients was 51 [47-55]. This was significantly lower than healthy controls with a score of 93 (90-95). Criterion validity was considered good when comparing the scores of the Danish version with the original version in both healthy individuals and patients. VISA-A-DK is a valid and reliable instrument and has shown compatible to the original version in assessment of AT patients. VISA-A-DK is a useful tool in the assessment of AT, both in research and in a clinical setting. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Coster, Wendy J; Haley, Stephen M; Ni, Pengsheng; Dumas, Helene M; Fragala-Pinkham, Maria A
2008-04-01
To examine score agreement, validity, precision, and response burden of a prototype computer adaptive testing (CAT) version of the self-care and social function scales of the Pediatric Evaluation of Disability Inventory compared with the full-length version of these scales. Computer simulation analysis of cross-sectional and longitudinal retrospective data; cross-sectional prospective study. Pediatric rehabilitation hospital, including inpatient acute rehabilitation, day school program, outpatient clinics; community-based day care, preschool, and children's homes. Children with disabilities (n=469) and 412 children with no disabilities (analytic sample); 38 children with disabilities and 35 children without disabilities (cross-validation sample). Not applicable. Summary scores from prototype CAT applications of each scale using 15-, 10-, and 5-item stopping rules; scores from the full-length self-care and social function scales; time (in seconds) to complete assessments and respondent ratings of burden. Scores from both computer simulations and field administration of the prototype CATs were highly consistent with scores from full-length administration (r range, .94-.99). Using computer simulation of retrospective data, discriminant validity, and sensitivity to change of the CATs closely approximated that of the full-length scales, especially when the 15- and 10-item stopping rules were applied. In the cross-validation study the time to administer both CATs was 4 minutes, compared with over 16 minutes to complete the full-length scales. Self-care and social function score estimates from CAT administration are highly comparable with those obtained from full-length scale administration, with small losses in validity and precision and substantial decreases in administration time.
NASA Astrophysics Data System (ADS)
Mundava, C.; Helmholz, P.; Schut, A. G. T.; Corner, R.; McAtee, B.; Lamb, D. W.
2014-09-01
The objective of this paper is to test the relationships between Above Ground Biomass (AGB) and remotely sensed vegetation indices for AGB assessments in the Kimberley area in Western Australia. For 19 different sites, vegetation indices were derived from eight Landsat ETM+ scenes over a period of two years (2011-2013). The sites were divided into three groups (Open plains, Bunch grasses and Spinifex) based on similarities in dominant vegetation types. Dry and green biomass fractions were measured at these sites. Single and multiple regression relationships between vegetation indices and green and total AGB were calibrated and validated using a "leave site out" cross validation. Four tests were compared: (1) relationships between AGB and vegetation indices combining all sites; (2) separate relationships per site group; (3) multiple regressions including selected vegetation indices per site group; and (4) as in 3 but including rainfall and elevation data. Results indicate that relationships based on single vegetation indices are moderately accurate for green biomass in wide open plains covered with annual grasses. The cross-validation results for green AGB improved for a combination of indices for the Open plains and Bunch grasses sites, but not for Spinifex sites. When rainfall and elevation data are included, cross validation improved slightly with a Q2 of 0.49-0.72 for Open plains and Bunch grasses sites respectively. Cross validation results for total AGB were moderately accurate (Q2 of 0.41) for Open plains but weak or absent for other site groups despite good calibration results, indicating strong influence of site-specific factors.
Rational selection of training and test sets for the development of validated QSAR models
NASA Astrophysics Data System (ADS)
Golbraikh, Alexander; Shen, Min; Xiao, Zhiyan; Xiao, Yun-De; Lee, Kuo-Hsiung; Tropsha, Alexander
2003-02-01
Quantitative Structure-Activity Relationship (QSAR) models are used increasingly to screen chemical databases and/or virtual chemical libraries for potentially bioactive molecules. These developments emphasize the importance of rigorous model validation to ensure that the models have acceptable predictive power. Using k nearest neighbors ( kNN) variable selection QSAR method for the analysis of several datasets, we have demonstrated recently that the widely accepted leave-one-out (LOO) cross-validated R2 (q2) is an inadequate characteristic to assess the predictive ability of the models [Golbraikh, A., Tropsha, A. Beware of q2! J. Mol. Graphics Mod. 20, 269-276, (2002)]. Herein, we provide additional evidence that there exists no correlation between the values of q 2 for the training set and accuracy of prediction ( R 2) for the test set and argue that this observation is a general property of any QSAR model developed with LOO cross-validation. We suggest that external validation using rationally selected training and test sets provides a means to establish a reliable QSAR model. We propose several approaches to the division of experimental datasets into training and test sets and apply them in QSAR studies of 48 functionalized amino acid anticonvulsants and a series of 157 epipodophyllotoxin derivatives with antitumor activity. We formulate a set of general criteria for the evaluation of predictive power of QSAR models.
Testing and Validating Machine Learning Classifiers by Metamorphic Testing☆
Xie, Xiaoyuan; Ho, Joshua W. K.; Murphy, Christian; Kaiser, Gail; Xu, Baowen; Chen, Tsong Yueh
2011-01-01
Machine Learning algorithms have provided core functionality to many application domains - such as bioinformatics, computational linguistics, etc. However, it is difficult to detect faults in such applications because often there is no “test oracle” to verify the correctness of the computed outputs. To help address the software quality, in this paper we present a technique for testing the implementations of machine learning classification algorithms which support such applications. Our approach is based on the technique “metamorphic testing”, which has been shown to be effective to alleviate the oracle problem. Also presented include a case study on a real-world machine learning application framework, and a discussion of how programmers implementing machine learning algorithms can avoid the common pitfalls discovered in our study. We also conduct mutation analysis and cross-validation, which reveal that our method has high effectiveness in killing mutants, and that observing expected cross-validation result alone is not sufficiently effective to detect faults in a supervised classification program. The effectiveness of metamorphic testing is further confirmed by the detection of real faults in a popular open-source classification program. PMID:21532969
Optimal Combinations of Diagnostic Tests Based on AUC.
Huang, Xin; Qin, Gengsheng; Fang, Yixin
2011-06-01
When several diagnostic tests are available, one can combine them to achieve better diagnostic accuracy. This article considers the optimal linear combination that maximizes the area under the receiver operating characteristic curve (AUC); the estimates of the combination's coefficients can be obtained via a nonparametric procedure. However, for estimating the AUC associated with the estimated coefficients, the apparent estimation by re-substitution is too optimistic. To adjust for the upward bias, several methods are proposed. Among them the cross-validation approach is especially advocated, and an approximated cross-validation is developed to reduce the computational cost. Furthermore, these proposed methods can be applied for variable selection to select important diagnostic tests. The proposed methods are examined through simulation studies and applications to three real examples. © 2010, The International Biometric Society.
Torabi, Hadi; Khoddami, Seyyedeh Maryam; Ansari, Noureddin Nakhostin; Dabirmoghaddam, Payman
2016-11-01
To cross-culturally adapt of Persian Vocal Tract Discomfort (VTDp) scale and evaluate its validity and reliability in the assessment of patients with muscle tension dysphonia (MTD). A cross-sectional and prospective cohort design was used to psychometrically test the VTDp. The VTD scale was cross-culturally adapted into Persian language following standard forward-backward translations. The VTDp scale was administrated to 100 patients with MTD (54 men and 46 women; mean age: 38.05 ± 10.02 years) and 50 healthy volunteers (26 men and 24 women; mean age: 36.50 ± 12.27 years). Forty-five patients with MTD completed the VTDp 7 days later for test-retest reliability. Patients also completed the Persian Voice Handicap Index (VHIp) to assess construct validity. The results of discriminative validity demonstrated that the VTDp was able to discriminate between patients with MTD and healthy participants. The internal consistency was confirmed with Cronbach α .77 and 0.73 for VTDp frequency and severity subscales, respectively. The test-retest reliability was excellent with an intraclass correlation coefficient (ICC agreement ) of 0.93 for the frequency subscale and 0.91 for the severity subscale. Construct validity of the VTDp was shown with significant correlations between the VTDp frequency and severity subscales and the VHIp total scores (0.36 and 0.37, respectively). The standard error of measurement and smallest detectable change values for VTDp frequency (2.11 and 5.85, respectively) and severity (2.25 and 6.23, respectively) were acceptable. The Bland-Altman analysis for assessing the agreement between test and retest measurements showed no systematic bias. The VTDp is a valid and reliable self-administered scale to measure patient's vocal tract sensations in Persian-speaking population. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Hall, Brian J.; Puffer, Eve; Murray, Laura K.; Ismael, Abdulkadir; Bass, Judith K.; Sim, Amanda; Bolton, Paul A.
2014-01-01
Assessing mental health problems cross-culturally for children exposed to war and violence presents a number of unique challenges. One of the most important issues is the lack of validated symptom measures to assess these problems. The present study sought to evaluate the psychometric properties of two measures to assess mental health problems: the Achenbach Youth Self-Report and the Child Posttraumatic Stress Disorder Symptom Scale. We conducted a validity study in three refugee camps in Eastern Ethiopia in the outskirts of Jijiga, the capital of the Somali region. A total of 147 child and caregiver pairs were assessed, and scores obtained were submitted to rigorous psychometric evaluation. Excellent internal consistency reliability was obtained for symptom measures for children and their caregivers. Validation of study instruments based on local case definitions was obtained for the caregivers but not consistently for the children. Sensitivity and specificity of study measures were generally low, indicating that these scales would not perform adequately as screening instruments. Combined test-retest and inter-rater reliability was low for all scales. This study illustrates the need for validation and testing of existing measures cross-culturally. Methodological implications for future cross-cultural research studies in low- and middle-income countries are discussed. PMID:24955147
ERIC Educational Resources Information Center
Boonstra, Anne M.; Reneman, Michiel F.; Stewart, Roy E.; Balk, Gerlof A.
2012-01-01
The aim of this study was to determine the reliability and discriminant validity of the Dutch version of the life satisfaction questionnaire (Lisat-9 DV) to assess patients with an acquired brain injury. The reliability study used a test-retest design, and the validity study used a cross-sectional design. The setting was the general rehabilitation…
de los Santos, Gonzalo; Reyes, Pablo; del Castillo, Raúl; Fragola, Claudio; Royuela, Ana
2015-11-01
Our objective was to perform translation, cross-cultural adaptation and validation of the sino-nasal outcome test 22 (SNOT-22) to Spanish language. SNOT-22 was translated, back translated, and a pretest trial was performed. The study included 119 individuals divided into 60 cases, who met diagnostic criteria for chronic rhinosinusitis according to the European Position Paper on Rhinosinusitis 2012; and 59 controls, who reported no sino-nasal disease. Internal consistency was evaluated with Cronbach's alpha test, reproducibility with Kappa coefficient, reliability with intraclass correlation coefficient (ICC), validity with Mann-Whitney U test and responsiveness with Wilcoxon test. In cases, Cronbach's alpha was 0.91 both before and after treatment, as for controls, it was 0.90 at their first test assessment and 0.88 at 3 weeks. Kappa coefficient was calculated for each item, with an average score of 0.69. ICC was also performed for each item, with a score of 0.87 in the overall score and an average among all items of 0.71. Median score for cases was 47, and 2 for controls, finding the difference to be highly significant (Mann-Whitney U test, p < 0.001). Clinical changes were observed among treated patients, with a median score of 47 and 13.5 before and after treatment, respectively (Wilcoxon test, p < 0.001). The effect size resulted in 0.14 in treated patients whose status at 3 weeks was unvarying; 1.03 in those who were better and 1.89 for much better group. All controls were unvarying with an effect size of 0.05. The Spanish version of the SNOT-22 has the internal consistency, reliability, reproducibility, validity and responsiveness necessary to be a valid instrument to be used in clinical practice.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pražnikar, Jure; University of Primorska,; Turk, Dušan, E-mail: dusan.turk@ijs.si
2014-12-01
The maximum-likelihood free-kick target, which calculates model error estimates from the work set and a randomly displaced model, proved superior in the accuracy and consistency of refinement of crystal structures compared with the maximum-likelihood cross-validation target, which calculates error estimates from the test set and the unperturbed model. The refinement of a molecular model is a computational procedure by which the atomic model is fitted to the diffraction data. The commonly used target in the refinement of macromolecular structures is the maximum-likelihood (ML) function, which relies on the assessment of model errors. The current ML functions rely on cross-validation. Theymore » utilize phase-error estimates that are calculated from a small fraction of diffraction data, called the test set, that are not used to fit the model. An approach has been developed that uses the work set to calculate the phase-error estimates in the ML refinement from simulating the model errors via the random displacement of atomic coordinates. It is called ML free-kick refinement as it uses the ML formulation of the target function and is based on the idea of freeing the model from the model bias imposed by the chemical energy restraints used in refinement. This approach for the calculation of error estimates is superior to the cross-validation approach: it reduces the phase error and increases the accuracy of molecular models, is more robust, provides clearer maps and may use a smaller portion of data for the test set for the calculation of R{sub free} or may leave it out completely.« less
Integration and validation testing for PhEDEx, DBS and DAS with the PhEDEx LifeCycle agent
NASA Astrophysics Data System (ADS)
Boeser, C.; Chwalek, T.; Giffels, M.; Kuznetsov, V.; Wildish, T.
2014-06-01
The ever-increasing amount of data handled by the CMS dataflow and workflow management tools poses new challenges for cross-validation among different systems within CMS experiment at LHC. To approach this problem we developed an integration test suite based on the LifeCycle agent, a tool originally conceived for stress-testing new releases of PhEDEx, the CMS data-placement tool. The LifeCycle agent provides a framework for customising the test workflow in arbitrary ways, and can scale to levels of activity well beyond those seen in normal running. This means we can run realistic performance tests at scales not likely to be seen by the experiment for some years, or with custom topologies to examine particular situations that may cause concern some time in the future. The LifeCycle agent has recently been enhanced to become a general purpose integration and validation testing tool for major CMS services. It allows cross-system integration tests of all three components to be performed in controlled environments, without interfering with production services. In this paper we discuss the design and implementation of the LifeCycle agent. We describe how it is used for small-scale debugging and validation tests, and how we extend that to large-scale tests of whole groups of sub-systems. We show how the LifeCycle agent can emulate the action of operators, physicists, or software agents external to the system under test, and how it can be scaled to large and complex systems.
Predicting protein-binding regions in RNA using nucleotide profiles and compositions.
Choi, Daesik; Park, Byungkyu; Chae, Hanju; Lee, Wook; Han, Kyungsook
2017-03-14
Motivated by the increased amount of data on protein-RNA interactions and the availability of complete genome sequences of several organisms, many computational methods have been proposed to predict binding sites in protein-RNA interactions. However, most computational methods are limited to finding RNA-binding sites in proteins instead of protein-binding sites in RNAs. Predicting protein-binding sites in RNA is more challenging than predicting RNA-binding sites in proteins. Recent computational methods for finding protein-binding sites in RNAs have several drawbacks for practical use. We developed a new support vector machine (SVM) model for predicting protein-binding regions in mRNA sequences. The model uses sequence profiles constructed from log-odds scores of mono- and di-nucleotides and nucleotide compositions. The model was evaluated by standard 10-fold cross validation, leave-one-protein-out (LOPO) cross validation and independent testing. Since actual mRNA sequences have more non-binding regions than protein-binding regions, we tested the model on several datasets with different ratios of protein-binding regions to non-binding regions. The best performance of the model was obtained in a balanced dataset of positive and negative instances. 10-fold cross validation with a balanced dataset achieved a sensitivity of 91.6%, a specificity of 92.4%, an accuracy of 92.0%, a positive predictive value (PPV) of 91.7%, a negative predictive value (NPV) of 92.3% and a Matthews correlation coefficient (MCC) of 0.840. LOPO cross validation showed a lower performance than the 10-fold cross validation, but the performance remains high (87.6% accuracy and 0.752 MCC). In testing the model on independent datasets, it achieved an accuracy of 82.2% and an MCC of 0.656. Testing of our model and other state-of-the-art methods on a same dataset showed that our model is better than the others. Sequence profiles of log-odds scores of mono- and di-nucleotides were much more powerful features than nucleotide compositions in finding protein-binding regions in RNA sequences. But, a slight performance gain was obtained when using the sequence profiles along with nucleotide compositions. These are preliminary results of ongoing research, but demonstrate the potential of our approach as a powerful predictor of protein-binding regions in RNA. The program and supporting data are available at http://bclab.inha.ac.kr/RBPbinding .
A verification library for multibody simulation software
NASA Technical Reports Server (NTRS)
Kim, Sung-Soo; Haug, Edward J.; Frisch, Harold P.
1989-01-01
A multibody dynamics verification library, that maintains and manages test and validation data is proposed, based on RRC Robot arm and CASE backhoe validation and a comparitive study of DADS, DISCOS, and CONTOPS that are existing public domain and commercial multibody dynamic simulation programs. Using simple representative problems, simulation results from each program are cross checked, and the validation results are presented. Functionalities of the verification library are defined, in order to automate validation procedure.
Ballangrud, Randi; Husebø, Sissel Eikeland; Hall-Lord, Marie Louise
2017-12-02
Teamwork is an integrated part of today's specialized and complex healthcare and essential to patient safety, and is considered as a core competency to improve twenty-first century healthcare. Teamwork measurements and evaluations show promising results to promote good team performance, and are recommended for identifying areas for improvement. The validated TeamSTEPPS® Teamwork Perception Questionnaire (T-TPQ) was found suitable for cross-cultural validation and testing in a Norwegian context. T-TPQ is a self-report survey that examines five dimensions of perception of teamwork within healthcare settings. The aim of the study was to translate and cross-validate the T-TPQ into Norwegian, and test the questionnaire for psychometric properties among healthcare personnel. The T-TPQ was translated and adapted to a Norwegian context according to a model of a back-translation process. A total of 247 healthcare personnel representing different professionals and hospital settings responded to the questionnaire. A confirmatory factor analysis was carried out to test the factor structure. Cronbach's alpha was used to establish internal consistency, and an Intraclass Correlation Coefficient was used to assess the test - retest reliability. A confirmatory factor analysis showed an acceptable fitting model (χ 2 (df) 969.46 (546), p < 0.001, Root Mean Square Error of Approximation (RMSEA) = 0.056, Tucker-Lewis Index (TLI) = 0.88, Comparative fit index (CFI) = 0.89, which indicates that each set of the items that was supposed to accompany each teamwork dimension clearly represents that specific construct. The Cronbach's alpha demonstrated acceptable values on the five subscales (0.786-0.844), and test-retest showed a reliability parameter, with Intraclass Correlation Coefficient scores from 0.672 to 0.852. The Norwegian version of T-TPQ was considered to be acceptable regarding the validity and reliability for measuring Norwegian individual healthcare personnel's perception of group level teamwork within their unit. However, it needs to be further tested, preferably in a larger sample and in different clinical settings.
Correcting for Optimistic Prediction in Small Data Sets
Smith, Gordon C. S.; Seaman, Shaun R.; Wood, Angela M.; Royston, Patrick; White, Ian R.
2014-01-01
The C statistic is a commonly reported measure of screening test performance. Optimistic estimation of the C statistic is a frequent problem because of overfitting of statistical models in small data sets, and methods exist to correct for this issue. However, many studies do not use such methods, and those that do correct for optimism use diverse methods, some of which are known to be biased. We used clinical data sets (United Kingdom Down syndrome screening data from Glasgow (1991–2003), Edinburgh (1999–2003), and Cambridge (1990–2006), as well as Scottish national pregnancy discharge data (2004–2007)) to evaluate different approaches to adjustment for optimism. We found that sample splitting, cross-validation without replication, and leave-1-out cross-validation produced optimism-adjusted estimates of the C statistic that were biased and/or associated with greater absolute error than other available methods. Cross-validation with replication, bootstrapping, and a new method (leave-pair-out cross-validation) all generated unbiased optimism-adjusted estimates of the C statistic and had similar absolute errors in the clinical data set. Larger simulation studies confirmed that all 3 methods performed similarly with 10 or more events per variable, or when the C statistic was 0.9 or greater. However, with lower events per variable or lower C statistics, bootstrapping tended to be optimistic but with lower absolute and mean squared errors than both methods of cross-validation. PMID:24966219
PIV Measurements of the CEV Hot Abort Motor Plume for CFD Validation
NASA Technical Reports Server (NTRS)
Wernet, Mark; Wolter, John D.; Locke, Randy; Wroblewski, Adam; Childs, Robert; Nelson, Andrea
2010-01-01
NASA s next manned launch platform for missions to the moon and Mars are the Orion and Ares systems. Many critical aspects of the launch system performance are being verified using computational fluid dynamics (CFD) predictions. The Orion Launch Abort Vehicle (LAV) consists of a tower mounted tractor rocket tasked with carrying the Crew Module (CM) safely away from the launch vehicle in the event of a catastrophic failure during the vehicle s ascent. Some of the predictions involving the launch abort system flow fields produced conflicting results, which required further investigation through ground test experiments. Ground tests were performed to acquire data from a hot supersonic jet in cross-flow for the purpose of validating CFD turbulence modeling relevant to the Orion Launch Abort Vehicle (LAV). Both 2-component axial plane Particle Image Velocimetry (PIV) and 3-component cross-stream Stereo Particle Image Velocimetry (SPIV) measurements were obtained on a model of an Abort Motor (AM). Actual flight conditions could not be simulated on the ground, so the highest temperature and pressure conditions that could be safely used in the test facility (nozzle pressure ratio 28.5 and a nozzle temperature ratio of 3) were used for the validation tests. These conditions are significantly different from those of the flight vehicle, but were sufficiently high enough to begin addressing turbulence modeling issues that predicated the need for the validation tests.
Mbada, Chidozie Emmanuel; Idowu, Opeyemi Ayodiipo; Ogunjimi, Olawale Richard; Ayanniyi, Olusola; Orimolade, Elkanah Ayodele; Oladiran, Ajibola Babatunde; Johnson, Olubusola Esther; Akinsulore, Adesanmi; Oni, Temitope Olawale
2017-04-01
A translation, cross-cultural adaptation, and psychometric analysis. The aim of this study was to translate, cross-culturally adapt, and validate the Yoruba version of the RMDQ. The Roland-Morris Disability Questionnaire (RMDQ) is a valid outcome tool for low back pain (LBP) in clinical and research settings. There seems to be no valid and reliable version of the RMDQ in the Nigerian languages. Following the Guillemin criteria, the English version of the RMDQ was forward and back translated. Two Yoruba translated versions of the RMDQ were assessed for clarity, common language usage, and conceptual equivalence. Consequently, a harmonized Yoruba version was produced and was pilot-tested among 20 patients with nonspecific long-term LBP (NSLBP) for cognitive debriefing. The final version of the Yoruba RMDQ was tested for its construct validity and re-retest reliability among 120 and 87 patients with NSLBP, respectively. Pearson product moment correlation coefficient (r) of 0.82 was obtained for reliability of the Yoruba version of the RMDQ. The test-retest reliability of the Yoruba RMDQ yielded Cronbach alpha 0.932, while the intraclass correlation (ICC) ranged between 0.896 and 0.956. The analysis of the global scores of both the English and Yoruba versions of the RMDQ yielded ICC value of between 0.995 (95% confidence interval 0.996-0.997), with the item-by-item Kappa agreement ranging between 0.824 and 1.000. The external validity of RMDQ using Quadruple Visual Analogue Scale was r = -0.596 (P = 0.001). The Yoruba version of the RMDQ had no floor/ceiling effects, as no patient achieved either of the maximum or the minimum possible scores. The Yoruba version of the RMDQ has excellent reliability and validity and may be an appropriate outcome tool for clinical and research purposes among Yoruba-speaking patients with LBP. 3.
The Vocal Cord Dysfunction Questionnaire: Validity and Reliability of the Persian Version.
Ghaemi, Hamide; Khoddami, Seyyedeh Maryam; Soleymani, Zahra; Zandieh, Fariborz; Jalaie, Shohreh; Ahanchian, Hamid; Khadivi, Ehsan
2017-12-25
The aim of this study was to develop, validate, and assess the reliability of the Persian version of Vocal Cord Dysfunction Questionnaire (VCDQ P ). The study design was cross-sectional or cultural survey. Forty-four patients with vocal fold dysfunction (VFD) and 40 healthy volunteers were recruited for the study. To assess the content validity, the prefinal questions were given to 15 experts to comment on its essential. Ten patients with VFD rated the importance of VCDQ P in detecting face validity. Eighteen of the patients with VFD completed the VCDQ 1 week later for test-retest reliability. To detect absolute reliability, standard error of measurement and smallest detected change were calculated. Concurrent validity was assessed by completing the Persian Chronic Obstructive Pulmonary Disease (COPD) Assessment Test (CAT) by 34 patients with VFD. Discriminant validity was measured from 34 participants. The VCDQ was further validated by administering the questionnaire to 40 healthy volunteers. Validation of the VCDQ as a treatment outcome tool was conducted in 18 patients with VFD using pre- and posttreatment scores. The internal consistency was confirmed (Cronbach α = 0.78). The test-retest reliability was excellent (intraclass correlation coefficient = 0.97). The standard error of measurement and smallest detected change values were acceptable (0.39 and 1.08, respectively). There was a significant correlation between the VCDQ P and the CAT total scores (P < 0.05). Discriminative validity was significantly different. The VCDQ scores in patients with VFD before and after treatment was significantly different (P < 0.001). The VCDQ was cross-culturally adapted to Persian and demonstrated to be a valid and reliable self-administered questionnaire in Persian-speaking population. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Perceived Health Outcomes of Recreation Scale (PHORS): Reliability, Validity and Invariance
ERIC Educational Resources Information Center
Gómez, Edwin; Hill, Eddie; Zhu, Xihe; Freidt, Barbara
2016-01-01
This study examined the psychometric properties of the Perceived Health Outcomes of Recreation Scale (PHORS). Data for PHORS were collected from three different trail sites (Appalachian Trail, Pacific Crest Trail, and First Landing State Park) during three separate time periods, allowing for cross-validation and invariance testing. Exploratory…
Uchoa, Priscila Regina Candido Espinola; Bezerra, Thiago Freire Pinto; Lima, Élcio Duarte; Fornazieri, Marco Aurélio; Pinna, Fabio de Rezende; Sperandio, Fabiana de Araújo; Voegels, Richard Louis
The concept of quality of life is subjective and variable definition, which depends on the individual's perception of their state of health. Quality of life questionnaires are instruments designed to measure quality of life, but most are developed in a language other than Portuguese. Questionnaires can identify the most important symptoms, focus on consultation, and assist in defining the goals of treatment. Some of these have been validated for the Portuguese language, but none in children. To validate the translation with cross-cultural adaptation and validation of the Sinus and Nasal Quality of Life Survey (SN-5) into Portuguese. Prospective study of children aged 2-12 years with sinonasal symptoms of over 30 days. The study comprised two stages: (I) translation and cross-cultural adaptation of the SN-5 into Portuguese (SN-5p); and (II) validation of the SN5-p. Statistical analysis was performed to assess internal consistency, test-retest reliability, and sensitivity, as well as construct and discriminant validity and standardization. The SN-5 was translated and adapted into Portuguese (SN-5p) and the author of the original version approved the process. Validation was carried out by administration of the SN-5p to 51 pediatric patients with sinonasal complaints (mean age, 5.8±2.5 years; range, 2-12 years). The questionnaire exhibited adequate construct validity (0.62, p<0.01), internal consistency (Cronbach's alpha=0.73), and discriminant validity (p<0.01), as well as good test-retest reproducibility (Goodman-Kruskal gamma=0.957, p<0.001), good correlation with a visual analog scale (r=0.62, p<0.01), and sensitivity to change. This study reports the successful translation and cross-cultural adaptation of the SN-5 instrument into Brazilian Portuguese. The translated version exhibited adequate psychometric properties for assessment of disease-specific quality of life in pediatric patients with sinonasal complaints. Copyright © 2016 Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial. Published by Elsevier Editora Ltda. All rights reserved.
Huang, Hui-Chuan; Shyu, Meei-Ling; Lin, Mei-Feng; Hu, Chaur-Jong; Chang, Chien-Hung; Lee, Hsin-Chien; Chi, Nai-Fang; Chang, Hsiu-Ju
2017-12-01
The objectives of this study were to develop a cross-cultural Chinese version of the Emotional and Social Dysfunction Questionnaire (ESDQ-C) and test its validity and reliability among Chinese-speaking stroke patients. Various methods were used to develop the ESDQ-C. A cross-sectional study was used to examine the validity and reliability of the developed questionnaire, which consists of 28 items belonging to six factors, anger, helplessness, emotional dyscontrol, indifference, inertia and fatigue, and euphoria. Satisfactory convergence and known-group validities were confirmed by significant correlations of the ESDQ-C with the Profile of Mood States-Short Form ( p < .05) and with the Hospital Anxiety and Depression Scale ( p < .05). The internal consistency was represented by Cronbach's alpha, which was .96 and .79 to .92 for the entire scale and subscales, respectively. Appropriate application of the ESDQ-C will be helpful to identify critical adjustment-related types of distress and patients who experience difficulty coping with such distress.
Rosen, Allyson; Weitlauf, Julie C
2015-01-01
A screening measure of capacity to consent can provide an efficient method of determining the appropriateness of including individuals from vulnerable patient populations in research, particularly in circumstances in which no caregiver is available to provide surrogate consent. Seaman et al. (2015) cross-validate a measure of capacity to consent to research developed by Jeste et al. (2007). They provide data on controls, caregivers, and patients with mild cognitive impairment and dementia. The study demonstrates the importance of validating measures across disorders with different domains of incapacity, as well as the need for timely and appropriate follow-up with potential participants who yield positive screens. Ultimately clinical measures need to adapt to the dimensional diagnostic approaches put forward in DSM 5. Integrative models of constructs, such as capacity to consent, will make this process more efficient by avoiding the need to test measures in each disorder. Until then, cross-validation studies, such as the work by Seaman et al. (2015) are critical.
ERIC Educational Resources Information Center
Ruan, Jiening; Nie, Youyan; Hong, Ji; Monobe, Gumiko; Zheng, Guomin; Kambara, Hitomi; You, Sula
2015-01-01
The purpose of this study is to validate the widely adopted Teachers' Sense of Efficacy Scale (TSES) for the East Asian context. The researchers seek to find out whether TSES holds validity and reliability and is appropriate for use to measure teacher efficacy in China, Korea, and Japan. 489 teachers from the three countries participated in the…
Validation of tungsten cross sections in the neutron energy region up to 100 keV
NASA Astrophysics Data System (ADS)
Pigni, Marco T.; Žerovnik, Gašper; Leal, Luiz. C.; Trkov, Andrej
2017-09-01
Following a series of recent cross section evaluations on tungsten isotopes performed at Oak Ridge National Laboratory (ORNL), this paper presents the validation work carried out to test the performance of the evaluated cross sections based on lead-slowing-down (LSD) benchmarks conducted in Grenoble. ORNL completed the resonance parameter evaluation of four tungsten isotopes - 182,183,184,186W - in August 2014 and submitted it as an ENDF-compatible file to be part of the next release of the ENDF/B-VIII.0 nuclear data library. The evaluations were performed with support from the US Nuclear Criticality Safety Program in an effort to provide improved tungsten cross section and covariance data for criticality safety sensitivity analyses. The validation analysis based on the LSD benchmarks showed an improved agreement with the experimental response when the ORNL tungsten evaluations were included in the ENDF/B-VII.1 library. Comparison with the results obtained with the JEFF-3.2 nuclear data library are also discussed.
Perspectives on Validation of High-Throughput Assays Supporting 21st Century Toxicity Testing1
Judson, Richard; Kavlock, Robert; Martin, Matt; Reif, David; Houck, Keith; Knudsen, Thomas; Richard, Ann; Tice, Raymond R.; Whelan, Maurice; Xia, Menghang; Huang, Ruili; Austin, Christopher; Daston, George; Hartung, Thomas; Fowle, John R.; Wooge, William; Tong, Weida; Dix, David
2014-01-01
Summary In vitro, high-throughput screening (HTS) assays are seeing increasing use in toxicity testing. HTS assays can simultaneously test many chemicals, but have seen limited use in the regulatory arena, in part because of the need to undergo rigorous, time-consuming formal validation. Here we discuss streamlining the validation process, specifically for prioritization applications in which HTS assays are used to identify a high-concern subset of a collection of chemicals. The high-concern chemicals could then be tested sooner rather than later in standard guideline bioassays. The streamlined validation process would continue to ensure the reliability and relevance of assays for this application. We discuss the following practical guidelines: (1) follow current validation practice to the extent possible and practical; (2) make increased use of reference compounds to better demonstrate assay reliability and relevance; (3) deemphasize the need for cross-laboratory testing, and; (4) implement a web-based, transparent and expedited peer review process. PMID:23338806
Pitchford, Nicola J; Outhwaite, Laura A
2016-01-01
Assessment of cognitive and motor functions is fundamental for developmental and neuropsychological profiling. Assessments are usually conducted on an individual basis, with a trained examiner, using standardized paper and pencil tests, and can take up to an hour or more to complete, depending on the nature of the test. This makes traditional standardized assessments of child development largely unsuitable for use in low-income countries. Touch screen tablets afford the opportunity to assess cognitive functions in groups of participants, with untrained administrators, with precision recording of responses, thus automating the assessment process. In turn, this enables cognitive profiling to be conducted in contexts where access to qualified examiners and standardized assessments are rarely available. As such, touch screen assessments could provide a means of assessing child development in both low- and high-income countries, which would afford cross-cultural comparisons to be made with the same assessment tool. However, before touch screen tablet assessments can be used for cognitive profiling in low-to-high-income countries they need to be shown to provide reliable and valid measures of performance. We report the development of a new touch screen tablet assessment of basic cognitive and motor functions for use with early years primary school children in low- and high-income countries. Measures of spatial intelligence, visual attention, short-term memory, working memory, manual processing speed, and manual coordination are included as well as mathematical knowledge. To investigate if this new touch screen assessment tool can be used for cross-cultural comparisons we administered it to a sample of children ( N = 283) spanning standards 1-3 in a low-income country, Malawi, and a smaller sample of children ( N = 70) from first year of formal schooling from a high-income country, the UK. Split-half reliability, test-retest reliability, face validity, convergent construct validity, predictive criterion validity, and concurrent criterion validity were investigated. Results demonstrate "proof of concept" that touch screen tablet technology can provide reliable and valid psychometric measures of performance in the early years, highlighting its potential to be used in cross-cultural comparisons and research.
Park, So Jeong; An, Soo Min; Kim, Se Hyun
2013-03-01
(1) To translate original English Cancer Therapy Satisfaction Questionnaire (CTSQ) into Korean and perform validation, (2) to compare CTSQ domains of expectations of therapy (ET), feelings about side effects (FSE), and satisfaction with therapy (SWT) by cancer therapy type. Cross-cultural adaptation was performed according to guidelines: translation, back translation, focus-group, and field test. We performed validation with internal consistency by Cronbach's alpha and construct validity by exploratory factor analysis (EFA) with varimax rotation method. We compared each CTSQ domain between traditional Korean Medicine (TKM) and integrative cancer therapy (ICT) of combining western and TKM by two-sample t test. Cross-cultural adaptation produced no major modifications in the items and domains. A total of 102 outpatients were participated. Mean age was 51.9 ± 12.4. Most were stage 4 (74.4 %) cancer. Mean scores of ET, FSE, and SWT were 81.2 ± 15.7, 79.5 ± 22.9, and 75.7 ± 14.8, respectively. Cronbach's alpha of ET, FSE, and SWT were 0.86, 0.78, and 0.74, respectively. EFA loaded items on the three domains, which is very close to that of the original CTSQ. ET and SWT was similar, but FSE was significantly higher in TKM than ICT (87.5 ± 19.3 vs. 74.9 ± 23.5; p = 0.0054). Cross-cultural adaptation was successful, and the adapted Korean CTSQ demonstrated good internal consistency and construct validity. Similar expectation and satisfaction was shown between the two types of therapy, but patient's reported feelings about side effects was significantly lower in patients receiving TKM than receiving ICT. Korean version of CTSQ can be used to evaluate Korean cancer patient's experiences receiving various cancer therapy types.
Bias correction for selecting the minimal-error classifier from many machine learning models.
Ding, Ying; Tang, Shaowu; Liao, Serena G; Jia, Jia; Oesterreich, Steffi; Lin, Yan; Tseng, George C
2014-11-15
Supervised machine learning is commonly applied in genomic research to construct a classifier from the training data that is generalizable to predict independent testing data. When test datasets are not available, cross-validation is commonly used to estimate the error rate. Many machine learning methods are available, and it is well known that no universally best method exists in general. It has been a common practice to apply many machine learning methods and report the method that produces the smallest cross-validation error rate. Theoretically, such a procedure produces a selection bias. Consequently, many clinical studies with moderate sample sizes (e.g. n = 30-60) risk reporting a falsely small cross-validation error rate that could not be validated later in independent cohorts. In this article, we illustrated the probabilistic framework of the problem and explored the statistical and asymptotic properties. We proposed a new bias correction method based on learning curve fitting by inverse power law (IPL) and compared it with three existing methods: nested cross-validation, weighted mean correction and Tibshirani-Tibshirani procedure. All methods were compared in simulation datasets, five moderate size real datasets and two large breast cancer datasets. The result showed that IPL outperforms the other methods in bias correction with smaller variance, and it has an additional advantage to extrapolate error estimates for larger sample sizes, a practical feature to recommend whether more samples should be recruited to improve the classifier and accuracy. An R package 'MLbias' and all source files are publicly available. tsenglab.biostat.pitt.edu/software.htm. ctseng@pitt.edu Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Coster, Wendy J.; Haley, Stephen M.; Ni, Pengsheng; Dumas, Helene M.; Fragala-Pinkham, Maria A.
2009-01-01
Objective To examine score agreement, validity, precision, and response burden of a prototype computer adaptive testing (CAT) version of the Self-Care and Social Function scales of the Pediatric Evaluation of Disability Inventory (PEDI) compared to the full-length version of these scales. Design Computer simulation analysis of cross-sectional and longitudinal retrospective data; cross-sectional prospective study. Settings Pediatric rehabilitation hospital, including inpatient acute rehabilitation, day school program, outpatient clinics; community-based day care, preschool, and children’s homes. Participants Four hundred sixty-nine children with disabilities and 412 children with no disabilities (analytic sample); 38 children with disabilities and 35 children without disabilities (cross-validation sample). Interventions Not applicable. Main Outcome Measures Summary scores from prototype CAT applications of each scale using 15-, 10-, and 5-item stopping rules; scores from the full-length Self-Care and Social Function scales; time (in seconds) to complete assessments and respondent ratings of burden. Results Scores from both computer simulations and field administration of the prototype CATs were highly consistent with scores from full-length administration (all r’s between .94 and .99). Using computer simulation of retrospective data, discriminant validity and sensitivity to change of the CATs closely approximated that of the full-length scales, especially when the 15- and 10-item stopping rules were applied. In the cross-validation study the time to administer both CATs was 4 minutes, compared to over 16 minutes to complete the full-length scales. Conclusions Self-care and Social Function score estimates from CAT administration are highly comparable to those obtained from full-length scale administration, with small losses in validity and precision and substantial decreases in administration time. PMID:18373991
Bayesian cross-entropy methodology for optimal design of validation experiments
NASA Astrophysics Data System (ADS)
Jiang, X.; Mahadevan, S.
2006-07-01
An important concern in the design of validation experiments is how to incorporate the mathematical model in the design in order to allow conclusive comparisons of model prediction with experimental output in model assessment. The classical experimental design methods are more suitable for phenomena discovery and may result in a subjective, expensive, time-consuming and ineffective design that may adversely impact these comparisons. In this paper, an integrated Bayesian cross-entropy methodology is proposed to perform the optimal design of validation experiments incorporating the computational model. The expected cross entropy, an information-theoretic distance between the distributions of model prediction and experimental observation, is defined as a utility function to measure the similarity of two distributions. A simulated annealing algorithm is used to find optimal values of input variables through minimizing or maximizing the expected cross entropy. The measured data after testing with the optimum input values are used to update the distribution of the experimental output using Bayes theorem. The procedure is repeated to adaptively design the required number of experiments for model assessment, each time ensuring that the experiment provides effective comparison for validation. The methodology is illustrated for the optimal design of validation experiments for a three-leg bolted joint structure and a composite helicopter rotor hub component.
[Cross-cultural adaptation and validation of the Dizziness Handicap Inventory: Argentine version].
Caldara, Betina; Asenzo, Adriana I; Brusotti Paglia, Gabriela; Ferreri, Eliana; Gomez, Ramiro S; Laiz, Mariela M; Luques, María L; Mangoni, Ana P; Marazzi, Carla; Matesa, María A; Peker, Guillermo; Pratto, Romina A; Quiroga, Cecilia E; Rapela, Laura; Ruiz, Vanesa R; Sanchez, Noelia; Taglioretti, Célide L; Tana, Andrés M; Zandstra, Ingrid V
2012-01-01
The Dizziness Handicap Inventory is a useful tool for quantifying self-perceived handicap in patients with vertigo, dizziness or unsteadiness and its impact on daily living activities. The Dizziness Handicap Inventory identifies functional, physical and emotional disorders related to balance disturbance. Our objective was to cross-culturally adapt the Peninsular Spanish version of the Dizziness Handicap Inventory for use in Argentina and validate the adapted Argentinian version. We included both healthy subjects and patients with vertigo, dizziness or unsteadiness, aged 18 to 85 years, native Spanish-speaking Argentinians. We introduced linguistic and cultural modifications to the Peninsular Spanish version to obtain the Argentinian one. This version was given twice to 108 patients, 24 to 72 h apart. Internal consistency, test-retest reliability and construct validity were assessed using a visual analogue scale, the Romberg test, the tandem Romberg test and the tandem gait test. We found high internal consistency (α=0.87) and very high test-retest reliability for the total Dizziness Handicap Inventory score (intraclass correlation coefficient: 0.98) and its subscales. The total Dizziness Handicap Inventory and the functional subscale were found to correlate significantly with the Romberg and tandem Romberg tests. The emotional subscale showed a significant correlation with the Romberg test and the eyes-open tandem Romberg test (P<.05) The Argentinian version of the Dizziness Handicap Inventory proved to be a reliable and valid tool to quantify self-perceived handicap resulting from vertigo, dizziness or unsteadiness. Copyright © 2011 Elsevier España, S.L. All rights reserved.
Classification based upon gene expression data: bias and precision of error rates.
Wood, Ian A; Visscher, Peter M; Mengersen, Kerrie L
2007-06-01
Gene expression data offer a large number of potentially useful predictors for the classification of tissue samples into classes, such as diseased and non-diseased. The predictive error rate of classifiers can be estimated using methods such as cross-validation. We have investigated issues of interpretation and potential bias in the reporting of error rate estimates. The issues considered here are optimization and selection biases, sampling effects, measures of misclassification rate, baseline error rates, two-level external cross-validation and a novel proposal for detection of bias using the permutation mean. Reporting an optimal estimated error rate incurs an optimization bias. Downward bias of 3-5% was found in an existing study of classification based on gene expression data and may be endemic in similar studies. Using a simulated non-informative dataset and two example datasets from existing studies, we show how bias can be detected through the use of label permutations and avoided using two-level external cross-validation. Some studies avoid optimization bias by using single-level cross-validation and a test set, but error rates can be more accurately estimated via two-level cross-validation. In addition to estimating the simple overall error rate, we recommend reporting class error rates plus where possible the conditional risk incorporating prior class probabilities and a misclassification cost matrix. We also describe baseline error rates derived from three trivial classifiers which ignore the predictors. R code which implements two-level external cross-validation with the PAMR package, experiment code, dataset details and additional figures are freely available for non-commercial use from http://www.maths.qut.edu.au/profiles/wood/permr.jsp
Arimura, Tatsuyuki; Hosoi, Masako; Tsukiyama, Yoshihiro; Yoshida, Toshiyuki; Fujiwara, Daiki; Tanaka, Masanori; Tamura, Ryuichi; Nakashima, Yasunori; Sudo, Nobuyuki; Kubo, Chiharu
2012-04-01
The present study aimed to develop a Japanese version of the Short-Form McGill Pain Questionnaire (SF-MPQ-J) that focuses on cross-culturally equivalence to the original English version and to test its reliability and validity. Cross-sectional design. In study 1, SF-MPQ was translated and adapted into Japanese. It included construction of response scales equivalent to the original using a variation of the Thurstone method of equal-appearing intervals. A total of 147 undergraduate students and 44 pain patients participated in the development of the Japanese response scales. To measure the equivalence of pain descriptors, 62 pain patients in four diagnostic groups were asked to choose pain descriptors that described their pain. In study 2, chronic pain patients (N=126) completed the SF-MPQ-J, the Long-Form McGill Pain Questionnaire Japanese version (LF-MPQ-J), and the 11-point numerical rating scale of pain intensity. Correlation analysis examined the construct validity of the SF-MPQ-J. The results from study 1 were used to develop SF-MPQ-J, which is linguistically equivalent to the original questionnaire. Response scales from SF-MPQ-J represented the original scale values. All pain descriptors, except one, were used by >33% in at least one of the four diagnostic groups. Study 2 exhibited adequate internal consistency and test-retest reliability, with the construct validity of SF-MPQ-J comparable to the original. These findings suggested that SF-MPQ-J is reliable, valid, and cross-culturally equivalent to the original questionnaire. Researchers might consider using this scale in multicenter, multi-ethnical trials or cross-cultural studies that include Japanese-speaking patients. Wiley Periodicals, Inc.
Augusto, Fabiana da Silva; Blanes, Leila; Nicodemo, Denise; Ferreira, Lydia Masako
2017-05-01
To translate into Brazilian Portuguese and cross-culturally adapt the Cardiff Wound Impact Schedule, a specific measure of health-related quality of life (HRQoL) for patients with chronic wounds. Chronic wounds have a relevant impact on the HRQoL of patients. However, there are few instruments cross-culturally adapted and validated in Brazil to assess HRQoL in patients with wounds. A descriptive cross-sectional study was conducted following six steps: (1) translation of the original instrument into Brazilian-Portuguese by two independent translators; (2) construction of a consensus version based on both translations; (3) two independent back-translations into English of the consensus version; (4) review by an expert committee and construction of the pre-final version; (5) testing of the pre-final version on patients with chronic wounds; and (6) construction of the final version. The psychometric properties of the instrument were tested on 30 patients with chronic wounds of the lower limb; 76.7% were men, 70.0% had traumatic wounds, and 43.3% had the wound for more than 1 year. Participants were recruited from an outpatient wound care clinic in São Paulo, Brazil. The final version approved by the expert committee was well understood by all patients who participate in the study and had satisfactory face validity, content validity, and internal consistency, with Cronbach's alpha coefficients ranging from 0.681 to 0.920. The cross-culturally adapted Brazilian-Portuguese version of the instrument showed satisfactory face and content validity, good internal consistency, and was named Cardiff Wound Impact Schedule-Federal University of São Paulo School of Medicine or CWIS-UNIFESP/EPM. Copyright © 2016 Tissue Viability Society. Published by Elsevier Ltd. All rights reserved.
Cross-cultural Adaption and Validation of the Danish Voice Handicap Index.
Sorensen, Jesper Roed; Printz, Trine; Mehlum, Camilla Slot; Heidemann, Christian Hamilton; Groentved, Aagot Moeller; Godballe, Christian
2018-02-02
We aimed to assess psychometric properties, including internal consistency, reliability, and clinical validity of the Danish version of the Voice Handicap Index (VHI). A cross-sectional survey study was carried out. For validation, the existing nonvalidated Danish version of the VHI was used. Data from 208 patients with voice disorders of different etiology (neurogenic, functional, and structural) and a control group of 85 vocally healthy individuals were included. A test-retest reliability analysis of 42 patients and 45 control persons was performed. The internal consistency, test-retest reliability, and clinical validity of the questionnaire were assessed. Internal consistency was high with a Cronbach α >0.90 for both the patient and control group. Test-retest reliability measured as intraclass correlation coefficient was good with 0.93 (95% confidence interval [95% confidence interval]: 0.87-0.96) for patients and 0.78 (95% confidence interval: 0.63-0.87) for the control group which indicates sufficient reliability of the questionnaire. The Danish VHI has good clinical validity as it has a strong correlation between patient's perception of the severity of their voice disorder and the VHI score from the Spearman correlation of 0.69. The existing Danish version of the VHI has been thoroughly validated and found to be in line with the original VHI from Jacobsen et al. It showed good internal consistency, test-retest reliability, and clinical validity. It is suitable for use in daily practice and in research projects as it is able to assess patients' perception of their voice disorder severity. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Kim, Ho-Joong; Ruscheweyh, Ruth; Yeo, Ji-Hyun; Cho, Hyeon-Guk; Yi, Je-Min; Chang, Bong-Soon; Lee, Choon-Ki; Yeom, Jin S
2014-11-01
The purpose of this study was to translate pain sensitivity questionnaires (PSQ) into the Korean language, perform a cross-cultural adaption of the PSQ, and validate the Korean version of PSQ in patients with degenerative spinal disease. The PSQ was translated forward and backward, cross-culturally adapted by 2 independent translators, and approved by an expert committee. The final Korean version of the PSQ was tested on 72 patients with degenerative spinal disease. Test-retest reliability was evaluated for 60 patients (83%) who completed the second assessment in an interval of 4 weeks. The mean PSQ-minor, PSQ-moderate, and PSQ-total (standard deviation [SD]) were 5.40 (2.02), 6.46 (1.98), and 5.93 (1.93), respectively. The PSQ-total, PSQ-minor, and PSQ-moderate of the Korean version showed very good internal consistencies determined by the Cronbach's α of 0.926, 0.869, and 0.877, respectively. For convergent validity, the PSQ scores of the Korean version showed significant correlations with pain catastrophizing scale (PCS) (r = 0.377, P = 0.002; r = 0.365, P = 0.003; r = 0.362, P = 0.003 for PSQ-total, PSQ-minor, and PSQ-moderate of the Korean version, respectively). For test-retest reliability, the intraclass correlation coefficients were 0.782 for PSQ-total, 0.752 for PSQ-minor, and 0.793 for PSQ-moderate. In conclusion, the validated Korean version of PSQ is a transculturally equivalent, reliable, and valid tool to assess individual pain sensitivity. © 2013 World Institute of Pain.
Ferreira, Maria Regina Sardinheiro do Céu Furtado; Martins, José Joaquim Penedos Amendoeira
2014-08-01
Testing the psychometric properties of the Portuguese version of the Practice Environment Scale of the Nursing Work Index. A descriptive, analytical and cross-sectional study, for the cross-cultural adaptation and validation of the psychometric properties of the scale. The study participants were 236 nurses from two hospitals in the regions of Lisbon and Vale do Tejo. The 0.92 Cronbach's alpha was obtained for overall reliability and support of a five-dimension structure. The excellent quality of adjustment of analysis confirms the validity of the adapted version to hospital care settings, although there was no total coincidence of items in the five dimensions
Liou, Shwu-Ru; Tsai, Hsiu-Min; Cheng, Ching-Yu
2013-01-01
To analyze and compare the psychometric properties and cultural attributes of the Organizational Commitment Questionnaire and the Organizational Commitment Scale to determine their appropriateness for measuring commitment of Asian nurses, the biggest portion of international nurses. The Organizational Commitment Questionnaire was cross-culturally cross-validated when compared with the Organizational Commitment Scale. Both instruments were not tested on Asian nurses. More studies are needed to validate the cultural properties of the Organizational Commitment Scale. Healthcare administrators can use culturally validated instruments, which concern cultural context, including languages and cultural values, to understand Asian nurses' organizational commitment and further lower turnover behavior among them. © 2013 Wiley Periodicals, Inc.
Dutch translation and cross-cultural validation of the Adult Social Care Outcomes Toolkit (ASCOT).
van Leeuwen, Karen M; Bosmans, Judith E; Jansen, Aaltje Pd; Rand, Stacey E; Towers, Ann-Marie; Smith, Nick; Razik, Kamilla; Trukeschitz, Birgit; van Tulder, Maurits W; van der Horst, Henriette E; Ostelo, Raymond W
2015-05-13
The Adult Social Care Outcomes Toolkit was developed to measure outcomes of social care in England. In this study, we translated the four level self-completion version (SCT-4) of the ASCOT for use in the Netherlands and performed a cross-cultural validation. The ASCOT SCT-4 was translated into Dutch following international guidelines, including two forward and back translations. The resulting version was pilot tested among frail older adults using think-aloud interviews. Furthermore, using a subsample of the Dutch ACT-study, we investigated test-retest reliability and construct validity and compared response distributions with data from a comparable English study. The pilot tests showed that translated items were in general understood as intended, that most items were reliable, and that the response distributions of the Dutch translation and associations with other measures were comparable to the original English version. Based on the results of the pilot tests, some small modifications and a revision of the Dignity items were proposed for the final translation, which were approved by the ASCOT development team. The complete original English version and the final Dutch translation can be obtained after registration on the ASCOT website ( http://www.pssru.ac.uk/ascot ). This study provides preliminary evidence that the Dutch translation of the ASCOT is valid, reliable and comparable to the original English version. We recommend further research to confirm the validity of the modified Dutch ASCOT translation.
Test, revision, and cross-validation of the Physical Activity Self-Definition Model.
Kendzierski, Deborah; Morganstein, Mara S
2009-08-01
Structural equation modeling was used to test an extended version of the Kendzierski, Furr, and Schiavoni (1998) Physical Activity Self-Definition Model. A revised model using data from 622 runners fit the data well. Cross-validation indices supported the revised model, and this model also provided a good fit to data from 397 cyclists. Partial invariance was found across activities. In both samples, perceived commitment and perceived ability had direct effects on self-definition, and perceived wanting, perceived trying, and enjoyment had indirect effects. The contribution of perceived ability to self-definition did not differ across activities. Implications concerning the original model, indirect effects, skill salience, and the role of context in self-definition are discussed.
van Ark, Mathijs; Zwerver, Johannes; Diercks, Ronald L; van den Akker-Scheek, Inge
2014-08-11
Lateral Epicondylalgia (LE) is a common injury for which no reliable and valid measure exists to determine severity in the Dutch language. The Patient-Rated Tennis Elbow Evaluation (PRTEE) is the first questionnaire specifically designed for LE but in English. The aim of this study was to translate into Dutch and cross-culturally adapt the PRTEE and determine reliability and validity of the PRTEE-D (Dutch version). The PRTEE was cross-culturally adapted according to international guidelines. Participants (n = 122) were asked to fill out the PRTEE-D twice with a one week interval to assess test-retest reliability. Internal consistency of the PRTEE-D was determined by calculating Crohnbach's alphas for the questionnaire and subscales. Intraclass Correlation Coefficients (ICC) were calculated for the overall PRTEE-D score, pain and function subscale and individual questions to determine test-retest reliability. Additionally, the Disabilities for the Arm, Shoulder and Hand questionnaire (DASH) and Visual Analogue Scale (VAS) pain scores were obtained from 30 patients to assess construct validity; Spearman's correlation coefficients were calculated between the PRTEE-D (subscales) and DASH and VAS-pain scores. The PRTEE was successfully cross-culturally adapted into Dutch (PRTEE-D). Crohnbach's alpha for the first assessment of the PRTEE-D was 0.98; Crohnbach's alpha was 0.93 for the pain subscale and 0.97 for the function subscale. ICC for the PRTEE-D was 0.98; subscales also showed excellent ICC values (pain scale 0.97 and function scale 0.97). A significant moderate correlation exists between PRTEE-D and DASH (0.65) and PRTEE-D and VAS pain (0.68). The PRTEE was successfully cross-culturally adapted and this study showed that the PRTEE-D is reliable and valid to obtain an indication of severity of LE. An easy-to-use instrument for practitioners is now available and this facilitates comparing Dutch and international research data.
Kim, SungHwan; Lin, Chien-Wei; Tseng, George C
2016-07-01
Supervised machine learning is widely applied to transcriptomic data to predict disease diagnosis, prognosis or survival. Robust and interpretable classifiers with high accuracy are usually favored for their clinical and translational potential. The top scoring pair (TSP) algorithm is an example that applies a simple rank-based algorithm to identify rank-altered gene pairs for classifier construction. Although many classification methods perform well in cross-validation of single expression profile, the performance usually greatly reduces in cross-study validation (i.e. the prediction model is established in the training study and applied to an independent test study) for all machine learning methods, including TSP. The failure of cross-study validation has largely diminished the potential translational and clinical values of the models. The purpose of this article is to develop a meta-analytic top scoring pair (MetaKTSP) framework that combines multiple transcriptomic studies and generates a robust prediction model applicable to independent test studies. We proposed two frameworks, by averaging TSP scores or by combining P-values from individual studies, to select the top gene pairs for model construction. We applied the proposed methods in simulated data sets and three large-scale real applications in breast cancer, idiopathic pulmonary fibrosis and pan-cancer methylation. The result showed superior performance of cross-study validation accuracy and biomarker selection for the new meta-analytic framework. In conclusion, combining multiple omics data sets in the public domain increases robustness and accuracy of the classification model that will ultimately improve disease understanding and clinical treatment decisions to benefit patients. An R package MetaKTSP is available online. (http://tsenglab.biostat.pitt.edu/software.htm). ctseng@pitt.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Martinez-Vega, Ingrid Patricia; Doubova, Svetlana V; Aguirre-Hernandez, Rebeca; Infante-Castañeda, Claudia
2016-03-02
The aim of this study was to adapt and validate the Distress Scale for Mexican patients with type 2 diabetes and hypertension (DSDH17M). Two family medicine clinics affiliated with the Mexican Institute of Social Security. 722 patients with type 2 diabetes and/or hypertension (235 patients with diabetes, 233 patients with hypertension and 254 patients with both diseases). A cross-sectional survey. The validation procedures included: (1) content validity using a group of experts, (2) construct validity from exploratory factor analysis, (3) internal consistency using Cronbach's α, (4) convergent validity between DSDH17M and anxiety and depression using the Spearman correlation coefficient, (5) discriminative validity through the Wilcoxon rank-sum test and (6) test-retest reliability using intraclass correlation coefficient. The DSDH17M has 17 items and three factors explaining 67% of the total variance. Cronbach α ranged from 0.83 to 0.91 among factors. The first factor of 'Regime-related Distress and Emotional Burden' moderately correlated with anxiety and depression scores. Discriminative validity revealed that patients with obesity, those with stressful events and those who did not adhere to pharmacological treatment had significantly higher distress scores in all DSDH17M domains. Test-retest intraclass correlation coefficient for DSDH17M ranged from 0.92 to 0.97 among factors. DSDH17M is a valid and reliable tool to identify distress of patients with type 2 diabetes and hypertension. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Crins, Martine H. P.; Roorda, Leo D.; Smits, Niels; de Vet, Henrica C. W.; Westhovens, Rene; Cella, David; Cook, Karon F.; Revicki, Dennis; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Terwee, Caroline B.
2015-01-01
The Dutch-Flemish PROMIS Group translated the adult PROMIS Pain Interference item bank into Dutch-Flemish. The aims of the current study were to calibrate the parameters of these items using an item response theory (IRT) model, to evaluate the cross-cultural validity of the Dutch-Flemish translations compared to the original English items, and to evaluate their reliability and construct validity. The 40 items in the bank were completed by 1085 Dutch chronic pain patients. Before calibrating the items, IRT model assumptions were evaluated using confirmatory factor analysis (CFA). Items were calibrated using the graded response model (GRM), an IRT model appropriate for items with more than two response options. To evaluate cross-cultural validity, differential item functioning (DIF) for language (Dutch vs. English) was examined. Reliability was evaluated based on standard errors and Cronbach’s alpha. To evaluate construct validity correlations with scores on legacy instruments (e.g., the Disabilities of the Arm, Shoulder and Hand Questionnaire) were calculated. Unidimensionality of the Dutch-Flemish PROMIS Pain Interference item bank was supported by CFA tests of model fit (CFI = 0.986, TLI = 0.986). Furthermore, the data fit the GRM and showed good coverage across the pain interference continuum (threshold-parameters range: -3.04 to 3.44). The Dutch-Flemish PROMIS Pain Interference item bank has good cross-cultural validity (only two out of 40 items showing DIF), good reliability (Cronbach’s alpha = 0.98), and good construct validity (Pearson correlations between 0.62 and 0.75). A computer adaptive test (CAT) and Dutch-Flemish PROMIS short forms of the Dutch-Flemish PROMIS Pain Interference item bank can now be developed. PMID:26214178
Crins, Martine H P; Roorda, Leo D; Smits, Niels; de Vet, Henrica C W; Westhovens, Rene; Cella, David; Cook, Karon F; Revicki, Dennis; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Terwee, Caroline B
2015-01-01
The Dutch-Flemish PROMIS Group translated the adult PROMIS Pain Interference item bank into Dutch-Flemish. The aims of the current study were to calibrate the parameters of these items using an item response theory (IRT) model, to evaluate the cross-cultural validity of the Dutch-Flemish translations compared to the original English items, and to evaluate their reliability and construct validity. The 40 items in the bank were completed by 1085 Dutch chronic pain patients. Before calibrating the items, IRT model assumptions were evaluated using confirmatory factor analysis (CFA). Items were calibrated using the graded response model (GRM), an IRT model appropriate for items with more than two response options. To evaluate cross-cultural validity, differential item functioning (DIF) for language (Dutch vs. English) was examined. Reliability was evaluated based on standard errors and Cronbach's alpha. To evaluate construct validity correlations with scores on legacy instruments (e.g., the Disabilities of the Arm, Shoulder and Hand Questionnaire) were calculated. Unidimensionality of the Dutch-Flemish PROMIS Pain Interference item bank was supported by CFA tests of model fit (CFI = 0.986, TLI = 0.986). Furthermore, the data fit the GRM and showed good coverage across the pain interference continuum (threshold-parameters range: -3.04 to 3.44). The Dutch-Flemish PROMIS Pain Interference item bank has good cross-cultural validity (only two out of 40 items showing DIF), good reliability (Cronbach's alpha = 0.98), and good construct validity (Pearson correlations between 0.62 and 0.75). A computer adaptive test (CAT) and Dutch-Flemish PROMIS short forms of the Dutch-Flemish PROMIS Pain Interference item bank can now be developed.
Alanazi, Fahad; Gleeson, Peggy; Olson, Sharon; Roddey, Toni
2017-04-01
Prospective cohort study of a cross-cultural low back pain (LBP) questionnaire OBJECTIVE.: The objectives of the present study were to translate and cross-culturally adapt the Fear-Avoidance Beliefs Questionnaire (FABQ) to create a version in Arabic and to test its psychometric properties. The FABQ measures the effects that fear and avoidance beliefs have on work and on physical activity. An FABQ cross-culturally adapted for Arabic readers and speakers was created by forward translation, translation synthesis, and backward translation. Forty patients in Riyadh, Saudi Arabia, with LBP evaluated use of the questionnaire, and 70 patients from the same hospital participated in reliability, validity, and sensitivity studies. To determine test-retest reliability of the Arabic FABQ, patients completed it twice within 48 hours without receiving any active treatment between these two sessions. Patients completed the Arabic FABQ (and three other scales) at baseline and 14 days later to determine its validity and sensitivity. Test-retest reliability was good (FABQ-work: intraclass coefficient [ICC] = 0.74; FABQ-physical activity: ICC = 0.90; FABQ overall: ICC = 0.76). Correlations between the FABQ and three other instruments for measuring pain and disability were weak. The strongest correlation was found at the follow-up session with the Arabic Oswestry Questionnaire (r = 0.283; P ≤ 0.05). Sensitivity to change was low. The translation and adaptation of the Arabic version of the FABQ was successful. Overall, the Arabic FABQ had good test-retest reliability, acceptable construct validity, and low sensitivity to change. The Arabic version of the FABQ shows promise in the assessment of fear-avoidance beliefs among patients with LBP who speak and read Arabic. 3.
Abanto, Jenny; Albites, Ursula; Bönecker, Marcelo; Paiva, Saul M; Castillo, Jorge L; Aguilar-Gálvez, Denisse
2015-12-01
The lack of a Family Impact Scale (FIS) in Spanish language limits its use as an indicator in Spanish-speaking countries and precludes comparisons with data from other cultural and ethnic groups. The purpose of this study was therefore to adapt the FIS cross-culturally to the Peruvian Spanish language and assess its reliability and validity. In order to translate and adapt the FIS cross-culturally, it was answered by 60 parents in two pilot tests, after which it was tested on 200 parents of children aged 11 to 14 years who were clinically examined for dental caries experience and malocclusions. Internal consistency was assessed by Cronbach's alpha coefficient while repeat administration of the FIS on the same 200 parents enabled the test-retest reliability to be assessed via intraclass correlation coefficient (ICC). Construct and discriminant validity were based on associations of the FIS with global ratings of oral health and clinical groups, respectively. Mean (standard deviation) FIS total score was 5.20 (5.86). Internal consistency was confirmed by Cronbach's alpha 0.84. Test-retest reliability revealed excellent reproducibility (ICC = 0.96). Construct validity was good, demonstrating statistically significant associations between total FIS score and global ratings of oral health (p=0.007) and overall wellbeing (p=0.002), as well as for the subscale scores (p<0.05) with exception of the financial burden subscale. The FIS was also able to discriminate between children with and without dental caries experience and malocclusions (p<0.05). Satisfactory psychometric results for the Peruvian Spanish FIS confirm it as a reliable, valid instrument for assessing the impact on the family caused by children's oral conditions. Sociedad Argentina de Investigación Odontológica.
NASA Astrophysics Data System (ADS)
Samala, Ravi K.; Chan, Heang-Ping; Hadjiiski, Lubomir; Helvie, Mark A.; Richter, Caleb; Cha, Kenny
2018-02-01
We propose a cross-domain, multi-task transfer learning framework to transfer knowledge learned from non-medical images by a deep convolutional neural network (DCNN) to medical image recognition task while improving the generalization by multi-task learning of auxiliary tasks. A first stage cross-domain transfer learning was initiated from ImageNet trained DCNN to mammography trained DCNN. 19,632 regions-of-interest (ROI) from 2,454 mass lesions were collected from two imaging modalities: digitized-screen film mammography (SFM) and full-field digital mammography (DM), and split into training and test sets. In the multi-task transfer learning, the DCNN learned the mass classification task simultaneously from the training set of SFM and DM. The best transfer network for mammography was selected from three transfer networks with different number of convolutional layers frozen. The performance of single-task and multitask transfer learning on an independent SFM test set in terms of the area under the receiver operating characteristic curve (AUC) was 0.78+/-0.02 and 0.82+/-0.02, respectively. In the second stage cross-domain transfer learning, a set of 12,680 ROIs from 317 mass lesions on DBT were split into validation and independent test sets. We first studied the data requirements for the first stage mammography trained DCNN by varying the mammography training data from 1% to 100% and evaluated its learning on the DBT validation set in inference mode. We found that the entire available mammography set provided the best generalization. The DBT validation set was then used to train only the last four fully connected layers, resulting in an AUC of 0.90+/-0.04 on the independent DBT test set.
Prediction of functional aerobic capacity without exercise testing
NASA Technical Reports Server (NTRS)
Jackson, A. S.; Blair, S. N.; Mahar, M. T.; Wier, L. T.; Ross, R. M.; Stuteville, J. E.
1990-01-01
The purpose of this study was to develop functional aerobic capacity prediction models without using exercise tests (N-Ex) and to compare the accuracy with Astrand single-stage submaximal prediction methods. The data of 2,009 subjects (9.7% female) were randomly divided into validation (N = 1,543) and cross-validation (N = 466) samples. The validation sample was used to develop two N-Ex models to estimate VO2peak. Gender, age, body composition, and self-report activity were used to develop two N-Ex prediction models. One model estimated percent fat from skinfolds (N-Ex %fat) and the other used body mass index (N-Ex BMI) to represent body composition. The multiple correlations for the developed models were R = 0.81 (SE = 5.3 ml.kg-1.min-1) and R = 0.78 (SE = 5.6 ml.kg-1.min-1). This accuracy was confirmed when applied to the cross-validation sample. The N-Ex models were more accurate than what was obtained from VO2peak estimated from the Astrand prediction models. The SEs of the Astrand models ranged from 5.5-9.7 ml.kg-1.min-1. The N-Ex models were cross-validated on 59 men on hypertensive medication and 71 men who were found to have a positive exercise ECG. The SEs of the N-Ex models ranged from 4.6-5.4 ml.kg-1.min-1 with these subjects.(ABSTRACT TRUNCATED AT 250 WORDS).
Calibration of the Dutch-Flemish PROMIS Pain Behavior item bank in patients with chronic pain.
Crins, M H P; Roorda, L D; Smits, N; de Vet, H C W; Westhovens, R; Cella, D; Cook, K F; Revicki, D; van Leeuwen, J; Boers, M; Dekker, J; Terwee, C B
2016-02-01
The aims of the current study were to calibrate the item parameters of the Dutch-Flemish PROMIS Pain Behavior item bank using a sample of Dutch patients with chronic pain and to evaluate cross-cultural validity between the Dutch-Flemish and the US PROMIS Pain Behavior item banks. Furthermore, reliability and construct validity of the Dutch-Flemish PROMIS Pain Behavior item bank were evaluated. The 39 items in the bank were completed by 1042 Dutch patients with chronic pain. To evaluate unidimensionality, a one-factor confirmatory factor analysis (CFA) was performed. A graded response model (GRM) was used to calibrate the items. To evaluate cross-cultural validity, Differential item functioning (DIF) for language (Dutch vs. English) was evaluated. Reliability of the item bank was also examined and construct validity was studied using several legacy instruments, e.g. the Roland Morris Disability Questionnaire. CFA supported the unidimensionality of the Dutch-Flemish PROMIS Pain Behavior item bank (CFI = 0.960, TLI = 0.958), the data also fit the GRM, and demonstrated good coverage across the pain behavior construct (threshold parameters range: -3.42 to 3.54). Analysis showed good cross-cultural validity (only six DIF items), reliability (Cronbach's α = 0.95) and construct validity (all correlations ≥0.53). The Dutch-Flemish PROMIS Pain Behavior item bank was found to have good cross-cultural validity, reliability and construct validity. The development of the Dutch-Flemish PROMIS Pain Behavior item bank will serve as the basis for Dutch-Flemish PROMIS short forms and computer adaptive testing (CAT). © 2015 European Pain Federation - EFIC®
Zometa, Carlos S; Dedrick, Robert; Knox, Michael D; Westhoff, Wayne; Siri, Rodrigo Simán; Debaldo, Ann
2007-06-01
An instrument developed in the United States by the Centers for Disease Control and Prevention to assess HIV/AIDS knowledge and four attitudinal dimensions (Peer Pressure, Abstinence, Drug Use, and Threat of HIV Infection) and an instrument developed by Basen-Engquist et al. (1999) to measure abstinence and condom use were translated, cross-culturally adapted, and validated for use with Spanish-speaking high school students in El Salvador. A back-translation of the English version was cross-culturally adapted using two different review panels and pilot-tested with Salvadorian students. An expert panel established content validity, and confirmatory factor analysis provided support for construct validity. Results indicated that the methodology was successful in cross-culturally adapting the instrument developed by the Centers for Disease Control and Prevention and the instrument developed by Basen-Engquist et al. The psychometric properties of the knowledge section were acceptable and there was partial support for the four-factor attitudinal model underlying the CDC instrument and the two-factor model underlying the Basen-Engquist et al. instrument. Additional studies with Spanish-speaking populations (either in the United States or Latin America) are needed to evaluate the generalizability of the present results.
Stapelfeldt, Christina Malmose; Momsen, Anne-Mette Hedeager; Lund, Thomas; Grønborg, Therese Koops; Hogg-Johnson, Sheilah; Jensen, Chris; Skakon, Janne; Labriola, Merete
2018-06-06
The objective of the present study was to translate and validate the Canadian Readiness for Return To Work instrument (RRTW-CA) into a Danish version (RRTWDK) by testing its test-retest and internal consistency reliability and its structural and construct validity. Cross-cultural adaptation of the six-staged RRTW-CA instrument was performed in a standardised, systematic five-step-procedure; forward translation, panel synthesis of the translation, back translation, consolidation and revision by researchers, and finally pre-testing. This RRTW-DK beta-version was tested for its psychometric properties by intra-class correlation coefficient and standard error of measurement (n = 114), Cronbach's alpha (n = 471), confirmatory factor analyses (n = 373), and Spearman's rank correlation coefficient (n = 436) in sickness beneficiaries from a municipal employment agency and hospital wards. The original RRTW-CA stage structure could not be confirmed in the RRTWDK. The psychometric properties were thus inconclusive. The RRTW-DK cannot be recommended for use in the current version as the RRTW construct is questionable. The RRTW construct needs further exploration, preferably in a population that is homogeneous with regard to cause of sickness, disability duration and age.
Chiarotto, Alessandro; Vanti, Carla; Ostelo, Raymond W; Ferrari, Silvano; Tedesco, Giuseppe; Rocca, Barbara; Pillastrini, Paolo; Monticone, Marco
2015-11-01
The Pain Self-Efficacy Questionnaire (PSEQ) is a patient self-reported measurement instrument that evaluates pain self-efficacy beliefs in patients with chronic pain. The measurement properties of the PSEQ have been tested in its original and translated versions, showing satisfactory results for validity and reliability. The aims of this study were 2 fold as follows: (1) to translate the PSEQ into Italian through a process of cross-cultural adaptation, (2) to test the measurement properties of the Italian PSEQ (PSEQ-I). The cross-cultural adaptation was completed in 5 months without omitting any item of the original PSEQ. Measurement properties were tested in 165 patients with chronic low back pain (CLBP) (65% women, mean age 49.9 years). Factor analysis confirmed the one-factor structure of the questionnaire. Internal consistency (Cronbach's α = 0.94) and test-retest reliability (ICCagreement = 0.82) of the PSEQ-I showed good results. The smallest detectable change was equal to 15.69 scale points. The PSEQ-I displayed a high construct validity by meeting more than 75% of a priori hypotheses on correlations with measurement instruments assessing pain intensity, disability, anxiety, depression, pain catastrophizing, fear of movement, and coping strategies. Additionally, the PSEQ-I differentiated patients taking pain medication or not. The results of this study suggest that the PSEQ-I can be used as a valid and reliable tool in Italian patients with CLBP. © 2014 World Institute of Pain.
Cross-cultural adaptation of the German version of the spinal stenosis measure.
Wertli, Maria M; Steurer, Johann; Wildi, Lukas M; Held, Ulrike
2014-06-01
To validate the German version of the spinal stenosis measure (SSM), a disease-specific questionnaire assessing symptom severity, physical function, and satisfaction with treatment in patients with lumbar spinal stenosis. After translation, cross-cultural adaptation, and pilot testing, we assessed internal consistency, test-retest reliability, construct validity, and responsiveness of the SSM subscales. Data from a large Swiss multi-center prospective cohort study were used. Reference scales for the assessment of construct validity and responsiveness were the numeric rating scale, pain thermometer, and the Roland Morris Disability Questionnaire. One hundred and eight consecutive patients were included in this validation study, recruited from five different centers. Cronbach's alpha was above 0.8 for all three subscales of the SSM. The objectivity of the SSM was assessed using a partial credit approach. The model showed a good global fit to the data. Of the 108 patients 78 participated in the test-retest procedure. The ICC values were above 0.8 for all three subscales of the SSM. Correlations with reference scales were above 0.7 for the symptom and function subscales. For satisfaction subscale, it was 0.66 or above. Clinically meaningful changes of the reference scales over time were associated with significantly more improvement in all three SSM subscales (p < 0.001). Conclusion: The proposed version of the SSM showed very good measurement properties and can be considered validated for use in the German language.
Kim, Jin Goo; Lee, Joong Yub; Seo, Seung Suk; Choi, Choong Hyeok; Lee, Myung Chul
2013-01-01
Purpose To perform a cross-cultural adaptation and to test the measurement properties of the Korean version of International Knee Documentation Committee (K-IKDC) Subjective Knee Form. Materials and Methods According to the guidelines for cross-cultural adaptation, translation and backward translation of the English version of the IKDC Subjective Knee Form were performed. After translation into the Korean version, 150 patients who had knee-related problems were asked to complete the K-IKDC, Lysholm score, and Short Form-36 (SF-36). Of these patients, 126 were retested 2 weeks later to evaluate test-retest reliability, and 104 were recruited 3 months later to evaluate responsiveness. Construct validity was analyzed by investigating the correlation with Lysholm score and SF-36; content validity was also evaluated. Standardized mean response was calculated for evaluating responsiveness. Results The test-retest reliability proved excellent with a high value for the intraclass correlation coefficient (r=0.94). The internal consistency was strong (Cronbach's α=0.91). Good content validity with absence of floor not ceiling effects and good convergent and divergent validity were observed. Moderate responsiveness was shown (standardized mean response=0.689). Conclusions The K-IKDC demonstrated good measurement properties. We suggest that this instrument is an excellent evaluation instrument that can be used for Korean patients with knee-related injuries. PMID:24032098
Cross-Cultural Research on the Creativity of Elementary School Students in Korea and Australia
ERIC Educational Resources Information Center
Kyunghwa, Lee; Hyejin, Yang
2016-01-01
The purpose of this study was to understand cultural differences and similarities in children's creative characteristics in Korea and Australia. In this cross-cultural research, the Integrative Creativity Test (K-ICT, [13]) with identified validity and reliability for measuring elementary school students' creative ability and creative personality,…
Ribeiro, João Carlos; Simões, João; Silva, Filipe; Silva, Eduardo D.; Hummel, Cornelia; Hummel, Thomas; Paiva, António
2016-01-01
The cross-cultural adaptation and validation of the Sniffin`Sticks test for the Portuguese population is described. Over 270 people participated in four experiments. In Experiment 1, 67 participants rated the familiarity of presented odors and seven descriptors of the original test were adapted to a Portuguese context. In Experiment 2, the Portuguese version of Sniffin`Sticks test was administered to 203 healthy participants. Older age, male gender and active smoking status were confirmed as confounding factors. The third experiment showed the validity of the Portuguese version of Sniffin`Sticks test in discriminating healthy controls from patients with olfactory dysfunction. In Experiment 4, the test-retest reliability for both the composite score (r71 = 0.86) and the identification test (r71 = 0.62) was established (p<0.001). Normative data for the Portuguese version of Sniffin`Sticks test is provided, showing good validity and reliability and effectively distinguishing patients from healthy controls with high sensitivity and specificity. The Portuguese version of Sniffin`Sticks test identification test is a clinically suitable screening tool in routine outpatient Portuguese settings. PMID:26863023
Cabrera, Esther; Zabalegui, Adelaida; Blanco, Ignacio
2011-01-15
The worry for falling ill has been described as a key element in the change of preventive attitudes. Levels of cancer worry not well fitted have been associated with inadequate adherence to preventive strategies. There is not a Spanish validated scale to evaluate the degree of worry for the cancer in our population. The aim of the present study was to perform the cross cultural adaptation and validation of the Cancer Worry Scale described by Lerman. A translation, re-translation of the Cancer Worry Scale to Spanish was done. Validation of the Spanish scale was performed by means of the factorial analysis of principal components with the rotation varimax test in a sample of 200 healthy women with family history of breast cancer. The Escala de Preocupación por el Cáncer (EPC) is the Spanish version of the Cancer Worry Scale and it contains 6 items with a total value ranging from 6 (minimal worry) to 24 (maximum worry). The analysis of content validity demonstrated that the EPC is conceptually equivalent to the original scale. The factorial analysis showed a unique factor that explains 53.07% of the variance confirming the unique dimension. The EPC presented good reliability test - re-test with an Intraclass Correlation Coefficient of 0.777. The Cronbach's alpha was 0.835 for the complete of the scale. The EPC is a validated Spanish scale to measure the cancer worry in healthy individuals, which shows a correct content validity and reliability. Copyright © 2010 Elsevier España, S.L. All rights reserved.
21 CFR 1270.31 - Written procedures.
Code of Federal Regulations, 2011 CFR
2011-04-01
... procedures prepared and followed for all significant steps in the infectious disease testing process under... procedures prepared, validated, and followed for prevention of infectious disease contamination or cross...
21 CFR 1270.31 - Written procedures.
Code of Federal Regulations, 2012 CFR
2012-04-01
... procedures prepared and followed for all significant steps in the infectious disease testing process under... procedures prepared, validated, and followed for prevention of infectious disease contamination or cross...
21 CFR 1270.31 - Written procedures.
Code of Federal Regulations, 2014 CFR
2014-04-01
... procedures prepared and followed for all significant steps in the infectious disease testing process under... procedures prepared, validated, and followed for prevention of infectious disease contamination or cross...
21 CFR 1270.31 - Written procedures.
Code of Federal Regulations, 2013 CFR
2013-04-01
... procedures prepared and followed for all significant steps in the infectious disease testing process under... procedures prepared, validated, and followed for prevention of infectious disease contamination or cross...
Comparative assessment of three standardized robotic surgery training methods.
Hung, Andrew J; Jayaratna, Isuru S; Teruya, Kara; Desai, Mihir M; Gill, Inderbir S; Goh, Alvin C
2013-10-01
To evaluate three standardized robotic surgery training methods, inanimate, virtual reality and in vivo, for their construct validity. To explore the concept of cross-method validity, where the relative performance of each method is compared. Robotic surgical skills were prospectively assessed in 49 participating surgeons who were classified as follows: 'novice/trainee': urology residents, previous experience <30 cases (n = 38) and 'experts': faculty surgeons, previous experience ≥30 cases (n = 11). Three standardized, validated training methods were used: (i) structured inanimate tasks; (ii) virtual reality exercises on the da Vinci Skills Simulator (Intuitive Surgical, Sunnyvale, CA, USA); and (iii) a standardized robotic surgical task in a live porcine model with performance graded by the Global Evaluative Assessment of Robotic Skills (GEARS) tool. A Kruskal-Wallis test was used to evaluate performance differences between novices and experts (construct validity). Spearman's correlation coefficient (ρ) was used to measure the association of performance across inanimate, simulation and in vivo methods (cross-method validity). Novice and expert surgeons had previously performed a median (range) of 0 (0-20) and 300 (30-2000) robotic cases, respectively (P < 0.001). Construct validity: experts consistently outperformed residents with all three methods (P < 0.001). Cross-method validity: overall performance of inanimate tasks significantly correlated with virtual reality robotic performance (ρ = -0.7, P < 0.001) and in vivo robotic performance based on GEARS (ρ = -0.8, P < 0.0001). Virtual reality performance and in vivo tissue performance were also found to be strongly correlated (ρ = 0.6, P < 0.001). We propose the novel concept of cross-method validity, which may provide a method of evaluating the relative value of various forms of skills education and assessment. We externally confirmed the construct validity of each featured training tool. © 2013 BJU International.
Simulation of SEU Cross-sections using MRED under Conditions of Limited Device Information
NASA Technical Reports Server (NTRS)
Lauenstein, J. M.; Reed, R. A.; Weller, R. A.; Mendenhall, M. H.; Warren, K. M.; Pellish, J. A.; Schrimpf, R. D.; Sierawski, B. D.; Massengill, L. W.; Dodd, P. E.;
2007-01-01
This viewgraph presentation reviews the simulation of Single Event Upset (SEU) cross sections using the membrane electrode assembly (MEA) resistance and electrode diffusion (MRED) tool using "Best guess" assumptions about the process and geometry, and direct ionization, low-energy beam test results. This work will also simulate SEU cross-sections including angular and high energy responses and compare the simulated results with beam test data for the validation of the model. Using MRED, we produced a reasonably accurate upset response model of a low-critical charge SRAM without detailed information about the circuit, device geometry, or fabrication process
Stanifer, John W.; Karia, Francis; Voils, Corrine I.; Turner, Elizabeth L.; Maro, Venance; Shimbi, Dionis; Kilawe, Humphrey; Lazaro, Matayo; Patel, Uptal D.
2015-01-01
Introduction Non-communicable diseases are a growing global burden, and structured surveys can identify critical gaps to address this epidemic. In sub-Saharan Africa, there are very few well-tested survey instruments measuring population attributes related to non-communicable diseases. To meet this need, we have developed and validated the first instrument evaluating knowledge, attitudes and practices pertaining to chronic kidney disease in a Swahili-speaking population. Methods and Results Between December 2013 and June 2014, we conducted a four-stage, mixed-methods study among adults from the general population of northern Tanzania. In stage 1, the survey instrument was constructed in English by a group of cross-cultural experts from multiple disciplines and through content analysis of focus group discussions to ensure local significance. Following translation, in stage 2, we piloted the survey through cognitive and structured interviews, and in stage 3, in order to obtain initial evidence of reliability and construct validity, we recruited and then administered the instrument to a random sample of 606 adults. In stage 4, we conducted analyses to establish test-retest reliability and known-groups validity which was informed by thematic analysis of the qualitative data in stages 1 and 2. The final version consisted of 25 items divided into three conceptual domains: knowledge, attitudes and practices. Each item demonstrated excellent test-retest reliability with established content and construct validity. Conclusions We have developed a reliable and valid cross-cultural survey instrument designed to measure knowledge, attitudes and practices of chronic kidney disease in a Swahili-speaking population of Northern Tanzania. This instrument may be valuable for addressing gaps in non-communicable diseases care by understanding preferences regarding healthcare, formulating educational initiatives, and directing development of chronic disease management programs that incorporate chronic kidney disease across sub-Saharan Africa. PMID:25811781
Odunaiya, Nse A; Louw, Quinette A; Grimmer, Karen
2017-06-01
Assessment of lifestyle risk factors must be culturally- and contextually relevant and available in local languages. This paper reports on a study which aimed to cross culturally adapt a composite lifestyle cardiovascular disease (CVD) risk factors questionnaire into an African language (Yoruba) and testing some of its psychometric properties such as content validity and test retest reliability in comparison to the original English version. This study utilized a cross sectional design. Translation of the English version of the questionnaire into Yoruba was undertaken using the guideline by Beaton et al. The translated instrument was presented to 21 rural adolescents to assess comprehensibility and clarity using a sample of convenience. A test retest reliability was conducted among 150 rural adolescents using a purposive sampling. Data was analyzed using intraclass correlation (ICC ) model 3, Cohen kappa statistics and prevalence rates. ICC ranged between 0.4-0.8. The Yoruba version was completed 15-20 minutes and was reported to be culturally appropriate and acceptable for rural Nigerian adolescents. The Yoruba translation of the Nigerian composite lifestyle risk factors questionnaire performs at least as well as the original English version in terms of content validity and reliability. It took a shorter time to complete therefore may be more relevant to rural adolescents.
Mbada, Chidozie Emmanuel; Adeogun, Gafar Atanda; Ogunlana, Michael Opeoluwa; Adedoyin, Rufus Adesoji; Akinsulore, Adesanmi; Awotidebe, Taofeek Oluwole; Idowu, Opeyemi Ayodiipo; Olaoye, Olumide Ayoola
2015-09-14
The Short-Form Health Survey (SF-36) is a valid quality of life tool often employed to determine the impact of medical intervention and the outcome of health care services. However, the SF-36 is culturally sensitive which necessitates its adaptation and translation into different languages. This study was conducted to cross-culturally adapt the SF-36 into Yoruba language and determine its reliability and validity. Based on the International Quality of Life Assessment project guidelines, a sequence of translation, test of item-scale correlation, and validation was implemented for the translation of the Yoruba version of the SF-36. Following pilot testing, the English and the Yoruba versions of the SF-36 were administered to a random sample of 1087 apparently healthy individuals to test validity and 249 respondents completed the Yoruba SF-36 again after two weeks to test reliability. Data was analyzed using Pearson's product moment correlation analysis, independent t-test, one-way analysis of variance, multi trait scaling analysis and Intra-Class Correlation (ICC) at p < 0.05. The concurrent validity scores for scales and domains ranges between 0.749 and 0.902 with the highest and lowest scores in the General Health (0.902) and Bodily Pain (0.749) scale. Scale-level descriptive result showed that all scale and domain scores had negative skewness ranging from -2.08 to -0.98. The mean scores for each scales ranges between 83.2 and 88.8. The domain scores for Physical Health Component and Mental Health Component were 85.6 ± 13.7 and 85.9 ± 15.4 respectively. The convergent validity was satisfactory, ranging from 0.421 to 0.907. Discriminant validity was also satisfactory except for item '1'. The ICC for the test-retest reliability of the Yoruba SF-36 ranges between 0.636 and 0.843 for scales; and 0.783 and 0.851 for domains. The data quality, concurrent and discriminant validity, reliability and internal consistency of the Yoruba version of the SF-36 are adequate and it is recommended for measuring health-related quality of life among Yoruba population.
Validity of one-repetition maximum predictive equations in men with spinal cord injury.
Ribeiro Neto, F; Guanais, P; Dornelas, E; Coutinho, A C B; Costa, R R G
2017-10-01
Cross-sectional study. The study aimed (a) to test the cross-validation of current one-repetition maximum (1RM) predictive equations in men with spinal cord injury (SCI); (b) to compare the current 1RM predictive equations to a newly developed equation based on the 4- to 12-repetition maximum test (4-12RM). SARAH Rehabilitation Hospital Network, Brasilia, Brazil. Forty-five men aged 28.0 years with SCI between C6 and L2 causing complete motor impairment were enrolled in the study. Volunteers were tested, in a random order, in 1RM test or 4-12RM with 2-3 interval days. Multiple regression analysis was used to generate an equation for predicting 1RM. There were no significant differences between 1RM test and the current predictive equations. ICC values were significant and were classified as excellent for all current predictive equations. The predictive equation of Lombardi presented the best Bland-Altman results (0.5 kg and 12.8 kg for mean difference and interval range around the differences, respectively). The two created equation models for 1RM demonstrated the same and a high adjusted R 2 (0.971, P<0.01), but different SEE of measured 1RM (2.88 kg or 5.4% and 2.90 kg or 5.5%). All 1RM predictive equations are accurate to assess individuals with SCI at the bench press exercise. However, the predictive equation of Lombardi presented the best associated cross-validity results. A specific 1RM prediction equation was also elaborated for individuals with SCI. The created equation should be tested in order to verify whether it presents better accuracy than the current ones.
Muscle synergies during bench press are reliable across days.
Kristiansen, Mathias; Samani, Afshin; Madeleine, Pascal; Hansen, Ernst Albin
2016-10-01
Muscle synergies have been investigated during different types of human movement using nonnegative matrix factorization. However, there are not any reports available on the reliability of the method. To evaluate between-day reliability, 21 subjects performed bench press, in two test sessions separated by approximately 7days. The movement consisted of 3 sets of 8 repetitions at 60% of the three repetition maximum in bench press. Muscle synergies were extracted from electromyography data of 13 muscles, using nonnegative matrix factorization. To evaluate between-day reliability, we performed a cross-correlation analysis and a cross-validation analysis, in which the synergy components extracted in the first test session were recomputed, using the fixed synergy components from the second test session. Two muscle synergies accounted for >90% of the total variance, and reflected the concentric and eccentric phase, respectively. The cross-correlation values were strong to very strong (r-values between 0.58 and 0.89), while the cross-validation values ranged from substantial to almost perfect (ICC3, 1 values between 0.70 and 0.95). The present findings revealed that the same general structure of the muscle synergies was present across days and the extraction of muscle synergies is thus deemed reliable. Copyright © 2016 Elsevier Ltd. All rights reserved.
Dzhambov, Angel M; Dimitrova, Donka D
2014-01-01
The Noise Sensitivity Scale Short Form (NSS-SF), developed in English as a more practical form of the classical Weinstein NSS, has not to date been validated in other cultures, and its validity and reliability have not yet been confirmed. This study aimed to validate NSS-SF in Bulgarian and to demonstrate its applicability. The study comprised test-retest (n = 115) and a field-testing (n = 71) of the newly validated scale. Its construct validity was examined with confirmatory factor analysis, and very good model-fit was observed. Temporal stability was assessed in a test-retest (r = 0.990), convergent validity was examined with single-item susceptibility to the noise scale (r = 0.906) and discriminant validity was confirmed with single-item noise annoyance scale (r = 0.718). The lowest observed McDonald's omega across the studies was 0.923. The cross-cultural validation of NSS-SF was successful but it proved to be somewhat problematic with respect to its annoyance-based items.
Zhang, Hui; Ren, Ji-Xia; Kang, Yan-Li; Bo, Peng; Liang, Jun-Yu; Ding, Lan; Kong, Wei-Bao; Zhang, Ji
2017-08-01
Toxicological testing associated with developmental toxicity endpoints are very expensive, time consuming and labor intensive. Thus, developing alternative approaches for developmental toxicity testing is an important and urgent task in the drug development filed. In this investigation, the naïve Bayes classifier was applied to develop a novel prediction model for developmental toxicity. The established prediction model was evaluated by the internal 5-fold cross validation and external test set. The overall prediction results for the internal 5-fold cross validation of the training set and external test set were 96.6% and 82.8%, respectively. In addition, four simple descriptors and some representative substructures of developmental toxicants were identified. Thus, we hope the established in silico prediction model could be used as alternative method for toxicological assessment. And these obtained molecular information could afford a deeper understanding on the developmental toxicants, and provide guidance for medicinal chemists working in drug discovery and lead optimization. Copyright © 2017 Elsevier Inc. All rights reserved.
Richardson, Michelle; Katsakou, Christina; Torres-González, Francisco; Onchev, George; Kallert, Thomas; Priebe, Stefan
2011-06-30
Patients' views of inpatient care need to be assessed for research and routine evaluation. For this a valid instrument is required. The Client Assessment of Treatment Scale (CAT) has been used in large scale international studies, but its psychometric properties have not been well established. The structural validity of the CAT was tested among involuntary inpatients with psychosis. Data from locations in three separate European countries (England, Spain and Bulgaria) were collected. The factorial validity was initially tested using single sample confirmatory factor analyses in each country. Subsequent multi-sample analyses were used to test for invariance of the factor loadings, and factor variances across the countries. Results provide good initial support for the factorial validity and invariance of the CAT scores. Future research is needed to cross-validate these findings and to generalise them to other countries, treatment settings, and patient populations. Copyright © 2011 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Lievens, Filip; Chasteen, Christopher S.; Day, Eric Anthony; Christiansen, Neil D.
2006-01-01
This study used trait activation theory as a theoretical framework to conduct a large-scale test of the interactionist explanation of the convergent and discriminant validity findings obtained in assessment centers. Trait activation theory specifies the conditions in which cross-situationally consistent and inconsistent candidate performances are…
Validating Behavioural Change: Teachers' Perception and Use of ICT in England and Korea.
ERIC Educational Resources Information Center
Carter, D. S. G.; Leeh, D. J. K.
This study focused on the test and cross-cultural validation of an organizational and behavioral model of planned change. The aim of the research was to ascertain the nature and direction of different cultural aspects influencing the change process when Information and Communication Technology (ICT) was being implemented in schools. The…
ERIC Educational Resources Information Center
Chan, Kathy; Penner, Kailee; Mah, Janet W. T.; Johnston, Charlotte
2010-01-01
The use of parenting measures that are developed for use with Western families without testing their validity among families from non-Western cultural backgrounds may not be appropriate. Similar parenting behaviors may affect child outcomes in different ways across different cultures. This study examined the cross-cultural validity of an…
Relative Reliability and Validity of the Block Kids Questionnaire among Youth Aged 10 to 17 Years
USDA-ARS?s Scientific Manuscript database
This cross-sectional study tested the reliability and validity of the Block Kids Questionnaire to assess diet during the past 7 days. Within a 7-day period, 10- to 17-year-old children and adolescents completed two 24-hour dietary recalls by telephone, followed by the Block Kids Questionnaire at the...
Validity and Reliability of Internalized Stigma of Mental Illness (Cantonese)
ERIC Educational Resources Information Center
Young, Daniel Kim-Wan; Ng, Petrus Y. N.; Pan, Jia-Yan; Cheng, Daphne
2017-01-01
Purpose: This study aims to translate and test the reliability and validity of the Internalized Stigma of Mental Illness-Cantonese (ISMI-C). Methods: The original English version of ISMI is translated into the ISMI-C by going through forward and backward translation procedure. A cross-sectional research design is adopted that involved 295…
Cross-Validation of a PACER Prediction Equation for Assessing Aerobic Capacity in Hungarian Youth
ERIC Educational Resources Information Center
Saint-Maurice, Pedro F.; Welk, Gregory J.; Finn, Kevin J.; Kaj, Mónika
2015-01-01
Purpose: The purpose of this article was to evaluate the validity of the Progressive Aerobic Cardiovascular and Endurance Run (PACER) test in a sample of Hungarian youth. Method: Approximately 500 participants (aged 10-18 years old) were randomly selected across Hungary to complete both laboratory (maximal treadmill protocol) and field assessments…
Simons, M; Kee, E Gee; Kimble, R; Tyack, Z
2017-08-01
The aim of this study was to investigate the reproducibility and validity of measuring scar height in children using ultrasound and 3D camera. Using a cross-sectional design, children with discrete burn scars were included. Reproducibility was tested using Intraclass Correlation Coefficient (ICC) for reliability, and percentage agreement within 1mm between test and re-test, standard error of measurement (SEM), smallest detectable change (SDC) and Bland Altman limits of agreement for agreement. Concurrent validity was tested using Spearman's rho for support of pre-specified hypotheses. Forty-nine participants (55 scars) were included. For ultrasound, test-retest and inter-rater reproducibility of scar thickness was acceptable for scarred skin (ICC=0.95, SDC=0.06cm and ICC=0.82, SDC=0.14cm). The ultrasound picked up changes of <1mm. Inter-rater reproducibility of maximal scar height using the 3D camera was acceptable (ICC=0.73, SDC=0.55cm). Construct validity of the ultrasound was supported with a strong correlation between the measure of scar thickness and observer ratings of thickness using the POSAS (ρ=0.61). Construct validity of the 3D camera was also supported with a moderate correlation (ρ=0.37) with the same measure using maximal scar height. The ultrasound is capable of detecting smaller changes or differences in scar thickness than the 3D camera, in children with burn scars. However agreement as part of reproducibility was lower than expected between raters for the ultrasound. Improving the accuracy of scar relocation may go some way to address agreement. Crown Copyright © 2017. Published by Elsevier Ltd. All rights reserved.
Saub, R; Locker, D; Allison, P; Disman, M
2007-09-01
The aim of this project was to develop an oral health related-quality of life measure for the Malaysian adult population aged 18 and above by the cross-cultural adaption the Oral Health Impact Profile (OHIP). The adaptation of the OHIP was based on the framework proposed by Herdman et al (1998). The OHIP was translated into the Malay language using a forward-backward translation technique. Thirty-six patients were interviewed to assess the conceptual equivalence and relevancy of each item. Based on the translation process and interview results a Malaysian version of the OHIP questionnaire was produced that contained 45 items. It was designated as the OHIP(M). This questionnaire was pre-tested on 20 patients to assess its face validity. A short 14-item version of the questionnaire was completed by 171 patients to assess the suitability of the Likert-type response format. Field-testing was conducted in order to assess the suitability of two modes of administration (mail and interview) and to establish the psychometric properties of the adapted measure. The pre-testing revealed that the OHIP(M) has good face validity. It was found that the five-point frequency Likert scale could be used for the Malaysian population. The OHIP(M) was reliable, where the scale Cronbach's alpha was 0.95 and the ICC value for test-retest reliability was 0.79. Three out four construct validity hypotheses tested were confirmed. OHIP(M) works equally well as the English version. OHIP(M) was found to be reliable and valid regardless of the mode of administration. However, this study only provides initial evidence for the reliability and validity of the measure. Further study is recommended to collect more evidence to support these results.
Translation and validation of a Spanish version of the xerostomia inventory.
Serrano, Carlos; Fariña, María P; Pérez, Cristhian; Fernández, Marcos; Forman, Katherine; Carrasco, Mauricio
2016-12-01
The aim of this study was to validate a Spanish cross-cultural adaptation of the xerostomia inventory (XI). The original English version of XI was translated into Spanish, cross-culturally adapted and field tested. The Spanish version of XI (XI-Sp) was tested with a sample of 41 patients with xerostomia. The reliability of the XI-Sp was determined through internal consistency and test-retest methods. The construct validity of XI-Sp was determined by means of correlation between XI-Sp scores and salivary flow measurements. Overall XI-Sp scores were 40.8 (SD = 10) for the first application and 40.2 (SD = 9.5) for the second. Cronbach's alpha value for the XI-Sp was 0.89 and 0.87, respectively, while interitem correlation averages were r = 0.44 and r = 0.39 for each application. Interitem correlation and corrected total was r c ≥0.30. The test-retest intraclass correlation coefficient value for the XI-Sp score was 0.59 and 0.91. Convergent validity for construct validity correlation with salivary flow showed a medium effect size (r 2 = 0.10) for the first application but did not make a statistically significant prediction for the second (r 2 = 0.7). This study provides evidence concerning the reliability of the XI-Sp, showing that it may be a useful tool for Spanish-speaking xerostomia patients for both clinical and epidemiologic research. © 2015 John Wiley & Sons A/S and The Gerodontology Association. Published by John Wiley & Sons Ltd.
Assessing cross-cultural differences through use of multiple-group invariance analyses.
Stein, Judith A; Lee, Jerry W; Jones, Patricia S
2006-12-01
The use of structural equation modeling in cross-cultural personality research has become a popular method for testing measurement invariance. In this report, we present an example of testing measurement invariance using the Sense of Coherence Scale of Antonovsky (1993) in 3 ethnic groups: Chinese, Japanese, and Whites. In a series of increasingly restrictive constraints on the measurement models of the 3 groups, we demonstrate how to assess differences among the groups. We also provide an example of construct validation.
Olivera, André Rodrigues; Roesler, Valter; Iochpe, Cirano; Schmidt, Maria Inês; Vigo, Álvaro; Barreto, Sandhi Maria; Duncan, Bruce Bartholow
2017-01-01
Type 2 diabetes is a chronic disease associated with a wide range of serious health complications that have a major impact on overall health. The aims here were to develop and validate predictive models for detecting undiagnosed diabetes using data from the Longitudinal Study of Adult Health (ELSA-Brasil) and to compare the performance of different machine-learning algorithms in this task. Comparison of machine-learning algorithms to develop predictive models using data from ELSA-Brasil. After selecting a subset of 27 candidate variables from the literature, models were built and validated in four sequential steps: (i) parameter tuning with tenfold cross-validation, repeated three times; (ii) automatic variable selection using forward selection, a wrapper strategy with four different machine-learning algorithms and tenfold cross-validation (repeated three times), to evaluate each subset of variables; (iii) error estimation of model parameters with tenfold cross-validation, repeated ten times; and (iv) generalization testing on an independent dataset. The models were created with the following machine-learning algorithms: logistic regression, artificial neural network, naïve Bayes, K-nearest neighbor and random forest. The best models were created using artificial neural networks and logistic regression. -These achieved mean areas under the curve of, respectively, 75.24% and 74.98% in the error estimation step and 74.17% and 74.41% in the generalization testing step. Most of the predictive models produced similar results, and demonstrated the feasibility of identifying individuals with highest probability of having undiagnosed diabetes, through easily-obtained clinical data.
Student Accounts of the Ontario Secondary School Literacy Test: A Case for Validation
ERIC Educational Resources Information Center
Cheng, Liying; Fox, Janna; Zheng, Ying
2007-01-01
The Ontario Secondary School Literacy Test (OSSLT) is a cross-curricular literacy test issued to all secondary school students in the province of Ontario. The test consists of a reading and a writing component, both of which must be successfully completed for secondary school graduation in Ontario. This study elicited 16 first language and second…
Cross-cultural equivalence in translations of the oral health impact profile.
MacEntee, Michael I; Brondani, Mario
2016-04-01
The Oral Health Impact Profile (OHIP) has been translated for comparisons across cultural boundaries. This report on a systematic search of literature published between 1994 and 2014 aims to identify an acceptable method of translating psychometric instruments for cross-cultural equivalence, and how they were used to translate the OHIP. An electronic search used the keywords 'cultural adaptation', 'validation', 'Oral Health Impact Profile' and 'OHIP' in MEDLINE and EMBASE databases supplemented by reference links and grey literature. It included papers on methods of cross-cultural translation and translations of the OHIP for dentulous adults and adolescents, and excluded papers without translational details or limited to specific disorders. The search identified eight steps to cross-cultural equivalence, and 36 (plus three supplemental) translations of the OHIP. The steps involve assessment of (i) forward/backward translation by committee, (ii) constructs, (iii) item interpretations, (iv) interval scales, (v) convergent validity, (vi) discriminant validity, (vii) responsiveness to clinical change and (viii) pilot tests. Most (>60%) of the translations involved forward/backward translation by committee, item interpretations, interval scales, convergence, discrimination and pilot tests, but fewer assessed the underlying theory (47%) or responsiveness to clinical change (28%). An acceptable method for translating quality of life-related psychometric instruments for cross-cultural equivalence has eight procedural steps, and most of the 36 OHIP translations involved at least five of the steps. Only translations to Saudi Arabian Arabic, Chinese Mandarin, German and Japanese used all eight steps to claim cultural equivalence with the original OHIP. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Tuğay, Baki Umut; Tuğay, Nazan; Güney, Hande; Kınıklı, Gizem İrem; Yüksel, İnci; Atilla, Bülent
2016-01-01
The Oxford Knee Score (OKS) is a valid, short, self-administered, and site- specific outcome measure specifically developed for patients with knee arthroplasty. This study aimed to cross-culturally adapt and validate the OKS to be used in Turkish-speaking patients with osteoarthritis of the knee. The OKS was translated and culturally adapted according to the guidelines in the literature. Ninety-one patients (mean age: 55.89±7.85 years) with knee osteoarthritis participated in the study. Patients completed the Turkish version of the Oxford Knee Score (OKS-TR), Short-Form 36 Health Survey (SF-36), and Western Ontario and McMaster Universities Index (WOMAC) questionnaires. Internal consistency was tested using Cronbach's α coefficient. Patients completed the OKS-TR questionnaire twice in 7 days to determine the reproducibility. Correlation between the total results of both tests was determined by Spearman's correlation coefficient and intraclass correlation coefficients (ICC). Validity was assessed by calculating Spearman's correlation coefficient between the OKS, WOMAC, and SF-36 scores. Floor and ceiling effects were analyzed. Internal consistency was high (Cronbach's α: 0.90). The reproducibility tested by 2 different methods showed no significant difference (p>0.05). The construct validity analyses showed a significant correlation between the OKS and the other scores (p<0.05). There was no floor or ceiling effect in total OKS score. The OKS-TR is a reliable and valid measure for the self-assessment of pain and function in Turkish-speaking patients with osteoarthritis of the knee.
Sebastião, Emerson; Sandroff, Brian M; Learmonth, Yvonne C; Motl, Robert W
2016-07-01
To examine the validity of the timed Up and Go (TUG) test as a measure of functional mobility in persons with multiple sclerosis (MS) by using a comprehensive framework based on construct validity (ie, convergent and divergent validity). Cross-sectional study. Hospital setting. Community-residing persons with MS (N=47). Not applicable. Main outcome measures included the TUG test, timed 25-foot walk test, 6-minute walk test, Multiple Sclerosis Walking Scale-12, Late-Life Function and Disability Instrument, posturography evaluation, Activities-specific Balance Confidence scale, Symbol Digits Modalities Test, Expanded Disability Status Scale, and the number of steps taken per day. The TUG test was strongly associated with other valid outcome measures of ambulatory mobility (Spearman rank correlation, rs=.71-.90) and disability status (rs=.80), moderately to strongly associated with balance confidence (rs=.66), and weakly associated with postural control (ie, balance) (rs=.31). The TUG test was moderately associated with cognitive processing speed (rs=.59), but not associated with other nonambulatory measures (ie, Late-Life Function and Disability Instrument-upper extremity function). Our findings support the validity of the TUG test as a measure of functional mobility. This warrants its inclusion in patients' assessment alongside other valid measures of functional mobility in both clinical and research practice in persons with MS. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Kashkouli, Mohsen Bahmani; Karimi, Nasser; Aghamirsalim, Mohamadreza; Abtahi, Mohammad Bagher; Nojomi, Marzieh; Shahrad-Bejestani, Hadi; Salehi, Masoud
2017-02-01
To determine the measurement properties of the Persian language version of the Graves orbitopathy quality of life questionnaire (GO-QOL). Following a systematic translation and cultural adaptation process, 141 consecutive unselected thyroid eye disease (TED) patients answered the Persian GO-QOL and underwent complete ophthalmic examination. The questionnaire was again completed by 60 patients on the second visit, 2-4 weeks later. Construct validity (cross-cultural validity, structural validity and hypotheses testing), reliability (internal consistency and test-retest reliability), and floor and ceiling effects of the Persian version of the GO-QOL were evaluated. Furthermore, Rasch analysis was used to assess its psychometric properties. Cross-cultural validity was established by back-translation techniques, committee review and pretesting techniques. Bi-dimensionality of the questionnaire was confirmed by factor analysis. Construct validity was also supported through confirmation of 6 out of 8 predefined hypotheses. Cronbach's α and intraclass correlation coefficient (ICC) were 0.650 and 0.859 for visual functioning and 0.875 and 0.896 for appearance subscale, respectively. Mean quality of life (QOL) scores for visual functioning and appearance were 78.18 (standard deviation, SD, 21.57) and 56.25 (SD 26.87), respectively. Person reliabilities from the Rasch rating scale model for both visual functioning and appearance revealed an acceptable internal consistency for the Persian GO-QOL. The Persian GO-QOL questionnaire is a valid and reliable tool with good psychometric properties in evaluation of Persian-speaking patients with TED. Applying Rasch analysis to future versions of the GO-QOL is recommended in order to perform tests for linearity between the estimated item measures in different versions.
Walsh, Jennifer R; Hebert, Angel; Byrd-Bredbenner, Carol; Carey, Gale; Colby, Sarah; Brown-Esters, Onikia N; Greene, Geoffrey; Hoerr, Sharon; Horacek, Tanya; Kattelmann, Kendra; Kidd, Tandalayo; Koenings, Mallory; Phillips, Beatrice; Shelnutt, Karla P; White, Adrienne A
2012-01-01
To develop and test the validity of the Behavior, Environment, and Changeability Survey (BECS) for identifying the importance and changeability of nutrition, exercise, and stress management behavior and related aspects of the environment. A cross-sectional, online survey of the BECS and selected validated instruments. Ten state universities. A convenience sample of college students (n = 1,283), ages 18-24 years. Principal component analysis was used to confirm a 6-component structure of the BECS in 2 independent samples for the purpose of cross-validation. Internal consistency was measured and construct and criterion-related analyses were conducted to test the reliability and validity of the BECS subscales. Six components representing 34 BECS items were revealed from the original 69 items and explained 64% of the total variance. Six scales were retained, and internal consistency of each ranged from α = .82 to .93. BECS Nutrition Behavior and Nutrition Changeability scale scores were highest for participants in action/maintenance Stages of Change for fruit and vegetable intake. There is strong support for the use of the BECS when planning health programs to gain insight into behavior that young adults are willing to improve, specifically related to nutrition, exercise, and sleep. Copyright © 2012 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Kim, Kyoung-Eun; Lim, Jae-Young
2011-01-01
The Roland-Morris Disability Questionnaire (RMDQ) is a reliable tool for evaluating disability in patients with back pain, but no Korean version has been published and validated. We developed a cross-culturally adapted Korean version of the RMDQ (RMDQ-K) and validated its use for assessing disability in Korean patients with low back pain. Two hundred thirty-one patients with low back pain were assessed using the RMDQ-K, visual analog scale (VAS) during rest and activity, and the Oswestry Disability Index (ODI). The results of 40 patients were used to evaluate the test-retest reliability. The correlations of the RMDQ-K with the VAS and ODI were used to assess validity. The reliability of the RMDQ-K estimated using the internal consistency reached a Cronbach's alpha of 0.893. Test-retest trials showed a high intraclass correlation coefficient of 0.837 (95% CI 0.833-0.953). The RMDQ-K was significantly correlated with the ODI (r=0.738) and VAS during rest (r=0.450) and activity (r=0.412). This study demonstrates that the RMDQ-K is a reliable, valid instrument for measuring of disability in Korean patients with low back pain.
Kosugi, Eduardo Macoto; Chen, Vitor Guo; Fonseca, Viviane Maria Guerreiro da; Cursino, Milena Martins Pellogia; Mendes Neto, José Arruda; Gregório, Luís Carlos
2011-01-01
Quality of life questionnaires have been increasingly used in clinical trials to help establish the impact of medical intervention or to assess the outcome of health care services. Among disease-specific outcome measures, SNOT-22 was considered the most suitable tool for assessing chronic rhinosinusitis and patients with nasal polyps. To perform translation, cross-cultural adaptation and validation of the SNOT-22 to Brazilian Portuguese. Prospective study involving eighty-nine patients with chronic rhinosinusitis or nasal polyps submitted to functional endoscopic sinus surgery, who answered the questionnaire before and after surgery. Furthermore, 113 volunteers without sinonasal disease also answered the questionnaire. Internal consistency, test-retest reliability, measure validity, responsiveness and clinical interpretability were assessed. Mean preoperative, postoperative and no sinonasal disease scores were 62.39, 23.09 and 11.42, respectively (p<0.0001); showing validity and responsiveness. Internal consistency was high (Cronbach's alpha = 0.9276). Reliability was sufficiently good, considering inter-interviewers (r=0.81) and intra-interviewers within a 10 to 14 day-interval (r=0.72). Surgery effect size was 1.55. Minimally important difference was 14 points; and scores up to 10 points were considered normal. The Brazilian Portuguese SNOT-22 version is a valid instrument to assess patients with chronic rhinosinusitis and nasal polyps.
Sebire, Simon J; Jago, Russell; Fox, Kenneth R; Edwards, Mark J; Thompson, Janice L
2013-09-26
Understanding children's physical activity motivation, its antecedents and associations with behavior is important and can be advanced by using self-determination theory. However, research among youth is largely restricted to adolescents and studies of motivation within certain contexts (e.g., physical education). There are no measures of self-determination theory constructs (physical activity motivation or psychological need satisfaction) for use among children and no previous studies have tested a self-determination theory-based model of children's physical activity motivation. The purpose of this study was to test the reliability and validity of scores derived from scales adapted to measure self-determination theory constructs among children and test a motivational model predicting accelerometer-derived physical activity. Cross-sectional data from 462 children aged 7 to 11 years from 20 primary schools in Bristol, UK were analysed. Confirmatory factor analysis was used to examine the construct validity of adapted behavioral regulation and psychological need satisfaction scales. Structural equation modelling was used to test cross-sectional associations between psychological need satisfaction, motivation types and physical activity assessed by accelerometer. The construct validity and reliability of the motivation and psychological need satisfaction measures were supported. Structural equation modelling provided evidence for a motivational model in which psychological need satisfaction was positively associated with intrinsic and identified motivation types and intrinsic motivation was positively associated with children's minutes in moderate-to-vigorous physical activity. The study provides evidence for the psychometric properties of measures of motivation aligned with self-determination theory among children. Children's motivation that is based on enjoyment and inherent satisfaction of physical activity is associated with their objectively-assessed physical activity and such motivation is positively associated with perceptions of psychological need satisfaction. These psychological factors represent potential malleable targets for interventions to increase children's physical activity.
Mota-Anaya, Evelin; Yumpo-Cárdenas, Daniel; Alva-Bravo, Edmundo; Wright-Nunes, Julie; Mayta-Tristán, Percy
2016-08-08
Chronic kidney disease (CKD) affects 50 million people globally. Several studies show the importance of implementing interventions that enhance patients knowledge about their disease. In 2011 the Kidney Disease Knowledge Survey (KiKS) was developed: a questionnaire that assesses the specific knowledge about chronic kidney disease in pre-dialysis patients. To translate to Spanish, culturally adapt and validate the Kidney Disease Knowledge Survey questionnaire in a population of patients with pre-dialysis chronic kidney disease. We carried out a Spanish translation and cross-cultural adaptation of the Kidney Disease Knowledge Survey questionnaire. Subsequently, we determined its validity and reliability. We determined the validity through construct validity; and reliability by evaluating its internal consistency and its intra-observer reliability (test-retest). We found a good internal consistency (Kuder-Richardson = 0.85). The intra-observer reliability was measured by the intra-class correlation coefficient that yielded a value of 0.78 (95% CI: 0.5-1.0). This value indicated a good reproducibility; also, the mean difference of -1.1 test-retest SD 6.0 (p = 0.369) confirms this finding. The translated Spanish version of the Kidney Disease Knowledge Survey is acceptable and equivalent to the original version; it also has a good reliability, validity and reproducibility. Therefore, it can be used in a population of patients with pre-dialysis chronic kidney disease.
The Spanish version of the Alberta Infant Motor Scale: Validity and reliability analysis.
Morales-Monforte, Erica; Bagur-Calafat, Caridad; Suc-Lerin, Neus; Fornaguera-Martí, Montserrat; Cazorla-Sánchez, Engracia; Girabent-Farrés, Montserrat
2017-02-01
Validity and reliability of the cross-cultural adaptive translation of the Alberta Infant Motor Scale (AIMS), to monitor gross motor development in infants from 0 to 18 months of age, were evaluated. A cross-cultural translation was used to generate a Spanish version of the AIMS. Fifty infants at risk or with diagnosis of motor delay, 0-18 months of age, participated in this study. Two independent physical therapists scored infants on the AIMS. Concurrent validity was tested using the AIMS and the Bayley Scales of Infant and Toddler Development - III (Bayley - III). Reliability and the internal consistency were high (ICCs ranged from 0.94 to 1.00 and KR-20 ranged from 0.90 to 0.98, respectively). AIMS and Bayley - III scores correlated strongly (r = 0.97). The Spanish version of the AIMS presented excellent validity and reliability. Further studies are suggested in order to assess the AIMS in preterm babies.
Cross-validating a bidimensional mathematics anxiety scale.
Haiyan Bai
2011-03-01
The psychometric properties of a 14-item bidimensional Mathematics Anxiety Scale-Revised (MAS-R) were empirically cross-validated with two independent samples consisting of 647 secondary school students. An exploratory factor analysis on the scale yielded strong construct validity with a clear two-factor structure. The results from a confirmatory factor analysis indicated an excellent model-fit (χ(2) = 98.32, df = 62; normed fit index = .92, comparative fit index = .97; root mean square error of approximation = .04). The internal consistency (.85), test-retest reliability (.71), interfactor correlation (.26, p < .001), and positive discrimination power indicated that MAS-R is a psychometrically reliable and valid instrument for measuring mathematics anxiety. Math anxiety, as measured by MAS-R, correlated negatively with student achievement scores (r = -.38), suggesting that MAS-R may be a useful tool for classroom teachers and other educational personnel tasked with identifying students at risk of reduced math achievement because of anxiety.
Godefroy, Olivier; Martinaud, Olivier; Verny, Marc; Mosca, Chrystèle; Lenoir, Hermine; Bretault, Eric; Devendeville, Agnès; Diouf, Momar; Pere, Jean-Jacques; Bakchine, Serge; Delabrousse-Mayoux, Jean-Philippe; Roussel, Martine
2016-01-01
The frequency of executive disorders in mild-to-moderate Alzheimer disease (AD) has been demonstrated by the application of a comprehensive battery. The present study analyzed data from 2 recent multicenter studies based on the same executive battery. The objective was to derive a shortened battery by using the GREFEX population as a training dataset and by cross-validating the results in the REFLEX population. A total of 102 AD patients of the GREFEX study (MMSE=23.2±2.9) and 72 patients of the REFLEX study (MMSE=20.8±3.5) were included. Tests were selected and receiver operating characteristic curves were generated relative to the performance of 780 controls from the GREFEX study. Stepwise logistic regression identified 3 cognitive tests (Six Elements Task, categorical fluency and Trail Making Test B error) and behavioral disorders globally referred as global hypoactivity (P=0.0001, all). This shortened battery was as accurate as the entire GREFEX battery in diagnosing dysexecutive disorders in both training group and the validation group. Bootstrap procedure confirmed the stability of AUC. A shortened battery based on 3 cognitive tests and 3 behavioral domains provides a high diagnosis accuracy of executive disorders in mild-to-moderate AD.
Shafeei, Asrin; Mokhtarinia, Hamid Reza; Maleki-Ghahfarokhi, Azam; Piri, Leila
2017-08-01
Observational study. To cross-culturally translate the Orebro Musculoskeletal Pain Screening Questionnaire (OMPQ) into Persian and then evaluate its psychometric properties (reliability, validity, ceiling, and flooring effects). To the authors' knowledge, prior to this study there has been no validated instrument to screen the risk of chronicity in Persian-speaking patients with low back pain (LBP) in Iran. The OMPQ was specifically developed as a self-administered screening tool for assessing the risk of LBP chronicity. The forward-backward translation method was used for the translation and cross-cultural adaptation of the original questionnaire. In total, 202 patients with subacute LBP completed the OMPQ and the pain disability questionnaire (PDQ), which was used to assess convergent validity. 62 patients completed the OMPQ a week later as a retest. Slight changes were made to the OMPQ during the translation/cultural adaptation process; face validity of the Persian version was obtained. The Persian OMPQ showed excellent test-retest reliability (intraclass correlation coefficient=0.89). Its internal consistency was 0.71, and its convergent validity was confirmed by good correlation coefficient between the OMPQ and PDQ total scores ( r =0.72, p <0.05). No ceiling or floor effects were observed. The Persian version of the OMPQ is acceptable for the target society in terms of face validity, construct validity, reliability, and consistency. It is therefore considered a useful instrument for screening Iranian patients with LBP.
Bellis, Teri James; Ross, Jody
2011-09-01
It has been suggested that, in order to validate a diagnosis of (C)APD (central auditory processing disorder), testing using direct cross-modal analogs should be performed to demonstrate that deficits exist solely or primarily in the auditory modality (McFarland and Cacace, 1995; Cacace and McFarland, 2005). This modality-specific viewpoint is controversial and not universally accepted (American Speech-Language-Hearing Association [ASHA], 2005; Musiek et al, 2005). Further, no such analogs have been developed to date, and neither the feasibility of such testing in normally functioning individuals nor the concurrent validity of cross-modal analogs has been established. The purpose of this study was to investigate the feasibility of cross-modal testing by examining the performance of normal adults and children on four tests of central auditory function and their corresponding visual analogs. In addition, this study investigated the degree to which concurrent validity of auditory and visual versions of these tests could be demonstrated. An experimental repeated measures design was employed. Participants consisted of two groups (adults, n=10; children, n=10) with normal and symmetrical hearing sensitivity, normal or corrected-to-normal visual acuity, and no family or personal history of auditory/otologic, language, learning, neurologic, or related disorders. Visual analogs of four tests in common clinical use for the diagnosis of (C)APD were developed (Dichotic Digits [Musiek, 1983]; Frequency Patterns [Pinheiro and Ptacek, 1971]; Duration Patterns [Pinheiro and Musiek, 1985]; and the Random Gap Detection Test [RGDT; Keith, 2000]). Participants underwent two 1 hr test sessions separated by at least 1 wk. Order of sessions (auditory, visual) and tests within each session were counterbalanced across participants. ANOVAs (analyses of variance) were used to examine effects of group, modality, and laterality (for the Dichotic/Dichoptic Digits tests) or response condition (for the auditory and visual Frequency Patterns and Duration Patterns tests). Pearson product-moment correlations were used to investigate relationships between auditory and visual performance. Adults performed significantly better than children on the Dichotic/Dichoptic Digits tests. Results also revealed a significant effect of modality, with auditory better than visual, and a significant modality×laterality interaction, with a right-ear advantage seen for the auditory task and a left-visual-field advantage seen for the visual task. For the Frequency Patterns test and its visual analog, results revealed a significant modality×response condition interaction, with humming better than labeling for the auditory version but the reversed effect for the visual version. For Duration Patterns testing, visual performance was significantly poorer than auditory performance. Due to poor test-retest reliability and ceiling effects for the auditory and visual gap-detection tasks, analyses could not be performed. No cross-modal correlations were observed for any test. Results demonstrated that cross-modal testing is at least feasible using easily accessible computer hardware and software. The lack of any cross-modal correlations suggests independent processing mechanisms for auditory and visual versions of each task. Examination of performance in individuals with central auditory and pan-sensory disorders is needed to determine the utility of cross-modal analogs in the differential diagnosis of (C)APD. American Academy of Audiology.
Vincent, Joshua Israel; Macdermid, Joy Christine; Grewal, Ruby; Sekar, Vincent Prabhakaran; Balachandran, Dinesh
2014-01-01
Prospective longitudinal validation study. To translate and cross-culturally adapt the Oswestry Disability Index (ODI) to the Tamil language (ODI-T), and to evaluate its reliability and construct validity. ODI is widely used as a disease specific questionnaire in back pain patients to evaluate pain and disability. A thorough literature search revealed that the Tamil version of the ODI has not been previously published. The ODI was translated and cross-culturally adapted to the Tamil language according to established guidelines. 30 subjects (16 women and 14 men) with a mean age of 42.7 years (S.D. 13.6; Range 22 - 69) with low back pain were recruited to assess the psychometric properties of the ODI-T Questionnaire. Patients completed the ODI-T, Roland-Morris disability questionnaire (RMDQ), VAS-pain and VAS-disability at baseline and 24-72 hours from the baseline visit. The ODI-T displayed a high degree of internal consistency, with a Cronbach's alpha of 0.92. The test-retest reliability was high (n=30) with an ICC of 0.92 (95% CI, 0.84 to 0.96) and a mean re-test difference of 2.6 points lower on re-test. The ODI-T scores exhibited a strong correlation with the RMDQ scores (r = 0.82) p<0.01, VAS-P (r = 0.78) p<0.01 and VAS-D (r = 0.81) p<0.01. Moderate to low correlations were observed between the ODI-T and lumbar ROM (r = -0.27 to -0.53). All the hypotheses that were constructed apriori were supported. The Tamil version of the ODI Questionnaire is a valid and reliable tool that can be used to measure subjective outcomes of pain and disability in Tamil speaking patients with low back pain.
Hravnak, Marilyn; Chen, Lujie; Dubrawski, Artur; Bose, Eliezer; Clermont, Gilles; Pinsky, Michael R
2016-12-01
Huge hospital information system databases can be mined for knowledge discovery and decision support, but artifact in stored non-invasive vital sign (VS) high-frequency data streams limits its use. We used machine-learning (ML) algorithms trained on expert-labeled VS data streams to automatically classify VS alerts as real or artifact, thereby "cleaning" such data for future modeling. 634 admissions to a step-down unit had recorded continuous noninvasive VS monitoring data [heart rate (HR), respiratory rate (RR), peripheral arterial oxygen saturation (SpO 2 ) at 1/20 Hz, and noninvasive oscillometric blood pressure (BP)]. Time data were across stability thresholds defined VS event epochs. Data were divided Block 1 as the ML training/cross-validation set and Block 2 the test set. Expert clinicians annotated Block 1 events as perceived real or artifact. After feature extraction, ML algorithms were trained to create and validate models automatically classifying events as real or artifact. The models were then tested on Block 2. Block 1 yielded 812 VS events, with 214 (26 %) judged by experts as artifact (RR 43 %, SpO 2 40 %, BP 15 %, HR 2 %). ML algorithms applied to the Block 1 training/cross-validation set (tenfold cross-validation) gave area under the curve (AUC) scores of 0.97 RR, 0.91 BP and 0.76 SpO 2 . Performance when applied to Block 2 test data was AUC 0.94 RR, 0.84 BP and 0.72 SpO 2 . ML-defined algorithms applied to archived multi-signal continuous VS monitoring data allowed accurate automated classification of VS alerts as real or artifact, and could support data mining for future model building.
Hravnak, Marilyn; Chen, Lujie; Dubrawski, Artur; Bose, Eliezer; Clermont, Gilles; Pinsky, Michael R.
2015-01-01
PURPOSE Huge hospital information system databases can be mined for knowledge discovery and decision support, but artifact in stored non-invasive vital sign (VS) high-frequency data streams limits its use. We used machine-learning (ML) algorithms trained on expert-labeled VS data streams to automatically classify VS alerts as real or artifact, thereby “cleaning” such data for future modeling. METHODS 634 admissions to a step-down unit had recorded continuous noninvasive VS monitoring data (heart rate [HR], respiratory rate [RR], peripheral arterial oxygen saturation [SpO2] at 1/20Hz., and noninvasive oscillometric blood pressure [BP]) Time data were across stability thresholds defined VS event epochs. Data were divided Block 1 as the ML training/cross-validation set and Block 2 the test set. Expert clinicians annotated Block 1 events as perceived real or artifact. After feature extraction, ML algorithms were trained to create and validate models automatically classifying events as real or artifact. The models were then tested on Block 2. RESULTS Block 1 yielded 812 VS events, with 214 (26%) judged by experts as artifact (RR 43%, SpO2 40%, BP 15%, HR 2%). ML algorithms applied to the Block 1 training/cross-validation set (10-fold cross-validation) gave area under the curve (AUC) scores of 0.97 RR, 0.91 BP and 0.76 SpO2. Performance when applied to Block 2 test data was AUC 0.94 RR, 0.84 BP and 0.72 SpO2). CONCLUSIONS ML-defined algorithms applied to archived multi-signal continuous VS monitoring data allowed accurate automated classification of VS alerts as real or artifact, and could support data mining for future model building. PMID:26438655
ERIC Educational Resources Information Center
Douglas, Kevin S.; Guy, Laura S.; Edens, John F.; Boer, Douglas P.; Hamilton, Jennine
2007-01-01
The Personality Assessment Inventory's (PAI's) ability to predict psychopathic personality features, as assessed by the Psychopathy Checklist-Revised (PCL-R), was examined. To investigate whether the PAI Antisocial Features (ANT) Scale and subscales possessed incremental validity beyond other theoretically relevant PAI scales, optimized regression…
The early maximum likelihood estimation model of audiovisual integration in speech perception.
Andersen, Tobias S
2015-05-01
Speech perception is facilitated by seeing the articulatory mouth movements of the talker. This is due to perceptual audiovisual integration, which also causes the McGurk-MacDonald illusion, and for which a comprehensive computational account is still lacking. Decades of research have largely focused on the fuzzy logical model of perception (FLMP), which provides excellent fits to experimental observations but also has been criticized for being too flexible, post hoc and difficult to interpret. The current study introduces the early maximum likelihood estimation (MLE) model of audiovisual integration to speech perception along with three model variations. In early MLE, integration is based on a continuous internal representation before categorization, which can make the model more parsimonious by imposing constraints that reflect experimental designs. The study also shows that cross-validation can evaluate models of audiovisual integration based on typical data sets taking both goodness-of-fit and model flexibility into account. All models were tested on a published data set previously used for testing the FLMP. Cross-validation favored the early MLE while more conventional error measures favored more complex models. This difference between conventional error measures and cross-validation was found to be indicative of over-fitting in more complex models such as the FLMP.
Remelhe, Mafalda; Teixeira, Pedro M; Lopes, Irene; Silva, Luís; Correia de Sousa, Jaime
2017-01-12
Enabling patients with asthma to obtain the knowledge, confidence and skills they need in order to assume a major role in the management of their disease is cost effective. It should be an integral part of any plan for long-term control of asthma. The modified Patient Enablement Instrument (mPEI) is an easily administered questionnaire that was adapted in the United Kingdom to measure patient enablement in asthma, but its applicability in Portugal is not known. Validity and reliability of questionnaires should be tested before use in settings different from those of the original version. The purpose of this study was to test the applicability of the mPEI to Portuguese asthma patients after translation and cross-cultural adaptation, and to verify the structural validity, internal consistency and reproducibility of the instrument. The mPEI was translated to Portuguese and back translated to English. Its content validity was assessed by a debriefing interview with 10 asthma patients. The translated instrument was then administered to a random sample of 142 patients with persistent asthma. Structural validity and internal consistency were assessed. For reproducibility analysis, 86 patients completed the instrument again 7 days later. Item-scale correlations and exploratory factor analysis were used to assess structural validity. Cronbach's alpha was used to test internal consistency, and the intra-class correlation coefficient was used for the analysis of reproducibility. All items of the Portuguese version of the mPEI were found to be equivalent to the original English version. There were strong item-scale correlations that confirmed construct validity, with a one component structure and good internal consistency (Cronbach's alpha >0.8) as well as high test-retest reliability (ICC=0.85). The mPEI showed sound psychometric properties for the evaluation of enablement in patients with asthma making it a reliable instrument for use in research and clinical practice in Portugal. Further studies are needed to confirm its responsiveness.
Translation and evaluation of the Cultural Awareness Scale for Korean nursing students.
Oh, Hyunjin; Lee, Jung-ah; Schepp, Karen G
2015-02-20
To evaluate the effectiveness of a curriculum for achieving high levels of cultural competence, we need to be able to assess education intended to enhance cultural competency skills. We therefore translated the Cultural Awareness Scale (CAS) into Korean (CAS-K). The purpose of this study was to evaluate the cross-cultural applicability and psychometric properties of the CAS-K, specifically its reliability and validity. A cross-sectional descriptive design was used to conduct the evaluation. A convenience sample of 495 nursing students was recruited from four levels of nursing education within four universities in the city of Daejeon, South Korea. This study provided beginning evidence of the validity and reliability of the CAS-K and the cross-cultural applicability of the concepts underlying this instrument. Cronbach's alpha ranged between 0.59 and 0.86 (overall 0.89) in the tests of internal consistency. Cultural competency score prediction of the experience of travel abroad (r=0.084) and the perceived need for cultural education (r=0.223) suggested reasonable criterion validity. Five factors with eigenvalues >1.0 were extracted, accounting for 55.58% of the variance; two retained the same items previously identified for the CAS. The CAS-K demonstrated satisfactory validity and reliability in measuring cultural awareness in this sample of Korean nursing students. The revised CAS-K should be tested for its usability in curriculum evaluation and its applicability as a guide for teaching cultural awareness among groups of Korean nursing students.
Waples, Robin S
2010-07-01
Recognition of the importance of cross-validation ('any technique or instance of assessing how the results of a statistical analysis will generalize to an independent dataset'; Wiktionary, en.wiktionary.org) is one reason that the U.S. Securities and Exchange Commission requires all investment products to carry some variation of the disclaimer, 'Past performance is no guarantee of future results.' Even a cursory examination of financial behaviour, however, demonstrates that this warning is regularly ignored, even by those who understand what an independent dataset is. In the natural sciences, an analogue to predicting future returns for an investment strategy is predicting power of a particular algorithm to perform with new data. Once again, the key to developing an unbiased assessment of future performance is through testing with independent data--that is, data that were in no way involved in developing the method in the first place. A 'gold-standard' approach to cross-validation is to divide the data into two parts, one used to develop the algorithm, the other used to test its performance. Because this approach substantially reduces the sample size that can be used in constructing the algorithm, researchers often try other variations of cross-validation to accomplish the same ends. As illustrated by Anderson in this issue of Molecular Ecology Resources, however, not all attempts at cross-validation produce the desired result. Anderson used simulated data to evaluate performance of several software programs designed to identify subsets of loci that can be effective for assigning individuals to population of origin based on multilocus genetic data. Such programs are likely to become increasingly popular as researchers seek ways to streamline routine analyses by focusing on small sets of loci that contain most of the desired signal. Anderson found that although some of the programs made an attempt at cross-validation, all failed to meet the 'gold standard' of using truly independent data and therefore produced overly optimistic assessments of power of the selected set of loci--a phenomenon known as 'high grading bias.'
Wang, Xiao-Lan; Zhan, Ting-Ting; Zhan, Xian-Cheng; Tan, Xiao-Ying; Qu, Xiao-You; Wang, Xin-Yue; Li, Cheng-Rong
2014-01-01
The osmotic pressure of ammonium sulfate solutions has been measured by the well-established freezing point osmometry in dilute solutions and we recently reported air humidity osmometry in a much wider range of concentration. Air humidity osmometry cross-validated the theoretical calculations of osmotic pressure based on the Pitzer model at high concentrations by two one-sided test (TOST) of equivalence with multiple testing corrections, where no other experimental method could serve as a reference for comparison. Although more strict equivalence criteria were established between the measurements of freezing point osmometry and the calculations based on the Pitzer model at low concentration, air humidity osmometry is the only currently available osmometry applicable to high concentration, serves as an economic addition to standard osmometry.
Talip, Whadi-ah; Steyn, Nelia P; Visser, Marianne; Charlton, Karen E; Temple, Norman
2003-09-01
We wanted to develop and validate a test that assesses the knowledge and practices of health professionals (HPs) with regard to the role of nutrition, physical activity, and smoking cessation (lifestyle modification) in chronic diseases of lifestyle. A descriptive cross-sectional validation study was carried out. The validation design consisted of two phases, namely 1) test planning and development and 2) test evaluation. The study sample consisted of five groups of HPs: dietitians, dietetic interns, general practitioners, medical students, and nurses. The overall response rate was 58%, resulting in a sample size of 186 participants. A test was designed to evaluate the knowledge and practices of HPs. The test was first evaluated by an expert group to ensure content, construct, and face validity. Thereafter, the questionnaire was tested on five groups of HPs to test for criterion validity. Internal consistency was evaluated by Cronbach's alpha. An expert panel ensured content, construct, and face validity of the test. Groups with the most training and exposure to nutrition (dietitians and dietetic interns) had the highest group mean score, ranging from 61% to 88%, whereas those with limited nutrition training (general practitioners, medical students, and nurses) had significantly lower scores, ranging from 26% to 80%. This result demonstrated criterion validity. Internal consistency of the overall test demonstrated a Cronbach's alpha of 0.99. Most HPs identified the mass media as their main source of information on lifestyle modification. These HPs also identified lack of time, lack of patient compliance, and lack of knowledge as barriers that prevent them from providing counseling on lifestyle modification. The results of this study showed that this test instrument identifies groups of health professionals with adequate training (knowledge) in lifestyle modification and those who require further training (knowledge).
Treder, Maximilian; Lauermann, Jost Lennart; Eter, Nicole
2018-02-01
Our purpose was to use deep learning for the automated detection of age-related macular degeneration (AMD) in spectral domain optical coherence tomography (SD-OCT). A total of 1112 cross-section SD-OCT images of patients with exudative AMD and a healthy control group were used for this study. In the first step, an open-source multi-layer deep convolutional neural network (DCNN), which was pretrained with 1.2 million images from ImageNet, was trained and validated with 1012 cross-section SD-OCT scans (AMD: 701; healthy: 311). During this procedure training accuracy, validation accuracy and cross-entropy were computed. The open-source deep learning framework TensorFlow™ (Google Inc., Mountain View, CA, USA) was used to accelerate the deep learning process. In the last step, a created DCNN classifier, using the information of the above mentioned deep learning process, was tested in detecting 100 untrained cross-section SD-OCT images (AMD: 50; healthy: 50). Therefore, an AMD testing score was computed: 0.98 or higher was presumed for AMD. After an iteration of 500 training steps, the training accuracy and validation accuracies were 100%, and the cross-entropy was 0.005. The average AMD scores were 0.997 ± 0.003 in the AMD testing group and 0.9203 ± 0.085 in the healthy comparison group. The difference between the two groups was highly significant (p < 0.001). With a deep learning-based approach using TensorFlow™, it is possible to detect AMD in SD-OCT with high sensitivity and specificity. With more image data, an expansion of this classifier for other macular diseases or further details in AMD is possible, suggesting an application for this model as a support in clinical decisions. Another possible future application would involve the individual prediction of the progress and success of therapy for different diseases by automatically detecting hidden image information.
HA, Mei; QIAN, Xiaoling; YANG, Hong; HUANG, Jichun; LIU, Changjiang
2016-01-01
Background: The public’s cognition of stroke and responses to stroke symptoms are important to prevent complications and decrease the mortality when stroke occurs. The aim of study was to develop and validate the Chinese version of the Stroke Action Test (C-STAT) in a Chinese population. Methods: This study was rigorously implemented with the published guideline for the translation, adaptation and validation of instruments for the cross-cultural use in healthcare care research. A cross-sectional study was performed among 328 stroke patients and family members in the Department of Neurology in the Second Hospital of Lanzhou University, Gansu province, China in 2014. Results: The Chinese version of the instrument showed favorable content equivalence with the source version. Values of Cronbach’s alpha and test-retest reliability of the C-STAT were 0.88 and 0.86, respectively. Principal component analysis supported four-factor solutions of the C-STAT. Criterion-related validity showed that the C-STAT was a significant predictor of the 7-item stroke symptom scores (R = 0.77; t = 21.74, P< 0.001). Conclusion: The C-STAT is an intelligible and brief psychometrical tool to assess individuals’ knowledge of the appropriate responses to stroke symptoms in Chinese populations. It could also be used by health care providers to assess educational programs on stroke prevention. PMID:28053925
Cross-Cultural Detection of Depression from Nonverbal Behaviour.
Alghowinem, Sharifa; Goecke, Roland; Cohn, Jeffrey F; Wagner, Michael; Parker, Gordon; Breakspear, Michael
2015-05-01
Millions of people worldwide suffer from depression. Do commonalities exist in their nonverbal behavior that would enable cross-culturally viable screening and assessment of severity? We investigated the generalisability of an approach to detect depression severity cross-culturally using video-recorded clinical interviews from Australia, the USA and Germany. The material varied in type of interview, subtypes of depression and inclusion healthy control subjects, cultural background, and recording environment. The analysis focussed on temporal features of participants' eye gaze and head pose. Several approaches to training and testing within and between datasets were evaluated. The strongest results were found for training across all datasets and testing across datasets using leave-one-subject-out cross-validation. In contrast, generalisability was attenuated when training on only one or two of the three datasets and testing on subjects from the dataset(s) not used in training. These findings highlight the importance of using training data exhibiting the expected range of variability.
Gutiérrez Sánchez, Daniel; Cuesta-Vargas, Antonio I
2018-04-01
Many measurements have been developed to assess the quality of death (QoD). Among these, the Quality of Dying and Death Questionnaire (QODD) is the most widely studied and best validated. Informal carers and health professionals who care for the patient during their last days of life can complete this assessment tool. The aim of the study is to carry out a cross-cultural adaptation and a psychometric analysis of the QODD for the Spanish population. The translation was performed using a double forward and backward method. An expert panel evaluated the content validity. The questionnaire was tested in a sample of 72 Spanish-speaking adult carers of deceased cancer patients. A psychometric analysis was performed to evaluate internal consistency, divergent criterion-related validity with the Mini-Suffering State Examination (MSSE) and concurrent criterion-related validity with the Palliative Outcome Scale (POS). Some items were deleted and modified to create the Spanish version of the QODD (QODD-ESP-26). The instrument was readable and acceptable. The content validity index was 0.96, suggesting that all items are relevant for the measure of the QoD. This questionnaire showed high internal consistency (Cronbach's α coefficient = 0.88). Divergent validity with MSSE (r = -0.64) and convergent validity with POS (r = -0.61) were also demonstrated. The QODD-ESP-26 is a valid and reliable instrument for the assessment of the QoD of deceased cancer patients that can be used in a clinical and research setting. Copyright © 2018 Elsevier Ltd. All rights reserved.
Development of the Persian version of the Vertigo Symptom Scale: Validity and reliability
Kamalvand, Atefeh; Ghahraman, Mansoureh Adel; Jalaie, Shohreh
2017-01-01
Background: Vertigo Symptom Scale (VSS) is a proper instrument for assessing the patient status, clarifying the symptoms, and examining the relative impact of the vertigo and anxiety on reported handicap. Our aim is the translation and cross-cultural adaptation of the VSS into Persian language (VSS-P) and investigating its validity and reliability in patients with peripheral vestibular disorders. Materials and Methods: VSS was translated into Persian. Cross-cultural adaptation was carried out on 101 patients with peripheral vestibular disorders and 34 participants with no history of vertigo. They completed the Persian versions of VSS, dizziness handicap inventory (DHI), and Beck anxiety inventory (BAI). Internal, discriminant, and convergent validities, internal consistency, and test-retest reliability were determined. Results: The VSS-P showed good face validity. Internal validity was confirmed and demonstrated the presence of two vertigo (VSS-VER) and autonomic-anxiety (VSS-AA) subscales. Significant difference between the median scores for patient and healthy groups was reported in discriminate validity (P <0.001). Convergent validity revealed high correlation between both BAI and DHI with VSS-P. There was a high test-retest reliability; with intraclass correlation coefficient of 0.89, 0.86, and 0.91 for VSS-AA, VER, and VSS-P, respectively. The internal consistency was good with Cronbach's alpha 0.90 for VER subscale, 0.86 for VSS-AA subscale, and 0.92 for the overall VSS-P. Conclusion: The Persian version of the VSS could be used clinically as a valid and reliable tool. Thus, it is a key instrument to focus on the symptoms associated with dizziness. PMID:28616045
Development of the Persian version of the Vertigo Symptom Scale: Validity and reliability.
Kamalvand, Atefeh; Ghahraman, Mansoureh Adel; Jalaie, Shohreh
2017-01-01
Vertigo Symptom Scale (VSS) is a proper instrument for assessing the patient status, clarifying the symptoms, and examining the relative impact of the vertigo and anxiety on reported handicap. Our aim is the translation and cross-cultural adaptation of the VSS into Persian language (VSS-P) and investigating its validity and reliability in patients with peripheral vestibular disorders. VSS was translated into Persian. Cross-cultural adaptation was carried out on 101 patients with peripheral vestibular disorders and 34 participants with no history of vertigo. They completed the Persian versions of VSS, dizziness handicap inventory (DHI), and Beck anxiety inventory (BAI). Internal, discriminant, and convergent validities, internal consistency, and test-retest reliability were determined. The VSS-P showed good face validity. Internal validity was confirmed and demonstrated the presence of two vertigo (VSS-VER) and autonomic-anxiety (VSS-AA) subscales. Significant difference between the median scores for patient and healthy groups was reported in discriminate validity ( P <0.001). Convergent validity revealed high correlation between both BAI and DHI with VSS-P. There was a high test-retest reliability; with intraclass correlation coefficient of 0.89, 0.86, and 0.91 for VSS-AA, VER, and VSS-P, respectively. The internal consistency was good with Cronbach's alpha 0.90 for VER subscale, 0.86 for VSS-AA subscale, and 0.92 for the overall VSS-P. The Persian version of the VSS could be used clinically as a valid and reliable tool. Thus, it is a key instrument to focus on the symptoms associated with dizziness.
A Swedish cross-cultural adaptation and validation of the Tinnitus Functional Index.
Hoff, Maria; Kähäri, Kim
2017-04-01
The Tinnitus Functional Index (TFI) is a recent self-report instrument for tinnitus with potential advantages over other existing instruments, including a demonstrated high responsiveness. The objectives of this study were to translate and cross-culturally adapt the TFI into Swedish and to investigate its validity and reliability. The development of the Swedish version (TFI-SE) followed published guidelines on cross-cultural adaptation of health questionnaires. Validity and reliability was investigated by correlating responses on the TFI-SE with other tinnitus measures [Tinnitus Handicap Inventory (THI) and visual analogue scale (VAS)] and a scale measuring anxiety and depression (HADS). Consecutively recruited tinnitus patients (n = 100) from four Swedish clinics completed the questionnaires. The mean age of the sample was 51 years (SD =17). The internal consistency of the TFI-SE was good (α = 0.95) and the test-retest reliability was high (ICC =0.93). Our results supported the eight-factor structure proposed for the original TFI, and a high correlation between the TFI-SE and the THI (r = 0.8; p < 0.01) and lower correlations between the TFI-SE and the HADS-D (r = 0.60; p < 0.01) and HADS-A (r = 0.59; p < 0.01) confirmed satisfactory convergent and discriminant validity. We found that the Swedish translation and cross-cultural adaptation of the TFI is valid and reliable for use with adult tinnitus patients.
Cross-validation of an employee safety climate model in Malaysia.
Bahari, Siti Fatimah; Clarke, Sharon
2013-06-01
Whilst substantial research has investigated the nature of safety climate, and its importance as a leading indicator of organisational safety, much of this research has been conducted with Western industrial samples. The current study focuses on the cross-validation of a safety climate model in the non-Western industrial context of Malaysian manufacturing. The first-order factorial validity of Cheyne et al.'s (1998) [Cheyne, A., Cox, S., Oliver, A., Tomas, J.M., 1998. Modelling safety climate in the prediction of levels of safety activity. Work and Stress, 12(3), 255-271] model was tested, using confirmatory factor analysis, in a Malaysian sample. Results showed that the model fit indices were below accepted levels, indicating that the original Cheyne et al. (1998) safety climate model was not supported. An alternative three-factor model was developed using exploratory factor analysis. Although these findings are not consistent with previously reported cross-validation studies, we argue that previous studies have focused on validation across Western samples, and that the current study demonstrates the need to take account of cultural factors in the development of safety climate models intended for use in non-Western contexts. The results have important implications for the transferability of existing safety climate models across cultures (for example, in global organisations) and highlight the need for future research to examine cross-cultural issues in relation to safety climate. Copyright © 2013 National Safety Council and Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Kitterød, Nils-Otto
2017-08-01
Unconsolidated sediment cover thickness (D) above bedrock was estimated by using a publicly available well database from Norway, GRANADA. General challenges associated with such databases typically involve clustering and bias. However, if information about the horizontal distance to the nearest bedrock outcrop (L) is included, does the spatial estimation of D improve? This idea was tested by comparing two cross-validation results: ordinary kriging (OK) where L was disregarded; and co-kriging (CK) where cross-covariance between D and L was included. The analysis showed only minor differences between OK and CK with respect to differences between estimation and true values. However, the CK results gave in general less estimation variance compared to the OK results. All observations were declustered and transformed to standard normal probability density functions before estimation and back-transformed for the cross-validation analysis. The semivariogram analysis gave correlation lengths for D and L of approx. 10 and 6 km. These correlations reduce the estimation variance in the cross-validation analysis because more than 50 % of the data material had two or more observations within a radius of 5 km. The small-scale variance of D, however, was about 50 % of the total variance, which gave an accuracy of less than 60 % for most of the cross-validation cases. Despite the noisy character of the observations, the analysis demonstrated that L can be used as secondary information to reduce the estimation variance of D.
A statistical method (cross-validation) for bone loss region detection after spaceflight
Zhao, Qian; Li, Wenjun; Li, Caixia; Chu, Philip W.; Kornak, John; Lang, Thomas F.
2010-01-01
Astronauts experience bone loss after the long spaceflight missions. Identifying specific regions that undergo the greatest losses (e.g. the proximal femur) could reveal information about the processes of bone loss in disuse and disease. Methods for detecting such regions, however, remains an open problem. This paper focuses on statistical methods to detect such regions. We perform statistical parametric mapping to get t-maps of changes in images, and propose a new cross-validation method to select an optimum suprathreshold for forming clusters of pixels. Once these candidate clusters are formed, we use permutation testing of longitudinal labels to derive significant changes. PMID:20632144
Alqarni, Ayidh M; Vennu, Vishal; Alshammari, Sulaiman A; Bindawas, Saad M
2018-01-01
Older adults are the fastest growing population group worldwide. Regular physical activity (PA) is reported to reduce the risk of health conditions and improve personal well-being. Few validated instruments can be used to measure the PA levels among older adults in Saudi Arabia. The Physical Activity Scale for the Elderly (PASE) is used worldwide for evaluating the PA levels of the elderly in epidemiological studies. However, this scale has not been translated into Arabic. This study aimed to cross-culturally adapt the PASE into Arabic language and evaluate its reliability and validity among community-dwelling older adults in Saudi Arabia. This study was a cross-sectional one following Beaton guidelines to translate and perform cultural adaptation, as well as test the reliability and validity of the PASE Arabic version (PASE-A). Elderly (N=74) people from both genders, who lived in a community dwelling in Riyadh city, were selected from several primary health care centers. The study used Cronbach's alpha coefficient to assess the internal consistency reliability, while intraclass correlation coefficient (ICC 2,1 ) was used for test-retest reliability and the Spearman's rank correlation coefficient ( r ) was used to evaluate the correlation among PASE-A and grip strength, Timed Up and Go test, body mass index, and fat percentage. Out of 74 older adults, 59 (79.7%) completed the PASE-A questionnaire twice. The internal consistency of the PASE-A components was good (Cronbach's alpha 0.70-0.75), and the reliability of the components was excellent (ICC 2,1 0.90-0.98). A higher PASE-A score was associated with higher grip strength ( r =0.28, p =0.05) and with shorter Timed Up and Go test times ( r =-0.45, p =0.01). The PASE-A version was easy, understandable, and relevant for Saudi older adults' culture. This scale was a reliable and valid tool for evaluating and assessing the PA level among community-dwelling older adults in Saudi Arabia.
Dragomir-Daescu, Dan; Buijs, Jorn Op Den; McEligot, Sean; Dai, Yifei; Entwistle, Rachel C.; Salas, Christina; Melton, L. Joseph; Bennet, Kevin E.; Khosla, Sundeep; Amin, Shreyasee
2013-01-01
Clinical implementation of quantitative computed tomography-based finite element analysis (QCT/FEA) of proximal femur stiffness and strength to assess the likelihood of proximal femur (hip) fractures requires a unified modeling procedure, consistency in predicting bone mechanical properties, and validation with realistic test data that represent typical hip fractures, specifically, a sideways fall on the hip. We, therefore, used two sets (n = 9, each) of cadaveric femora with bone densities varying from normal to osteoporotic to build, refine, and validate a new class of QCT/FEA models for hip fracture under loading conditions that simulate a sideways fall on the hip. Convergence requirements of finite element models of the first set of femora led to the creation of a new meshing strategy and a robust process to model proximal femur geometry and material properties from QCT images. We used a second set of femora to cross-validate the model parameters derived from the first set. Refined models were validated experimentally by fracturing femora using specially designed fixtures, load cells, and high speed video capture. CT image reconstructions of fractured femora were created to classify the fractures. The predicted stiffness (cross-validation R2 = 0.87), fracture load (cross-validation R2 = 0.85), and fracture patterns (83% agreement) correlated well with experimental data. PMID:21052839
Marchese, C; Cristalli, G; Pichi, B; Manciocco, V; Mercante, G; Pellini, R; Marchesi, P; Sperduti, I; Ruscito, P; Spriano, G
2012-02-01
Shoulder syndrome after neck dissection is a well known entity, but its incidence and prognostic factors influencing recovery have not been clearly assessed due to the heterogeneity of possible evaluations. The University of California - Los Angeles (UCLA) Shoulder Scale, the Shoulder Pain and Disability Index (SPADI) and the Simple Shoulder Test (SST) are three English-language questionnaires commonly used to test shoulder impairment. An Italian version of these scales is not available. The aim of the present study was to translate, culturally adapt and validate an Italian version of UCLA Shoulder Scale, SPADI and SST. Translation and cross-cultural adaptation of the SPADI, the UCLA shoulder scale and the SST was performed according to the international guidelines. Sixty-six patients treated with neck dissection for head and neck cancer were called to draw up these scales. Forty patients completed the same questionnaires a second time one week after the first to test the reproducibility of the Italian versions. All the English-speaking Italian patients (n = 11) were asked to complete both the English and the Italian versions of the three questionnaires to validate the scales. No major problems regarding the content or the language were found during the translation of the 3 questionnaires. For all three scales, Cronbach's α was > 0.89. The Pearson correlation coefficient was r > 0.91. With respect to validity, there was a significant correlation between the Italian and the English versions of all three scales. This study shows that the Italian versions of UCLA Shoulder Scale, SPADI and SST are valid instruments for the evaluation of shoulder dysfunction after neck dissection in Italian patients.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mbah, Chamberlain, E-mail: chamberlain.mbah@ugent.be; Department of Mathematical Modeling, Statistics, and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, Ghent; Thierens, Hubert
Purpose: To identify the main causes underlying the failure of prediction models for radiation therapy toxicity to replicate. Methods and Materials: Data were used from two German cohorts, Individual Radiation Sensitivity (ISE) (n=418) and Mammary Carcinoma Risk Factor Investigation (MARIE) (n=409), of breast cancer patients with similar characteristics and radiation therapy treatments. The toxicity endpoint chosen was telangiectasia. The LASSO (least absolute shrinkage and selection operator) logistic regression method was used to build a predictive model for a dichotomized endpoint (Radiation Therapy Oncology Group/European Organization for the Research and Treatment of Cancer score 0, 1, or ≥2). Internal areas undermore » the receiver operating characteristic curve (inAUCs) were calculated by a naïve approach whereby the training data (ISE) were also used for calculating the AUC. Cross-validation was also applied to calculate the AUC within the same cohort, a second type of inAUC. Internal AUCs from cross-validation were calculated within ISE and MARIE separately. Models trained on one dataset (ISE) were applied to a test dataset (MARIE) and AUCs calculated (exAUCs). Results: Internal AUCs from the naïve approach were generally larger than inAUCs from cross-validation owing to overfitting the training data. Internal AUCs from cross-validation were also generally larger than the exAUCs, reflecting heterogeneity in the predictors between cohorts. The best models with largest inAUCs from cross-validation within both cohorts had a number of common predictors: hypertension, normalized total boost, and presence of estrogen receptors. Surprisingly, the effect (coefficient in the prediction model) of hypertension on telangiectasia incidence was positive in ISE and negative in MARIE. Other predictors were also not common between the 2 cohorts, illustrating that overcoming overfitting does not solve the problem of replication failure of prediction models completely. Conclusions: Overfitting and cohort heterogeneity are the 2 main causes of replication failure of prediction models across cohorts. Cross-validation and similar techniques (eg, bootstrapping) cope with overfitting, but the development of validated predictive models for radiation therapy toxicity requires strategies that deal with cohort heterogeneity.« less
The reliability and validity of the SF-8 with a conflict-affected population in northern Uganda.
Roberts, Bayard; Browne, John; Ocaka, Kaducu Felix; Oyok, Thomas; Sondorp, Egbert
2008-12-02
The SF-8 is a health-related quality of life instrument that could provide a useful means of assessing general physical and mental health amongst populations affected by conflict. The purpose of this study was to test the validity and reliability of the SF-8 with a conflict-affected population in northern Uganda. A cross-sectional multi-staged, random cluster survey was conducted with 1206 adults in camps for internally displaced persons in Gulu and Amuru districts of northern Uganda. Data quality was assessed by analysing the number of incomplete responses to SF-8 items. Response distribution was analysed using aggregate endorsement frequency. Test-retest reliability was assessed in a separate smaller survey using the intraclass correlation test. Construct validity was measured using principal component analysis, and the Pearson Correlation test for item-summary score correlation and inter-instrument correlations. Known groups validity was assessed using a two sample t-test to evaluates the ability of the SF-8 to discriminate between groups known to have, and not have, physical and mental health problems. The SF-8 showed excellent data quality. It showed acceptable item response distribution based upon analysis of aggregate endorsement frequencies. Test-retest showed a good intraclass correlation of 0.61 for PCS and 0.68 for MCS. The principal component analysis indicated strong construct validity and concurred with the results of the validity tests by the SF-8 developers. The SF-8 also showed strong construct validity between the 8 items and PCS and MCS summary score, moderate inter-instrument validity, and strong known groups validity. This study provides evidence on the reliability and validity of the SF-8 amongst IDPs in northern Uganda.
The reliability and validity of the SF-8 with a conflict-affected population in northern Uganda
Roberts, Bayard; Browne, John; Ocaka, Kaducu Felix; Oyok, Thomas; Sondorp, Egbert
2008-01-01
Background The SF-8 is a health-related quality of life instrument that could provide a useful means of assessing general physical and mental health amongst populations affected by conflict. The purpose of this study was to test the validity and reliability of the SF-8 with a conflict-affected population in northern Uganda. Methods A cross-sectional multi-staged, random cluster survey was conducted with 1206 adults in camps for internally displaced persons in Gulu and Amuru districts of northern Uganda. Data quality was assessed by analysing the number of incomplete responses to SF-8 items. Response distribution was analysed using aggregate endorsement frequency. Test-retest reliability was assessed in a separate smaller survey using the intraclass correlation test. Construct validity was measured using principal component analysis, and the Pearson Correlation test for item-summary score correlation and inter-instrument correlations. Known groups validity was assessed using a two sample t-test to evaluates the ability of the SF-8 to discriminate between groups known to have, and not have, physical and mental health problems. Results The SF-8 showed excellent data quality. It showed acceptable item response distribution based upon analysis of aggregate endorsement frequencies. Test-retest showed a good intraclass correlation of 0.61 for PCS and 0.68 for MCS. The principal component analysis indicated strong construct validity and concurred with the results of the validity tests by the SF-8 developers. The SF-8 also showed strong construct validity between the 8 items and PCS and MCS summary score, moderate inter-instrument validity, and strong known groups validity. Conclusion This study provides evidence on the reliability and validity of the SF-8 amongst IDPs in northern Uganda. PMID:19055716
Arnold, R; Ponnusamy, V; Zhang, C-Q; Gucciardi, D F
2017-08-01
Organizational stressors are a universal phenomenon which can be particularly prevalent and problematic for sport performers. In view of their global existence, it is surprising that no studies have examined cross-cultural differences in organizational stressors. One explanation for this is that the Organizational Stressor Indicator for Sport Performers (OSI-SP; Arnold, Fletcher, & Daniels, 2013), which can comprehensively measure the organizational pressures that sport performers have encountered, has not yet been translated from English into any other languages nor scrutinized cross-culturally. The first purpose of this study, therefore, was to examine the cross-cultural validity of the OSI-SP. In addition, the study aimed to test the equivalence of the OSI-SP's factor structure across cultures. British (n = 379), Chinese (n = 335), and Malaysian (n = 444) sport performers completed the OSI-SP. Confirmatory factor analyses confirmed the cross-cultural validity of the factorial model for the British and Malaysian samples; however, the overall model fit for the Chinese data did not meet all guideline values. Support was provided for the equality of factor loadings, variances, and covariances on the OSI-SP across the British and Malaysian cultures. These findings advance knowledge and understanding on the cross-cultural existence, conceptualization, and operationalization of organizational stressors. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Can we predict 4-year graduation in podiatric medical school using admission data?
Sesodia, Sanjay; Molnar, David; Shaw, Graham P
2012-01-01
This study examined the predictive ability of educational background and demographic variables, available at the admission stage, to identify applicants who will graduate in 4 years from podiatric medical school. A logistic regression model was used to identify two predictors of 4-year graduation: age at matriculation and total Medical College Admission Test score. The model was cross-validated using a second independent sample from the same population. Cross-validation gives greater confidence that the results could be more generally applied. Total Medical College Admission Test score was the strongest predictor of 4-year graduation, with age at matriculation being a statistically significant but weaker predictor. Despite the model's capacity to predict 4-year graduation better than random assignment, a sufficient amount of error in prediction remained, suggesting that important predictors are missing from the model. Furthermore, the high rate of false-positives makes it inappropriate to use age and Medical College Admission Test score as admission screens in an attempt to eliminate attrition by not accepting at-risk students.
WISESight : a multispectral smart video-track intrusion monitor.
DOT National Transportation Integrated Search
2015-05-01
International Electronic Machines : Corporation (IEM) developed, tested, and : validated a unique smart video-based : intrusion monitoring system for use at : highway-rail grade crossings. The system : used both thermal infrared (IR) and : visible/ne...
Stability and Change in Interests: A Longitudinal Study of Adolescents from Grades 8 through 12
ERIC Educational Resources Information Center
Tracey, Terence J. G.; Robbins, Steven B.; Hofsess, Christy D.
2005-01-01
The pattern of RIASEC interests and academic skills were assessed longitudinally from a large-scale national database at three time points: eight grade, 10th grade, and 12th grade. Validation and cross-validation samples of 1000 males and 1000 females in each set were used to test the pattern of these scores over time relative to mean changes,…
Bouzubar, Fawzi F; Aljadi, Sameera H; Alotaibi, Naser M; Irrgang, James J
2018-07-01
The purpose of this study is to cross-culturally adapt the Knee Outcome Survey-Activities of Daily Living Scale into Arabic and to assess its psychometric properties (internal consistency, reliability, validity, and responsiveness) in patients with knee disorders. The cross-cultural adaptation process for the Knee Outcome Survey-Activities of Daily Living Scale into Arabic was performed consistent with the published guidelines. The psychometric properties of this Arabic version were then evaluated. Participants completed this version three times: at baseline, 2-4 days later, and 4 weeks later. Correlations between the Arabic version of Knee Outcome Survey-Activities of Daily Living Scale and the Arabic version of the Short Form-36 Health Survey, Get Up and Go, and Ascending/Descending stairs tests were evaluated. Linguistic and cultural issues were addressed. The Arabic version of the Knee Outcome Survey-Activities of Daily Living Scale demonstrated excellent internal consistency (Cronbach's alpha = 0.97) and excellent test-retest reliability (intraclass correlation coefficient = 0.97). Construct validity of the Arabic version of the Knee Outcome Survey-Activities of Daily Living Scale with the Arabic version of Short Form-36 Health Survey subscales ranged from r = 0.28 to 0.53, p < 0.001. Criterion validity with the Get Up and Go and Ascending/Descending stairs tests ranged from r = -0.47 to -0.60, p < 0.01. This Arabic version was able to detect changes 4 weeks later (effect size = 1.12 and minimum clinically important difference = 14 points). The Arabic version of the Knee Outcome Survey-Activities of Daily Living Scale is a reliable, valid and responsive measure for assessing knee-related symptoms and functional limitations Implications for rehabilitation The Knee Outcome Survey-Activities of Daily Living Scale-Arabic is a reliable, valid and responsive measure for assessing knee-related functional limitations. This Arabic version can be used in clinical practice and for research purposes to assess symptoms and functional limitations in Arabic-speaking patients with knee disorders. This scale is responsive to track therapeutic outcome of Arabic-speaking patients with knee disorders.
Evidence of Validity for the Japanese Version of the Foot and Ankle Ability Measure
Uematsu, Daisuke; Suzuki, Hidetomo; Sasaki, Shogo; Nagano, Yasuharu; Shinozuka, Nobuyuki; Sunagawa, Norihiko; Fukubayashi, Toru
2015-01-01
Context: The Foot and Ankle Ability Measure (FAAM) is a valid, reliable, and self-reported outcome instrument for the foot and ankle region. Objective: To provide evidence for translation, cross-cultural adaptation, validity, and reliability of the Japanese version of the FAAM (FAAM-J). Design: Cross-sectional study. Setting: Collegiate athletic training/sports medicine clinical setting. Patients or Other Participants: Eighty-three collegiate athletes. Main Outcome Measure(s): All participants completed the Activities of Daily Living and Sports subscales of the FAAM-J and the Physical Functioning and Mental Health subscales of the Japanese version of the Short Form-36v2 (SF-36). Also, 19 participants (23%) whose conditions were expected to be stable completed another FAAM-J 2 to 6 days later for test-retest reliability. We analyzed the scores of those subscales for convergent and divergent validity, internal consistency, and test-retest reliability. Results: The Activities of Daily Living and Sports subscales of the FAAM-J had correlation coefficients of 0.86 and 0.75, respectively, with the Physical Functioning section of the SF-36 for convergent validity. For divergent validity, the correlation coefficients with Mental Health of the SF-36 were 0.29 and 0.27 for each subscale, respectively. Cronbach α for internal consistency was 0.99 for the Activities of Daily Living and 0.98 for the Sports subscale. A 95% confidence interval with a single measure was ±8.1 and ±14.0 points for each subscale. The test-retest reliability measures revealed intraclass correlation coefficient values of 0.87 for the Activities of Daily Living and 0.91 for the Sports subscales with minimal detectable changes of ±6.8 and ±13.7 for the respective subscales. Conclusions: The FAAM was successfully translated for a Japanese version, and the FAAM-J was adapted cross-culturally. Thus, the FAAM-J can be used as a self-reported outcome measure for Japanese-speaking individuals; however, the scores must be interpreted with caution, especially when applied to different populations and other types of injury than those included in this study. PMID:25310247
Amuzu-Aweh, E N; Bijma, P; Kinghorn, B P; Vereijken, A; Visscher, J; van Arendonk, J Am; Bovenhuis, H
2013-12-01
Prediction of heterosis has a long history with mixed success, partly due to low numbers of genetic markers and/or small data sets. We investigated the prediction of heterosis for egg number, egg weight and survival days in domestic white Leghorns, using ∼400 000 individuals from 47 crosses and allele frequencies on ∼53 000 genome-wide single nucleotide polymorphisms (SNPs). When heterosis is due to dominance, and dominance effects are independent of allele frequencies, heterosis is proportional to the squared difference in allele frequency (SDAF) between parental pure lines (not necessarily homozygous). Under these assumptions, a linear model including regression on SDAF partitions crossbred phenotypes into pure-line values and heterosis, even without pure-line phenotypes. We therefore used models where phenotypes of crossbreds were regressed on the SDAF between parental lines. Accuracy of prediction was determined using leave-one-out cross-validation. SDAF predicted heterosis for egg number and weight with an accuracy of ∼0.5, but did not predict heterosis for survival days. Heterosis predictions allowed preselection of pure lines before field-testing, saving ∼50% of field-testing cost with only 4% loss in heterosis. Accuracies from cross-validation were lower than from the model-fit, suggesting that accuracies previously reported in literature are overestimated. Cross-validation also indicated that dominance cannot fully explain heterosis. Nevertheless, the dominance model had considerable accuracy, clearly greater than that of a general/specific combining ability model. This work also showed that heterosis can be modelled even when pure-line phenotypes are unavailable. We concluded that SDAF is a useful predictor of heterosis in commercial layer breeding.
ERIC Educational Resources Information Center
Schneider, W. Joel; Roman, Zachary
2018-01-01
We used data simulations to test whether composites consisting of cohesive subtest scores are more accurate than composites consisting of divergent subtest scores. We demonstrate that when multivariate normality holds, divergent and cohesive scores are equally accurate. Furthermore, excluding divergent scores results in biased estimates of…
Gender Fairness within the Force Concept Inventory
ERIC Educational Resources Information Center
Traxler, Adrienne; Henderson, Rachel; Stewart, John; Stewart, Gay; Papak, Alexis; Lindell, Rebecca
2018-01-01
Research on the test structure of the Force Concept Inventory (FCI) has largely ignored gender, and research on FCI gender effects (often reported as "gender gaps") has seldom interrogated the structure of the test. These rarely crossed streams of research leave open the possibility that the FCI may not be structurally valid across…
Burkey, Matthew D.; Ghimire, Lajina; Adhikari, Ramesh P.; Kohrt, Brandon A.; Jordans, Mark J. D.; Haroz, Emily; Wissow, Lawrence
2017-01-01
Systematic processes are needed to develop valid measurement instruments for disruptive behavior disorders (DBDs) in cross-cultural settings. We employed a four-step process in Nepal to identify and select items for a culturally valid assessment instrument: 1) We extracted items from validated scales and local free-list interviews. 2) Parents, teachers, and peers (n=30) rated the perceived relevance and importance of behavior problems. 3) Highly rated items were piloted with children (n=60) in Nepal. 4) We evaluated internal consistency of the final scale. We identified 49 symptoms from 11 scales, and 39 behavior problems from free-list interviews (n=72). After dropping items for low ratings of relevance and severity and for poor item-test correlation, low frequency, and/or poor acceptability in pilot testing, 16 items remained for the Disruptive Behavior International Scale—Nepali version (DBIS-N). The final scale had good internal consistency (α=0.86). A 4-step systematic approach to scale development including local participation yielded an internally consistent scale that included culturally relevant behavior problems. PMID:28093575
Rubio, Joaquín Salmerón; García-Delgado, Pilar; Ferreira, Paula Iglésias; Santos, Henrique Mateus; Martínez-Martínez, Fernando
2014-04-01
The scope of this study was the validation of a cross-culturally adapted questionnaire into Portuguese in five community pharmacies in Portugal. The discriminatory power of items, content and construct validity and factor analysis of the main components and their reliability and stability were determined. A high degree of semantic equivalence between the original questionnaire and the cross-culturally adapted questionnaire into Portuguese was observed. A Kaiser-Meyer-Olkin index of 0.550 was obtained and the Bartlett sphericity test confirmed the adequacy of the data for the application of factor analysis (p <0.0001). Three factors which accounted for 52.6% of the total variability were considered. With respect to reliability the following results were obtained: 0.519 for Cronbach's alpha test; 0.89 for Cohen's kappa coefficient; and 0.756 (IC=0.598-0.963) for the CCI exam. In this work, the first adaptation for the Portuguese culture of a specific questionnaire was produced to measure the degree of knowledge patients have about their medication.
Bittante, G; Ferragina, A; Cipolat-Gotet, C; Cecchinato, A
2014-10-01
Cheese yield is an important technological trait in the dairy industry. The aim of this study was to infer the genetic parameters of some cheese yield-related traits predicted using Fourier-transform infrared (FTIR) spectral analysis and compare the results with those obtained using an individual model cheese-producing procedure. A total of 1,264 model cheeses were produced using 1,500-mL milk samples collected from individual Brown Swiss cows, and individual measurements were taken for 10 traits: 3 cheese yield traits (fresh curd, curd total solids, and curd water as a percent of the weight of the processed milk), 4 milk nutrient recovery traits (fat, protein, total solids, and energy of the curd as a percent of the same nutrient in the processed milk), and 3 daily cheese production traits per cow (fresh curd, total solids, and water weight of the curd). Each unprocessed milk sample was analyzed using a MilkoScan FT6000 (Foss, Hillerød, Denmark) over the spectral range, from 5,000 to 900 wavenumber × cm(-1). The FTIR spectrum-based prediction models for the previously mentioned traits were developed using modified partial least-square regression. Cross-validation of the whole data set yielded coefficients of determination between the predicted and measured values in cross-validation of 0.65 to 0.95 for all traits, except for the recovery of fat (0.41). A 3-fold external validation was also used, in which the available data were partitioned into 2 subsets: a training set (one-third of the herds) and a testing set (two-thirds). The training set was used to develop calibration equations, whereas the testing subsets were used for external validation of the calibration equations and to estimate the heritabilities and genetic correlations of the measured and FTIR-predicted phenotypes. The coefficients of determination between the predicted and measured values in cross-validation results obtained from the training sets were very similar to those obtained from the whole data set, but the coefficient of determination of validation values for the external validation sets were much lower for all traits (0.30 to 0.73), and particularly for fat recovery (0.05 to 0.18), for the training sets compared with the full data set. For each testing subset, the (co)variance components for the measured and FTIR-predicted phenotypes were estimated using bivariate Bayesian analyses and linear models. The intraherd heritabilities for the predicted traits obtained from our internal cross-validation using the whole data set ranged from 0.085 for daily yield of curd solids to 0.576 for protein recovery, and were similar to those obtained from the measured traits (0.079 to 0.586, respectively). The heritabilities estimated from the testing data set used for external validation were more variable but similar (on average) to the corresponding values obtained from the whole data set. Moreover, the genetic correlations between the predicted and measured traits were high in general (0.791 to 0.996), and they were always higher than the corresponding phenotypic correlations (0.383 to 0.995), especially for the external validation subset. In conclusion, we herein report that application of the cross-validation technique to the whole data set tended to overestimate the predictive ability of FTIR spectra, give more precise phenotypic predictions than the calibrations obtained using smaller data sets, and yield genetic correlations similar to those obtained from the measured traits. Collectively, our findings indicate that FTIR predictions have the potential to be used as indicator traits for the rapid and inexpensive selection of dairy populations for improvement of cheese yield, milk nutrient recovery in curd, and daily cheese production per cow. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Development of estrogen receptor beta binding prediction model using large sets of chemicals.
Sakkiah, Sugunadevi; Selvaraj, Chandrabose; Gong, Ping; Zhang, Chaoyang; Tong, Weida; Hong, Huixiao
2017-11-03
We developed an ER β binding prediction model to facilitate identification of chemicals specifically bind ER β or ER α together with our previously developed ER α binding model. Decision Forest was used to train ER β binding prediction model based on a large set of compounds obtained from EADB. Model performance was estimated through 1000 iterations of 5-fold cross validations. Prediction confidence was analyzed using predictions from the cross validations. Informative chemical features for ER β binding were identified through analysis of the frequency data of chemical descriptors used in the models in the 5-fold cross validations. 1000 permutations were conducted to assess the chance correlation. The average accuracy of 5-fold cross validations was 93.14% with a standard deviation of 0.64%. Prediction confidence analysis indicated that the higher the prediction confidence the more accurate the predictions. Permutation testing results revealed that the prediction model is unlikely generated by chance. Eighteen informative descriptors were identified to be important to ER β binding prediction. Application of the prediction model to the data from ToxCast project yielded very high sensitivity of 90-92%. Our results demonstrated ER β binding of chemicals could be accurately predicted using the developed model. Coupling with our previously developed ER α prediction model, this model could be expected to facilitate drug development through identification of chemicals that specifically bind ER β or ER α .
Stoyanova, Rumyana; Dimova, Rositsa; Tarnovska, Miglena; Boeva, Tatyana
2018-05-20
Patient safety (PS) is one of the essential elements of health care quality and a priority of healthcare systems in most countries. Thus the creation of validated instruments and the implementation of systems that measure patient safety are considered to be of great importance worldwide. The present paper aims to illustrate the process of linguistic validation, cross-cultural verification and adaptation of the Bulgarian version of the Hospital Survey on Patient Safety Culture (B-HSOPSC) and its test-retest reliability. The study design is cross-sectional. The HSOPSC questionnaire consists of 42 questions, grouped in 12 different subscales that measure patient safety culture. Internal con-sistency was assessed using Cronbach's alpha. The Wilcoxon signed-rank test and the split-half method were used; the Spear-man-Brown coefficient was calculated. The overall Cronbach's alpha for B-HSOPSC is 0.918. Subscales 7 Staffing and 12 Overall perceptions of safety had the lowest coefficients. The high reliability of the instrument was confirmed by the Split-half method (0.97) and ICC-coefficient (0.95). The lowest values of Spearmen-Broun coefficients were found in items A13 and A14. The study offers an analysis of the results of the linguistic validation of the B-HSOPSC and its test-retest reliability. The psychometric characteristics of the questions revealed good validity and reliability, except two questions. In the future, the instrument will be administered to the target population in the main study so that the psychometric properties of the instrument can be verified.
Riley, Richard D; Ahmed, Ikhlaaq; Debray, Thomas P A; Willis, Brian H; Noordzij, J Pieter; Higgins, Julian P T; Deeks, Jonathan J
2015-06-15
Following a meta-analysis of test accuracy studies, the translation of summary results into clinical practice is potentially problematic. The sensitivity, specificity and positive (PPV) and negative (NPV) predictive values of a test may differ substantially from the average meta-analysis findings, because of heterogeneity. Clinicians thus need more guidance: given the meta-analysis, is a test likely to be useful in new populations, and if so, how should test results inform the probability of existing disease (for a diagnostic test) or future adverse outcome (for a prognostic test)? We propose ways to address this. Firstly, following a meta-analysis, we suggest deriving prediction intervals and probability statements about the potential accuracy of a test in a new population. Secondly, we suggest strategies on how clinicians should derive post-test probabilities (PPV and NPV) in a new population based on existing meta-analysis results and propose a cross-validation approach for examining and comparing their calibration performance. Application is made to two clinical examples. In the first example, the joint probability that both sensitivity and specificity will be >80% in a new population is just 0.19, because of a low sensitivity. However, the summary PPV of 0.97 is high and calibrates well in new populations, with a probability of 0.78 that the true PPV will be at least 0.95. In the second example, post-test probabilities calibrate better when tailored to the prevalence in the new population, with cross-validation revealing a probability of 0.97 that the observed NPV will be within 10% of the predicted NPV. © 2015 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.
Naghdi, Soofia; Nakhostin Ansari, Noureddin; Farhadi, Yasaman; Ebadi, Safoora; Entezary, Ebrahim; Glazer, Douglas
2016-10-01
The aim of the present study was to develop and provide validation statistics for the Persian Injury-Psychological Readiness to Return to Sport scale (I-PRRS) following a cross-sectional and prospective cohort study design. The I-PRRS was forward/back-translated and culturally adapted into Persian language. The Persian I-PRRS was administered to 100 injured athletes (93 male; age 26.0 ± 5.6 years; time since injury 4.84 ± 6.4 months) and 50 healthy athletes (36 male; mean age 25.7 ± 6.0 years). The Persian I-PRRS was re-administered to 50 injured athletes at 1 week to examine test-retest reliability. There were no floor or ceiling effects confirming the content validity of Persian I-PRRS. The internal consistency reliability was good. Excellent test-retest reliability and agreement were demonstrated. The statistically significant difference in Persian I-PRRS total scores between the injured athletes and healthy athletes provides an evidence of discriminative validity. The Persian I-PRRS total scores were positively correlated with the Farsi Mood Scale (FARMS) total scores, showing construct validity. The principal component analysis indicated a two-factor solution consisting of "Confidence to play" and "Confidence in the injured body part and skill level". The Persian I-PRRS showed excellent reliability and validity and can be used to assess injured athletes' psychological readiness to return to sport among Persian-speaking populations.
Validity and reliability of the Tibetan version of s-EMBU for measuring parenting styles.
Yangzong, Ciren; Lerkiatbundit, Sanguan; Luobu, Ouzhu; Cui, Chaoying; Liabsuetrakul, Tippawan; Kangzhuo, Baima; Quzong, Deji; Zhandui, Luobu; Zhen, Pu; Chongsuvivatwong, Virasakdi
2017-01-01
Parenting style experienced during childhood has profound effects on children's futures. Scales developed in other countries have never been validated in the Tibetan context. The present study aimed to examine the construct validity and reliability of a Tibetan translation of the 23-item short form of the Egna Minnen Beträffande Uppfostran [One's Memories of Upbringing] (s-EMBU) and to test the correlation between the parenting styles of fathers and mothers. A cross-sectional study was conducted in a sample of 847 students aged 12-21 years from Lhasa, Tibet, during September and October 2015 with a participation rate of 97.7%. The Tibetan translation of self-completed s-EMBU was administered. Confirmatory factor analysis was employed to test the scale's validity on the first half of the sample and was then cross-validated with the second half of the sample. The final model consisted of six factors: three (rejection, emotional warmth, and overprotection) for each parent, equality constrained on factor loadings, factor correlations, and error variance between father and mother. Father-mother correlation coefficients ranged from 0.81 to 0.86, and the level of consistency ranged from 0.62 to 0.82. Thus, the slightly modified s-EMBU is suitable for use in the Tibetan culture where both the father and the mother have consistent parenting styles.
Reproducibility and validity of a semi-quantitative FFQ for trace elements.
Lee, Yujin; Park, Kyong
2016-09-01
The aim of this study was to test the reproducibility and validity of a self-administered FFQ for the Trace Element Study of Korean Adults in the Yeungnam area (SELEN). Study subjects were recruited from the SELEN cohort selected from rural and urban areas in Yeungnam, Korea. A semi-quantitative FFQ with 146 items was developed considering the dietary characteristics of cohorts in the study area. In a validation study, seventeen men and forty-eight women aged 38-62 years completed 3-d dietary records (DR) and two FFQ over a 3-month period. The validity was examined with the FFQ and DR, and the reproducibility was estimated using partial correlation coefficients, the Bland-Altman method and cross-classification. There were no significant differences between the mean intakes of selected nutrients as estimated from FFQ1, FFQ2 and DR. The median correlation coefficients for all nutrients were 0·47 and 0·56 in the reproducibility and validity tests, respectively. Bland-Altman's index and cross-classification showed acceptable agreement between FFQ1 and FFQ2 and between FFQ2 and DR. Ultimately, 78 % of the subjects were classified into the same and adjacent quartiles for most nutrients. In addition, the weighted κ value indicated that the two methods agreed fairly. In conclusion, this newly developed FFQ was a suitable dietary assessment method for the SELEN cohort study.
Validity and reliability of the Tibetan version of s-EMBU for measuring parenting styles
Yangzong, Ciren; Lerkiatbundit, Sanguan; Luobu, Ouzhu; Cui, Chaoying; Liabsuetrakul, Tippawan; Kangzhuo, Baima; Quzong, Deji; Zhandui, Luobu; Zhen, Pu; Chongsuvivatwong, Virasakdi
2017-01-01
Parenting style experienced during childhood has profound effects on children’s futures. Scales developed in other countries have never been validated in the Tibetan context. The present study aimed to examine the construct validity and reliability of a Tibetan translation of the 23-item short form of the Egna Minnen Beträffande Uppfostran [One’s Memories of Upbringing] (s-EMBU) and to test the correlation between the parenting styles of fathers and mothers. A cross-sectional study was conducted in a sample of 847 students aged 12–21 years from Lhasa, Tibet, during September and October 2015 with a participation rate of 97.7%. The Tibetan translation of self-completed s-EMBU was administered. Confirmatory factor analysis was employed to test the scale’s validity on the first half of the sample and was then cross-validated with the second half of the sample. The final model consisted of six factors: three (rejection, emotional warmth, and overprotection) for each parent, equality constrained on factor loadings, factor correlations, and error variance between father and mother. Father–mother correlation coefficients ranged from 0.81 to 0.86, and the level of consistency ranged from 0.62 to 0.82. Thus, the slightly modified s-EMBU is suitable for use in the Tibetan culture where both the father and the mother have consistent parenting styles. PMID:28053560
NASA Astrophysics Data System (ADS)
Yan, Hong; Song, Xiangzhong; Tian, Kuangda; Chen, Yilin; Xiong, Yanmei; Min, Shungeng
2018-02-01
A novel method, mid-infrared (MIR) spectroscopy, which enables the determination of Chlorantraniliprole in Abamectin within minutes, is proposed. We further evaluate the prediction ability of four wavelength selection methods, including bootstrapping soft shrinkage approach (BOSS), Monte Carlo uninformative variable elimination (MCUVE), genetic algorithm partial least squares (GA-PLS) and competitive adaptive reweighted sampling (CARS) respectively. The results showed that BOSS method obtained the lowest root mean squared error of cross validation (RMSECV) (0.0245) and root mean squared error of prediction (RMSEP) (0.0271), as well as the highest coefficient of determination of cross-validation (Qcv2) (0.9998) and the coefficient of determination of test set (Q2test) (0.9989), which demonstrated that the mid infrared spectroscopy can be used to detect Chlorantraniliprole in Abamectin conveniently. Meanwhile, a suitable wavelength selection method (BOSS) is essential to conducting a component spectral analysis.
Cross-validation pitfalls when selecting and assessing regression and classification models.
Krstajic, Damjan; Buturovic, Ljubomir J; Leahy, David E; Thomas, Simon
2014-03-29
We address the problem of selecting and assessing classification and regression models using cross-validation. Current state-of-the-art methods can yield models with high variance, rendering them unsuitable for a number of practical applications including QSAR. In this paper we describe and evaluate best practices which improve reliability and increase confidence in selected models. A key operational component of the proposed methods is cloud computing which enables routine use of previously infeasible approaches. We describe in detail an algorithm for repeated grid-search V-fold cross-validation for parameter tuning in classification and regression, and we define a repeated nested cross-validation algorithm for model assessment. As regards variable selection and parameter tuning we define two algorithms (repeated grid-search cross-validation and double cross-validation), and provide arguments for using the repeated grid-search in the general case. We show results of our algorithms on seven QSAR datasets. The variation of the prediction performance, which is the result of choosing different splits of the dataset in V-fold cross-validation, needs to be taken into account when selecting and assessing classification and regression models. We demonstrate the importance of repeating cross-validation when selecting an optimal model, as well as the importance of repeating nested cross-validation when assessing a prediction error.
Beckers, Laura; Speth, Lucianne; Rameckers, Eugène; Janssen-Potten, Yvonne
2017-07-01
To produce a Dutch translation of the Lifestyle Assessment Questionnaire for children with cerebral palsy (LAQ-CP), adapted for cross-cultural differences. The translation process consisted of 6 stages, following a guideline for cross-cultural adaptations including duplicate forward- and back-translations, expert group review, pilot-testing, and a process audit. Several adaptations to the questionnaire were required due to cross-cultural differences. As a result of the pilot-test, the layout was adapted to the desires of the users. The process auditor stated that the process had been comprehensive and valued the quality of the work. The project resulted in a Dutch translation of the LAQ-CP, adapted for cross-cultural differences. Validation of the translated questionnaire is required before use in clinical practice and research is recommended (Dutch abstract, Supplemental Digital Content 1, available at: http://links.lww.com/PPT/A164).
Tuomikoski, Anna-Maria; Ruotsalainen, Heidi; Mikkonen, Kristina; Miettunen, Jouko; Kääriäinen, Maria
2018-06-05
Mentors require competence at a diverse array of skills to mentor students during clinical practice. According to the latest evidence, competence at mentoring includes: knowledge, skills and attributes of individual students' learning objectives, core elements of nursing, learning processes, a reciprocal and trustful relationship, feedback, evaluation, cooperation with stakeholders, and the mentor's personal qualities. The purpose of the study was to test psychometric properties of a mentor's competence instrument developed to self-evaluate mentors' competence at mentoring nursing students in clinical practice. A cross-sectional, descriptive, explorative study design was used. Data were collected from mentors at five university hospitals in Finland in 2016. A total of 576 mentors participated in this study. The instrument was developed through systematic review, experts' evaluations, and pilot versions of the instrument tested in previous studies. The construct validity and reliability of the instrument were tested using exploratory factor analysis (EFA) with promax rotation and Cronbach's alpha. A 10-factor model showed that the instrument has acceptable construct validity. Cronbach's alpha values for the subscales observed ranged from 0.76 to 0.90. The instrument exhibited acceptable psychometric properties, thereby proving itself a valuable tool for evaluating mentors' competence at mentoring students. Further assessments of its reliability, validity and generality for measuring mentor's competence for mentoring students in different contexts and cultures are recommended. Copyright © 2018 Elsevier Ltd. All rights reserved.
Cross-cultural validity of four quality of life scales in persons with spinal cord injury
2010-01-01
Background Quality of life (QoL) in persons with spinal cord injury (SCI) has been found to differ across countries. However, comparability of measurement results between countries depends on the cross-cultural validity of the applied instruments. The study examined the metric quality and cross-cultural validity of the Satisfaction with Life Scale (SWLS), the Life Satisfaction Questionnaire (LISAT-9), the Personal Well-Being Index (PWI) and the 5-item World Health Organization Quality of Life Assessment (WHOQoL-5) across six countries in a sample of persons with spinal cord injury (SCI). Methods A cross-sectional multi-centre study was conducted and the data of 243 out-patients with SCI from study centers in Australia, Brazil, Canada, Israel, South Africa, and the United States were analyzed using Rasch-based methods. Results The analyses showed high reliability for all 4 instruments (person reliability index .78-.92). Unidimensionality of measurement was supported for the WHOQoL-5 (Chi2 = 16.43, df = 10, p = .088), partially supported for the PWI (Chi2 = 15.62, df = 16, p = .480), but rejected for the LISAT-9 (Chi2 = 50.60, df = 18, p = .000) and the SWLS (Chi2 = 78.54, df = 10, p = .000) based on overall and item-wise Chi2 tests, principal components analyses and independent t-tests. The response scales showed the expected ordering for the WHOQoL-5 and the PWI, but not for the other two instruments. Using differential item functioning (DIF) analyses potential cross-country bias was found in two items of the SWLS and the WHOQoL-5, three items of the LISAT-9 and four items of the PWI. However, applying Rasch-based statistical methods, especially subtest analyses, it was possible to identify optimal strategies to enhance the metric properties and the cross-country equivalence of the instruments post-hoc. Following the post-hoc procedures the WHOQOL-5 and the PWI worked in a consistent and expected way in all countries. Conclusions QoL assessment using the summary scores of the WHOQOL-5 and the PWI appeared cross-culturally valid in persons with SCI. In contrast, summary scores of the LISAT-9 and the SWLS have to be interpreted with caution. The findings of the current study can be especially helpful to select instruments for international research projects in SCI. PMID:20815864
Validation of the Narrowing Beam Walking Test in Lower Limb Prosthesis Users.
Sawers, Andrew; Hafner, Brian
2018-04-11
To evaluate the content, construct, and discriminant validity of the Narrowing Beam Walking Test (NBWT), a performance-based balance test for lower limb prosthesis users. Cross-sectional study. Research laboratory and prosthetics clinic. Unilateral transtibial and transfemoral prosthesis users (N=40). Not applicable. Content validity was examined by quantifying the percentage of participants receiving maximum or minimum scores (ie, ceiling and floor effects). Convergent construct validity was examined using correlations between participants' NBWT scores and scores or times on existing clinical balance tests regularly administered to lower limb prosthesis users. Known-groups construct validity was examined by comparing NBWT scores between groups of participants with different fall histories, amputation levels, amputation etiologies, and functional levels. Discriminant validity was evaluated by analyzing the area under each test's receiver operating characteristic (ROC) curve. No minimum or maximum scores were recorded on the NBWT. NBWT scores demonstrated strong correlations (ρ=.70‒.85) with scores/times on performance-based balance tests (timed Up and Go test, Four Square Step Test, and Berg Balance Scale) and a moderate correlation (ρ=.49) with the self-report Activities-specific Balance Confidence scale. NBWT performance was significantly lower among participants with a history of falls (P=.003), transfemoral amputation (P=.011), and a lower mobility level (P<.001). The NBWT also had the largest area under the ROC curve (.81) and was the only test to exhibit an area that was statistically significantly >.50 (ie, chance). The results provide strong evidence of content, construct, and discriminant validity for the NBWT as a performance-based test of balance ability. The evidence supports its use to assess balance impairments and fall risk in unilateral transtibial and transfemoral prosthesis users. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Baldwin, Carol M.; Choi, Myunghan; McClain, Darya Bonds; Celaya, Alma; Quan, Stuart F.
2012-01-01
Study Objectives: To translate, back-translate and cross-language validate (English/Spanish) the Sleep Heart Health Study Sleep Habits Questionnaire for use with Spanish-speakers in clinical and research settings. Methods: Following rigorous translation and back-translation, this cross-sectional cross-language validation study recruited bilingual participants from academic, clinic, and community-based settings (N = 50; 52% women; mean age 38.8 ± 12 years; 90% of Mexican heritage). Participants completed English and Spanish versions of the Sleep Habits Questionnaire, the Epworth Sleepiness Scale, and the Acculturation Rating Scale for Mexican Americans II one week apart in randomized order. Psychometric properties were assessed, including internal consistency, convergent validity, scale equivalence, language version intercorrelations, and exploratory factor analysis using PASW (Version18) software. Grade level readability of the sleep measure was evaluated. Results: All sleep categories (duration, snoring, apnea, insomnia symptoms, other sleep symptoms, sleep disruptors, restless legs syndrome) showed Cronbach α, Spearman-Brown coefficients and intercorrelations ≥ 0.700, suggesting robust internal consistency, correlation, and agreement between language versions. The Epworth correlated significantly with snoring, apnea, sleep symptoms, restless legs, and sleep disruptors) on both versions, supporting convergent validity. Items loaded on 4 factors accounted for 68% and 67% of the variance on the English and Spanish versions, respectively. Conclusions: The Spanish-language Sleep Habits Questionnaire demonstrates conceptual and content equivalency. It has appropriate measurement properties and should be useful for assessing sleep health in community-based clinics and intervention studies among Spanish-speaking Mexican Americans. Both language versions showed readability at the fifth grade level. Further testing is needed with larger samples. Citation: Baldwin CM; Choi M; McClain DB; Celaya A; Quan SF. Spanish translation and cross-language validation of a Sleep Habits Questionnaire for use in clinical and research settings. J Clin Sleep Med 2012;8(2):137-146. PMID:22505858
Cross-cultural adaptation and validation of the Behcet's Disease Current Activity Form in Korea.
Choi, Hyo Jin; Seo, Mi Ryoung; Ryu, Hee Jung; Baek, Han Joo
2015-09-01
This study was undertaken to perform a cross-cultural adaptation of the Behcet's Disease Current Activity Form (BDCAF, version 2006) questionnaire to the Korean language and to evaluate its reliability and validity in a population of Korean patients with Behcet's disease (BD). A cross-cultural study was conducted among patients with BD who attended our rheumatology clinic between November 2012 and March 2013. There were 11 males and 35 females in the group. The mean age of the participants was 48.5 years and the mean disease duration was 6.4 years. The first BDCAF questionnaire was completed on arrival and the second assessment was performed 20 minutes later by a different physician. The test-retest reliability was analyzed by computing κ statistics. Kappa scores of > 0.6 indicated a good agreement. To assess the validity, we compared the total BDCAF score with the patient's/clinician's perception of disease activity and the Korean version of the Behcet's Disease Quality of Life (BDQOL). For the test-retest reliability, good agreements were achieved on items such as headache, oral/genital ulceration, erythema, skin pustules, arthralgia, nausea/vomiting/abdominal pain, and diarrhea with altered/frank blood per rectum. Moderate agreement was observed for eye and nervous system involvement. We achieved a fair agreement for arthritis and major vessel involvement. Significant correlations were obtained between the total BDCAF score with the BDQOL and the patient's/clinician's perception of disease activity p < 0.05). The Korean version of the BDCAF is a reliable and valid instrument for measuring current disease activity in Korean BD patients.
Cross-validation and Peeling Strategies for Survival Bump Hunting using Recursive Peeling Methods
Dazard, Jean-Eudes; Choe, Michael; LeBlanc, Michael; Rao, J. Sunil
2015-01-01
We introduce a framework to build a survival/risk bump hunting model with a censored time-to-event response. Our Survival Bump Hunting (SBH) method is based on a recursive peeling procedure that uses a specific survival peeling criterion derived from non/semi-parametric statistics such as the hazards-ratio, the log-rank test or the Nelson--Aalen estimator. To optimize the tuning parameter of the model and validate it, we introduce an objective function based on survival or prediction-error statistics, such as the log-rank test and the concordance error rate. We also describe two alternative cross-validation techniques adapted to the joint task of decision-rule making by recursive peeling and survival estimation. Numerical analyses show the importance of replicated cross-validation and the differences between criteria and techniques in both low and high-dimensional settings. Although several non-parametric survival models exist, none addresses the problem of directly identifying local extrema. We show how SBH efficiently estimates extreme survival/risk subgroups unlike other models. This provides an insight into the behavior of commonly used models and suggests alternatives to be adopted in practice. Finally, our SBH framework was applied to a clinical dataset. In it, we identified subsets of patients characterized by clinical and demographic covariates with a distinct extreme survival outcome, for which tailored medical interventions could be made. An R package PRIMsrc (Patient Rule Induction Method in Survival, Regression and Classification settings) is available on CRAN (Comprehensive R Archive Network) and GitHub. PMID:27034730
Mohamad Marzuki, Muhamad Fadhil; Yaacob, Nor Azwany; Yaacob, Najib Majdi
2018-05-14
A mobile app is a programmed system designed to be used by a target user on a mobile device. The usability of such a system refers not only to the extent to which product can be used to achieve the task that it was designed for, but also its effectiveness and efficiency, as well as user satisfaction. The System Usability Scale is one of the most commonly used questionnaires used to assess the usability of a system. The original 10-item version of System Usability Scale was developed in English and thus needs to be adapted into local languages to assess the usability of a mobile apps developed in other languages. The aim of this study is to translate and validate (with cross-cultural adaptation) the English System Usability Scale questionnaire into Malay, the main language spoken in Malaysia. The development of a translated version will allow the usability of mobile apps to be assessed in Malay. Forward and backward translation of the questionnaire was conducted by groups of Malay native speakers who spoke English as their second language. The final version was obtained after reconciliation and cross-cultural adaptation. The content of the Malay System Usability Scale questionnaire for mobile apps was validated by 10 experts in mobile app development. The efficacy of the questionnaire was further probed by testing the face validity on 10 mobile phone users, followed by reliability testing involving 54 mobile phone users. The content validity index was determined to be 0.91, indicating good relevancy of the 10 items used to assess the usability of a mobile app. Calculation of the face validity index resulted in a value of 0.94, therefore indicating that the questionnaire was easily understood by the users. Reliability testing showed a Cronbach alpha value of .85 (95% CI 0.79-0.91) indicating that the translated System Usability Scale questionnaire is a reliable tool for the assessment of usability of a mobile app. The Malay System Usability Scale questionnaire is a valid and reliable tool to assess the usability of mobile app in Malaysia. ©Muhamad Fadhil Mohamad Marzuki, Nor Azwany Yaacob, Najib Majdi Yaacob. Originally published in JMIR Human Factors (http://humanfactors.jmir.org), 14.05.2018.
Testing alternative ground water models using cross-validation and other methods
Foglia, L.; Mehl, S.W.; Hill, M.C.; Perona, P.; Burlando, P.
2007-01-01
Many methods can be used to test alternative ground water models. Of concern in this work are methods able to (1) rank alternative models (also called model discrimination) and (2) identify observations important to parameter estimates and predictions (equivalent to the purpose served by some types of sensitivity analysis). Some of the measures investigated are computationally efficient; others are computationally demanding. The latter are generally needed to account for model nonlinearity. The efficient model discrimination methods investigated include the information criteria: the corrected Akaike information criterion, Bayesian information criterion, and generalized cross-validation. The efficient sensitivity analysis measures used are dimensionless scaled sensitivity (DSS), composite scaled sensitivity, and parameter correlation coefficient (PCC); the other statistics are DFBETAS, Cook's D, and observation-prediction statistic. Acronyms are explained in the introduction. Cross-validation (CV) is a computationally intensive nonlinear method that is used for both model discrimination and sensitivity analysis. The methods are tested using up to five alternative parsimoniously constructed models of the ground water system of the Maggia Valley in southern Switzerland. The alternative models differ in their representation of hydraulic conductivity. A new method for graphically representing CV and sensitivity analysis results for complex models is presented and used to evaluate the utility of the efficient statistics. The results indicate that for model selection, the information criteria produce similar results at much smaller computational cost than CV. For identifying important observations, the only obviously inferior linear measure is DSS; the poor performance was expected because DSS does not include the effects of parameter correlation and PCC reveals large parameter correlations. ?? 2007 National Ground Water Association.
Cross-cultural adaptation and validation of the Korean version of the Oxford shoulder score.
Roh, Young Hak; Noh, Jung Ho; Kim, Woo; Oh, Joo Han; Gong, Hyun Sik; Baek, Goo Hyun
2012-01-01
The Oxford shoulder score (OSS) is being used increasingly and has been adapted cross-culturally in some Western countries. On the other hand, there are few validated translations of the OSS in Asian countries. This study translated and adapted cross-culturally the original OSS to produce a Korean version, and assessed the validity and reliability of the Korean version of the OSS (Korean OSS). One hundred and five patients with shoulder pain caused by degenerative or inflammatory disorders completed the Korean OSS and Korean disability of arm, shoulder and hand (DASH). In addition, the pain score by a visual analog scale (VAS) during activity and at rest, subjective assessment of activities of daily living (ADL), the active range of motion (ROM), and measurements of the abduction strength (strength) were included in the validation process. There were no major linguistic or cultural problems during the forward and backward translations of the MHQ, except for a minor change due to cultural discrepancies in eating such as using a spoon and chopsticks by one dominant hand instead of a knife and fork by two hands. The internal consistency was high (Cronbach's alpha 0.91). The reproducibility test showed no significant difference (Intra-class coefficient 0.95). The construct validity, which was tested by the Pearson correlation coefficient revealed a strong correlation (r > 0.6) between the Korean OSS against subscale of DASH disability/symptom, DASH work and ADL, as well as a moderate correlation (0.3 < r < 0.6) with the DASH sports/music, strength, ROM, pain during activity and pain at rest. The Korean OSS proved to be valid by demonstrating a significant correlation with the patient-based upper extremity questionnaire and clinical assessment. The application and evaluation of the instrument is feasible and understandable among patients in Korea.
Cross-cultural adaptation and validation of the Korean version of the Michigan hand questionnaire.
Roh, Young Hak; Yang, Bo Kyu; Noh, Jung Ho; Baek, Goo Hyun; Song, Cheol Ho; Gong, Hyun Sik
2011-09-01
The Michigan hand questionnaire (MHQ) is increasingly being used and has been adapted cross-culturally in some Western and Asian countries, but the validation process for an Asian translation of MHQ has not been well described. In this study, we translated and adapted the original MHQ cross-culturally to produce a Korean version, and then assessed the validity and reliability of the Korean version of the MHQ. A total of 176 patients with common hand disorders completed the Korean version of the MHQ and the Disabilities of the Arm, Shoulder, and Hand questionnaire. We included the pain score assessed by a visual analog scale during activity, range of motion, measurement of grip strength, and subjective assessment of the functional state by use of Cooney's scale in the validation process. There were no major linguistic or cultural problems during forward and backward translations of the MHQ, except for a minor change owing to cultural discrepancies in eating, such as the dominant hand using a spoon and chopsticks instead of both hands using a knife and fork. All subscales of the MHQ showed satisfactory internal consistency. The reproducibility test showed no significant difference. The construct validity revealed a moderate to strong correlation between every subscale of the Korean MHQ against DASH disabilities and symptoms. The aesthetic and satisfaction domains, unique domains of the MHQ, had little correlation with the objective measure of the pain visual analog scale, grip strength, motion and subjective functional state. The Korean version of MHQ showed satisfactory internal consistency, test-retest reliability, and validity and demonstrated a significant correlation with the patient-based upper extremity questionnaire and clinical assessment. We found the application and evaluation of the instrument to be feasible and understandable among patients in Korea. Copyright © 2011 American Society for Surgery of the Hand. Published by Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Scott, Leslie K.; Hall, Lynne M.
2012-01-01
The purpose of this study was to test the reliability and validity of an acanthosis nigricans (AN) screening tool for use in elementary school-age children of different ethnic groups. Cross-sectional data were collected via observation of 288, 5- to 12-year-old school-age children. Three nurse clinicians used a 0-4 grade AN screening tool to rate…
El-Housseiny, Azza A; Alsadat, Farah A; Alamoudi, Najlaa M; El Derwi, Douaa A; Farsi, Najat M; Attar, Moaz H; Andijani, Basil M
2016-04-14
Early recognition of dental fear is essential for the effective delivery of dental care. This study aimed to test the reliability and validity of the Arabic version of the Children's Fear Survey Schedule-Dental Subscale (CFSS-DS). A school-based sample of 1546 children was randomly recruited. The Arabic version of the CFSS-DS was completed by children during class time. The scale was tested for internal consistency and test-retest reliability. To test criterion validity, children's behavior was assessed using the Frankl scale during dental examination, and results were compared with children's CFSS-DS scores. To test the scale's construct validity, scores on "fear of going to the dentist soon" were correlated with CFSS-DS scores. Factor analysis was also used. The Arabic version of the CFSS-DS showed high reliability regarding both test-retest reliability (intraclass correlation = 0.83, p < 0.001) and internal consistency (Cronbach's α = 0.88). It showed good criterion validity: children with negative behavior had significantly higher fear scores (t = 13.67, p < 0.001). It also showed moderate construct validity (Spearman's rho correlation, r = 0.53, p < 0.001). Factor analysis identified the following factors: "fear of invasive dental procedures," "fear of less invasive dental procedures" and "fear of strangers." The Arabic version of the CFSS-DS is a reliable and valid measure of dental fear in Arabic-speaking children. Pediatric dentists and researchers may use this validated version of the CFSS-DS to measure dental fear in Arabic-speaking children.
Cross-cultural adaptation of the Individual Work Performance Questionnaire.
Koopmans, Linda; Bernaards, Claire M; Hildebrandt, Vincent H; Lerner, Debra; de Vet, Henrica C W; van der Beek, Allard J
2015-01-01
The Individual Work Performance Questionnaire (IWPQ), measuring task performance, contextual performance, and counterproductive work behavior, was developed in The Netherlands. To cross-culturally adapt the IWPQ from the Dutch to the American-English language, and assess the questionnaire's internal consistency and content validity in the American-English context. A five stage translation and adaptation process was used: forward translation, synthesis, back-translation, expert committee review, and pilot-testing. During the pilot-testing, cognitive interviews with 40 American workers were performed, to examine the comprehensibility, applicability, and completeness of the American-English IWPQ. Questionnaire instructions were slightly modified to aid interpretation in the American-English language. Inconsistencies with verb tense were identified, and it was decided to consistently use simple past tense. The wording of five items was modified to better suit the American-English language. In general, participants were positive on the comprehensibility, applicability and completeness of the questionnaire during the pilot-testing phase. Furthermore, the study showed positive results concerning the internal consistency (Cronbach's alphas for the scales between 0.79-0.89) and content validity of the American-English IWPQ. The results indicate that the cross-cultural adaptation of the American-English IWPQ was successful and that the measurement properties of the translated version are promising.
Asmuri, Siti Noraini; Brown, Ted; Broom, Lisa J
2016-07-01
Valid translations of time use scales are needed by occupational therapists for use in different cross-cultural contexts to gather relevant data to inform practice and research. The purpose of this study was to describe the process of translating, adapting, and validating the Time Use Diary from its current English language edition into a Malay language version. Five steps of the cross-cultural adaptation process were completed: (i) translation from English into the Malay language by a qualified translator, (ii) synthesis of the translated Malay version, (iii) backtranslation from Malay to English by three bilingual speakers, (iv) expert committee review and discussion, and (v) pilot testing of the Malay language version with two participant groups. The translated version was found to be a reliable and valid tool identifying changes and potential challenges in the time use of older adults. This provides Malaysian occupational therapists with a useful tool for gathering time use data in practice settings and for research purposes.
A Decision Tree for Nonmetric Sex Assessment from the Skull.
Langley, Natalie R; Dudzik, Beatrix; Cloutier, Alesia
2018-01-01
This study uses five well-documented cranial nonmetric traits (glabella, mastoid process, mental eminence, supraorbital margin, and nuchal crest) and one additional trait (zygomatic extension) to develop a validated decision tree for sex assessment. The decision tree was built and cross-validated on a sample of 293 U.S. White individuals from the William M. Bass Donated Skeletal Collection. Ordinal scores from the six traits were analyzed using the partition modeling option in JMP Pro 12. A holdout sample of 50 skulls was used to test the model. The most accurate decision tree includes three variables: glabella, zygomatic extension, and mastoid process. This decision tree yielded 93.5% accuracy on the training sample, 94% on the cross-validated sample, and 96% on a holdout validation sample. Linear weighted kappa statistics indicate acceptable agreement among observers for these variables. Mental eminence should be avoided, and definitions and figures should be referenced carefully to score nonmetric traits. © 2017 American Academy of Forensic Sciences.
Tsugawa, Yusuke; Ohbu, Sadayoshi; Cruess, Richard; Cruess, Sylvia; Okubo, Tomoya; Takahashi, Osamu; Tokuda, Yasuharu; Heist, Brian S; Bito, Seiji; Itoh, Toshiyuki; Aoki, Akiko; Chiba, Tsutomu; Fukui, Tsuguya
2011-08-01
Despite the growing importance of and interest in medical professionalism, there is no standardized tool for its measurement. The authors sought to verify the validity, reliability, and generalizability of the Professionalism Mini-Evaluation Exercise (P-MEX), a previously developed and tested tool, in the context of Japanese hospitals. A multicenter, cross-sectional evaluation study was performed to investigate the validity, reliability, and generalizability of the P-MEX in seven Japanese hospitals. In 2009-2010, 378 evaluators (attending physicians, nurses, peers, and junior residents) completed 360-degree assessments of 165 residents and fellows using the P-MEX. The content validity and criterion-related validity were examined, and the construct validity of the P-MEX was investigated by performing confirmatory factor analysis through a structural equation model. The reliability was tested using generalizability analysis. The contents of the P-MEX achieved good acceptance in a preliminary working group, and the poststudy survey revealed that 302 (79.9%) evaluators rated the P-MEX items as appropriate, indicating good content validity. The correlation coefficient between P-MEX scores and external criteria was 0.78 (P < .001), demonstrating good criterion-related validity. Confirmatory factor analysis verified high path coefficient (0.60-0.99) and adequate goodness of fit of the model. The generalizability analysis yielded a high dependability coefficient, suggesting good reliability, except when evaluators were peers or junior residents. Findings show evidence of adequate validity, reliability, and generalizability of the P-MEX in Japanese hospital settings. The P-MEX is the only evaluation tool for medical professionalism verified in both a Western and East Asian cultural context.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bishop, G. R.; Bernheim, M.
1963-06-01
For the ( γ,n) reaction in Li 6 a model in which Li 6 splits into a deuteron and an alpha particle that separately absorb the photon energy was recently proposed. The model was tested by studying the inelastic scattering of 101.4-Mev electrons from Li 6. Expressions for the cross sections were obtained, and values calculated for a form factor in the cross sections confirm the validity of the model.
ERIC Educational Resources Information Center
Wombacher, Jorg; Tagg, Stephen K.; Burgi, Thomas; MacBryde, Jillian
2010-01-01
In this article, the authors present a German Sense of Community (SOC) Scale for use in military settings. The scale is based on the translation and field-testing of an existing U.S.-based measure of neighborhood SOC (Peterson, Speer, & McMillan, 2008). The methodological intricacies underlying cross-cultural scale development are highlighted, as…
Baldwin, Carol M; Choi, Myunghan; McClain, Darya Bonds; Celaya, Alma; Quan, Stuart F
2012-04-15
To translate, back-translate and cross-language validate (English/Spanish) the Sleep Heart Health Study Sleep Habits Questionnaire for use with Spanish-speakers in clinical and research settings. Following rigorous translation and back-translation, this cross-sectional cross-language validation study recruited bilingual participants from academic, clinic, and community-based settings (N = 50; 52% women; mean age 38.8 ± 12 years; 90% of Mexican heritage). Participants completed English and Spanish versions of the Sleep Habits Questionnaire, the Epworth Sleepiness Scale, and the Acculturation Rating Scale for Mexican Americans II one week apart in randomized order. Psychometric properties were assessed, including internal consistency, convergent validity, scale equivalence, language version intercorrelations, and exploratory factor analysis using PASW (Version18) software. Grade level readability of the sleep measure was evaluated. All sleep categories (duration, snoring, apnea, insomnia symptoms, other sleep symptoms, sleep disruptors, restless legs syndrome) showed Cronbach α, Spearman-Brown coefficients and intercorrelations ≥ 0.700, suggesting robust internal consistency, correlation, and agreement between language versions. The Epworth correlated significantly with snoring, apnea, sleep symptoms, restless legs, and sleep disruptors) on both versions, supporting convergent validity. Items loaded on 4 factors accounted for 68% and 67% of the variance on the English and Spanish versions, respectively. The Spanish-language Sleep Habits Questionnaire demonstrates conceptual and content equivalency. It has appropriate measurement properties and should be useful for assessing sleep health in community-based clinics and intervention studies among Spanish-speaking Mexican Americans. Both language versions showed readability at the fifth grade level. Further testing is needed with larger samples.
Sharma, Saurab; Palanchoke, Joshna; Reed, Darren; Haxby Abbott, J
2017-12-04
Pain intensity and patients' impression of global improvement are widely used patient-reported outcome measures (PROMs) in clinical practice and research. They are commonly assessed using the Numerical Pain Rating Scale (NPRS) and Global Rating of Change (GROC) questionnaires. The GROC is essential as an anchor for evaluating the psychometric properties of PROMs. Both of these PROMs are translated to many languages and have shown excellent psychometric properties. Their availability in Nepali would facilitate pain research and cross-cultural comparison of research findings. Therefore, the objectives of this study were to translate and cross-culturally adapt the NPRS and GROC into Nepali and to assess the psychometric properties of the Nepali version of the NPRS (NPRS-NP). After translating and cross-culturally adapting the NPRS and GROC into Nepali using recommended guidelines, NPRS-NP was administered to 104 individuals with musculoskeletal pain twice. The Nepali version of the GROC (GROC-NP) was administered at the follow-up for anchor-based assessment. (1) Test-retest reliability and minimum detectable change (MDC) among the stable group, (2) construct validity (by single sample t-test within the improved group and independent sample t-test between groups), and (3) concurrent validity were assessed. Receiver operating characteristic (ROC) curves were plotted to determine the responsiveness of the NPRS-NP using the area under the curve (AUC), and minimum important changes (MIC) for small, medium and large improvements. Significant cultural adaptations were required to obtain relevant Nepali versions of both the NPRS and GROC. The NPRS-NP showed excellent test-retest reliability and a MDC of 1.13 points. NPRS-NP demonstrated a good construct validity by significant within-group difference in mean of NPRS score- t(63)= 7.57, P < 0.001 and statistically significant difference of mean score- t(98)= -4.24, P < .001 between the stable and improved groups. It demonstrated moderate concurrent correlation with the GROC-NP; r = 0.43, P < 0.01. Responsiveness of the NPRS-NP was shown at three levels with AUC = 0.68-0.82, and MIC = 1.17-1.33. The NPRS and GROC were successfully translated and culturally adapted into Nepali. The NPRS-NP demonstrated good reliability, validity and responsiveness in assessing musculoskeletal pain intensity in a Nepali population.
Heo, K H; Squires, J; Yovanoff, P
2008-03-01
Accurate and efficient developmental screening measures are critical for early identification of developmental problems; however, few reliable and valid tests are available in Korea as well as other countries outside the USA. The Ages and Stages Questionnaires (ASQ) was chosen for study with young children in Korea. The ASQ was translated into Korean and necessary cross-cultural adaptations were made. The translated version was then distributed and completed by 3220 parents of young children between the ages of 4 months and 5 years. Reliability was studied including domain correlations, internal consistency, and performance of identification cut-off scores for the Korean population. Rasch analyses including tests of Differential Item Functioning, contrasting Korean and US samples were also performed. In general, internal consistency of the Korean ASQ was high, with overall correlations 0.75 for communication, 0.85 for gross motor, 0.74 for fine motor, 0.72 for problem solving, and 0.65 for personal-social. Validity, including concurrent validity, also had strong evidence. Mean scores of children on the Korean translation of the ASQ and the US normative sample were generally similar. Rasch analyses indicated the majority of items functioned similarly across the Korean sample. In general, the ASQ was translated with cultural appropriateness in mind and functioned as a valid and reliable parent-completed screening test to assist in early identification of young children with developmental delays. Further research is needed to confirm these results with a larger and more diverse Korean sample.
Kuo, Shu-Fen; Chang, Wen-Yin; Chang, Lu-I; Chou, Yu-Hua; Chen, Ching-Min
2013-01-01
This is a report of development and psychometric testing of the East Asian Acculturation Measure-Chinese version (EAAM-C) scale. An instrument validation design with a cross-sectional survey was conducted. The process was carried in two phases. In Phase 1, Barry's East Asian Acculturation Measure was translated and back translated to evaluate its content, face validity, and feasibility validity. In Phase 2, the 16-item EAAM-C was pilot-tested among 485 female immigrants for test-retest reliability, internal consistency, theoretically-supported construct validity and concurrent validity. The pilot work and the survey results indicated the tools possessed adequate content and face validity. The Cronbach's Alphas for the EAAM-C was 0.72, and 0.76-0.79 for its subscales, and the correlation of test-retest reliability (at 3 weeks) was 0.75. After dropping one item, four theoretically-supported factors which explained 61.82% of the variance were abstracted using exploratory factor analysis: assimilation, integration, separation, and marginalization. Based on the underlying four-factor theoretical structures of the EAAM, the confirmatory factor analysis of the EAAM-C was further examined. The analysis revealed that the four-factor model was an acceptable fit for the data which demonstrated adequate finding in its construct validity. These factors were inter-correlated, and showed statistically significant correlation with the Chinese Health Questionnaire, indicating adequate concurrent validity. The scale shows acceptable validity and consistency, and suggests that immigrant acculturation is a complex construct. This quick evaluation instrument can be applied to assess clients' acculturation and in further developing certain interventions to improve their health.
Sotardi, Valerie A
2018-05-01
Educational measures of anxiety focus heavily on students' experiences with tests yet overlook other assessment contexts. In this research, two brief multiscale questionnaires were developed and validated to measure trait evaluation anxiety (MTEA-12) and state evaluation anxiety (MSEA-12) for use in various assessment contexts in non-clinical, educational settings. The research included a cross-sectional analysis of self-report data using authentic assessment settings in which evaluation anxiety was measured. Instruments were tested using a validation sample of 241 first-year university students in New Zealand. Scale development included component structures for state and trait scales based on existing theoretical frameworks. Analyses using confirmatory factor analysis and descriptive statistics indicate that the scales are reliable and structurally valid. Multivariate general linear modeling using subscales from the MTEA-12, MSEA-12, and student grades suggest adequate criterion-related validity. Initial predictive validity in which one relevant MTEA-12 factor explained between 21% and 54% of the variance in three MSEA-12 factors. Results document MTEA-12 and MSEA-12 as reliable measures of trait and state dimensions of evaluation anxiety for test and writing contexts. Initial estimates suggest the scales as having promising validity, and recommendations for further validation are outlined.
International Harmonization and Cooperation in the Validation of Alternative Methods.
Barroso, João; Ahn, Il Young; Caldeira, Cristiane; Carmichael, Paul L; Casey, Warren; Coecke, Sandra; Curren, Rodger; Desprez, Bertrand; Eskes, Chantra; Griesinger, Claudius; Guo, Jiabin; Hill, Erin; Roi, Annett Janusch; Kojima, Hajime; Li, Jin; Lim, Chae Hyung; Moura, Wlamir; Nishikawa, Akiyoshi; Park, HyeKyung; Peng, Shuangqing; Presgrave, Octavio; Singer, Tim; Sohn, Soo Jung; Westmoreland, Carl; Whelan, Maurice; Yang, Xingfen; Yang, Ying; Zuang, Valérie
The development and validation of scientific alternatives to animal testing is important not only from an ethical perspective (implementation of 3Rs), but also to improve safety assessment decision making with the use of mechanistic information of higher relevance to humans. To be effective in these efforts, it is however imperative that validation centres, industry, regulatory bodies, academia and other interested parties ensure a strong international cooperation, cross-sector collaboration and intense communication in the design, execution, and peer review of validation studies. Such an approach is critical to achieve harmonized and more transparent approaches to method validation, peer-review and recommendation, which will ultimately expedite the international acceptance of valid alternative methods or strategies by regulatory authorities and their implementation and use by stakeholders. It also allows achieving greater efficiency and effectiveness by avoiding duplication of effort and leveraging limited resources. In view of achieving these goals, the International Cooperation on Alternative Test Methods (ICATM) was established in 2009 by validation centres from Europe, USA, Canada and Japan. ICATM was later joined by Korea in 2011 and currently also counts with Brazil and China as observers. This chapter describes the existing differences across world regions and major efforts carried out for achieving consistent international cooperation and harmonization in the validation and adoption of alternative approaches to animal testing.
2014-01-01
Background Health impairments can result in disability and changed work productivity imposing considerable costs for the employee, employer and society as a whole. A large number of instruments exist to measure health-related productivity changes; however their methodological quality remains unclear. This systematic review critically appraised the measurement properties in generic self-reported instruments that measure health-related productivity changes to recommend appropriate instruments for use in occupational and economic health practice. Methods PubMed, PsycINFO, Econlit and Embase were systematically searched for studies whereof: (i) instruments measured health-related productivity changes; (ii) the aim was to evaluate instrument measurement properties; (iii) instruments were generic; (iv) ratings were self-reported; (v) full-texts were available. Next, methodological quality appraisal was based on COSMIN elements: (i) internal consistency; (ii) reliability; (iii) measurement error; (iv) content validity; (v) structural validity; (vi) hypotheses testing; (vii) cross-cultural validity; (viii) criterion validity; and (ix) responsiveness. Recommendations are based on evidence syntheses. Results This review included 25 articles assessing the reliability, validity and responsiveness of 15 different generic self-reported instruments measuring health-related productivity changes. Most studies evaluated criterion validity, none evaluated cross-cultural validity and information on measurement error is lacking. The Work Limitation Questionnaire (WLQ) was most frequently evaluated with moderate respectively strong positive evidence for content and structural validity and negative evidence for reliability, hypothesis testing and responsiveness. Less frequently evaluated, the Stanford Presenteeism Scale (SPS) showed strong positive evidence for internal consistency and structural validity, and moderate positive evidence for hypotheses testing and criterion validity. The Productivity and Disease Questionnaire (PRODISQ) yielded strong positive evidence for content validity, evidence for other properties is lacking. The other instruments resulted in mostly fair-to-poor quality ratings with limited evidence. Conclusions Decisions based on the content of the instrument, usage purpose, target country and population, and available evidence are recommended. Until high-quality studies are in place to accurately assess the measurement properties of the currently available instruments, the WLQ and, in a Dutch context, the PRODISQ are cautiously preferred based on its strong positive evidence for content validity. Based on its strong positive evidence for internal consistency and structural validity, the SPS is cautiously recommended. PMID:24495301
The cross-cultural adaptation of the DASH questionnaire in Thai (DASH-TH).
Tongprasert, Siam; Rapipong, Jeeranan; Buntragulpoontawee, Montana
2014-01-01
Clinical measurement. Currently there are no self-report questionnaires in Thai to evaluate disability levels in patients suffering from upper extremity musculoskeletal disorders. To translate and cross-cultural adaptation the disabilities of the arm, shoulder and hand (DASH) questionnaire to Thai version and to evaluate content validity, construct validity and internal consistency of the questionnaire. The DASH-TH was produced by following cross-cultural adaptation guidelines stated by the Institute for Work and Health (IWH). Forty Thai patients with arm, shoulder or hand problems participated in field testing of the questionnaire. Content validity was determined by obtaining the item-objective congruence (IOC) value for each questionnaire item. Correlation between the DASH-TH score and numeric rating scale was used to assess construct validity. Internal consistency of DASH-TH was measured using Cronbach's alpha coefficient. Forty patients (14 males, 26 females) with arm, shoulder or hand problems enrolled in the present study. The average age of patients was 44.8 years. The index of item-objective congruence (IOC) of each item ranged from 0.7 to 1.0. The Cronbach's alpha coefficient of the questionnaire was 0.938. There was no correlation between DASH-TH score and numeric rating scale. The DASH-TH has high content validity and internal consistency. N/A. Copyright © 2014 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Gunaydin, Gurkan; Citaker, Seyit; Meray, Jale; Cobanoglu, Gamze; Gunaydin, Ozge Ece; Hazar Kanik, Zeynep
2016-11-01
Validation of a self-report questionnaire. The purpose of this study was to investigate adaptation, validity, and reliability of the Turkish version of the Bournemouth Questionnaire. Low back pain is one of the most frequent disorders leading to activity limitation. This pain affects most of people in their lives. The most important point to evaluate patient's functional abilities and to decide a successful therapy procedure is to manage the assessment questionnaires precisely. One hundred ten patients with chronic low back pain were included in present study. To assess reliability, test-retest and internal consistency analyses were applied. The results of test-retest analysis were assessed by using Intraclass Correlation Coefficient method (95% confidence interval). For internal consistency, Cronbach alpha value was calculated. Validity of the questionnaire was assessed in terms of construct validity. For construct validity, factor analysis and convergent validity were tested. For convergent validity, total points of the Bournemouth Questionnaire were assessed with the total points of Quebec Back Pain Disability Scale and Roland Morris Disability Questionnaire by using Pearson correlation coefficient analysis. Cronbach alpha value was found 0.914, showing that this questionnaire has high internal consistency. The results of test-retest analysis were varying between 0.851 and 0.927, which shows that test-retest results are highly correlated. Factor analysis test indicated that this questionnaire had one factor. Pearson correlation coefficient of the Bournemouth Questionnaire with Roland Morris Disability Questionnaire was calculated 0.703 and it was found with Quebec Back Pain Disability Scale is 0.659. These results showed that the Bournemouth Questionnaire is very good correlated with Roland Morris Disability Questionnaire and Quebec Back Pain Disability Scale. The Turkish version of the Bournemouth Questionnaire is valid and reliable. 3.
Kitada, Masako; Musashi, Manabu; Kano, Masato
2011-08-01
To examine reliability and validity of Kano Test for Social Nicotine Dependence (KTSND), a scale assessing the psychosocial acceptability of smoking, and to develop a new version when validity or reliability of KTSND was not acceptable. We carried out a self-administered cross-sectional survey on undergraduate university students. The participants completed the KTSND, and supplemented three questions on the attitudes toward tobacco control policies and smoking states. Using daily smokers, we examined the relationship between the KTSND and Fagerström Test for Nicotine Dependence (FTND). In each study, we examined test-retest reliability and construct validity, discriminant and convergent validity, and factor validity. Although the KTSND had high internal consistency (Cronbach's a 0.82) and high test-retest reliability (r=0.72), the results of factor analysis were unacceptable; we expected three factors to be extracted, however, only two factors of "Overestimate of smoking usefulness" and "Allege smoking as a taste and/or culture" were extracted. Using the Kano's Test for Assessing Acceptability of Smoking (KTAAS), the new version of KTSND in which a question was replaced with another one, the third factor of "Neglect of harm of tobacco smoking" was extracted adding to the above-mentioned two. KTAAS had also both high internal consistency (Cronbach's alpha 0.82) and test-retest reliability (r=0.66). Overall, the KTSND and the KTAAS score differed according to smoking states, and the nonsmokers' scores were the lowest. The KTSND was a popular questionnaire in Japan, however, its validity assessed using factor analysis was not acceptable, while KTAAS had sufficient reliability and validity, and might assess the cognition and attitude affirming or accepting tobacco smoking among university students.
A Metric-Based Validation Process to Assess the Realism of Synthetic Power Grids
DOE Office of Scientific and Technical Information (OSTI.GOV)
Birchfield, Adam; Schweitzer, Eran; Athari, Mir
Public power system test cases that are of high quality benefit the power systems research community with expanded resources for testing, demonstrating, and cross-validating new innovations. Building synthetic grid models for this purpose is a relatively new problem, for which a challenge is to show that created cases are sufficiently realistic. This paper puts forth a validation process based on a set of metrics observed from actual power system cases. These metrics follow the structure, proportions, and parameters of key power system elements, which can be used in assessing and validating the quality of synthetic power grids. Though wide diversitymore » exists in the characteristics of power systems, the paper focuses on an initial set of common quantitative metrics to capture the distribution of typical values from real power systems. The process is applied to two new public test cases, which are shown to meet the criteria specified in the metrics of this paper.« less
A Metric-Based Validation Process to Assess the Realism of Synthetic Power Grids
Birchfield, Adam; Schweitzer, Eran; Athari, Mir; ...
2017-08-19
Public power system test cases that are of high quality benefit the power systems research community with expanded resources for testing, demonstrating, and cross-validating new innovations. Building synthetic grid models for this purpose is a relatively new problem, for which a challenge is to show that created cases are sufficiently realistic. This paper puts forth a validation process based on a set of metrics observed from actual power system cases. These metrics follow the structure, proportions, and parameters of key power system elements, which can be used in assessing and validating the quality of synthetic power grids. Though wide diversitymore » exists in the characteristics of power systems, the paper focuses on an initial set of common quantitative metrics to capture the distribution of typical values from real power systems. The process is applied to two new public test cases, which are shown to meet the criteria specified in the metrics of this paper.« less
Rodrigues, Marcelo F; Michel-Crosato, Edgard; Cardoso, Jefferson R; Traebert, Jefferson
2009-06-01
Cross-cultural translation and psychometric testing. To translate and cross-culturally adapt the Quebec Back Pain Disability Scale (QDS) to Brazilian Portuguese and to examine its validity and reliability. Current literature shows the need to adopt reliable and internationally standardized methods for the analysis of low back pain. To our knowledge, this specific questionnaire has not been translated and validated for Portuguese-speaking patients. The translation and cross-cultural adaptation of the QDS were developed in agreement with internationally recommended methodology, and the resulting product was evaluated in this study with 54 consecutive patients. Internal consistency was obtained through Cronbach's alpha; reliability was estimated through the intraclass correlation coefficient and the Bland and Altman agreement (d = mean difference). Validity was determined by correlating the scores of the Brazil-QDS with the Brazilian version of the Roland-Morris Questionnaire and Visual Analogue Pain Scale by means of the Spearman rank correlation coefficient. The internal consistency obtained was excellent (Cronbach's alpha = 0.97). Intraobserver and interobserver reliability were considered strong (ICC = 0.93-d = 0.68 and 0.96-d = 0.57, respectively). The correlation with Brazilian Roland-Morris Questionnaire and with the Visual Analogue Scale was high (r = 0.857; r = 0.758, respectively). The data showed that the process of translation and cross-cultural adaptation were successful and that the adapted instrument demonstrated excellent psychometric properties.
Tomaschewski-Barlem, Jamila Geri; Lunardi, Valéria Lerch; Barlem, Edison Luiz Devos; da Silveira, Rosemary Silva; Dalmolin, Graziele de Lima; Ramos, Aline Marcelino
2015-01-01
Abstract Objective: to adapt culturally and validate the Protective Nursing Advocacy Scale for Brazilian nurses. Method: methodological study carried out with 153 nurses from two hospitals in the South region of Brazil, one public and the other philanthropic. The cross-cultural adaptation of the Protective Nursing Advocacy Scale was performed according to international standards, and its validation was carried out for use in the Brazilian context, by means of factor analysis and Cronbach's alpha as measure of internal consistency. Results: by means of evaluation by a committee of experts and application of pre-test, face validity and content validity of the instrument were considered satisfactory. From the factor analysis, five constructs were identified: negative implications of the advocacy practice, advocacy actions, facilitators of the advocacy practice, perceptions that favor practice advocacy and barriers to advocacy practice. The instrument showed satisfactory internal consistency, with Cronbach's alpha values ranging from 0.70 to 0.87. Conclusion: it was concluded that the Protective Nursing Advocacy Scale - Brazilian version, is a valid and reliable instrument for use in the evaluation of beliefs and actions of health advocacy, performed by Brazilian nurses in their professional practice environment. PMID:26444169
Kyrölä, Kati; Järvenpää, Salme; Ylinen, Jari; Mecklin, Jukka-Pekka; Repo, Jussi Petteri; Häkkinen, Arja
2017-06-15
A prospective clinical study to test and adapt a Finnish version of the Scoliosis Research Society 30 (SRS-30) questionnaire. The aim of this study was to perform cross-cultural adaptation and evaluate the validity of the adapted Finnish version of the SRS-30 questionnaire. The SRS-30 questionnaire has proved to be a valid instrument in evaluating health-related quality of life (HRQoL) in adolescent and adult population with spine deformities in the United States. Multinational availability requires cross-cultural and linguistic adaptation and validation of the instrument. The SRS-30 was translated into Finnish using accepted methods for translation of quality-of-life questionnaires. A total of 274 adult patients with degenerative radiographic sagittal spinal disorder answered the questionnaire with sociodemographic data, RAND 36-item health survey questionnaire (RAND Corp. Health, Santa Monica, CA, US), Oswestry disability index, DEPS depression scale, and Visual Analog Scale (VAS) back and leg pain scales within 2 weeks' interval. The cohort included patients with and without previous spine surgery. Internal consistency and validity were tested with Cronbach α, intraclass correlation (ICC), standard error of measurement, and Spearman correlation coefficient with 95% confidence intervals (CIs). The internal consistency of SRS-30 was good in both surgery and nonsurgery groups, with Cronbach α 0.853 (95% CI, 0.670 to 0.960) and 0.885 (95% CI, 0.854 to 0.911), respectively. The test-retest reproducibility ICC of the SRS-30 total and subscore domains of patients with stable symptoms was 0.905 (95% CI, 0.870-0.930) and 0.904 (95% CI, 0.871-0.929), respectively. The questionnaire had discriminative validity in the pain, self-image, and satisfaction with management domains compared with other questionnaires. The SRS-30 questionnaire proved to be valid and applicable in evaluating HRQoL in Finnish adult spinal deformity patients. It has two domains related to deformity that are not covered by other generally used questionnaires. 3.
Automated smartphone audiometry: Validation of a word recognition test app.
Dewyer, Nicholas A; Jiradejvong, Patpong; Henderson Sabes, Jennifer; Limb, Charles J
2018-03-01
Develop and validate an automated smartphone word recognition test. Cross-sectional case-control diagnostic test comparison. An automated word recognition test was developed as an app for a smartphone with earphones. English-speaking adults with recent audiograms and various levels of hearing loss were recruited from an audiology clinic and were administered the smartphone word recognition test. Word recognition scores determined by the smartphone app and the gold standard speech audiometry test performed by an audiologist were compared. Test scores for 37 ears were analyzed. Word recognition scores determined by the smartphone app and audiologist testing were in agreement, with 86% of the data points within a clinically acceptable margin of error and a linear correlation value between test scores of 0.89. The WordRec automated smartphone app accurately determines word recognition scores. 3b. Laryngoscope, 128:707-712, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
NASA Technical Reports Server (NTRS)
Tuhela-Reuning, S. R.; Walton, E. K.
1991-01-01
The design, construction, and testing of a low cost, planar scanning system to be used in a compact range environment for bistatic radar cross-section (bistatic RCS) measurement data are discussed. This scanning system is similar to structures used for measuring near-field antenna patterns. A synthetic aperture technique is used for plane wave reception. System testing entailed comparison of measured and theoretical bistatic RCS of a sphere and a right circular cylinder. Bistatic scattering analysis of the ogival target support, target and pedestal interactions, and compact range room was necessary to determine measurement validity.
2013-01-01
Background Understanding children’s physical activity motivation, its antecedents and associations with behavior is important and can be advanced by using self-determination theory. However, research among youth is largely restricted to adolescents and studies of motivation within certain contexts (e.g., physical education). There are no measures of self-determination theory constructs (physical activity motivation or psychological need satisfaction) for use among children and no previous studies have tested a self-determination theory-based model of children’s physical activity motivation. The purpose of this study was to test the reliability and validity of scores derived from scales adapted to measure self-determination theory constructs among children and test a motivational model predicting accelerometer-derived physical activity. Methods Cross-sectional data from 462 children aged 7 to 11 years from 20 primary schools in Bristol, UK were analysed. Confirmatory factor analysis was used to examine the construct validity of adapted behavioral regulation and psychological need satisfaction scales. Structural equation modelling was used to test cross-sectional associations between psychological need satisfaction, motivation types and physical activity assessed by accelerometer. Results The construct validity and reliability of the motivation and psychological need satisfaction measures were supported. Structural equation modelling provided evidence for a motivational model in which psychological need satisfaction was positively associated with intrinsic and identified motivation types and intrinsic motivation was positively associated with children’s minutes in moderate-to-vigorous physical activity. Conclusions The study provides evidence for the psychometric properties of measures of motivation aligned with self-determination theory among children. Children’s motivation that is based on enjoyment and inherent satisfaction of physical activity is associated with their objectively-assessed physical activity and such motivation is positively associated with perceptions of psychological need satisfaction. These psychological factors represent potential malleable targets for interventions to increase children’s physical activity. PMID:24067078
Support vector machines and generalisation in HEP
NASA Astrophysics Data System (ADS)
Bevan, Adrian; Gamboa Goñi, Rodrigo; Hays, Jon; Stevenson, Tom
2017-10-01
We review the concept of Support Vector Machines (SVMs) and discuss examples of their use in a number of scenarios. Several SVM implementations have been used in HEP and we exemplify this algorithm using the Toolkit for Multivariate Analysis (TMVA) implementation. We discuss examples relevant to HEP including background suppression for H → τ + τ - at the LHC with several different kernel functions. Performance benchmarking leads to the issue of generalisation of hyper-parameter selection. The avoidance of fine tuning (over training or over fitting) in MVA hyper-parameter optimisation, i.e. the ability to ensure generalised performance of an MVA that is independent of the training, validation and test samples, is of utmost importance. We discuss this issue and compare and contrast performance of hold-out and k-fold cross-validation. We have extended the SVM functionality and introduced tools to facilitate cross validation in TMVA and present results based on these improvements.
The Model Analyst’s Toolkit: Scientific Model Development, Analysis, and Validation
2015-02-20
being integrated within MAT, including Granger causality. Granger causality tests whether a data series helps when predicting future values of another...relations by econometric models and cross-spectral methods. Econometrica: Journal of the Econometric Society, 424-438. Granger, C. W. (1980). Testing ... testing dataset. This effort is described in Section 3.2. 3.1. Improvements in Granger Causality User Interface Various metrics of causality are
Kuzmanova, Rumyana; Stefanova, Irina; Velcheva, Irena; Stambolieva, Katerina
2014-10-01
Adverse effects (AEs) of antiepileptic drugs (AEDs) affect the quality of life of patients with epilepsy and their outcomes. There are no questionnaires or studies on the reliability and validity of instruments measuring AEs of AEDs in patients with epilepsy in Bulgarian language. The aim of the present study was the translation, cross-cultural adaptation, and validation of the LAEP in the Bulgarian language in order to use it in the Bulgarian-speaking population in providing a reliable instrument for the clinical monitoring of patients with epilepsy. One hundred thirty-one patients (57 men and 74 women, mean age: 40.13±13.37 years) took part in the investigation. The internal consistency and test-retest reliability were tested by Cronbach's α and ICC estimations. The convergent construct validity was tested by estimating the correlation of the LAEP-BG with the QOLIE-89 and the discriminant validity by evaluating the difference between LAEP-BG scores and clinical parameters such as the type of epilepsy using Kruskal-Wallis ANOVA. The LAEP-BG showed high internal consistency and reliability. The Cronbach's α of the total scale was 0.86. No significant differences between the Cronbach's α coefficients of the total LAEP-BG and original English, Chinese, Spanish, Korean, and Portuguese-Brazilian versions of the questionnaire were observed. The ICCs, which evaluate the test-retest reliability, were higher than the recommended value of 0.75 and determined the strong positive correlations between the first and second examinations. The creation of two subscales "Neurological and psychiatric side effects" and "Non neurological side effects" of the LAEP-BG proposed by us showed good internal consistency (Cronbach's α of 0.85 and 0.71, respectively). The LAEP-BG scores significantly correlated with other questionnaires such as the Quality of Life in Epilepsy Inventory-89 (QOLIE-89) and showed a good discriminative validity between groups with different levels of self-assessed AEs of AEDs. The Bulgarian version of the Liverpool Adverse Event Profile (LAEP) is a reliable and valid tool in assessing the patient-reported AEs of AEDs and their impact on the patient's outcome. Copyright © 2014 Elsevier Inc. All rights reserved.
Numerical and experimental analysis of an in-scale masonry cross-vault prototype up to failure
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rossi, Michela; Calderini, Chiara; Lagomarsino, Sergio
2015-12-31
A heterogeneous full 3D non-linear FE approach is validated against experimental results obtained on an in-scale masonry cross vault assembled with dry joints, and subjected to various loading conditions consisting on imposed displacement combinations to the abutments. The FE model relies into a discretization of the blocks by means of few rigid-infinitely resistant parallelepiped elements interacting by means of planar four-noded interfaces, where all the deformation (elastic and inelastic) occurs. The investigated response mechanisms of vault are the shear in-plane distortion and the longitudinal opening and closing mechanism at the abutments. After the validation of the approach on the experimentallymore » tested cross-vault, a sensitivity analysis is conducted on the same geometry, but in real scale, varying mortar joints mechanical properties, in order to furnish useful hints for safety assessment, especially in presence of seismic action.« less
NASA Astrophysics Data System (ADS)
Bak, S.; Smith, J. M.; Hesser, T.; Bryant, M. A.
2016-12-01
Near-coast wave models are generally validated with relatively small data sets that focus on analytical solutions, specialized experiments, or intense storms. Prior studies have compiled testbeds that include a few dozen experiments or storms to validate models (e.g., Ris et al. 2002), but few examples exist that allow for continued model evaluation in the nearshore and surf-zone in near-realtime. The limited nature of these validation sets is driven by a lack of high spatial and temporal resolution in-situ wave measurements and the difficulty in maintaining these instruments on the active profile over long periods of time. The US Army Corps of Engineers Field Research Facility (FRF) has initiated a Coastal Model Test-Bed (CMTB), which is an automated system that continually validates wave models (with morphological and circulation models to follow) utilizing the rich data set of oceanographic and bathymetric measurements collected at the FRF. The FRF's cross-shore wave array provides wave measurements along a cross-shore profile from 26 m of water depth to the shoreline, utilizing various instruments including wave-rider buoys, AWACs, aquadopps, pressure gauges, and a dune-mounted lidar (Brodie et al. 2015). This work uses the CMTB to evaluate the performance of a phase-averaged numerical wave model, STWAVE (Smith 2007, Massey et al. 2011) over the course of a year at the FRF in Duck, NC. Additionally, from the BathyDuck Experiment in October 2015, the CMTB was used to determine the impact of applying the depth boundary condition for the model from monthly acoustic bathymetric surveys in comparison to hourly estimates using a video-based inversion method (e.g., cBathy, Holman et al. 2013). The modeled wave parameters using both bathymetric boundary conditions are evaluated using the FRF's cross-shore wave array and two additional cross-shore arrays of wave measurements in 2 to 4 m water depth from BathyDuck in Fall, 2015.
Validation of the Arabic Version of the Internet Gaming Disorder-20 Test.
Hawi, Nazir S; Samaha, Maya
2017-04-01
In recent years, researchers have been trying to shed light on gaming addiction and its association with different psychiatric disorders and psychological determinants. The latest edition version of the American Psychiatric Association's Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) included in its Section 3 Internet Gaming Disorder (IGD) as a condition for further empirical study and proposed nine criteria for the diagnosis of IGD. The 20-item Internet Gaming Disorder (IGD-20) Test was developed as a valid and reliable tool to assess gaming addiction based on the nine criteria set by the DSM-5. The aim of this study is to validate an Arabic version of the IGD-20 Test. The Arabic version of IGD-20 will not only help in identifying Arabic-speaking pathological gamers but also stimulate cross-cultural studies that could contribute to an area in need of more research for insight and treatment. After a process of translation and back-translation and with the participation of a sizable sample of Arabic-speaking adolescents, the present study conducted a psychometric validation of the IGD-20 Test. Our confirmatory factor analysis showed the validity of the Arabic version of the IGD-20 Test. The one-factor model of the Arabic IGD-20 Test had very good psychometric properties, and it fitted the sample data extremely well. In addition, correlation analysis between the IGD-20 Test and the daily duration on weekdays and weekends gameplay revealed significant positive relationships that warranted a criterion-related validation. Thus, the Arabic version of the IGD-20 Test is a valid and reliable measure of IGD among Arabic-speaking populations.
Validation of the Cross-Cultural Alcoholism Screening Test (CCAST).
Gorenc, K D; Peredo, S; Pacurucu, S; Llanos, R; Vincente, B; López, R; Abreu, L F; Paez, E
1999-01-01
When screening instruments that are used in the assessment and diagnosis of alcoholism of individuals from different ethnicities, some cultural variables based on norms and societal acceptance of drinking behavior can play an important role in determining the outcome. The accepted diagnostic criteria of current market testing are based on Western standards. In this study, the Munich Alcoholism Test (31 items) was the base instrument applied to subjects from several Hispanic-American countries (Bolivia, Chile, Ecuador, Mexico, and Peru). After the sample was submitted to several statistical procedures, these 31 items were reduced to a culture-free, 31-item test named the Cross-Cultural Alcohol Screening Test (CCAST). The results of this Hispanic-American sample (n = 2,107) empirically demonstrated that CCAST measures alcoholism with an adequate degree of accuracy when compared to other available cross-cultural tests. CCAST is useful in the diagnosis of alcoholism in Spanish-speaking immigrants living in countries where English is spoken. CCAST can be used in general hospitals, psychiatric wards, emergency services and police stations. The test can be useful for other professionals, such as psychological consultants, researchers, and those conducting expertise appraisal.
Naghdi, Soofia; Ansari, Noureddin Nakhostin; Raji, Parvin; Shamili, Aryan; Amini, Malek; Hasson, Scott
2016-01-01
To translate and cross-culturally adapt the Functional Independence Measure (FIM) into the Persian language and to test the reliability and validity of the Persian FIM (PFIM) in patients with stroke. In this cross-sectional study carried out in an outpatient stroke rehabilitation center, 40 patients with stroke (mean age 60 years) were participated. A standard forward-backward translation method and expert panel validation was followed to develop the PFIM. Two experienced occupational therapists (OTs) assessed the patients independently in all items of the PFIM in a single session for inter-rater reliability. One of the OTs reassessed the patients after 1 week for intra-rater reliability. There were no floor or ceiling effects for the PFIM. Excellent inter-rater and intra-rater reliability was noted for the PFIM total score, motor and cognitive subscales (ICC(agreement)0.88-0.98). According to the Bland-Altman agreement analysis, there was no systematic bias between raters and within raters. The internal consistency of the PFIM was with Cronbach's alpha from 0.70 to 0.96. The principal component analysis with varimax rotation indicated a three-factor structure: (1) self-care and mobility; (2) sphincter control and (3) cognitive that jointly accounted for 74.8% of the total variance. Construct validity was supported by a significant Pearson correlation between the PFIM and the Persian Barthel Index (r = 0.95; p < 0.001). The PFIM is a highly reliable and valid instrument for measuring functional status of Persian patients with stroke. The Functional Independence Measure (FIM) is an outcome measure for disability based on the International Classification of Functioning, Disability and Health (ICF). The FIM was cross-culturally adapted and validated into Persian language. The Persian version of the FIM (PFIM) is reliable and valid for assessing functional status of patients with stroke. The PFIM can be used in Persian speaking countries to assess the limitations in activities of daily living of patients with stroke.
Jeong, Yunwha; Law, Mary; Stratford, Paul; DeMatteo, Carol; Kim, Hwan
2016-11-01
To develop the Korean version of the Participation and Environment Measure for Children and Youth (KPEM-CY) and examine its psychometric properties. The PEM-CY was cross-culturally translated into Korean using a specific guideline: pre-review of participation items, forward/backward translation, expert committee review, pre-test of the KPEM-CY and final review. To establish internal consistency, test-retest reliability and construct validity of the KPEM-CY, 80 parents of children with disabilities aged 5-13 years were recruited in South Korea. Across the home, school and community settings, 76% of participation items and 29% of environment items were revised to improve their fit with Korean culture. Internal consistency was moderate to excellent (0.67-0.92) for different summary scores. Test-retest reliability was excellent (>0.75) in the summary scores of participation frequency and extent of involvement across the three settings and moderate to excellent (0.53-0.95) in all summary scores at home. Child's age, type of school and annual income were the factors that significantly influenced specific dimensions of participation and environment across all settings. Results indicated that the KPEM-CY is equivalent to the original PEM-CY and has initial evidence of reliability and validity for use with Korean children with disabilities. Implications for rehabilitation Because 'participation' is a key outcome of the rehabilitation, measuring comprehensive participation of children with disabilities is necessary. The PEM-CY is a parent-report survey measure to assess comprehensive participation of children and youth and environment, which affect their participation, at home, school and in the community. A cross-cultural adaptation process is mandatory to adapt the measurement tool to a new culture or country. The Korean PEM-CY has both reliability and validity and can therefore generate useful clinical data for Korean children with disabilities.
Mulero-Portela, Ana L.; Colón-Santaella, Carmen L.; Cruz-Gomez, Cynthia
2010-01-01
The purpose of this study was to perform a cross-cultural adaptation of the Disability of Arm, Shoulder, and Hand (DASH) questionnaire to Spanish for Puerto Rico. Five steps were followed for the cross-cultural adaptation: forward translations into Spanish for Puerto Rico, synthesis of the translations, back translations into English, revision by an expert committee, and field test of the prefinal version. Psychometric characteristics of reliability and construct validity were evaluated for the final version. Internal consistency of the final version was high (Cronbach's α = 0.97) and item-to-total correlations were moderate (range from 0.44 to 0.85). Construct validity was evaluated by correlating the DASH with the scales of the Functional Assessment of Cancer Therapy - Breast. Fair to moderate correlations found in this study between the DASH and most scales of the Functional Assessment of Cancer Therapy - Breast support the construct validity of the Puerto Rico-Spanish DASH. The final version of the questionnaire was revised and approved by the Institute for Work and Health of Canada. Revisions to the original DASH English version are recommended. This version of the DASH is valid and reliable, and it can be used to evaluate outcomes in both clinical and research settings. PMID:19901616
Rahimi Kian, Fatemeh; Zandi, Afsaneh; Omani Samani, Reza; Maroufizadeh, Saman; Mehran, Abbas
2016-01-01
Background Surrogacy is one of the most challenging infertility treatments engaging ethical, psychological and social issues. Attitudes survey plays an important role to disclosure variant aspects of surrogacy, to help meeting legislative gaps and ambiguities, and to convert controversial dimensions surrounding surrogacy to a normative concept that eliminates stigma. The aim of this study is to develop a comprehensive scale for gestational surrogacy attitudes. Materials and Methods Development process of gestational surrogacy attitudes scale (GSAS) performed based on a descriptive cross-sectional study and included a rich data pool gathered from literature reviews, a qualitative pilot study on 15 infertile couples (n=30), use of expert advisory panel (EAP) consisting of 20 members, as well as use of content validity through qualitative and quantitative study by the means of content validity ratio (CVR) and content validity index (CVI). Also internal consistence using Cronbach’s alpha and test-retest reliability using intracalss correlation coefficient (ICC) were evaluated. Application of GSAS was tested in a cross-sectional study that was conducted on 200 infertile couples (n=400) at Royan Institute, Tehran, Iran, during 2014. Results Final version of GSAS had 30 items within five subscales including "acceptance of surrogacy", "Surrogacy and public attitudes", "Child born through surrogacy", "Surrogate mother", and "Intentional attitude and surrogacy future attempt". Content validity was represented with values of CVR=0.73 and CVI =0.98. Cronbach’s alpha value was 0.91 for the overall scale, while ICC value due to test-retest responses was 0.89. Conclusion Acceptable level of competency and capability of GSAS is significantly indicated; therefore, it seems to be an appropriate tool for the evaluation of gestational surrogacy attitudes in Iranian infertile couples. PMID:27123208
Rahimi Kian, Fatemeh; Zandi, Afsaneh; Omani Samani, Reza; Maroufizadeh, Saman; Mehran, Abbas
2016-01-01
Surrogacy is one of the most challenging infertility treatments engaging ethical, psychological and social issues. Attitudes survey plays an important role to disclosure variant aspects of surrogacy, to help meeting legislative gaps and ambiguities, and to convert controversial dimensions surrounding surrogacy to a normative concept that eliminates stigma. The aim of this study is to develop a comprehensive scale for gestational surrogacy attitudes. Development process of gestational surrogacy attitudes scale (GSAS) performed based on a descriptive cross-sectional study and included a rich data pool gathered from literature reviews, a qualitative pilot study on 15 infertile couples (n=30), use of expert advisory panel (EAP) consisting of 20 members, as well as use of content validity through qualitative and quantitative study by the means of content validity ratio (CVR) and content validity index (CVI). Also internal consistence using Cronbach's alpha and test-retest reliability using intracalss correlation coefficient (ICC) were evaluated. Application of GSAS was tested in a cross-sectional study that was conducted on 200 infertile couples (n=400) at Royan Institute, Tehran, Iran, during 2014. Final version of GSAS had 30 items within five subscales including "acceptance of surrogacy", "Surrogacy and public attitudes", "Child born through surrogacy", "Surrogate mother", and "Intentional attitude and surrogacy future attempt". Content validity was represented with values of CVR=0.73 and CVI =0.98. Cronbach's alpha value was 0.91 for the overall scale, while ICC value due to test-retest responses was 0.89. Acceptable level of competency and capability of GSAS is significantly indicated; therefore, it seems to be an appropriate tool for the evaluation of gestational surrogacy attitudes in Iranian infertile couples.
Leung, Ying Ying; Lee, Weixian; Lui, Nai Lee; Rouse, Matthew; McKenna, Stephen P; Thumboo, Julian
2017-08-17
To cross-culturally adapt and validate the Singapore Chinese and Singapore English versions of the Ankylosing Spondylitis Quality of Life (ASQoL) scales. Translation of the ASQoL into Singapore Chinese and English was performed by professional and lay translation panels. Field-testing for face and content validity was performed by interviewing ten Chinese speaking and ten English speaking axial spondyloarthritis (AxSpA) patients. AxSpA patients (either Chinese or English speaking) were invited to take part in validation surveys. The Health Assessment Questionnaire (HAQ), Short Form Health Survey (SF-36), Bath Indices, and other measures of disease activity were used as comparator scales for convergent validity. A separate sample of AxSpA patients were invited to participate in a test-retest postal study, with 2 weeks between administrations. The cross-sectional study included 183 patients (77% males, 82% English speaking), with a mean (SD) age of 39.4 (13.7) years. The ASQoL had excellent internal consistency (Cronbach's alpha = 0.88), and correlated moderately with all the comparator scales. The ASQoL was able to distinguish between patients grouped by disease activity and perceived general health. The ASQoL fulfilled the Rasch model analysis for fit, reliability and unidimensionality requirements. No significant differential item functioning was noted for gender, age below or above 50 years, and language of administration. Test-retest reliability was good (r = 0.81). The ASQoL was adapted into Singapore Chinese and English language versions, and shown to be culturally relevant, valid and reliable when used with combined samples of AxSpA patients who speak either Chinese or English.
Flosadottir, Vala; Roos, Ewa M; Ageberg, Eva
2017-09-01
The Activity Rating Scale (ARS) for disorders of the knee evaluates the level of activity by the frequency of participation in 4 separate activities with high demands on knee function, with a score ranging from 0 (none) to 16 (pivoting activities 4 times/wk). To translate and cross-culturally adapt the ARS into Swedish and to assess measurement properties of the Swedish version of the ARS. Cohort study (diagnosis); Level of evidence, 2. The COSMIN guidelines were followed. Participants (N = 100 [55 women]; mean age, 27 years) who were undergoing rehabilitation for a knee injury completed the ARS twice for test-retest reliability. The Knee injury and Osteoarthritis Outcome Score (KOOS), Tegner Activity Scale (TAS), and modernized Saltin-Grimby Physical Activity Level Scale (SGPALS) were administered at baseline to validate the ARS. Construct validity and responsiveness of the ARS were evaluated by testing predefined hypotheses regarding correlations between the ARS, KOOS, TAS, and SGPALS. The Cronbach alpha, intraclass correlation coefficients, absolute reliability, standard error of measurement, smallest detectable change, and Spearman rank-order correlation coefficients were calculated. The ARS showed good internal consistency (α ≈ 0.96), good test-retest reliability (intraclass correlation coefficient >0.9), and no systematic bias between measurements. The standard error of measurement was less than 2 points, and the smallest detectable change was less than 1 point at the group level and less than 5 points at the individual level. More than 75% of the hypotheses were confirmed, indicating good construct validity and good responsiveness of the ARS. The Swedish version of the ARS is valid, reliable, and responsive for evaluating the level of activity based on the frequency of participation in high-demand knee sports activities in young adults with a knee injury.
Forecasting Advancement Rates to Petty Officer Third Class for U.S. Navy Hospital Corpsmen
2014-06-01
variable. c. Designation of Data Subsets for Cross-Validation In order to maintain the integrity of the analysis and test the fitted models’ predictive...two models, an H-L goodness-of-fit test is conducted on the 1,524 individual Sailors within the designated test data set; the results of which are...the total number of sea months, the proportion of vacancies to test takers, Armed Forces Qualification Test score, and performance mark average
Development and validation of the Child Oral Health Impact Profile - Preschool version.
Ruff, R R; Sischo, L; Chinn, C H; Broder, H L
2017-09-01
The Child Oral Health Impact Profile (COHIP) is a validated instrument created to measure the oral health-related quality of life of school-aged children. The purpose of this study was to develop and validate a preschool version of the COHIP (COHIP-PS) for children aged 2-5. The COHIP-PS was developed and validated using a multi-stage process consisting of item selection, face validity testing, item impact testing, reliability and validity testing, and factor analysis. A cross-sectional convenience sample of caregivers having children 2-5 years old from four groups completed item clarity and impact forms. Groups were recruited from pediatric health clinics or preschools/daycare centers, speech clinics, dental clinics, or cleft/craniofacial centers. Participants had a variety of oral health-related conditions, including caries, congenital orofacial anomalies, and speech/language deficiencies such as articulation and language disorders. COHIP-PS. The COHIP-PS was found to have acceptable internal validity (a = 0.71) and high test-retest reliability (0.87), though internal validity was below the accepted threshold for the community sample. While discriminant validity results indicated significant differences across study groups, the overall magnitude of differences was modest. Results from confirmatory factor analyses support the use of a four-factor model consisting of 11 items across oral health, functional well-being, social-emotional well-being, and self-image domains. Quality of life is an integral factor in understanding and assessing children's well-being. The COHIP-PS is a validated oral health-related quality of life measure for preschool children with cleft or other oral conditions. Copyright© 2017 Dennis Barber Ltd.
Devoogdt, Nele; De Groef, An; Hendrickx, Ad; Damstra, Robert; Christiaansen, Anke; Geraerts, Inge; Vervloesem, Nele; Vergote, Ignace; Van Kampen, Marijke
2014-05-01
Patients may develop primary (congenital) or secondary (acquired) lymphedema, causing significant physical and psychosocial problems. To plan treatment for lymphedema and monitor a patient's progress, swelling, and problems in functioning associated with lymphedema development should be assessed at baseline and follow-up. The purpose of this study was to investigate the reliability (test-retest, internal consistency, and measurement variability) and validity (content and construct) of data obtained with the Lymphoedema Functioning, Disability and Health Questionnaire for Lower Limb Lymphoedema (Lymph-ICF-LL). This was a multicenter, cross-sectional study. The Lymph-ICF-LL is a descriptive, evaluative tool containing 28 questions about impairments in function, activity limitations, and participation restrictions in patients with lower limb lymphedema. The questionnaire has 5 domains: physical function, mental function, general tasks/household activities, mobility activities, and life domains/social life. The reliability and validity of the Lymph-ICF-LL were examined in 30 participants with objective lower limb lymphedema. Intraclass correlation coefficients for test-retest reliability ranged from .69 to .94, and Cronbach alpha coefficients for internal consistency ranged from .82 to .97. Measurement variability was acceptable (standard error of measurement=5.9-12.6). Content validity was good because all questions were understandable for 93% of participants, the scoring system (visual analog scale) was clear, and the questionnaire was comprehensive for 90% of participants. Construct validity was good. All hypotheses for assessing convergent validity and divergent validity were accepted. The known-groups validity and responsiveness of the Dutch Lymph-ICF-LL and the cross-cultural validity of the English version of the Lymph-ICF-LL were not investigated. The Lymph-ICF-LL is a Dutch questionnaire with evidence of reliability and validity for assessing impairments in function, activity limitations, and participation restrictions in people with primary or secondary lower limb lymphedema.
The Chinese version of the Outcome Expectations for Exercise scale: validation study.
Lee, Ling-Ling; Chiu, Yu-Yun; Ho, Chin-Chih; Wu, Shu-Chen; Watson, Roger
2011-06-01
Estimates of the reliability and validity of the English nine-item Outcome Expectations for Exercise (OEE) scale have been tested and found to be valid for use in various settings, particularly among older people, with good internal consistency and validity. Data on the use of the OEE scale among older Chinese people living in the community and how cultural differences might affect the administration of the OEE scale are limited. To test the validity and reliability of the Chinese version of the Outcome Expectations for Exercise scale among older people. A cross-sectional validation study was designed to test the Chinese version of the OEE scale (OEE-C). Reliability was examined by testing both the internal consistency for the overall scale and the squared multiple correlation coefficient for the single item measure. The validity of the scale was tested on the basis of both a traditional psychometric test and a confirmatory factor analysis using structural equation modelling. The Mokken Scaling Procedure (MSP) was used to investigate if there were any hierarchical, cumulative sets of items in the measure. The OEE-C scale was tested in a group of older people in Taiwan (n=108, mean age=77.1). There was acceptable internal consistency (alpha=.85) and model fit in the scale. Evidence of the validity of the measure was demonstrated by the tests for criterion-related validity and construct validity. There was a statistically significant correlation between exercise outcome expectations and exercise self-efficacy (r=.34, p<.01). An analysis of the Mokken Scaling Procedure found that nine items of the scale were all retained in the analysis and the resulting scale was reliable and statistically significant (p=.0008). The results obtained in the present study provided acceptable levels of reliability and validity evidence for the Chinese Outcome Expectations for Exercise scale when used with older people in Taiwan. Future testing of the OEE-C scale needs to be carried out to see whether these results are generalisable to older Chinese people living in urban areas. Copyright © 2010 Elsevier Ltd. All rights reserved.
Validity and test-retest reliability of the six-spot step test in persons after stroke.
Arvidsson Lindvall, Mialinn; Anderzén-Carlsson, Agneta; Appelros, Peter; Forsberg, Anette
2018-06-06
After stroke, asymmetric weight distribution is common with decreased balance control in standing and walking. The six-spot step test (SSST) includes a 5-m walk during which one leg shoves wooden blocks out of circles marked on the floor, thus assessing the ability to take load on each leg. The aim of the present study was to investigate the convergent and discriminant validity and test-retest reliability of the SSST in persons with stroke. Eighty-one participants were included. A cross-sectional study was performed, in which the SSST was conducted twice, 3-7 days apart. Validity was investigated using measures of dynamic balance and walking. Reliability was assessed using intraclass correlation coefficient, standard error of the measurement (SEM), and smallest real difference (SRD). The convergent validity was strong to moderate, and the test-retest reliability was good. The SEM% was 14.7%, and the SRD% was 40.8% based on the mean of four walks shoving twice with the paretic and twice with the non-paretic leg. Values on random measurement error were high affecting the use of the SSST for follow-up evaluations but the SSST can be a complementary measure of gait and balance.
ERIC Educational Resources Information Center
Chu, Tsai-Ling; Lin, Wei-Wen
2013-01-01
The primary goal of our study was to investigate the importance of originality in divergent thinking (DT) tests and to determine whether originality is the best reflection of creativity. To accomplish this, we cross-validated the DT test and creative writing task rating by consensual assessment technique (CAT). Thirty-seven elementary school…
Munkácsy, Gyöngyi; Sztupinszki, Zsófia; Herman, Péter; Bán, Bence; Pénzváltó, Zsófia; Szarvas, Nóra; Győrffy, Balázs
2016-09-27
No independent cross-validation of success rate for studies utilizing small interfering RNA (siRNA) for gene silencing has been completed before. To assess the influence of experimental parameters like cell line, transfection technique, validation method, and type of control, we have to validate these in a large set of studies. We utilized gene chip data published for siRNA experiments to assess success rate and to compare methods used in these experiments. We searched NCBI GEO for samples with whole transcriptome analysis before and after gene silencing and evaluated the efficiency for the target and off-target genes using the array-based expression data. Wilcoxon signed-rank test was used to assess silencing efficacy and Kruskal-Wallis tests and Spearman rank correlation were used to evaluate study parameters. All together 1,643 samples representing 429 experiments published in 207 studies were evaluated. The fold change (FC) of down-regulation of the target gene was above 0.7 in 18.5% and was above 0.5 in 38.7% of experiments. Silencing efficiency was lowest in MCF7 and highest in SW480 cells (FC = 0.59 and FC = 0.30, respectively, P = 9.3E-06). Studies utilizing Western blot for validation performed better than those with quantitative polymerase chain reaction (qPCR) or microarray (FC = 0.43, FC = 0.47, and FC = 0.55, respectively, P = 2.8E-04). There was no correlation between type of control, transfection method, publication year, and silencing efficiency. Although gene silencing is a robust feature successfully cross-validated in the majority of experiments, efficiency remained insufficient in a significant proportion of studies. Selection of cell line model and validation method had the highest influence on silencing proficiency.
Validation of the Headache Impact Test (HIT-6) in patients with chronic migraine.
Rendas-Baum, Regina; Yang, Min; Varon, Sepideh F; Bloudek, Lisa M; DeGryse, Ronald E; Kosinski, Mark
2014-08-01
The Headache Impact Test (HIT)-6 was developed and has been validated in patients with various types of headache. The objective of this study was to report the psychometric properties of the HIT-6 among patients with chronic migraine. Data came from two international, multicenter, randomized, double-blind, placebo-controlled clinical trials of chronic migraine patients (N = 1,384) undergoing prophylaxis therapy. Confirmatory factor analysis and differential item functioning (DIF) analysis were used to test the latent structure and cross-cultural comparability of the HIT-6. Reliability, construct validity, and responsiveness were assessed. Two sets of criterion groups were used: (1) 28-day headache frequency: <10, 10-14, and ≥15 days; (2) sample quartiles of the total cumulative hours of headache: <140, 140 to <280, 280 to <420, and ≥420 hours. Two sets of responsiveness categories were defined as reduction of <30%, 30% to <50%, or ≥50% in (1) number of headache days and (2) cumulative hours of headache. Measurement invariance tests supported the stability of the HIT-6 latent structure across studies. DIF analysis supported cross-cultural comparability. Good reliability was observed across studies (Cronbach's α: 0.75-0.92; intraclass correlation coefficient: 0.76-0.80). HIT-6 scores correlated strongly (-0.86 to -0.59) with scores of the Migraine-Specific Quality-of-Life Questionnaire. Analysis of variance indicated that HIT-6 scores discriminated across both types of criterion groups (P<0.001), across studies and time points. HIT-6 change scores were significantly higher in magnitude in groups experiencing greater improvement (P<0.001). All measurement properties were consistently verified across the two studies, supporting the validity of the HIT-6 among chronic migraine patients. NCT00156910 and NCT00168428 on www.ClinicalTrials.gov.
Kahraman, Turhan; Genç, Arzu; Göz, Evrim
2016-10-01
The purpose of this study was to linguistically and culturally adapt the Nordic Musculoskeletal Questionnaire (NMQ) for use in Turkey, and to examine the psychometric properties of this adapted version. The cross-cultural adaptation was achieved by translating the items from the original version, with back-translation performed by independent mother-tongue translators, followed by committee review. Reliability (internal consistency and test-retest) was examined for 198 participants who completed the NMQ twice (with a 1 week interval). Construct validity was examined with data from 126 participants from the same population, who completed further four questionnaires related to the body regions described in the NMQ. The internal consistency was excellent (Cronbach's alpha = 0.896). The test-retest reliability was examined with the prevalence-adjusted bias-adjusted kappa (PABAK) and all items showed moderate to almost perfect reliability (PABAK = 0.57-0.90). Participants with a musculoskeletal problem in a related region had significantly more disability/pain, as assessed by the relevant questionnaires (p < 0.001), indicating that the NMQ had a good construct validity. This study provided considerable evidence that the Turkish version of the NMQ has appropriate psychometric properties, including good test-retest reliability, internal consistency and construct validity. It can be used for screening and epidemiological investigations of musculoskeletal symptoms. Implications for Rehabilitation The Nordic Musculoskeletal Questionnaire (NMQ) can be used for the screening of musculoskeletal problems. The NMQ allows comparison of musculoskeletal problems in different body regions in epidemiological studies with large numbers of participants. The Turkish version of the NMQ can be used for rehabilitation due to its appropriate psychometric properties, including good test-retest reliability, internal consistency and construct validity.
Shields, B M; McDonald, T J; Ellard, S; Campbell, M J; Hyde, C; Hattersley, A T
2012-05-01
Diagnosing MODY is difficult. To date, selection for molecular genetic testing for MODY has used discrete cut-offs of limited clinical characteristics with varying sensitivity and specificity. We aimed to use multiple, weighted, clinical criteria to determine an individual's probability of having MODY, as a crucial tool for rational genetic testing. We developed prediction models using logistic regression on data from 1,191 patients with MODY (n = 594), type 1 diabetes (n = 278) and type 2 diabetes (n = 319). Model performance was assessed by receiver operating characteristic (ROC) curves, cross-validation and validation in a further 350 patients. The models defined an overall probability of MODY using a weighted combination of the most discriminative characteristics. For MODY, compared with type 1 diabetes, these were: lower HbA(1c), parent with diabetes, female sex and older age at diagnosis. MODY was discriminated from type 2 diabetes by: lower BMI, younger age at diagnosis, female sex, lower HbA(1c), parent with diabetes, and not being treated with oral hypoglycaemic agents or insulin. Both models showed excellent discrimination (c-statistic = 0.95 and 0.98, respectively), low rates of cross-validated misclassification (9.2% and 5.3%), and good performance on the external test dataset (c-statistic = 0.95 and 0.94). Using the optimal cut-offs, the probability models improved the sensitivity (91% vs 72%) and specificity (94% vs 91%) for identifying MODY compared with standard criteria of diagnosis <25 years and an affected parent. The models are now available online at www.diabetesgenes.org . We have developed clinical prediction models that calculate an individual's probability of having MODY. This allows an improved and more rational approach to determine who should have molecular genetic testing.
Vyas, Shaleen; Nagarajappa, Sandesh; Dasar, Pralhad L.; Mishra, Prashant
2018-01-01
AIM: To translate OHIP-14 into Hindi and test its psychometric properties among school teacher community. METHODS: The OHIP-14 was translated to OHIP-14-H using WHO recommended translation protocol. During pre-testing, an expert panel assessed content validity of the questionnaire. Face validity was assessed on a sample of 10 individuals. The OHIP-14-H was administered on a random sample of 170 primary school teachers. Internal consistency and test-retest reliability were assessed using Cronbach's alpha and Intra-class correlation coefficient (ICC) respectively, with 2 weeks interval. Predictive validity was tested by comparing OHIP-14-H scores with clinical parameters. The concurrent validity was assessed using self-reported oral health and discriminant validity was ascertained through negative association with sociodemographic variables. RESULTS: The mean OHIP-14-H score was 9.57 (S.D = 4.58). ICC and Cronbach's alpha for OHIP-14-H was 0.96 and 0.92 respectively. Concurrent validity using binomial regression model indicated that good (OR = 0.56, 95% CI = 0.55 – 4.47) and moderate (OR = 0.25, 95% CI = 0.17 – 1.87) OHIP-14-H scores were negative but significant risk indicators of poor self reported oral health (P < 0.009). Significant predictive validity was observed between OHIP-14-H scores and clinical parameters (P < 0.000). CONCLUSION: Translated and culturally adapted OHIP-14-H indicates good reliability and validity among primary school teachers. PMID:29417064
Wang, Yi-Wen; Tsai, Yun-Fang; Lee, Shwu-Hua; Chen, Ying-Jen; Chen, Hsiu-Fang
2016-07-01
To develop and psychometrically test the Protective Reasons against Suicide Inventory among older Chinese-speaking outpatients. Tools currently exist to test reasons for living among individuals of all ages in western countries, but few are available to assess older adults' protective reasons against suicide in Asia. A cross-sectional survey to investigate protective reasons against suicide among older Chinese-speaking outpatients. The Protective Reasons against Suicide Inventory was developed based on individual interviews with 83 older outpatients in Taiwan, the literature and the authors' clinical experiences. The resulting Inventory was examined in 2013 for content validity, face validity, construct validity, criterion-related validity, internal consistency reliability and test-retest reliability. The Inventory had excellent content validity and face validity. Factor analysis yielded a seven-factor solution, accounting for 87·7% of the variance. Scores on the global Inventory and its subscales tended to be higher in outpatients diagnosed without suicidal ideation than in outpatients diagnosed with suicidal ideation, indicating good criterion validity. Inventory reliability and the intraclass correlation coefficient were satisfactory. The Protective Reasons against Suicide Inventory can be completed in 5 minutes and is perceived as easy to complete. Moreover, the Inventory yielded highly acceptable parameters for validity and reliability. The Protective Reasons against Suicide Inventory can be used to assess older Chinese-speaking outpatients for factors that protect them from attempting suicide. © 2016 John Wiley & Sons Ltd.
Cane, James; O'Connor, Denise; Michie, Susan
2012-04-24
An integrative theoretical framework, developed for cross-disciplinary implementation and other behaviour change research, has been applied across a wide range of clinical situations. This study tests the validity of this framework. Validity was investigated by behavioural experts sorting 112 unique theoretical constructs using closed and open sort tasks. The extent of replication was tested by Discriminant Content Validation and Fuzzy Cluster Analysis. There was good support for a refinement of the framework comprising 14 domains of theoretical constructs (average silhouette value 0.29): 'Knowledge', 'Skills', 'Social/Professional Role and Identity', 'Beliefs about Capabilities', 'Optimism', 'Beliefs about Consequences', 'Reinforcement', 'Intentions', 'Goals', 'Memory, Attention and Decision Processes', 'Environmental Context and Resources', 'Social Influences', 'Emotions', and 'Behavioural Regulation'. The refined Theoretical Domains Framework has a strengthened empirical base and provides a method for theoretically assessing implementation problems, as well as professional and other health-related behaviours as a basis for intervention development.
Kim, Daeho; Kim, Kwang-iel; Lee, Haewon; Choi, Joonho; Park, Yong-Chon
2005-04-01
The Illness Intrusiveness Rating Scale (IIRS) measures illness-induced disruptions to 13 domains of lifestyles, activities, and interests. A stable three-factor structure has been well documented; however, the cross-cultural validity of this scale needs to be tested. This study investigated the factor structure of the Korean version of IIRS in 712 outpatients at a university medical center. A predominant diagnosis of the patients was rheumatoid arthritis (47%). The Center for Epidemiological Studies-Depression Scale (CES-D), and Health Assessment Questionnaire (HAQ) were also administered. Exploratory Principal Component Analysis identified a two-factor structure, "Relationships and Personal Development (RPD)" and "Instrumental", accounting for 57% of the variance. Confirmatory analyses extracted an identical factor structure. However, a goodness-of-the fit test failed to support two-factor solution (chi(2)=138.2, df=43, p<.001). Two factors had high internal consistency (RPD, alpha=.89; Instrumental, alpha=.75) and significantly correlated with scores of HAQ (RPD, r=.53, p<.001; Instrumental, .r=44, p<.001) and CES-D (RPD, .r=55, p<.001; Instrumental, .r=43, p<.001). These findings supported construct validity of the Korean version of IIRS, but did not support cross-cultural equivalence of the factor structure.
NASA Astrophysics Data System (ADS)
Provo, Judy; Lamar, Carlton; Newby, Timothy
2002-01-01
A cross section was used to enhance three-dimensional knowledge of anatomy of the canine head. All veterinary students in two successive classes (n = 124) dissected the head; experimental groups also identified structures on a cross section of the head. A test assessing spatial knowledge of the head generated 10 dependent variables from two administrations. The test had content validity and statistically significant interrater and test-retest reliability. A live-dog examination generated one additional dependent variable. Analysis of covariance controlling for performance on course examinations and quizzes revealed no treatment effect. Including spatial skill as a third covariate revealed a statistically significant effect of spatial skill on three dependent variables. Men initially had greater spatial skill than women, but spatial skills were equal after 8 months. A qualitative analysis showed the positive impact of this experience on participants. Suggestions for improvement and future research are discussed.
Dogan, Eyup; Seker, Fahri
2016-07-01
This empirical study analyzes the impacts of real income, energy consumption, financial development and trade openness on CO2 emissions for the OECD countries in the Environmental Kuznets Curve (EKC) model by using panel econometric approaches that consider issues of heterogeneity and cross-sectional dependence. Results from the Pesaran CD test, the Pesaran-Yamagata's homogeneity test, the CADF and the CIPS unit root tests, the LM bootstrap cointegration test, the DSUR estimator, and the Emirmahmutoglu-Kose Granger causality test indicate that (i) the panel time-series data are heterogeneous and cross-sectionally dependent; (ii) CO2 emissions, real income, the quadratic income, energy consumption, financial development and openness are integrated of order one; (iii) the analyzed data are cointegrated; (iv) the EKC hypothesis is validated for the OECD countries; (v) increases in openness and financial development mitigate the level of emissions whereas energy consumption contributes to carbon emissions; (vi) a variety of Granger causal relationship is detected among the analyzed variables; and (vii) empirical results and policy recommendations are accurate and efficient since panel econometric models used in this study account for heterogeneity and cross-sectional dependence in their estimation procedures.
Michel, Pierre; Auquier, Pascal; Baumstarck, Karine; Pelletier, Jean; Loundou, Anderson; Ghattas, Badih; Boyer, Laurent
2015-09-01
Quality of life (QoL) measurements are considered important outcome measures both for research on multiple sclerosis (MS) and in clinical practice. Computerized adaptive testing (CAT) can improve the precision of measurements made using QoL instruments while reducing the burden of testing on patients. Moreover, a cross-cultural approach is also necessary to guarantee the wide applicability of CAT. The aim of this preliminary study was to develop a calibrated item bank that is available in multiple languages and measures QoL related to mental health by combining one generic (SF-36) and one disease-specific questionnaire (MusiQoL). Patients with MS were enrolled in this international, multicenter, cross-sectional study. The psychometric properties of the item bank were based on classical test and item response theories and approaches, including the evaluation of unidimensionality, item response theory model fitting, and analyses of differential item functioning (DIF). Convergent and discriminant validities of the item bank were examined according to socio-demographic, clinical, and QoL features. A total of 1992 patients with MS and from 15 countries were enrolled in this study to calibrate the 22-item bank developed in this study. The strict monotonicity of the Cronbach's alpha curve, the high eigenvalue ratio estimator (5.50), and the adequate CFA model fit (RMSEA = 0.07 and CFI = 0.95) indicated that a strong assumption of unidimensionality was warranted. The infit mean square statistic ranged from 0.76 to 1.27, indicating a satisfactory item fit. DIF analyses revealed no item biases across geographical areas, confirming the cross-cultural equivalence of the item bank. External validity testing revealed that the item bank scores correlated significantly with QoL scores but also showed discriminant validity for socio-demographic and clinical characteristics. This work demonstrated satisfactory psychometric characteristics for a QoL item bank for MS in multiple languages. This work may offer a common measure for the assessment of QoL in different cultural contexts and for international studies conducted on MS.
Muquith, Mohammed A; Islam, Md Nazrul; Haq, Syed A; Ten Klooster, Peter M; Rasker, Johannes J; Yunus, Muhammad B
2012-08-27
Currently, no validated instruments are available to measure the health status of Bangladeshi patients with fibromyalgia (FM). The aims of this study were to cross-culturally adapt the modified Fibromyalgia Impact Questionnaire (FIQ) into Bengali (B-FIQ) and to test its validity and reliability in Bangladeshi patients with FM. The FIQ was translated following cross-cultural adaptation guidelines and pretested in 30 female patients with FM. Next, the adapted B-FIQ was physician-administered to 102 consecutive female FM patients together with the Health Assessment Questionnaire (HAQ), selected subscales of the SF-36, and visual analog scales for current clinical symptoms. A tender point count (TPC) was performed by an experienced rheumatologist. Forty randomly selected patients completed the B-FIQ again after 7 days. Two control groups of 50 healthy people and 50 rheumatoid arthritis (RA) patients also completed the B-FIQ. For the final B-FIQ, five physical function sub-items were replaced with culturally appropriate equivalents. Internal consistency was adequate for both the 11-item physical function subscale (α = 0.73) and the total scale (α = 0.83). With exception of the physical function subscale, expected correlations were generally observed between the B-FIQ items and selected subscales of the SF-36, HAQ, clinical symptoms, and TPC. The B-FIQ was able to discriminate between FM patients and healthy controls and between FM patients and RA patients. Test-retest reliability was adequate for the physical function subscale (r = 0.86) and individual items (r = 0.73-0.86), except anxiety (r = 0.27) and morning tiredness (r = 0.64). This study supports the reliability and validity of the B-FIQ as a measure of functional disability and health status in Bangladeshi women with FM.
Cross-cultural adaptation and validation of the Behcet’s Disease Current Activity Form in Korea
Choi, Hyo Jin; Seo, Mi Ryoung; Ryu, Hee Jung; Baek, Han Joo
2015-01-01
Background/Aims: This study was undertaken to perform a cross-cultural adaptation of the Behcet’s Disease Current Activity Form (BDCAF, version 2006) questionnaire to the Korean language and to evaluate its reliability and validity in a population of Korean patients with Behcet’s disease (BD). Methods: A cross-cultural study was conducted among patients with BD who attended our rheumatology clinic between November 2012 and March 2013. There were 11 males and 35 females in the group. The mean age of the participants was 48.5 years and the mean disease duration was 6.4 years. The first BDCAF questionnaire was completed on arrival and the second assessment was performed 20 minutes later by a different physician. The test-retest reliability was analyzed by computing κ statistics. Kappa scores of > 0.6 indicated a good agreement. To assess the validity, we compared the total BDCAF score with the patient’s/clinician’s perception of disease activity and the Korean version of the Behcet’s Disease Quality of Life (BDQOL). Results: For the test-retest reliability, good agreements were achieved on items such as headache, oral/genital ulceration, erythema, skin pustules, arthralgia, nausea/vomiting/abdominal pain, and diarrhea with altered/frank blood per rectum. Moderate agreement was observed for eye and nervous system involvement. We achieved a fair agreement for arthritis and major vessel involvement. Significant correlations were obtained between the total BDCAF score with the BDQOL and the patient’s/clinician’s perception of disease activity p < 0.05). Conclusions: The Korean version of the BDCAF is a reliable and valid instrument for measuring current disease activity in Korean BD patients. PMID:26354066
Gallasch, Cristiane Helena; Alexandre, Neusa Maria Costa; Amick, Benjamin
2007-12-01
The study objectives were to translate and adapt the Work Role Functioning Questionnaire (WRFQ) into the Brazilian Portuguese language and evaluate its reliability in patients experiencing musculoskeletal disorders. The cross-cultural adaptation was performed according to the internationally recommended methodology, using the following guidelines: translation, back-translation, revision by a committee, and pretest. At first, the questionnaire was independently translated by two bilingual translators, who had Portuguese as their mother language. Subsequently, two other translators whose mother language was English did the back-translation. A committee composed of five specialists revised and compared the translations obtained, developing the final version for pretest application. The pretest was carried out with 30 patients experiencing musculoskeletal disorders. Psychometric properties were evaluated by administering the questionnaire to 105 subjects with musculoskeletal disorders and receiving physical therapy treatment. The reliability was estimated through stability and homogeneity assessment. The construct validity was tested comparing subjects experiencing musculoskeletal disorders to healthy workers. The results indicated good content validity and internal consistency (Cronbach alpha = 0.95). Cronbach alpha for each scale was >0.85, except for the social demand scale. The Intraclass Correlation Coefficient for the test-retest reliability was satisfactory for mental demands (ICC = 0.68) and excellent for the others (0.82-0.91). In relation to the construct validity, the mean score obtained for each scale was lower for physical, work scheduling, and output demands in the subjects with musculoskeletal disorders. There was a significant difference (p < 0.001) between the groups in comparison to work scheduling, physical, and output demands. The data showed that the cross-cultural adaptation process was successful and the adapted instrument demonstrated psychometric properties making it reliable to use in Brazilian culture.
Nayana, M Ravi Shashi; Sekhar, Y Nataraja; Nandyala, Haritha; Muttineni, Ravikumar; Bairy, Santosh Kumar; Singh, Kriti; Mahmood, S K
2008-10-01
In the present study, a series of 179 quinoline and quinazoline heterocyclic analogues exhibiting inhibitory activity against Gastric (H+/K+)-ATPase were investigated using the comparative molecular field analysis (CoMFA) and comparative molecular similarity indices (CoMSIA) methods. Both the models exhibited good correlation between the calculated 3D-QSAR fields and the observed biological activity for the respective training set compounds. The most optimal CoMFA and CoMSIA models yielded significant leave-one-out cross-validation coefficient, q(2) of 0.777, 0.744 and conventional cross-validation coefficient, r(2) of 0.927, 0.914 respectively. The predictive ability of generated models was tested on a set of 52 compounds having broad range of activity. CoMFA and CoMSIA yielded predicted activities for test set compounds with r(pred)(2) of 0.893 and 0.917 respectively. These validation tests not only revealed the robustness of the models but also demonstrated that for our models r(pred)(2) based on the mean activity of test set compounds can accurately estimate external predictivity. The factors affecting activity were analyzed carefully according to standard coefficient contour maps of steric, electrostatic, hydrophobic, acceptor and donor fields derived from the CoMFA and CoMSIA. These contour plots identified several key features which explain the wide range of activities. The results obtained from models offer important structural insight into designing novel peptic-ulcer inhibitors prior to their synthesis.
Jalenques, Isabelle; Guiguet-Auclair, Candy; Derost, Philippe; Joubert, Pauline; Foures, Louis; Hartmann, Andreas; Muellner, Julia; Rondepierre, Fabien
2018-03-01
The Motor tic, Obsessions and compulsions, Vocal tic Evaluation Survey (MOVES) is a self-report scale suggested as a severity scale for tics and related sensory phenomena observed in Gilles de la Tourette syndrome (GTS) and recommended as a screening instrument by the Committee on Rating Scale Development of the International Parkinson's Disease and Movement Disorder Society. To cross-culturally adapt a French version of the MOVES and to evaluate its psychometric properties. After the cross-cultural adaptation of the MOVES, we assessed its psychometric properties in 53 patients aged 12-16 years and in 54 patients aged 16 years and above: reliability and construct validity (relationships between items and scales), internal consistency and concurrent validity with the Yale Global Tic Severity Scale (YGTSS) and the Children's Yale-Brown Obsessive-Compulsive Scale (CY-BOCS) or the auto-Yale-Brown scale. The results showed very good acceptability with response rates greater than 92%, good internal consistency (Cronbach's alpha ranging from 0.62 and 0.89) and good test-retest reliability (ICCs ranging from 0.59 to 0.91). Concurrent validity with the YGTSS, CY-BOCS and auto-Yale-Brown scales showed strong expected correlations. The cut-off points tested for diagnostic performance gave satisfactory values of sensitivity, specificity, and positive and negative predictive values. Our study provides evidence of the good psychometric properties of the French version of the MOVES. The cross-cultural adaptation of this specific instrument will allow investigators to include French-speaking persons with GTS aged 12 years and over in national and international collaboration research projects.
Reliability and Concurrent Validity of Dynamic Rotator Stability Test-A Cross Sectional study.
Binoy Mathew, K V; Eapen, Charu; Kumar, P Senthil
2012-01-01
To find intra rater and inter rater reliability of Dynamic Rotator Stability Test (DRST) and to find concurrent validity of Dynamic Rotator Stability Test (DRST) with University of Pennsylvania Shoulder Score (PENN) Scale. 40 subjects of either gender between the age group of 18-70 with painful shoulder conditions of musculoskeletal origin was selected through convenient sampling. Tester 1 and tester 2 administered DRST and PENN scale randomly. In a subgroup of 20 subjects DRST was administered by both the testers to find the inter rater reliability. 180° Standard Universal Goniometer was used to take measurements. For intra-rater reliability, all the test variables were showing highly significant correlation (p=.94 - 1). For inter -rater, with tester 2, test variables like position, ROM, force, direction of abnormal translation, pain during the test, compensatory movement during test were found to be significant (p=.71-1).only some variables of DRST showed significant correlation with PENN scale (P=.320-.450). Dynamic Rotator Stability Test has good intra rater and moderate inter rater reliability. Concurrent validity of Dynamic Rotator Stability Test was found to be poor when compared to PENN Shoulder Score.
Mao, Hui-Fen; Chen, Wan-Yin; Yao, Grace; Huang, Sheau-Ling; Lin, Chia-Chi; Huang, Wen-Ni Wennie
2010-05-01
To develop and validate a cross-cultural version of the Quebec User Evaluation of Satisfaction with Assistive Technology (QUEST 2.0) for users of assistive technology devices in Taiwan. A cross-sectional survey. The standard cultural adaptation procedure was used for questionnaire translation and cultural item design. A field test was then conducted for item selection and psychometric properties testing. One hundred and five volunteer assistive device users in community. A questionnaire comprising 12 items of the QUEST 2.0 and 16 culture-specific items. One culture-specific item, 'Cost', was selected based on eight criteria and added to the QUEST 2.0 (12 items) to formulate the Taiwanese version of QUEST 2.0 (T-QUEST). The T-QUEST consisted of 13 items which were classified into two domains: device (8 items) and service (5 items). The internal consistencies of the device, service and total T-QUEST scores were 0.87, 0.84 and 0.90, respectively. The device, services and total T-QUEST scores achieved good test-retest stability (intraclass correlation coefficient (ICC) 0.90, 0.97, 0.95). Exploratory factor analysis revealed that T-QUEST had a two-factor structure for device and service in the construct of user satisfaction (53.42% of the variance explained). Users of assistive device in different culture may have different concerns regarding satisfaction. T-QUEST is the first published version of QUEST with culture-specific items added to the original translated items of QUEST 2.0. T-QUEST was a valid and reliable tool for measuring user satisfaction among Mandarin-speaking individuals using various kinds of assistive devices.
Ridenour, Ty A.; Willis, David; Bogen, Debra L.; Novak, Scott; Scherer, Jennifer; Reynolds, Maureen D.; Zhai, Zu Wei; Tarter, Ralph E.
2015-01-01
Background Youth substance use (SU) is prevalent and costly, affecting mental and physical health. American Academy of Pediatrics and Affordable Care Act call for SU screening and prevention. The Youth Risk Index© (YRI) was tested as a screening tool for having initiated and propensity to initiate SU before high school (which forecasts SU disorder). YRI was hypothesized to have good to excellent psychometrics, feasibility and stakeholder acceptability for use during well-child check-ups. Design A high-risk longitudinal design with two cross-sectional replication samples, ages 9–13 was used. Analyses included receiver operating characteristics and regression analyses. Participants A one-year longitudinal sample (N=640) was used for YRI derivation. Replication samples were a cross-sectional sample (N=345) and well-child check-up patients (N=105) for testing feasibility, validity and acceptability as a screening tool. Results YRI has excellent test-retest reliability and good sensitivity and specificity for concurrent and one-year-later SU (odds ratio=7.44 CI=4.3–13.0) and conduct problems (odds ratios=7.33 CI=3.9–13.7). Results were replicated in both cross-sectional samples. Well-child patients, parents and pediatric staff rated YRI screening as important, acceptable, and a needed service. Conclusions Identifying at-risk youth prior to age 13 could reap years of opportunity to intervene before onset of SU disorder. Most results pertained to YRI’s association with concurrent or recent past risky behaviors; further replication ought to specify its predictive validity, especially adolescent-onset risky behaviors. YRI well identifies youth at risk for SU and conduct problems prior to high school, is feasible and valid for screening during well-child check-ups, and is acceptable to stakeholders. PMID:25765481
Sivan, Sree Kanth; Manga, Vijjulatha
2012-02-01
Multiple receptors conformation docking (MRCD) and clustering of dock poses allows seamless incorporation of receptor binding conformation of the molecules on wide range of ligands with varied structural scaffold. The accuracy of the approach was tested on a set of 120 cyclic urea molecules having HIV-1 protease inhibitory activity using 12 high resolution X-ray crystal structures and one NMR resolved conformation of HIV-1 protease extracted from protein data bank. A cross validation was performed on 25 non-cyclic urea HIV-1 protease inhibitor having varied structures. The comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) models were generated using 60 molecules in the training set by applying leave one out cross validation method, r (loo) (2) values of 0.598 and 0.674 for CoMFA and CoMSIA respectively and non-cross validated regression coefficient r(2) values of 0.983 and 0.985 were obtained for CoMFA and CoMSIA respectively. The predictive ability of these models was determined using a test set of 60 cyclic urea molecules that gave predictive correlation (r (pred) (2) ) of 0.684 and 0.64 respectively for CoMFA and CoMSIA indicating good internal predictive ability. Based on this information 25 non-cyclic urea molecules were taken as a test set to check the external predictive ability of these models. This gave remarkable out come with r (pred) (2) of 0.61 and 0.53 for CoMFA and CoMSIA respectively. The results invariably show that this method is useful for performing 3D QSAR analysis on molecules having different structural motifs.
Singh, Varun Pratap; Singh, Rajkumar
2014-03-01
The aim of this study was to develop a reliable and valid Nepali version of the Psychosocial Impact of Dental Aesthetic Questionnaire (PIDAQ). Cross-sectional descriptive validation study. B.P. Koirala Institute of Health Sciences, Dharan, Nepal. A rigorous translation process including conceptual and semantic evaluation, translation, back translation and pre-testing was carried out. Two hundred and fifty-two undergraduates, including equal numbers of males and females with an age ranging from 18 to 29 years (mean age: 22·33±2·114 years), participated in this study. Reliability was assessed by Cronbach's alpha coefficient and the coefficient of correlation was used to assess correlation between items and test-retest reliability. The construct validity was tested by factorial analysis. Convergent construct validity was tested by comparison of PIDAQ scores with the aesthetic component of the index of orthodontic treatment needs (IOTN-AC) and perception of occlusion scale (POS), respectively. Discriminant construct validity was assessed by differences in score for those who demand treatment and those who did not. The response rate was 100%. One hundred and twenty-three individuals had a demand for orthodontic treatment. The Nepali PIDAQ had excellent reliability with Cronbach's alpha of 0·945, corrected item correlation between 0·525 and 0·790 and overall test-retest reliability of 0·978. The construct validity was good with formation of a new sub-domain 'Dental self-consciousness'. The scale had good correlation with IOTN-AC and POS fulfilling convergent construct validity. The discriminant construct validity was proved by significant differences in scores for subjects with demand and without demand for treatment. To conclude, Nepali version of PIDAQ has good psychometric properties and can be used effectively in this population group for further research.
UTM Technical Capabilities Level 2 (TLC2) Test at Reno-Stead Airport.
2016-10-06
Test of Unmanned Aircraft Systems Traffic Management (UTM) technical capability Level 2 (TCL2) at Reno-Stead Airport, Nevada. During the test, five drones simultaneously crossed paths, separated by altitude. Two drones flew beyond visual line-of-sight and three flew within line-of-sight of their operators. Engineer Joey Mercer reviews flight paths using the UAS traffic management research platform UTM coordinator app to verify and validate flight paths.
Validation of Metrics as Error Predictors
NASA Astrophysics Data System (ADS)
Mendling, Jan
In this chapter, we test the validity of metrics that were defined in the previous chapter for predicting errors in EPC business process models. In Section 5.1, we provide an overview of how the analysis data is generated. Section 5.2 describes the sample of EPCs from practice that we use for the analysis. Here we discuss a disaggregation by the EPC model group and by error as well as a correlation analysis between metrics and error. Based on this sample, we calculate a logistic regression model for predicting error probability with the metrics as input variables in Section 5.3. In Section 5.4, we then test the regression function for an independent sample of EPC models from textbooks as a cross-validation. Section 5.5 summarizes the findings.
Freitas, N O; Forero, C G; Alonso, J; Caltran, M P; Dantas, R A S; Farina, J A; Rossi, L A
2017-01-01
Burn patients may encounter social barriers and stigmatization. The objectives of this study were to adapt the Social Comfort Questionnaire (SCQ) into Brazilian Portuguese and to assess the psychometric properties of the adapted version. Cross-cultural adaptation of the 8 items of the SCQ followed international guidelines. We interviewed 240 burn patients and verified the SCQ internal consistency, test-retest reliability and construct validity, correlating the scores with depression [Beck Depression Inventory (BDI)], affect/body image and interpersonal relationships [Burns Specific Health Scale-Revised (BSHS-R)] and self-esteem [Rosenberg's Self-Esteem Scale (RSES)]. We also performed a confirmatory factor analysis (CFA). The cross-cultural adaptation resulted in minor semantic modifications to the original SCQ version. After CFA, a reduced 6-item version showed satisfactory fit to the one-factor model (RMSEA = 0.05, CFI = 0.99, TLI = 0.99). Cronbach alpha's was 0.80, and test-retest intraclass correlation coefficient was 0.86. The final version presented a strong negative correlation with depression (BDI), and strong positive correlations with affect/body image (BSHS-R), interpersonal relationships (BSHS-R) and self-esteem (RSES) (all p < 0.001). The results showed that the SCQ Brazilian Portuguese adapted version complies with the validity and reliability criteria required for an instrument assessing social comfort in Brazilian burn patients. The Brazilian version yields a single score that is easy to interpret and well understood by patients.
Seligman, D A; Pullinger, A G
2006-11-01
To determine whether patients with temporomandibular joint disease or masticatory muscle pain can be usefully differentiated from asymptomatic controls using multifactorial classification tree models of attrition severity and/or rates. Measures of attrition severity and rates in patients diagnosed with disc displacement (n = 52), osteoarthrosis (n = 74), or masticatory muscle pain only (n = 43) were compared against those in asymptomatic controls (n = 132). Cross-validated classification tree models were tested for fit with sensitivity, specificity, accuracy and log likelihood accountability. The model for identifying asymptomatic controls only required the three measures of attrition severity (anterior, mediotrusive and laterotrusive posterior) to be differentiated from the patients with a 74.2 +/- 3.8% cross-validation accuracy. This compared with cross-validation accuracies of 69.7 +/- 3.7% for differentiating disc displacement using anterior and laterotrusive attrition severity, 68.7 +/- 3.9% for differentiating disc displacement using anterior and laterotrusive attrition rates, 70.9 +/- 3.3% for differentiating osteoarthrosis using anterior attrition severity and rates, 94.6 +/- 2.1% for differentiating myofascial pain using mediotrusive and laterotrusive attrition severity, and 92.0 +/- 2.1% for differentiating myofascial pain using mediotrusive and anterior attrition rates. The myofascial pain models exceeded the > or =75% sensitivity and > or =90% specificity thresholds recommended for diagnostic tests, and the asymptomatic control model approached these thresholds. Multifactorial models using attrition severity and rates may differentiate masticatory muscle pain patients from asymptomatic controls, and have some predictive value for differentiating intracapsular temporomandibular disorder patients as well.
Ávila, Christiane Wahast; Riegel, Barbara; Pokorski, Simoni Chiarelli; Camey, Suzi; Silveira, Luana Claudia Jacoby; Rabelo-Silva, Eneida Rejane
2013-01-01
Objective. To adapt and evaluate the psychometric properties of the Brazilian version of the SCHFI v 6.2. Methods. With the approval of the original author, we conducted a complete cross-cultural adaptation of the instrument (translation, synthesis, back translation, synthesis of back translation, expert committee review, and pretesting). The adapted version was named Brazilian version of the self-care of heart failure index v 6.2. The psychometric properties assessed were face validity and content validity (by expert committee review), construct validity (convergent validity and confirmatory factor analysis), and reliability. Results. Face validity and content validity were indicative of semantic, idiomatic, experimental, and conceptual equivalence. Convergent validity was demonstrated by a significant though moderate correlation (r = −0.51) on comparison with equivalent question scores of the previously validated Brazilian European heart failure self-care behavior scale. Confirmatory factor analysis supported the original three-factor model as having the best fit, although similar results were obtained for inadequate fit indices. The reliability of the instrument, as expressed by Cronbach's alpha, was 0.40, 0.82, and 0.93 for the self-care maintenance, self-care management, and self-care confidence scales, respectively. Conclusion. The SCHFI v 6.2 was successfully adapted for use in Brazil. Nevertheless, further studies should be carried out to improve its psychometric properties. PMID:24163765
Mendonça, Bianca; Sargent, Barbara; Fetters, Linda
2016-12-01
To investigate whether standardized motor development screening and assessment tools that are used to evaluate motor abilities of children aged 0 to 2 years are valid in cultures other than those in which the normative sample was established. This was a systematic review in which six databases were searched. Studies were selected based on inclusion/exclusion criteria and appraised for evidence level and quality. Study variables were extracted. Twenty-three studies representing six motor development screening and assessment tools in 16 cultural contexts met the inclusion criteria: Alberta Infant Motor Scale (n=7), Ages and Stages Questionnaire, 3rd edition (n=2), Bayley Scales of Infant and Toddler Development, 3rd edition (n=8), Denver Developmental Screening Test, 2nd edition (n=4), Harris Infant Neuromotor Test (n=1), and Peabody Developmental Motor Scales, 2nd edition (n=1). Thirteen studies found significant differences between the cultural context and normative sample. Two studies established reliability and/or validity of standardized motor development assessments in high-risk infants from different cultural contexts. Five studies established new population norms. Eight studies described the cross-cultural adaptation of a standardized motor development assessment. Standardized motor development assessments have limited validity in cultures other than that in which the normative sample was established. Their use can result in under- or over-referral for services. © 2016 Mac Keith Press.
Prediction of breast cancer risk with volatile biomarkers in breath.
Phillips, Michael; Cataneo, Renee N; Cruz-Ramos, Jose Alfonso; Huston, Jan; Ornelas, Omar; Pappas, Nadine; Pathak, Sonali
2018-03-23
Human breath contains volatile organic compounds (VOCs) that are biomarkers of breast cancer. We investigated the positive and negative predictive values (PPV and NPV) of breath VOC biomarkers as indicators of breast cancer risk. We employed ultra-clean breath collection balloons to collect breath samples from 54 women with biopsy-proven breast cancer and 124 cancer-free controls. Breath VOCs were analyzed with gas chromatography (GC) combined with either mass spectrometry (GC MS) or surface acoustic wave detection (GC SAW). Chromatograms were randomly assigned to a training set or a validation set. Monte Carlo analysis identified significant breath VOC biomarkers of breast cancer in the training set, and these biomarkers were incorporated into a multivariate algorithm to predict disease in the validation set. In the unsplit dataset, the predictive algorithms generated discriminant function (DF) values that varied with sensitivity, specificity, PPV and NPV. Using GC MS, test accuracy = 90% (area under curve of receiver operating characteristic in unsplit dataset) and cross-validated accuracy = 77%. Using GC SAW, test accuracy = 86% and cross-validated accuracy = 74%. With both assays, a low DF value was associated with a low risk of breast cancer (NPV > 99.9%). A high DF value was associated with a high risk of breast cancer and PPV rising to 100%. Analysis of breath VOC samples collected with ultra-clean balloons detected biomarkers that accurately predicted risk of breast cancer.
Ko, Young-Mi; Park, Won-Beom; Lim, Jae-Young
2010-03-15
Validation of a translated, culturally adapted questionnaire. We developed a Korean version of the Chronic Pain Coping Inventory-42 (CPCI-42) by performing a cross-cultural adaptation, and evaluated its reliability and validity. The CPCI is widely used and validated instruments for measuring coping strategies in chronic pain. However, no validated and culturally adapted version was available in Asian countries. We assessed 142 patients with chronic low back pain using the CPCI-42 and measures of physical disability, pain, and quality of life. Results for 93 of the 142 patients exhibited test-retest reliability. The interval time of collecting retest data varied from 2 weeks to 1 month. Criterion validity was evaluated using correlations between the CPCI-42 and the Oswestry Disability Index, the Brief Pain Inventory, and the Short Form 36-item Health Survey (version 2.0). Construct validity was computed using exploratory factor analysis. The Korean version of the CPCI-42 had a high internal consistency (Cronbach's alpha >0.70) with the exception of results for task persistence and relaxation. Illness-focused coping (guarding, resting, asking for assistance) and other-focused coping (seeking social support) were most significantly correlated with Oswestry Disability Index, Brief Pain Inventory, and Short Form 36-item Health Survey, respectively. Outcomes for task persistence were contrary to other subscales in wellness-focused coping. Construct validity by factor analysis produced similar results to the original CPCI subscale. However, several factors showed cross-loading in 8 factor solutions. Despite linguistic and cultural differences, the Korean version of the CPCI-42 is overall a meaningful tool, and produces results sufficiently similar to the original CPCI-42.
Cui, Jin; Jia, Zhenyu; Zhi, Xin; Li, Xiaoqun; Zhai, Xiao; Cao, Liehu; Weng, Weizong; Zhang, Jun; Wang, Lin; Chen, Xiao; Su, Jiacan
2017-01-05
The Achilles tendon Total Rupture Score (ATRS), which is originally developed in 2007 in Swedish, is the only patient-reported outcome measure (PROM) for specific outcome assessment of an Achilles tendon rupture.Purpose of this study is to translate and cross-culturally adapt Achilles tendon Total Rupture Score (ATRS) into simplified Chinese, and primarily evaluate the responsiveness, reliability and validity. International recognized guideline which was designed by Beaton was followed to make the translation of ATRS from English into simplified Chinese version (CH-ATRS). A prospective cohort study was carried out for the cross-cultural adaptation. There were 112 participants included into the study. Psychometric properties including floor and ceiling effects, Cronbach's alpha, intraclass correlation coefficient, effect size, standard response mean, and construct validity were tested. The mean scores of CH-ATRS are 57.42 ± 13.70. No sign of floor or ceiling effect was found of CH-ATRS. High level of internal consistency was supported by the value of Cronbach's alpha (0.893). ICC (0.979, 95%CI: 0.984-0.993) was high to indicate the high test-retest reliability. Great responsive ness was proved with the high absolute value of ES and SRM (0.84 and 8.98, respectively). The total CH-ATRS score had very good correlation with physical function and body pain subscales of SF-36 (r = -0.758 and r = -0.694, respectively, p < 0.001), while poor correlation with vitality and role physical subscales of SF-36 (r = -0.033 and r = -0.025, respectively, p ≥ 0.05), which supported construct validity of CH-ATRS. This Chinese version of Achilles tendon Total Rupture Score (CH-ATRS) can be used as a reliable and valid instrument for Achilles tendon rupture assessing in Chinese-speaking population. Level of evidence II.
Lohrer, Heinz; Nauck, Tanja
2011-03-01
Clinical measurement study. To cross-culturally adapt and validate the Victorian Institute of Sports Assessment Patellar Tendinopathy Questionnaire (VISA-P) for German-speaking patients. Like most questionnaires, the VISA-P was developed for English-speaking patients. There is a need to adapt the scale for German-speaking patients and thereby add to the total body of psychometric evidence relating to this instrument. The VISA-P questionnaire was translated and cross-culturally adapted into German (VISA-P-G) in 6 steps: translation, synthesis, back translation, expert committee review, pretesting, and advisory committee appraisal. The psychometric properties of the VISA-P-G were determined using 23 patients with patellar tendinopathy and 57 active healthy persons (32 sport students and 25 basketball players). Reliability was evaluated by applying the questionnaire twice within a week to all 80 participants. Known group validity was calculated using a 1-way analysis of variance. Additionally, VISA-P-G results were correlated with the Blazina classification system for patellar tendinopathy, using the Spearman rank correlation coefficient. VISA-P-G ratings from the present study groups were further compared with respective data published in the original English, Dutch, and Swedish versions by a 2-sample t test. Internal consistency for the individual items of the questionnaire was determined within the patient group using a Cronbach alpha. Test-retest revealed excellent reliability for the patient and the asymptomatic control group (ICC = 0.88 and 0.87, respectively). Internal consistency for the patients was 0.88. Concurrent validity was almost perfect (ρ = -0.81; P<.001). The VISA-P-G is a reliable and valid questionnaire for the self-assessment of pain, symptoms, and function in German-speaking patients with patellar tendinopathy. Its psychometric properties are comparable with the original English and international adaptations (Swedish, Dutch, and Italian).
Ko, Jupil; Rosen, Adam B; Brown, Cathleen N
2017-09-12
To cross-culturally adapt the Identification Functional Ankle Instability for use with Korean-speaking participants. The English version of the IdFAI was cross-culturally adapted into Korean based on the guidelines. The psychometric properties in the Korean version of the IdFAI were measured for test-retest reliability, internal consistency, criterion-related validity, discriminative validity, and measurement error 181 native Korean-speakers. Intra-class correlation coefficients (ICC 2,1 ) between the English and Korean versions of the IdFAI for test-retest reliability was 0.98 (standard error of measurement = 1.41). The Cronbach's alpha coefficient was 0.89 for the Korean versions of IdFAI. The Korean versions of the IdFAI had a strong correlation with the SF-36 (r s = -0.69, p < .001) and the Korean version of the Cumberland Ankle Instability Tool (r s = -0.65, p < .001). The cutoff score of >10 was the optimal cutoff score to distinguish between the group memberships. The minimally detectable change of the Korean versions of the IdFAI score was 3.91. The Korean versions of the IdFAI have shown to be an excellent, reliable, and valid instrument. The Korean versions of the IdFAI can be utilized to assess the presence of Chronic Ankle Instability by researchers and clinicians working among Korean-speaking populations. Implications for rehabilitation The high recurrence rate of sprains may result into Chronic Ankle Instability (CAI). The Identification of Functional Ankle Instability Tool (IdFAI) has been validated and recommended to identify patients with Chronic Ankle Instability (CAI). The Korean version of the Identification of Functional Ankle Instability Tool (IdFAI) may be also recommend to researchers and clinicians for assessing the presence of Chronic Ankle Instability (CAI) in Korean-speaking population.
Armed Forces Institute of Pathology Becomes CDC Registered Testing Site for Human Swine Influenza
2010-01-01
naval officer and pathologist during his nearly three decades of military service. Adrien Ravizee, research associate, is pipetting cells used to...grow influenza virus to new flasks and plates, so the cells can be infected with influenza virus as Dr. Sue Cross, virologist, left and Dr...Izadjoo said. “The collection can be used for validating improved diagnostic swine flu assays.” Dr. Sue Cross, a virologist, used cell cultures
Kennedy, Carol A; Beaton, Dorcas E; Smith, Peter; Van Eerd, Dwayne; Tang, Kenneth; Inrig, Taucha; Hogg-Johnson, Sheilah; Linton, Denise; Couban, Rachel
2013-11-01
To identify and synthesize evidence for the measurement properties of the QuickDASH, a shortened version of the 30-item DASH (Disabilities of the Arm, Shoulder and Hand) instrument. This systematic review used a best evidence synthesis approach to critically appraise the measurement properties [using COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN)] of the QuickDASH and cross-cultural adaptations. A standard search strategy was conducted between 2005 (year of first publication of QuickDASH) and March 2011 in MEDLINE, EMBASE and CINAHL. The search identified 14 studies to include in the best evidence synthesis of the QuickDASH. A further 11 studies were identified on eight cross-cultural adaptation versions. Many measurement properties of the QuickDASH have been evaluated in multiple studies and across most of the measurement properties. The best evidence synthesis of the QuickDASH English version suggests that this tool is performing well with strong positive evidence for reliability and validity (hypothesis testing), and moderate positive evidence for structural validity testing. Strong negative evidence was found for responsiveness due to lower correlations with global estimates of change. Information about the measurement properties of the cross-cultural adaptation versions is still lacking, or the available information is of poor overall methodological quality.
Adaptive local linear regression with application to printer color management.
Gupta, Maya R; Garcia, Eric K; Chin, Erika
2008-06-01
Local learning methods, such as local linear regression and nearest neighbor classifiers, base estimates on nearby training samples, neighbors. Usually, the number of neighbors used in estimation is fixed to be a global "optimal" value, chosen by cross validation. This paper proposes adapting the number of neighbors used for estimation to the local geometry of the data, without need for cross validation. The term enclosing neighborhood is introduced to describe a set of neighbors whose convex hull contains the test point when possible. It is proven that enclosing neighborhoods yield bounded estimation variance under some assumptions. Three such enclosing neighborhood definitions are presented: natural neighbors, natural neighbors inclusive, and enclosing k-NN. The effectiveness of these neighborhood definitions with local linear regression is tested for estimating lookup tables for color management. Significant improvements in error metrics are shown, indicating that enclosing neighborhoods may be a promising adaptive neighborhood definition for other local learning tasks as well, depending on the density of training samples.
Lanza, Ian R.; Bhagra, Sumit; Nair, K. Sreekumaran; Port, John D.
2011-01-01
Purpose To cross-validate skeletal muscle oxidative capacity measured by 31P-MRS with in vitro measurements of oxidative capacityin mitochondria isolated from muscle biopsies of the same muscle group in 18 healthy adults. Materials and Methods Oxidative capacity in vivo was determined from PCr recovery kinetics following a 30s maximal isometric knee extension. State 3 respiration was measured in isolated mitochondria using high-resolution respirometry. A second cohort of 10 individuals underwent two 31P-MRS testing sessions to assess the test-retest reproducibility of the method. Results Overall, the in vivo and in vitro methods were well-correlated (r = 0.66 –0.72) and showed good agreement by Bland Altman plots. Excellent reproducibility was observed for the PCr recovery rate constant (CV = 4.6%, ICC = 0.85) and calculated oxidative capacity (CV = 3.4%, ICC = 0.83). Conclusion These results indicate that 31P-MRS corresponds well with gold-standard in vitro measurements and is highly reproducible. PMID:22006551
Tóth, Gergely; Bodai, Zsolt; Héberger, Károly
2013-10-01
Coefficient of determination (R (2)) and its leave-one-out cross-validated analogue (denoted by Q (2) or R cv (2) ) are the most frequantly published values to characterize the predictive performance of models. In this article we use R (2) and Q (2) in a reversed aspect to determine uncommon points, i.e. influential points in any data sets. The term (1 - Q (2))/(1 - R (2)) corresponds to the ratio of predictive residual sum of squares and the residual sum of squares. The ratio correlates to the number of influential points in experimental and random data sets. We propose an (approximate) F test on (1 - Q (2))/(1 - R (2)) term to quickly pre-estimate the presence of influential points in training sets of models. The test is founded upon the routinely calculated Q (2) and R (2) values and warns the model builders to verify the training set, to perform influence analysis or even to change to robust modeling.
Kalderstam, Jonas; Edén, Patrik; Ohlsson, Mattias
2015-01-01
We investigate a new method to place patients into risk groups in censored survival data. Properties such as median survival time, and end survival rate, are implicitly improved by optimizing the area under the survival curve. Artificial neural networks (ANN) are trained to either maximize or minimize this area using a genetic algorithm, and combined into an ensemble to predict one of low, intermediate, or high risk groups. Estimated patient risk can influence treatment choices, and is important for study stratification. A common approach is to sort the patients according to a prognostic index and then group them along the quartile limits. The Cox proportional hazards model (Cox) is one example of this approach. Another method of doing risk grouping is recursive partitioning (Rpart), which constructs a decision tree where each branch point maximizes the statistical separation between the groups. ANN, Cox, and Rpart are compared on five publicly available data sets with varying properties. Cross-validation, as well as separate test sets, are used to validate the models. Results on the test sets show comparable performance, except for the smallest data set where Rpart's predicted risk groups turn out to be inverted, an example of crossing survival curves. Cross-validation shows that all three models exhibit crossing of some survival curves on this small data set but that the ANN model manages the best separation of groups in terms of median survival time before such crossings. The conclusion is that optimizing the area under the survival curve is a viable approach to identify risk groups. Training ANNs to optimize this area combines two key strengths from both prognostic indices and Rpart. First, a desired minimum group size can be specified, as for a prognostic index. Second, the ability to utilize non-linear effects among the covariates, which Rpart is also able to do.
Duruturk, Neslihan; Tonga, Eda; Gabel, Charles Philip; Acar, Manolya; Tekindal, Agah
2015-07-26
This study aims to adapt culturally a Turkish version of the Lower Limb Functional Index (LLFI) and to determine its validity, reliability, internal consistency, measurement sensitivity and factor structure in lower limb problems. The LLFI was translated into Turkish and cross-culturally adapted with a double forward-backward protocol that determined face and content validity. Individuals (n = 120) with lower limb musculoskeletal disorders completed the LLFI and Short Form-36 questionnaires and the Timed Up and Go physical test. The psychometric properties were evaluated for the all participants from patient-reported outcome measures made at baseline and repeated at day 3 to determine criterion between scores (Pearson's r), internal consistency (Cronbachs α) and test-retest reliability (intraclass correlation coefficient - ICC 2.1 ). Error was determined using standard error of the measurement (SEM) and minimal detectable change at the 90% level (MDC 90 ), while factor structure was determined using exploratory factor analysis with maximum likelihood extraction and Varimax rotation. The psychometric characteristics showed strong criterion validity (r = 0.74-0.76), high internal consistency (α = 0.82) and high test-retest reability (ICC 2.1 = 0.97). The SEM of 3.2% gave an MDC 90 = 5.8%. The factor structure was uni-dimensional. Turkish version of LLFI was found to be valid and reliable for the measurement of lower limb function in a Turkish population. Implications for Rehabilitation Lower extremity musculoskeletal disorders are common and greatly impact activities among the affected individuals pertaining to daily living, work, leisure and quality of life. Patient-reported outcome (PRO) measures have advantages as they are practical, cost-effective and clinically convenient for use in patient-centered care. The Lower Limb Functional Index is a recently validated PRO measure shown to have strong clinimetric properties.
Kim, Myoung-Hee; Cho, Young-Shin; Uhm, Wan-Sik; Kim, Sehyun; Bae, Sang-Cheol
2005-06-01
This study aimed to determine the cross-cultural adaptation and validation of the Korean version of the EQ-5D in rheumatic conditions. Translation, back-translation and cognitive debriefing were performed according to the EuroQol group's guidelines. For validity, 508 patients were recruited and administered the EQ-5D, Short-Form 36 and condition-specific measures. Construct validity and sensitivity were evaluated by testing a-priori hypotheses. For reliability, another 57 patients repeated the EQ-5D at 1-week interval, and intra-class correlations (ICC) and kappa statistics were estimated. For responsiveness, another 60 patients repeated it at 12-week interval within the context of clinical trial, and standardized response mean(SRM) were calculated. The cross-cultural adaptation produced no major modifications in the scale. The associations of the EQ-5D with the generic- and condition-specific measures were observed as expected in hypotheses: the higher EQ-5Dindex and EQ-5D(VAS) scores, the better health status by generic- or condition-specific measures, and the better functional class. The ICCs were 0.751 and 0.767, respectively, and kappa ranged from 0.455 to 0.772. The SRM were 0.649 and 0.410, respectively. The Korean EQ-5D exhibits good validity and sensitivity in various rheumatic conditions. Although its reliability and responsiveness were not excellent, it seems acceptable if condition-specific measures are applied together.
The validity and reliability of tinnitus handicap inventory Thai version.
Limviriyakul, Siriporn; Supavanich, Walop
2012-11-01
Demonstrate the reliability and validity of the Tinnitus Handicap Inventory Thai Version (THI-T), a self-report measure of tinnitus. A cross-sectional psychometric validation study was used to determine internal consistency reliability and validity of the Tinnitus Handicap Inventory Thai Version at the Otoneurology clinic at Tertiary care center The cross-cultural adaptation of the Tinnitus Handicapped Inventory English version (Newman et al, 1996) was translated into Thai version following the steps indicated by Guillemin et al. The reliability was constructed by using Cronbach's coefficient alpha. The validity was analyzed by the correlation between Tinnitus Handicap Inventory Thai version and the 36-items short form health survey and visual analog scale using Spearman and Pearson test. The result showed good internal consistency reliabilities of total, functional, emotional, and catastrophic scale (a = 0.902, 0.804, 0.831 and 0.661, respectively) of Tinnitus Handicap Inventory Thai Version. Spearman correlation showed the significant correlation of Tinnitus Handicap Inventory to 36-items short form health survey and visual analog scale. Tinnitus Handicap Inventory Thai Version will be a vigorous tool in evaluating tinnitus patients as well as monitoring the progress of their symptoms.
[Validation of three screening tests used for early detection of cervical cancer].
Rodriguez-Reyes, Esperanza Rosalba; Cerda-Flores, Ricardo M; Quiñones-Pérez, Juan M; Cortés-Gutiérrez, Elva I
2008-01-01
to evaluate the validity (sensitivity, specificity, and accuracy) of three screening methods used in the early detection of the cervical carcinoma versus the histopathology diagnosis. a selected sample of 107 women attended in the Opportune Detection of Cervicouterine Cancer Program in the Hospital de Zona 46, Instituto Mexicano del Seguro Social in Durango, during the 2003 was included. The application of Papa-nicolaou, acetic acid test, and molecular detection of human papillomavirus, and histopatholgy diagnosis were performed in all the patients at the time of the gynecological exam. The detection and tipification of the human papillomavirus was performed by polymerase chain reaction (PCR) and analysis of polymorphisms of length of restriction fragments (RFLP). Histopathology diagnosis was considered the gold standard. The evaluation of the validity was carried out by the Bayesian method for diagnosis test. the positive cases for acetic acid test, Papanicolaou, and PCR were 47, 22, and 19. The accuracy values were 0.70, 0.80 and 0.99, respectively. since the molecular method showed a greater validity in the early detection of the cervical carcinoma we considered of vital importance its implementation in suitable programs of Opportune Detection of Cervicouterino Cancer Program in Mexico. However, in order to validate this conclusion, cross-sectional studies in different region of country must be carried out.
Cross-Validation of the YMCA Submaximal Cycle Ergometer Test to Predict V[o.sub.2] Max
ERIC Educational Resources Information Center
Beekley, Matthew D.; Brechue, William F.; deHoyos, Diego V.; Garzarella, Linda; Werber-Zion, Galila; Pollock, Michael L.
2004-01-01
Maximal oxygen uptake (V[O.sub.2]max) is an important indicator of health-risk status, specifically for coronary heart disease (Blair et al., 1989). Direct measurement of V[O.sub.2]max is considered to be the most accurate means of determining cardiovascular fitness level. Typically, this measurement is taken using a progressive exercise test on a…
The Moral Competence Test: An Examination of Validity for Samples in the United States
ERIC Educational Resources Information Center
Biggs, Donald A.; Colesante, Robert J.
2015-01-01
The Moral Competence Test (MCT) was designed over 30 years ago to provide a resource for educators interested in conducting cross-cultural studies of moral development and education. Since its origin, it has been translated into at least 30 languages and used in hundreds of studies. However, few studies provide evidence to support the use of the…
Chen, Chia-Wei; Chu, Hsin; Tsai, Chia-Fen; Yang, Hui-Ling; Tsai, Jui-Chen; Chung, Min-Huey; Liao, Yuan-Mei; Chi, Mei-Ju; Chou, Kuei-Ru
2015-11-01
The purpose of this study was to translate the Rowland Universal Dementia Assessment Scale into Chinese and to evaluate the psychometric properties (reliability and validity) and the diagnostic properties (sensitivity, specificity and predictive values) of the Chinese version of the Rowland Universal Dementia Assessment Scale. The accurate detection of early dementia requires screening tools with favourable cross-cultural linguistic and appropriate sensitivity, specificity, and predictive values, particularly for Chinese-speaking populations. This was a cross-sectional, descriptive study. Overall, 130 participants suspected to have cognitive impairment were enrolled in the study. A test-retest for determining reliability was scheduled four weeks after the initial test. Content validity was determined by five experts, whereas construct validity was established by using contrasted group technique. The participants' clinical diagnoses were used as the standard in calculating the sensitivity, specificity, positive predictive value and negative predictive value. The study revealed that the Chinese version of the Rowland Universal Dementia Assessment Scale exhibited a test-retest reliability of 0.90, an internal consistency reliability of 0.71, an inter-rater reliability (kappa value) of 0.88 and a content validity index of 0.97. Both the patients and healthy contrast group exhibited significant differences in their cognitive ability. The optimal cut-off points for the Chinese version of the Rowland Universal Dementia Assessment Scale in the test for mild cognitive impairment and dementia were 24 and 22, respectively; moreover, for these two conditions, the sensitivities of the scale were 0.79 and 0.76, the specificities were 0.91 and 0.81, the areas under the curve were 0.85 and 0.78, the positive predictive values were 0.99 and 0.83 and the negative predictive values were 0.96 and 0.91 respectively. The Chinese version of the Rowland Universal Dementia Assessment Scale exhibited sound reliability, validity, sensitivity, specificity and predictive values. This scale can help clinical staff members to quickly and accurately diagnose cognitive impairment and provide appropriate treatment as early as possible. © 2015 John Wiley & Sons Ltd.
Using simple artificial intelligence methods for predicting amyloidogenesis in antibodies
2010-01-01
Background All polypeptide backbones have the potential to form amyloid fibrils, which are associated with a number of degenerative disorders. However, the likelihood that amyloidosis would actually occur under physiological conditions depends largely on the amino acid composition of a protein. We explore using a naive Bayesian classifier and a weighted decision tree for predicting the amyloidogenicity of immunoglobulin sequences. Results The average accuracy based on leave-one-out (LOO) cross validation of a Bayesian classifier generated from 143 amyloidogenic sequences is 60.84%. This is consistent with the average accuracy of 61.15% for a holdout test set comprised of 103 AM and 28 non-amyloidogenic sequences. The LOO cross validation accuracy increases to 81.08% when the training set is augmented by the holdout test set. In comparison, the average classification accuracy for the holdout test set obtained using a decision tree is 78.64%. Non-amyloidogenic sequences are predicted with average LOO cross validation accuracies between 74.05% and 77.24% using the Bayesian classifier, depending on the training set size. The accuracy for the holdout test set was 89%. For the decision tree, the non-amyloidogenic prediction accuracy is 75.00%. Conclusions This exploratory study indicates that both classification methods may be promising in providing straightforward predictions on the amyloidogenicity of a sequence. Nevertheless, the number of available sequences that satisfy the premises of this study are limited, and are consequently smaller than the ideal training set size. Increasing the size of the training set clearly increases the accuracy, and the expansion of the training set to include not only more derivatives, but more alignments, would make the method more sound. The accuracy of the classifiers may also be improved when additional factors, such as structural and physico-chemical data, are considered. The development of this type of classifier has significant applications in evaluating engineered antibodies, and may be adapted for evaluating engineered proteins in general. PMID:20144194
Using simple artificial intelligence methods for predicting amyloidogenesis in antibodies.
David, Maria Pamela C; Concepcion, Gisela P; Padlan, Eduardo A
2010-02-08
All polypeptide backbones have the potential to form amyloid fibrils, which are associated with a number of degenerative disorders. However, the likelihood that amyloidosis would actually occur under physiological conditions depends largely on the amino acid composition of a protein. We explore using a naive Bayesian classifier and a weighted decision tree for predicting the amyloidogenicity of immunoglobulin sequences. The average accuracy based on leave-one-out (LOO) cross validation of a Bayesian classifier generated from 143 amyloidogenic sequences is 60.84%. This is consistent with the average accuracy of 61.15% for a holdout test set comprised of 103 AM and 28 non-amyloidogenic sequences. The LOO cross validation accuracy increases to 81.08% when the training set is augmented by the holdout test set. In comparison, the average classification accuracy for the holdout test set obtained using a decision tree is 78.64%. Non-amyloidogenic sequences are predicted with average LOO cross validation accuracies between 74.05% and 77.24% using the Bayesian classifier, depending on the training set size. The accuracy for the holdout test set was 89%. For the decision tree, the non-amyloidogenic prediction accuracy is 75.00%. This exploratory study indicates that both classification methods may be promising in providing straightforward predictions on the amyloidogenicity of a sequence. Nevertheless, the number of available sequences that satisfy the premises of this study are limited, and are consequently smaller than the ideal training set size. Increasing the size of the training set clearly increases the accuracy, and the expansion of the training set to include not only more derivatives, but more alignments, would make the method more sound. The accuracy of the classifiers may also be improved when additional factors, such as structural and physico-chemical data, are considered. The development of this type of classifier has significant applications in evaluating engineered antibodies, and may be adapted for evaluating engineered proteins in general.
Validation of the Hebrew version of the Burn Specific Health Scale-Brief questionnaire.
Stavrou, Demetris; Haik, Josef; Wiser, Itay; Winkler, Eyal; Liran, Alon; Holloway, Samantha; Boyd, Julie; Zilinsky, Isaac; Weissman, Oren
2015-02-01
The Burns Specific Health Scale-Brief (BSHS-B) questionnaire is a suitable measurement tool for the assessment of general, physical, mental, and social health aspects of the burn survivor. To translate, culturally adapt and validate the BSHS-B to Hebrew (BSHS-H), and to investigate its psychometric properties. Eighty-six Hebrew speaking burn survivors filled out the BSHS-B and SF-36 questionnaires. Ten of them (11.63%) completed a retest. The psychometric properties of the scale were evaluated. Internal consistency, criterion validity, and construct validity were assessed using interclass correlation coefficient, Cronbach's alpha statistic, Spearman rank test, and Mann-Whitney U test respectively. BSHS-H Cronbach's alpha coefficient was 0.97. Test-retest interclass coefficients were between 0.81 and 0.98. BSHS-H was able to discriminate between facial burns, hand burns and burns >10% body surface area (p<0.05). BSHS-H and SF-36 were positively correlated (r(2)=0.667, p<0.01). BSHS-H is a reliable and valid instrument for use in the Israeli burn survivor population. The translation and cross-cultural adaptation of this disease specific scale allows future comparative international studies. Copyright © 2014 Elsevier Ltd and ISBI. All rights reserved.
Learning Style Scales: a valid and reliable questionnaire.
Abdollahimohammad, Abdolghani; Ja'afar, Rogayah
2014-01-01
Learning-style instruments assist students in developing their own learning strategies and outcomes, in eliminating learning barriers, and in acknowledging peer diversity. Only a few psychometrically validated learning-style instruments are available. This study aimed to develop a valid and reliable learning-style instrument for nursing students. A cross-sectional survey study was conducted in two nursing schools in two countries. A purposive sample of 156 undergraduate nursing students participated in the study. Face and content validity was obtained from an expert panel. The LSS construct was established using principal axis factoring (PAF) with oblimin rotation, a scree plot test, and parallel analysis (PA). The reliability of LSS was tested using Cronbach's α, corrected item-total correlation, and test-retest. Factor analysis revealed five components, confirmed by PA and a relatively clear curve on the scree plot. Component strength and interpretability were also confirmed. The factors were labeled as perceptive, solitary, analytic, competitive, and imaginative learning styles. Cronbach's α was >0.70 for all subscales in both study populations. The corrected item-total correlations were >0.30 for the items in each component. The LSS is a valid and reliable inventory for evaluating learning style preferences in nursing students in various multicultural environments.
de Abreu, Pascale M J Engel; Baldassi, Martine; Puglisi, Marina L; Befi-Lopes, Debora M
2013-04-01
In this study, the authors explored the impact of test language and cultural status on vocabulary and working memory performance in multilingual language-minority children. Twenty 7-year-old Portuguese-speaking immigrant children living in Luxembourg completed several assessments of first (L1)- and second-language (L2) vocabulary (comprehension and production), executive-loaded working memory (counting recall and backward digit recall), and verbal short-term memory (digit recall and nonword repetition). Cross-linguistic task performance was compared within individuals. The language-minority children were also compared with multilingual language-majority children from Luxembourg and Portuguese-speaking monolinguals from Brazil without an immigrant background matched on age, sex, socioeconomic status, and nonverbal reasoning. Results showed that (a) verbal working memory measures involving numerical memoranda were relatively independent of test language and cultural status; (b) language status had an impact on the repetition of high- but not on low-wordlike L2 nonwords; (c) large cross-linguistic and cross-cultural effects emerged for productive vocabulary; (d) cross-cultural effects were less pronounced for vocabulary comprehension with no differences between groups if only L1 words relevant to the home context were considered. The study indicates that linguistic and cognitive assessments for language-minority children require careful choice among measures to ensure valid results. Implications for testing culturally and linguistically diverse children are discussed.
European Portuguese adaptation and validation of dilemmas used to assess moral decision-making.
Fernandes, Carina; Gonçalves, Ana Ribeiro; Pasion, Rita; Ferreira-Santos, Fernando; Paiva, Tiago Oliveira; Melo E Castro, Joana; Barbosa, Fernando; Martins, Isabel Pavão; Marques-Teixeira, João
2018-03-01
Objective To adapt and validate a widely used set of moral dilemmas to European Portuguese, which can be applied to assess decision-making. Moreover, the classical formulation of the dilemmas was compared with a more focused moral probe. Finally, a shorter version of the moral scenarios was tested. Methods The Portuguese version of the set of moral dilemmas was tested in 53 individuals from several regions of Portugal. In a second study, an alternative way of questioning on moral dilemmas was tested in 41 participants. Finally, the shorter version of the moral dilemmas was tested in 137 individuals. Results Results evidenced no significant differences between English and Portuguese versions. Also, asking whether actions are "morally acceptable" elicited less utilitarian responses than the original question, although without reaching statistical significance. Finally, all tested versions of moral dilemmas exhibited the same pattern of responses, suggesting that the fundamental elements to the moral decision-making were preserved. Conclusions We found evidence of cross-cultural validity for moral dilemmas. However, the moral focus might affect utilitarian/deontological judgments.
NASA Astrophysics Data System (ADS)
Folkert, Michael R.; Setton, Jeremy; Apte, Aditya P.; Grkovski, Milan; Young, Robert J.; Schöder, Heiko; Thorstad, Wade L.; Lee, Nancy Y.; Deasy, Joseph O.; Oh, Jung Hun
2017-07-01
In this study, we investigate the use of imaging feature-based outcomes research (‘radiomics’) combined with machine learning techniques to develop robust predictive models for the risk of all-cause mortality (ACM), local failure (LF), and distant metastasis (DM) following definitive chemoradiation therapy (CRT). One hundred seventy four patients with stage III-IV oropharyngeal cancer (OC) treated at our institution with CRT with retrievable pre- and post-treatment 18F-fluorodeoxyglucose positron emission tomography (FDG-PET) scans were identified. From pre-treatment PET scans, 24 representative imaging features of FDG-avid disease regions were extracted. Using machine learning-based feature selection methods, multiparameter logistic regression models were built incorporating clinical factors and imaging features. All model building methods were tested by cross validation to avoid overfitting, and final outcome models were validated on an independent dataset from a collaborating institution. Multiparameter models were statistically significant on 5 fold cross validation with the area under the receiver operating characteristic curve (AUC) = 0.65 (p = 0.004), 0.73 (p = 0.026), and 0.66 (p = 0.015) for ACM, LF, and DM, respectively. The model for LF retained significance on the independent validation cohort with AUC = 0.68 (p = 0.029) whereas the models for ACM and DM did not reach statistical significance, but resulted in comparable predictive power to the 5 fold cross validation with AUC = 0.60 (p = 0.092) and 0.65 (p = 0.062), respectively. In the largest study of its kind to date, predictive features including increasing metabolic tumor volume, increasing image heterogeneity, and increasing tumor surface irregularity significantly correlated to mortality, LF, and DM on 5 fold cross validation in a relatively uniform single-institution cohort. The LF model also retained significance in an independent population.
Efficient strategies for leave-one-out cross validation for genomic best linear unbiased prediction.
Cheng, Hao; Garrick, Dorian J; Fernando, Rohan L
2017-01-01
A random multiple-regression model that simultaneously fit all allele substitution effects for additive markers or haplotypes as uncorrelated random effects was proposed for Best Linear Unbiased Prediction, using whole-genome data. Leave-one-out cross validation can be used to quantify the predictive ability of a statistical model. Naive application of Leave-one-out cross validation is computationally intensive because the training and validation analyses need to be repeated n times, once for each observation. Efficient Leave-one-out cross validation strategies are presented here, requiring little more effort than a single analysis. Efficient Leave-one-out cross validation strategies is 786 times faster than the naive application for a simulated dataset with 1,000 observations and 10,000 markers and 99 times faster with 1,000 observations and 100 markers. These efficiencies relative to the naive approach using the same model will increase with increases in the number of observations. Efficient Leave-one-out cross validation strategies are presented here, requiring little more effort than a single analysis.
Narin, Selnur; Unver, Bayram; Bakırhan, Serkan; Bozan, Ozgür; Karatosun, Vasfi
2014-01-01
The purpose of this study was to adapt the English version of the Hospital for Special Surgery (HSS) knee score for use in a Turkish population and to evaluate its validity, reliability and cultural adaptation. Standard forward-back translation of the HSS knee score was performed and the Turkish version was applied in 73 patients. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), Mini-Mental State Examination and sit-to-stand test were also performed and analyzed. Internal consistency reliability was tested using Cronbach's alpha. The intraclass correlation coefficient (ICC) was used to calculate the test-retest reliability at one-week intervals. Validity was assessed by calculating the Pearson correlation between the HSS, WOMAC and sit-to-stand test scores. The ICC ranged from 0.98 to 0.99 with high internal consistency (Cronbach's alpha: 0.87). The WOMAC score correlated with total HSS score (r: -0.80, p<0.001) and sit-to-stand score (r: 0.12, p: 0.312). The Turkish version of the HSS knee score is reliable and valid in evaluating the total knee arthroplasty in Turkish patients.
Detection of Malingered Mental Retardation
ERIC Educational Resources Information Center
Shandera, Anne L.; Berry, David T. R.; Clark, Jessica A.; Schipper, Lindsey J.; Graue, Lili O.; Harp, Jordan P.
2010-01-01
In a cross-validation of results from L. O. Graue et al. (2007), standard psychological assessment instruments, as well as tests of neurocognitive and psychiatric feigning, were administered under standard instructions to 24 participants diagnosed with mild mental retardation (MR) and 10 demographically matched community volunteers (CVH). A 2nd…
Cross-cultural Adaptation of the Oral Anticoagulation Knowledge Test to the Brazilian Portuguese.
Praxedes, Marcus Fernando da Silva; Abreu, Mauro Henrique Nogueira Guimarães; Ribeiro, Daniel Dias; Marcolino, Milena Soriano; Paiva, Saul Martins de; Martins, Maria Auxiliadora Parreiras
2017-05-01
Patients' knowledge about oral anticoagulant therapy may favor the achievement of therapeutic results and the prevention of adverse pharmacotherapy-related events. Brazil lacks validated instruments for assessing the patient's knowledge about treatment with warfarin. This study aimed to perform the cross-cultural adaptation of the Oral Anticoagulation Knowledge (OAK) Test instrument from English into Portuguese. This is a methodological study developed in an anticoagulation clinic of a public university hospital. The study included initial translation, synthesis of translations, back-translation, review by the experts committee and pre-testing with 30 individuals. We obtained semantic equivalence through the analysis of the referential and general meaning of each item. The conceptual equivalence of the items sought to demonstrate the relevance and acceptability of the instrument. The process of cross-cultural adaptation produced the final version of the OAK Test in Brazilian Portuguese entitled "Teste de Conhecimento sobre Anticoagulação Oral". There was a suitable semantic and conceptual equivalence between the adapted version and the original version, as well as an excellent acceptability of this instrument.
Ankrapp, David; Schaus, Benjamin; Clements, Lauren; Klein, Frank; Rice, Jennifer; Rejman, John
2018-05-09
A validation study was conducted for an immunochromatographic method (BetaStar ® Advanced for Tetracyclines) for detection of tetracycline antibiotic residues in raw, commingled bovine milk. The assay was demonstrated to detect tetracycline, chlortetracycline, and oxytetracycline at levels below the FDA tolerance levels but above the maximum sensitivity thresholds established by the National Conference on Interstate Milk Shipments. Results of internal and independent laboratory dose-response studies employing spiked samples were in agreement. All three drugs at the approximate 90/95% sensitivity levels were detected in milk collected from cows that had been treated with the specific drug. Selectivity of the assay was 100%, as no false-positive results were obtained in testing 881 control milk samples. Testing the estimated 90/95 sensitivity level for tetracycline (213 ppb), chlortetracycline (272 ppb), and oxytetracycline (180 ppb) and at 1000 ppb for each antibiotic resulted in 100% positive tests for each tetracycline. Results of ruggedness experiments established the operating parameter tolerances for the test. Results of cross-reactivity testing established that the assay detects certain other tetracycline drugs but does not cross-react with any of 32 drugs belonging to seven different drug classes. Abnormally high bacterial or somatic cell counts (SCC) in raw milk produced no assay interference.
Murumkar, Prashant R; Giridhar, Rajani; Yadav, Mange Ram
2008-04-01
A set of 29 benzothiadiazepine hydroxamates having selective tumor necrosis factor-alpha converting enzyme inhibitory activity were used to compare the quality and predictive power of 3D-quantitative structure-activity relationship, comparative molecular field analysis, and comparative molecular similarity indices models for the atom-based, centroid/atom-based, data-based, and docked conformer-based alignment. Removal of two outliers from the initial training set of molecules improved the predictivity of models. Among the 3D-quantitative structure-activity relationship models developed using the above four alignments, the database alignment provided the optimal predictive comparative molecular field analysis model for the training set with cross-validated r(2) (q(2)) = 0.510, non-cross-validated r(2) = 0.972, standard error of estimates (s) = 0.098, and F = 215.44 and the optimal comparative molecular similarity indices model with cross-validated r(2) (q(2)) = 0.556, non-cross-validated r(2) = 0.946, standard error of estimates (s) = 0.163, and F = 99.785. These models also showed the best test set prediction for six compounds with predictive r(2) values of 0.460 and 0.535, respectively. The contour maps obtained from 3D-quantitative structure-activity relationship studies were appraised for activity trends for the molecules analyzed. The comparative molecular similarity indices models exhibited good external predictivity as compared with that of comparative molecular field analysis models. The data generated from the present study helped us to further design and report some novel and potent tumor necrosis factor-alpha converting enzyme inhibitors.
Merolla, Giovanni; Corona, Katia; Zanoli, Gustavo; Cerciello, Simone; Giannotti, Stefano; Porcellini, Giuseppe
2017-12-01
The Kerlan-Jobe Orthopaedic Clinic (KJOC) Shoulder and Elbow score is a reliable and sensitive tool to measure the performance of overhead athletes. The purpose of this study was to carry out a cross-cultural adaptation and validation of the KJOC questionnaire in Italian and to assess its reliability, validity, and responsiveness. Ninety professional athletes with a painful shoulder were included in this study and were assigned to the "injury group" (n = 32) or the "overuse group" (n = 58); 65 were managed conservatively and 25 were treated by arthroscopic surgery. To assess the reliability of the KJOC score, patients were asked to fill in the questionnaire at baseline and after 2 weeks. To test the construct validity, KJOC scores were compared to those obtained with the Italian version of the Disabilities of the Arm, Shoulder, and Hand (DASH) scale, and with the DASH sports/performing arts module. To test KJOC score responsiveness, the follow-up KJOC scores of the participants treated conservatively were compared to those of the patients treated by arthroscopic surgery. Statistical analysis demonstrated that the KJOC questionnaire is reliable in terms of the single items and the overall score (ICC 0.95-0.99); that it has high construct validity (r s = -0.697; p < 0.01); and that it is responsive to clinical differences in shoulder function (p < 0.0001). The Italian version of the KJOC Shoulder and Elbow score performed in a similar way to the English version and demonstrated good validity, reliability, and responsiveness after conservative and surgical treatment. II.
Ferreira, Mariana Cândido; Björklund, Martin; Dach, Fabiola; Chaves, Thais Cristina
The purpose of this study was to adapt and evaluate the psychometric properties of the ProFitMap-neck to Brazilian Portuguese. The cross-cultural adaptation consisted of 5 stages, and 180 female patients with chronic neck pain participated in the study. A subsample (n = 30) answered the pretest, and another subsample (n = 100) answered the questionnaire a second time. Internal consistency, test-retest reliability, and construct validity (hypothesis testing and structural validity) were estimated. For construct validity, the scores of the questionnaire were correlated with the Neck Disability Index (NDI), and the Hospital Anxiety and Depression Scale (HADS), the Tampa Scale of Kinesiophobia (TSK), and the 36-item Short-Form Health Survey (SF-36). Internal consistency was determined by adequate Cronbach's α values (α > 0.70). Strong reliability was identified by high intraclass correlation coefficients (ICC > 0.75). Construct validity was identified by moderate and strong correlations of the Br-ProFitMap-neck with total NDI score (-0.56
Cross-Cultural Adaptation and Validation of the Italian Version of SWAL-QOL.
Ginocchio, Daniela; Alfonsi, Enrico; Mozzanica, Francesco; Accornero, Anna Rosa; Bergonzoni, Antonella; Chiarello, Giulia; De Luca, Nicoletta; Farneti, Daniele; Marilia, Simonelli; Calcagno, Paola; Turroni, Valentina; Schindler, Antonio
2016-10-01
The aim of the study was to evaluate the reliability and validity of the Italian SWAL-QOL (I-SWAL-QOL). The study consisted of five phases: item generation, reliability analysis, normative data generation, validity analysis, and responsiveness analysis. The item generation phase followed the five-step, cross-cultural, adaptation process of translation and back-translation. A group of 92 dysphagic patients was enrolled for the internal consistency analysis. Seventy-eight patients completed the I-SWAL-QOL twice, 2 weeks apart, for test-retest reliability analysis. A group of 200 asymptomatic subjects completed the I-SWAL-QOL for normative data generation. I-SWAL-QOL scores obtained by both the group of dysphagic subjects and asymptomatic ones were compared for validity analysis. I-SWAL-QOL scores were correlated with SF-36 scores in 67 patients with dysphagia for concurrent validity analysis. Finally, I-SWAL-QOL scores obtained in a group of 30 dysphagic patients before and after successful rehabilitation treatment were compared for responsiveness analysis. All the enrolled patients managed to complete the I-SWAL-QOL without needing any assistance, within 20 min. Internal consistency was acceptable for all I-SWAL-QOL subscales (α > 0.70). Test-retest reliability was also satisfactory for all subscales (ICC > 0.7). A significant difference between the dysphagic group and the control group was found in all I-SWAL-QOL subscales (p < 0.05). Mild to moderate correlations between I-SWAL-QOL and SF-36 subscales were observed. I-SWAL-QOL scores obtained in the pre-treatment condition were significantly lower than those obtained after swallowing rehabilitation. I-SWAL-QOL is reliable, valid, responsive to changes in QOL, and recommended for clinical practice and outcome research.
Abma, Femke I; van der Klink, Jac J L; Bültmann, Ute
2013-03-01
The promotion of a sustainable, healthy and productive working life attracts more and more attention. Recently the Work Role Functioning Questionnaire (WRFQ) has been cross-culturally translated and adapted to Dutch. This questionnaire aims to measure the health-related work functioning of workers with health problems. The aim of this study is to evaluate the reliability, validity (including five new items) and responsiveness of the WRFQ 2.0 in the working population. A longitudinal study was conducted among workers. The reliability (internal consistency, test-retest reliability, measurement error), validity (structural validity-factor analysis, construct validity by means of hypotheses testing) and responsiveness of the WRFQ 2.0 were evaluated. A total of N = 553 workers completed the survey. The final WRFQ 2.0 has four subscales and showed very good internal consistency, moderate test-retest reliability, good construct validity and moderate responsiveness in the working population. The WRFQ was able to distinguish between groups with different levels of mental health, physical health, fatigue and need for recovery. A moderate correlation was found between WRFQ and related constructs respectively work ability and work productivity. A weak relationship was found with general self-rated health, work engagement and work involvement. The WRFQ 2.0 is a reliable and valid instrument to measure health-related work functioning in the working population. Further validation in larger samples is recommended, especially for test-retest reliability, responsiveness and the questionnaire's ability to predict the future course of health-related work functioning.
CrossTalk: The Journal of Defense Software Engineering. Volume 26, Number 6, November/December 2013
2013-12-01
requirements during sprint planning. Automated scanning, which includes automated code-review tools, allows the expert to monitor the system... sprint . This enables the validator to leverage the test results for formal validation and verification, and perform a shortened “hybrid” style of IV&V...per SPRINT (1-4 weeks) 1 week 1 Month Up to four months Ø Deliverable product to user Ø Security posture assessed Ø Accredited to field/operate
Aziz, M M; Galal, M A A; Elzohri, M H; El-Nouby, F; Leong, K P
2018-04-01
Systemic lupus erythematosus (SLE) is a chronic autoimmune disease which affects all aspects of quality of life (QoL) of the patients. Comprehensive patient assessment should include QoL measures in addition to the objective clinical measures of the disease. There is no specific Arabic instrument for assessment of QoL of SLE patients. The objective of this study was to translate and cross culturally adapt the SLEQOL questionnaire into Arabic and test its reliability and validity. The SLEQOL questionnaire was translated into Arabic based on the Guidelines for Translation and Cross-cultural Adaptation into other languages. Reliability was assessed by interviewing patients three times: two interviews on the same day by different interviewers and the third interview 14 days later by one of the first interviewers. Validity was assessed by correlating SLEQOL scores of 91 patients with 36-item Short Form Health Survey (SF-36) scores and clinical parameters of the patients. We found that the Arabic version of SLEQOL has a Cronbach's alpha of 0.936, interobserver and intraobserver correlation coefficients of 0.809 and 0.886 respectively. Strong correlations were also found between SLEQOL scores and SF-36 Physical and Mental Component summaries. In conclusion, the Arabic version of SLEQOL is a reliable and valid instrument for measuring QoL of Egyptian SLE patients.
[Measurement properties of self-report questionnaires published in Korean nursing journals].
Lee, Eun-Hyun; Kim, Chun-Ja; Kim, Eun Jung; Chae, Hyun-Ju; Cho, Soo-Yeon
2013-02-01
The purpose of this study was to evaluate measurement properties of self-report questionnaires for studies published in Korean nursing journals. Of 424 Korean nursing articles initially identified, 168 articles met the inclusion criteria. The methodological quality of the measurements used in the studies and interpretability were assessed using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. It consists of items on internal consistency, reliability, measurement error, content validity, construct validity including structural validity, hypothesis testing, cross-cultural validity, and criterion validity, and responsiveness. For each item of the COSMIN checklist, measurement properties are rated on a four-point scale: excellent, good, fair, and poor. Each measurement property is scored with worst score counts. All articles used the classical test theory for measurement properties. Internal consistency (72.6%), construct validity (56.5%), and content validity (38.2%) were most frequently reported properties being rated as 'excellent' by COSMIN checklist, whereas other measurement properties were rarely reported. A systematic review of measurement properties including interpretability of most instruments warrants further research and nursing-focused checklists assessing measurement properties should be developed to facilitate intervention outcomes across Korean studies.
Vascellari, Alberto; Schiavetti, Stefano; Rebuzzi, Enrico; Coletti, Nicolò
2015-11-01
The Nottingham Clavicle Score (NCS) is a specific Patient Reported Outcome Measure of injuries to the clavicle, acromio-clavicular joint (ACJ) and sterno-clavicular joint. The purpose of this study was to translate the NCS into Italian and establish its cultural adaptiveness and validity. The original version of the NCS was translated into Italian in accordance with the cross-cultural adaptation guidelines described by Guillemin. Sixty-six patients [average age 45.7 years (SD 11.3)] who had received surgical treatment for injuries of the ACJ and the clavicle were included in the study. The study population completed the NCS twice within 5 days, the Oxford Shoulder Score (OSS), the Disability of the Arm, Shoulder and Hand (DASH) questionnaire and the short-form 36 (SF-36). Statistical tests assessed the construct validity, discriminant validity, internal consistency, reliability and feasibility of the NCS. The translation and adaptation of the NCS for an Italian context required no major cultural adaptation. Internal consistency was high (Cronbach's α, 0.86). Test-retest reproducibility was excellent (ρ = 0.981, p < 0.00001). Administration time was 45 s (range 1 min 32 s-8 min), and all items were answered. The Italian NCS showed strong correlation with the DASH (-0.87), the OSS (-0.84) and those subscales of the SF-36 (physical functioning, role physical and bodily pain) which aim to measure similar constructs. The Italian NCS scale is a reliable, valid, consistent shoulder assessment form that can be used to assess the functional limitations of patients with injuries of clavicle or ACJ. III.
The Reliability and Validity of Measures of Gait Variability in Community-Dwelling Older Adults
Brach, Jennifer S.; Perera, Subashan; Studenski, Stephanie; Newman, Anne B.
2009-01-01
Objective To examine the test-retest reliability and concurrent validity of variability of gait characteristics. Design Cross-sectional study. Setting Research laboratory. Participants Older adults (N=558) from the Cardiovascular Health Study. Interventions Not applicable. Main Outcome Measures Gait characteristics were measured using a 4-m computerized walkway. SD determined from the steps recorded were used as the measures of variability. Intraclass correlation coefficients (ICC) were calculated to examine test-retest reliability of a 4-m walk and two 4-m walks. To establish concurrent validity, the measures of gait variability were compared across levels of health, functional status, and physical activity using independent t tests and analysis of variances. Results Gait variability measures from the two 4-m walks demonstrated greater test-retest reliability than those from the single 4-m walk (ICC=.22–.48 and ICC=.40–.63, respectively). Greater step length and stance time variability were associated with poorer health, functional status and physical activity (P<.05). Conclusions Gait variability calculated from a limited number of steps has fair to good test-retest reliability and concurrent validity. Reliability of gait variability calculated from a greater number of steps should be assessed to determine if the consistency can be improved. PMID:19061741
Statistical Anomalies of Bitflips in SRAMs to Discriminate SBUs From MCUs
NASA Astrophysics Data System (ADS)
Clemente, Juan Antonio; Franco, Francisco J.; Villa, Francesca; Baylac, Maud; Rey, Solenne; Mecha, Hortensia; Agapito, Juan A.; Puchner, Helmut; Hubert, Guillaume; Velazco, Raoul
2016-08-01
Recently, the occurrence of multiple events in static tests has been investigated by checking the statistical distribution of the difference between the addresses of the words containing bitflips. That method has been successfully applied to Field Programmable Gate Arrays (FPGAs) and the original authors indicate that it is also valid for SRAMs. This paper presents a modified methodology that is based on checking the XORed addresses with bitflips, rather than on the difference. Irradiation tests on CMOS 130 & 90 nm SRAMs with 14-MeV neutrons have been performed to validate this methodology. Results in high-altitude environments are also presented and cross-checked with theoretical predictions. In addition, this methodology has also been used to detect modifications in the organization of said memories. Theoretical predictions have been validated with actual data provided by the manufacturer.
Au, Raymond Wing Cheong; Tam, Peter Wai Chung; Tam, Gladys Wai Chi; Ungvari, Gabor Sander
2005-01-01
The study validated a culturally sensitive community living skills rating scale for Chinese patients by adapting the St. Louis Inventory of Community Living Skills (SLICLS). The Chinese version (SLICLS-C) was produced by forward and backward translation. An expert panel evaluated its content validity. Its internal consistency, inter-rater reliability, construct and concurrent validity were tested on 80 DSM-IV schizophrenia inpatients in a long-term facility. For predictive validity, the above sample was extended to ensure at least 20 subjects discharged to each of three levels of community care were included in the study sample. The SLICLS-C was psychometrically sound and could be used for predicting level of community care, program evaluation and measuring outcome.
Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Kachooei, Amir Reza
2015-01-01
We aimed to validate a cross-culturally adapted version of the Persian Michigan Hand Outcomes Questionnaire (MHOQ). We followed the Beaton's guideline to translate the questionnaire to Persian. We administered the final version to 223 patients among which 79 patients returned 3 days later to respond to the Persian MHOQ for the second time. In the first visit, respondents also filled the Disabilities of the Arm Shoulder and Hand (DASH) and rated the pain based on the Visual Analogue Scale (VAS). Cronbach's alpha for the total MHOQ was 0.79 which showed good internal consistency. Intraclass correlation coefficient (ICC) for the total MHOQ was 0.84 which demonstrated good reliability between test and retest. The absolute correlation coefficient between total MHOQ and the DASH was as high as 0.74. Persian version of the MHOQ proved to be a reliable and valid instrument to be implemented among Persian population with the hand and wrist disorders.
3D-QSAR and molecular docking studies on HIV protease inhibitors
NASA Astrophysics Data System (ADS)
Tong, Jianbo; Wu, Yingji; Bai, Min; Zhan, Pei
2017-02-01
In order to well understand the chemical-biological interactions governing their activities toward HIV protease activity, QSAR models of 34 cyclic-urea derivatives with inhibitory HIV were developed. The quantitative structure activity relationship (QSAR) model was built by using comparative molecular similarity indices analysis (CoMSIA) technique. And the best CoMSIA model has rcv2, rncv2 values of 0.586 and 0.931 for cross-validated and non-cross-validated. The predictive ability of CoMSIA model was further validated by a test set of 7 compounds, giving rpred2 value of 0.973. Docking studies were used to find the actual conformations of chemicals in active site of HIV protease, as well as the binding mode pattern to the binding site in protease enzyme. The information provided by 3D-QSAR model and molecular docking may lead to a better understanding of the structural requirements of 34 cyclic-urea derivatives and help to design potential anti-HIV protease molecules.
Testing the Construct Validity of the Gambling Functional Assessment-Revised
ERIC Educational Resources Information Center
Weatherly, Jeffrey N.; Miller, Joseph C.; Terrell, Heather K.
2011-01-01
An attempt was made to modify the Gambling Functional Assessment (GFA), which was proposed to identify four possible contingencies maintaining the respondent's gambling behavior. However, previous research found that it only identified two contingencies (i.e., positive vs. negative reinforcement), with some items cross-loading on both…
Investigation of Truncated Waveguides
NASA Technical Reports Server (NTRS)
Lourie, Nathan P.; Chuss, David T.; Henry, Ross M.; Wollack, Edward J.
2013-01-01
The design, fabrication, and performance of truncated circular and square waveguide cross-sections are presented. An emphasis is placed upon numerical and experimental validation of simple analytical formulae that describe the propagation properties of these structures. A test component, a 90-degree phase shifter, was fabricated and tested at 30 GHz. The concepts explored can be directly applied in the design, synthesis and optimization of components in the microwave to sub-millimeter wavebands.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Strigun, Alexander; Wahrheit, Judith; Beckers, Simone
Along with hepatotoxicity, cardiotoxic side effects remain one of the major reasons for drug withdrawals and boxed warnings. Prediction methods for cardiotoxicity are insufficient. High content screening comprising of not only electrophysiological characterization but also cellular molecular alterations are expected to improve the cardiotoxicity prediction potential. Metabolomic approaches recently have become an important focus of research in pharmacological testing and prediction. In this study, the culture medium supernatants from HL-1 cardiomyocytes after exposure to drugs from different classes (analgesics, antimetabolites, anthracyclines, antihistamines, channel blockers) were analyzed to determine specific metabolic footprints in response to the tested drugs. Since most drugsmore » influence energy metabolism in cardiac cells, the metabolite 'sub-profile' consisting of glucose, lactate, pyruvate and amino acids was considered. These metabolites were quantified using HPLC in samples after exposure of cells to test compounds of the respective drug groups. The studied drug concentrations were selected from concentration response curves for each drug. The metabolite profiles were randomly split into training/validation and test set; and then analysed using multivariate statistics (principal component analysis and discriminant analysis). Discriminant analysis resulted in clustering of drugs according to their modes of action. After cross validation and cross model validation, the underlying training data were able to predict 50%-80% of conditions to the correct classification group. We show that HPLC based characterisation of known cell culture medium components is sufficient to predict a drug's potential classification according to its mode of action.« less
Cross-cultural validity of a dietary questionnaire for studies of dental caries risk in Japanese.
Shinga-Ishihara, Chikako; Nakai, Yukie; Milgrom, Peter; Murakami, Kaori; Matsumoto-Nakano, Michiyo
2014-01-02
Diet is a major modifiable contributing factor in the etiology of dental caries. The purpose of this paper is to examine the reliability and cross-cultural validity of the Japanese version of the Food Frequency Questionnaire to assess dietary intake in relation to dental caries risk in Japanese. The 38-item Food Frequency Questionnaire, in which Japanese food items were added to increase content validity, was translated into Japanese, and administered to two samples. The first sample comprised 355 pregnant women with mean age of 29.2 ± 4.2 years for the internal consistency and criterion validity analyses. Factor analysis (principal components with Varimax rotation) was used to determine dimensionality. The dietary cariogenicity score was calculated from the Food Frequency Questionnaire and used for the analyses. Salivary mutans streptococci level was used as a semi-quantitative assessment of dental caries risk and measured by Dentocult SM. Dentocult SM scores were compared with the dietary cariogenicity score computed from the Food Frequency Questionnaire to examine criterion validity, and assessed by Spearman's correlation coefficient (rs) and Kruskal-Wallis test. Test-retest reliability of the Food Frequency Questionnaire was assessed with a second sample of 25 adults with mean age of 34.0 ± 3.0 years by using the intraclass correlation coefficient analysis. The Japanese language version of the Food Frequency Questionnaire showed high test-retest reliability (ICC = 0.70) and good criterion validity assessed by relationship with salivary mutans streptococci levels (rs = 0.22; p < 0.001). Factor analysis revealed four subscales that construct the questionnaire (solid sugars, solid and starchy sugars, liquid and semisolid sugars, sticky and slowly dissolving sugars). Internal consistency were low to acceptable (Cronbach's alpha = 0.67 for the total scale, 0.46-0.61 for each subscale). Mean dietary cariogenicity scores were 50.8 ± 19.5 in the first sample, 47.4 ± 14.1, and 40.6 ± 11.3 for the first and second administrations in the second sample. The distribution of Dentocult SM score was 6.8% (score = 0), 34.4% (score = 1), 39.4% (score = 2), and 19.4% (score = 3). Participants with higher scores were more likely to have higher dietary cariogenicity scores (p < 0.001; Kruskal-Wallis test). These results provide the preliminary evidence for the reliability and validity of the Japanese language Food Frequency Questionnaire.
Classification of echolocation clicks from odontocetes in the Southern California Bight.
Roch, Marie A; Klinck, Holger; Baumann-Pickering, Simone; Mellinger, David K; Qui, Simon; Soldevilla, Melissa S; Hildebrand, John A
2011-01-01
This study presents a system for classifying echolocation clicks of six species of odontocetes in the Southern California Bight: Visually confirmed bottlenose dolphins, short- and long-beaked common dolphins, Pacific white-sided dolphins, Risso's dolphins, and presumed Cuvier's beaked whales. Echolocation clicks are represented by cepstral feature vectors that are classified by Gaussian mixture models. A randomized cross-validation experiment is designed to provide conditions similar to those found in a field-deployed system. To prevent matched conditions from inappropriately lowering the error rate, echolocation clicks associated with a single sighting are never split across the training and test data. Sightings are randomly permuted before assignment to folds in the experiment. This allows different combinations of the training and test data to be used while keeping data from each sighting entirely in the training or test set. The system achieves a mean error rate of 22% across 100 randomized three-fold cross-validation experiments. Four of the six species had mean error rates lower than the overall mean, with the presumed Cuvier's beaked whale clicks showing the best performance (<2% error rate). Long-beaked common and bottlenose dolphins proved the most difficult to classify, with mean error rates of 53% and 68%, respectively.
Feldt, Taru; Rantanen, Johanna; Hyvönen, Katriina; Mäkikangas, Anne; Huhtala, Mari; Pihlajasaari, Pia; Kinnunen, Ulla
2014-01-01
The present study tested the factorial validity of the 9-item Bergen Burnout Inventory (BBI-9). The BBI-9 is comprised of three core dimensions: (1) exhaustion at work; (2) cynicism toward the meaning of work; and (3) sense of inadequacy at work. The study further investigated whether the three-factor structure of the BBI-9 remains the same across different organizations (group invariance) and measurement time points (time invariance). The factorial group invariance was tested using a cross-sectional design with data pertaining to managers (n=742), and employees working in a bank (n=162), an engineering office (n=236), a public sector organization divided into three service areas: administration (n=102), education and culture (n=581), and social affairs and health (n=1,505). Factorial time invariance was tested using longitudinal data pertaining to managers, with three measurements over a four-year follow-up period. The confirmatory factor analysis revealed that the three-factor structure of the BBI-9 was invariant across cross-sectional samples. The factorial invariance was also supported across measurement times. To conclude, the factorial structure of the BBI-9 was found to remain the same regardless of the sample properties and measurement times.
Development and psychometric properties of the Ethics Environment Questionnaire.
McDaniel, C
1997-09-01
The author reports on the development and the psychometric properties of the Ethics Environment Questionnaire (EEQ), an instrument by which to measure the opinions of health-care providers about ethics in their clinical practice organizations. The EEQ was developed to increase the number of valid and reliable measures pertaining to ethics in health-care delivery. The EEQ is a 20-item self-administered questionnaire using a Likert-type 5-point format, offering ease of administration. It is applicable to a cross-section of health-care practitioners and health-care facilities. The mean administration time is 10 minutes. The EEQ represents testing on 450 respondents in acute care settings among a cross-section of acute care facilities. Internal consistency reliability using Cronbach's alpha coefficient is 0.93, and the test-retest reliability is 0.88. Construct, content, and criterion validity are established. The scale is unidimensional, with factor loadings exceeding the minimum preset criterion. Mean score is 3.1 out of 5.0, with scores of 3.5 and above interpreted as reflective of a positive ethics environment. The EEQ provides a measure of ethics in health-care organizations among multi-practitioners in clinical practice on a valid, reliable, cost effective, and easily administered instrument that requires minimum investment of personnel time.
Jeon, Ki-Yeob
2011-01-01
It is well known that countries with well-structured primary care have better health outcomes, better health equity and reduced healthcare costs. This study aimed to culturally modify and validate the US consumer form of the short Primary Care Assessment Tool (PCAT) in primary care in the Republic of Korea (hereafter referred to as Korea). The Korean consumer form of the short PCAT (KC PCAT) was cross-culturally modified from the original version using a standardised transcultural adaptation method. A pre-test version of the KC PCAT was formulated by replacement of four items and modification of a further four items from the 37 items of the original consumer form of the short PCAT at face value evaluation meetings. Pilot testing was done with a convenience sample of 15 responders at two different sites. Test-retest showed high reliability. To validate the KC PCAT, 606 clients participated in a survey carried out in Korea between February and May 2006. Internal consistency reliability, test-retest reliability and factor analysis were conducted in order to test validity. Psychometric testing was carried out on 37 items of the KC PCAT to make the KS PCAT which has 30 items and has seven principal domains: first contact utilisation, first contact accessibility, ongoing accountable care (ongoing care and coordinated rapport care), integrated care (patient-centred care with integration between primary and specialty care or between different specialties), comprehensive care, community-oriented care and culturally-oriented care. Component factors of the verified KS PCAT explained 58.28% of the total variance in the total item scores of primary care. The verified KS PCAT has been characterised by the seven classic domains of primary care with minor modifications. This may provide clues concerning differences in expectations for primary care in the Korean population as compared with that of the US. The KS PCAT is a reliable and valid tool for the evaluation of the quality of primary care in Korea. It will be used to identify any aspects of primary care linked to better or worse health outcomes, and to provide evidence-based evaluations of or recommendations for Korean healthcare policy. cross-cultural adaptation, Korean Standard Primary Care Assessment Tool, Primary Care Assessment Tool, quality of primary care.
Kim, Kwang-iel; Lee, Haewon; Choi, Joonho; Park, Yong-Chon
2005-01-01
The Illness Intrusiveness Rating Scale (IIRS) measures illness-induced disruptions to 13 domains of lifestyles, activities, and interests. A stable three-factor structure has been well documented; however, the cross-cultural validity of this scale needs to be tested. This study investigated the factor structure of the Korean version of IIRS in 712 outpatients at a university medical center. A predominant diagnosis of the patients was rheumatoid arthritis (47%). The Center for Epidemiological Studies-Depression Scale (CES-D), and Health Assessment Questionnaire (HAQ) were also administered. Exploratory Principal Component Analysis identified a two-factor structure, "Relationships and Personal Development (RPD)" and "Instrumental", accounting for 57% of the variance. Confirmatory analyses extracted an identical factor structure. However, a goodness-of-the fit test failed to support two-factor solution (χ2=138.2, df=43, p<.001). Two factors had high internal consistency (RPD, α=.89; Instrumental, α=.75) and significantly correlated with scores of HAQ (RPD, r=.53, p<.001; Instrumental, .r=44, p<.001) and CES-D (RPD, .r=55, p<.001; Instrumental, .r=43, p<.001). These findings supported construct validity of the Korean version of IIRS, but did not support cross-cultural equivalence of the factor structure. PMID:15832005
Barroso, Eliane Marçon; Carvalho, André Lopes; Paiva, Carlos Eduardo; Nunes, João Soares; Paiva, Bianca Sakamoto Ribeiro
2015-01-01
Patients submitted to radiotherapy for the treatment of head and neck cancer have several symptoms, predominantly oral. The Vanderbilt Head and Neck Symptom Survey version 2.0 is an American tool developed to evaluate oral symptoms in head and neck cancer patients submitted to radiotherapy. The aim of the present study was to translate the Vanderbilt Head and Neck Symptom Survey version 2.0 into Brazilian Portuguese and cross-culturally adapt this tool for subsequent validation and application in Brazil. A method used for the translation and cultural adaptation of tools, which included independent translations, synthesis of the translations, back-translations, expert committee, and pre-test, was used. The pre-test was performed with 37 head and neck cancer patients, who were divided into four groups, to assess the relevance and understanding of the assessed items. Data were submitted to descriptive statistical analysis. The overall mean of the content validity index was 0.79 for semantic and idiomatic equivalence, and it was higher than 0.8 for cultural and conceptual equivalence. The cognitive interview showed that patients were able to paraphrase the items, and considered them relevant and easily understood. The tool was translated and cross-culturally adapted to be used in Brazil. The authors believe this translation is suited for validation. Copyright © 2015 Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial. Published by Elsevier Editora Ltda. All rights reserved.
TENI: A comprehensive battery for cognitive assessment based on games and technology.
Delgado, Marcela Tenorio; Uribe, Paulina Arango; Alonso, Andrés Aparicio; Díaz, Ricardo Rosas
2016-01-01
TENI (Test de Evaluación Neuropsicológica Infantil) is an instrument developed to assess cognitive abilities in children between 3 and 9 years of age. It is based on a model that incorporates games and technology as tools to improve the assessment of children's capacities. The test was standardized with two Chilean samples of 524 and 82 children living in urban zones. Evidence of reliability and validity based on current standards is presented. Data show good levels of reliability for all subtests. Some evidence of validity in terms of content, test structure, and association with other variables is presented. This instrument represents a novel approach and a new frontier in cognitive assessment. Further studies with clinical, rural, and cross-cultural populations are required.
Development of Internet-Based Tasks for the Executive Function Performance Test.
Rand, Debbie; Lee Ben-Haim, Keren; Malka, Rachel; Portnoy, Sigal
The Executive Function Performance Test (EFPT) is a reliable and valid performance-based tool to assess executive functions (EFs). This study's objective was to develop and verify two Internet-based tasks for the EFPT. A cross-sectional study assessed the alternate-form reliability of the Internet-based bill-paying and telephone-use tasks in healthy adults and people with subacute stroke (Study 1). It also sought to establish the tasks' criterion reliability for assessing EF deficits by correlating performance with that on the Trail Making Test in five groups: healthy young adults, healthy older adults, people with subacute stroke, people with chronic stroke, and young adults with attention deficit hyperactivity disorder (Study 2). The alternative-form reliability and initial construct validity for the Internet-based bill-paying task were verified. Criterion validity was established for both tasks. The Internet-based tasks are comparable to the original EFPT tasks and can be used for assessment of EF deficits. Copyright © 2018 by the American Occupational Therapy Association, Inc.
Maki, Dana; Rajab, Ebrahim; Watson, Paul J; Critchley, Duncan J
2014-12-01
Cross-cultural translation, adaptation, and psychometric testing. To cross-culturally translate and adapt the Roland-Morris Disability Questionnaire (RMDQ) into Modern Standard Arabic and examine its validity with Arabic-speaking patients with low back pain (LBP). The English RMDQ is valid, reliable, and commonly used to assess LBP disability in clinical practice and research. There is no valid and reliable version of the RMDQ in Modern Standard Arabic. The RMDQ was forward translated and back translated. An expert committee of musculoskeletal physiotherapists reviewed the translation. Eight patients with LBP evaluated item-by-item comprehensibility. Ten patients piloted the RMDQ for overall comprehensibility and acceptability. Seventeen bilingual patients tested the agreement of the Arabic and English RMDQs. Two-hundred one patients completed the RMDQ and the visual analogue scale. Sixty-four patients were followed-up for test-retest reliability. Translation of most items was uncontroversial. The expert committee found the Arabic RMDQ clinically and culturally appropriate. They reviewed item 11, addressing bending and kneeling, because this has a clinical significance and cultural/religious implication regarding prayer positions. All patients reported that it was easy to understand and complete. The Arabic RMDQ had high overall agreement with the English RMDQ for the global score (intraclass correlation coefficient [ICC] = 0.925; 0.811-0.972). Kappa statistics showed good item-by-item agreement (none ≤0.30). Mean (SD) RMDQ and visual analog scale scores of 201 patients were 10.53 (4.80) and 5.11 (2.28), respectively. The RMDQ had a low correlation against pain intensity (r = 0.259; P < 0.01). A Cronbach α of 0.729 showed high internal consistency. Test-retest reliability of the Arabic RMDQ was good (ICC = 0.900; 95% confidence interval, 0.753-0.951). Kappa statistics were high for 18 items and fair for 6. The Arabic version of the RMDQ has good comprehensibility and acceptability, high internal consistency and reliability, low correlation against pain intensity, and good agreement with the English RMDQ. We recommend its use with Arabic-speaking patients with LBP. 3.
Essential elements of the nursing practice environment in nursing homes: Psychometric evaluation.
de Brouwer, Brigitte Johanna Maria; Kaljouw, Marian J; Schoonhoven, Lisette; van Achterberg, Theo
2017-06-01
To develop and psychometrically test the Essentials of Magnetism II in nursing homes. Increasing numbers and complex needs of older people in nursing homes strain the nursing workforce. Fewer adequately trained staff and increased care complexity raise concerns about declining quality. Nurses' practice environment has been reported to affect quality of care and productivity. The Essentials of Magnetism II © measures processes and relationships of practice environments that contribute to productivity and quality of care and can therefore be useful in identifying processes requiring change to pursue excellent practice environments. However, this instrument was not explicitly evaluated for its use in nursing home settings so far. In a preparatory phase, a cross-sectional survey study focused on face validity of the essentials of magnetism in nursing homes. A second cross-sectional survey design was then used to further test the instrument's validity and reliability. Psychometric testing included evaluation of content and construct validity, and reliability. Nurses (N = 456) working at 44 units of three nursing homes were included. Respondent acceptance, relevance and clarity were adequate. Five of the eight subscales and 54 of the 58 items did meet preset psychometric criteria. All essentials of magnetism are considered relevant for nursing homes. The subscales Adequacy of Staffing, Clinically Competent Peers, Patient Centered Culture, Autonomy and Nurse Manager Support can be used in nursing homes without problems. The other subscales cannot be directly applied to this setting. The valid subscales of the Essentials of Magnetism II instrument can be used to design excellent nursing practice environments that support nurses' delivery of care. Before using the entire instrument, however, the other subscales have to be improved. © 2016 John Wiley & Sons Ltd.
Lera, Lydia; Ángel, Bárbara; Sánchez, Hugo; Picrin, Yaisy; Hormazabal, María José; Quiero, Andrea; Albala, Cecilia
2014-09-28
To estimate and validate cut-off points of skeletal muscle mass index (SMI) in Chilean population, for using in an algorithm for a diagnosis of sarcopenia developed by European Working Group on Sarcopenia in Older People (EWGSOP). Secondary analysis of Cross-sectional data in 440 Chilean older subjects to estimate cut-off points of SMI determined by DEXA and predicted by an anthropometric equation. Afterward a cross-sectional validation in a sample of 164 older people was performed. Anthropometric measures, self-reported health status, physical performance tests and DEXA were carried out. Decreased muscle strength was defined as handgrip strength <15 kg in women and <27 kg in male. Cut-off points of SMI were defined as values under 20th percentile for DEXA measures and estimated through ROC curves for the anthropometric model. Biological validity of the algorithm was tested by contrasting the diagnosis with physical performance tests and functionality. Cut-off points of SMI obtained by DEXA were 7.19 kg/m² in men and 5.77 kg/m² in women and 7.45 kg/ m² and 5.88 kg/m², respectively for the predicted by the model. Sensibility and specificity of estimations vs DEXA measures were 80% and 92% in men and 77% and 89% in women. We obtained cut-off points of SMI for DEXA and for a prediction equation for older adults Chilean, with good sensibility and specificity for the measurement by DEXA. It will allow to apply the EWGSOP algorithm to the early diagnosis of sarcopenia and to develop programs for prevention, delay or reversion this syndrome. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Zhang, Yin-Ping; Zhao, Xin-Shuang; Zhang, Bei; Zhang, Lu-Lu; Ni, Chun-Ping; Hao, Nan; Shi, Chang-Bei; Porr, Caroline
2015-07-01
The comprehensive needs assessment tool for cancer caregivers (CNAT-C) is a systematic and comprehensive needs assessment tool for the family caregivers. The purpose of this project was twofold: (1) to adapt the CNAT-C to Mainland China's cultural context and (2) to evaluate the psychometric properties of the newly adapted Chinese CNAT-C. Cross-cultural adaptation of the original CNAT-C was performed according to published guidelines. A pilot study was conducted in Mainland China with 30 Chinese family cancer caregivers. A subsequent validation study was conducted with 205 Chinese cancer caregivers from Mainland China. Construct validity was determined through exploratory and confirmatory factor analyses. Reliability was determined using internal consistency and test-retest reliability. The split-half coefficient for the overall Chinese CNAT-C scale was 0.77. Principal component analysis resulted in an eight-factor structure explaining 68.11 % of the total variance. The comparative fit index (CFI) was 0.91 from the modified model confirmatory factor analysis. The Chi-square divided by degrees of freedom was 1.98, and the root mean squared error of approximation (RMSEA) was 0.079. In relation to the known-group validation, significant differences were found in the Chinese CNAT-C scale according to various caregiver characteristics. Internal consistency was high for the Chinese CNAT-C reaching a Cronbach α value of 0.94. Test-retest reliability was 0.85. The newly adapted Chinese CNAT-C scale possesses adequate validity, test-retest reliability, and internal consistency and therefore may be used to ascertain holistic health and support needs of cancer patients' family caregivers in Mainland China.
Cross-validation of a dementia screening test in a heterogeneous population.
Ritchie, K A; Hallerman, E F
1989-09-01
Recognition of the increasing importance of early dementia screening for both research and clinical purposes has led to the development of numerous screening instruments. The most promising of these are based on neuropsychological measures which are able to focus on very specific cognitive functions. Of these tests the Iowa screening test is of particular interest to researchers and clinicians working with heterogenous populations or wishing to make cross-cultural comparisons as it is relatively culture-fair and does not assume literacy. A preliminary study of the performance of the Iowa in an Israeli sample of diverse ethnic origins and low education level suggests it to be a very sensitive measure even in such groups. The study also demonstrates the inadvisability of adopting item weights derived by multivariate statistical techniques from another population.
NASA Astrophysics Data System (ADS)
Zhang, Lei; Li, Dong; Liu, Yu; Liu, Jingxiao; Li, Jingsong; Yu, Benli
2017-11-01
We demonstrate the validity of the simultaneous reverse optimization reconstruction (SROR) algorithm in circular subaperture stitching interferometry (CSSI), which is previously proposed for non-null aspheric annular subaperture stitching interferometry (ASSI). The merits of the modified SROR algorithm in CSSI, such as auto retrace error correction, no need of overlap and even permission of missed coverage, are analyzed in detail in simulations and experiments. Meanwhile, a practical CSSI system is proposed for this demonstration. An optical wedge is employed to deflect the incident beam for subaperture scanning by its rotation and shift instead of the six-axis motion-control system. Also the reference path can provide variable Zernike defocus for each subaperture test, which would decrease the fringe density. Experiments validating the SROR algorithm in this CSSI is implemented with cross validation by testing of paraboloidal mirror, flat mirror and astigmatism mirror. It is an indispensable supplement in SROR application in general subaperture stitching interferometry.
Lunt, Heather; Roiz De Sa, Daniel; Roiz De Sa, Julia; Allsopp, Adrian
2013-07-01
To provide an accurate estimate of peak oxygen uptake (VO2 peak) for British Royal Navy Personnel aged between 18 and 39, comparing a gold standard treadmill based maximal exercise test with a submaximal one-mile walk test. Two hundred military personnel consented to perform a treadmill-based VO2 peak test and two one-mile walk tests round an athletics track. The estimated VO2 peak values from three different one-mile walk equations were compared to directly measured VO2 peak values from the treadmill-based test. One hundred participants formed a validation group from which a new equation was derived and the other 100 participants formed the cross-validation group. Existing equations underestimated the VO2 peak values of the fittest personnel and overestimated the VO2 peak of the least aerobically fit by between 2% and 18%. The new equation derived from the validation group has less bias, the highest correlation with the measured values (r = 0.83), and classified the most people correctly according to the Royal Navy's Fitness Test standards, producing the fewest false positives and false negatives combined (9%). The new equation will provide a more accurate estimate of VO2 peak for a British military population aged 18 to 39. Reprint & Copyright © 2013 Association of Military Surgeons of the U.S.
Double Cross-Validation in Multiple Regression: A Method of Estimating the Stability of Results.
ERIC Educational Resources Information Center
Rowell, R. Kevin
In multiple regression analysis, where resulting predictive equation effectiveness is subject to shrinkage, it is especially important to evaluate result replicability. Double cross-validation is an empirical method by which an estimate of invariance or stability can be obtained from research data. A procedure for double cross-validation is…
Rikli, Roberta E; Jones, C Jessie
2013-04-01
To develop and validate criterion-referenced fitness standards for older adults that predict the level of capacity needed for maintaining physical independence into later life. The proposed standards were developed for use with a previously validated test battery for older adults-the Senior Fitness Test (Rikli, R. E., & Jones, C. J. (2001). Development and validation of a functional fitness test for community--residing older adults. Journal of Aging and Physical Activity, 6, 127-159; Rikli, R. E., & Jones, C. J. (1999a). Senior fitness test manual. Champaign, IL: Human Kinetics.). A criterion measure to assess physical independence was identified. Next, scores from a subset of 2,140 "moderate-functioning" older adults from a larger cross-sectional database, together with findings from longitudinal research on physical capacity and aging, were used as the basis for proposing fitness standards (performance cut points) associated with having the ability to function independently. Validity and reliability analyses were conducted to test the standards for their accuracy and consistency as predictors of physical independence. Performance standards are presented for men and women ages 60-94 indicating the level of fitness associated with remaining physically independent until late in life. Reliability and validity indicators for the standards ranged between .79 and .97. The proposed standards provide easy-to-use, previously unavailable methods for evaluating physical capacity in older adults relative to that associated with physical independence. Most importantly, the standards can be used in planning interventions that target specific areas of weakness, thus reducing risk for premature loss of mobility and independence.
González-Sánchez, Manuel; Ruiz-Muñoz, Maria; Li, Guang Zhi; Cuesta-Vargas, Antonio I
2018-08-01
To perform a cross-cultural adaptation and validation of the Foot Function Index (FFI) questionnaire to develop the Chinese version. Three hundred and six patients with foot and ankle neuromusculoskeletal diseases participated in this observational study. Construct validity, internal consistency and criterion validity were calculated for the FFI Chinese version after the translation and transcultural adaptation process. Internal consistency ranged from 0.996 to 0.998. Test-retest analysis ranged from 0.985 to 0.994; minimal detectable change 90: 2.270; standard error of measurement: 0.973. Load distribution of the three factors had an eigenvalue greater than 1. Chi-square value was 9738.14 (p < 0.001). Correlations with the three factors were significant between Factor 1 and the other two: r = -0.634 (Factor 2) and r = -0.191 (Factor 1). Foot Function Index (Taiwan Version), Short-Form 12 (Version 2) and EuroQol-5D were used for criterion validity. Factors 1 and 2 showed significant correlation with 15/16 and 14/16 scales and subscales, respectively. Foot Function Index Chinese version psychometric characteristics were good to excellent. Chinese researchers and clinicians may use this tool for foot and ankle assessment and monitoring. Implications for rehabilitation A cross-cultural adaptation of the FFI has been done from original version to Chinese. Consistent results and satisfactory psychometric properties of the Foot Function Index Chinese version have been reported. For Chinese speaking researcher and clinician FFI-Ch could be used as a tool to assess patients with foot disease.
Ramos, Tatiana Dalpasquale; Brito, Maria José Azevedo de; Piccolo, Mônica Sarto; Rosella, Maria Fernanda Normanha da Silva Martins; Sabino, Miguel; Ferreira, Lydia Masako
2016-07-21
Rhinoplasty is one of the most sought-after esthetic operations among individuals with body dysmorphic disorder. The aim of this study was to cross-culturally adapt and validate the Body Dysmorphic Symptoms Scale. Cross-cultural validation study conducted in a plastic surgery outpatient clinic of a public university hospital. Between February 2014 and March 2015, 80 consecutive patients of both sexes seeking rhinoplasty were selected. Thirty of them participated in the phase of cultural adaptation of the instrument. Reproducibility was tested on 20 patients and construct validity was assessed on 50 patients, with correlation against the Yale-Brown Obsessive Compulsive Scale for Body Dysmorphic Disorder. The Brazilian version of the instrument showed Cronbach's alpha of 0.805 and excellent inter-rater reproducibility (intraclass correlation coefficient, ICC = 0.873; P < 0.001) and intra-rater reproducibility (ICC = 0.939; P < 0.001). Significant differences in total scores were found between patients with and without symptoms (P < 0.001). A strong correlation (r = 0.841; P < 0.001) was observed between the Yale-Brown Obsessive Compulsive Scale for Body Dysmorphic Disorder and the Body Dysmorphic Symptoms Scale. The area under the receiver operating characteristic curve was 0.981, thus showing good accuracy for discriminating between presence and absence of symptoms of body dysmorphic disorder. Forty-six percent of the patients had body dysmorphic symptoms and 54% had moderate to severe appearance-related obsessive-compulsive symptoms. The Brazilian version of the Body Dysmorphic Symptoms Scale is a reproducible instrument that presents face, content and construct validity.
Ramos, Tatiana Dalpasquale; Brito, Maria José Azevedo de; Piccolo, Mônica Sarto; Rosella, Maria Fernanda Normanha da Silva Martins; Sabino, Miguel; Ferreira, Lydia Masako
2016-01-01
Rhinoplasty is one of the most sought-after esthetic operations among individuals with body dysmorphic disorder. The aim of this study was to cross-culturally adapt and validate the Body Dysmorphic Symptoms Scale. Cross-cultural validation study conducted in a plastic surgery outpatient clinic of a public university hospital. Between February 2014 and March 2015, 80 consecutive patients of both sexes seeking rhinoplasty were selected. Thirty of them participated in the phase of cultural adaptation of the instrument. Reproducibility was tested on 20 patients and construct validity was assessed on 50 patients, with correlation against the Yale-Brown Obsessive Compulsive Scale for Body Dysmorphic Disorder. The Brazilian version of the instrument showed Cronbach's alpha of 0.805 and excellent inter-rater reproducibility (intraclass correlation coefficient, ICC = 0.873; P < 0.001) and intra-rater reproducibility (ICC = 0.939; P < 0.001). Significant differences in total scores were found between patients with and without symptoms (P < 0.001). A strong correlation (r = 0.841; P < 0.001) was observed between the Yale-Brown Obsessive Compulsive Scale for Body Dysmorphic Disorder and the Body Dysmorphic Symptoms Scale. The area under the receiver operating characteristic curve was 0.981, thus showing good accuracy for discriminating between presence and absence of symptoms of body dysmorphic disorder. Forty-six percent of the patients had body dysmorphic symptoms and 54% had moderate to severe appearance-related obsessive-compulsive symptoms. The Brazilian version of the Body Dysmorphic Symptoms Scale is a reproducible instrument that presents face, content and construct validity.
Gillespie, Brigid M; Polit, Denise F; Hamlin, Lois; Chaboyer, Wendy
2012-01-01
This paper describes the development and validation of the Revised Perioperative Competence Scale (PPCS-R). There is a lack of a psychometrically tested sound self-assessment tools to measure nurses' perceived competence in the operating room. Content validity was established by a panel of international experts and the original 98-item scale was pilot tested with 345 nurses in Queensland, Australia. Following the removal of several items, a national sample that included all 3209 nurses who were members of the Australian College of Operating Room Nurses was surveyed using the 94-item version. Psychometric testing assessed content validity using exploratory factor analysis, internal consistency using Cronbach's alpha, and construct validity using the "known groups" technique. During item reduction, several preliminary factor analyses were performed on two random halves of the sample (n=550). Usable data for psychometric assessment were obtained from 1122 nurses. The original 94-item scale was reduced to 40 items. The final factor analysis using the entire sample resulted in a 40 item six-factor solution. Cronbach's alpha for the 40-item scale was .96. Construct validation demonstrated significant differences (p<.0001) in perceived competence scores relative to years of operating room experience and receipt of specialty education. On the basis of these results, the psychometric properties of the PPCS-R were considered encouraging. Further testing of the tool in different samples of operating room nurses is necessary to enable cross-cultural comparisons. Copyright © 2011 Elsevier Ltd. All rights reserved.
Mandillo, Silvia; Tucci, Valter; Hölter, Sabine M.; Meziane, Hamid; Banchaabouchi, Mumna Al; Kallnik, Magdalena; Lad, Heena V.; Nolan, Patrick M.; Ouagazzal, Abdel-Mouttalib; Coghill, Emma L.; Gale, Karin; Golini, Elisabetta; Jacquot, Sylvie; Krezel, Wojtek; Parker, Andy; Riet, Fabrice; Schneider, Ilka; Marazziti, Daniela; Auwerx, Johan; Brown, Steve D. M.; Chambon, Pierre; Rosenthal, Nadia; Tocchini-Valentini, Glauco; Wurst, Wolfgang
2008-01-01
Establishing standard operating procedures (SOPs) as tools for the analysis of behavioral phenotypes is fundamental to mouse functional genomics. It is essential that the tests designed provide reliable measures of the process under investigation but most importantly that these are reproducible across both time and laboratories. For this reason, we devised and tested a set of SOPs to investigate mouse behavior. Five research centers were involved across France, Germany, Italy, and the UK in this study, as part of the EUMORPHIA program. All the procedures underwent a cross-validation experimental study to investigate the robustness of the designed protocols. Four inbred reference strains (C57BL/6J, C3HeB/FeJ, BALB/cByJ, 129S2/SvPas), reflecting their use as common background strains in mutagenesis programs, were analyzed to validate these tests. We demonstrate that the operating procedures employed, which includes open field, SHIRPA, grip-strength, rotarod, Y-maze, prepulse inhibition of acoustic startle response, and tail flick tests, generated reproducible results between laboratories for a number of the test output parameters. However, we also identified several uncontrolled variables that constitute confounding factors in behavioral phenotyping. The EUMORPHIA SOPs described here are an important start-point for the ongoing development of increasingly robust phenotyping platforms and their application in large-scale, multicentre mouse phenotyping programs. PMID:18505770
Junkes, Monica C; Fraiz, Fabian C; Sardenberg, Fernanda; Lee, Jessica Y; Paiva, Saul M; Ferreira, Fernanda M
2015-01-01
The aim of the present study was to translate, perform the cross-cultural adaptation of the Rapid Estimate of Adult Literacy in Dentistry to Brazilian-Portuguese language and test the reliability and validity of this version. After translation and cross-cultural adaptation, interviews were conducted with 258 parents/caregivers of children in treatment at the pediatric dentistry clinics and health units in Curitiba, Brazil. To test the instrument's validity, the scores of Brazilian Rapid Estimate of Adult Literacy in Dentistry (BREALD-30) were compared based on occupation, monthly household income, educational attainment, general literacy, use of dental services and three dental outcomes. The BREALD-30 demonstrated good internal reliability. Cronbach's alpha ranged from 0.88 to 0.89 when words were deleted individually. The analysis of test-retest reliability revealed excellent reproducibility (intraclass correlation coefficient = 0.983 and Kappa coefficient ranging from moderate to nearly perfect). In the bivariate analysis, BREALD-30 scores were significantly correlated with the level of general literacy (rs = 0.593) and income (rs = 0.327) and significantly associated with occupation, educational attainment, use of dental services, self-rated oral health and the respondent's perception regarding his/her child's oral health. However, only the association between the BREALD-30 score and the respondent's perception regarding his/her child's oral health remained significant in the multivariate analysis. The BREALD-30 demonstrated satisfactory psychometric properties and is therefore applicable to adults in Brazil.
Junkes, Monica C.; Fraiz, Fabian C.; Sardenberg, Fernanda; Lee, Jessica Y.; Paiva, Saul M.; Ferreira, Fernanda M.
2015-01-01
Objective The aim of the present study was to translate, perform the cross-cultural adaptation of the Rapid Estimate of Adult Literacy in Dentistry to Brazilian-Portuguese language and test the reliability and validity of this version. Methods After translation and cross-cultural adaptation, interviews were conducted with 258 parents/caregivers of children in treatment at the pediatric dentistry clinics and health units in Curitiba, Brazil. To test the instrument's validity, the scores of Brazilian Rapid Estimate of Adult Literacy in Dentistry (BREALD-30) were compared based on occupation, monthly household income, educational attainment, general literacy, use of dental services and three dental outcomes. Results The BREALD-30 demonstrated good internal reliability. Cronbach’s alpha ranged from 0.88 to 0.89 when words were deleted individually. The analysis of test-retest reliability revealed excellent reproducibility (intraclass correlation coefficient = 0.983 and Kappa coefficient ranging from moderate to nearly perfect). In the bivariate analysis, BREALD-30 scores were significantly correlated with the level of general literacy (rs = 0.593) and income (rs = 0.327) and significantly associated with occupation, educational attainment, use of dental services, self-rated oral health and the respondent’s perception regarding his/her child's oral health. However, only the association between the BREALD-30 score and the respondent’s perception regarding his/her child's oral health remained significant in the multivariate analysis. Conclusion The BREALD-30 demonstrated satisfactory psychometric properties and is therefore applicable to adults in Brazil. PMID:26158724
Novel naïve Bayes classification models for predicting the chemical Ames mutagenicity.
Zhang, Hui; Kang, Yan-Li; Zhu, Yuan-Yuan; Zhao, Kai-Xia; Liang, Jun-Yu; Ding, Lan; Zhang, Teng-Guo; Zhang, Ji
2017-06-01
Prediction of drug candidates for mutagenicity is a regulatory requirement since mutagenic compounds could pose a toxic risk to humans. The aim of this investigation was to develop a novel prediction model of mutagenicity by using a naïve Bayes classifier. The established model was validated by the internal 5-fold cross validation and external test sets. For comparison, the recursive partitioning classifier prediction model was also established and other various reported prediction models of mutagenicity were collected. Among these methods, the prediction performance of naïve Bayes classifier established here displayed very well and stable, which yielded average overall prediction accuracies for the internal 5-fold cross validation of the training set and external test set I set were 89.1±0.4% and 77.3±1.5%, respectively. The concordance of the external test set II with 446 marketed drugs was 90.9±0.3%. In addition, four simple molecular descriptors (e.g., Apol, No. of H donors, Num-Rings and Wiener) related to mutagenicity and five representative substructures of mutagens (e.g., aromatic nitro, hydroxyl amine, nitroso, aromatic amine and N-methyl-N-methylenemethanaminum) produced by ECFP_14 fingerprints were identified. We hope the established naïve Bayes prediction model can be applied to risk assessment processes; and the obtained important information of mutagenic chemicals can guide the design of chemical libraries for hit and lead optimization. Copyright © 2017 Elsevier B.V. All rights reserved.
Dong, Zuoli; Zhang, Naiqian; Li, Chun; Wang, Haiyun; Fang, Yun; Wang, Jun; Zheng, Xiaoqi
2015-06-30
An enduring challenge in personalized medicine is to select right drug for individual patients. Testing drugs on patients in large clinical trials is one way to assess their efficacy and toxicity, but it is impractical to test hundreds of drugs currently under development. Therefore the preclinical prediction model is highly expected as it enables prediction of drug response to hundreds of cell lines in parallel. Recently, two large-scale pharmacogenomic studies screened multiple anticancer drugs on over 1000 cell lines in an effort to elucidate the response mechanism of anticancer drugs. To this aim, we here used gene expression features and drug sensitivity data in Cancer Cell Line Encyclopedia (CCLE) to build a predictor based on Support Vector Machine (SVM) and a recursive feature selection tool. Robustness of our model was validated by cross-validation and an independent dataset, the Cancer Genome Project (CGP). Our model achieved good cross validation performance for most drugs in the Cancer Cell Line Encyclopedia (≥80% accuracy for 10 drugs, ≥75% accuracy for 19 drugs). Independent tests on eleven common drugs between CCLE and CGP achieved satisfactory performance for three of them, i.e., AZD6244, Erlotinib and PD-0325901, using expression levels of only twelve, six and seven genes, respectively. These results suggest that drug response could be effectively predicted from genomic features. Our model could be applied to predict drug response for some certain drugs and potentially play a complementary role in personalized medicine.
Lima, Elaine; Teixeira-Salmela, Luci F; Simões, Luan; Guerra, Ana C C; Lemos, Andrea
2016-03-15
While there are several instruments in Brazil that measure motor function in patients after stroke, it is unknown whether the measurement properties of these instruments are appropriate. To identify the motor function instruments available in Brazil for patients after stroke. To assess the methodological quality of the studies and the results related to the measurement properties of these instruments. Two independent reviewers conducted searches on PubMed, LILACS, CINAHL, Web of Science, and Scopus. Studies that aimed to cross-culturally adapt an existing instrument or create a Brazilian instrument and test at least one measurement property related to motor function in patients after stroke were included. The methodological quality of these studies was checked by the COSMIN checklist with 4-point rating scale and the results of the measurement properties were analyzed by the criteria developed by Terwee et al. A total of 11 instruments were considered eligible, none of which were created in Brazil. The process of cross-cultural adaptation was inadequate in 10 out of 11 instruments due to the lack of back-translation or due to inappropriate target population. All of the instruments presented flaws in the measurement properties, especially reliability, internal consistency, and construct validity. The flaws observed in both cross-cultural adaptation process and testing measurement properties make the results inconclusive on the validity of the available instruments. Adequate procedures of cross-cultural adaptation and measurement properties of these instruments are strongly needed.
Lima, Elaine; Teixeira-Salmela, Luci F.; Simões, Luan; Guerra, Ana C. C.; Lemos, Andrea
2016-01-01
Background While there are several instruments in Brazil that measure motor function in patients after stroke, it is unknown whether the measurement properties of these instruments are appropriate. Objective To identify the motor function instruments available in Brazil for patients after stroke. To assess the methodological quality of the studies and the results related to the measurement properties of these instruments. Method Two independent reviewers conducted searches on PubMed, LILACS, CINAHL, Web of Science, and Scopus. Studies that aimed to cross-culturally adapt an existing instrument or create a Brazilian instrument and test at least one measurement property related to motor function in patients after stroke were included. The methodological quality of these studies was checked by the COSMIN checklist with 4-point rating scale and the results of the measurement properties were analyzed by the criteria developed by Terwee et al. Results A total of 11 instruments were considered eligible, none of which were created in Brazil. The process of cross-cultural adaptation was inadequate in 10 out of 11 instruments due to the lack of back-translation or due to inappropriate target population. All of the instruments presented flaws in the measurement properties, especially reliability, internal consistency, and construct validity. Conclusion The flaws observed in both cross-cultural adaptation process and testing measurement properties make the results inconclusive on the validity of the available instruments. Adequate procedures of cross-cultural adaptation and measurement properties of these instruments are strongly needed. PMID:26982452
Fuermaier, Anselm B M; Tucha, Oliver; Koerts, Janneke; Lange, Klaus W; Weisbrod, Matthias; Aschenbrenner, Steffen; Tucha, Lara
2017-12-01
The assessment of performance validity is an essential part of the neuropsychological evaluation of adults with attention-deficit/hyperactivity disorder (ADHD). Most available tools, however, are inaccurate regarding the identification of noncredible performance. This study describes the development of a visuospatial working memory test, including a validity indicator for noncredible cognitive performance of adults with ADHD. Visuospatial working memory of adults with ADHD (n = 48) was first compared to the test performance of healthy individuals (n = 48). Furthermore, a simulation design was performed including 252 individuals who were randomly assigned to either a control group (n = 48) or to 1 of 3 simulation groups who were requested to feign ADHD (n = 204). Additional samples of 27 adults with ADHD and 69 instructed simulators were included to cross-validate findings from the first samples. Adults with ADHD showed impaired visuospatial working memory performance of medium size as compared to healthy individuals. Simulation groups committed significantly more errors and had shorter response times as compared to patients with ADHD. Moreover, binary logistic regression analysis was carried out to derive a validity index that optimally differentiates between true and feigned ADHD. ROC analysis demonstrated high classification rates of the validity index, as shown in excellent specificity (95.8%) and adequate sensitivity (60.3%). The visuospatial working memory test as presented in this study therefore appears sensitive in indicating cognitive impairment of adults with ADHD. Furthermore, the embedded validity index revealed promising results concerning the detection of noncredible cognitive performance of adults with ADHD. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Translation and validation of the Canadian diabetes risk assessment questionnaire in China.
Guo, Jia; Shi, Zhengkun; Chen, Jyu-Lin; Dixon, Jane K; Wiley, James; Parry, Monica
2018-01-01
To adapt the Canadian Diabetes Risk Assessment Questionnaire for the Chinese population and to evaluate its psychometric properties. A cross-sectional study was conducted with a convenience sample of 194 individuals aged 35-74 years from October 2014 to April 2015. The Canadian Diabetes Risk Assessment Questionnaire was adapted and translated for the Chinese population. Test-retest reliability was conducted to measure stability. Criterion and convergent validity of the adapted questionnaire were assessed using 2-hr 75 g oral glucose tolerance tests and the Finnish Diabetes Risk Scores, respectively. Sensitivity and specificity were evaluated to establish its predictive validity. The test-retest reliability was 0.988. Adequate validity of the adapted questionnaire was demonstrated by positive correlations found between the scores and 2-hr 75 g oral glucose tolerance tests (r = .343, p < .001) and with the Finnish Diabetes Risk Scores (r = .738, p < .001). The area under receiver operating characteristic curve was 0.705 (95% CI .632, .778), demonstrating moderate diagnostic value at a cutoff score of 30. The sensitivity was 73%, with a positive predictive value of 57% and negative predictive value of 78%. Our results provided evidence supporting the translation consistency, content validity, convergent validity, criterion validity, sensitivity, and specificity of the translated Canadian Diabetes Risk Assessment Questionnaire with minor modifications. This paper provides clinical, practical, and methodological information on how to adapt a diabetes risk calculator between cultures for public health nurses. © 2017 Wiley Periodicals, Inc.
Validity and cultural equivalence of the standard Greene Climacteric Scale in Hong Kong.
Chen, Run Qiu; Davis, Susan R; Wong, Chit Ming; Lam, Tai Hing
2010-01-01
The aim of this study was to translate the standard Greene Climacteric Scale (GCS) and a urogenital symptom scale into colloquial Chinese (Hong Kong) and test their validity and reliability in Hong Kong Chinese women. The scales were translated with standard techniques, and cross-cultural construct validity, internal consistency, test-retest reliability, and responsiveness were tested on samples of women aged 40 to 60 years recruited from the community. A total of 611 women, with mean (SD) age of 48.9 (5.3) years, provided completed scales for the study. Confirmatory factor analysis demonstrated construct validity of the translated standard GCS. The items were found to have good homogeneity in measuring the scale concepts (Cronbach alpha > 0.7). But the three-item urogenital scale had poor internal consistency (Cronbach alpha = 0.43), and a combination of this scale with the standard GCS resulted in a reduced model fit to the data. Test-retest reliability for the GCS was good on women recruited for a retest (n = 52). The translated GCS was found to be responsive to change over time (effect size, 0.59; n = 19). The Chinese (Hong Kong) version of the standard GCS is a valid and cultural-equivalent instrument. Our data do not support inclusion of the urogenital scale to the standard GCS. Measurement of urogenital symptoms is subject to further study.
Validation of the 'Test of the Adherence to Inhalers' (TAI) for Asthma and COPD Patients.
Plaza, Vicente; Fernández-Rodríguez, Concepción; Melero, Carlos; Cosío, Borja G; Entrenas, Luís Manuel; de Llano, Luis Pérez; Gutiérrez-Pereyra, Fernando; Tarragona, Eduard; Palomino, Rosa; López-Viña, Antolín
2016-04-01
To validate the 'Test of Adherence to Inhalers' (TAI), a 12-item questionnaire designed to assess the adherence to inhalers in patients with COPD or asthma. A total of 1009 patients with asthma or COPD participated in a cross-sectional multicenter study. Patients with electronic adherence ≥80% were defined as adherents. Construct validity, internal validity, and criterion validity were evaluated. Self-reported adherence was compared with the Morisky-Green questionnaire. Factor analysis study demonstrated two factors, factor 1 was coincident with TAI patient domain (items 1 to 10) and factor 2 with TAI health-care professional domain (items 11 and 12). The Cronbach's alpha was 0.860 and the test-retest reliability 0.883. TAI scores correlated with electronic adherence (ρ=0.293, p=0.01). According to the best cut-off for 10 items (score 50, area under the ROC curve 0.7), 569 (62.5%) patients were classified as non-adherents. The non-adherence behavior pattern was: erratic 527 (57.9%), deliberate 375 (41.2%), and unwitting 242 (26.6%) patients. As compared to Morisky-Green test, TAI showed better psychometric properties. The TAI is a reliable and homogeneous questionnaire to identify easily non-adherence and to classify from a clinical perspective the barriers related to the use of inhalers in asthma and COPD.
Validation of the Italian translation of the Inflammatory Bowel Disease Questionnaire.
Ciccocioppo, Rachele; Klersy, Catherine; Russo, Maria Luisa; Valli, Monica; Boccaccio, Vincenzo; Imbesi, Venerina; Ardizzone, Sandro; Porro, Gabriele Bianchi; Corazza, Gino Roberto
2011-07-01
Health-related quality of life is an important measure of treatment outcome; its evaluation requires the use of internationally validated ad hoc questionnaires. The McMaster Inflammatory Bowel Disease Questionnaire (IBDQ) is the most used specific instrument. To assess the validity and reliability of the Italian translation of the IBDQ. The IBDQ underwent forward and backward translation; 13 patients were enrolled for cognitive testing of the Italian version to increase clarity. For field testing, 113 patients (65 with Crohn's disease and 48 with ulcerative colitis) completed both the IBDQ and the generic instrument 36-item Short Form Health Survey scale (SF-36). Data quality was optimal with high completeness and low floor and ceiling effect. Item internal consistency was satisfied for 100% of patients, while discriminant validity showed a few items with higher correlations with other scales. Cronbach's alpha coefficient was 0.96. Test-retest correlations indicated good reliability (Pearson R 0.81). Exploratory factor analysis indicated that the original grouping of the item was suboptimal. The score proved sensitive to disease activity, gender and quality of life as measured by the SF-36. The Italian translation of the McMaster Inflammatory Bowel Disease Questionnaire sounds natural and is easy to understand. A field test gave results comparable to other international validations, supporting its use in cross-national surveys. Copyright © 2011 Editrice Gastroenterologica Italiana S.r.l. Published by Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ferraioli, Luigi; Hueller, Mauro; Vitale, Stefano
The scientific objectives of the LISA Technology Package experiment on board of the LISA Pathfinder mission demand accurate calibration and validation of the data analysis tools in advance of the mission launch. The level of confidence required in the mission outcomes can be reached only by intensively testing the tools on synthetically generated data. A flexible procedure allowing the generation of a cross-correlated stationary noise time series was set up. A multichannel time series with the desired cross-correlation behavior can be generated once a model for a multichannel cross-spectral matrix is provided. The core of the procedure comprises a noisemore » coloring, multichannel filter designed via a frequency-by-frequency eigendecomposition of the model cross-spectral matrix and a subsequent fit in the Z domain. The common problem of initial transients in a filtered time series is solved with a proper initialization of the filter recursion equations. The noise generator performance was tested in a two-dimensional case study of the closed-loop LISA Technology Package dynamics along the two principal degrees of freedom.« less
ERIC Educational Resources Information Center
Acar, Tu¨lin
2014-01-01
In literature, it has been observed that many enhanced criteria are limited by factor analysis techniques. Besides examinations of statistical structure and/or psychological structure, such validity studies as cross validation and classification-sequencing studies should be performed frequently. The purpose of this study is to examine cross…
2011-01-01
Background Health-related quality of life (HRQoL) assessment, encompassing the adolescents' perceptions of their mental, physical, and social health and well-being is increasingly considered an important outcome to be used to identify population health needs and to provide targeted medical care. Although validated instruments are essential for accurately assessing HRQoL outcomes, there are few cross-culturally adapted tools for use in Brazil, and none designed exclusively for use among adolescents. The Vécu et Santé Perçue de l'Adolescent (VSP-A) is a generic, multidimensional self-reported instrument originally developed and validated in France that evaluates HRQoL of ill and healthy adolescents. Purpose To cross-culturally adapt and validate the Brazilian-Portuguese version of the VSP-A, a generic HRQoL measure for adolescents originally developed in France. Methods The VSP-A was translated following a well-validated forward-backward process leading to the Brazilian version. The psychometric evaluation was conducted in a sample of 446 adolescents (14-18 years) attending 2 public high schools of São Gonçalo City. The adolescents self-reported the Brazilian VSP-A, the validated Psychosomatic Symptom Checklist and socio-demographic information. A retest evaluation was carried out on a sub-sample (n = 195) at a two-week interval. The internal construct validity was assessed through confirmatory factor analysis (CFA), multi-trait scaling analyses, Rasch analysis evaluating unidimensionality of each scale and Cronbach's alpha coefficients. The reproducibility was evaluated by intra-class correlation coefficients (ICC). Zumbo's ordinal logistic regression analysis was used to detect differential item functioning (DIF) between the Brazilian and the French items. External construct validity was investigated testing expected differences between groups using one-way analysis of variance (ANOVA), Mann-Whitney tests and the univariate general regression linear model. Results CFA showed an acceptable fit (RMSEA=0.05; CFI=0.93); 94% of scaling success was found for item-internal consistency and 98% for item discriminant validity. The items showed good fit to the Rasch model except 3 items with an INFIT at the upper threshold. Cronbach's Alpha ranged from 0.60 to 0.85. Test-retest reliability was moderate to good (ICC=0.55-0.82). DIF was evidenced in 4 out of 36 items. Expected patterns of differences were confirmed with significantly lower physical, psychological well being and vitality reported by symptomatic adolescents. Conclusions Although DIF in few items and responsiveness must be further explored, the Brazilian version of VSP-A demonstrated an acceptable validity and reliability in adolescents attending school and might serve as a starting point for more specific clinical investigations. PMID:21272317
Chabrera, Carolina; Areal, Joan; Font, Albert; Caro, Mónica; Bonet, Marta; Zabalegui, Adelaida
2015-01-01
The aim of this study is to develop a Spanish version of the Satisfaction With Decision scale (SWDs) and analyse the psychometric properties of validity and reliability. An observational, descriptive study and validation of a tool to measure satisfaction with the decision. Urology, Radiation oncology, and Medical oncology Departments of the Hospital Universitari Germans Trias i Pujol, Institut Català d'Oncologia and the Institut Oncològic del Vallès - Hospital General de Catalunya. A total of 170 participants diagnosed with prostate cancer, and who could read and write in Spanish and gave their informed consent. A translation, back-translation and cross-cultural adaptation to Spanish was performed on the SWDs. The content validity, criterion validity, construct validity and reliability (internal consistency and stability) of the Spanish version were evaluated. The SWDs contains 6 items with 5-item Likert scales. A Spanish version (ESD) was obtained that was linguistically and conceptually equivalent to the original version. Criterion validity, the ESD correlated with "satisfaction with the decision" using a linear analogue scale, was significant (r=0.63, P<.01) for all items. The factorial analysis showed a unique dimension to explain 82.08% of the variance. The ESD showed excellent results in terms of internal consistency (Cronbach alpha=0.95) and good test-retest reliability with intraclass correlation coefficient of 0.711. The ESD is a validated Spanish scale to measure the satisfaction with the decisions taken in health, and demonstrates a correct validity and reliability. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.
Aldrees, Turki; Almubarak, Zaid; Hassouneh, Basil; Albosaily, Ahamed; Aloulah, Mohammad; Almasoud, Mai; Alsaleh, Saad
2018-01-01
Disease-specific quality of life instruments assess the impact of chronic rhinosinusitis on patients' quality of life (QoL). To the extent of our knowledge, there are no Arabic versions of two instruments-the Rhinosinusitis Disability Index (RSDI) and the Chronic Sinusitis Survey (CSS). Develop an Arabic-validated version of both instruments, thus allowing its use among the Arabic population. Prospective cross-sectional study for instrument validation. Tertiary university hospital. This study was conducted between September 2015 and October 2016. We followed the international comprehensive guidelines for translation and cross-cultural adaptation of QoL instruments. Test-retest reliability, discriminant validity, and responsiveness ability of both the RSDI and CSS Arabic versions. 124. The sample comprised 75 patients diagnosed with chronic rhinosinusitis and 49 healthy control subjects. The Arabic version of both instruments showed high internal consistency (Cronbach's alpha: RSDI=0.97, CSS=.88) and the ability to differentiate between diseased and healthy volunteers (P less than .0001). The translated versions also detected significant change in response to an intervention (P less than .0001). These Arabic validated versions of the RSDI and CSS can be used for both clinical and research purposes. This study was performed in only one tertiary hospital. None.
Schnyder, Manuela; Deplazes, Peter
2012-11-13
Dirofilaria immitis and Angiostrongylus vasorum are both important potentially fatal canine nematodes with overlapping endemic areas, especially in Europe. The preadult and adult stages of both species are living in the Arteria pulmonalis and the right heart, and diagnostically detectable circulating parasite antigens have been demonstrated for both species. For the detection of D. immitis infections, a variety of commercial tests have been developed, however, they have not been evaluated for cross-reactions against circulating antigens of A. vasorum. In this study, potential cross-reactions of sera from 16 dogs, which were experimentally infected with A. vasorum and which had circulating antigens as confirmed by a species-specific ELISA, were evaluated for the detection of A. vasorum antigen in six commercially available D. immitis test kits. In three fast tests (Witness® Dirofilaria, SensPERT® Canine Heartworm, SNAP® 4Dx® Plus), all sera were negative. One fast membrane ELISA (SNAP® HTWM RT Test) was positive with four sera (25%), and one serum delivered a non-valid result twice. In the PetChek® HTWM PF Test, depending on the interpretation protocol, 5 or 8 dogs (31.2 - 50%) were positive. With the DiroCHEK®-ELISA, a single A. vasorum-infected dog (6.2%) tested positive. Due to potential cross-reactions with A. vasorum in commercially available test kits for the detection of D. immitis antigen, the simultaneous use of highly specific diagnostic methods for the differentiation of these two canine heart worms is recommended.
Flosadottir, Vala; Roos, Ewa M.; Ageberg, Eva
2017-01-01
Background: The Activity Rating Scale (ARS) for disorders of the knee evaluates the level of activity by the frequency of participation in 4 separate activities with high demands on knee function, with a score ranging from 0 (none) to 16 (pivoting activities 4 times/wk). Purpose: To translate and cross-culturally adapt the ARS into Swedish and to assess measurement properties of the Swedish version of the ARS. Study Design: Cohort study (diagnosis); Level of evidence, 2. Methods: The COSMIN guidelines were followed. Participants (N = 100 [55 women]; mean age, 27 years) who were undergoing rehabilitation for a knee injury completed the ARS twice for test-retest reliability. The Knee injury and Osteoarthritis Outcome Score (KOOS), Tegner Activity Scale (TAS), and modernized Saltin-Grimby Physical Activity Level Scale (SGPALS) were administered at baseline to validate the ARS. Construct validity and responsiveness of the ARS were evaluated by testing predefined hypotheses regarding correlations between the ARS, KOOS, TAS, and SGPALS. The Cronbach alpha, intraclass correlation coefficients, absolute reliability, standard error of measurement, smallest detectable change, and Spearman rank-order correlation coefficients were calculated. Results: The ARS showed good internal consistency (α ≈ 0.96), good test-retest reliability (intraclass correlation coefficient >0.9), and no systematic bias between measurements. The standard error of measurement was less than 2 points, and the smallest detectable change was less than 1 point at the group level and less than 5 points at the individual level. More than 75% of the hypotheses were confirmed, indicating good construct validity and good responsiveness of the ARS. Conclusion: The Swedish version of the ARS is valid, reliable, and responsive for evaluating the level of activity based on the frequency of participation in high-demand knee sports activities in young adults with a knee injury. PMID:28979920
Cross-Validating Chinese Language Mental Health Recovery Measures in Hong Kong
ERIC Educational Resources Information Center
Bola, John; Chan, Tiffany Hill Ching; Chen, Eric HY; Ng, Roger
2016-01-01
Objectives: Promoting recovery in mental health services is hampered by a shortage of reliable and valid measures, particularly in Hong Kong. We seek to cross validate two Chinese language measures of recovery and one of recovery-promoting environments. Method: A cross-sectional survey of people recovering from early episode psychosis (n = 121)…
Helgadóttir, Halla; Gudmundsson, Ólafur Ó; Baldursson, Gísli; Magnússon, Páll; Blin, Nicolas; Brynjólfsdóttir, Berglind; Emilsdóttir, Ásdís; Gudmundsdóttir, Gudrún B; Lorange, Málfrídur; Newman, Paula K; Jóhannesson, Gísli H; Johnsen, Kristinn
2015-01-01
Objectives The aim of this study was to develop and test, for the first time, a multivariate diagnostic classifier of attention deficit hyperactivity disorder (ADHD) based on EEG coherence measures and chronological age. Setting The participants were recruited in two specialised centres and three schools in Reykjavik. Participants The data are from a large cross-sectional cohort of 310 patients with ADHD and 351 controls, covering an age range from 5.8 to 14 years. ADHD was diagnosed according to the Diagnostic and Statistical Manual of Mental Disorders fourth edition (DSM-IV) criteria using the K-SADS-PL semistructured interview. Participants in the control group were reported to be free of any mental or developmental disorders by their parents and had a score of less than 1.5 SDs above the age-appropriate norm on the ADHD Rating Scale-IV. Other than moderate or severe intellectual disability, no additional exclusion criteria were applied in order that the cohort reflected the typical cross section of patients with ADHD. Results Diagnostic classifiers were developed using statistical pattern recognition for the entire age range and for specific age ranges and were tested using cross-validation and by application to a separate cohort of recordings not used in the development process. The age-specific classification approach was more accurate (76% accuracy in the independent test cohort; 81% cross-validation accuracy) than the age-independent version (76%; 73%). Chronological age was found to be an important classification feature. Conclusions The novel application of EEG-based classification methods presented here can offer significant benefit to the clinician by improving both the accuracy of initial diagnosis and ongoing monitoring of children and adolescents with ADHD. The most accurate possible diagnosis at a single point in time can be obtained by the age-specific classifiers, but the age-independent classifiers are also useful as they enable longitudinal monitoring of brain function. PMID:25596195
Harris, Alex Hs; Kuo, Alfred C; Bowe, Thomas; Gupta, Shalini; Nordin, David; Giori, Nicholas J
2018-05-01
Statistical models to preoperatively predict patients' risk of death and major complications after total joint arthroplasty (TJA) could improve the quality of preoperative management and informed consent. Although risk models for TJA exist, they have limitations including poor transparency and/or unknown or poor performance. Thus, it is currently impossible to know how well currently available models predict short-term complications after TJA, or if newly developed models are more accurate. We sought to develop and conduct cross-validation of predictive risk models, and report details and performance metrics as benchmarks. Over 90 preoperative variables were used as candidate predictors of death and major complications within 30 days for Veterans Health Administration patients with osteoarthritis who underwent TJA. Data were split into 3 samples-for selection of model tuning parameters, model development, and cross-validation. C-indexes (discrimination) and calibration plots were produced. A total of 70,569 patients diagnosed with osteoarthritis who received primary TJA were included. C-statistics and bootstrapped confidence intervals for the cross-validation of the boosted regression models were highest for cardiac complications (0.75; 0.71-0.79) and 30-day mortality (0.73; 0.66-0.79) and lowest for deep vein thrombosis (0.59; 0.55-0.64) and return to the operating room (0.60; 0.57-0.63). Moderately accurate predictive models of 30-day mortality and cardiac complications after TJA in Veterans Health Administration patients were developed and internally cross-validated. By reporting model coefficients and performance metrics, other model developers can test these models on new samples and have a procedure and indication-specific benchmark to surpass. Published by Elsevier Inc.
Multivariate pattern analysis for MEG: A comparison of dissimilarity measures.
Guggenmos, Matthias; Sterzer, Philipp; Cichy, Radoslaw Martin
2018-06-01
Multivariate pattern analysis (MVPA) methods such as decoding and representational similarity analysis (RSA) are growing rapidly in popularity for the analysis of magnetoencephalography (MEG) data. However, little is known about the relative performance and characteristics of the specific dissimilarity measures used to describe differences between evoked activation patterns. Here we used a multisession MEG data set to qualitatively characterize a range of dissimilarity measures and to quantitatively compare them with respect to decoding accuracy (for decoding) and between-session reliability of representational dissimilarity matrices (for RSA). We tested dissimilarity measures from a range of classifiers (Linear Discriminant Analysis - LDA, Support Vector Machine - SVM, Weighted Robust Distance - WeiRD, Gaussian Naïve Bayes - GNB) and distances (Euclidean distance, Pearson correlation). In addition, we evaluated three key processing choices: 1) preprocessing (noise normalisation, removal of the pattern mean), 2) weighting decoding accuracies by decision values, and 3) computing distances in three different partitioning schemes (non-cross-validated, cross-validated, within-class-corrected). Four main conclusions emerged from our results. First, appropriate multivariate noise normalization substantially improved decoding accuracies and the reliability of dissimilarity measures. Second, LDA, SVM and WeiRD yielded high peak decoding accuracies and nearly identical time courses. Third, while using decoding accuracies for RSA was markedly less reliable than continuous distances, this disadvantage was ameliorated by decision-value-weighting of decoding accuracies. Fourth, the cross-validated Euclidean distance provided unbiased distance estimates and highly replicable representational dissimilarity matrices. Overall, we strongly advise the use of multivariate noise normalisation as a general preprocessing step, recommend LDA, SVM and WeiRD as classifiers for decoding and highlight the cross-validated Euclidean distance as a reliable and unbiased default choice for RSA. Copyright © 2018 Elsevier Inc. All rights reserved.
Sex estimation from measurements of the first rib in a contemporary Polish population.
Kubicka, Anna Maria; Piontek, Janusz
2016-01-01
The aim of this study was to evaluate the accuracy of sex assessment using measurements of the first rib from computed tomography (CT) to develop a discriminant formula. Four discriminant formulae were derived based on CT imaging of the right first rib of 85 female and 91 male Polish patients of known age and sex. In direct discriminant analysis, the first equation consisted of all first rib variables; the second included measurements of the rib body; the third comprised only two measurements of the sternal end of the first rib. The stepwise method selected the four best variables from all measurements. The discriminant function equation was then tested on a cross-validated group consisting of 23 females and 24 males. The direct discriminant analysis showed that sex assessment was possible in 81.5% of cases in the first group and in 91.5% in the cross-validated group when all variables for the first rib were included. The average accuracy for the original group for rib body and sternal end was 80.9 and 67.9%, respectively. The percentages of correctly assigned individuals for the functions based on the rib body and sternal end in the cross-validated group were 76.6 and 85.0%, respectively. Higher average accuracies were obtained for stepwise discriminant analysis: 83.1% for the original group and 91.2% for the cross-validated group. The exterior edge, anterior-posterior of the sternal end, and depth of the arc were the most reliable parameters. Our results suggest that the first rib is dimorphic and that the described method can be used for sex assessment.
Validity and cross-cultural adaptation of the persian version of the oxford elbow score.
Ebrahimzadeh, Mohammad H; Kachooei, Amir Reza; Vahedi, Ehsan; Moradi, Ali; Mashayekhi, Zeinab; Hallaj-Moghaddam, Mohammad; Azami, Mehran; Birjandinejad, Ali
2014-01-01
Oxford Elbow Score (OES) is a patient-reported questionnaire used to assess outcomes after elbow surgery. The aim of this study was to validate and adapt the OES into Persian language. After forward-backward translation of the OES into Persian, a total number of 92 patients after elbow surgeries completed the Persian OES along with the Persian DASH and SF-36. To assess test-retest reliability, 31 randomly selected patients (34%) completed the Persian OES again after three days while abstaining from all forms of therapeutic regimens. Reliability of the Persian OES was assessed by measuring intraclass correlation coefficient (ICC) for test-retest reliability and Cronbach's alpha for internal consistency. Spearman's correlation coefficient was used to test the construct validity. Cronbach's alpha coefficient was 0.92 showing excellent reliability. Cronbach's alpha for function, pain, and social-psychological subscales was 0.95, 0.86, and 0.85, respectively. Intraclass correlation coefficient (ICC) was 0.85 for the overall questionnaire and 0.90, 0.76, and 0.75 for function, pain, and social-psychological subscales, respectively. Construct validity was confirmed as the Spearman correlation between OES and DASH was 0.80. Persian OES is a valid and reliable patient-reported outcome measure to assess postsurgical elbow status in Persian speaking population.
Features of Cross-Correlation Analysis in a Data-Driven Approach for Structural Damage Assessment
Camacho Navarro, Jhonatan; Ruiz, Magda; Villamizar, Rodolfo; Mujica, Luis
2018-01-01
This work discusses the advantage of using cross-correlation analysis in a data-driven approach based on principal component analysis (PCA) and piezodiagnostics to obtain successful diagnosis of events in structural health monitoring (SHM). In this sense, the identification of noisy data and outliers, as well as the management of data cleansing stages can be facilitated through the implementation of a preprocessing stage based on cross-correlation functions. Additionally, this work evidences an improvement in damage detection when the cross-correlation is included as part of the whole damage assessment approach. The proposed methodology is validated by processing data measurements from piezoelectric devices (PZT), which are used in a piezodiagnostics approach based on PCA and baseline modeling. Thus, the influence of cross-correlation analysis used in the preprocessing stage is evaluated for damage detection by means of statistical plots and self-organizing maps. Three laboratory specimens were used as test structures in order to demonstrate the validity of the methodology: (i) a carbon steel pipe section with leak and mass damage types, (ii) an aircraft wing specimen, and (iii) a blade of a commercial aircraft turbine, where damages are specified as mass-added. As the main concluding remark, the suitability of cross-correlation features combined with a PCA-based piezodiagnostic approach in order to achieve a more robust damage assessment algorithm is verified for SHM tasks. PMID:29762505
Features of Cross-Correlation Analysis in a Data-Driven Approach for Structural Damage Assessment.
Camacho Navarro, Jhonatan; Ruiz, Magda; Villamizar, Rodolfo; Mujica, Luis; Quiroga, Jabid
2018-05-15
This work discusses the advantage of using cross-correlation analysis in a data-driven approach based on principal component analysis (PCA) and piezodiagnostics to obtain successful diagnosis of events in structural health monitoring (SHM). In this sense, the identification of noisy data and outliers, as well as the management of data cleansing stages can be facilitated through the implementation of a preprocessing stage based on cross-correlation functions. Additionally, this work evidences an improvement in damage detection when the cross-correlation is included as part of the whole damage assessment approach. The proposed methodology is validated by processing data measurements from piezoelectric devices (PZT), which are used in a piezodiagnostics approach based on PCA and baseline modeling. Thus, the influence of cross-correlation analysis used in the preprocessing stage is evaluated for damage detection by means of statistical plots and self-organizing maps. Three laboratory specimens were used as test structures in order to demonstrate the validity of the methodology: (i) a carbon steel pipe section with leak and mass damage types, (ii) an aircraft wing specimen, and (iii) a blade of a commercial aircraft turbine, where damages are specified as mass-added. As the main concluding remark, the suitability of cross-correlation features combined with a PCA-based piezodiagnostic approach in order to achieve a more robust damage assessment algorithm is verified for SHM tasks.
Mikkonen, Kristina; Elo, Satu; Miettunen, Jouko; Saarikoski, Mikko; Kääriäinen, Maria
2017-08-01
The purpose of this study was to develop and test the psychometric properties of the new Cultural and Linguistic Diversity scale, which is designed to be used with the newly validated Clinical Learning Environment, Supervision and Nurse Teacher scale for assessing international nursing students' clinical learning environments. In various developed countries, clinical placements are known to present challenges in the professional development of international nursing students. A cross-sectional survey. Data were collected from eight Finnish universities of applied sciences offering nursing degree courses taught in English during 2015-2016. All the relevant students (N = 664) were invited and 50% chose to participate. Of the total data submitted by the participants, 28% were used for scale validation. The construct validity of the two scales was tested by exploratory factor analysis, while their validity with respect to convergence and discriminability was assessed using Spearman's correlation. Construct validation of the Clinical Learning Environment, Supervision and Nurse Teacher scale yielded an eight-factor model with 34 items, while validation of the Cultural and Linguistic Diversity scale yielded a five-factor model with 21 items. A new scale was developed to improve evidence-based mentorship of international nursing students in clinical learning environments. The instrument will be useful to educators seeking to identify factors that affect the learning of international students. © 2017 John Wiley & Sons Ltd.
Rosales, Roberto S; Martin-Hidalgo, Yolanda; Reboso-Morales, Luis; Atroshi, Isam
2016-03-03
The purpose of this study was to assess the reliability and construct validity of the Spanish version of the 6-item carpal tunnel syndrome (CTS) symptoms scale (CTS-6). In this cross-sectional study 40 patients diagnosed with CTS based on clinical and neurophysiologic criteria, completed the standard Spanish versions of the CTS-6 and the disabilities of the arm, shoulder and hand (QuickDASH) scales on two occasions with a 1-week interval. Internal-consistency reliability was assessed with the Cronbach alpha coefficient and test-retest reliability with the intraclass correlation coefficient, two way random effect model and absolute agreement definition (ICC2,1). Cross-sectional precision was analyzed with the Standard Error of the Measurement (SEM). Longitudinal precision for test-retest reliability coefficient was assessed with the Standard Error of the Measurement difference (SEMdiff) and the Minimal Detectable Change at 95 % confidence level (MDC95). For assessing construct validity it was hypothesized that the CTS-6 would have a strong positive correlation with the QuickDASH, analyzed with the Pearson correlation coefficient (r). The standard Spanish version of the CTS-6 presented a Cronbach alpha of 0.81 with a SEM of 0.3. Test-retest reliability showed an ICC of 0.85 with a SRMdiff of 0.36 and a MDC95 of 0.7. The correlation between CTS-6 and the QuickDASH was concordant with the a priori formulated construct hypothesis (r 0.69) CONCLUSIONS: The standard Spanish version of the 6-item CTS symptoms scale showed good internal consistency, test-retest reliability and construct validity for outcomes assessment in CTS. The CTS-6 will be useful to clinicians and researchers in Spanish speaking parts of the world. The use of standardized outcome measures across countries also will facilitate comparison of research results in carpal tunnel syndrome.
Real-time sensor data validation
NASA Technical Reports Server (NTRS)
Bickmore, Timothy W.
1994-01-01
This report describes the status of an on-going effort to develop software capable of detecting sensor failures on rocket engines in real time. This software could be used in a rocket engine controller to prevent the erroneous shutdown of an engine due to sensor failures which would otherwise be interpreted as engine failures by the control software. The approach taken combines analytical redundancy with Bayesian belief networks to provide a solution which has well defined real-time characteristics and well-defined error rates. Analytical redundancy is a technique in which a sensor's value is predicted by using values from other sensors and known or empirically derived mathematical relations. A set of sensors and a set of relations among them form a network of cross-checks which can be used to periodically validate all of the sensors in the network. Bayesian belief networks provide a method of determining if each of the sensors in the network is valid, given the results of the cross-checks. This approach has been successfully demonstrated on the Technology Test Bed Engine at the NASA Marshall Space Flight Center. Current efforts are focused on extending the system to provide a validation capability for 100 sensors on the Space Shuttle Main Engine.
Casemix classification payment for sub-acute and non-acute inpatient care, Thailand.
Khiaocharoen, Orathai; Pannarunothai, Supasit; Zungsontiporn, Chairoj; Riewpaiboon, Wachara
2010-07-01
There is a need to develop other casemix classifications, apart from DRG for sub-acute and non-acute inpatient care payment mechanism in Thailand. To develop a casemix classification for sub-acute and non-acute inpatient service. The study began with developing a classification system, analyzing cost, assigning payment weights, and ended with testing the validity of this new casemix system. Coefficient of variation, reduction in variance, linear regression, and split-half cross-validation were employed. The casemix for sub-acute and non-acute inpatient services contained 98 groups. Two percent of them had a coefficient of variation of the cost of higher than 1.5. The reduction in variance of cost after the classification was 32%. Two classification variables (physical function and the rehabilitation impairment categories) were key determinants of the cost (adjusted R2 = 0.749, p = .001). Validity results of split-half cross-validation of sub-acute and non-acute inpatient service were high. The present study indicated that the casemix for sub-acute and non-acute inpatient services closely predicted the hospital resource use and should be further developed for payment of the inpatients sub-acute and non-acute phase.
Some Long-Standing and Emerging Research Lines in Africa
ERIC Educational Resources Information Center
Serpell, Robert; Marfo, Kofi
2014-01-01
Early research on child development in Africa was dominated by expatriates and was primarily addressed to the topics of testing the cross-cultural validity of theories developed "in the West," and the search for universals. After a brief review of the outcome of that research, we propose two additional types of motivation that seem…
The Development and Testing of a Tool for Analysis of Computer-Mediated Conferencing Transcripts.
ERIC Educational Resources Information Center
Fahy, Patrick J.; Crawford, Gail; Ally, Mohamed; Cookson, Peter; Keller, Verna; Prosser, Frank
2000-01-01
The Zhu model for analyzing computer mediated communications was further developed by an Athabasca University (Alberta) distance education research team based on ease of use, reliability, validity, theoretical support, and cross-discipline utility. Five classification categories of the new model are vertical questioning, horizontal questioning,…
Knowledge Restructuring in Biology: Testing a Punctuated Model of Conceptual Change
ERIC Educational Resources Information Center
Mintzes, Joel; Quinn, Heather J.
2007-01-01
Emerging from a human constructivist view of learning and a punctuated model of conceptual change, these studies explored differences in the structural complexity and content validity of knowledge about prehistoric life depicted in concept maps by learners ranging in age from approximately 10 to 20 years. Study 1 (cross-age) explored the…
Force Project Technology Presentation to the NRCC
2014-02-04
Functional Bridge components Smart Odometer Adv Pretreatment Smart Bridge Multi-functional Gap Crossing Fuel Automated Tracking System Adv...comprehensive matrix of candidate composite material systems and textile reinforcement architectures via modeling/analyses and testing. Product(s...Validated Dynamic Modeling tool based on parametric study using material models to reliably predict the textile mechanics of the hose
ERIC Educational Resources Information Center
Anderson, Ross C.; Pitts, Christine; Smolkowski, Keith
2017-01-01
This study examines measurement of creative ideational behaviors alongside factors of student engagement that may play a role in the development of students' creative potential during early adolescence in school. Two studies used exploratory and confirmatory factor analyses, cross-validation, and invariance testing of 2 extant measures with…
Zumpano, Camila Eugênia; Mendonça, Tânia Maria da Silva; Silva, Carlos Henrique Martins da; Correia, Helena; Arnold, Benjamin; Pinto, Rogério de Melo Costa
2017-01-23
This study aimed to perform the cross-cultural adaptation and validation of the Patient-Reported Outcomes Measurement Information System (PROMIS) Global Health scale in the Portuguese language. The ten Global Health items were cross-culturally adapted by the method proposed in the Functional Assessment of Chronic Illness Therapy (FACIT). The instrument's final version in Portuguese was self-administered by 1,010 participants in Brazil. The scale's precision was verified by floor and ceiling effects analysis, reliability of internal consistency, and test-retest reliability. Exploratory and confirmatory factor analyses were used to assess the construct's validity and instrument's dimensionality. Calibration of the items used the Gradual Response Model proposed by Samejima. Four global items required adjustments after the pretest. Analysis of the psychometric properties showed that the Global Health scale has good reliability, with Cronbach's alpha of 0.83 and intra-class correlation of 0.89. Exploratory and confirmatory factor analyses showed good fit in the previously established two-dimensional model. The Global Physical Health and Global Mental Health scale showed good latent trait coverage according to the Gradual Response Model. The PROMIS Global Health items showed equivalence in Portuguese compared to the original version and satisfactory psychometric properties for application in clinical practice and research in the Brazilian population.
Guedes, Keyte; Pereira, Cecília; Pavan, Karina; Valério, Berenice Cataldo Oliveira
2010-02-01
The aim of this study is the cross-cultural, as well as to validate in Portuguese language the Amyotrophic Lateral Sclerosis Functional Rating Scale - Revised (ALSFRS-R). We performed a prospective study of individuals with amyotrophic lateral sclerosis (ALS) clinically defined. The scale, after obtaining the final version in Portuguese, was administered in 22 individuals and three weeks after re-applied. There were no significant differences between the application and reapplication of the scale (p=0.069). The linear regression and internal consistency measured by Pearson correlation and alpha Conbrach were significant with r=0.975 e alpha=0.934. The reliability test-retest demonstrated by intraclass correlation coefficient was strong with ICC=0.975. Therefore, this version proved to be applicable, reliable and easy to be conducted in clinical practice and research.
Progress on China nuclear data processing code system
NASA Astrophysics Data System (ADS)
Liu, Ping; Wu, Xiaofei; Ge, Zhigang; Li, Songyang; Wu, Haicheng; Wen, Lili; Wang, Wenming; Zhang, Huanyu
2017-09-01
China is developing the nuclear data processing code Ruler, which can be used for producing multi-group cross sections and related quantities from evaluated nuclear data in the ENDF format [1]. The Ruler includes modules for reconstructing cross sections in all energy range, generating Doppler-broadened cross sections for given temperature, producing effective self-shielded cross sections in unresolved energy range, calculating scattering cross sections in thermal energy range, generating group cross sections and matrices, preparing WIMS-D format data files for the reactor physics code WIMS-D [2]. Programming language of the Ruler is Fortran-90. The Ruler is tested for 32-bit computers with Windows-XP and Linux operating systems. The verification of Ruler has been performed by comparison with calculation results obtained by the NJOY99 [3] processing code. The validation of Ruler has been performed by using WIMSD5B code.
Diminutives facilitate word segmentation in natural speech: cross-linguistic evidence.
Kempe, Vera; Brooks, Patricia J; Gillis, Steven; Samson, Graham
2007-06-01
Final-syllable invariance is characteristic of diminutives (e.g., doggie), which are a pervasive feature of the child-directed speech registers of many languages. Invariance in word endings has been shown to facilitate word segmentation (Kempe, Brooks, & Gillis, 2005) in an incidental-learning paradigm in which synthesized Dutch pseudonouns were used. To broaden the cross-linguistic evidence for this invariance effect and to increase its ecological validity, adult English speakers (n=276) were exposed to naturally spoken Dutch or Russian pseudonouns presented in sentence contexts. A forced choice test was given to assess target recognition, with foils comprising unfamiliar syllable combinations in Experiments 1 and 2 and syllable combinations straddling word boundaries in Experiment 3. A control group (n=210) received the recognition test with no prior exposure to targets. Recognition performance improved with increasing final-syllable rhyme invariance, with larger increases for the experimental group. This confirms that word ending invariance is a valid segmentation cue in artificial, as well as naturalistic, speech and that diminutives may aid segmentation in a number of languages.
LeDell, Erin; Petersen, Maya; van der Laan, Mark
In binary classification problems, the area under the ROC curve (AUC) is commonly used to evaluate the performance of a prediction model. Often, it is combined with cross-validation in order to assess how the results will generalize to an independent data set. In order to evaluate the quality of an estimate for cross-validated AUC, we obtain an estimate of its variance. For massive data sets, the process of generating a single performance estimate can be computationally expensive. Additionally, when using a complex prediction method, the process of cross-validating a predictive model on even a relatively small data set can still require a large amount of computation time. Thus, in many practical settings, the bootstrap is a computationally intractable approach to variance estimation. As an alternative to the bootstrap, we demonstrate a computationally efficient influence curve based approach to obtaining a variance estimate for cross-validated AUC.
Empirical Performance of Cross-Validation With Oracle Methods in a Genomics Context.
Martinez, Josue G; Carroll, Raymond J; Müller, Samuel; Sampson, Joshua N; Chatterjee, Nilanjan
2011-11-01
When employing model selection methods with oracle properties such as the smoothly clipped absolute deviation (SCAD) and the Adaptive Lasso, it is typical to estimate the smoothing parameter by m-fold cross-validation, for example, m = 10. In problems where the true regression function is sparse and the signals large, such cross-validation typically works well. However, in regression modeling of genomic studies involving Single Nucleotide Polymorphisms (SNP), the true regression functions, while thought to be sparse, do not have large signals. We demonstrate empirically that in such problems, the number of selected variables using SCAD and the Adaptive Lasso, with 10-fold cross-validation, is a random variable that has considerable and surprising variation. Similar remarks apply to non-oracle methods such as the Lasso. Our study strongly questions the suitability of performing only a single run of m-fold cross-validation with any oracle method, and not just the SCAD and Adaptive Lasso.
Petersen, Maya; van der Laan, Mark
2015-01-01
In binary classification problems, the area under the ROC curve (AUC) is commonly used to evaluate the performance of a prediction model. Often, it is combined with cross-validation in order to assess how the results will generalize to an independent data set. In order to evaluate the quality of an estimate for cross-validated AUC, we obtain an estimate of its variance. For massive data sets, the process of generating a single performance estimate can be computationally expensive. Additionally, when using a complex prediction method, the process of cross-validating a predictive model on even a relatively small data set can still require a large amount of computation time. Thus, in many practical settings, the bootstrap is a computationally intractable approach to variance estimation. As an alternative to the bootstrap, we demonstrate a computationally efficient influence curve based approach to obtaining a variance estimate for cross-validated AUC. PMID:26279737
A cross-validation package driving Netica with python
Fienen, Michael N.; Plant, Nathaniel G.
2014-01-01
Bayesian networks (BNs) are powerful tools for probabilistically simulating natural systems and emulating process models. Cross validation is a technique to avoid overfitting resulting from overly complex BNs. Overfitting reduces predictive skill. Cross-validation for BNs is known but rarely implemented due partly to a lack of software tools designed to work with available BN packages. CVNetica is open-source, written in Python, and extends the Netica software package to perform cross-validation and read, rebuild, and learn BNs from data. Insights gained from cross-validation and implications on prediction versus description are illustrated with: a data-driven oceanographic application; and a model-emulation application. These examples show that overfitting occurs when BNs become more complex than allowed by supporting data and overfitting incurs computational costs as well as causing a reduction in prediction skill. CVNetica evaluates overfitting using several complexity metrics (we used level of discretization) and its impact on performance metrics (we used skill).
Nijdam-Jones, Alicia; Rosenfeld, Barry
2017-11-01
The cross-cultural validity of feigning instruments and cut-scores is a critical concern for forensic mental health clinicians. This systematic review evaluated feigning classification accuracy and effect sizes across instruments and languages by summarizing 45 published peer-reviewed articles and unpublished doctoral dissertations conducted in Europe, Asia, and North America using linguistically, ethnically, and culturally diverse samples. The most common psychiatric symptom measures used with linguistically, ethnically, and culturally diverse samples included the Structured Inventory of Malingered Symptomatology, the Miller Forensic Assessment of Symptoms Test, and the Minnesota Multiphasic Personality Inventory (MMPI). The most frequently studied cognitive effort measures included the Word Recognition Test, the Test of Memory Malingering, and the Rey 15-item Memory test. The classification accuracy of these measures is compared and the implications of this research literature are discussed. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
High yields of hydrogen production from methanol steam reforming with a cross-U type reactor
Zhang, Shubin; Chen, Junyu; Zhang, Xuelin; Liu, Xiaowei
2017-01-01
This paper presents a numerical and experimental study on the performance of a methanol steam reformer integrated with a hydrogen/air combustion reactor for hydrogen production. A CFD-based 3D model with mass and momentum transport and temperature characteristics is established. The simulation results show that better performance is achieved in the cross-U type reactor compared to either a tubular reactor or a parallel-U type reactor because of more effective heat transfer characteristics. Furthermore, Cu-based micro reformers of both cross-U and parallel-U type reactors are designed, fabricated and tested for experimental validation. Under the same condition for reforming and combustion, the results demonstrate that higher methanol conversion is achievable in cross-U type reactor. However, it is also found in cross-U type reactor that methanol reforming selectivity is the lowest due to the decreased water gas shift reaction under high temperature, thereby carbon monoxide concentration is increased. Furthermore, the reformed gas generated from the reactors is fed into a high temperature proton exchange membrane fuel cell (PEMFC). In the test of discharging for 4 h, the fuel cell fed by cross-U type reactor exhibits the most stable performance. PMID:29121067
High yields of hydrogen production from methanol steam reforming with a cross-U type reactor.
Zhang, Shubin; Zhang, Yufeng; Chen, Junyu; Zhang, Xuelin; Liu, Xiaowei
2017-01-01
This paper presents a numerical and experimental study on the performance of a methanol steam reformer integrated with a hydrogen/air combustion reactor for hydrogen production. A CFD-based 3D model with mass and momentum transport and temperature characteristics is established. The simulation results show that better performance is achieved in the cross-U type reactor compared to either a tubular reactor or a parallel-U type reactor because of more effective heat transfer characteristics. Furthermore, Cu-based micro reformers of both cross-U and parallel-U type reactors are designed, fabricated and tested for experimental validation. Under the same condition for reforming and combustion, the results demonstrate that higher methanol conversion is achievable in cross-U type reactor. However, it is also found in cross-U type reactor that methanol reforming selectivity is the lowest due to the decreased water gas shift reaction under high temperature, thereby carbon monoxide concentration is increased. Furthermore, the reformed gas generated from the reactors is fed into a high temperature proton exchange membrane fuel cell (PEMFC). In the test of discharging for 4 h, the fuel cell fed by cross-U type reactor exhibits the most stable performance.
NASA Astrophysics Data System (ADS)
Schratz, Patrick; Herrmann, Tobias; Brenning, Alexander
2017-04-01
Computational and statistical prediction methods such as the support vector machine have gained popularity in remote-sensing applications in recent years and are often compared to more traditional approaches like maximum-likelihood classification. However, the accuracy assessment of such predictive models in a spatial context needs to account for the presence of spatial autocorrelation in geospatial data by using spatial cross-validation and bootstrap strategies instead of their now more widely used non-spatial equivalent. The R package sperrorest by A. Brenning [IEEE International Geoscience and Remote Sensing Symposium, 1, 374 (2012)] provides a generic interface for performing (spatial) cross-validation of any statistical or machine-learning technique available in R. Since spatial statistical models as well as flexible machine-learning algorithms can be computationally expensive, parallel computing strategies are required to perform cross-validation efficiently. The most recent major release of sperrorest therefore comes with two new features (aside from improved documentation): The first one is the parallelized version of sperrorest(), parsperrorest(). This function features two parallel modes to greatly speed up cross-validation runs. Both parallel modes are platform independent and provide progress information. par.mode = 1 relies on the pbapply package and calls interactively (depending on the platform) parallel::mclapply() or parallel::parApply() in the background. While forking is used on Unix-Systems, Windows systems use a cluster approach for parallel execution. par.mode = 2 uses the foreach package to perform parallelization. This method uses a different way of cluster parallelization than the parallel package does. In summary, the robustness of parsperrorest() is increased with the implementation of two independent parallel modes. A new way of partitioning the data in sperrorest is provided by partition.factor.cv(). This function gives the user the possibility to perform cross-validation at the level of some grouping structure. As an example, in remote sensing of agricultural land uses, pixels from the same field contain nearly identical information and will thus be jointly placed in either the test set or the training set. Other spatial sampling resampling strategies are already available and can be extended by the user.
Wang, Yao; Xiao, Lily Dongxia; He, Guo-Ping
2015-02-01
Suboptimal care for people with dementia in hospital settings has been reported and is attributed to the lack of knowledge and inadequate attitudes in dementia care among health professionals. Educational interventions have been widely used to improve care outcomes; however, Chinese-language instruments used in dementia educational interventions for health professionals are lacking. The aims of this study were to select, translate and evaluate instruments used in dementia educational interventions for Chinese health professionals in acute-care hospitals. A cross-sectional study design was used. A modified stratified random sampling was used to recruit 442 participants from different levels of hospitals in Changsha, China. Dementia care competence was used as a framework for the selection and evaluation of Alzheimer's Disease Knowledge Scale and Dementia Care Attitudes Scale for health professionals in the study. These two scales were translated into Chinese using forward and back translation method. Content validity, test-retest reliability and internal consistency were assessed. Construct validity was tested using exploratory factor analysis. Known-group validity was established by comparing scores of Alzheimer's Disease Knowledge Scale and Dementia Care Attitudes Scale in two sub-groups. A person-centred care scale was utilised as a gold standard to establish concurrent validity of these two scales. Results demonstrated acceptable content validity, internal consistency, test-retest reliability and concurrent validity. Exploratory factor analysis presented a single-factor structure of the Chinese Alzheimer's Disease Knowledge Scale and a two-factor structure of the Chinese Dementia Care Attitudes Scale, supporting the conceptual dimensions of the original scales. The Chinese Alzheimer's Disease Knowledge Scale and Chinese Dementia Care Attitudes Scale demonstrated known-group validity evidenced by significantly higher scores identified from the sub-group with a longer work experience compared to those in the sub-group with less work experience. The use of dementia care competence as a framework to inform the selection and evaluation of instruments used in dementia educational interventions for health professionals has wide applicability in other areas. The results support that Chinese Alzheimer's Disease Knowledge Scale and Chinese Dementia Care Attitudes Scale are reliable and valid instruments for health professionals to use in acute-care settings. Copyright © 2014 Elsevier Ltd. All rights reserved.
Ni, Meng; Brown, Lorna G; Lawler, Danielle; Bean, Jonathan F
2017-07-01
Stair climb power is an important clinical measure of lower-extremity power. The stair climb power test (SCPT) was validated by requiring individuals to climb a full flight of stairs. A 4-step SCPT (4SCPT) would be more clinically feasible and easier to perform, yet its reliability and validity are unknown. To evaluate reliability, validity, and minimal detectable change of 4SCPT among community-dwelling older adults. This study is a cross-sectional analysis of baseline data from a clinical trial. Fifty older adults ≥65 years of age, at risk for mobility decline, consented to participate in this ancillary study. Test-retest reliability was derived from 2 measurements within each participant measured by a single assessor. Pearson correlation analyses among leg power measures (4SCPT, SCPT, single leg press power at 40% and 70% of the 1-repetition maximum [SLP40, SLP70]) were performed. Separate multivariate linear regressions were conducted evaluating the associations between each leg power measure and 2 mobility outcomes, the Short Physical Performance Battery (SPPB) and habitual gait speed (HGS). Minimal detectable change was based on a 90% confidence interval (MDC 90 ). The 4SCPT had excellent test-retest reliability (ICC(2,1) = 0.951), and strong correlation with SCPT, SLP40, and SLP70 ( r = 0.85-0.96). The 4SCPT explained a greater amount of variance in the SPPB (R 2 = 0.31) than other leg power measurements (R 2 = 0.23-0.25). The 4SCPT (R 2 = 0.41) and SCPT (R 2 = 0.42) described equivalent amounts of variance in HGS, and greater than that with SLP40 (R 2 = 0.28) and SLP70 (R 2 = 0.30). The MDC 90 for 4SCPT was 44.0 watts. This was a cross-sectional analysis within a small, nonrepresentative sample. Interrater reliability was not evaluated. The 4SCPT shows scientific promise as a valid and reliable leg power measurement among community-dwelling older adults. © 2017 American Physical Therapy Association
Chen, Xin-Lin; Zhong, Liang-Huan; Wen, Yi; Liu, Tian-Wen; Li, Xiao-Ying; Hou, Zheng-Kun; Hu, Yue; Mo, Chuan-Wei; Liu, Feng-Bin
2017-09-15
This review aims to critically appraise and compare the measurement properties of inflammatory bowel disease (IBD)-specific health-related quality of life instruments. Medline, EMBASE and ISI Web of Knowledge were searched from their inception to May 2016. IBD-specific instruments for patients with Crohn's disease, ulcerative colitis or IBD were enrolled. The basic characteristics and domains of the instruments were collected. The methodological quality of measurement properties and measurement properties of the instruments were assessed. Fifteen IBD-specific instruments were included, which included twelve instruments for adult IBD patients and three for paediatric IBD patients. All of the instruments were developed in North American and European countries. The following common domains were identified: IBD-related symptoms, physical, emotional and social domain. The methodological quality was satisfactory for content validity; fair in internal consistency, reliability, structural validity, hypotheses testing and criterion validity; and poor in measurement error, cross-cultural validity and responsiveness. For adult IBD patients, the IBDQ-32 and its short version (SIBDQ) had good measurement properties and were the most widely used worldwide. For paediatric IBD patients, the IMPACT-III had good measurement properties and had more translated versions. Most methodological quality should be promoted, especially measurement error, cross-cultural validity and responsiveness. The IBDQ-32 was the most widely used instrument with good reliability and validity, followed by the SIBDQ and IMPACT-III. Further validation studies are necessary to support the use of other instruments.
Denhartigh, Andrew; Reynolds, Lindsay; Palmer, Katherine; Klein, Frank; Rice, Jennifer; Rejman, John J
2018-05-18
A validation study was conducted for an immunochromatographic method (BetaStar ® Advanced for Beta-lactams) for the detection of beta-lactam residues in raw, commingled bovine milk. The assay detected amoxicillin, ampicillin, cloxacillin, penicillin, cephapirin, and ceftiofur below the U.S. Food and Drug Administration tolerance levels but above the maximum sensitivity thresholds established by the National Conference on Interstate Milk Shipments. The results of internal and independent laboratory dose-response studies employing spiked samples were in agreement. The test detected all six drugs at the approximate 90/95% sensitivity levels in milk from cows treated with each drug. Selectivity of the assay was 100%, as no false-positive results were obtained in testing 1148 control milk samples. Testing the estimated 90/95% sensitivity level for amoxicillin (8.5 ppb), ampicillin (6.9 ppb), cloxacillin (8.9 ppb), penicillin (4.2 ppb), and cephapirin (17.6 ppb), and at 100 ppb for each antibiotic, resulted in 94-100% positive tests for each of the beta-lactam drugs. The results of ruggedness experiments established the operating parameter tolerances for the assay. Cross-reactivity testing established that the assay detects other certain beta-lactam drugs, but it does not cross-react with any of 30 drugs belonging to seven different drug classes. Abnormally high bacterial or somatic cell counts in raw milk produced no assay interference.
Automated Cross-Sectional Measurement Method of Intracranial Dural Venous Sinuses.
Lublinsky, S; Friedman, A; Kesler, A; Zur, D; Anconina, R; Shelef, I
2016-03-01
MRV is an important blood vessel imaging and diagnostic tool for the evaluation of stenosis, occlusions, or aneurysms. However, an accurate image-processing tool for vessel comparison is unavailable. The purpose of this study was to develop and test an automated technique for vessel cross-sectional analysis. An algorithm for vessel cross-sectional analysis was developed that included 7 main steps: 1) image registration, 2) masking, 3) segmentation, 4) skeletonization, 5) cross-sectional planes, 6) clustering, and 7) cross-sectional analysis. Phantom models were used to validate the technique. The method was also tested on a control subject and a patient with idiopathic intracranial hypertension (4 large sinuses tested: right and left transverse sinuses, superior sagittal sinus, and straight sinus). The cross-sectional area and shape measurements were evaluated before and after lumbar puncture in patients with idiopathic intracranial hypertension. The vessel-analysis algorithm had a high degree of stability with <3% of cross-sections manually corrected. All investigated principal cranial blood sinuses had a significant cross-sectional area increase after lumbar puncture (P ≤ .05). The average triangularity of the transverse sinuses was increased, and the mean circularity of the sinuses was decreased by 6% ± 12% after lumbar puncture. Comparison of phantom and real data showed that all computed errors were <1 voxel unit, which confirmed that the method provided a very accurate solution. In this article, we present a novel automated imaging method for cross-sectional vessels analysis. The method can provide an efficient quantitative detection of abnormalities in the dural sinuses. © 2016 by American Journal of Neuroradiology.
Ruiz, Jonatan R; Ortega, Francisco B; Castro-Piñero, Jose
2014-11-30
We investigated the criterion-related validity and the reliability of the 1/4 mile run-walk test (MRWT) in children and adolescents. A total of 86 children (n=42 girls) completed a maximal graded treadmill test using a gas analyzer and the 1/4MRW test. We investigated the test-retest reliability of the 1/4MRWT in a different group of children and adolescents (n=995, n=418 girls). The 1/4MRWT time, sex, and BMI significantly contributed to predict measured VO2peak (R2= 0.32). There was no systematic bias in the cross-validation group (P>0.1). The root mean sum of squared errors (RMSE) and the percentage error were 6.9 ml/kg/min and 17.7%, respectively, and the accurate prediction (i.e. the percentage of estimations within ±4.5 ml/kg/min of VO2peak) was 48.8%. The reliability analysis showed that the mean inter-trial difference ranged from 0.6 seconds in children aged 6-11 years to 1.3 seconds in adolescents aged 12-17 years (all P. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Van de Vossenberg, B T L H; Van der Straten, M J
2014-08-01
The genus Spodoptera comprises 31 species, 4 of which are listed as quarantine pests for the European Union: Spodoptera eridania (Cramer), Spodoptera frugiperda (Smith), Spodoptera littoralis (Boisduval), and Spodoptera litura (F.). In international trade, the earlier life stages (eggs and larvae) are being intercepted at point of inspection most frequently, challenging the possibilities of morphological identification. To realize a rapid and reliable identification for all stages, we developed and validated four simplex real-time polymerase chain reaction identification tests based on the mitochondrial cytochrome b gene using dual-labeled hydrolysis probes. Method validation on dilutions of extracted DNA of the target organisms showed that low levels of template (up to 0.2-100 pg) can reliably be identified. No cross-reactivity was observed with 14 nontarget Spodoptera and 5 non-Spodoptera species in the specific Spodoptera tests. The tests showed to be repeatable, reproducible (both 100%), and robust. The new Spodoptera tests have proven to be suitable tools for routine identification of all life stages of S. eridania, S. frugiperda, S. littoralis, and S. litura.
Olderbak, Sally; Wilhelm, Oliver; Olaru, Gabriel; Geiger, Mattis; Brenneman, Meghan W.; Roberts, Richard D.
2015-01-01
The Reading the Mind in the Eyes Test is a popular measure of individual differences in Theory of Mind that is often applied in the assessment of particular clinical populations (primarily, individuals on the autism spectrum). However, little is known about the test's psychometric properties, including factor structure, internal consistency, and convergent validity evidence. We present a psychometric analysis of the test followed by an evaluation of other empirically proposed and statistically identified structures. We identified, and cross-validated in a second sample, an adequate short-form solution that is homogeneous with adequate internal consistency, and is moderately related to Cognitive Empathy, Emotion Perception, and strongly related to Vocabulary. We recommend the use of this short-form solution in normal adults as a more precise measure over the original version. Future revisions of the test should seek to reduce the test's reliance on one's vocabulary and evaluate the short-form structure in clinical populations. PMID:26500578
Sobański, Jerzy A; Klasa, Katarzyna; Rutkowski, Krzysztof; Dembińska, Edyta; Müldner-Nieckowski, Łukasz; Cyranka, Katarzyna
2013-01-01
Assessment of reliability, cross-validity and usefulness in everyday clinical practice of two related tools: Social Avoidance and Distress Scale (SAD) and Fear of Negative Evaluation Scale (FNE). Analysis of tests results of 453 females and 172 males diagnosed in the years 2008-2010 in the Outpatient Clinic for Neurotic and Behavioral Disorders of the Cracow University Hospital, including, inter alia, results of the questionnaires SAD and FNE. The scales have been, with the consent of their authors (R. Friend) and the copyright holder (APA), translated into Polish and back-translated. Subjects also completed the symptom checklist KO '0'(n = 512), and neurotic personality questionnaire KON-2006 (n = 505), as well as the NEO-PI-R personality inventory (n = 46). The reliability and cross-validity coefficients of Polish versions were assessed in the patient population and their results were compared with those of the group of 75 medical students. The translation was verified by retranslation. The reliability coefficients of Polish version of the SAD and FNE scales turned out to be high--Cronbach's alpha coefficient was 0.94 for both scales, Guttman's split-half reliability coefficient 0.93. Correlations with symptom checklist KO '0 'and neurotic personality questionnaire KON-2006, as well as with the NEO -PI-R personality inventory were significant and indicate a good cross-validity of the analyzed tools. The average results in the patient population for both scales were significantly higher than the results in the preliminary control group of medical students. Polish versions of SAD and FNE questionnaires, like their other translations from English, proved to be reliable and have a high cross-validity with other original Polish tools used in the diagnosis of neurotic disorders, which allows to recommend them to be used in further studies, also in comparing healthy persons with those suffering from a variety of neurotic disorders.
Joint multifractal analysis based on wavelet leaders
NASA Astrophysics Data System (ADS)
Jiang, Zhi-Qiang; Yang, Yan-Hong; Wang, Gang-Jin; Zhou, Wei-Xing
2017-12-01
Mutually interacting components form complex systems and these components usually have long-range cross-correlated outputs. Using wavelet leaders, we propose a method for characterizing the joint multifractal nature of these long-range cross correlations; we call this method joint multifractal analysis based on wavelet leaders (MF-X-WL). We test the validity of the MF-X-WL method by performing extensive numerical experiments on dual binomial measures with multifractal cross correlations and bivariate fractional Brownian motions (bFBMs) with monofractal cross correlations. Both experiments indicate that MF-X-WL is capable of detecting cross correlations in synthetic data with acceptable estimating errors. We also apply the MF-X-WL method to pairs of series from financial markets (returns and volatilities) and online worlds (online numbers of different genders and different societies) and determine intriguing joint multifractal behavior.
We propose a Phase 2 (large cross-sectional) PRoBE-compliant validation trial of stool-based and serum-based tests for the detection of colorectal neoplasia (1). The trial is powered to detect early stage colorectal adenocarcinoma or high grade dysplasia. This is the most stringent, conservative approach to the early diagnosis of colonic neoplasia and addresses the most important endpoint of identifying individuals with curable, early stage cancer and those with very high risk non-invasive neoplasia (high grade dysplasia).
da Costa, Filipa Alves; Ribeiro, Manuel Castro; Braga, Sofia; Carvalho, Elisabete; Francisco, Fátima; Miranda, Ana Costa; Moreira, António; Fallowfield, Lesley
2016-09-01
The increasing survivor population of breast cancer has shifted research and practice interests into the impacts of the disease and treatment in quality of life aspects. The lack of tools available in Portuguese to objectively evaluate sexual function led to the development of this study, which aimed to cross-culturally adapt and validate the Sexual Activity Questionnaire for use in Portugal. The questionnaire was translated and back-translated, refined following face-to-face interviews with seven breast cancer survivors, and then self-administered by a larger sample at baseline and a fortnight later to test validity and reliability. Following cognitive debriefing (n = 7), minor changes were made and the Sexual Activity Questionnaire was then tested with 134 breast cancer survivors. A 3-factor structure explained 75.5% of the variance, comprising the Pleasure, Habit and Discomfort scales, all yielding good internal consistency (Cronbach's α > 0.70). Concurrent validity with the FACt-An and the BCPT checklist was good (Spearman's r > 0.65; p-value < 0.001) and reliability acceptable (Cohen's k > 0.444). The Sexual Activity Questionnaire allowed the identification of 23.9% of sexually inactive women, for whom the main reasons were lack of interest or motivation and not having a partner. Patient-reported outcomes led to a more comprehensive and improved approach to cancer, tackling areas previously abandoned. Future research should focus on the validation of this scale in samples with different characteristics and even in the overall population to enable generalizability of the findings. The adapted Sexual Activity Questionnaire is a valid tool for assessing sexual function in breast cancer survivors in Portugal.
Zammit, Andrea R; Hall, Charles B; Lipton, Richard B; Katz, Mindy J; Muniz-Terrera, Graciela
2018-05-01
The aim of this study was to identify natural subgroups of older adults based on cognitive performance, and to establish each subgroup's characteristics based on demographic factors, physical function, psychosocial well-being, and comorbidity. We applied latent class (LC) modeling to identify subgroups in baseline assessments of 1345 Einstein Aging Study (EAS) participants free of dementia. The EAS is a community-dwelling cohort study of 70+ year-old adults living in the Bronx, NY. We used 10 neurocognitive tests and 3 covariates (age, sex, education) to identify latent subgroups. We used goodness-of-fit statistics to identify the optimal class solution and assess model adequacy. We also validated our model using two-fold split-half cross-validation. The sample had a mean age of 78.0 (SD=5.4) and a mean of 13.6 years of education (SD=3.5). A 9-class solution based on cognitive performance at baseline was the best-fitting model. We characterized the 9 identified classes as (i) disadvantaged, (ii) poor language, (iii) poor episodic memory and fluency, (iv) poor processing speed and executive function, (v) low average, (vi) high average, (vii) average, (viii) poor executive and poor working memory, (ix) elite. The cross validation indicated stable class assignment with the exception of the average and high average classes. LC modeling in a community sample of older adults revealed 9 cognitive subgroups. Assignment of subgroups was reliable and associated with external validators. Future work will test the predictive validity of these groups for outcomes such as Alzheimer's disease, vascular dementia and death, as well as markers of biological pathways that contribute to cognitive decline. (JINS, 2018, 24, 511-523).
Validation of the Economic and Health Outcomes Model of Type 2 Diabetes Mellitus (ECHO-T2DM).
Willis, Michael; Johansen, Pierre; Nilsson, Andreas; Asseburg, Christian
2017-03-01
The Economic and Health Outcomes Model of Type 2 Diabetes Mellitus (ECHO-T2DM) was developed to address study questions pertaining to the cost-effectiveness of treatment alternatives in the care of patients with type 2 diabetes mellitus (T2DM). Naturally, the usefulness of a model is determined by the accuracy of its predictions. A previous version of ECHO-T2DM was validated against actual trial outcomes and the model predictions were generally accurate. However, there have been recent upgrades to the model, which modify model predictions and necessitate an update of the validation exercises. The objectives of this study were to extend the methods available for evaluating model validity, to conduct a formal model validation of ECHO-T2DM (version 2.3.0) in accordance with the principles espoused by the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) and the Society for Medical Decision Making (SMDM), and secondarily to evaluate the relative accuracy of four sets of macrovascular risk equations included in ECHO-T2DM. We followed the ISPOR/SMDM guidelines on model validation, evaluating face validity, verification, cross-validation, and external validation. Model verification involved 297 'stress tests', in which specific model inputs were modified systematically to ascertain correct model implementation. Cross-validation consisted of a comparison between ECHO-T2DM predictions and those of the seminal National Institutes of Health model. In external validation, study characteristics were entered into ECHO-T2DM to replicate the clinical results of 12 studies (including 17 patient populations), and model predictions were compared to observed values using established statistical techniques as well as measures of average prediction error, separately for the four sets of macrovascular risk equations supported in ECHO-T2DM. Sub-group analyses were conducted for dependent vs. independent outcomes and for microvascular vs. macrovascular vs. mortality endpoints. All stress tests were passed. ECHO-T2DM replicated the National Institutes of Health cost-effectiveness application with numerically similar results. In external validation of ECHO-T2DM, model predictions agreed well with observed clinical outcomes. For all sets of macrovascular risk equations, the results were close to the intercept and slope coefficients corresponding to a perfect match, resulting in high R 2 and failure to reject concordance using an F test. The results were similar for sub-groups of dependent and independent validation, with some degree of under-prediction of macrovascular events. ECHO-T2DM continues to match health outcomes in clinical trials in T2DM, with prediction accuracy similar to other leading models of T2DM.
2012-01-01
Background Currently, no validated instruments are available to measure the health status of Bangladeshi patients with fibromyalgia (FM). The aims of this study were to cross-culturally adapt the modified Fibromyalgia Impact Questionnaire (FIQ) into Bengali (B-FIQ) and to test its validity and reliability in Bangladeshi patients with FM. Methods The FIQ was translated following cross-cultural adaptation guidelines and pretested in 30 female patients with FM. Next, the adapted B-FIQ was physician-administered to 102 consecutive female FM patients together with the Health Assessment Questionnaire (HAQ), selected subscales of the SF-36, and visual analog scales for current clinical symptoms. A tender point count (TPC) was performed by an experienced rheumatologist. Forty randomly selected patients completed the B-FIQ again after 7 days. Two control groups of 50 healthy people and 50 rheumatoid arthritis (RA) patients also completed the B-FIQ. Results For the final B-FIQ, five physical function sub-items were replaced with culturally appropriate equivalents. Internal consistency was adequate for both the 11-item physical function subscale (α = 0.73) and the total scale (α = 0.83). With exception of the physical function subscale, expected correlations were generally observed between the B-FIQ items and selected subscales of the SF-36, HAQ, clinical symptoms, and TPC. The B-FIQ was able to discriminate between FM patients and healthy controls and between FM patients and RA patients. Test-retest reliability was adequate for the physical function subscale (r = 0.86) and individual items (r = 0.73-0.86), except anxiety (r = 0.27) and morning tiredness (r = 0.64). Conclusion This study supports the reliability and validity of the B-FIQ as a measure of functional disability and health status in Bangladeshi women with FM. PMID:22925458
Cross-modal face recognition using multi-matcher face scores
NASA Astrophysics Data System (ADS)
Zheng, Yufeng; Blasch, Erik
2015-05-01
The performance of face recognition can be improved using information fusion of multimodal images and/or multiple algorithms. When multimodal face images are available, cross-modal recognition is meaningful for security and surveillance applications. For example, a probe face is a thermal image (especially at nighttime), while only visible face images are available in the gallery database. Matching a thermal probe face onto the visible gallery faces requires crossmodal matching approaches. A few such studies were implemented in facial feature space with medium recognition performance. In this paper, we propose a cross-modal recognition approach, where multimodal faces are cross-matched in feature space and the recognition performance is enhanced with stereo fusion at image, feature and/or score level. In the proposed scenario, there are two cameras for stereo imaging, two face imagers (visible and thermal images) in each camera, and three recognition algorithms (circular Gaussian filter, face pattern byte, linear discriminant analysis). A score vector is formed with three cross-matched face scores from the aforementioned three algorithms. A classifier (e.g., k-nearest neighbor, support vector machine, binomial logical regression [BLR]) is trained then tested with the score vectors by using 10-fold cross validations. The proposed approach was validated with a multispectral stereo face dataset from 105 subjects. Our experiments show very promising results: ACR (accuracy rate) = 97.84%, FAR (false accept rate) = 0.84% when cross-matching the fused thermal faces onto the fused visible faces by using three face scores and the BLR classifier.
Sobral, Maria P; Costa, Maria E; Schmidt, Lone; Martins, Mariana V
2017-02-01
Are the Copenhagen Multi-Centre Psychosocial Infertility research program Fertility Problem Stress Scales (COMPI-FPSS) a reliable and valid measure across gender and culture? The COMPI-FPSS is a valid and reliable measure, presenting excellent or good fit in the majority of the analyzed countries, and demonstrating full invariance across genders and partial invariance across cultures. Cross-cultural and gender validation is needed to consider a measure as standard care within fertility. The present study is the first attempting to establish comparability of fertility-related stress across genders and countries. Cross-sectional study. First, we tested the structure of the COMPI-FPSS. Then, reliability and validity (convergent and discriminant) were examined for the final model. Finally, measurement invariance both across genders and cultures was tested. Our final sample had 3923 fertility patients (1691 men and 2232 women) recruited in clinical settings from seven different countries: Denmark, China, Croatia, Germany, Greece, Hungary and Sweden. Participants had a mean age of 34 years and the majority (84%) were childless. Findings confirmed the original three-factor structure of the COMPI-FPSS, although suggesting a shortened measurement model using less items that fitted the data better than the full version model. While data from the Chinese and Croatian subsamples did not fit, all other counties presented good fit (χ 2 /df ≤ 5.4; comparative fit index ≥ 0.94; root-mean-square error of approximation ≤ 0.07; modified expected cross-validation index ≤ 0.77). In general, reliability, convergent validity, and discriminant validity were observed in all subscales from each country (composite reliability ≥ 0.63; average variance extracted ≥ 0.38; squared correlation ≥ 0.13). Full invariance was established across genders, and partial invariance was demonstrated across countries. Generalizability regarding the validation of the COMPI-FPSS cannot be made regarding infertile individuals not seeking treatment, or non-European patients. This study did not investigate predictive validity, and hence the capability of this instrument in detecting changes in fertility-specific adjustment over time and predicting the psychological impact needs to be established in future research. Besides extending knowledge on the psychometric properties of one of the most used fertility stress questionnaire, this study demonstrates both research and clinical usefulness of the COMPI-FPSS. This study was supported by European Union Funds (FEDER/COMPETE-Operational Competitiveness Program, and by national funds (FCT-Portuguese Foundation for Science and Technology) under the projects PTDC/MHC-PSC/4195/2012 and SFRH/BPD/85789/2012). There are no conflicts of interest to declare. N/A. © The Author 2016. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Oliveira, Indiara Soares; da Cunha Menezes Costa, Lucíola; Fagundes, Felipe Ribeiro Cabral; Cabral, Cristina Maria Nunes
2015-05-01
To assess the procedures of translation, cross-cultural adaptation, and measurement properties of breast cancer-specific quality-of-life questionnaires. Searches were conducted in the databases MEDLINE, EMBASE, CINAHL, and SciELO using the keywords: "Questionnaires," "Quality of life," and "Breast cancer." The studies were analyzed in terms of methodological quality according to the guidelines for the procedure of cross-cultural adaptation and the quality criteria for measurement properties of questionnaires. We found 24 eligible studies. Most of the articles assessed the translation and measurement properties of the instrument EORTC QLQ-BR23. Description about translation and cross-cultural adaptation was incomplete in 11 studies. Translation and back translation were the most tested phases, and synthesis of the translation was the most omitted phase in the articles. Information on assessing measurement properties was provided incompletely in 23 articles. Internal consistency was the most tested property in all of the eligible articles, but none of them provided information on agreement. Construct validity was adequately tested in only three studies that used the FACT-B and QLQ-BR23. Eight articles provided information on reliability; however, only four found positive classification. Responsiveness was tested in four articles, and ceiling and floor effects were tested in only three articles. None of the instruments showed fully adequate quality. There is limited evidence on cross-cultural adaptations and measurement properties; therefore, it is recommended that caution be exercised when using breast cancer-specific quality-of-life questionnaires that have been translated, adapted, and tested.
Lin, Ying-Tsong; Collis, Jon M; Duda, Timothy F
2012-11-01
An alternating direction implicit (ADI) three-dimensional fluid parabolic equation solution method with enhanced accuracy is presented. The method uses a square-root Helmholtz operator splitting algorithm that retains cross-multiplied operator terms that have been previously neglected. With these higher-order cross terms, the valid angular range of the parabolic equation solution is improved. The method is tested for accuracy against an image solution in an idealized wedge problem. Computational efficiency improvements resulting from the ADI discretization are also discussed.
Kaneko, Hiromasa; Funatsu, Kimito
2013-09-23
We propose predictive performance criteria for nonlinear regression models without cross-validation. The proposed criteria are the determination coefficient and the root-mean-square error for the midpoints between k-nearest-neighbor data points. These criteria can be used to evaluate predictive ability after the regression models are updated, whereas cross-validation cannot be performed in such a situation. The proposed method is effective and helpful in handling big data when cross-validation cannot be applied. By analyzing data from numerical simulations and quantitative structural relationships, we confirm that the proposed criteria enable the predictive ability of the nonlinear regression models to be appropriately quantified.
Validating Neuro-QoL short forms and targeted scales with people who have multiple sclerosis.
Miller, Deborah M; Bethoux, Francois; Victorson, David; Nowinski, Cindy J; Buono, Sarah; Lai, Jin-Shei; Wortman, Katy; Burns, James L; Moy, Claudia; Cella, David
2016-05-01
Multiple sclerosis (MS) is a chronic, progressive, and disabling disease of the central nervous system with dramatic variations in the combination and severity of symptoms it can produce. The lack of reliable disease-specific health-related quality of life (HRQL) measures for use in clinical trials prompted the development of the Neurology Quality of Life (Neuro-QOL) instrument, which includes 13 scales that assess physical, emotional, cognitive, and social domains, for use in a variety of neurological illnesses. The objective of this research paper is to conduct an initial assessment of the reliability and validation of the Neuro-QOL short forms (SFs) in MS. We assessed reliability, concurrent validity, known groups validity, and responsiveness between cross-sectional and longitudinal data in 161 recruited MS patients. Internal consistency was high for all measures (α = 0.81-0.95) and ICCs were within the acceptable range (0.76-0.91); concurrent and known groups validity were highest with the Global HRQL question. Longitudinal assessment was limited by the lack of disease progression in the group. The Neuro-QOL SFs demonstrate good internal consistency, test-re-test reliability, and concurrent and known groups validity in this MS population, supporting the validity of Neuro-QOL in adults with MS. © The Author(s), 2015.
Paiva, Bianca Sakamoto Ribeiro; de Camargos, Mayara Goulart; Demarzo, Marcelo Marcos Piva; Hervás, Gonzalo; Vázquez, Carmelo; Paiva, Carlos Eduardo
2016-09-01
The Pemberton Happiness Index (PHI) is a recently developed integrative measure of well-being that includes components of hedonic, eudaimonic, social, and experienced well-being. The PHI has been validated in several languages, but not in Portuguese. Our aim was to cross-culturally adapt the Universal Portuguese version of the PHI and to assess its psychometric properties in a sample of the Brazilian population using online surveys.An expert committee evaluated 2 versions of the PHI previously translated into Portuguese by the original authors using a standardized form for assessment of semantic/idiomatic, cultural, and conceptual equivalence. A pretesting was conducted employing cognitive debriefing methods. In sequence, the expert committee evaluated all the documents and reached a final Universal Portuguese PHI version. For the evaluation of the psychometric properties, the data were collected using online surveys in a cross-sectional study. The study population included healthcare professionals and users of the social network site Facebook from several Brazilian geographic areas. In addition to the PHI, participants completed the Satisfaction with Life Scale (SWLS), Diener and Emmons' Positive and Negative Experience Scale (PNES), Psychological Well-being Scale (PWS), and the Subjective Happiness Scale (SHS). Internal consistency, convergent validity, known-group validity, and test-retest reliability were evaluated. Satisfaction with the previous day was correlated with the 10 items assessing experienced well-being using the Cramer V test. Additionally, a cut-off value of PHI to identify a "happy individual" was defined using receiver-operating characteristic (ROC) curve methodology.Data from 1035 Brazilian participants were analyzed (health professionals = 180; Facebook users = 855). Regarding reliability results, the internal consistency (Cronbach alpha = 0.890 and 0.914) and test-retest (intraclass correlation coefficient = 0.814) were both considered adequate. Most of the validity hypotheses formulated a priori (convergent and know-group) was further confirmed. The cut-off value of higher than 7 in remembered PHI was identified (AUC = 0.780, sensitivity = 69.2%, specificity = 78.2%) as the best one to identify a happy individual.We concluded that the Universal Portuguese version of the PHI is valid and reliable for use in the Brazilian population using online surveys.
Xu, Ximing; Wang, Fei; Yang, Mingyuan; Huang, Qikai; Chang, Yifan; Wei, Xianzhao; Bai, Yushu; Li, Ming
2015-08-01
Bad Sobernheim Stress Questionnaire (BSSQ)-Deformity and BSSQ-Brace are the most widely used instruments for evaluating stress levels in adolescent idiopathic scoliosis (AIS) patients under brace treatment, and good reliability and validity have been demonstrated across different cultures. Great stress has been found among many adolescents, becoming a major concern for professionals. However, no previous research has addressed the cultural adaptations and psychometric testing of BSSQ-Deformity and BSSQ-Brace in China or the stress levels in AIS patients. The purposes of our study were to evaluate the cross-cultural adaptation and validation of the BSSQ-Deformity and BSSQ-Brace and to investigate stress levels in Chinese (AIS) patients under brace treatment.The original (German) versions of BSSQ-Deformity and BSSQ-Brace were cross-culturally translated according to international guidelines. Psychometric properties such as reliability and construct validity were tested. Eighty-six AIS patients were included in our study, and 50 patients paid a second visit 3 to 7 days later to test reproducibility. Cronbach α and the intraclass coefficient were determined to assess internal consistency and reproducibility. Scoliosis Research Society patient questionnaire-22 (SRS-22) was applied to evaluate construct validity.The mean BSSQ-Deformity and BSSQ-Brace scores were 15.3 and 13.4 points, respectively. Severe stress was observed in 12% of patients due to brace treatment. Item analysis demonstrated that each item was scored under a normal distribution with no redundancy. Psychometric analysis revealed excellent internal consistency (Cronbach α = 0.85 and 0.80, respectively) and reproducibility (intraclass correlation coefficient = 0.85 and 0.90, respectively) for BSSQ-Deformity and BSSQ-Brace. The correlation coefficients of BSSQ-Deformity, BSSQ-Brace and SRS-22 were 0.48 and 0.63, respectively.In conclusion, BSSQ-Deformity and BSSQ-Brace have been successfully adapted to a Chinese background and psychometrically validated with excellent reliability and construct validity. Brace wearing is considered the main cause of stress in AIS patients under brace treatment.
Xu, Ximing; Wang, Fei; Yang, Mingyuan; Huang, Qikai; Chang, Yifan; Wei, Xianzhao; Bai, Yushu; Li, Ming
2015-01-01
Abstract Bad Sobernheim Stress Questionnaire (BSSQ)-Deformity and BSSQ-Brace are the most widely used instruments for evaluating stress levels in adolescent idiopathic scoliosis (AIS) patients under brace treatment, and good reliability and validity have been demonstrated across different cultures. Great stress has been found among many adolescents, becoming a major concern for professionals. However, no previous research has addressed the cultural adaptations and psychometric testing of BSSQ-Deformity and BSSQ-Brace in China or the stress levels in AIS patients. The purposes of our study were to evaluate the cross-cultural adaptation and validation of the BSSQ-Deformity and BSSQ-Brace and to investigate stress levels in Chinese (AIS) patients under brace treatment. The original (German) versions of BSSQ-Deformity and BSSQ-Brace were cross-culturally translated according to international guidelines. Psychometric properties such as reliability and construct validity were tested. Eighty-six AIS patients were included in our study, and 50 patients paid a second visit 3 to 7 days later to test reproducibility. Cronbach α and the intraclass coefficient were determined to assess internal consistency and reproducibility. Scoliosis Research Society patient questionnaire-22 (SRS-22) was applied to evaluate construct validity. The mean BSSQ-Deformity and BSSQ-Brace scores were 15.3 and 13.4 points, respectively. Severe stress was observed in 12% of patients due to brace treatment. Item analysis demonstrated that each item was scored under a normal distribution with no redundancy. Psychometric analysis revealed excellent internal consistency (Cronbach α = 0.85 and 0.80, respectively) and reproducibility (intraclass correlation coefficient = 0.85 and 0.90, respectively) for BSSQ-Deformity and BSSQ-Brace. The correlation coefficients of BSSQ-Deformity, BSSQ-Brace and SRS-22 were 0.48 and 0.63, respectively. In conclusion, BSSQ-Deformity and BSSQ-Brace have been successfully adapted to a Chinese background and psychometrically validated with excellent reliability and construct validity. Brace wearing is considered the main cause of stress in AIS patients under brace treatment. PMID:26252283
Abeare, Christopher A; Messa, Isabelle; Zuccato, Brandon G; Merker, Bradley; Erdodi, Laszlo
2018-03-12
Estimated base rates of invalid performance on baseline testing (base rates of failure) for the management of sport-related concussion range from 6.1% to 40.0%, depending on the validity indicator used. The instability of this key measure represents a challenge in the clinical interpretation of test results that could undermine the utility of baseline testing. To determine the prevalence of invalid performance on baseline testing and to assess whether the prevalence varies as a function of age and validity indicator. This retrospective, cross-sectional study included data collected between January 1, 2012, and December 31, 2016, from a clinical referral center in the Midwestern United States. Participants included 7897 consecutively tested, equivalently proportioned male and female athletes aged 10 to 21 years, who completed baseline neurocognitive testing for the purpose of concussion management. Baseline assessment was conducted with the Immediate Postconcussion Assessment and Cognitive Testing (ImPACT), a computerized neurocognitive test designed for assessment of concussion. Base rates of failure on published ImPACT validity indicators were compared within and across age groups. Hypotheses were developed after data collection but prior to analyses. Of the 7897 study participants, 4086 (51.7%) were male, mean (SD) age was 14.71 (1.78) years, 7820 (99.0%) were primarily English speaking, and the mean (SD) educational level was 8.79 (1.68) years. The base rate of failure ranged from 6.4% to 47.6% across individual indicators. Most of the sample (55.7%) failed at least 1 of 4 validity indicators. The base rate of failure varied considerably across age groups (117 of 140 [83.6%] for those aged 10 years to 14 of 48 [29.2%] for those aged 21 years), representing a risk ratio of 2.86 (95% CI, 2.60-3.16; P < .001). The results for base rate of failure were surprisingly high overall and varied widely depending on the specific validity indicator and the age of the examinee. The strong age association, with 3 of 4 participants aged 10 to 12 years failing validity indicators, suggests that the clinical interpretation and utility of baseline testing in this age group is questionable. These findings underscore the need for close scrutiny of performance validity indicators on baseline testing across age groups.
Cross-validation to select Bayesian hierarchical models in phylogenetics.
Duchêne, Sebastián; Duchêne, David A; Di Giallonardo, Francesca; Eden, John-Sebastian; Geoghegan, Jemma L; Holt, Kathryn E; Ho, Simon Y W; Holmes, Edward C
2016-05-26
Recent developments in Bayesian phylogenetic models have increased the range of inferences that can be drawn from molecular sequence data. Accordingly, model selection has become an important component of phylogenetic analysis. Methods of model selection generally consider the likelihood of the data under the model in question. In the context of Bayesian phylogenetics, the most common approach involves estimating the marginal likelihood, which is typically done by integrating the likelihood across model parameters, weighted by the prior. Although this method is accurate, it is sensitive to the presence of improper priors. We explored an alternative approach based on cross-validation that is widely used in evolutionary analysis. This involves comparing models according to their predictive performance. We analysed simulated data and a range of viral and bacterial data sets using a cross-validation approach to compare a variety of molecular clock and demographic models. Our results show that cross-validation can be effective in distinguishing between strict- and relaxed-clock models and in identifying demographic models that allow growth in population size over time. In most of our empirical data analyses, the model selected using cross-validation was able to match that selected using marginal-likelihood estimation. The accuracy of cross-validation appears to improve with longer sequence data, particularly when distinguishing between relaxed-clock models. Cross-validation is a useful method for Bayesian phylogenetic model selection. This method can be readily implemented even when considering complex models where selecting an appropriate prior for all parameters may be difficult.
Validation of the Perceived Stigmatization Questionnaire for Brazilian adult burn patients.
Freitas, Noélle de Oliveira; Forero, Carlos García; Caltran, Marina Paes; Alonso, Jordi; Dantas, Rosana A Spadoti; Piccolo, Monica Sarto; Farina, Jayme Adriano; Lawrence, John W; Rossi, Lidia A
2018-01-01
Currently, there is no questionnaire to assess perceived stigmatization among people with visible differences in Brazil. The Perceived Stigmatization Questionnaire (PSQ), developed in the United States, is a valid instrument to assess the perception of stigmatizing behaviours among burn survivors. The objective of this cross-sectional and multicentre study was to assess the factor structure, reliability and validity of the Brazilian Portuguese version of the PSQ in burn patients. A Brazilian version of the 21-item PSQ was answered by 240 adult burn patients, undergoing rehabilitation in two burns units in Brazil. We tested its construct validity by correlating PSQ scores with depression (Beck Depression Index-BDI) and self-esteem (Rosenberg Self-Esteem Scale-RSE), as well as with two domains of the Revised Burn Specific Health Scale-BSHS-R: affect and body image, and interpersonal relationships. We used Confirmatory Item Factor Analysis (CIFA) to test whether the data fit a measurement model involving a three-factor structure (absence of friendly behaviour; confusing/staring behaviour; and hostile behaviour). We conducted Exploratory Factor Analyses (EFA) of the subscale in a 50% random sample of individuals (training split), treating items as ordinal categorical using unweighted least squares estimation. To assess discriminant validity of the Brazilian version of the PSQ we correlated PSQ scores with known groups (sex, total body surface area burned, and visibility of the scars) and assessed its reliability by means of Cronbach's alpha and using test-retest. Goodness-of-fit indices for confirmatory factor analysis were satisfactory for the PSQ, but not for the hostile behaviour subscale, which was modified to improve fit by eliminating 3 items. Cronbach's alphas for the PSQ refined version (PSQ-R) ranged from 0.65 to 0.88, with test-retest reliability 0.87 for the total score. The PSQ-R scores correlated strongly with depression (0.63; p < 0.001), self-esteem (-0.57; p < 0.001), body image (-0.63; p < 0.001), and interpersonal relationships (-0.55; p < 0.001). PSQ-R total scores were significantly lower for patients with visible scars (effect size = 0.51, p = 0.029). The PSQ-R showed reliability and validity comparable to the original version. However, the cross-cultural structure of the subscale "hostile behaviour" and sensitivity to change of the PSQ should be further evaluated.
De Silva Weliange, Shreenika H; Fernando, Dulitha; Gunatilake, Jagath
2014-05-03
Environmental characteristics are known to be associated with patterns of physical activity (PA). Although several validated tools exist, to measure the environment characteristics, these instruments are not necessarily suitable for application in all settings especially in a developing country. This study was carried out to develop and validate an instrument named the "Physical And Social Environment Scale--PASES" to assess the physical and social environmental factors associated with PA. This will enable identification of various physical and social environmental factors affecting PA in Sri Lanka, which will help in the development of more tailored intervention strategies for promoting higher PA levels in Sri Lanka. The PASES was developed using a scientific approach of defining the construct, item generation, analysis of content of items and item reduction. Both qualitative and quantitative methods of key informant interviews, in-depth interviews and rating of the items generated by experts were conducted. A cross sectional survey among 180 adults was carried out to assess the factor structure through principal component analysis. Another cross sectional survey among a different group of 180 adults was carried out to assess the construct validity through confirmatory factor analysis. Reliability was assessed with test re-test reliability and internal consistency using Spearman r and Cronbach's alpha respectively. Thirty six items were selected after the expert ratings and were developed into interviewer administered questions. Exploration of factor structure of the 34 items which were factorable through principal component analysis with Quartimax rotation extracted 8 factors. The 34 item instrument was assessed for construct validity with confirmatory factor analysis which confirmed an 8 factor model (x2 = 339.9, GFI = 0.90). The identified factors were infrastructure for walking, aesthetics and facilities for cycling, vehicular traffic safety, access and connectivity, recreational facilities for PA, safety, social cohesion and social acceptance of PA with the two non-factorable factors, residential density and land use mix. The PASES also showed good test re-test reliability and a moderate level of internal consistency. The PASES is a valid and reliable tool which could be used to assess the physical and social environment associated with PA in Sri Lanka.
Validation of the Perceived Stigmatization Questionnaire for Brazilian adult burn patients
Forero, Carlos García; Caltran, Marina Paes; Alonso, Jordi; Dantas, Rosana A. Spadoti; Piccolo, Monica Sarto; Farina, Jayme Adriano; Lawrence, John W.; Rossi, Lidia A.
2018-01-01
Currently, there is no questionnaire to assess perceived stigmatization among people with visible differences in Brazil. The Perceived Stigmatization Questionnaire (PSQ), developed in the United States, is a valid instrument to assess the perception of stigmatizing behaviours among burn survivors. The objective of this cross-sectional and multicentre study was to assess the factor structure, reliability and validity of the Brazilian Portuguese version of the PSQ in burn patients. A Brazilian version of the 21-item PSQ was answered by 240 adult burn patients, undergoing rehabilitation in two burns units in Brazil. We tested its construct validity by correlating PSQ scores with depression (Beck Depression Index-BDI) and self-esteem (Rosenberg Self-Esteem Scale-RSE), as well as with two domains of the Revised Burn Specific Health Scale—BSHS-R: affect and body image, and interpersonal relationships. We used Confirmatory Item Factor Analysis (CIFA) to test whether the data fit a measurement model involving a three-factor structure (absence of friendly behaviour; confusing/staring behaviour; and hostile behaviour). We conducted Exploratory Factor Analyses (EFA) of the subscale in a 50% random sample of individuals (training split), treating items as ordinal categorical using unweighted least squares estimation. To assess discriminant validity of the Brazilian version of the PSQ we correlated PSQ scores with known groups (sex, total body surface area burned, and visibility of the scars) and assessed its reliability by means of Cronbach's alpha and using test-retest. Goodness-of-fit indices for confirmatory factor analysis were satisfactory for the PSQ, but not for the hostile behaviour subscale, which was modified to improve fit by eliminating 3 items. Cronbach’s alphas for the PSQ refined version (PSQ-R) ranged from 0.65 to 0.88, with test-retest reliability 0.87 for the total score. The PSQ-R scores correlated strongly with depression (0.63; p < 0.001), self-esteem (-0.57; p < 0.001), body image (-0.63; p < 0.001), and interpersonal relationships (-0.55; p < 0.001). PSQ-R total scores were significantly lower for patients with visible scars (effect size = 0.51, p = 0.029). The PSQ-R showed reliability and validity comparable to the original version. However, the cross-cultural structure of the subscale “hostile behaviour” and sensitivity to change of the PSQ should be further evaluated. PMID:29381711
Cross-Cultural Adaptation and Validation of the Back Beliefs Questionnaire to the Arabic Language.
Alamrani, Samia; Alsobayel, Hana; Alnahdi, Ali H; Moloney, Niamh; Mackey, Martin
2016-06-01
Translation, cross-cultural adaptation, and psychometric testing. To translate the Back Beliefs Questionnaire (BBQ) into Arabic and investigate its psychometric properties in an Arabic-speaking sample of individuals with low back pain (LBP). Back pain beliefs are associated with pain chronicity and disability in people with LBP. The BBQ is a recognized and frequently used tool for measuring these beliefs. To date the BBQ has not been translated into Arabic. The English version of the BBQ was translated and culturally adapted into Arabic (BBQ-Ar) according to published guidelines. The BBQ-Ar was then tested in a sample of 115 Arabic-speaking individuals with LBP. Reliability was evaluated through internal consistency (Cronbach α) and test-retest reliability (intraclass correlation coefficient), the latter in a subgroup of 25. Construct validity was assessed using exploratory factor analysis and by examining the correlation between the BBQ-Ar, the Oswestry Disability Index and a Numerical Pain Rating Scale. Internal consistency of the BBQ-Ar was good (Cronbach α = 0.77). Test-retest reliability was good (intraclass correlation coefficient [2,1] = 0.88). Exploratory factor analysis revealed a three-factor structure, explaining 46% of total variance, with the first factor alone explaining 24%. Eight of the nine scoring items were loaded on the first factor thus forming a unidimensional scale. A significant negative correlation was found between Oswestry Disability Index and BBQ-Ar scores (r = -0.307; P < 0.01), whereas no significant correlation was found between BBQ-Ar and Pain Rating Scale scores. No floor or celling effects were observed. The BBQ-Ar is a valid and reliable tool that can be used to assess back pain beliefs in Arabic-speaking individuals. N/A.
Halm, Margo A
2018-05-14
Proficiency in evidence-based practice (EBP) is essential for relevant research findings to be integrated into clinical care when congruent with patient preferences. Few valid and reliable tools are available to evaluate the effectiveness of educational programs in advancing EBP attitudes, knowledge, skills, or behaviors, and ongoing competency. The Fresno test is one objective method to evaluate EBP knowledge and skills; however, the original and modified versions were validated with family physicians, physical therapists, and speech and language therapists. To adapt the Modified Fresno-Acute Care Nursing test and develop a psychometrically sound tool for use in academic and practice settings. In Phase 1, modified Fresno (Tilson, 2010) items were adapted for acute care nursing. In Phase 2, content validity was established with an expert panel. Content validity indices (I-CVI) ranged from .75 to 1.0. Scale CVI was .95%. A cross-sectional convenience sample of acute care nurses (n = 90) in novice, master, and expert cohorts completed the Modified Fresno-Acute Care Nursing test administered electronically via SurveyMonkey. Total scores were significantly different between training levels (p < .0001). Novice nurses scored significantly lower than master or expert nurses, but differences were not found between the latter cohorts. Total score reliability was acceptable: (interrater [ICC (2, 1)]) = .88. Cronbach's alpha was 0.70. Psychometric properties of most modified items were satisfactory; however, six require further revision and testing to meet acceptable standards. The Modified Fresno-Acute Care Nursing test is a 14-item test for objectively assessing EBP knowledge and skills of acute care nurses. While preliminary psychometric properties for this new EBP knowledge measure for acute care nursing are promising, further validation of some of the items and scoring rubric is needed. © 2018 Sigma Theta Tau International.
Bernabeu-Mora, Roberto; Medina-Mirapeix, Françesc; Llamazares-Herrán, Eduardo; García-Guillamón, Gloria; Giménez-Giménez, Luz María; Sánchez-Nieto, Juan Miguel
2015-01-01
Limited mobility is a risk factor for developing chronic obstructive pulmonary disease (COPD)-related disabilities. Little is known about the validity of the Short Physical Performance Battery (SPPB) for identifying mobility limitations in patients with COPD. To determine the clinical validity of the SPPB summary score and its three components (standing balance, 4-meter gait speed, and five-repetition sit-to-stand) for identifying mobility limitations in patients with COPD. This cross-sectional study included 137 patients with COPD, recruited from a hospital in Spain. Muscle strength tests and SPPB were measured; then, patients were surveyed for self-reported mobility limitations. The validity of SPPB scores was analyzed by developing receiver operating characteristic curves to analyze the sensitivity and specificity for identifying patients with mobility limitations; by examining group differences in SPPB scores across categories of mobility activities; and by correlating SPPB scores to strength tests. Only the SPPB summary score and the five-repetition sit-to-stand components showed good discriminative capabilities; both showed areas under the receiver operating characteristic curves greater than 0.7. Patients with limitations had significantly lower SPPB scores than patients without limitations in nine different mobility activities. SPPB scores were moderately correlated with the quadriceps test (r>0.40), and less correlated with the handgrip test (r<0.30), which reinforced convergent and divergent validities. A SPPB summary score cutoff of 10 provided the best accuracy for identifying mobility limitations. This study provided evidence for the validity of the SPPB summary score and the five-repetition sit-to-stand test for assessing mobility in patients with COPD. These tests also showed potential as a screening test for identifying patients with COPD that have mobility limitations.
Bernabeu-Mora, Roberto; Medina-Mirapeix, Françesc; Llamazares-Herrán, Eduardo; García-Guillamón, Gloria; Giménez-Giménez, Luz María; Sánchez-Nieto, Juan Miguel
2015-01-01
Background Limited mobility is a risk factor for developing chronic obstructive pulmonary disease (COPD)-related disabilities. Little is known about the validity of the Short Physical Performance Battery (SPPB) for identifying mobility limitations in patients with COPD. Objective To determine the clinical validity of the SPPB summary score and its three components (standing balance, 4-meter gait speed, and five-repetition sit-to-stand) for identifying mobility limitations in patients with COPD. Methods This cross-sectional study included 137 patients with COPD, recruited from a hospital in Spain. Muscle strength tests and SPPB were measured; then, patients were surveyed for self-reported mobility limitations. The validity of SPPB scores was analyzed by developing receiver operating characteristic curves to analyze the sensitivity and specificity for identifying patients with mobility limitations; by examining group differences in SPPB scores across categories of mobility activities; and by correlating SPPB scores to strength tests. Results Only the SPPB summary score and the five-repetition sit-to-stand components showed good discriminative capabilities; both showed areas under the receiver operating characteristic curves greater than 0.7. Patients with limitations had significantly lower SPPB scores than patients without limitations in nine different mobility activities. SPPB scores were moderately correlated with the quadriceps test (r>0.40), and less correlated with the handgrip test (r<0.30), which reinforced convergent and divergent validities. A SPPB summary score cutoff of 10 provided the best accuracy for identifying mobility limitations. Conclusion This study provided evidence for the validity of the SPPB summary score and the five-repetition sit-to-stand test for assessing mobility in patients with COPD. These tests also showed potential as a screening test for identifying patients with COPD that have mobility limitations. PMID:26664110
International physical activity questionnaire: reliability and validity of the Turkish version.
Saglam, Melda; Arikan, Hulya; Savci, Sema; Inal-Ince, Deniz; Bosnak-Guclu, Meral; Karabulut, Erdem; Tokgozoglu, Lale
2010-08-01
Physical inactivity is a global problem which is related to many chronic health disorders. Physical activity scales which allow cross-cultural comparisons have been developed. The goal was to assess the reliability and validity of a Turkish version of the International Physical Activity Questionnaire (IPAQ). 1,097 university students (721 women, 376 men; ages 18-32) volunteered. Short and long forms of the IPAQ gave good agreement and comparable 1-wk. test-retest reliabilities. Caltrac accelerometer data were compared with IPAQ scores in 80 participants with good agreement for short and long forms. Turkish versions of the IPAQ short and long forms are reliable and valid in assessment of physical activity.
Vanderploeg, Rodney D; Cooper, Douglas B; Belanger, Heather G; Donnell, Alison J; Kennedy, Jan E; Hopewell, Clifford A; Scott, Steven G
2014-01-01
To develop and cross-validate internal validity scales for the Neurobehavioral Symptom Inventory (NSI). Four existing data sets were used: (1) outpatient clinical traumatic brain injury (TBI)/neurorehabilitation database from a military site (n = 403), (2) National Department of Veterans Affairs TBI evaluation database (n = 48 175), (3) Florida National Guard nonclinical TBI survey database (n = 3098), and (4) a cross-validation outpatient clinical TBI/neurorehabilitation database combined across 2 military medical centers (n = 206). Secondary analysis of existing cohort data to develop (study 1) and cross-validate (study 2) internal validity scales for the NSI. The NSI, Mild Brain Injury Atypical Symptoms, and Personality Assessment Inventory scores. Study 1: Three NSI validity scales were developed, composed of 5 unusual items (Negative Impression Management [NIM5]), 6 low-frequency items (LOW6), and the combination of 10 nonoverlapping items (Validity-10). Cut scores maximizing sensitivity and specificity on these measures were determined, using a Mild Brain Injury Atypical Symptoms score of 8 or more as the criterion for invalidity. Study 2: The same validity scale cut scores again resulted in the highest classification accuracy and optimal balance between sensitivity and specificity in the cross-validation sample, using a Personality Assessment Inventory Negative Impression Management scale with a T score of 75 or higher as the criterion for invalidity. The NSI is widely used in the Department of Defense and Veterans Affairs as a symptom-severity assessment following TBI, but is subject to symptom overreporting or exaggeration. This study developed embedded NSI validity scales to facilitate the detection of invalid response styles. The NSI Validity-10 scale appears to hold considerable promise for validity assessment when the NSI is used as a population-screening tool.
Development and validation of the Work Conflict Appraisal Scale (WCAS).
González-Navarro, Pilar; Llinares-Insa, Lucía; Zurriaga-Llorens, Rosario; Lloret-Segura, Susana
2017-05-01
In the context of cognitive appraisal, the Work Conflict Appraisal Scale (WCAS) was developed to assess work conflict in terms of threat and challenge. In the first study, the factorial structure of the scale was tested using confirmatory factor analysis with a Spanish multi-occupational employee sample (N= 296). In the sec-ond study, we used multi-sampling confirmatory factor analysis (N= 815) to cross-validate the results. The analyses confirm the validity of the scale and are con-sistent with the tri-dimensional conflict classification. The findings support the distinc-tion between the challenge and threat appraisals of work conflict, highlighting the im-portance of measuring these two types of appraisal separately. This scale is a valid and reliable instrument to measure conflict appraisal in organizations.
Extending the validity of the Feeding Practices and Structure Questionnaire.
Jansen, Elena; Mallan, Kimberley M; Daniels, Lynne A
2015-06-30
Feeding practices are commonly examined as potentially modifiable determinants of children's eating behaviours and weight status. Although a variety of questionnaires exist to assess different feeding aspects, many lack thorough reliability and validity testing. The Feeding Practices and Structure Questionnaire (FPSQ) is a tool designed to measure early feeding practices related to non-responsive feeding and structure of the meal environment. Face validity, factorial validity, internal reliability and cross-sectional correlations with children's eating behaviours have been established in mothers with 2-year-old children. The aim of the present study was to further extend the validity of the FPSQ by examining factorial, construct and predictive validity, and stability. Participants were from the NOURISH randomised controlled trial which evaluated an intervention with first-time mothers designed to promote protective feeding practices. Maternal feeding practices (FP) and child eating behaviours were assessed when children were aged 2 years and 3.7 years (n = 388). Confirmatory Factor analysis, group differences, predictive relationships, and stability were tested. The original 9-factor structure was confirmed when children were aged 3.7 ± 0.3 years. Cronbach's alpha was above the recommended 0.70 cut-off for all factors except Structured Meal Timing, Over Restriction and Distrust in Appetite which were 0.58, 0.67 and 0.66 respectively. Allocated group differences reflected behaviour consistent with intervention content and all feeding practices were stable across both time points (range of r = 0.45-0.70). There was some evidence for the predictive validity of factors with 2 FP showing expected relationships, 2 FP showing expected and unexpected relationships and 5 FP showing no relationship. Reliability and validity was demonstrated for most subscales of the FPSQ. Future validation is warranted with culturally diverse samples and with fathers and other caregivers. The use of additional outcomes to further explore predictive validity is recommended as well as testing test-retest reliability of the questionnaire.
Allometric scaling of biceps strength before and after resistance training in men.
Zoeller, Robert F; Ryan, Eric D; Gordish-Dressman, Heather; Price, Thomas B; Seip, Richard L; Angelopoulos, Theodore J; Moyna, Niall M; Gordon, Paul M; Thompson, Paul D; Hoffman, Eric P
2007-06-01
The purposes of this study were 1) derive allometric scaling models of isometric biceps muscle strength using pretraining body mass (BM) and muscle cross-sectional area (CSA) as scaling variables in adult males, 2) test model appropriateness using regression diagnostics, and 3) cross-validate the models before and after 12 wk of resistance training. A subset of FAMuSS (Functional SNP Associated with Muscle Size and Strength) study data (N=136) were randomly split into two groups (A and B). Allometric scaling models using pretraining BM and CSA were derived and tested for group A. The scaling exponents determined from these models were then applied to and tested on group B pretraining data. Finally, these scaling exponents were applied to and tested on group A and B posttraining data. BM and CSA models produced scaling exponents of 0.64 and 0.71, respectively. Regression diagnostics determined both models to be appropriate. Cross-validation of the models to group B showed that the BM model, but not the CSA model, was appropriate. Removal of the largest six subjects (CSA>30 cm) from group B resulted in an appropriate fit for the CSA model. Application of the models to group A posttraining data showed that both models were appropriate, but only the body mass model was successful for group B. These data suggest that the application of scaling exponents of 0.64 and 0.71, using BM and CSA, respectively, are appropriate for scaling isometric biceps strength in adult males. However, the scaling exponent using CSA may not be appropriate for individuals with biceps CSA>30 cm. Finally, 12 wk of resistance training does not alter the relationship between BM, CSA, and muscular strength as assessed by allometric scaling.
Validation in the Absence of Observed Events
Lathrop, John; Ezell, Barry
2015-07-22
Here our paper addresses the problem of validating models in the absence of observed events, in the area of Weapons of Mass Destruction terrorism risk assessment. We address that problem with a broadened definition of “Validation,” based on “backing up” to the reason why modelers and decision makers seek validation, and from that basis re-define validation as testing how well the model can advise decision makers in terrorism risk management decisions. We develop that into two conditions: Validation must be based on cues available in the observable world; and it must focus on what can be done to affect thatmore » observable world, i.e. risk management. That in turn leads to two foci: 1.) the risk generating process, 2.) best use of available data. Based on our experience with nine WMD terrorism risk assessment models, we then describe three best use of available data pitfalls: SME confidence bias, lack of SME cross-referencing, and problematic initiation rates. Those two foci and three pitfalls provide a basis from which we define validation in this context in terms of four tests -- Does the model: … capture initiation? … capture the sequence of events by which attack scenarios unfold? … consider unanticipated scenarios? … consider alternative causal chains? Finally, we corroborate our approach against three key validation tests from the DOD literature: Is the model a correct representation of the simuland? To what degree are the model results comparable to the real world? Over what range of inputs are the model results useful?« less
Yang, Baoqi; Chen, Guo; Yang, Qing; Yan, Xiaoxiao; Zhang, Zhaoxia; Murrell, Dédée F; Zhang, Furen
2017-02-02
The autoimmune bullous diseases quality of life (ABQOL) questionnaire was recently developed by an Australian group and has been validated in Australian and North American patient cohorts. It is a 17-item, multidimensional, self-administered English questionnaire. The study aimed to validate the Chinese version of the ABQOL questionnaire and evaluate the reliability in Chinese patients. The Chinese version of the ABQOL questionnaire was produced by forward-backward translation and cross-cultural adaptation of the original English version. The ABQOL questionnaire was then distributed to a total of 101 patients with autoimmune bullous diseases (AIBDs) together with the Dermatology Life Quality Index (DLQI) and the 36-item Short Form Health Survey (SF-36). Validity was analyzed across a range of indices and reliability was assessed using internal consistency and test-retest methods. The Chinese version of the ABQOL questionnaire has a high internal consistency (Cronbach's alpha coefficient, 0.88) and test-retest reliability (the intraclass correlation coefficient, 0.87). Face and content validity were satisfactory. Convergent validity testing showed that the correlation coefficients for the ABQOL and DLQI was 0.77 and for the ABQOL and SF-36 was -0.62. In terms of discriminant validity, there was no significant difference between the proportions of insensitive items in ABQOL and DLQI (p = 0.236). There was no significant difference between the proportions of insensitive items in ABQOL and SF-36 (p = 0.823). The Chinese version of the ABQOL questionnaire has adequate validity and reliability. It may constitute a useful instrument to measure disease burden in Chinese patients with AIBDs.
Validation in the Absence of Observed Events.
Lathrop, John; Ezell, Barry
2016-04-01
This article addresses the problem of validating models in the absence of observed events, in the area of weapons of mass destruction terrorism risk assessment. We address that problem with a broadened definition of "validation," based on stepping "up" a level to considering the reason why decisionmakers seek validation, and from that basis redefine validation as testing how well the model can advise decisionmakers in terrorism risk management decisions. We develop that into two conditions: validation must be based on cues available in the observable world; and it must focus on what can be done to affect that observable world, i.e., risk management. That leads to two foci: (1) the real-world risk generating process, and (2) best use of available data. Based on our experience with nine WMD terrorism risk assessment models, we then describe three best use of available data pitfalls: SME confidence bias, lack of SME cross-referencing, and problematic initiation rates. Those two foci and three pitfalls provide a basis from which we define validation in this context in terms of four tests--Does the model: … capture initiation? … capture the sequence of events by which attack scenarios unfold? … consider unanticipated scenarios? … consider alternative causal chains? Finally, we corroborate our approach against three validation tests from the DOD literature: Is the model a correct representation of the process to be simulated? To what degree are the model results comparable to the real world? Over what range of inputs are the model results useful? © 2015 Society for Risk Analysis.
QSAR study of curcumine derivatives as HIV-1 integrase inhibitors.
Gupta, Pawan; Sharma, Anju; Garg, Prabha; Roy, Nilanjan
2013-03-01
A QSAR study was performed on curcumine derivatives as HIV-1 integrase inhibitors using multiple linear regression. The statistically significant model was developed with squared correlation coefficients (r(2)) 0.891 and cross validated r(2) (r(2) cv) 0.825. The developed model revealed that electronic, shape, size, geometry, substitution's information and hydrophilicity were important atomic properties for determining the inhibitory activity of these molecules. The model was also tested successfully for external validation (r(2) pred = 0.849) as well as Tropsha's test for model predictability. Furthermore, the domain analysis was carried out to evaluate the prediction reliability of external set molecules. The model was statistically robust and had good predictive power which can be successfully utilized for screening of new molecules.
Brodie, Nicholas I.; Popov, Konstantin I.; Petrotchenko, Evgeniy V.; Dokholyan, Nikolay V.; Borchers, Christoph H.
2017-01-01
We present an integrated experimental and computational approach for de novo protein structure determination in which short-distance cross-linking data are incorporated into rapid discrete molecular dynamics (DMD) simulations as constraints, reducing the conformational space and achieving the correct protein folding on practical time scales. We tested our approach on myoglobin and FK506 binding protein—models for α helix–rich and β sheet–rich proteins, respectively—and found that the lowest-energy structures obtained were in agreement with the crystal structure, hydrogen-deuterium exchange, surface modification, and long-distance cross-linking validation data. Our approach is readily applicable to other proteins with unknown structures. PMID:28695211
Brodie, Nicholas I; Popov, Konstantin I; Petrotchenko, Evgeniy V; Dokholyan, Nikolay V; Borchers, Christoph H
2017-07-01
We present an integrated experimental and computational approach for de novo protein structure determination in which short-distance cross-linking data are incorporated into rapid discrete molecular dynamics (DMD) simulations as constraints, reducing the conformational space and achieving the correct protein folding on practical time scales. We tested our approach on myoglobin and FK506 binding protein-models for α helix-rich and β sheet-rich proteins, respectively-and found that the lowest-energy structures obtained were in agreement with the crystal structure, hydrogen-deuterium exchange, surface modification, and long-distance cross-linking validation data. Our approach is readily applicable to other proteins with unknown structures.
Jeyashree, Kathiresan; Shewade, Hemant Deepak; Kathirvel, Soundappan
2018-04-17
Dundee Ready Educational Environment Measure (DREEM) is a 50-item tool to assess the educational environment of medical institutions as perceived by the students. This cross-sectional study developed and validated an abridged version of the DREEM-50 with an aim to have a less resource-intensive (time, manpower), yet valid and reliable, version of DREEM-50 while also avoiding respondent fatigue. A methodology similar to that used in the development of WHO-BREF was adopted to develop the abridged version of DREEM. Medical students (n = 418) from a private teaching hospital in Madurai, India, were divided into two groups. Group I (n = 277) participated in the development of the abridged version. This was performed by domain-wise selection of items that had the highest item-total correlation. Group II (n = 141) participated in the testing of the abridged version for construct validity, internal consistency and test-retest reliability. Confirmatory factor analysis was performed to assess the construct validity of DREEM-12. The abridged version had 12 items (DREEM-12) spread over all five domains in DREEM-50. DREEM-12 explained 77.4% of the variance in DREEM-50 scores. Correlation between total scores of DREEM-50 and DREEM-12 was 0.88 (p < 0.001). Confirmatory factor analysis of DREEM-12 construct was statistically significant (LR test of model vs. saturated p = 0.0006). The internal consistency of DREEM-12 was 0.83. The test-retest reliability of DREEM-12 was 0.595, p < 0.001. DREEM-12 is a valid and reliable tool for use in educational research. Future research using DREEM-12 will establish its validity and reliability across different settings.
Measurement of the static and dynamic coefficients of a cross-type parachute in subsonic flow
NASA Technical Reports Server (NTRS)
Shpund, Zalman; Levin, Daniel
1991-01-01
An experimental parametric investigation of the aerodynamic qualities of cross-type parachutes was performed in a subsonic wind tunnel, using a new experimental technique. This investigation included the measurement of the static and dynamic aerodynamic coefficients, utilizing the measuring apparatus modified specifically for this type of testing. It is shown that the static aerodynamic coefficients of several configurations are in good agreement with available data, and assisted in validating the experimental technique employed. Two configuration parameters were varied in the static tests, the cord length and the canopy aspect ratio, with both parameters having a similar effect on the drag measurement, i.e., any increase in either of them increased the effective blocking area, and therefore the axial force.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roper, J; Ghavidel, B; Godette, K
Purpose: To validate a knowledge-based algorithm for prostate LDR brachytherapy treatment planning. Methods: A dataset of 100 cases was compiled from an active prostate seed implant service. Cases were randomized into 10 subsets. For each subset, the 90 remaining library cases were registered to a common reference frame and then characterized on a point by point basis using principle component analysis (PCA). Each test case was converted to PCA vectors using the same process and compared with each library case using a Mahalanobis distance to evaluate similarity. Rank order PCA scores were used to select the best-matched library case. Themore » seed arrangement was extracted from the best-matched case and used as a starting point for planning the test case. Any subsequent modifications were recorded that required input from a treatment planner to achieve V100>95%, V150<60%, V200<20%. To simulate operating-room planning constraints, seed activity was held constant, and the seed count could not increase. Results: The computational time required to register test-case contours and evaluate PCA similarity across the library was 10s. Preliminary analysis of 2 subsets shows that 9 of 20 test cases did not require any seed modifications to obtain an acceptable plan. Five test cases required fewer than 10 seed modifications or a grid shift. Another 5 test cases required approximately 20 seed modifications. An acceptable plan was not achieved for 1 outlier, which was substantially larger than its best match. Modifications took between 5s and 6min. Conclusion: A knowledge-based treatment planning algorithm for prostate LDR brachytherapy is being cross validated using 100 prior cases. Preliminary results suggest that for this size library, acceptable plans can be achieved without planner input in about half of the cases while varying amounts of planner input are needed in remaining cases. Computational time and planning time are compatible with clinical practice.« less
Advanced orbiting systems test-bedding and protocol verification
NASA Technical Reports Server (NTRS)
Noles, James; De Gree, Melvin
1989-01-01
The Consultative Committee for Space Data Systems (CCSDS) has begun the development of a set of protocol recommendations for Advanced Orbiting Systems (SOS). The AOS validation program and formal definition of AOS protocols are reviewed, and the configuration control of the AOS formal specifications is summarized. Independent implementations of the AOS protocols by NASA and ESA are discussed, and cross-support/interoperability tests which will allow the space agencies of various countries to share AOS communication facilities are addressed.
Bjorner, Jakob Bue; Pejtersen, Jan Hyld
2010-02-01
To evaluate the construct validity of the Copenhagen Psychosocial Questionnaire II (COPSOQ II) by means of tests for differential item functioning (DIF) and differential item effect (DIE). We used a Danish general population postal survey (n = 4,732 with 3,517 wage earners) with a one-year register based follow up for long-term sickness absence. DIF was evaluated against age, gender, education, social class, public/private sector employment, and job type using ordinal logistic regression. DIE was evaluated against job satisfaction and self-rated health (using ordinal logistic regression), against depressive symptoms, burnout, and stress (using multiple linear regression), and against long-term sick leave (using a proportional hazards model). We used a cross-validation approach to counter the risk of significant results due to multiple testing. Out of 1,052 tests, we found 599 significant instances of DIF/DIE, 69 of which showed both practical and statistical significance across two independent samples. Most DIF occurred for job type (in 20 cases), while we found little DIF for age, gender, education, social class and sector. DIE seemed to pertain to particular items, which showed DIE in the same direction for several outcome variables. The results allowed a preliminary identification of items that have a positive impact on construct validity and items that have negative impact on construct validity. These results can be used to develop better shortform measures and to improve the conceptual framework, items and scales of the COPSOQ II. We conclude that tests of DIF and DIE are useful for evaluating construct validity.
Alhajj, Mohammed Nasser; Amran, Abdullah Ghalib; Halboub, Esam; Al-Basmi, Abdulghani Ali; Al-Ghabri, Fawaz Abdullah
2017-07-01
This study aimed at developing the Arabic version of the Orofacial Esthetic Scale (OES-Ar) and to investigate its psychometric properties among Arabic-speaking population with and without esthetic impairments. Translation and cross-cultural adaptation was done according to the standard guidelines. Internal consistency was assessed on 230 participants. For test-retest reliability, 50 subjects with natural teeth were recalled within a period of 2 weeks. Validity of the OES-Ar was tested by construct, convergent, and discriminant validity tests. Responsiveness to esthetic changes was assessed in 60 patients. The results showed excellent internal consistency with Cronbach's alpha value of 0.92 and inter-item correlation average value of 0.60. The ICC values ranged from 0.87 to 0.96 which indicated excellent agreement. Construct validity of the OES-Ar was confirmed to be one-factor structure (one-dimensional). For convergent validity, a significant correlation was found between OES summary score and overall impression of the orofacial esthetic as well as between OES summary score and the summary score of the three questions of the OHIP-49Ar related to esthetic. The discriminant validity test revealed significant differences between different study groups (P<0.001). Responsiveness to treatment was confirmed by significant differences between pre- and post-treatment OES total summary score (P<0.001). The OES-Ar has excellent psychometric properties making it valuable instrument to assess orofacial esthetics in Arabic-speaking patients. Copyright © 2016 Japan Prosthodontic Society. Published by Elsevier Ltd. All rights reserved.
Papadopoulou, Soultana L.; Exarchakos, Georgios; Christodoulou, Dimitrios; Theodorou, Stavroula; Beris, Alexandre; Ploumis, Avraam
2016-01-01
Introduction The Ohkuma questionnaire is a validated screening tool originally used to detect dysphagia among patients hospitalized in Japanese nursing facilities. Objective The purpose of this study is to evaluate the reliability and validity of the adapted Greek version of the Ohkuma questionnaire. Methods Following the steps for cross-cultural adaptation, we delivered the validated Ohkuma questionnaire to 70 patients (53 men, 17 women) who were either suffering from dysphagia or not. All of them completed the questionnaire a second time within a month. For all of them, we performed a bedside and VFSS study of dysphagia and asked participants to undergo a second VFSS screening, with the exception of nine individuals. Statistical analysis included measurement of internal consistency with Cronbach's α coefficient, reliability with Cohen's Kappa, Pearson's correlation coefficient and construct validity with categorical components, and One-Way Anova test. Results According to Cronbach's α coefficient (0.976) for total score, there was high internal consistency for the Ohkuma Dysphagia questionnaire. Test-retest reliability (Cohen's Kappa) ranged from 0.586 to 1.00, exhibiting acceptable stability. We also estimated the Pearson's correlation coefficient for the test-retest total score, which reached high levels (0.952; p = 0.000). The One-Way Anova test in the two measurement times showed statistically significant correlation in both measurements (p = 0.02 and p = 0.016). Conclusion The adapted Greek version of the questionnaire is valid and reliable and can be used for the screening of dysphagia in the Greek-speaking patients. PMID:28050209
Papadopoulou, Soultana L; Exarchakos, Georgios; Christodoulou, Dimitrios; Theodorou, Stavroula; Beris, Alexandre; Ploumis, Avraam
2017-01-01
Introduction The Ohkuma questionnaire is a validated screening tool originally used to detect dysphagia among patients hospitalized in Japanese nursing facilities. Objective The purpose of this study is to evaluate the reliability and validity of the adapted Greek version of the Ohkuma questionnaire. Methods Following the steps for cross-cultural adaptation, we delivered the validated Ohkuma questionnaire to 70 patients (53 men, 17 women) who were either suffering from dysphagia or not. All of them completed the questionnaire a second time within a month. For all of them, we performed a bedside and VFSS study of dysphagia and asked participants to undergo a second VFSS screening, with the exception of nine individuals. Statistical analysis included measurement of internal consistency with Cronbach's α coefficient, reliability with Cohen's Kappa, Pearson's correlation coefficient and construct validity with categorical components, and One-Way Anova test. Results According to Cronbach's α coefficient (0.976) for total score, there was high internal consistency for the Ohkuma Dysphagia questionnaire. Test-retest reliability (Cohen's Kappa) ranged from 0.586 to 1.00, exhibiting acceptable stability. We also estimated the Pearson's correlation coefficient for the test-retest total score, which reached high levels (0.952; p = 0.000). The One-Way Anova test in the two measurement times showed statistically significant correlation in both measurements ( p = 0.02 and p = 0.016). Conclusion The adapted Greek version of the questionnaire is valid and reliable and can be used for the screening of dysphagia in the Greek-speaking patients.
Bohu, Y; Klouche, S; Lefevre, N; Webster, K; Herman, S
2015-04-01
The aim of this study was to translate, adapt and validate in French the Anterior Cruciate Ligament-Return to Sport after Injury (ACL-RSI), a 12-item English language scale assessing the psychological impact of returning to sports after ACL reconstruction. The ACL-RSI scale was forward and back translated, cross-culturally adapted and validated using international guidelines. The study population included all patients who were active in sports and underwent primary arthroscopic ACL reconstruction. The control group included subjects with no history of knee trauma. At the 6-month follow-up, the study population completed the ACL-RSI scale twice within 3-4 days, Knee injury and Osteoarthritis Outcome Score (KOOS) and subjective International Knee Documentation Committee (IKDC) scores. Statistical tests assessed the construct validity, discriminant validity, internal consistency, reliability and feasibility of the ACL-RSI scale. Ninety-one patients with ACL tears and 98 control subjects were included: mean age 31.7 ± 8.1 and 21.8 ± 2, respectively. The ACL-RSI scores were correlated with all KOOS sub-categories (r = 0.22-0.64, p < 0.05) as well as the subjective IKDC score (r = 0.42, p < 0.00001). The mean scores of the study and control groups were significantly different (62.8 ± 19.4 vs. 89.6 ± 11.5, p < 0.00001), and scores were significantly better in patients who returned to the same sport (72.1 ± 21.4 vs. 60.3 ± 18.1, p = 0.008). Internal consistency was high (α = 0.96). Test-retest reproducibility was excellent: ρ = 0.90 (0.86-0.94), p < 0.00001. Administration time was 1.32 ± 0.7 mn, and all items were answered. This study showed that the cross-cultural adaptation of the English version of the ACL-RSI was successful and validated in a French-speaking population. The discriminant capacity of the scale between patients who underwent reconstruction and healthy subjects was confirmed. II.