objective test items: Topics by Science.gov

Sample records for objective test items

Objective and Item Banking Computer Software and Its Use in Comprehensive Achievement Monitoring.

ERIC Educational Resources Information Center

Schriber, Peter E.; Gorth, William P.

The current emphasis on objectives and test item banks for constructing more effective tests is being augmented by increasingly sophisticated computer software. Items can be catalogued in numerous ways for retrieval. The items as well as instructional objectives can be stored and test forms can be selected and printed by the computer. It is also…
Item Specifications, Science Grade 8. Blue Prints for Testing Minimum Performance Test.

ERIC Educational Resources Information Center

Arkansas State Dept. of Education, Little Rock.

These item specifications were developed as a part of the Arkansas "Minimum Performance Testing Program" (MPT). There is one item specification for each instructional objective included in the MPT. The purpose of an item specification is to provide an overview of the general content and format of test items used to measure an…
Item Specifications, Science Grade 6. Blue Prints for Testing Minimum Performance Test.

ERIC Educational Resources Information Center

Arkansas State Dept. of Education, Little Rock.

These item specifications were developed as a part of the Arkansas "Minimum Performance Testing Program" (MPT). There is one item specification for each instructional objective included in the MPT. The purpose of an item specification is to provide an overview of the general content and format of test items used to measure an…
Identification of metallic items that caused nickel dermatitis in Danish patients.

PubMed

Thyssen, Jacob P; Menné, Torkil; Johansen, Jeanne D

2010-09-01

Nickel allergy is prevalent as assessed by epidemiological studies. In an attempt to further identify and characterize sources that may result in nickel allergy and dermatitis, we analysed items identified by nickel-allergic dermatitis patients as causative of nickel dermatitis by using the dimethylglyoxime (DMG) test. Dermatitis patients with nickel allergy of current relevance were identified over a 2-year period in a tertiary referral patch test centre. When possible, their work tools and personal items were examined with the DMG test. Among 95 nickel-allergic dermatitis patients, 70 (73.7%) had metallic items investigated for nickel release. A total of 151 items were investigated, and 66 (43.7%) gave positive DMG test reactions. Objects were nearly all purchased or acquired after the introduction of the EU Nickel Directive. Only one object had been inherited, and only two objects had been purchased outside of Denmark. DMG testing is valuable as a screening test for nickel release and should be used to identify relevant exposures in nickel-allergic patients. Mainly consumer items, but also work tools used in an occupational setting, released nickel in dermatitis patients. This study confirmed 'risk items' from previous studies, including mobile phones.
Item Analysis in Introductory Economics Testing.

ERIC Educational Resources Information Center

Tinari, Frank D.

1979-01-01

Computerized analysis of multiple choice test items is explained. Examples of item analysis applications in the introductory economics course are discussed with respect to three objectives: to evaluate learning; to improve test items; and to help improve classroom instruction. Problems, costs and benefits of the procedures are identified. (JMD)
Standards for Evaluating Criterion-Referenced Tests.

ERIC Educational Resources Information Center

Walker, Clinton B.

Standards for evaluating criterion-referenced tests are presented. Twenty-one standards, grouped in three categories, are discussed. Category one is defined as measurement properties and is comprised of conceptual validity, including description of the domain, test item agreement with objectives, and item representativeness of the objectives; and…
General Metals: Grades 7-12.

ERIC Educational Resources Information Center

Instructional Objectives Exchange, Los Angeles, CA.

Ninety objectives and related test items for use in grades 7 through 12 are presented. Each sample contains an objective, test items, and criteria for judging the adequacy of the response. Objectives are organized into the following categories: (1) property of metals; (2) operations and functions; (3) cutting and shearing; (4) filing; (5) cutting…
A Multiple Objective Test Assembly Approach for Exposure Control Problems in Computerized Adaptive Testing

ERIC Educational Resources Information Center

Veldkamp, Bernard P.; Verschoor, Angela J.; Eggen, Theo J. H. M.

2010-01-01

Overexposure and underexposure of items in the bank are serious problems in operational computerized adaptive testing (CAT) systems. These exposure problems might result in item compromise, or point at a waste of investments. The exposure control problem can be viewed as a test assembly problem with multiple objectives. Information in the test has…
Implications of Changing Answers on Objective Test Items

ERIC Educational Resources Information Center

Mueller, Daniel J.; Wasser, Virginia

1977-01-01

Eighteen studies of the effects of changing initial answers to objective test items are reviewed. While students throughout the total test score range tended to gain more points than they lost, higher scoring students gain more than did lower scoring students. Suggestions for further research are made. (Author/JKS)
Test Bias: An Objective Definition for Test Items.

ERIC Educational Resources Information Center

Durovic, Jerry J.

A test bias definition, applicable at the item-level of a test is presented. The definition conceptually equates test bias with measuring different things in different groups, and operationally equates test bias with a difference in item fit to the Rasch Model, greater than one, between groups. It is suggested that the proposed definition avoids…
An Item-Driven Adaptive Design for Calibrating Pretest Items. Research Report. ETS RR-14-38

ERIC Educational Resources Information Center

Ali, Usama S.; Chang, Hua-Hua

2014-01-01

Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…
U.S. History: Grades 7-9. Revised Edition.

ERIC Educational Resources Information Center

Instructional Objectives Exchange, Los Angeles, CA.

Sixty-three behavioral objectives and related test items for United States history in grades seven through nine are presented. Each sample contains the objective, sample test items and directions, and criteria for judging the adequacy of student responses. Fourteen of the 15 categories are content oriented and presented chronologically: (1)…
U.S. History: Grades 10-12. Revised Edition.

ERIC Educational Resources Information Center

Instructional Objectives Exchange, Los Angeles, CA.

Seventy-seven behavioral objectives and related test items for United States history in grades 10 through 12 are presented. Each sample contains the objective, sample test items, and criteria for judging the adequacy of student responses. Fourteen of the 15 categories are content-oriented, and presented in chronological groups: (1) discovery of…
Dual-Objective Item Selection Criteria in Cognitive Diagnostic Computerized Adaptive Testing

ERIC Educational Resources Information Center

Kang, Hyeon-Ah; Zhang, Susu; Chang, Hua-Hua

2017-01-01

The development of cognitive diagnostic-computerized adaptive testing (CD-CAT) has provided a new perspective for gaining information about examinees' mastery on a set of cognitive attributes. This study proposes a new item selection method within the framework of dual-objective CD-CAT that simultaneously addresses examinees' attribute mastery…
The Nature of Objectivity with the Rasch Model.

ERIC Educational Resources Information Center

Whitely, Susan E.; Dawis, Rene V.

Although it has been claimed that the Rasch model leads to a higher degree of objectivity in measurement than has been previously possible, this model has had little impact on test development. Population-invariant item and ability calibrations along with the statistical equivalency of any two item subsets are supposedly possible if the item pool…
Assessment of item-writing flaws in multiple-choice questions.

PubMed

Nedeau-Cayo, Rosemarie; Laughlin, Deborah; Rus, Linda; Hall, John

2013-01-01

This study evaluated the quality of multiple-choice questions used in a hospital's e-learning system. Constructing well-written questions is fraught with difficulty, and item-writing flaws are common. Study results revealed that most items contained flaws and were written at the knowledge/comprehension level. Few items had linked objectives, and no association was found between the presence of objectives and flaws. Recommendations include education for writing test questions.
Item Selection in Multidimensional Computerized Adaptive Testing--Gaining Information from Different Angles

ERIC Educational Resources Information Center

Wang, Chun; Chang, Hua-Hua

2011-01-01

Over the past thirty years, obtaining diagnostic information from examinees' item responses has become an increasingly important feature of educational and psychological testing. The objective can be achieved by sequentially selecting multidimensional items to fit the class of latent traits being assessed, and therefore Multidimensional…
Distinctions between Item Format and Objectivity in Scoring.

ERIC Educational Resources Information Center

Terwilliger, James S.

This paper clarifies important distinctions in item writing and item scoring and considers the implications of these distinctions for developing guidelines related to test construction for training teachers. The terminology used to describe and classify paper and pencil test questions frequently confuses two distinct features of questions:…
ITEM SELECTION TECHNIQUES AND EVALUATION OF INSTRUCTIONAL OBJECTIVES.

ERIC Educational Resources Information Center

COX, RICHARD C.

THE VALIDITY OF AN EDUCATIONAL ACHIEVEMENT TEST DEPENDS UPON THE CORRESPONDENCE BETWEEN SPECIFIED EDUCATIONAL OBJECTIVES AND THE EXTENT TO WHICH THESE OBJECTIVES ARE MEASURED BY THE EVALUATION INSTRUMENT. THIS STUDY IS DESIGNED TO EVALUATE THE EFFECT OF STATISTICAL ITEM SELECTION ON THE STRUCTURE OF THE FINAL EVALUATION INSTRUMENT AS COMPARED WITH…
A Rigorous Test of the Fit of the Circumplex Model to Big Five Personality Data: Theoretical and Methodological Issues and Two Large Sample Empirical Tests.

PubMed

DeGeest, David Scott; Schmidt, Frank

2015-01-01

Our objective was to apply the rigorous test developed by Browne (1992) to determine whether the circumplex model fits Big Five personality data. This test has yet to be applied to personality data. Another objective was to determine whether blended items explained correlations among the Big Five traits. We used two working adult samples, the Eugene-Springfield Community Sample and the Professional Worker Career Experience Survey. Fit to the circumplex was tested via Browne's (1992) procedure. Circumplexes were graphed to identify items with loadings on multiple traits (blended items), and to determine whether removing these items changed five-factor model (FFM) trait intercorrelations. In both samples, the circumplex structure fit the FFM traits well. Each sample had items with dual-factor loadings (8 items in the first sample, 21 in the second). Removing blended items had little effect on construct-level intercorrelations among FFM traits. We conclude that rigorous tests show that the fit of personality data to the circumplex model is good. This finding means the circumplex model is competitive with the factor model in understanding the organization of personality traits. The circumplex structure also provides a theoretically and empirically sound rationale for evaluating intercorrelations among FFM traits. Even after eliminating blended items, FFM personality traits remained correlated.

Difficulty and Discriminability of Introductory Psychology Test Items.

ERIC Educational Resources Information Center

Scialfa, Charles; Legare, Connie; Wenger, Larry; Dingley, Louis

2001-01-01

Analyzes multiple-choice questions provided in test banks for introductory psychology textbooks. Study 1 offered a consistent picture of the objective difficulty of multiple-choice tests for introductory psychology students, while both studies 1 and 2 indicated that test items taken from commercial test banks have poor psychometric properties.…
Repetition Blindness for Rotated Objects

ERIC Educational Resources Information Center

Hayward, William G.; Zhou, Guomei; Man, Wai-Fung; Harris, Irina M.

2010-01-01

Repetition blindness (RB) is the finding that observers often miss the repetition of an item within a rapid stream of words or objects. Recent studies have shown that RB for objects is largely unaffected by variations in viewpoint between the repeated items. In 5 experiments, we tested RB under different axes of rotation, with different types of…
Meatcutting Testbook, Part 2.

ERIC Educational Resources Information Center

California State Dept. of Education, Sacramento. Bureau of Publications.

This document contains objective tests for each topic in the Meatcutting Workbook, Part 2, which is designed for apprenticeship meatcutting programs in California. Each of the 30 tests consists of from 5 to 65 multiple-choice items with most tests containing approximately 10 items. The tests are grouped according to the eight units of the…
FIRST GRADE CHILDREN'S CONCEPT OF ADDITION OF NATURAL NUMBERS.

ERIC Educational Resources Information Center

STEFFE, LESLIE; VAN ENGEN, HENRY

MIDDLE-CLASS, FIRST-GRADE STUDENTS (100) WERE TESTED INDIVIDUALLY ON 4 ITEMS OF CONCEPT OF ADDITION AND CONSERVATION OF NUMBER. THE TEST ITEMS WERE IDENTICAL EXCEPT FOR THE NUMBER OF OBJECTS INVOLVED. FOR EACH ITEM, TWO PILES OF CANDY WERE PLACED BEFORE EACH CHILD AND THEN MOVED TOGETHER. THE STUDY SHOWED NO MAJOR DIFFERENCE IN THE MEAN…
Developing Parallel Career and Occupational Development Objectives and Exercise (Test) Items in Spanish for Assessment and Evaluation.

ERIC Educational Resources Information Center

Muratti, Jose E.; And Others

A parallel Spanish edition was developed of released objectives and objective-referenced items used in the National Assessment of Educational Progress (NAEP) in the field of Career and Occupational Development (COD). The Spanish edition was designed to assess the identical skills, attitudes, concepts, and knowledge of Spanish-dominant students…
Adult Roles & Functions. Objective Based Evaluation System.

ERIC Educational Resources Information Center

West Virginia State Vocational Curriculum Lab., Cedar Lakes.

This book of objective-based test items is designed to be used with the Adult Roles and Functions curriculum for a non-laboratory home economic course for grades eleven and twelve. It contains item banks for each cognitive objective in the curriculum. In addition, there is a form for the table of specifications to be developed for each unit. This…
Post-encoding emotional arousal enhances consolidation of item memory, but not reality-monitoring source memory.

PubMed

Wang, Bo; Sun, Bukuan

2017-03-01

The current study examined whether the effect of post-encoding emotional arousal on item memory extends to reality-monitoring source memory and, if so, whether the effect depends on emotionality of learning stimuli and testing format. In Experiment 1, participants encoded neutral words and imagined or viewed their corresponding object pictures. Then they watched a neutral, positive, or negative video. The 24-hour delayed test showed that emotional arousal had little effect on both item memory and reality-monitoring source memory. Experiment 2 was similar except that participants encoded neutral, positive, and negative words and imagined or viewed their corresponding object pictures. The results showed that positive and negative emotional arousal induced after encoding enhanced consolidation of item memory, but not reality-monitoring source memory, regardless of emotionality of learning stimuli. Experiment 3, identical to Experiment 2 except that participants were tested only on source memory for all the encoded items, still showed that post-encoding emotional arousal had little effect on consolidation of reality-monitoring source memory. Taken together, regardless of emotionality of learning stimuli and regardless of testing format of source memory (conjunction test vs. independent test), the facilitatory effect of post-encoding emotional arousal on item memory does not generalize to reality-monitoring source memory.
Item analysis of university-wide multiple choice objective examinations: the experience of a Nigerian private university.

PubMed

Odukoya, Jonathan A; Adekeye, Olajide; Igbinoba, Angie O; Afolabi, A

2018-01-01

Teachers and Students worldwide often dance to the tune of tests and examinations. Assessments are powerful tools for catalyzing the achievement of educational goals, especially if done rightly. One of the tools for 'doing it rightly' is item analysis. The core objectives for this study, therefore, were: ascertaining the item difficulty and distractive indices of the university wide courses. A range of 112-1956 undergraduate students participated in this study. With the use of secondary data, the ex-post facto design was adopted for this project. In virtually all cases, majority of the items (ranging between 65% and 97% of the 70 items fielded in each course) did not meet psychometric standard in terms of difficulty and distractive indices and consequently needed to be moderated or deleted. Considering the importance of these courses, the need to apply item analyses when developing these tests was emphasized.
Mechanical Drawing: Grades 7-12.

ERIC Educational Resources Information Center

Instructional Objectives Exchange, Los Angeles, CA.

Eighty-five behavioral objectives and related evaluation items for mechanical drawing in grades 7 through 12 are presented. Each sample contains the objective, test items, and means for judging the adequacy of the response. The following categories are included: (1) basic drafting skills; (2) beginning lettering; (3) drawing; (4) orthographic…
Woodworking: Grades 7-12.

ERIC Educational Resources Information Center

Instructional Objectives Exchange, Los Angeles, CA.

The woodworking collection is composed of 55 objectives and related evaluation items for use in grades 7 through 12. Each sample contains the objective, test items, and criteria for judging the adequacy of the response. Woodworking categories being measured include sharpening, adjusting, using and caring for tools; reading a working drawing; stock…
Development and psychometric evaluation of an information literacy self-efficacy survey and an information literacy knowledge test*

PubMed Central

Tepe, Rodger; Tepe, Chabha

2015-01-01

Objective To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. Methods In this test–retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. Results The IL self-efficacy survey demonstrated good reliability (test–retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test–retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). Conclusions This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments. PMID:25517736
To call a cloud 'cirrus': sound symbolism in names for categories or items.

PubMed

Ković, Vanja; Sučević, Jelena; Styles, Suzy J

2017-01-01

The aim of the present paper is to experimentally test whether sound symbolism has selective effects on labels with different ranges-of-reference within a simple noun-hierarchy. In two experiments, adult participants learned the make up of two categories of unfamiliar objects ('alien life forms'), and were passively exposed to either category-labels or item-labels, in a learning-by-guessing categorization task. Following category training, participants were tested on their visual discrimination of object pairs. For different groups of participants, the labels were either congruent or incongruent with the objects. In Experiment 1, when trained on items with individual labels, participants were worse (made more errors) at detecting visual object mismatches when trained labels were incongruent. In Experiment 2, when participants were trained on items in labelled categories, participants were faster at detecting a match if the trained labels were congruent, and faster at detecting a mismatch if the trained labels were incongruent. This pattern of results suggests that sound symbolism in category labels facilitates later similarity judgments when congruent, and discrimination when incongruent, whereas for item labels incongruence generates error in judgements of visual object differences. These findings reveal that sound symbolic congruence has a different outcome at different levels of labelling within a noun hierarchy. These effects emerged in the absence of the label itself, indicating subtle but pervasive effects on visual object processing.
Memory for Items and Relationships among Items Embedded in Realistic Scenes: Disproportionate Relational Memory Impairments in Amnesia

PubMed Central

Hannula, Deborah E.; Tranel, Daniel; Allen, John S.; Kirchhoff, Brenda A.; Nickel, Allison E.; Cohen, Neal J.

2014-01-01

Objective The objective of this study was to examine the dependence of item memory and relational memory on medial temporal lobe (MTL) structures. Patients with amnesia, who either had extensive MTL damage or damage that was relatively restricted to the hippocampus, were tested, as was a matched comparison group. Disproportionate relational memory impairments were predicted for both patient groups, and those with extensive MTL damage were also expected to have impaired item memory. Method Participants studied scenes, and were tested with interleaved two-alternative forced-choice probe trials. Probe trials were either presented immediately after the corresponding study trial (lag 1), five trials later (lag 5), or nine trials later (lag 9) and consisted of the studied scene along with a manipulated version of that scene in which one item was replaced with a different exemplar (item memory test) or was moved to a new location (relational memory test). Participants were to identify the exact match of the studied scene. Results As predicted, patients were disproportionately impaired on the test of relational memory. Item memory performance was marginally poorer among patients with extensive MTL damage, but both groups were impaired relative to matched comparison participants. Impaired performance was evident at all lags, including the shortest possible lag (lag 1). Conclusions The results are consistent with the proposed role of the hippocampus in relational memory binding and representation, even at short delays, and suggest that the hippocampus may also contribute to successful item memory when items are embedded in complex scenes. PMID:25068665
Math: Figure and Object Characteristics. Measurement and Geometry. Grades K-9. Revised Edition.

ERIC Educational Resources Information Center

Instructional Objectives Exchange, Los Angeles, CA.

To help classroom teachers construct mathematics tests, thirty-seven general objectives, corresponding sub-objectives, sample test items, and answers are presented. In general, sub-objectives are arranged in increasing order of difficulty. The objectives were written to comprehensively cover two categories: measurement and geometry. Measurement…
An Attempt to Influence Selected Portions of Student Learning.

ERIC Educational Resources Information Center

Anderson, Edwin R.

In an attempt to selectively improve student performance, one-half of a set of difficult test items from a FORTRAN programming class had handouts explaining the concepts underlying the items distributed to the students. Each handout contained a written learning objective, a short prose passage explaining the objective, and one or more practice…
The Impact of Escape Alternative Position Change in Multiple-Choice Test on the Psychometric Properties of a Test and Its Items Parameters

ERIC Educational Resources Information Center

Hamadneh, Iyad Mohammed

2015-01-01

This study aimed at investigating the impact changing of escape alternative position in multiple-choice test on the psychometric properties of a test and it's items parameters (difficulty, discrimination & guessing), and estimation of examinee ability. To achieve the study objectives, a 4-alternative multiple choice type achievement test…
Competency Test Items for Applied Principles of Agribusiness and Natural Resources Occupations. Ornamental Horticulture Component. A Report of Research.

ERIC Educational Resources Information Center

Cheek, Jimmy G.; McGhee, Max B.

The central purpose of this study was to develop and field test written criterion-referenced tests for the ornamental horticulture component of applied principles of agribusiness and natural resources occupations programs. The test items were to be used by secondary agricultural education students in Florida. Based upon the objectives identified…
Differential item functioning analysis of the Vanderbilt Expertise Test for cars.

PubMed

Lee, Woo-Yeol; Cho, Sun-Joo; McGugin, Rankin W; Van Gulick, Ana Beth; Gauthier, Isabel

2015-01-01

The Vanderbilt Expertise Test for cars (VETcar) is a test of visual learning for contemporary car models. We used item response theory to assess the VETcar and in particular used differential item functioning (DIF) analysis to ask if the test functions the same way in laboratory versus online settings and for different groups based on age and gender. An exploratory factor analysis found evidence of multidimensionality in the VETcar, although a single dimension was deemed sufficient to capture the recognition ability measured by the test. We selected a unidimensional three-parameter logistic item response model to examine item characteristics and subject abilities. The VETcar had satisfactory internal consistency. A substantial number of items showed DIF at a medium effect size for test setting and for age group, whereas gender DIF was negligible. Because online subjects were on average older than those tested in the lab, we focused on the age groups to conduct a multigroup item response theory analysis. This revealed that most items on the test favored the younger group. DIF could be more the rule than the exception when measuring performance with familiar object categories, therefore posing a challenge for the measurement of either domain-general visual abilities or category-specific knowledge.
Home Economics. Sample Test Items. Levels I and II.

ERIC Educational Resources Information Center

New York State Education Dept., Albany. Bureau of Elementary and Secondary Educational Testing.

A sample of behavioral objectives and related test items that could be developed for content modules in Home Economics levels I and II, this book is intended to enable teachers to construct more valid and reliable test materials. Forty-eight one-page modules are presented, and opposite each module are listed two to seven specific behavioral…
Measuring psychological trauma after spinal cord injury: Development and psychometric characteristics of the SCI-QOL Psychological Trauma item bank and short form

PubMed Central

Kisala, Pamela A.; Victorson, David; Pace, Natalie; Heinemann, Allen W.; Choi, Seung W.; Tulsky, David S.

2015-01-01

Objective To describe the development and psychometric properties of the SCI-QOL Psychological Trauma item bank and short form. Design Using a mixed-methods design, we developed and tested a Psychological Trauma item bank with patient and provider focus groups, cognitive interviews, and item response theory based analytic approaches, including tests of model fit, differential item functioning (DIF) and precision. Setting We tested a 31-item pool at several medical institutions across the United States, including the University of Michigan, Kessler Foundation, Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital and the James J. Peters/Bronx Veterans Administration hospital. Participants A total of 716 individuals with SCI completed the trauma items Results The 31 items fit a unidimensional model (CFI=0.952; RMSEA=0.061) and demonstrated good precision (theta range between 0.6 and 2.5). Nine items demonstrated negligible DIF with little impact on score estimates. The final calibrated item bank contains 19 items Conclusion The SCI-QOL Psychological Trauma item bank is a psychometrically robust measurement tool from which a short form and a computer adaptive test (CAT) version are available. PMID:26010967

Standardization in the Handling and Evaluation of Objective Examinations.

ERIC Educational Resources Information Center

Sass, M. Burke

1978-01-01

In response to requests for standardization on testing and grading, a pilot program for the administration and evaluation of objective examinations was instituted. Outlined are objectives, initial test item collection, procedural flow for examinations, faculty responsibilities, support staff responsibilities, and project coordinator services. (LBH)
A signal detection-item response theory model for evaluating neuropsychological measures.

PubMed

Thomas, Michael L; Brown, Gregory G; Gur, Ruben C; Moore, Tyler M; Patt, Virginie M; Risbrough, Victoria B; Baker, Dewleen G

2018-02-05

Models from signal detection theory are commonly used to score neuropsychological test data, especially tests of recognition memory. Here we show that certain item response theory models can be formulated as signal detection theory models, thus linking two complementary but distinct methodologies. We then use the approach to evaluate the validity (construct representation) of commonly used research measures, demonstrate the impact of conditional error on neuropsychological outcomes, and evaluate measurement bias. Signal detection-item response theory (SD-IRT) models were fitted to recognition memory data for words, faces, and objects. The sample consisted of U.S. Infantry Marines and Navy Corpsmen participating in the Marine Resiliency Study. Data comprised item responses to the Penn Face Memory Test (PFMT; N = 1,338), Penn Word Memory Test (PWMT; N = 1,331), and Visual Object Learning Test (VOLT; N = 1,249), and self-report of past head injury with loss of consciousness. SD-IRT models adequately fitted recognition memory item data across all modalities. Error varied systematically with ability estimates, and distributions of residuals from the regression of memory discrimination onto self-report of past head injury were positively skewed towards regions of larger measurement error. Analyses of differential item functioning revealed little evidence of systematic bias by level of education. SD-IRT models benefit from the measurement rigor of item response theory-which permits the modeling of item difficulty and examinee ability-and from signal detection theory-which provides an interpretive framework encompassing the experimentally validated constructs of memory discrimination and response bias. We used this approach to validate the construct representation of commonly used research measures and to demonstrate how nonoptimized item parameters can lead to erroneous conclusions when interpreting neuropsychological test data. Future work might include the development of computerized adaptive tests and integration with mixture and random-effects models.
Constructing a question bank based on script concordance approach as a novel assessment methodology in surgical education.

PubMed

Aldekhayel, Salah A; Alselaim, Nahar A; Magzoub, Mohi Eldin; Al-Qattan, Mohammad M; Al-Namlah, Abdullah M; Tamim, Hani; Al-Khayal, Abdullah; Al-Habdan, Sultan I; Zamakhshary, Mohammed F

2012-10-24

Script Concordance Test (SCT) is a new assessment tool that reliably assesses clinical reasoning skills. Previous descriptions of developing SCT-question banks were merely subjective. This study addresses two gaps in the literature: 1) conducting the first phase of a multistep validation process of SCT in Plastic Surgery, and 2) providing an objective methodology to construct a question bank based on SCT. After developing a test blueprint, 52 test items were written. Five validation questions were developed and a validation survey was established online. Seven reviewers were asked to answer this survey. They were recruited from two countries, Saudi Arabia and Canada, to improve the test's external validity. Their ratings were transformed into percentages. Analysis was performed to compare reviewers' ratings by looking at correlations, ranges, means, medians, and overall scores. Scores of reviewers' ratings were between 76% and 95% (mean 86% ± 5). We found poor correlations between reviewers (Pearson's: +0.38 to -0.22). Ratings of individual validation questions ranged between 0 and 4 (on a scale 1-5). Means and medians of these ranges were computed for each test item (mean: 0.8 to 2.4; median: 1 to 3). A subset of test items comprising 27 items was generated based on a set of inclusion and exclusion criteria. This study proposes an objective methodology for validation of SCT-question bank. Analysis of validation survey is done from all angles, i.e., reviewers, validation questions, and test items. Finally, a subset of test items is generated based on a set of criteria.
Effects of motor congruence on visual working memory.

PubMed

Quak, Michel; Pecher, Diane; Zeelenberg, Rene

2014-10-01

Grounded-cognition theories suggest that memory shares processing resources with perception and action. The motor system could be used to help memorize visual objects. In two experiments, we tested the hypothesis that people use motor affordances to maintain object representations in working memory. Participants performed a working memory task on photographs of manipulable and nonmanipulable objects. The manipulable objects were objects that required either a precision grip (i.e., small items) or a power grip (i.e., large items) to use. A concurrent motor task that could be congruent or incongruent with the manipulable objects caused no difference in working memory performance relative to nonmanipulable objects. Moreover, the precision- or power-grip motor task did not affect memory performance on small and large items differently. These findings suggest that the motor system plays no part in visual working memory.
Differential age-related effects on conjunctive and relational visual short-term memory binding.

PubMed

Bastin, Christine

2017-12-28

An age-related associative deficit has been described in visual short-term binding memory tasks. However, separate studies have suggested that ageing disrupts relational binding (to associate distinct items or item and context) more than conjunctive binding (to integrate features within an object). The current study directly compared relational and conjunctive binding with a short-term memory task for object-colour associations in 30 young and 30 older adults. Participants studied a number of object-colour associations corresponding to their individual object span level in a relational task in which objects were associated to colour patches and a conjunctive task where colour was integrated into the object. Memory for individual items and for associations was tested with a recognition memory test. Evidence for an age-related associative deficit was observed in the relational binding task, but not in the conjunctive binding task. This differential impact of ageing on relational and conjunctive short-term binding is discussed by reference to two underlying age-related cognitive difficulties: diminished hippocampally dependent binding and attentional resources.
An Internal Construct Validation Study of the "Iowa Tests of Basic Skills" (Level 12, Form G) Reading Comprehension Test Items.

ERIC Educational Resources Information Center

Perkins, Kyle; Duncan, Ann

An assessment analysis was performed to determine whether sets of items designed to measure three different subskills of reading comprehension of the Iowa Tests of Basic Skills (ITBSs) did, in fact, distinguish among these subskills. The three major skills objectives were: (1) facts; (2) generalizations; and (3) inferences. Data from…
Food and Nutrition (Intermediate). Performance Objectives and Criterion-Referenced Test Items.

ERIC Educational Resources Information Center

Missouri Univ., Columbia. Instructional Materials Lab.

This document contains competencies and criterion-referenced test items for the Intermediate Food and Nutrition semester course in Missouri that were derived from the duties and tasks of the Missouri homemaker and identified and validated by home economics teachers and subject matter specialists. The guide is designed to assist home economics…
Differential item functioning analysis of the Vanderbilt Expertise Test for cars

PubMed Central

Lee, Woo-Yeol; Cho, Sun-Joo; McGugin, Rankin W.; Van Gulick, Ana Beth; Gauthier, Isabel

2015-01-01

The Vanderbilt Expertise Test for cars (VETcar) is a test of visual learning for contemporary car models. We used item response theory to assess the VETcar and in particular used differential item functioning (DIF) analysis to ask if the test functions the same way in laboratory versus online settings and for different groups based on age and gender. An exploratory factor analysis found evidence of multidimensionality in the VETcar, although a single dimension was deemed sufficient to capture the recognition ability measured by the test. We selected a unidimensional three-parameter logistic item response model to examine item characteristics and subject abilities. The VETcar had satisfactory internal consistency. A substantial number of items showed DIF at a medium effect size for test setting and for age group, whereas gender DIF was negligible. Because online subjects were on average older than those tested in the lab, we focused on the age groups to conduct a multigroup item response theory analysis. This revealed that most items on the test favored the younger group. DIF could be more the rule than the exception when measuring performance with familiar object categories, therefore posing a challenge for the measurement of either domain-general visual abilities or category-specific knowledge. PMID:26418499
[A test to measure the degree of knowledge on food and nutrition at the onset of elementary school].

PubMed

Ivanovic Marincovich, D; Castro Gómez, C G; Ivanovic Marincovich, R

1997-06-01

The objective of this work was to design a test to measure the degree of knowledge on food and nutrition in school-age children from elementary first and second grades. A graphic instrument was designed according to the psychological child development and was based on the specific objectives pursued by the curriculum programs of the Ministry of Education. The test was developed around the following topics through 15 items: Area 1: Basic Concepts on Food and Nutrition (9 items) and Area 2: Food, Personal and Environmental Hygiene (9 items). The test was pilot tested on 103 school-age children of both grades (1:1), of both sexes (1:1), belonging to Peñalolén and Las Condes counties from Chile's Metropolitan Region and from high and low socioeconomic status (SES) (1:1), measured through the Graffar's Modified Method. The final version of the test was applied in a representative sample of 1.482 school-age children from Chile's Metropolitan Region from elementary first and second grades during 1986-1987. Content validity was assured by a team of judges and by the curriculum programs. Reliability was assessed by the Spearman correlation with the Spearman-Brown correction. Item-test consistency was determined by the Pearson correlation coefficient. Data were processed by the statistical analysis system (SAS) package. Results showed that reliability coefficient was 0.84 and item-test consistency was equal or above 0.25 in all items. It can be concluded that this test can be useful to determine the degree of knowledge on food and nutrition at the onset of elementary school, both in Chile and in other countries.
Math: Data Relationships. Graphs, Ratios and Proportions, Statistics and Probability. Grades K-9. Revised Edition.

ERIC Educational Resources Information Center

Instructional Objectives Exchange, Los Angeles, CA.

To help classroom teachers in grades K-9 construct mathematics tests, fifteen general objectives, corresponding sub-objectives, sample test items, and answers are presented. In general, sub-objectives are arranged in increasing order of difficulty. The objectives were written to comprehensively cover three categories. The first, graphs, covers the…
Development, Validation, and Use of an Item Bank for Police Promotion Examinations.

ERIC Educational Resources Information Center

Enger, John M.

In Arkansas, in reaction to complaints about traditional methods of selection for promotion, the civil service commission has chosen to base promotions in the police department solely on scores on locally-developed objective tests. Items developed and loaded into a computerized test bank were selected from six areas of responsibility: (1) criminal…
Using Empirical Data to Set Cutoff Scores.

ERIC Educational Resources Information Center

Hills, John R.

Six experimental approaches to the problems of setting cutoff scores and choosing proper test length are briefly mentioned. Most of these methods share the premise that a test is a random sample of items, from a domain associated with a carefully specified objective. Each item is independent and is scored zero or one, with no provision for…
An Analysis of Variance Approach for the Estimation of Response Time Distributions in Tests

ERIC Educational Resources Information Center

Attali, Yigal

2010-01-01

Generalizability theory and analysis of variance methods are employed, together with the concept of objective time pressure, to estimate response time distributions and the degree of time pressure in timed tests. By estimating response time variance components due to person, item, and their interaction, and fixed effects due to item types and…
Transportable Applications Environment (TAE) Tenth Users' Conference

NASA Technical Reports Server (NTRS)

Rouff, Chris (Editor); Harris, Elfrieda (Editor); Yeager, Arleen (Editor)

1993-01-01

Conference proceedings are represented in graphic visual-aid form. Presentation and panel discussion topics include user experiences with C++ and Ada; the design and interaction of the user interface; the history and goals of TAE; commercialization and testing of TAE Plus; Computer-Human Interaction Models (CHIMES); data driven objects; item-to-item connections and object dependencies; and integration with other software. There follows a list of conference attendees.
Parent Ratings of ADHD Symptoms: Generalized Partial Credit Model Analysis of Differential Item Functioning across Gender

ERIC Educational Resources Information Center

Gomez, Rapson

2012-01-01

Objective: Generalized partial credit model, which is based on item response theory (IRT), was used to test differential item functioning (DIF) for the "Diagnostic and Statistical Manual of Mental Disorders" (4th ed.), inattention (IA), and hyperactivity/impulsivity (HI) symptoms across boys and girls. Method: To accomplish this, parents completed…
Clinical instruments: reliability and validity critical appraisal.

PubMed

Brink, Yolandi; Louw, Quinette A

2012-12-01

RATIONALE, AIM AND OBJECTIVES: There is a lack of health care practitioners using objective clinical tools with sound psychometric properties. There is also a need for researchers to improve their reporting of the validity and reliability results of these clinical tools. Therefore, to promote the use of valid and reliable tools or tests for clinical evaluation, this paper reports on the development of a critical appraisal tool to assess the psychometric properties of objective clinical tools. A five-step process was followed to develop the new critical appraisal tool: (1) preliminary conceptual decisions; (2) defining key concepts; (3) item generation; (4) assessment of face validity; and (5) formulation of the final tool. The new critical appraisal tool consists of 13 items, of which five items relate to both validity and reliability studies, four items to validity studies only and four items to reliability studies. The 13 items could be scored as 'yes', 'no' or 'not applicable'. This critical appraisal tool will aid both the health care practitioner to critically appraise the relevant literature and researchers to improve the quality of reporting of the validity and reliability of objective clinical tools. © 2011 Blackwell Publishing Ltd.
An objective measure of physical function of elderly outpatients. The Physical Performance Test.

PubMed

Reuben, D B; Siu, A L

1990-10-01

Direct observation of physical function has the advantage of providing an objective, quantifiable measure of functional capabilities. We have developed the Physical Performance Test (PPT), which assesses multiple domains of physical function using observed performance of tasks that simulate activities of daily living of various degrees of difficulty. Two versions are presented: a nine-item scale that includes writing a sentence, simulated eating, turning 360 degrees, putting on and removing a jacket, lifting a book and putting it on a shelf, picking up a penny from the floor, a 50-foot walk test, and climbing stairs (scored as two items); and a seven-item scale that does not include stairs. The PPT can be completed in less than 10 minutes and requires only a few simple props. We then tested the validity of PPT using 183 subjects (mean age, 79 years) in six settings including four clinical practices (one of Parkinson's disease patients), a board-and-care home, and a senior citizens' apartment. The PPT was reliable (Cronbach's alpha = 0.87 and 0.79, interrater reliability = 0.99 and 0.93 for the nine-item and seven-item tests, respectively) and demonstrated concurrent validity with self-reported measures of physical function. Scores on the PPT for both scales were highly correlated (.50 to .80) with modified Rosow-Breslau, Instrumental and Basic Activities of Daily Living scales, and Tinetti gait score. Scores on the PPT were more moderately correlated with self-reported health status, cognitive status, and mental health (.24 to .47), and negatively with age (-.24 and -.18). Thus, the PPT also demonstrated construct validity. The PPT is a promising objective measurement of physical function, but its clinical and research value for screening, monitoring, and prediction will have to be determined.
Item Development and Validity Testing for a Self- and Proxy Report: The Safe Driving Behavior Measure

PubMed Central

Classen, Sherrilene; Winter, Sandra M.; Velozo, Craig A.; Bédard, Michel; Lanford, Desiree N.; Brumback, Babette; Lutz, Barbara J.

2010-01-01

OBJECTIVE We report on item development and validity testing of a self-report older adult safe driving behaviors measure (SDBM). METHOD On the basis of theoretical frameworks (Precede–Proceed Model of Health Promotion, Haddon’s matrix, and Michon’s model), existing driving measures, and previous research and guided by measurement theory, we developed items capturing safe driving behavior. Item development was further informed by focus groups. We established face validity using peer reviewers and content validity using expert raters. RESULTS Peer review indicated acceptable face validity. Initial expert rater review yielded a scale content validity index (CVI) rating of 0.78, with 44 of 60 items rated ≥0.75. Sixteen unacceptable items (≤0.5) required major revision or deletion. The next CVI scale average was 0.84, indicating acceptable content validity. CONCLUSION The SDBM has relevance as a self-report to rate older drivers. Future pilot testing of the SDBM comparing results with on-road testing will define criterion validity. PMID:20437917
Item Analyses of Memory Differences

PubMed Central

Salthouse, Timothy A.

2017-01-01

Objective Although performance on memory and other cognitive tests is usually assessed with a score aggregated across multiple items, potentially valuable information is also available at the level of individual items. Method The current study illustrates how analyses of variance with item as one of the factors, and memorability analyses in which item accuracy in one group is plotted as a function of item accuracy in another group, can provide a more detailed characterization of the nature of group differences in memory. Data are reported for two memory tasks, word recall and story memory, across age, ability, repetition, delay, and longitudinal contrasts. Results The item-level analyses revealed evidence for largely uniform differences across items in the age, ability, and longitudinal contrasts, but differential patterns across items in the repetition contrast, and unsystematic item relations in the delay contrast. Conclusion Analyses at the level of individual items have the potential to indicate the manner by which group differences in the aggregate test score are achieved. PMID:27618285
Instructional Sensitivity Statistics Appropriate for Objectives-Based Test Items. CSE Report No. 91.

ERIC Educational Resources Information Center

Kosecoff, Jacqueline B.; Klein, Stephen P.

Two types of sensitivity indices were developed in this paper, one internal to the total test and the second external. To evaluate the success of these statistics the three criteria suggested for a satisfactory index of item quality were considered. The Internal Sensitivity Index appears to meet these demands. Certainly it is easily computed. In…

An Item Response Theory-Based, Computerized Adaptive Testing Version of the MacArthur-Bates Communicative Development Inventory: Words & Sentences (CDI:WS)

ERIC Educational Resources Information Center

Makransky, Guido; Dale, Philip S.; Havmose, Philip; Bleses, Dorthe

2016-01-01

Purpose: This study investigated the feasibility and potential validity of an item response theory (IRT)-based computerized adaptive testing (CAT) version of the MacArthur-Bates Communicative Development Inventory: Words & Sentences (CDI:WS; Fenson et al., 2007) vocabulary checklist, with the objective of reducing length while maintaining…
Measuring self-esteem after spinal cord injury: Development, validation and psychometric characteristics of the SCI-QOL Self-esteem item bank and short form

PubMed Central

Kalpakjian, Claire Z.; Tate, Denise G.; Kisala, Pamela A.; Tulsky, David S.

2015-01-01

Objective To describe the development and psychometric properties of the Spinal Cord Injury-Quality of Life (SCI-QOL) Self-esteem item bank. Design Using a mixed-methods design, we developed and tested a self-esteem item bank through the use of focus groups with individuals with SCI and clinicians with expertise in SCI, cognitive interviews, and item-response theory- (IRT) based analytic approaches, including tests of model fit, differential item functioning (DIF) and precision. Setting We tested a pool of 30 items at several medical institutions across the United States, including the University of Michigan, Kessler Foundation, the Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital, and the James J. Peters/Bronx Department of Veterans Affairs hospital. Participants A total of 717 individuals with SCI completed the self-esteem items. Results A unidimensional model was observed (CFI = 0.946; RMSEA = 0.087) and measurement precision was good (theta range between −2.7 and 0.7). Eleven items were flagged for DIF; however, effect sizes were negligible with little practical impact on score estimates. The final calibrated item bank resulted in 23 retained items. Conclusion This study indicates that the SCI-QOL Self-esteem item bank represents a psychometrically robust measurement tool. Short form items are also suggested and computer adaptive tests are available. PMID:26010972
Measuring resilience after spinal cord injury: Development, validation and psychometric characteristics of the SCI-QOL Resilience item bank and short form

PubMed Central

Victorson, David; Tulsky, David S.; Kisala, Pamela A.; Kalpakjian, Claire Z.; Weiland, Brian; Choi, Seung W.

2015-01-01

Objective To describe the development and psychometric properties of the Spinal Cord Injury - Quality of Life (SCI-QOL) Resilience item bank and short form. Design Using a mixed-methods design, we developed and tested a resilience item bank through the use of focus groups with individuals with SCI and clinicians with expertise in SCI, cognitive interviews, and item-response theory based analytic approaches, including tests of model fit and differential item functioning (DIF). Setting We tested a 32-item pool at several medical institutions across the United States, including the University of Michigan, Kessler Foundation, the Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital and the James J. Peters/Bronx Department of Veterans Affairs medical center. Participants A total of 717 individuals with SCI completed the Resilience items. Results A unidimensional model was observed (CFI = 0.968; RMSEA = 0.074) and measurement precision was good (theta range between −3.1 and 0.9). Ten items were flagged for DIF, however, after examination of effect sizes we found this to be negligible with little practical impact on score estimates. The final calibrated item bank resulted in 21 retained items. Conclusion This study indicates that the SCI-QOL Resilience item bank represents a psychometrically robust measurement tool. Short form items are also suggested and computer adaptive tests are available. PMID:26010971
Standards in C.S.E. and G.C.E.: English and Mathematics. Working Paper No. 9.

ERIC Educational Resources Information Center

Schools Council, London (England).

Attainment tests in English and mathematics were administered to a total sample of 2,011/15-year old students. The English test consisted of a composition and a test battery of objective items. Marking of the composition was made by the test designer on a rapid first-impression reading. The objective test battery consisted of a comprehension test,…
Culture impacts the magnitude of the emotion-induced memory trade-off effect.

PubMed

Gutchess, Angela; Garner, Lauryn; Ligouri, Laura; Konuk, Ayse Isilay; Boduroglu, Aysecan

2017-10-04

The present study assessed the extent to which culture impacts the emotion-induced memory trade-off effect. This trade-off effect occurs because emotional items are better remembered than neutral ones, but this advantage comes at the expense of memory for backgrounds such that neutral backgrounds are remembered worse when they occurred with an emotional item than with a neutral one. Cultures differ in their prioritisation of focal object versus contextual background information, with Westerners focusing more on objects and Easterners focusing more on backgrounds. Americans, a Western culture, and Turks, an Eastern-influenced culture, incidentally encoded positive, negative, and neutral items placed against neutral backgrounds, and then completed a surprise memory test with the items and backgrounds tested separately. Results revealed a reduced trade-off for Turks compared to Americans. Although both groups exhibited an emotional enhancement in item memory, Turks did not show a decrement in memory for backgrounds that had been paired with emotional items. These findings complement prior ones showing reductions in trade-off effects as a result of task instructions. Here, we suggest that a contextual-focus at the level of culture can mitigate trade-off effects in emotional memory.
A critique of Rasch residual fit statistics.

PubMed

Karabatsos, G

2000-01-01

In test analysis involving the Rasch model, a large degree of importance is placed on the "objective" measurement of individual abilities and item difficulties. The degree to which the objectivity properties are attained, of course, depends on the degree to which the data fit the Rasch model. It is therefore important to utilize fit statistics that accurately and reliably detect the person-item response inconsistencies that threaten the measurement objectivity of persons and items. Given this argument, it is somewhat surprising that there is far more emphasis placed in the objective measurement of person and items than there is in the measurement quality of Rasch fit statistics. This paper provides a critical analysis of the residual fit statistics of the Rasch model, arguably the most often used fit statistics, in an effort to illustrate that the task of Rasch fit analysis is not as simple and straightforward as it appears to be. The faulty statistical properties of the residual fit statistics do not allow either a convenient or a straightforward approach to Rasch fit analysis. For instance, given a residual fit statistic, the use of a single minimum critical value for misfit diagnosis across different testing situations, where the situations vary in sample and test properties, leads to both the overdetection and underdetection of misfit. To improve this situation, it is argued that psychometricians need to implement residual-free Rasch fit statistics that are based on the number of Guttman response errors, or use indices that are statistically optimal in detecting measurement disturbances.
Influence of the wording of evaluation items on outcome-based evaluation results for large-group teaching in anatomy, biochemistry and legal medicine.

PubMed

Anders, Sven; Pyka, Katharina; Mueller, Tjark; von Streinbuechel, Nicole; Raupach, Tobias

2016-11-01

Student learning outcome is an important dimension of teaching quality in undergraduate medical education. Measuring an increase in knowledge during teaching requires repetitive objective testing which is usually not feasible. As an alternative, student learning outcome can be calculated from student self-ratings. Comparative self-assessment (CSA) gain reflects the performance difference before and after teaching, adjusted for initial knowledge. It has been shown to be a valid proxy measure of actual learning outcome derived from objective tests. However, student self-ratings are prone to a number of confounding factors. In the context of outcome-based evaluation, the wording of self-rating items is crucial to the validity of evaluation results. This randomized trial assessed whether including qualifiers in these statements impacts on student ratings and CSA gain. First-year medical students self-rated their initial (then-test) and final (post-test) knowledge for lectures in anatomy, biochemistry and legal medicine, respectively, and 659 questionnaires were retrieved. Six-point scales were used for self-ratings with 1 being the most positive option. Qualifier use did not affect then-test ratings but was associated with slightly less favorable post-test ratings. Consecutively, mean CSA gain was smaller for items containing qualifiers than for items lacking qualifiers (50.6±15.0% vs. 56.3±14.6%, p=0.079). The effect was more pronounced (Cohen's d=0.82) for items related to anatomy. In order to increase fairness of outcome-based evaluation and increase the comparability of CSA gain data across subjects, medical educators should agree on a consistent approach (qualifiers for all items or no qualifiers at all) when drafting self-rating statements for outcome-based evaluation. Copyright © 2016 Elsevier GmbH. All rights reserved.
Students’ understanding of forces: Force diagrams on horizontal and inclined plane

NASA Astrophysics Data System (ADS)

Sirait, J.; Hamdani; Mursyid, S.

2018-03-01

This study aims to analyse students’ difficulties in understanding force diagrams on horizontal surfaces and inclined planes. Physics education students (pre-service physics teachers) of Tanjungpura University, who had completed a Basic Physics course, took a Force concept test which has six questions covering three concepts: an object at rest, an object moving at constant speed, and an object moving at constant acceleration both on a horizontal surface and on an inclined plane. The test is in a multiple-choice format. It examines the ability of students to select appropriate force diagrams depending on the context. The results show that 44% of students have difficulties in solving the test (these students only could solve one or two items out of six items). About 50% of students faced difficulties finding the correct diagram of an object when it has constant speed and acceleration in both contexts. In general, students could only correctly identify 48% of the force diagrams on the test. The most difficult task for the students in terms was identifying the force diagram representing forces exerted on an object on in an inclined plane.
Constructing objective tests

NASA Astrophysics Data System (ADS)

Aubrecht, Gordon J.; Aubrecht, Judith D.

1983-07-01

True-false or multiple-choice tests can be useful instruments for evaluating student progress. We examine strategies for planning objective tests which serve to test the material covered in science (physics) courses. We also examine strategies for writing questions for tests within a test blueprint. The statistical basis for judging the quality of test items are discussed. Reliability, difficulty, and discrimination indices are defined and examples presented. Our recommendation are rather easily put into practice.
Constructing and Implementing a Four Tier Test about Static Electricity to Diagnose Pre-service Elementary School Teacher’ Misconceptions

NASA Astrophysics Data System (ADS)

Hermita, N.; Suhandi, A.; Syaodih, E.; Samsudin, A.; Isjoni; Johan, H.; Rosa, F.; Setyaningsih, R.; Sapriadil; Safitri, D.

2017-09-01

We have already constructed and implemented the diagnostic test formed in the four tier test to diagnose pre-service elementary teachers’ misconceptions about static electricity. The method which is utilized in this study is 3D-1I (Define, Design, Develop and Implementation) conducted to the pre-service elementary school teachers. The number of respondents involved in the study is 78 students of PGSD FKIP Universitas Riau. The data was collected by administering diagnostic test items in the form of four tier test. The result indicates that there are several misconceptions related to static electricity concept, these include: 1) Electrostatic objects cannot attract neutral objects, 2) A neutral object is an object that does not contain an electrical charge, and 3) the magnitude of the tensile force between two charged objects depends on the size of the charge. Moreover, the research’s results establish that the diagnostic test is able to analyse number of misconceptions and classify level of understanding pre-service elementary school teachers that is scientific knowledge, misconception, lack knowledge, and error. In conclusion, the diagnostic test item in the form of four tier test has already been constructed and implemented to diagnose students’ conceptions on static electricity.
Selective attention modulates neural substrates of repetition priming and "implicit" visual memory: suppressions and enhancements revealed by FMRI.

PubMed

Vuilleumier, Patrik; Schwartz, Sophie; Duhoux, Stéphanie; Dolan, Raymond J; Driver, Jon

2005-08-01

Attention can enhance processing for relevant information and suppress this for ignored stimuli. However, some residual processing may still arise without attention. Here we presented overlapping outline objects at study, with subjects attending to those in one color but not the other. Attended objects were subsequently recognized on a surprise memory test, whereas there was complete amnesia for ignored items on such direct explicit testing; yet reliable behavioral priming effects were found on indirect testing. Event-related fMRI examined neural responses to previously attended or ignored objects, now shown alone in the same or mirror-reversed orientation as before, intermixed with new items. Repetition-related decreases in fMRI responses for objects previously attended and repeated in the same orientation were found in the right posterior fusiform, lateral occipital, and left inferior frontal cortex. More anterior fusiform regions also showed some repetition decreases for ignored objects, irrespective of orientation. View-specific repetition decreases were found in the striate cortex, particularly for previously attended items. In addition, previously ignored objects produced some fMRI response increases in the bilateral lingual gyri, relative to new objects. Selective attention at exposure can thus produce several distinct long-term effects on processing of stimuli repeated later, with neural response suppression stronger for previously attended objects, and some response enhancement for previously ignored objects, with these effects arising in different brain areas. Although repetition decreases may relate to positive priming phenomena, the repetition increases for ignored objects shown here for the first time might relate to processes that can produce "negative priming" in some behavioral studies. These results reveal quantitative and qualitative differences between neural substrates of long-term repetition effects for attended versus unattended objects.
Comprehensive Achievement Monitoring for Science. Symposium; National Association of Biology Teachers, San Francisco, California, October 27, 1972.

ERIC Educational Resources Information Center

White, Mona E.; And Others

Comprehensive Achievement Monitoring (CAM) is a system designed to provide a curriculum defined in terms of performance objectives, test items to measure student performance on each objective, a set of comparable test forms to evaluate performance, testing throughout the period of the course, computerized analysis and reporting of results after…
Computerized training management system

DOEpatents

Rice, H.B.; McNair, R.C.; White, K.; Maugeri, T.

1998-08-04

A Computerized Training Management System (CTMS) is disclosed for providing a procedurally defined process that is employed to develop accreditable performance based training programs for job classifications that are sensitive to documented regulations and technical information. CTMS is a database that links information needed to maintain a five-phase approach to training-analysis, design, development, implementation, and evaluation independent of training program design. CTMS is designed using R-Base{trademark}, an-SQL compliant software platform. Information is logically entered and linked in CTMS. Each task is linked directly to a performance objective, which, in turn, is linked directly to a learning objective; then, each enabling objective is linked to its respective test items. In addition, tasks, performance objectives, enabling objectives, and test items are linked to their associated reference documents. CTMS keeps all information up to date since it automatically sorts, files and links all data; CTMS includes key word and reference document searches. 18 figs.
Computerized training management system

DOEpatents

Rice, Harold B.; McNair, Robert C.; White, Kenneth; Maugeri, Terry

1998-08-04

A Computerized Training Management System (CTMS) for providing a procedurally defined process that is employed to develop accreditable performance based training programs for job classifications that are sensitive to documented regulations and technical information. CTMS is a database that links information needed to maintain a five-phase approach to training-analysis, design, development, implementation, and evaluation independent of training program design. CTMS is designed using R-Base.RTM., an-SQL compliant software platform. Information is logically entered and linked in CTMS. Each task is linked directly to a performance objective, which, in turn, is linked directly to a learning objective; then, each enabling objective is linked to its respective test items. In addition, tasks, performance objectives, enabling objectives, and test items are linked to their associated reference documents. CTMS keeps all information up to date since it automatically sorts, files and links all data; CTMS includes key word and reference document searches.
Rasch-family models are more valuable than score-based approaches for analysing longitudinal patient-reported outcomes with missing data.

PubMed

de Bock, Élodie; Hardouin, Jean-Benoit; Blanchin, Myriam; Le Neel, Tanguy; Kubis, Gildas; Bonnaud-Antignac, Angélique; Dantan, Étienne; Sébille, Véronique

2016-10-01

The objective was to compare classical test theory and Rasch-family models derived from item response theory for the analysis of longitudinal patient-reported outcomes data with possibly informative intermittent missing items. A simulation study was performed in order to assess and compare the performance of classical test theory and Rasch model in terms of bias, control of the type I error and power of the test of time effect. The type I error was controlled for classical test theory and Rasch model whether data were complete or some items were missing. Both methods were unbiased and displayed similar power with complete data. When items were missing, Rasch model remained unbiased and displayed higher power than classical test theory. Rasch model performed better than the classical test theory approach regarding the analysis of longitudinal patient-reported outcomes with possibly informative intermittent missing items mainly for power. This study highlights the interest of Rasch-based models in clinical research and epidemiology for the analysis of incomplete patient-reported outcomes data. © The Author(s) 2013.
Development of Self-Report Measures of Social Attitudes that Act as Environmental Barriers and Facilitators for People with Disabilities

PubMed Central

Garcia, Sofia F.; Hahn, Elizabeth A.; Magasi, Susan; Lai, Jin-Shei; Semik, Patrick; Hammel, Joy; Heinemann, Allen W.

2014-01-01

Objective To describe the development of new self-report measures of social attitudes that act as environmental facilitators or barriers to the participation of people with disabilities in society. Design A mixed methods approach included a literature review; item classification, selection and writing; cognitive interviews and field testing with participants with spinal cord injury (SCI), traumatic brain injury (TBI) or stroke; and rating scale analysis to evaluate initial psychometric properties. Setting General community. Participants Nine individuals with SCI, TBI or stroke participated in cognitive interviews; 305 community residents with those same conditions participated in field testing. Interventions None. Main Outcome Measure(s) Self-report item pool of social attitudes that act as facilitators or barriers to people with disabilities participating in society. Results An interdisciplinary team of experts classified 710 existing social environment items into content areas and wrote 32 new items. Additional qualitative item review included item refinement and winnowing of the pool prior to cognitive interviews and field testing 82 items. Field test data indicated that the pool satisfies a one-parameter item response theory measurement model and would be appropriate for development into a calibrated item bank. Conclusions Our qualitative item review process supported a social environment conceptual framework that includes both social support and social attitudes. We developed a new social attitudes self-report item pool. Calibration testing of that pool is underway with a larger sample in order to develop a social attitudes item bank for persons with disabilities. PMID:25045803
Validity of Computer Adaptive Tests of Daily Routines for Youth with Spinal Cord Injury

PubMed Central

Haley, Stephen M.

2013-01-01

Objective: To evaluate the accuracy of computer adaptive tests (CATs) of daily routines for child- and parent-reported outcomes following pediatric spinal cord injury (SCI) and to evaluate the validity of the scales. Methods: One hundred ninety-six daily routine items were administered to 381 youths and 322 parents. Pearson correlations, intraclass correlation coefficients (ICC), and 95% confidence intervals (CI) were calculated to evaluate the accuracy of simulated 5-item, 10-item, and 15-item CATs against the full-item banks and to evaluate concurrent validity. Independent samples t tests and analysis of variance were used to evaluate the ability of the daily routine scales to discriminate between children with tetraplegia and paraplegia and among 5 motor groups. Results: ICC and 95% CI demonstrated that simulated 5-, 10-, and 15-item CATs accurately represented the full-item banks for both child- and parent-report scales. The daily routine scales demonstrated discriminative validity, except between 2 motor groups of children with paraplegia. Concurrent validity of the daily routine scales was demonstrated through significant relationships with the FIM scores. Conclusion: Child- and parent-reported outcomes of daily routines can be obtained using CATs with the same relative precision of a full-item bank. Five-item, 10-item, and 15-item CATs have discriminative and concurrent validity. PMID:23671380
Memorable objects are more susceptible to forgetting: Evidence for the inhibitory account of retrieval-induced forgetting.

PubMed

Reppa, I; Williams, K E; Worth, E R; Greville, W J; Saunders, J

2017-11-01

Retrieval of target information can cause forgetting for related, but non-retrieved, information - retrieval-induced forgetting (RIF). The aim of the current studies was to examine a key prediction of the inhibitory account of RIF - interference dependence - whereby 'strong' non-retrieved items are more likely to interfere during retrieval and therefore, are more susceptible to RIF. Using visual objects allowed us to examine and contrast one index of item strength -object typicality, that is, how typical of its category an object is. Experiment 1 provided proof of concept for our variant of the recognition practice paradigm. Experiment 2 tested the prediction of the inhibitory account that the magnitude of RIF for natural visual objects would be dependent on item strength. Non-typical objects were more memorable overall than typical objects. We found that object memorability (as determined by typicality) influenced RIF with significant forgetting occurring for the memorable (non-typical), but not non-memorable (typical), objects. The current findings strongly support an inhibitory account of retrieval-induced forgetting. Copyright © 2017 Elsevier B.V. All rights reserved.
Electronic Quality of Life Assessment Using Computer-Adaptive Testing

PubMed Central

2016-01-01

Background Quality of life (QoL) questionnaires are desirable for clinical practice but can be time-consuming to administer and interpret, making their widespread adoption difficult. Objective Our aim was to assess the performance of the World Health Organization Quality of Life (WHOQOL)-100 questionnaire as four item banks to facilitate adaptive testing using simulated computer adaptive tests (CATs) for physical, psychological, social, and environmental QoL. Methods We used data from the UK WHOQOL-100 questionnaire (N=320) to calibrate item banks using item response theory, which included psychometric assessments of differential item functioning, local dependency, unidimensionality, and reliability. We simulated CATs to assess the number of items administered before prespecified levels of reliability was met. Results The item banks (40 items) all displayed good model fit (P>.01) and were unidimensional (fewer than 5% of t tests significant), reliable (Person Separation Index>.70), and free from differential item functioning (no significant analysis of variance interaction) or local dependency (residual correlations < +.20). When matched for reliability, the item banks were between 45% and 75% shorter than paper-based WHOQOL measures. Across the four domains, a high standard of reliability (alpha>.90) could be gained with a median of 9 items. Conclusions Using CAT, simulated assessments were as reliable as paper-based forms of the WHOQOL with a fraction of the number of items. These properties suggest that these item banks are suitable for computerized adaptive assessment. These item banks have the potential for international development using existing alternative language versions of the WHOQOL items. PMID:27694100
Electronics 7-12 [Instructional Objectives Exchange].

ERIC Educational Resources Information Center

California Univ., Los Angeles. Center for the Study of Evaluation.

Included are instructional objectives which can be considered for use in a classroom or laboratory. The objectives are followed by measurement items meant to test if a certain objective is accomplished. Means for judging the adequacy of student responses are given. The areas of electronics included in this publication are: Fundamentals, Block…

Modifying the test of understanding graphs in kinematics

NASA Astrophysics Data System (ADS)

Zavala, Genaro; Tejeda, Santa; Barniol, Pablo; Beichner, Robert J.

2017-12-01

In this article, we present several modifications to the Test of Understanding Graphs in Kinematics. The most significant changes are (i) the addition and removal of items to achieve parallelism in the objectives (dimensions) of the test, thus allowing comparisons of students' performance that were not possible with the original version, and (ii) changes to the distractors of some of the original items that represent the most frequent alternative conceptions. The final modified version (after an iterative process involving four administrations of test variations over two years) was administered to 471 students of an introductory university physics course at a large private university in Mexico. When analyzing the final modified version of the test it was found that the added items satisfied the statistical tests of difficulty, discriminatory power, and reliability; also, that the great majority of the modified distractors were effective in terms of their frequency selection and discriminatory power; and, that the final modified version of the test satisfied the reliability and discriminatory power criteria as well as the original test. Here, we also show the use of the new version of the test, presenting a new analysis of students' understanding not possible to do before with the original version of the test, specifically regarding the objectives and items that in the new version meet parallelisms. Finally, in the PhysPort project (physport.org), we present the final modified version of the test. It can be used by teachers and researchers to assess students' understanding of graphs in kinematics, as well as their learning about them.
Ten Issues in Criterion-Referenced Testing: A Response to Commonly Heard Criticisms.

ERIC Educational Resources Information Center

Curlette, William L.; Stallings, William M.

1979-01-01

The 10 criticisms of criterion-referenced tests addressed in this paper are: the domains tested; pedagogical influence; difficulty of items; cumbersome reports; reliability; arbitrary criteria; local objectives; labeling; predictive validity; and repeated testing. (SJL)
Multiple, correlated covariates associated with differential item functioning (DIF): Accounting for language DIF when education levels differ across languages.

PubMed

Gibbons, Laura E; Crane, Paul K; Mehta, Kala M; Pedraza, Otto; Tang, Yuxiao; Manly, Jennifer J; Narasimhalu, Kaavya; Teresi, Jeanne; Jones, Richard N; Mungas, Dan

2011-04-28

Differential item functioning (DIF) occurs when a test item has different statistical properties in subgroups, controlling for the underlying ability measured by the test. DIF assessment is necessary when evaluating measurement bias in tests used across different language groups. However, other factors such as educational attainment can differ across language groups, and DIF due to these other factors may also exist. How to conduct DIF analyses in the presence of multiple, correlated factors remains largely unexplored. This study assessed DIF related to Spanish versus English language in a 44-item object naming test. Data come from a community-based sample of 1,755 Spanish- and English-speaking older adults. We compared simultaneous accounting, a new strategy for handling differences in educational attainment across language groups, with existing methods. Compared to other methods, simultaneously accounting for language- and education-related DIF yielded salient differences in some object naming scores, particularly for Spanish speakers with at least 9 years of education. Accounting for factors that vary across language groups can be important when assessing language DIF. The use of simultaneous accounting will be relevant to other cross-cultural studies in cognition and in other fields, including health-related quality of life.
Multiple, correlated covariates associated with differential item functioning (DIF): Accounting for language DIF when education levels differ across languages

PubMed Central

Gibbons, Laura E.; Crane, Paul K.; Mehta, Kala M.; Pedraza, Otto; Tang, Yuxiao; Manly, Jennifer J.; Narasimhalu, Kaavya; Teresi, Jeanne; Jones, Richard N.; Mungas, Dan

2012-01-01

Differential item functioning (DIF) occurs when a test item has different statistical properties in subgroups, controlling for the underlying ability measured by the test. DIF assessment is necessary when evaluating measurement bias in tests used across different language groups. However, other factors such as educational attainment can differ across language groups, and DIF due to these other factors may also exist. How to conduct DIF analyses in the presence of multiple, correlated factors remains largely unexplored. This study assessed DIF related to Spanish versus English language in a 44-item object naming test. Data come from a community-based sample of 1,755 Spanish- and English-speaking older adults. We compared simultaneous accounting, a new strategy for handling differences in educational attainment across language groups, with existing methods. Compared to other methods, simultaneously accounting for language- and education-related DIF yielded salient differences in some object naming scores, particularly for Spanish speakers with at least 9 years of education. Accounting for factors that vary across language groups can be important when assessing language DIF. The use of simultaneous accounting will be relevant to other cross-cultural studies in cognition and in other fields, including health-related quality of life. PMID:22900138
Sleep enhances a spatially mediated generalization of learned values

PubMed Central

Tolat, Anisha; Spiers, Hugo J.

2015-01-01

Sleep is thought to play an important role in memory consolidation. Here we tested whether sleep alters the subjective value associated with objects located in spatial clusters that were navigated to in a large-scale virtual town. We found that sleep enhances a generalization of the value of high-value objects to the value of locally clustered objects, resulting in an impaired memory for the value of high-valued objects. Our results are consistent with (a) spatial context helping to bind items together in long-term memory and serve as a basis for generalizing across memories and (b) sleep mediating memory effects on salient/reward-related items. PMID:26373834
Using Comprehensive Achievement Monitoring in the Classroom. Symposium; California Educational Research Association, San Jose, California, November 9, 1972.

ERIC Educational Resources Information Center

Easter, John; And Others

Comprehensive Achievement Monitoring (CAM) is a system designed to provide a curriculum defined in terms of performance objectives, test items to measure student performance on each objective, a set of comparable test forms to evaluate performance, testing throughout the period of the course, computerized analysis and reporting of results after…
Comprehensive Achievement Monitoring in the Sequoia Union High School District. Symposium, California Educational Data Processing Association, December 8, 1972.

ERIC Educational Resources Information Center

Easter, John; And Others

Comprehensive Achievement Monitoring (CAM) is a system designed to provide a curriculum defined in terms of performance objectives, test items to measure student performance on each objective, a set of comparable test forms to evaluate performance, testing throughout the period of course, computerized analysis and reporting of results after test…
An Update of "Implications of Changing Answers on Objective Test Items".

ERIC Educational Resources Information Center

Mercer, Maryann

In a 1977 review of the literature on test answer changing, Mueller and Wasser (EJ 163 236) cited 17 studies and concluded that students changing answers on objective tests gain more points than they lost by so doing. Higher scoring students tend to gain more than do the lower scoring students. Six additional studies not reported in the Mueller…
PREDICTION OF RELIABILITY IN BIOGRAPHICAL QUESTIONNAIRES.

ERIC Educational Resources Information Center

STARRY, ALLAN R.

THE OBJECTIVES OF THIS STUDY WERE (1) TO DEVELOP A GENERAL CLASSIFICATION SYSTEM FOR LIFE HISTORY ITEMS, (2) TO DETERMINE TEST-RETEST RELIABILITY ESTIMATES, AND (3) TO ESTIMATE RESISTANCE TO EXAMINEE FAKING, FOR REPRESENTATIVE BIOGRAPHICAL QUESTIONNAIRES. TWO 100-ITEM QUESTIONNAIRES WERE CONSTRUCTED THROUGH RANDOM ASSIGNMENT BY CONTENT AREA OF 200…
Consequences of screening in lung cancer: development and dimensionality of a questionnaire.

PubMed

Brodersen, John; Thorsen, Hanne; Kreiner, Svend

2010-08-01

The objective of this study was to extend the Consequences of Screening (COS) Questionnaire for use in a lung cancer screening by testing for comprehension, content coverage, dimensionality, and reliability. In interviews, the suitability, content coverage, and relevance of the COS were tested on participants in a lung cancer screening program. The results were thematically analyzed to identify the key consequences of abnormal and false-positive screening results. Item Response Theory and Classical Test Theory were used to analyze data. Dimensionality, objectivity, and reliability were established by item analysis, examining the fit between item responses and Rasch models. Eight themes specifically relevant for participants in lung cancer screening results were identified: "self-blame,"focus on symptoms,"stigmatization,"introvert,"harm of smoking,"impulsivity,"empathy," and "regretful of still smoking." Altogether, 26 new items for part I and 16 new items for part II were generated. These themes were confirmed to fit a partial-credit Rasch model measuring different constructs including several of the new items. In conclusion, the reliability and the dimensionality of a condition-specific measure with high content validity for persons having abnormal or false-positive lung cancer screening results have been demonstrated. This new questionnaire called Consequences of Screening in Lung Cancer (COS-LC) covers in two parts the psychosocial experience in lung cancer screening. Part I: "anxiety,"behavior,"dejection,"sleep,"self-blame,"focus on airway symptoms,"stigmatization,"introvert," and "harm of smoking." Part II: "calm/relax,"social network,"existential values,"impulsivity,"empathy," and "regretful of still smoking."
Research applications for an Object and Action Naming Battery to assess naming skills in adult Spanish-English bilingual speakers.

PubMed

Edmonds, Lisa A; Donovan, Neila J

2014-06-01

Virtually no valid materials are available to evaluate confrontation naming in Spanish-English bilingual adults in the U.S. In a recent study, a large group of young Spanish-English bilingual adults were evaluated on An Object and Action Naming Battery (Edmonds & Donovan in Journal of Speech, Language, and Hearing Research 55:359-381, 2012). Rasch analyses of the responses resulted in evidence for the content and construct validity of the retained items. However, the scope of that study did not allow for extensive examination of individual item characteristics, group analyses of participants, or the provision of testing and scoring materials or raw data, thereby limiting the ability of researchers to administer the test to Spanish-English bilinguals and to score the items with confidence. In this study, we present the in-depth information described above on the basis of further analyses, including (1) online searchable spreadsheets with extensive empirical (e.g., accuracy and name agreeability) and psycholinguistic item statistics; (2) answer sheets and instructions for scoring and interpreting the responses to the Rasch items; (3) tables of alternative correct responses for English and Spanish; (4) ability strata determined for all naming conditions (English and Spanish nouns and verbs); and (5) comparisons of accuracy across proficiency groups (i.e., Spanish dominant, English dominant, and balanced). These data indicate that the Rasch items from An Object and Action Naming Battery are valid and sensitive for the evaluation of naming in young Spanish-English bilingual adults. Additional information based on participant responses for all of the items on the battery can provide researchers with valuable information to aid in stimulus development and response interpretation for experimental studies in this population.
Ability evaluation by binary tests: Problems, challenges & recent advances

NASA Astrophysics Data System (ADS)

Bashkansky, E.; Turetsky, V.

2016-11-01

Binary tests designed to measure abilities of objects under test (OUTs) are widely used in different fields of measurement theory and practice. The number of test items in such tests is usually very limited. The response to each test item provides only one bit of information per OUT. The problem of correct ability assessment is even more complicated, when the levels of difficulty of the test items are unknown beforehand. This fact makes the search for effective ways of planning and processing the results of such tests highly relevant. In recent years, there has been some progress in this direction, generated by both the development of computational tools and the emergence of new ideas. The latter are associated with the use of so-called “scale invariant item response models”. Together with maximum likelihood estimation (MLE) approach, they helped to solve some problems of engineering and proficiency testing. However, several issues related to the assessment of uncertainties, replications scheduling, the use of placebo, as well as evaluation of multidimensional abilities still present a challenge for researchers. The authors attempt to outline the ways to solve the above problems.
The Instructional Quality Inventory. I. Introduction and Overview

DTIC Science & Technology

1978-11-01

level objectives, "hands-on" performance tests are usually most appropriate. T IS PAQ IS BEST QUA After a test item is consistent with its objective, the...idea. When the statement is separated, the key points stand out, and are not buried in the presentation. There are several ways to accomplish this goal
Problems in Criterion-Referenced Measurement. CSE Monograph Series in Evaluation, 3.

ERIC Educational Resources Information Center

Harris, Chester W., Ed.; And Others

Six essays on technical measurement problems in criterion referenced tests and four essays by psychometricians proposing solutions are presented: (1) "Criterion-Referenced Measurement" and Other Such Terms, by Marvin C. Alkin which is an overview of the first six papers; (2) Selecting Objectives and Generating Test Items for Objectives-Based…
A knowledge-based theory of rising scores on "culture-free" tests.

PubMed

Fox, Mark C; Mitchum, Ainsley L

2013-08-01

Secular gains in intelligence test scores have perplexed researchers since they were documented by Flynn (1984, 1987). Gains are most pronounced on abstract, so-called culture-free tests, prompting Flynn (2007) to attribute them to problem-solving skills availed by scientifically advanced cultures. We propose that recent-born individuals have adopted an approach to analogy that enables them to infer higher level relations requiring roles that are not intrinsic to the objects that constitute initial representations of items. This proposal is translated into item-specific predictions about differences between cohorts in pass rates and item-response patterns on the Raven's Matrices (Flynn, 1987), a seemingly culture-free test that registers the largest Flynn effect. Consistent with predictions, archival data reveal that individuals born around 1940 are less able to map objects at higher levels of relational abstraction than individuals born around 1990. Polytomous Rasch models verify predicted violations of measurement invariance, as raw scores are found to underestimate the number of analogical rules inferred by members of the earlier cohort relative to members of the later cohort who achieve the same overall score. The work provides a plausible cognitive account of the Flynn effect, furthers understanding of the cognition of matrix reasoning, and underscores the need to consider how test-takers select item responses. PsycINFO Database Record (c) 2013 APA, all rights reserved.
Interservice Procedures for Instructional Systems Development. Phase 3. Develop

DTIC Science & Technology

1975-08-01

Occur at wide intervals to be learned *Reads about the actions to *Occur at the end, but before be learned tests or on-the-job performance *Watches a...the particular sub-category. Use the learning objective action statement, conditions, standards, and the test item to help select which guidelines to...objective. EXAMPLE If you have a CLASSIFYING objective like "identifying poisonous plants,’ when you get to guideline 16. "To test learning, require the
Assessment of nasalance and nasality in patients with a repaired cleft palate.

PubMed

Sinko, Klaus; Gruber, Maike; Jagsch, Reinhold; Roesner, Imme; Baumann, Arnulf; Wutzl, Arno; Denk-Linnert, Doris-Maria

2017-07-01

In patients with a repaired cleft palate, nasality is typically diagnosed by speech language pathologists. In addition, there are various instruments to objectively diagnose nasalance. To explore the potential of nasalance measurements after cleft palate repair by NasalView ® , we correlated perceptual nasality and instrumentally measured nasalance of eight speech items and determined the relationship between sensitivity and specificity of the nasalance measures by receiver-operating characteristics (ROC) analyses and AUC (area under the curve) computation for each single test item and specific item groups. We recruited patients with a primarily repaired cleft palate receiving speech therapy during follow-up. During a single day visit, perceptive and instrumental assessments were obtained in 36 patients and analyzed. The individual perceptual nasality was assigned to one of four categories; the corresponding instrumental nasalance measures for the eight specific speech items were expressed on a metric scale (1-100). With reference to the perceptual diagnoses, we observed 3 nasal and one oral test item with high sensitivity. However, the specificity of the nasality indicating measures was rather low. The four best speech items with the highest sensitivity provided scores ranging from 96.43 to 100%, while the averaged sensitivity of all eight items was below 90%. We conclude that perceptive evaluation of nasality remains state of the art. For clinical follow-up, instrumental nasalance assessment can objectively document subtle changes by analysis of four speech items only. Further studies are warranted to determine the applicability of instrumental nasalance measures in the clinical routine, using discriminative items only.
Social influences on reality-monitoring decisions.

PubMed

Hoffman, H G; Granhag, P A; Kwong See, S T; Loftus, E F

2001-04-01

A modified Asch (1951) conformity paradigm was used to study the impact of social influence on reality-monitoring decisions about new items. Subjects studied pictures of some objects and imagined others. In a later test phase, they judged whether items had been perceived in the study phase, had been imagined, or were new. Critically, for some items, the subjects were informed of a confederate's response before rendering a judgment. Although the confederate was always correct when they responded to old items, for new items, the confederate responded perceived, imagined, or new, or did not respond (baseline). In two experiments, we show that memory for new items was influenced by an erroneous response of the confederate. Social conformity was reduced by undermining the credibility of the confederate (Experiments 1A and 1B), and the confederate's influence was evident even after there was only a 20-min delay between study and test (Experiment 2), when the subjects were 87% accurate on new baseline items. These experiments reveal the power of social influence on reality-monitoring accuracy and confidence.
Do you remember where sounds, pictures and words came from? The role of the stimulus format in object location memory.

PubMed

Delogu, Franco; Lilla, Christopher C

2017-11-01

Contrasting results in visual and auditory spatial memory stimulate the debate over the role of sensory modality and attention in identity-to-location binding. We investigated the role of sensory modality in the incidental/deliberate encoding of the location of a sequence of items. In 4 separated blocks, 88 participants memorised sequences of environmental sounds, spoken words, pictures and written words, respectively. After memorisation, participants were asked to recognise old from new items in a new sequence of stimuli. They were also asked to indicate from which side of the screen (visual stimuli) or headphone channel (sounds) the old stimuli were presented in encoding. In the first block, participants were not aware of the spatial requirement while, in blocks 2, 3 and 4 they knew that their memory for item location was going to be tested. Results show significantly lower accuracy of object location memory for the auditory stimuli (environmental sounds and spoken words) than for images (pictures and written words). Awareness of spatial requirement did not influence localisation accuracy. We conclude that: (a) object location memory is more effective for visual objects; (b) object location is implicitly associated with item identity during encoding and (c) visual supremacy in spatial memory does not depend on the automaticity of object location binding.
Analysis instrument test on mathematical power the material geometry of space flat side for grade 8

NASA Astrophysics Data System (ADS)

Kusmaryono, Imam; Suyitno, Hardi; Dwijanto, Karomah, Nur

2017-08-01

The main problem of research to determine the quality of test items on the material side of flat geometry to assess students' mathematical power. The method used is quantitative descriptive. The subjects were students of class 8 as many as 20 students. The object of research is the quality of test items in terms of the power of mathematics: validity, reliability, level of difficulty and power differentiator. Instrument mathematical power ratings are tested include: written tests and questionnaires about the disposition of mathematical power. Data were obtained from the field, in the form of test data on the material geometry of space flat side and questionnaires. The results of the test instrument to the reliability of the test item is influenced by many factors. Factors affecting the reliability of the instrument is the number of items, homogeneity test questions, the time required, the uniformity of conditions of the test taker, the homogeneity of the group, the variability problem, and motivation of the individual (person taking the test). Overall, the evaluation results of this study stated that the test instrument can be used as a tool to measure students' mathematical power.

Objective and Subjective Cancer Knowledge Among Faith-Based Chinese Adults.

PubMed

Hou, Su-I; Liu, Ling Jie

2017-10-01

This study examined cancer knowledge between church-going younger versus older Chinese adults. Hou's 8-item validated cancer screening knowledge test (CSKT) and a new 14-item cancer warning signs test (CWST) were used to assess objective knowledge. Subjective knowledge was measured by one overall 5-point Likert scale item. A total of 372 Taiwanese and Chinese Americans from nine churches participated. Although there were no significant differences by age on either the CSKT scores (younger = 5.89 vs. older = 5.71; p = .297) or the CWST (younger = 6.27 vs. older = 5.86; p = .245), subjective knowledge was higher among older Chinese adults (younger = 2.44 vs. older = 3.05, p < .001). Older Chinese adults were also more likely to identify cancer warning signs correctly, while younger adults were more likely to identify false warning signs correctly. Results have implication on tailoring cancer knowledge type (subjective vs. objective) and content domain (screening vs. warning signs). Findings can help health educators better understand cancer education needs among Chinese adults.
Development of the Online Assessment of Athletic Training Education (OAATE) Instrument

ERIC Educational Resources Information Center

Carr, W. David; Frey, Bruce B.; Swann, Elizabeth

2009-01-01

Objective: To establish the validity and reliability of an online assessment instrument's items developed to track educational outcomes over time. Design and Setting: A descriptive study of the validation arguments and reliability testing of the assessment items. The instrument is available to graduating students enrolled in entry-level Athletic…
An Introduction to Item Response Theory for Health Behavior Researchers

ERIC Educational Resources Information Center

Warne, Russell T.; McKyer, E. J. Lisako; Smith, Matthew L.

2012-01-01

Objective: To introduce item response theory (IRT) to health behavior researchers by contrasting it with classical test theory and providing an example of IRT in health behavior. Method: Demonstrate IRT by fitting the 2PL model to substance-use survey data from the Adolescent Health Risk Behavior questionnaire (n = 1343 adolescents). Results: An…
General Business: Grades 10-12.

ERIC Educational Resources Information Center

Instructional Objectives Exchange, Los Angeles, CA.

Thirty-five objectives and related test items assessing general business skills taught in grades 10 through 12 are included in this collection. Each objective is stated in operational terms and identified by a subject area within the hood category of general business. Objectives include the desired behavior and subject content so that students are…
Measuring stigma after spinal cord injury: Development and psychometric characteristics of the SCI-QOL Stigma item bank and short form

PubMed Central

Kisala, Pamela A.; Tulsky, David S.; Pace, Natalie; Victorson, David; Choi, Seung W.; Heinemann, Allen W.

2015-01-01

Objective To develop a calibrated item bank and computer adaptive test (CAT) to assess the effects of stigma on health-related quality of life in individuals with spinal cord injury (SCI). Design Grounded-theory based qualitative item development methods, large-scale item calibration field testing, confirmatory factor analysis, and item response theory (IRT)-based psychometric analyses. Setting Five SCI Model System centers and one Department of Veterans Affairs medical center in the United States. Participants Adults with traumatic SCI. Main Outcome Measures SCI-QOL Stigma Item Bank Results A sample of 611 individuals with traumatic SCI completed 30 items assessing SCI-related stigma. After 7 items were iteratively removed, factor analyses confirmed a unidimensional pool of items. Graded Response Model IRT analyses were used to estimate slopes and thresholds for the final 23 items. Conclusions The SCI-QOL Stigma item bank is unique not only in the assessment of SCI-related stigma but also in the inclusion of individuals with SCI in all phases of its development. Use of confirmatory factor analytic and IRT methods provide flexibility and precision of measurement. The item bank may be administered as a CAT or as a 10-item fixed-length short form and can be used for research and clinical applications. PMID:26010973
Test Program for Assessing Vulnerability of Industrial Equipment to Nuclear Air Blast.

DTIC Science & Technology

1983-10-01

PROJECT. TASK 4Scientific Servic, Inc. AREA & WORK UNIT NUMBERS 517 East Bayshore Work Unit 1124F Redwood City, CA 94063___ __________ 11. CONTROLLING ...vulnerability, but perhaps less expensive, to be selected and substituted, with an eye to cost control . 5. MODELING AND SCALING CONSIDERATIONS Reiterating...behavior and properties of the test items and Interfaces that control behavior (e4g., test objects/flow field, test objects/interfacing surface of
Face validity and reliability of a pictorial instrument for assessing fundamental movement skill perceived competence in young children.

PubMed

Barnett, Lisa M; Ridgers, Nicola D; Zask, Avigdor; Salmon, Jo

2015-01-01

To determine reliability and face validity of an instrument to assess young children's perceived fundamental movement skill competence. Validation and reliability study. A pictorial instrument based on the Test Gross Motor Development-2 assessed perceived locomotor (six skills) and object control (six skills) competence using the format and item structure from the physical competence subscale of the Pictorial Scale of Perceived Competence and Acceptance for Young Children. Sample 1 completed object control items in May (n=32) and locomotor items in October 2012 (n=23) at two time points seven days apart. Children were asked at the end of the test-retest their understanding of what was happening in each picture to determine face validity. Sample 2 (n=58) completed 12 items in November 2012 on a single occasion to test internal reliability only. Sample 1 children were aged 5-7 years (M=6.0, SD=0.8) at object control assessment and 5-8 years at locomotor assessment (M=6.5, SD=0.9). Sample 2 children were aged 6-8 years (M=7.2, SD=0.73). Intra-class correlations assessed in Sample 1 children were excellent for object control (intra-class correlation=0.78), locomotor (intra-class correlation=0.82) and all 12 skills (intra-class correlations=0.83). Face validity was acceptable. Internal consistency was adequate in both samples for each subscale and all 12 skills (alpha range 0.60-0.81). This study has provided preliminary evidence for instrument reliability and face validity. This enables future alignment between the measurement of perceived and actual fundamental movement skill competence in young children. Crown Copyright © 2014. Published by Elsevier Ltd. All rights reserved.
Measuring grief and loss after spinal cord injury: Development, validation and psychometric characteristics of the SCI-QOL Grief and Loss item bank and short form

PubMed Central

Kalpakjian, Claire Z.; Tulsky, David S.; Kisala, Pamela A.; Bombardier, Charles H.

2015-01-01

Objective To develop an item response theory (IRT) calibrated Grief and Loss item bank as part of the Spinal Cord Injury – Quality of Life (SCI-QOL) measurement system. Design A literature review guided framework development of grief/loss. New items were created from focus groups. Items were revised based on expert review and patient feedback and were then field tested. Analyses included confirmatory factor analysis (CFA), graded response IRT modeling and evaluation of differential item functioning (DIF). Setting We tested a 20-item pool at several rehabilitation centers across the United States, including the University of Michigan, Kessler Foundation, Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital and the James J. Peters/Bronx Department of Veterans Affairs hospital. Participants A total of 717 individuals with SCI answered the grief and loss questions. Results The final calibrated item bank resulted in 17 retained items. A unidimensional model was observed (CFI = 0.976; RMSEA = 0.078) and measurement precision was good (theta range between −1.48 to 2.48). Ten items were flagged for DIF, however, after examination of effect sizes found this to be negligible with little practical impact on score estimates. Conclusions This study indicates that the SCI-QOL Grief and Loss item bank represents a psychometrically robust measurement tool. Short form items are also suggested and computer adaptive tests are available. PMID:26010969
Modeling the Severity of Drinking Consequences in First-Year College Women: An Item Response Theory Analysis of the Rutgers Alcohol Problem Index*

PubMed Central

Cohn, Amy M.; Hagman, Brett T.; Graff, Fiona S.; Noel, Nora E.

2011-01-01

Objective: The present study examined the latent continuum of alcohol-related negative consequences among first-year college women using methods from item response theory and classical test theory. Method: Participants (N = 315) were college women in their freshman year who reported consuming any alcohol in the past 90 days and who completed assessments of alcohol consumption and alcohol-related negative consequences using the Rutgers Alcohol Problem Index. Results: Item response theory analyses showed poor model fit for five items identified in the Rutgers Alcohol Problem Index. Two-parameter item response theory logistic models were applied to the remaining 18 items to examine estimates of item difficulty (i.e., severity) and discrimination parameters. The item difficulty parameters ranged from 0.591 to 2.031, and the discrimination parameters ranged from 0.321 to 2.371. Classical test theory analyses indicated that the omission of the five misfit items did not significantly alter the psychometric properties of the construct. Conclusions: Findings suggest that those consequences that had greater severity and discrimination parameters may be used as screening items to identify female problem drinkers at risk for an alcohol use disorder. PMID:22051212
Criterion-Referenced Testing in Foreign Language Teaching.

ERIC Educational Resources Information Center

Takala, Sauli

A review of literature serves as the basis for a discussion of various aspects of criterion-referenced tests. The aspects discussed are: teaching and evaluation objectives, criterion- and norm-referenced measurement, stages in construction of criterion-referenced tests, construction and selection of items, test validity, and test reliability.…
Development of knowledge tests for multi-disciplinary emergency training: a review and an example.

PubMed

Sørensen, J L; Thellesen, L; Strandbygaard, J; Svendsen, K D; Christensen, K B; Johansen, M; Langhoff-Roos, P; Ekelund, K; Ottesen, B; Van Der Vleuten, C

2015-01-01

The literature is sparse on written test development in a post-graduate multi-disciplinary setting. Developing and evaluating knowledge tests for use in multi-disciplinary post-graduate training is challenging. The objective of this study was to describe the process of developing and evaluating a multiple-choice question (MCQ) test for use in a multi-disciplinary training program in obstetric-anesthesia emergencies. A multi-disciplinary working committee with 12 members representing six professional healthcare groups and another 28 participants were involved. Recurrent revisions of the MCQ items were undertaken followed by a statistical analysis. The MCQ items were developed stepwise, including decisions on aims and content, followed by testing for face and content validity, construct validity, item-total correlation, and reliability. To obtain acceptable content validity, 40 out of originally 50 items were included in the final MCQ test. The MCQ test was able to distinguish between levels of competence, and good construct validity was indicated by a significant difference in the mean score between consultants and first-year trainees, as well as between first-year trainees and medical and midwifery students. Evaluation of the item-total correlation analysis in the 40 items set revealed that 11 items needed re-evaluation, four of which addressed content issues in local clinical guidelines. A Cronbach's alpha of 0.83 for reliability was found, which is acceptable. Content and construct validity and reliability were acceptable. The presented template for the development of this MCQ test could be useful to others when developing knowledge tests and may enhance the overall quality of test development. © 2014 The Acta Anaesthesiologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.
Visual short-term memory binding deficit in familial Alzheimer's disease.

PubMed

Liang, Yuying; Pertzov, Yoni; Nicholas, Jennifer M; Henley, Susie M D; Crutch, Sebastian; Woodward, Felix; Leung, Kelvin; Fox, Nick C; Husain, Masud

2016-05-01

Long-term episodic memory deficits in Alzheimer's disease (AD) are well characterised but, until recently, short-term memory (STM) function has attracted far less attention. We employed a recently-developed, delayed reproduction task which requires participants to reproduce precisely the remembered location of items they had seen only seconds previously. This paradigm provides not only a continuous measure of localization error in memory, but also an index of relational binding by determining the frequency with which an object is misplaced to the location of one of the other items held in memory. Such binding errors in STM have previously been found on this task to be sensitive to medial temporal lobe (MTL) damage in focal lesion cases. Twenty individuals with pathological mutations in presenilin 1 or amyloid precursor protein genes for familial Alzheimer's disease (FAD) were tested together with 62 healthy controls. Participants were assessed using the delayed reproduction memory task, a standard neuropsychological battery and structural MRI. Overall, FAD mutation carriers were worse than controls for object identity as well as in gross localization memory performance. Moreover, they showed greater misbinding of object identity and location than healthy controls. Thus they would often mislocalize a correctly-identified item to the location of one of the other items held in memory. Significantly, asymptomatic gene carriers - who performed similarly to healthy controls on standard neuropsychological tests - had a specific impairment in object-location binding, despite intact memory for object identity and location. Consistent with the hypothesis that the hippocampus is critically involved in relational binding regardless of memory duration, decreased hippocampal volume across FAD participants was significantly associated with deficits in object-location binding but not with recall precision for object identity or localization. Object-location binding may therefore provide a sensitive cognitive biomarker for MTL dysfunction in a range of diseases including AD. Copyright © 2016. Published by Elsevier Ltd.
Item-location binding in working memory: is it hippocampus-dependent?

PubMed

Allen, Richard J; Vargha-Khadem, Faraneh; Baddeley, Alan D

2014-07-01

A general consensus is emerging that the hippocampus has an important and active role in the creation of new long-term memory representations of associations or bindings between elements. However, it is less clear whether this contribution can be extended to the creation of temporary bound representations in working memory, involving the retention of small numbers of items over short delays. We examined this by administering a series of recognition and recall tests of working memory for colour-location binding and object-location binding to a patient with highly selective hippocampal damage (Jon), and groups of control participants. Jon achieved high levels of accuracy in all working memory tests of recognition and recall binding across retention intervals of up to 10s. In contrast, Jon performed at chance on an unexpected delayed test of the same object-location binding information. These findings indicate a clear dissociation between working memory and long-term memory, with no evidence for a critical hippocampal contribution to item-location binding in working memory. Copyright © 2014 Elsevier Ltd. All rights reserved.
Electricity-Electronics for Industrial Arts. Instructors Lesson Plans. Industrial Arts Series, Publication Number 10,010.

ERIC Educational Resources Information Center

Hinrichs, Roy S., Comp.

Thirty-one lesson plans on electricity-electronics are presented in this guide designed for industrial arts instructors. Each lesson plan is organized into the following format: (1) lesson objective; (2) supplementary teaching items; (3) presentation; (4) demonstration; (5) laboratory or other activities; and (6) test items (oral, written, or…
Role of the Dorsal Hippocampus in Object Memory Load

ERIC Educational Resources Information Center

Sannino, Sara; Russo, Fabio; Torromino, Giulia; Pendolino, Valentina; Calabresi, Paolo; De Leonibus, Elvira

2012-01-01

The dorsal hippocampus is crucial for mammalian spatial memory, but its exact role in item memory is still hotly debated. Recent evidence in humans suggested that the hippocampus might be selectively involved in item short-term memory to deal with an increasing memory load. In this study, we sought to test this hypothesis. To this aim we developed…
Similarity, not complexity, determines visual working memory performance.

PubMed

Jackson, Margaret C; Linden, David E J; Roberts, Mark V; Kriegeskorte, Nikolaus; Haenschel, Corinna

2015-11-01

A number of studies have shown that visual working memory (WM) is poorer for complex versus simple items, traditionally accounted for by higher information load placing greater demands on encoding and storage capacity limits. Other research suggests that it may not be complexity that determines WM performance per se, but rather increased perceptual similarity between complex items as a result of a large amount of overlapping information. Increased similarity is thought to lead to greater comparison errors between items encoded into WM and the test item(s) presented at retrieval. However, previous studies have used different object categories to manipulate complexity and similarity, raising questions as to whether these effects are simply due to cross-category differences. For the first time, here the relationship between complexity and similarity in WM using the same stimulus category (abstract polygons) are investigated. The authors used a delayed discrimination task to measure WM for 1-4 complex versus simple simultaneously presented items and manipulated the similarity between the single test item at retrieval and the sample items at encoding. WM was poorer for complex than simple items only when the test item was similar to 1 of the encoding items, and not when it was dissimilar or identical. The results provide clear support for reinterpretation of the complexity effect in WM as a similarity effect and highlight the importance of the retrieval stage in governing WM performance. The authors discuss how these findings can be reconciled with current models of WM capacity limits. (c) 2015 APA, all rights reserved).
Trading Up: Chimpanzees (Pan troglodytes) Show Self-Control Through Their Exchange Behavior

PubMed Central

Beran, Michael J.; Rossettie, Mattea S.; Parrish, Audrey E.

2015-01-01

Self-control is defined as the ability or capacity to obtain an objectively more valuable outcome rather than an objectively less valuable outcome though tolerating a longer delay or a greater effort requirement (or both) in obtaining that more valuable outcome. A number of tests have been devised to assess self-control in nonhuman animals, including exchange tasks. In this study, three chimpanzees (Pan troglodytes) participated in a delay of gratification task that required food exchange as the behavioral response that reflected self-control. The chimpanzees were offered opportunities to inhibit eating and instead exchange a currently possessed food item for a different (and sometimes better) item, often needing to exchange several food items before obtaining the highest-valued reward. We manipulated reward type, reward size, reward visibility, delay to exchange, and location of the highest-valued reward in the sequence of exchange events to compare performance within the same individuals. The chimpanzees successfully traded until obtaining the best item in most cases, although there were individual differences among participants in some variations of the test. These results support the idea that self-control is robust in chimpanzees even in contexts in which they perhaps anticipate future rewards and sustain delay of gratification until they can obtain the ultimately most-valuable item. PMID:26325355
Trading up: chimpanzees (Pan troglodytes) show self-control through their exchange behavior.

PubMed

Beran, Michael J; Rossettie, Mattea S; Parrish, Audrey E

2016-01-01

Self-control is defined as the ability or capacity to obtain an objectively more valuable outcome rather than an objectively less valuable outcome though tolerating a longer delay or a greater effort requirement (or both) in obtaining that more valuable outcome. A number of tests have been devised to assess self-control in non-human animals, including exchange tasks. In this study, three chimpanzees (Pan troglodytes) participated in a delay of gratification task that required food exchange as the behavioral response that reflected self-control. The chimpanzees were offered opportunities to inhibit eating and instead exchange a currently possessed food item for a different (and sometimes better) item, often needing to exchange several food items before obtaining the highest valued reward. We manipulated reward type, reward size, reward visibility, delay to exchange, and location of the highest valued reward in the sequence of exchange events to compare performance within the same individuals. The chimpanzees successfully traded until obtaining the best item in most cases, although there were individual differences among participants in some variations of the test. These results support the idea that self-control is robust in chimpanzees even in contexts in which they perhaps anticipate future rewards and sustain delay of gratification until they can obtain the ultimately most valuable item.
Item and source memory for emotional associates is mediated by different retrieval processes.

PubMed

Ventura-Bort, Carlos; Dolcos, Florin; Wendt, Julia; Wirkner, Janine; Hamm, Alfons O; Weymar, Mathias

2017-12-12

Recent event-related potential (ERP) data showed that neutral objects encoded in emotional background pictures were better remembered than objects encoded in neutral contexts, when recognition memory was tested one week later. In the present study, we investigated whether this long-term memory advantage for items is also associated with correct memory for contextual source details. Furthermore, we were interested in the possibly dissociable contribution of familiarity and recollection processes (using a Remember/Know procedure). The results revealed that item memory performance was mainly driven by the subjective experience of familiarity, irrespective of whether the objects were previously encoded in emotional or neutral contexts. Correct source memory for the associated background picture, however, was driven by recollection and enhanced when the content was emotional. In ERPs, correctly recognized old objects evoked frontal ERP Old/New effects (300-500ms), irrespective of context category. As in our previous study (Ventura-Bort et al., 2016b), retrieval for objects from emotional contexts was associated with larger parietal Old/New differences (600-800ms), indicating stronger involvement of recollection. Thus, the results suggest a stronger contribution of recollection-based retrieval to item and contextual background source memory for neutral information associated with an emotional event. Copyright © 2017 Elsevier Ltd. All rights reserved.
Nouns referring to tools and natural objects differentially modulate the motor system.

PubMed

Gough, Patricia M; Riggio, Lucia; Chersi, Fabian; Sato, Marc; Fogassi, Leonardo; Buccino, Giovanni

2012-01-01

While increasing evidence points to a critical role for the motor system in language processing, the focus of previous work has been on the linguistic category of verbs. Here we tested whether nouns are effective in modulating the motor system and further whether different kinds of nouns - those referring to artifacts or natural items, and items that are graspable or ungraspable - would differentially modulate the system. A Transcranial Magnetic Stimulation (TMS) study was carried out to compare modulation of the motor system when subjects read nouns referring to objects which are Artificial or Natural and which are Graspable or Ungraspable. TMS was applied to the primary motor cortex representation of the first dorsal interosseous (FDI) muscle of the right hand at 150 ms after noun presentation. Analyses of Motor Evoked Potentials (MEPs) revealed that across the duration of the task, nouns referring to graspable artifacts (tools) were associated with significantly greater MEP areas. Analyses of the initial presentation of items revealed a main effect of graspability. The findings are in line with an embodied view of nouns, with MEP measures modulated according to whether nouns referred to natural objects or artifacts (tools), confirming tools as a special class of items in motor terms. Additionally our data support a difference for graspable versus non graspable objects, an effect which for natural objects is restricted to initial presentation of items. Copyright © 2011 Elsevier Ltd. All rights reserved.

An object location memory paradigm for older adults with and without mild cognitive impairment.

PubMed

Külzow, Nadine; Kerti, Lucia; Witte, Veronica A; Kopp, Ute; Breitenstein, Caterina; Flöel, Agnes

2014-11-30

Object-location memory is critical in every-day life and known to deteriorate early in the course of neurodegenerative disease. We adapted the previously established learning paradigm "LOCATO" for use in healthy older adults and patients with mild cognitive impairment (MCI). Pictures of real-life buildings were associated with positions on a two-dimensional street map by repetitions of "correct" object-location pairings over the course of five training blocks, followed by a recall task. Correct/incorrect associations were indicated by button presses. The original two 45-item sets were reduced to 15 item-sets, and tested in healthy older adults and MCI for learning curve, recall, and re-test effects. The two 15-item versions showed comparable learning curves and recall scores within each group. While learning curves increased linearly in both groups, MCI patients performed significantly worse on learning and recall compared to healthy controls. Re-testing after 6 month showed small practice effects only. LOCATO is a simple standardized task that overcomes several limitation of previously employed visuospatial task by using real-life stimuli, minimizing verbal encoding, avoiding fine motor responses, combining explicit and implicit statistical learning, and allowing to assess learning curve in addition to recall. Results show that the shortened version of LOCATO meets the requirements for a robust and ecologically meaningful assessment of object-location memory in older adults with and without MCI. It can now be used to systematically assess acquisition of object-location memory and its modulation through adjuvant therapies like pharmacological or non-invasive brain stimulation. Copyright © 2014 Elsevier B.V. All rights reserved.
The value of item response theory in clinical assessment: a review.

PubMed

Thomas, Michael L

2011-09-01

Item response theory (IRT) and related latent variable models represent modern psychometric theory, the successor to classical test theory in psychological assessment. Although IRT has become prevalent in the measurement of ability and achievement, its contributions to clinical domains have been less extensive. Applications of IRT to clinical assessment are reviewed to appraise its current and potential value. Benefits of IRT include comprehensive analyses and reduction of measurement error, creation of computer adaptive tests, meaningful scaling of latent variables, objective calibration and equating, evaluation of test and item bias, greater accuracy in the assessment of change due to therapeutic intervention, and evaluation of model and person fit. The theory may soon reinvent the manner in which tests are selected, developed, and scored. Although challenges remain to the widespread implementation of IRT, its application to clinical assessment holds great promise. Recommendations for research, test development, and clinical practice are provided.
Neural Overlap in Item Representations Across Episodes Impairs Context Memory.

PubMed

Kim, Ghootae; Norman, Kenneth A; Turk-Browne, Nicholas B

2018-06-12

We frequently encounter the same item in different contexts, and when that happens, memories of earlier encounters can get reactivated. We examined how existing memories are changed as a result of such reactivation. We hypothesized that when an item's initial and subsequent neural representations overlap, this allows the initial item to become associated with novel contextual information, interfering with later retrieval of the initial context. Specifically, we predicted a negative relationship between representational similarity across repeated experiences of an item and subsequent source memory for the initial context. We tested this hypothesis in an fMRI study, in which objects were presented multiple times during different tasks. We measured the similarity of the neural patterns in lateral occipital cortex that were elicited by the first and second presentations of objects, and related this neural overlap score to subsequent source memory. Consistent with our hypothesis, greater item-specific pattern similarity was linked to worse source memory for the initial task. In contrast, greater reactivation of the initial context was associated with better source memory. Our findings suggest that the influence of novel experiences on an existing context memory depends on how reliably a shared component (i.e., item) is represented across these episodes.
Attention Effects During Visual Short-Term Memory Maintenance: Protection or Prioritization?

PubMed Central

Matsukura, Michi; Luck, Steven J.; Vecera, Shaun P.

2007-01-01

Interactions between visual attention and visual short-term memory (VSTM) play a central role in cognitive processing. For example, attention can assist in selectively encoding items into visual memory. Attention appears to be able to influence items already stored in visual memory as well; cues that appear long after the presentation of an array of objects can affect memory for those objects (Griffin & Nobre, 2003). In five experiments, we distinguished two possible mechanisms for the effects of cues on items currently stored in VSTM. A protection account proposes that attention protects the cued item from becoming degraded during the retention interval. By contrast, a prioritization account suggests that attention increases a cued item’s priority during the comparison process that occurs when memory is tested. The results of the experiments were consistent with the first of these possibilities, suggesting that attention can serve to protect VSTM representations while they are being maintained. PMID:18078232
The Time-Course of Lexical Activation During Sentence Comprehension in People With Aphasia

PubMed Central

Ferrill, Michelle; Love, Tracy; Walenski, Matthew; Shapiro, Lewis P.

2012-01-01

Purpose To investigate the time-course of processing of lexical items in auditorily presented canonical (subject–verb–object) constructions in young, neurologically unimpaired control participants and participants with left-hemisphere damage and agrammatic aphasia. Method A cross modal picture priming (CMPP) paradigm was used to test 114 control participants and 8 participants with agrammatic aphasia for priming of a lexical item (direct object noun) immediately after it is initially encountered in the ongoing auditory stream and at 3 additional time points at 400-ms intervals. Results The control participants demonstrated immediate activation of the lexical item, followed by a rapid loss (decay). The participants with aphasia demonstrated delayed activation of the lexical item. Conclusion This evidence supports the hypothesis of a delay in lexical activation in people with agrammatic aphasia. The delay in lexical activation feeds syntactic processing too slowly, contributing to comprehension deficits in people with agrammatic aphasia. PMID:22355007
Pursuing the Qualities of a "Good" Test

ERIC Educational Resources Information Center

Coniam, David

2014-01-01

This article examines the issue of the quality of teacher-produced tests, limiting itself in the current context to objective, multiple-choice tests. The article investigates a short, two-part 20-item English language test. After a brief overview of the key test qualities of reliability and validity, the article examines the two subtests in terms…
Assessment of Functional Abilities of Moderate Learning Students.

ERIC Educational Resources Information Center

Brundage, Elvira; And Others

The "Assessment of Functional Abilities of Moderate Learning Students" test is presented, along with a list of objectives being tested and brief test administration instructions. The test, which was developed by special education teachers of Cortland-Madison, New York, contains test items for the following 12 strands of a curriculum for secondary…
Objective Assessment of Activity Limitation in Glaucoma with Smartphone Virtual Reality Goggles: A Pilot Study.

PubMed

Goh, Rachel L Z; Kong, Yu Xiang George; McAlinden, Colm; Liu, John; Crowston, Jonathan G; Skalicky, Simon E

2018-01-01

To evaluate the use of smartphone-based virtual reality to objectively assess activity limitation in glaucoma. Cross-sectional study of 93 patients (54 mild, 22 moderate, 17 severe glaucoma). Sociodemographics, visual parameters, Glaucoma Activity Limitation-9 and Visual Function Questionnaire - Utility Index (VFQ-UI) were collected. Mean age was 67.4 ± 13.2 years; 52.7% were male; 65.6% were driving. A smartphone placed inside virtual reality goggles was used to administer the Virtual Reality Glaucoma Visual Function Test (VR-GVFT) to participants, consisting of three parts: stationary, moving ball, driving. Rasch analysis and classical validity tests were conducted to assess performance of VR-GVFT. Twenty-four of 28 stationary test items showed acceptable fit to the Rasch model (person separation 3.02, targeting 0). Eleven of 12 moving ball test items showed acceptable fit (person separation 3.05, targeting 0). No driving test items showed acceptable fit. Stationary test person scores showed good criterion validity, differentiating between glaucoma severity groups ( P = 0.014); modest convergence validity, with mild to moderate correlation with VFQ-UI, better eye (BE) mean deviation, BE pattern deviation, BE central scotoma, worse eye (WE) visual acuity, and contrast sensitivity (CS) in both eyes ( R = 0.243-0.381); and suboptimal divergent validity. Multivariate analysis showed that lower WE CS ( P = 0.044) and greater age ( P = 0.009) were associated with worse stationary test person scores. Smartphone-based virtual reality may be a portable objective simulation test of activity limitation related to glaucomatous visual loss. The use of simulated virtual environments could help better understand the activity limitations that affect patients with glaucoma.
Objective Assessment of Activity Limitation in Glaucoma with Smartphone Virtual Reality Goggles: A Pilot Study

PubMed Central

Goh, Rachel L. Z.; McAlinden, Colm; Liu, John; Crowston, Jonathan G.; Skalicky, Simon E.

2018-01-01

Purpose To evaluate the use of smartphone-based virtual reality to objectively assess activity limitation in glaucoma. Methods Cross-sectional study of 93 patients (54 mild, 22 moderate, 17 severe glaucoma). Sociodemographics, visual parameters, Glaucoma Activity Limitation-9 and Visual Function Questionnaire – Utility Index (VFQ-UI) were collected. Mean age was 67.4 ± 13.2 years; 52.7% were male; 65.6% were driving. A smartphone placed inside virtual reality goggles was used to administer the Virtual Reality Glaucoma Visual Function Test (VR-GVFT) to participants, consisting of three parts: stationary, moving ball, driving. Rasch analysis and classical validity tests were conducted to assess performance of VR-GVFT. Results Twenty-four of 28 stationary test items showed acceptable fit to the Rasch model (person separation 3.02, targeting 0). Eleven of 12 moving ball test items showed acceptable fit (person separation 3.05, targeting 0). No driving test items showed acceptable fit. Stationary test person scores showed good criterion validity, differentiating between glaucoma severity groups (P = 0.014); modest convergence validity, with mild to moderate correlation with VFQ-UI, better eye (BE) mean deviation, BE pattern deviation, BE central scotoma, worse eye (WE) visual acuity, and contrast sensitivity (CS) in both eyes (R = 0.243–0.381); and suboptimal divergent validity. Multivariate analysis showed that lower WE CS (P = 0.044) and greater age (P = 0.009) were associated with worse stationary test person scores. Conclusions Smartphone-based virtual reality may be a portable objective simulation test of activity limitation related to glaucomatous visual loss. Translational Relevance The use of simulated virtual environments could help better understand the activity limitations that affect patients with glaucoma. PMID:29372112
Neurophysiological indices of perceptual object priming in the absence of explicit recognition memory.

PubMed

Harris, Jill D; Cutmore, Tim R H; O'Gorman, John; Finnigan, Simon; Shum, David

2009-02-01

The aim of this study was to identify ERP correlates of perceptual object priming that are insensitive to factors affecting explicit, episodic memory. EEG was recorded from 21 participants while they performed a visual object recognition test on a combination of unstudied items and old items that were previously encountered during either a 'deep' or 'shallow' levels-of-processing (LOP) study task. The results demonstrated a midline P150 old/new effect which was sensitive only to objects' old/new status and not to the accuracy of recognition responses to old items, or to the LOP manipulation. Similar outcomes were observed for the subsequent P200 and N400 effects, the former of which had a parietal scalp maximum and the latter, a broadly distributed topography. In addition an LPC old/new effect typical of those reported in past ERP recognition studies was observed. These outcomes support the proposal that the P150 effect is reflective of perceptual object priming and moreover, provide novel evidence that this and the P200 effect are independent of explicit recognition memory process(es).
Magical thinking and memory: distinctiveness effect for tv commercials with magical content.

PubMed

Subbotsky, Eugene; Mathews, Jayne

2011-10-01

The aim of this study was to examine whether memorizing advertised products of television advertisements with magical effects (i.e., talking animals, inanimate objects which turn into humans, objects that appear from thin air or instantly turn into other objects) is easier than memorizing products of advertisements without such effects, by testing immediate and delayed retention. Adolescents and adults viewed two films containing television advertisements and were asked to recall and recognize the films' characters, events, and advertised products. Film 1 included magical effects, but Film 2 did not. On a free-recall test, no differences in the number of items recalled were noted for the two films. On the immediate recognition test, adolescents, but not adults, showed significantly better recognition for the magical than the nonmagical film. When this test was repeated two weeks later, results were reversed: adults, but not adolescents, recognized a significantly larger number of items from the magical film than the nonmagical one. These results are interpreted to accentuate the role of magical thinking in cognitive processes.
A Multidimensional Tool Based on the eHealth Literacy Framework: Development and Initial Validity Testing of the eHealth Literacy Questionnaire (eHLQ)

PubMed Central

Karnoe, Astrid; Furstrand, Dorthe; Batterham, Roy; Christensen, Karl Bang; Elsworth, Gerald; Osborne, Richard H

2018-01-01

Background For people to be able to access, understand, and benefit from the increasing digitalization of health services, it is critical that services are provided in a way that meets the user’s needs, resources, and competence. Objective The objective of the study was to develop a questionnaire that captures the 7-dimensional eHealth Literacy Framework (eHLF). Methods Draft items were created in parallel in English and Danish. The items were generated from 450 statements collected during the conceptual development of eHLF. In all, 57 items (7 to 9 items per scale) were generated and adjusted after cognitive testing. Items were tested in 475 people recruited from settings in which the scale was intended to be used (community and health care settings) and including people with a range of chronic conditions. Measurement properties were assessed using approaches from item response theory (IRT) and classical test theory (CTT) such as confirmatory factor analysis (CFA) and reliability using composite scale reliability (CSR); potential bias due to age and sex was evaluated using differential item functioning (DIF). Results CFA confirmed the presence of the 7 a priori dimensions of eHLF. Following item analysis, a 35-item 7-scale questionnaire was constructed, covering (1) using technology to process health information (5 items, CSR=.84), (2) understanding of health concepts and language (5 items, CSR=.75), (3) ability to actively engage with digital services (5 items, CSR=.86), (4) feel safe and in control (5 items, CSR=.87), (5) motivated to engage with digital services (5 items, CSR=.84), (6) access to digital services that work (6 items, CSR=.77), and (7) digital services that suit individual needs (4 items, CSR=.85). A 7-factor CFA model, using small-variance priors for cross-loadings and residual correlations, had a satisfactory fit (posterior productive P value: .27, 95% CI for the difference between the observed and replicated chi-square values: −63.7 to 133.8). The CFA showed that all items loaded strongly on their respective factors. The IRT analysis showed that no items were found to have disordered thresholds. For most scales, discriminant validity was acceptable; however, 2 pairs of dimensions were highly correlated; dimensions 1 and 5 (r=.95), and dimensions 6 and 7 (r=.96). All dimensions were retained because of strong content differentiation and potential causal relationships between these dimensions. There is no evidence of DIF. Conclusions The eHealth Literacy Questionnaire (eHLQ) is a multidimensional tool based on a well-defined a priori eHLF framework with robust properties. It has satisfactory evidence of construct validity and reliable measurement across a broad range of concepts (using both CTT and IRT traditions) in various groups. It is designed to be used to understand and evaluate people’s interaction with digital health services. PMID:29434011
Functional relations trump implied motion in recovery from extinction: evidence from the effects of animacy on extinction.

PubMed

Riddoch, M Jane; Riveros, Rodrigo; Humphreys, Glyn W

2011-02-01

Patients with extinction show a characteristic impairment in the identification of objects when two items are presented simultaneously, typically reporting the ipsilesional item only. The effect is thought to be due to a spatial bias advantaging the ipsilesional item under conditions of competing concurrent stimulation. Action relations between objects can result in recovery from extinction as the object pair may be perceived as a single group rather than competing perceptual units. However, objects interacting together can also have implied motion. Here we test whether implied motion is necessary to generate recovery from extinction. We varied orthogonally whether animate and inanimate objects were paired together in positions related or unrelated to action. Implied motion was greater when an animate object was present than when both stimuli were inanimate. Despite this, recovery from extinction was greater when actions were shown between inanimate objects. We suggest that actions between inanimate objects are perceived more easily due to the surfaces of these stimuli being designed for functional goals (e.g., the flat surface of a hammer head is designed to hit the flattened head of a nail). Attention is sensitive to the fit between potential action and the functional properties of objects, and not just to implied motion between stimuli.
Launch Deployment Assembly Extravehicular Activity Neutral Buoyancy Development Test Report

NASA Technical Reports Server (NTRS)

Loughead, T.

1996-01-01

This test evaluated the Launch Deployment Assembly (LDA) design for Extravehicular Activity (EVA) work sites (setup, igress, egress), reach and visual access, and translation required for cargo item removal. As part of the LDA design, this document describes the method and results of the LDA EVA Neutral Buoyancy Development Test to ensure that the LDA hardware support the deployment of the cargo items from the pallet. This document includes the test objectives, flight and mockup hardware description, descriptions of procedures and data collection used in the testing, and the results of the development test at the National Aeronautics and Space Administrations (NASA) Marshall Space Flight Center (MSFC) Neutral Buoyancy Simulator (NBS).
Item-saving assessment of self-care performance in children with developmental disabilities: A prospective caregiver-report computerized adaptive test

PubMed Central

Chen, Cheng-Te; Chen, Yu-Lan; Lin, Yu-Ching; Hsieh, Ching-Lin; Tzeng, Jeng-Yi

2018-01-01

Objective The purpose of this study was to construct a computerized adaptive test (CAT) for measuring self-care performance (the CAT-SC) in children with developmental disabilities (DD) aged from 6 months to 12 years in a content-inclusive, precise, and efficient fashion. Methods The study was divided into 3 phases: (1) item bank development, (2) item testing, and (3) a simulation study to determine the stopping rules for the administration of the CAT-SC. A total of 215 caregivers of children with DD were interviewed with the 73-item CAT-SC item bank. An item response theory model was adopted for examining the construct validity to estimate item parameters after investigation of the unidimensionality, equality of slope parameters, item fitness, and differential item functioning (DIF). In the last phase, the reliability and concurrent validity of the CAT-SC were evaluated. Results The final CAT-SC item bank contained 56 items. The stopping rules suggested were (a) reliability coefficient greater than 0.9 or (b) 14 items administered. The results of simulation also showed that 85% of the estimated self-care performance scores would reach a reliability higher than 0.9 with a mean test length of 8.5 items, and the mean reliability for the rest was 0.86. Administering the CAT-SC could reduce the number of items administered by 75% to 84%. In addition, self-care performances estimated by the CAT-SC and the full item bank were very similar to each other (Pearson r = 0.98). Conclusion The newly developed CAT-SC can efficiently measure self-care performance in children with DD whose performances are comparable to those of TD children aged from 6 months to 12 years as precisely as the whole item bank. The item bank of the CAT-SC has good reliability and a unidimensional self-care construct, and the CAT can estimate self-care performance with less than 25% of the items in the item bank. Therefore, the CAT-SC could be useful for measuring self-care performance in children with DD in clinical and research settings. PMID:29561879
Evaluating the Impact of EBP Education: Development of a Modified Fresno Test for Acute Care Nursing.

PubMed

Halm, Margo A

2018-05-14

Proficiency in evidence-based practice (EBP) is essential for relevant research findings to be integrated into clinical care when congruent with patient preferences. Few valid and reliable tools are available to evaluate the effectiveness of educational programs in advancing EBP attitudes, knowledge, skills, or behaviors, and ongoing competency. The Fresno test is one objective method to evaluate EBP knowledge and skills; however, the original and modified versions were validated with family physicians, physical therapists, and speech and language therapists. To adapt the Modified Fresno-Acute Care Nursing test and develop a psychometrically sound tool for use in academic and practice settings. In Phase 1, modified Fresno (Tilson, 2010) items were adapted for acute care nursing. In Phase 2, content validity was established with an expert panel. Content validity indices (I-CVI) ranged from .75 to 1.0. Scale CVI was .95%. A cross-sectional convenience sample of acute care nurses (n = 90) in novice, master, and expert cohorts completed the Modified Fresno-Acute Care Nursing test administered electronically via SurveyMonkey. Total scores were significantly different between training levels (p < .0001). Novice nurses scored significantly lower than master or expert nurses, but differences were not found between the latter cohorts. Total score reliability was acceptable: (interrater [ICC (2, 1)]) = .88. Cronbach's alpha was 0.70. Psychometric properties of most modified items were satisfactory; however, six require further revision and testing to meet acceptable standards. The Modified Fresno-Acute Care Nursing test is a 14-item test for objectively assessing EBP knowledge and skills of acute care nurses. While preliminary psychometric properties for this new EBP knowledge measure for acute care nursing are promising, further validation of some of the items and scoring rubric is needed. © 2018 Sigma Theta Tau International.
Behavioral Objectives and Related Test Items for Selected Units in Automotive Mechanics.

ERIC Educational Resources Information Center

Hill, Richard K., Ed.; And Others

This is a catalog of behavioral objectives for Vocational Automotive Mechanics organized by units of instruction as listed in the State curriculum guide. Each unit contains a suggested outline of content, a goal statement, and general and specific objectives. The units taught are: introduction to the automobile; basic hand tools--fasteners and…
AI-MSG modification work plan. [LMFBR

DOE Office of Scientific and Technical Information (OSTI.GOV)

Page, J.P.

1973-08-20

This document contains the Work Plan for the modification of the AI Steam Generator for tests in Large Leak Test Rig. This Work Plan describes the objectives, scope of work, schedule and manpower, end items, and meetings and reports required for the modification.
Development and psychometric testing of an instrument designed to measure chronic pain in dogs with osteoarthritis

PubMed Central

Boston, Raymond C.; Coyne, James C.; Farrar, John T.

2010-01-01

Objective To develop and psychometrically test an owner self-administered questionnaire designed to assess severity and impact of chronic pain in dogs with osteoarthritis. Sample Population 70 owners of dogs with osteoarthritis and 50 owners of clinically normal dogs. Procedures Standard methods for the stepwise development and testing of instruments designed to assess subjective states were used. Items were generated through focus groups and an expert panel. Items were tested for readability and ambiguity, and poorly performing items were removed. The reduced set of items was subjected to factor analysis, reliability testing, and validity testing. Results Severity of pain and interference with function were 2 factors identified and named on the basis of the items contained in them. Cronbach’s α was 0.93 and 0.89, respectively, suggesting that the items in each factor could be assessed as a group to compute factor scores (ie, severity score and interference score). The test-retest analysis revealed κ values of 0.75 for the severity score and 0.81 for the interference score. Scores correlated moderately well (r = 0.51 and 0.50, respectively) with the overall quality-of-life (QOL) question, such that as severity and interference scores increased, QOL decreased. Clinically normal dogs had significantly lower severity and interference scores than dogs with osteoarthritis. Conclusions and Clinical Relevance A psychometrically sound instrument was developed. Responsiveness testing must be conducted to determine whether the questionnaire will be useful in reliably obtaining quantifiable assessments from owners regarding the severity and impact of chronic pain and its treatment on dogs with osteoarthritis. PMID:17542696
Measuring the Test-Wiseness of Medical Students.

ERIC Educational Resources Information Center

Harvill, Leo M.

The objectives for this study were to: (1) develop a valid, reliable measure of test-wiseness with equivalent forms for use with students in the health sciences; and (2) determine the level of test-wiseness of entering medical students. The test-wiseness areas included in this study were: similar options, umbrella term, item give-away, convergence…

Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

PubMed Central

2011-01-01

Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
Meatcutting Testbook, Part I.

ERIC Educational Resources Information Center

Strazicich, Mirko, Ed.

This document contains objective tests for each lesson in the Meatcutting Workbook, Part I (see note), which is designed for apprenticeship programs in meatcutting in California. Each of the 36 tests contains from 10 to 45 multiple-choice items. The tests are grouped according to the eight units of the workbook: the apprentice meatcutter; applied…
Development and Validation of a Test for Bulimia.

ERIC Educational Resources Information Center

Smith, Marcia C.; Thelen, Mark H.

1984-01-01

Developed the Bulimia Test (BULIT) based on responses of clinically identified females (N=18) and normal female college students (N=119) to preliminary test items. Results showed that the BULIT provided an objective, reliable, and valid measure by which to identify individuals with symptoms of bulimia. (Instrument is appended.) (LLL)
The Subjective and Objective Interface of Bias Detection on Language Tests

ERIC Educational Resources Information Center

Ross, Steven J.; Okabe, Junko

2006-01-01

Test validity is predicated on there being a lack of bias in tasks, items, or test content. It is well-known that factors such as test candidates' mother tongue, life experiences, and socialization practices of the wider community may serve to inject subtle interactions between individuals' background and the test content. When the gender of the…
A Single-System Model Predicts Recognition Memory and Repetition Priming in Amnesia

PubMed Central

Kessels, Roy P.C.; Wester, Arie J.; Shanks, David R.

2014-01-01

We challenge the claim that there are distinct neural systems for explicit and implicit memory by demonstrating that a formal single-system model predicts the pattern of recognition memory (explicit) and repetition priming (implicit) in amnesia. In the current investigation, human participants with amnesia categorized pictures of objects at study and then, at test, identified fragmented versions of studied (old) and nonstudied (new) objects (providing a measure of priming), and made a recognition memory judgment (old vs new) for each object. Numerous results in the amnesic patients were predicted in advance by the single-system model, as follows: (1) deficits in recognition memory and priming were evident relative to a control group; (2) items judged as old were identified at greater levels of fragmentation than items judged new, regardless of whether the items were actually old or new; and (3) the magnitude of the priming effect (the identification advantage for old vs new items) overall was greater than that of items judged new. Model evidence measures also favored the single-system model over two formal multiple-systems models. The findings support the single-system model, which explains the pattern of recognition and priming in amnesia primarily as a reduction in the strength of a single dimension of memory strength, rather than a selective explicit memory system deficit. PMID:25122896
78 FR 50108 - Notice of Intent To Repatriate Cultural Item: Rochester Museum & Science Center, Rochester, NY

Federal Register 2010, 2011, 2012, 2013, 2014

2013-08-16

... that the cultural item listed in this notice meets the definition of a sacred object and an object of... definition of a sacred object and an object of cultural patrimony under 25 U.S.C. 3001. This notice is... Item(s) The one sacred object and object of cultural patrimony is a Chilkat blanket (27.92.1/AE 580...
The development of a computer assisted instruction and assessment system in pharmacology.

PubMed

Madsen, B W; Bell, R C

1977-01-01

We describe the construction of a computer based system for instruction and assessment in pharmacology, utilizing a large bank of multiple choice questions. Items were collected from many sources, edited and coded for student suitability, topic, taxonomy and difficulty and text references. Students reserve a time during the day, specify the type of test desired and questions are presented randomly from the subset satisfying their criteria. Answers are scored after each question and a summary given at the end of every test; details on item performance are recorded automatically. The biggest hurdle in implementation was the assembly, review, classification and editing of items, while the programming was relatively straight-forward. A number of modifications had to be made to the initial plans and changes will undoubtedly continue with further experience. When fully operational the system will possess a number of advantages including: elimination of test preparation, editing and marking; facilitated item review opportunities; increased objectivity, feedback, flexibility and descreased anxiety in students.
Attention during memory retrieval enhances future remembering.

PubMed

Dudukovic, Nicole M; Dubrow, Sarah; Wagner, Anthony D

2009-10-01

Memory retrieval is a powerful learning event that influences whether an experience will be remembered in the future. Although retrieval can succeed in the presence of distraction, dividing attention during retrieval may reduce the power of remembering as an encoding event. In the present experiments, participants studied pictures of objects under full attention and then engaged in item recognition and source memory retrieval under full or divided attention. Two days later, a second recognition and source recollection test assessed the impact of attention during initial retrieval on long-term retention. On this latter test, performance was superior for items that had been tested initially under full versus divided attention. More importantly, even when items were correctly recognized on the first test, divided attention reduced the likelihood of subsequent recognition on the second test. The same held true for source recollection. Additionally, foils presented during the first test were also less likely to be later recognized if they had been encountered initially under divided attention. These findings demonstrate that attentive retrieval is critical for learning through remembering.
Role of Cognitive Testing in the Development of the CAHPS® Hospital Survey

PubMed Central

Levine, Roger E; Fowler, Floyd J; Brown, Julie A

2005-01-01

Objective To describe how cognitive testing results were used to inform the modification and selection of items for the Consumer Assessment of Health Providers and Systems (CAHPS®) Hospital Survey pilot test instrument. Data Sources Cognitive interviews were conducted on 31 subjects in two rounds of testing: in December 2002–January 2003 and in February 2003. In both rounds, interviews were conducted in northern California, southern California, Massachusetts, and North Carolina. Study Design A common protocol served as the basis for cognitive testing activities in each round. This protocol was modified to enable testing of the items as interviewer-administered and self-administered items and to allow members of each of three research teams to use their preferred cognitive research tools. Data Collection/Extraction Methods Each research team independently summarized, documented, and reported their findings. Item-specific and general issues were noted. The results were reviewed and discussed by senior staff from each research team after each round of testing, to inform the acceptance, modification, or elimination of candidate items. Principal Findings Many candidate items required modification because respondents lacked the information required to answer them, respondents failed to understand them consistently, the items were not measuring the constructs they were intended to measure, the items were based on erroneous assumptions about what respondents wanted or experienced during their hospitalization, or the items were asking respondents to make distinctions that were too fine for them to make. Cognitive interviewing enabled the detection of these problems; an understanding of the etiology of the problem informed item revisions. However, for some constructs, the revisions proved to be inadequate. Accordingly, items could not be developed to provide acceptable measures of certain constructs such as shared decision making, coordination of care, and delays in the admissions process. Conclusions Cognitive testing is the most direct way of finding out whether respondents understand questions consistently, have the information needed to answer the questions, and can use the response alternatives provided to describe their experiences or their opinions accurately. Many of the candidate questions failed to meet these standards. Cognitive testing only evaluates the way in which respondents understand and answer questions. Although it does not directly assess the validity of the answers, it is a reasonable premise that cognitive problems will seriously compromise validity and reliability. PMID:16316437
Medial Temporal Lobe Contributions to Cued Retrieval of Items and Contexts

PubMed Central

Hannula, Deborah E.; Libby, Laura A.; Yonelinas, Andrew P.; Ranganath, Charan

2013-01-01

Several models have proposed that different regions of the medial temporal lobes contribute to different aspects of episodic memory. For instance, according to one view, the perirhinal cortex represents specific items, parahippocampal cortex represents information regarding the context in which these items were encountered, and the hippocampus represents item-context bindings. Here, we used event-related functional magnetic resonance imaging (fMRI) to test a specific prediction of this model – namely, that successful retrieval of items from context cues will elicit perirhinal recruitment and that successful retrieval of contexts from item cues will elicit parahippocampal cortex recruitment. Retrieval of the bound representation in either case was expected to elicit hippocampal engagement. To test these predictions, we had participants study several item-context pairs (i.e., pictures of objects and scenes, respectively), and then had them attempt to recall items from associated context cues and contexts from associated item cues during a scanned retrieval session. Results based on both univariate and multivariate analyses confirmed a role for hippocampus in content-general relational memory retrieval, and a role for parahippocampal cortex in successful retrieval of contexts from item cues. However, we also found that activity differences in perirhinal cortex were correlated with successful cued recall for both items and contexts. These findings provide partial support for the above predictions and are discussed with respect to several models of medial temporal lobe function. PMID:23466350
Selective Maintenance in Visual Working Memory Does Not Require Sustained Visual Attention

PubMed Central

Hollingworth, Andrew; Maxcey-Richard, Ashleigh M.

2012-01-01

In four experiments, we tested whether sustained visual attention is required for the selective maintenance of objects in VWM. Participants performed a color change-detection task. During the retention interval, a valid cue indicated the item that would be tested. Change detection performance was higher in the valid-cue condition than in a neutral-cue control condition. To probe the role of visual attention in the cuing effect, on half of the trials, a difficult search task was inserted after the cue, precluding sustained attention on the cued item. The addition of the search task produced no observable decrement in the magnitude of the cuing effect. In a complementary test, search efficiency was not impaired by simultaneously prioritizing an object for retention in VWM. The results demonstrate that selective maintenance in VWM can be dissociated from the locus of visual attention. PMID:23067118
Selective maintenance in visual working memory does not require sustained visual attention.

PubMed

Hollingworth, Andrew; Maxcey-Richard, Ashleigh M

2013-08-01

In four experiments, we tested whether sustained visual attention is required for the selective maintenance of objects in visual working memory (VWM). Participants performed a color change-detection task. During the retention interval, a valid cue indicated the item that would be tested. Change-detection performance was higher in the valid-cue condition than in a neutral-cue control condition. To probe the role of visual attention in the cuing effect, on half of the trials, a difficult search task was inserted after the cue, precluding sustained attention on the cued item. The addition of the search task produced no observable decrement in the magnitude of the cuing effect. In a complementary test, search efficiency was not impaired by simultaneously prioritizing an object for retention in VWM. The results demonstrate that selective maintenance in VWM can be dissociated from the locus of visual attention. 2013 APA, all rights reserved
Methodology for the development and calibration of the SCI-QOL item banks

PubMed Central

Tulsky, David S.; Kisala, Pamela A.; Victorson, David; Choi, Seung W.; Gershon, Richard; Heinemann, Allen W.; Cella, David

2015-01-01

Objective To develop a comprehensive, psychometrically sound, and conceptually grounded patient reported outcomes (PRO) measurement system for individuals with spinal cord injury (SCI). Methods Individual interviews (n = 44) and focus groups (n = 65 individuals with SCI and n = 42 SCI clinicians) were used to select key domains for inclusion and to develop PRO items. Verbatim items from other cutting-edge measurement systems (i.e. PROMIS, Neuro-QOL) were included to facilitate linkage and cross-population comparison. Items were field tested in a large sample of individuals with traumatic SCI (n = 877). Dimensionality was assessed with confirmatory factor analysis. Local item dependence and differential item functioning were assessed, and items were calibrated using the item response theory (IRT) graded response model. Finally, computer adaptive tests (CATs) and short forms were administered in a new sample (n = 245) to assess test-retest reliability and stability. Participants and Procedures A calibration sample of 877 individuals with traumatic SCI across five SCI Model Systems sites and one Department of Veterans Affairs medical center completed SCI-QOL items in interview format. Results We developed 14 unidimensional calibrated item banks and 3 calibrated scales across physical, emotional, and social health domains. When combined with the five Spinal Cord Injury – Functional Index physical function banks, the final SCI-QOL system consists of 22 IRT-calibrated item banks/scales. Item banks may be administered as CATs or short forms. Scales may be administered in a fixed-length format only. Conclusions The SCI-QOL measurement system provides SCI researchers and clinicians with a comprehensive, relevant and psychometrically robust system for measurement of physical-medical, physical-functional, emotional, and social outcomes. All SCI-QOL instruments are freely available on Assessment CenterSM. PMID:26010963
Psychometrics of the preschooler physical activity parenting practices instrument among a Latino sample.

PubMed

O'Connor, Teresia M; Cerin, Ester; Hughes, Sheryl O; Robles, Jessica; Thompson, Deborah I; Mendoza, Jason A; Baranowski, Tom; Lee, Rebecca E

2014-01-15

Latino preschoolers (3-5 year old children) have among the highest rates of obesity. Low levels of physical activity (PA) are a risk factor for obesity. Characterizing what Latino parents do to encourage or discourage their preschooler to be physically active can help inform interventions to increase their PA. The objective was therefore to develop and assess the psychometrics of a new instrument: the Preschooler Physical Activity Parenting Practices (PPAPP) among a Latino sample, to assess parenting practices used to encourage or discourage PA among preschool-aged children. Cross-sectional study of 240 Latino parents who reported the frequency of using PA parenting practices. 95% of respondents were mothers; 42% had more than a high school education. Child mean age was 4.5 (±0.9) years (52% male). Test-retest reliability was assessed in 20%, 2 weeks later. We assessed the fit of a priori models using Confirmatory factor analyses (CFA). In a separate sub-sample (35%), preschool-aged children wore accelerometers to assess associations with their PA and PPAPP subscales. The a-priori models showed poor fit to the data. A modified factor structure for encouraging PPAPP had one multiple-item scale: engagement (15 items), and two single-items (have outdoor toys; not enroll in sport-reverse coded). The final factor structure for discouraging PPAPP had 4 subscales: promote inactive transport (3 items), promote screen time (3 items), psychological control (4 items) and restricting for safety (4 items). Test-retest reliability (ICC) for the two scales ranged from 0.56-0.85. Cronbach's alphas ranged from 0.5-0.9. Several sub-factors correlated in the expected direction with children's objectively measured PA. The final models for encouraging and discouraging PPAPP had moderate to good fit, with moderate to excellent test-retest reliabilities. The PPAPP should be further evaluated to better assess its associations with children's PA and offers a new tool for measuring PPAPP among Latino families with preschool-aged children.
Business Education--Business Law: Grades 10-12.

ERIC Educational Resources Information Center

Instructional Objectives Exchange, Los Angeles, CA.

Thirty-seven objectives and related test items for business law courses taught in grades 10 through 12 are organized into the following categories: (1) foundations of law; (2) law of contracts, property, and negotiable instruments; (3) business relations and business organizations; and (4) vocabulary. Each objective contains three elements: the…
Effects of Varied Enhancement Strategies (Chunking, Feedback, Gaming) in Complementing Animated Instruction in Facilitating Different Types of Learning Objectives

ERIC Educational Resources Information Center

Munyofu, Mine

2008-01-01

The purpose of this study was to examine the instructional effectiveness of different levels of chunking (simple visual/text and complex visual/text), different forms of feedback (item-by-item feedback, end-of-test feedback and no feedback), and use of instructional gaming (game and no game) in complementing animated programmed instruction on a…
The medial dorsal thalamic nucleus and the medial prefrontal cortex of the rat function together to support associative recognition and recency but not item recognition.

PubMed

Cross, Laura; Brown, Malcolm W; Aggleton, John P; Warburton, E Clea

2012-12-21

In humans recognition memory deficits, a typical feature of diencephalic amnesia, have been tentatively linked to mediodorsal thalamic nucleus (MD) damage. Animal studies have occasionally investigated the role of the MD in single-item recognition, but have not systematically analyzed its involvement in other recognition memory processes. In Experiment 1 rats with bilateral excitotoxic lesions in the MD or the medial prefrontal cortex (mPFC) were tested in tasks that assessed single-item recognition (novel object preference), associative recognition memory (object-in-place), and recency discrimination (recency memory task). Experiment 2 examined the functional importance of the interactions between the MD and mPFC using disconnection techniques. Unilateral excitotoxic lesions were placed in both the MD and the mPFC in either the same (MD + mPFC Ipsi) or opposite hemispheres (MD + mPFC Contra group). Bilateral lesions in the MD or mPFC impaired object-in-place and recency memory tasks, but had no effect on novel object preference. In Experiment 2 the MD + mPFC Contra group was significantly impaired in the object-in-place and recency memory tasks compared with the MD + mPFC Ipsi group, but novel object preference was intact. Thus, connections between the MD and mPFC are critical for recognition memory when the discriminations involve associative or recency information. However, the rodent MD is not necessary for single-item recognition memory.
Measuring quality of life in patients with head and neck cancer: Update of the EORTC QLQ-H&N Module, Phase III.

PubMed

Singer, Susanne; Araújo, Cláudia; Arraras, Juan Ignacio; Baumann, Ingo; Boehm, Andreas; Brokstad Herlofson, Bente; Castro Silva, Joaquim; Chie, Wei-Chu; Fisher, Sheila; Guntinas-Lichius, Orlando; Hammerlid, Eva; Irarrázaval, María Elisa; Jensen Hjermstad, Marianne; Jensen, Kenneth; Kiyota, Naomi; Licitra, Lisa; Nicolatou-Galitis, Ourania; Pinto, Monica; Santos, Marcos; Schmalz, Claudia; Sherman, Allen C; Tomaszewska, Iwona M; Verdonck de Leeuw, Irma; Yarom, Noam; Zotti, Paola; Hofmeister, Dirk

2015-09-01

The objective of this study was to pilot test an updated version of the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire Head and Neck Module (EORTC QLQ-H&N60). Patients with head and neck cancer were asked to complete a list of 60 head and neck cancer-specific items comprising the updated EORTC head and neck module and the core questionnaire EORTC QLQ-C30. Debriefing interviews were conducted to identify any irrelevant items and confusing or upsetting wording. Interviews were performed with 330 patients from 17 countries, representing different head and neck cancer sites and treatments. Forty-one of the 60 items were retained according to the predefined EORTC criteria for module development, for another 2 items the wording was refined, and 17 items were removed. The preliminary EORTC QLQ-H&N43 can now be used in academic research. Psychometrics will be tested in a larger field study. © 2014 Wiley Periodicals, Inc.
Item response theory detects differential item functioning between healthy and ill children in QoL measures

PubMed Central

Langer, Michelle M.; Hill, Cheryl D.; Thissen, David; Burwinkle, Tasha M.; Varni, James W.; DeWalt, Darren A.

2008-01-01

Objective To demonstrate the value of item response theory (IRT) and differential item functioning (DIF) methods in examining a health-related quality of life (HRQOL) measure in children and adolescents. Study Design and Setting This illustration uses data from 5,429 children using the four subscales of the PedsQL™ 4.0 Generic Core Scales. The IRT model-based likelihood ratio test was used to detect and evaluate DIF between healthy children and children with a chronic condition. Results DIF was detected for a majority of items but cancelled out at the total test score level due to opposing directions of DIF. Post-hoc analysis indicated that this pattern of results may be due to multidimensionality. We discuss issues in detecting and handling DIF. Conclusion This paper describes how to perform DIF analyses in validating a questionnaire to ensure that scores have equivalent meaning across subgroups. It offers insight into ways information gained through the analysis can be used to evaluate an existing scale. PMID:18226750
Development of and Field-Test Results for the CAHPS PCMH Survey

PubMed Central

Scholle, Sarah Hudson; Vuong, Oanh; Ding, Lin; Fry, Stephanie; Gallagher, Patricia; Brown, Julie A.; Hays, Ron D.; Cleary, Paul D.

2017-01-01

Objective To develop and evaluate survey questions that assess processes of care relevant to Patient-Centered Medical Homes (PCMHs). Research Design We convened expert panels, reviewed evidence on effective care practices and existing surveys, elicited broad public input, and conducted cognitive interviews and a field test to develop items relevant to PCMHs that could be added to the CAHPS® Clinician & Group (CG-CAHPS) 1.0 Survey. Surveys were tested using a two-contact mail protocol in 10 adult and 33 pediatric practices (both private and community health centers) in Massachusetts. A total of 4,875 completed surveys were received (overall response rate of 25%). Analyses We calculated the rate of valid responses for each item. We conducted exploratory factor analyses and estimated item-to-total correlations, individual and site level reliability, and correlations among proposed multi-item composites. Results Ten items in four new domains (Comprehensiveness, Information, Self-Management Support, and Shared Decision-Making) and four items in two existing domains (Access and Coordination of Care) were selected to be supplemental items to be used in conjunction with the adult CG-CAHPS 1.0 survey. For the child version, four items in each of two new domains (Information and Self-Management Support) and five items in existing domains (Access, Comprehensiveness-Prevention, Coordination of Care) were selected. Conclusions This study provides support for the reliability and validity of new items to supplement the CG-CAHPS 1.0 survey to assess aspects of primary care that are important attributes of Patient-Centered Medical Homes. PMID:23064272

Item and scale differential functioning of the Mini-Mental State Exam assessed using the Differential Item and Test Functioning (DFIT) Framework.

PubMed

Morales, Leo S; Flowers, Claudia; Gutierrez, Peter; Kleinman, Marjorie; Teresi, Jeanne A

2006-11-01

To illustrate the application of the Differential Item and Test Functioning (DFIT) method using English and Spanish versions of the Mini-Mental State Examination (MMSE). Study participants were 65 years of age or older and lived in North Manhattan, New York. Of the 1578 study participants who were administered the MMSE 665 completed it in Spanish. : The MMSE contains 20 items that measure the degree of cognitive impairment in the areas of orientation, attention and calculation, registration, recall and language, as well as the ability to follow verbal and written commands. After assessing the dimensionality of the MMSE scale, item response theory person and item parameters were estimated separately for the English and Spanish sample using Samejima's 2-parameter graded response model. Then the DFIT framework was used to assess differential item functioning (DIF) and differential test functioning (DTF). Nine items were found to show DIF; these were items that ask the respondent to name the correct season, day of the month, city, state, and 2 nearby streets, recall 3 objects, repeat the phrase no ifs, no ands, no buts, follow the command, "close your eyes," and the command, "take the paper in your right hand, fold the paper in half with both hands, and put the paper down in your lap." At the scale level, however, the MMSE did not show differential functioning. Respondents to the English and Spanish versions of the MMSE are comparable on the basis of scale scores. However, assessments based on individual MMSE items may be misleading.
ATLS-stowage and deployment testing of medical supplies and pharmaceuticals

NASA Technical Reports Server (NTRS)

Gosbee, John; Benz, Darren; Lloyd, Charles W.; Bueker, Richard; Orsak, Debra

1991-01-01

The objective is to evaluate stowage and deployment methods for the Health Maintenance Facility (HMF) during microgravity. The specific objectives of this experiment are: (1) to evaluate the stowage and deployment mechanisms for the medical supplies; and (2) to evaluate the procedures for performing medical scenarios. To accomplish these objectives, the HMF test mini-racks will contain medical equipment mounted in the racks; and self-contained drawers with various mechanisms for stowing and deploying items. The medical supplies and pharmaceuticals will be destowed, handled, and restowed. The in-flight test procedures and other aspects of the KC-135 parabolic flight test to simulate weightlessness are presented.
Construction of Valid and Reliable Test for Assessment of Students

ERIC Educational Resources Information Center

Osadebe, P. U.

2015-01-01

The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…
Construction of Economics Achievement Test for Assessment of Students

ERIC Educational Resources Information Center

Osadebe, P. U.

2014-01-01

The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…
Conceptualising computerized adaptive testing for measurement of latent variables associated with physical objects

NASA Astrophysics Data System (ADS)

Camargo, F. R.; Henson, B.

2015-02-01

The notion of that more or less of a physical feature affects in different degrees the users' impression with regard to an underlying attribute of a product has frequently been applied in affective engineering. However, those attributes exist only as a premise that cannot directly be measured and, therefore, inferences based on their assessment are error-prone. To establish and improve measurement of latent attributes it is presented in this paper the concept of a stochastic framework using the Rasch model for a wide range of independent variables referred to as an item bank. Based on an item bank, computerized adaptive testing (CAT) can be developed. A CAT system can converge into a sequence of items bracketing to convey information at a user's particular endorsement level. It is through item banking and CAT that the financial benefits of using the Rasch model in affective engineering can be realised.
Developmental changes in visual short-term memory in infancy: evidence from eye-tracking.

PubMed

Oakes, Lisa M; Baumgartner, Heidi A; Barrett, Frederick S; Messenger, Ian M; Luck, Steven J

2013-01-01

We assessed visual short-term memory (VSTM) for color in 6- and 8-month-old infants (n = 76) using a one-shot change detection task. In this task, a sample array of two colored squares was visible for 517 ms, followed by a 317-ms retention period and then a 3000-ms test array consisting of one unchanged item and one item in a new color. We tracked gaze at 60 Hz while infants looked at the changed and unchanged items during test. When the two sample items were different colors (Experiment 1), 8-month-old infants exhibited a preference for the changed item, indicating memory for the colors, but 6-month-olds exhibited no evidence of memory. When the two sample items were the same color and did not need to be encoded as separate objects (Experiment 2), 6-month-old infants demonstrated memory. These results show that infants can encode information in VSTM in a single, brief exposure that simulates the timing of a single fixation period in natural scene viewing, and they reveal rapid developmental changes between 6 and 8 months in the ability to store individuated items in VSTM.
Designing and Testing an Inventory for Measuring Social Media Competency of Certified Health Education Specialists

PubMed Central

Bernhardt, Jay M; Stellefson, Michael; Weiler, Robert M; Anderson-Lewis, Charkarra; Miller, M David; MacInnes, Jann

2015-01-01

Background Social media can promote healthy behaviors by facilitating engagement and collaboration among health professionals and the public. Thus, social media is quickly becoming a vital tool for health promotion. While guidelines and trainings exist for public health professionals, there are currently no standardized measures to assess individual social media competency among Certified Health Education Specialists (CHES) and Master Certified Health Education Specialists (MCHES). Objective The aim of this study was to design, develop, and test the Social Media Competency Inventory (SMCI) for CHES and MCHES. Methods The SMCI was designed in three sequential phases: (1) Conceptualization and Domain Specifications, (2) Item Development, and (3) Inventory Testing and Finalization. Phase 1 consisted of a literature review, concept operationalization, and expert reviews. Phase 2 involved an expert panel (n=4) review, think-aloud sessions with a small representative sample of CHES/MCHES (n=10), a pilot test (n=36), and classical test theory analyses to develop the initial version of the SMCI. Phase 3 included a field test of the SMCI with a random sample of CHES and MCHES (n=353), factor and Rasch analyses, and development of SMCI administration and interpretation guidelines. Results Six constructs adapted from the unified theory of acceptance and use of technology and the integrated behavioral model were identified for assessing social media competency: (1) Social Media Self-Efficacy, (2) Social Media Experience, (3) Effort Expectancy, (4) Performance Expectancy, (5) Facilitating Conditions, and (6) Social Influence. The initial item pool included 148 items. After the pilot test, 16 items were removed or revised because of low item discrimination (r<.30), high interitem correlations (Ρ>.90), or based on feedback received from pilot participants. During the psychometric analysis of the field test data, 52 items were removed due to low discrimination, evidence of content redundancy, low R-squared value, or poor item infit or outfit. Psychometric analyses of the data revealed acceptable reliability evidence for the following scales: Social Media Self-Efficacy (alpha=.98, item reliability=.98, item separation=6.76), Social Media Experience (alpha=.98, item reliability=.98, item separation=6.24), Effort Expectancy(alpha =.74, item reliability=.95, item separation=4.15), Performance Expectancy (alpha =.81, item reliability=.99, item separation=10.09), Facilitating Conditions (alpha =.66, item reliability=.99, item separation=16.04), and Social Influence (alpha =.66, item reliability=.93, item separation=3.77). There was some evidence of local dependence among the scales, with several observed residual correlations above |.20|. Conclusions Through the multistage instrument-development process, sufficient reliability and validity evidence was collected in support of the purpose and intended use of the SMCI. The SMCI can be used to assess the readiness of health education specialists to effectively use social media for health promotion research and practice. Future research should explore associations across constructs within the SMCI and evaluate the ability of SMCI scores to predict social media use and performance among CHES and MCHES. PMID:26399428
Using Optimal Test Assembly Methods for Shortening Patient-Reported Outcome Measures: Development and Validation of the Cochin Hand Function Scale-6: A Scleroderma Patient-Centered Intervention Network Cohort Study.

PubMed

Levis, Alexander W; Harel, Daphna; Kwakkenbos, Linda; Carrier, Marie-Eve; Mouthon, Luc; Poiraudeau, Serge; Bartlett, Susan J; Khanna, Dinesh; Malcarne, Vanessa L; Sauve, Maureen; van den Ende, Cornelia H M; Poole, Janet L; Schouffoer, Anne A; Welling, Joep; Thombs, Brett D

2016-11-01

To develop and validate a short form of the Cochin Hand Function Scale (CHFS), which measures hand disability, for use in systemic sclerosis, using objective criteria and reproducible techniques. Responses on the 18-item CHFS were obtained from English-speaking patients enrolled in the Scleroderma Patient-Centered Intervention Network Cohort. CHFS unidimensionality was verified using confirmatory factor analysis, and an item response theory model was fit to CHFS items. Optimal test assembly (OTA) methods identified a maximally precise short form for each possible form length between 1 and 17 items. The final short form selected was the form with the least number of items that maintained statistically equivalent convergent validity, compared to the full-length CHFS, with the Health Assessment Questionnaire (HAQ) disability index (DI) and the physical function domain of the 29-item Patient-Reported Outcomes Measurement Information System (PROMIS-29). There were 601 patients included. A 6-item short form of the CHFS (CHFS-6) was selected. The CHFS-6 had a Cronbach's alpha of 0.93. Correlations of the CHFS-6 summed score with HAQ DI (r = 0.79) and PROMIS-29 physical function (r = -0.54) were statistically equivalent to the CHFS (r = 0.81 and r = -0.56). The correlation with the full CHFS was high (r = 0.98). The OTA procedure generated a valid short form of the CHFS with minimal loss of information compared to the full-length form. The OTA method used was based on objective, prespecified criteria, but should be further studied for viability as a general procedure for shortening patient-reported outcome measures in health research. © 2016, American College of Rheumatology.
Cognitive Process Modeling of Spatial Ability: The Assembling Objects Task

ERIC Educational Resources Information Center

Ivie, Jennifer L.; Embretson, Susan E.

2010-01-01

Spatial ability tasks appear on many intelligence and aptitude tests. Although the construct validity of spatial ability tests has often been studied through traditional correlational methods, such as factor analysis, less is known about the cognitive processes involved in solving test items. This study examines the cognitive processes involved in…
Modifying the Test of Understanding Graphs in Kinematics

ERIC Educational Resources Information Center

Zavala, Genaro; Tejeda, Santa; Barniol, Pablo; Beichner, Robert J.

2017-01-01

In this article, we present several modifications to the Test of Understanding Graphs in Kinematics. The most significant changes are (i) the addition and removal of items to achieve parallelism in the objectives (dimensions) of the test, thus allowing comparisons of students' performance that were not possible with the original version, and (ii)…
Dimensional analyses of frontal posed smile attractiveness in Japanese female patients.

PubMed

Hata, Kyoko; Arai, Kazuhito

2016-01-01

To identify appropriate dimensional items in objective diagnostic analysis for attractiveness of frontal posed smile in Japanese female patients by comparing with the result of human judgments. Photographs of frontal posed smiles of 100 Japanese females after orthodontic treatment were evaluated by 20 dental students (10 males and 10 females) using a visual analogue scale (VAS). The photographs were ranked based on the VAS evaluations and the 25 photographs with the highest evaluations were selected as group A, and the 25 photos with the lowest evaluations were designated group B. Then 12 dimensional items of objective analysis selected from a literature review were measured. Means and standard deviations for measurements of the dimensional items were compared between the groups using the unpaired t-test with a significance level of P < .05. Mean values were significantly smaller in group A than in group B for interlabial gap, intervermilion distance, maxillary gingival display, maximum incisor exposure, and lower lip to incisor (P < .05). Significant differences were observed only in the vertical dimension, not in the transverse dimension. Five of the 12 objective diagnostic items were correlated with human judgments of the attractiveness of frontal posed smile in Japanese females after orthodontic treatment.
Do healthier foods cost more in Saudi Arabia than less healthier options?

PubMed Central

Gosadi, Ibrahim M.; Alshehri, Muner A.; Alawad, Saud H.

2016-01-01

Objectives: To investigate whether healthy foods in Saudi Arabia cost more compared with less healthy options. Method: This is a cross-sectional study conducted in Riyadh, Saudi Arabia during June and July 2015. The study targeted well-known market chains in the city of Riyadh. The selection of food items was purposive to include healthy and less healthy food items in each category. Price, caloric value, salt, fat, sugar, and fiber contents for each food item were collected. To test for the correlation between nutritional contents and average price, Spearman’s correlation coefficients were calculated. The Mann-Whitney U test was used to test for the presence of average price difference between healthy and less healthy food items. Results: A total of 162 food items were collected. Sixty-six food items were classified as healthy compared with 96 less healthier options. The calculated correlation coefficients indicate an association between increased cost of food with increased caloric values (0.649 p=0.0000001), increased fat content (0.610 p=0.0000003), and increased salt contents (0.273 p=0.001). Prices of food items with higher fiber contents showed a weaker association (0.191 p=0.015). The overall average cost of healthy food was approximately 10 Saudi riyals cheaper than less healthy food (p=0.000001). Conclusion: The findings of the study suggest that the cost of healthy food is lower than that of less healthy items in the Saudi market. PMID:27570859
Family Living and Parenthood. Performance Objectives and Criterion-Referenced Test Items.

ERIC Educational Resources Information Center

Missouri Univ., Columbia. Instructional Materials Lab.

This guide was developed to assist home economics teachers in implementing the Missouri Vocational Instructional Management System into the home economics curriculum at the local level through a family living and parenthood semester course. The course contains a minimum of two performance objectives for each competency developed and validated by…
Using the Rasch Measurement Model in Psychometric Analysis of the Family Effectiveness Measure

PubMed Central

McCreary, Linda L.; Conrad, Karen M.; Conrad, Kendon J.; Scott, Christy K; Funk, Rodney R.; Dennis, Michael L.

2013-01-01

Background Valid assessment of family functioning can play a vital role in optimizing client outcomes. Because family functioning is influenced by family structure, socioeconomic context, and culture, existing measures of family functioning--primarily developed with nuclear, middle class European American families--may not be valid assessments of families in diverse populations. The Family Effectiveness Measure was developed to address this limitation. Objectives To test the Family Effectiveness Measure with data from a primarily low-income African American convenience sample, using the Rasch measurement model. Method A sample of 607 adult women completed the measure. Rasch analysis was used to assess unidimensionality, response category functioning, item fit, person reliability, differential item functioning by race and parental status, and item hierarchy. Criterion-related validity was tested using correlations with five other variables related to family functioning. Results The Family Effectiveness Measure measures two separate constructs: The effective family functioning construct was a psychometrically sound measure of the target construct that was more efficient due to the deletion of 22 items. The ineffective family functioning construct consisted of 16 of those deleted items but was not as strong psychometrically. Items in both constructs evidenced no differential item functioning by race. Criterion-related validity was supported for both. Discussion In contrast to the prevailing conceptualization that family functioning is a single construct, assessed by positively and negatively worded items, use of the Rasch analysis suggested the existence of two constructs. While the effective family functioning is a strong and efficient measure of family functioning, the ineffective family functioning will require additional item development and psychometric testing. PMID:23636342
Retrieving self-vocalized information: An event-related potential (ERP) study on the effect of retrieval orientation.

PubMed

Rosburg, Timm; Johansson, Mikael; Sprondel, Volker; Mecklinger, Axel

2014-11-18

Retrieval orientation refers to a pre-retrieval process and conceptualizes the specific form of processing that is applied to a retrieval cue. In the current event-related potential (ERP) study, we sought to find evidence for an involvement of the auditory cortex when subjects attempt to retrieve vocalized information, and hypothesized that adopting retrieval orientation would be beneficial for retrieval accuracy. During study, participants saw object words that they subsequently vocalized or visually imagined. At test, participants had to identify object names of one study condition as targets and to reject object names of the second condition together with new items. Target category switched after half of the test trials. Behaviorally, participants responded less accurately and more slowly to targets of the vocalize condition than to targets of the imagine condition. ERPs to new items varied at a single left electrode (T7) between 500 and 800ms, indicating a moderate retrieval orientation effect in the subject group as a whole. However, whereas the effect was strongly pronounced in participants with high retrieval accuracy, it was absent in participants with low retrieval accuracy. A current source density (CSD) mapping of the retrieval orientation effect indicated a source over left temporal regions. Independently from retrieval accuracy, the ERP retrieval orientation effect was surprisingly also modulated by test order. Findings are suggestive for an involvement of the auditory cortex in retrieval attempts of vocalized information and confirm that adopting retrieval orientation is potentially beneficial for retrieval accuracy. The effects of test order on retrieval-related processes might reflect a stronger focus on the newness of items in the more difficult test condition when participants started with this condition. Copyright © 2014 Elsevier Inc. All rights reserved.
Medial temporal lobe contributions to cued retrieval of items and contexts.

PubMed

Hannula, Deborah E; Libby, Laura A; Yonelinas, Andrew P; Ranganath, Charan

2013-10-01

Several models have proposed that different regions of the medial temporal lobes contribute to different aspects of episodic memory. For instance, according to one view, the perirhinal cortex represents specific items, parahippocampal cortex represents information regarding the context in which these items were encountered, and the hippocampus represents item-context bindings. Here, we used event-related functional magnetic resonance imaging (fMRI) to test a specific prediction of this model-namely, that successful retrieval of items from context cues will elicit perirhinal recruitment and that successful retrieval of contexts from item cues will elicit parahippocampal cortex recruitment. Retrieval of the bound representation in either case was expected to elicit hippocampal engagement. To test these predictions, we had participants study several item-context pairs (i.e., pictures of objects and scenes, respectively), and then had them attempt to recall items from associated context cues and contexts from associated item cues during a scanned retrieval session. Results based on both univariate and multivariate analyses confirmed a role for hippocampus in content-general relational memory retrieval, and a role for parahippocampal cortex in successful retrieval of contexts from item cues. However, we also found that activity differences in perirhinal cortex were correlated with successful cued recall for both items and contexts. These findings provide partial support for the above predictions and are discussed with respect to several models of medial temporal lobe function. Copyright © 2013 Elsevier Ltd. All rights reserved.
The Dutch-Flemish PROMIS Physical Function item bank exhibited strong psychometric properties in patients with chronic pain.

PubMed

Crins, Martine H P; Terwee, Caroline B; Klausch, Thomas; Smits, Niels; de Vet, Henrica C W; Westhovens, Rene; Cella, David; Cook, Karon F; Revicki, Dennis A; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Roorda, Leo D

2017-07-01

The objective of this study was to assess the psychometric properties of the Dutch-Flemish Patient-Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank in Dutch patients with chronic pain. A bank of 121 items was administered to 1,247 Dutch patients with chronic pain. Unidimensionality was assessed by fitting a one-factor confirmatory factor analysis and evaluating resulting fit statistics. Items were calibrated with the graded response model and its fit was evaluated. Cross-cultural validity was assessed by testing items for differential item functioning (DIF) based on language (Dutch vs. English). Construct validity was evaluated by calculation correlations between scores on the Dutch-Flemish PROMIS Physical Function measure and scores on generic and disease-specific measures. Results supported the Dutch-Flemish PROMIS Physical Function item bank's unidimensionality (Comparative Fit Index = 0.976, Tucker Lewis Index = 0.976) and model fit. Item thresholds targeted a wide range of physical function construct (threshold-parameters range: -4.2 to 5.6). Cross-cultural validity was good as four items only showed DIF for language and their impact on item scores was minimal. Physical Function scores were strongly associated with scores on all other measures (all correlations ≤ -0.60 as expected). The Dutch-Flemish PROMIS Physical Function item bank exhibited good psychometric properties. Development of a computer adaptive test based on the large bank is warranted. Copyright © 2017 Elsevier Inc. All rights reserved.
Item bank development, calibration and validation for patient-reported outcomes in female urinary incontinence

PubMed Central

Sung, Vivian W.; Griffith, James W.; Rogers, Rebecca G.; Raker, Christina A.; Clark, Melissa A.

2016-01-01

Purpose Current patient-reported outcomes for female urinary incontinence (UI) are limited by their inability to be tailored. Our objective is to describe the development and field-testing of 7 item banks designed to measure domains identified as important UI in females (UIf). We also describe the calibration and validation properties of the UIf-item banks, which allow for more efficient computerized-adaptive testing (CAT) in the future. METHODS The UIf-measures included 168 items covering 7 domains: Stress UI (SUI), Overactive Bladder (OAB), Urinary Frequency, Physical, Social and Emotional Health Impact, and Adaptation. Items underwent rigorous qualitative development and psychometric testing across 2 sites. Items were calibrated using item response theory and evaluated for internal consistency, construct validity and responsiveness. RESULTS 750 women (249 SUI, 249 OAB, and 252 mixed UI) participated. Mean age was 55±14 years ,23% were Hispanic, 80% white. In addition to face and content validity, the measures demonstrated good internal consistency (coefficient alpha 0.92-0.98) and unidimensionality. There was evidence for construct validity with moderate to strong correlations with the UDI (r’s ≥ 0.6) and IIQ (r’s = ≥ 0.6) scales. The measures were responsive to change for SUI treatment (paired t-test p <.001, ES range=1.3 to 2.9; SRM range=1.3 to 2.5) and OAB treatment (paired t-test p <.05 for all domains except Social Health Impact and Adaptation, ES range=.3 to 1.5, SRM range=0.4 to 1.0). The measures were responsive based on concurrent changes with the UDI and IIQ (p < 0.05). CAT versions were developed and pilot tested. CONCLUSIONS The UIf-item banks demonstrate good psychometric characteristics and are a sufficiently valid set of customizable tools for measuring UI symptoms and life impact. PMID:26732514
Identification and Development of Items Comprising Organizational Citizenship Behaviors Among Pharmacy Faculty

PubMed Central

Semsick, Gretchen R.

2016-01-01

Objective. Identify behaviors that can compose a measure of organizational citizenship by pharmacy faculty. Methods. A four-round, modified Delphi procedure using open-ended questions (Round 1) was conducted with 13 panelists from pharmacy academia. The items generated were evaluated and refined for inclusion in subsequent rounds. A consensus was reached after completing four rounds. Results. The panel produced a set of 26 items indicative of extra-role behaviors by faculty colleagues considered to compose a measure of citizenship, which is an expressed manifestation of collegiality. Conclusions. The items generated require testing for validation and reliability in a large sample to create a measure of organizational citizenship. Even prior to doing so, the list of items can serve as a resource for mentorship of junior and senior faculty alike. PMID:28179717
Advancing the efficiency and efficacy of patient reported outcomes with multivariate computer adaptive testing.

PubMed

Morris, Scott; Bass, Mike; Lee, Mirinae; Neapolitan, Richard E

2017-09-01

The Patient Reported Outcomes Measurement Information System (PROMIS) initiative developed an array of patient reported outcome (PRO) measures. To reduce the number of questions administered, PROMIS utilizes unidimensional item response theory and unidimensional computer adaptive testing (UCAT), which means a separate set of questions is administered for each measured trait. Multidimensional item response theory (MIRT) and multidimensional computer adaptive testing (MCAT) simultaneously assess correlated traits. The objective was to investigate the extent to which MCAT reduces patient burden relative to UCAT in the case of PROs. One MIRT and 3 unidimensional item response theory models were developed using the related traits anxiety, depression, and anger. Using these models, MCAT and UCAT performance was compared with simulated individuals. Surprisingly, the root mean squared error for both methods increased with the number of items. These results were driven by large errors for individuals with low trait levels. A second analysis focused on individuals aligned with item content. For these individuals, both MCAT and UCAT accuracies improved with additional items. Furthermore, MCAT reduced the test length by 50%. For the PROMIS Emotional Distress banks, neither UCAT nor MCAT provided accurate estimates for individuals at low trait levels. Because the items in these banks were designed to detect clinical levels of distress, there is little information for individuals with low trait values. However, trait estimates for individuals targeted by the banks were accurate and MCAT asked substantially fewer questions. By reducing the number of items administered, MCAT can allow clinicians and researchers to assess a wider range of PROs with less patient burden. © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com

How "implicit" are implicit color effects in memory?

PubMed

Zimmer, Hubert D; Steiner, Astrid; Ecker, Ullrich K H

2002-01-01

Processing colored pictures of objects results in a preference to choose the former color for a specific object in a subsequent color choice test (Wippich & Mecklenbräuker, 1998). We tested whether this implicit memory effect is independent of performances in episodic color recollection (recognition). In the study phase of Experiment 1, the color of line drawings was either named or its appropriateness was judged. We found only weak implicit memory effects for categorical color information. In Experiment 2, silhouettes were colored by subjects during the study phase. Performances in both the implicit and the explicit test were good. Selections of "old" colors in the implicit test, though, were almost completely confined to items for which the color was also remembered explicitly. In Experiment 3, we applied the opposition technique in order to check whether we could find any implicit effects regarding items for which no explicit color recollection was possible. This was not the case. We therefore draw the conclusion that implicit color preference effects are not independent of explicit recollection, and that they are probably based on the same episodic memory traces that are used in explicit tests.
Development and Psychometric Evaluation of a Health-Related Quality of Life Instrument for Individuals with Adult-Onset Hearing Loss

PubMed Central

Stika, Carren J.; Hays, Ron D.

2016-01-01

Objective Self-reports of “hearing handicap” are available, but a comprehensive measure of health-related quality of life (HRQOL) for individuals with adult-onset hearing loss (AOHL) does not exist. Our objective was to develop and evaluate a multidimensional HRQOL instrument for individuals with AOHL. Design The Impact of Hearing Loss Inventory Tool (IHEAR-IT) was developed using results of focus groups, a literature review, Advisory Expert Panel input, and cognitive interviews. Study Sample The 73-item field-test instrument was completed by 409 adults (22-91 years old) with varying degrees of AOHL and from different areas of the US. Results Multitrait scaling analysis supported four multi-item scales and five individual items. Internal consistency reliabilities ranged from 0.93 to 0.96 for the scales. Construct validity was supported by correlations between the IHEAR-IT scales and scores on the 36-Item Short Form Health Survey, Version 2.0 (SF-36v2) Mental Composite Summary (r’s = 0.32 – 0.64) and the Hearing Handicap Inventory for the Elderly/Adults (HHIE/HHIA) (r’s > −0.70). Conclusions The field test provide initial support for the reliability and construct validity of the IHEAR-IT for evaluating HRQOL of individuals with AOHL. Further research is needed to evaluate the responsiveness to change of the IHEAR-IT scales and identify items for a short-form. PMID:27104754
Personalized objects can optimize the diagnosis of EMCS in the assessment of functional object use in the CRS-R: a double blind, randomized clinical trial.

PubMed

Sun, Yuxiao; Wang, Jianan; Heine, Lizette; Huang, Wangshan; Wang, Jing; Hu, Nantu; Hu, Xiaohua; Fang, Xiaohui; Huang, Supeng; Laureys, Steven; Di, Haibo

2018-04-12

Behavioral assessment has been acted as the gold standard for the diagnosis of disorders of consciousness (DOC) patients. The item "Functional Object Use" in the motor function sub-scale in the Coma Recovery Scale-Revised (CRS-R) is a key item in differentiating between minimally conscious state (MCS) and emergence from MCS (EMCS). However, previous studies suggested that certain specific stimuli, especially something self-relevant can affect DOC patients' scores of behavioral assessment scale. So, we attempted to find out if personalized objects can improve the diagnosis of EMCS in the assessment of Functional Object Use by comparing the use of patients' favorite objects and other common objects in MCS patients. Twenty-one post-comatose patients diagnosed as MCS were prospectively included. The item "Functional Object Use" was assessed by using personalized objects (e.g., cigarette, paper) and non-personalized objects, which were presented in a random order. The rest assessments were performed following the standard protocol of the CRS-R. The differences between functional uses of the two types of objects were analyzed by the McNemar test. The incidence of Functional Object Use was significantly higher using personalized objects than non-personalized objects in the CRS-R. Five out of the 21 MCS studied patients, who were assessed with non-personalized objects, were re-diagnosed as EMCS with personalized objects (χ 2 = 5, df = 1, p < 0.05). Personalized objects employed here seem to be more effective to elicit patients' responses as compared to non-personalized objects during the assessment of Functional Object Use in DOC patients. Clinical Trials.gov: NCT02988206 ; Date of registration: 2016/12/12.
Evaluation: Test Construction and Use. An Instructional Model for Undergraduate Teacher Education in the RAFT Program at Mississippi State University.

ERIC Educational Resources Information Center

Handley, Herbert M., Ed.

This module developed by the Research Applications for Teaching (RAFT) project assists the preservice teacher in constructing test items to better measure the outcomes of instructional objectives. Student teachers are also assisted in the interpretation of results of a student's performance on a standardized test. Students also…
Practical Procedures for Constructing Mastery Tests to Minimize Errors of Classification and to Maximize or Optimize Decision Reliability.

ERIC Educational Resources Information Center

Byars, Alvin Gregg

The objectives of this investigation are to develop, describe, assess, and demonstrate procedures for constructing mastery tests to minimize errors of classification and to maximize decision reliability. The guidelines are based on conditions where item exchangeability is a reasonable assumption and the test constructor can control the number of…
78 FR 21413 - Notice of Intent To Repatriate Cultural Items: The Field Museum of Natural History, Chicago, IL

Federal Register 2010, 2011, 2012, 2013, 2014

2013-04-10

... cultural items listed in this notice meet the definition of sacred objects and objects of cultural... Natural History, Chicago, IL, that meet the definition of sacred objects and objects of cultural patrimony... items have been identified as Native American sacred objects and objects of cultural patrimony through...
Development and validation of the Cancer Exercise Stereotypes Scale.

PubMed

Falzon, Charlène; Sabiston, Catherine; Bergamaschi, Alessandro; Corrion, Karine; Chalabaev, Aïna; D'Arripe-Longueville, Fabienne

2014-01-01

The objective of this study was to develop and validate a French-language questionnaire measuring stereotypes related to exercise in cancer patients: The Cancer Exercise Stereotypes Scale (CESS). Four successive steps were carried out with 806 participants. First, a preliminary version was developed on the basis of the relevant literature and qualitative interviews. A test of clarity then led to the reformulation of six of the 30 items. Second, based on the modification indices of the first confirmatory factorial analysis, 11 of the 30 initial items were deleted. A new factorial structure analysis showed a good fit and validated a 19-item instrument with five subscales. Third, the stability of the instrument was tested over time. Last, tests of construct validity were conducted to examine convergent validity and discriminant validity. The French-language CESS appears to have good psychometric qualities and can be used to test theoretical tenets and inform intervention strategies on ways to foster exercise in cancer patients.
Testing for lead in toys at day care centers.

PubMed

Sanders, Martha; Stolz, Julie; Chacon-Baker, Ashley

2013-01-01

Exposure to lead-based paint or material has been found to impact children's cognitive and behavioral development at blood lead levels far below current standards. The purpose of the project was to screen for lead in toy items in daycare centers in order to raise awareness of inside environmental lead exposures and minimize lead-based exposures for children. Occupational therapy students in a service learning class tested for lead in ten daycare or public centers using the XRF Thermo Scientific Niton XL3t, a method accepted by the Consumer Product Safety Commission (CPSC). A total of 460 items were tested over a two-month period for an average of 66 toys per setting. Fifty six (56) items tested > 100 ppm, which represented 12% of the entire sample. Items with high lead levels included selected toys constructed with lead-based paint, lead metals, plastics using lead as a color enhancer, and decorative objects. While the actual number of lead-based products is small, the cumulative exposure or habitual use may pose an unnecessary risk to children. Indoor exposures occurred for all day care centers regardless of socio-economic levels. Recommendations to minimize exposures are provided.
Semantic representation in the white matter pathway

PubMed Central

Fang, Yuxing; Wang, Xiaosha; Zhong, Suyu; Song, Luping; Han, Zaizhu; Gong, Gaolang

2018-01-01

Object conceptual processing has been localized to distributed cortical regions that represent specific attributes. A challenging question is how object semantic space is formed. We tested a novel framework of representing semantic space in the pattern of white matter (WM) connections by extending the representational similarity analysis (RSA) to structural lesion pattern and behavioral data in 80 brain-damaged patients. For each WM connection, a neural representational dissimilarity matrix (RDM) was computed by first building machine-learning models with the voxel-wise WM lesion patterns as features to predict naming performance of a particular item and then computing the correlation between the predicted naming score and the actual naming score of another item in the testing patients. This correlation was used to build the neural RDM based on the assumption that if the connection pattern contains certain aspects of information shared by the naming processes of these two items, models trained with one item should also predict naming accuracy of the other. Correlating the neural RDM with various cognitive RDMs revealed that neural patterns in several WM connections that connect left occipital/middle temporal regions and anterior temporal regions associated with the object semantic space. Such associations were not attributable to modality-specific attributes (shape, manipulation, color, and motion), to peripheral picture-naming processes (picture visual similarity, phonological similarity), to broad semantic categories, or to the properties of the cortical regions that they connected, which tended to represent multiple modality-specific attributes. That is, the semantic space could be represented through WM connection patterns across cortical regions representing modality-specific attributes. PMID:29624578
Development and psychometric testing of the Canine Owner-Reported Quality of Life questionnaire, an instrument designed to measure quality of life in dogs with cancer.

PubMed

Giuffrida, Michelle A; Brown, Dorothy Cimino; Ellenberg, Susan S; Farrar, John T

2018-05-01

OBJECTIVE To describe development and initial psychometric testing of an owner-reported questionnaire designed to standardize measurement of general quality of life (QOL) in dogs with cancer. DESIGN Key-informant interviews, questionnaire development, and field trial. SAMPLE Owners of 25 dogs with cancer for item development and pretesting and owners of 90 dogs with cancer for reliability and validity testing. PROCEDURES Standard methods for development and testing of questionnaire instruments intended to measure subjective states were used. Items were generated, selected, scaled, and pretested for content, meaning, and readability. Response items were evaluated with exploratory factor analysis and by assessing internal consistency (Cronbach α) and convergence with global QOL as determined with a visual analog scale. Preliminary tests of stability and responsiveness were performed. RESULTS The final questionnaire-which was named the Canine Owner-Reported Quality of Life (CORQ) questionnaire-contained 17 items related to observable behaviors commonly used by owners to evaluate QOL in their dogs. Several items pertaining to physical symptoms performed poorly and were omitted. The 17 items were assigned to 4 factors-vitality, companionship, pain, and mobility-on the basis of the items they contained. The CORQ questionnaire and its factors had high internal consistency (Cronbach α = 0.68 to 0.90) and moderate to strong correlations (r = 0.49 to 0.71) with global QOL as measured on a visual analog scale. Preliminary testing indicated good test-retest reliability and responsiveness to improvements in overall QOL. CONCLUSIONS AND CLINICAL RELEVANCE The CORQ questionnaire was a valid, reliable owner-reported questionnaire that measured general QOL in dogs with cancer and showed promise as a clinical trial outcome measure for quantifying changes in individual dog QOL occurring in response to cancer treatment and progression.
Objective Structured Clinical Examination as an Assessment Tool for Clinical Skills in Dermatology.

PubMed

Saceda-Corralo, D; Fonda-Pascual, P; Moreno-Arrones, Ó M; Alegre-Sánchez, A; Hermosa-Gelbard, Á; Jiménez-Gómez, N; Vañó-Galván, S; Jaén-Olasolo, P

2017-04-01

Objective Structured Clinical Evaluation (OSCE) is an excellent method to evaluate student's abilities, but there are no previous reports implementing it in dermatology. To determine the feasibility of implementation of a dermatology OSCE in the medical school. Five stations with standardized patients and image-based assessment were designed. A specific checklist was elaborated in each station with different items which evaluated one competency and were classified into five groups (medical history, physical examination, technical skills, case management and prevention). A total of 28 students were tested. Twenty-five of them (83.3%) passed the exam globally. Concerning each group of items tested: medical interrogation had a mean score of 71.0; physical examination had a mean score of 63.0; management had a mean score of 58.0; and prevention had a mean score of 58.0 points. The highest results were obtained in interpersonal skills items with 91.8 points. Testing a small sample of voluntary students may hinder generalization of our study. OSCE is an useful tool for assessing clinical skills in dermatology and it is possible to carry it out. Our experience enhances that medical school curriculum needs to establish OSCE as an assessment tool in dermatology. Copyright © 2016 AEDV. Publicado por Elsevier España, S.L.U. All rights reserved.
Judgment: Analyzing Fallacies and Weaknesses in Arguments: Grades 7-12.

ERIC Educational Resources Information Center

Instructional Objectives Exchange, Los Angeles, CA.

Objectives, with sample test items and explanations of answers are presented for instruction in judgment and logic in analyzing fallacies and weaknesses in arguments. This type of material is not usually taught in pre-college curricula, but has been geared for the secondary grades. Each fallacy is explained after the stated objective, and answers…
The Prevalence of Olfactory Dysfunction in Chronic Rhinosinusitis

PubMed Central

Kohli, Preeti; Naik, Akash N.; Harruff, E. Emily; Nguyen, Shaun A.; Schlosser, Rodney J.; Soler, Zachary M.

2016-01-01

Objective Many studies have reported that olfactory dysfunction frequently occurs in chronic rhinosinusitis (CRS) populations; however, the prevalence and degree of olfactory loss has not been systematically studied. The aims of this study are to use combined data to report the prevalence of olfactory dysfunction and to calculate weighted averages of olfactory test scores in CRS patients. Data Sources A search was conducted in PubMed and Scopus, following the methods of Preferred Reporting Items for Systematic Review and Meta-Analysis guidelines. Review Methods Studies reporting the prevalence of olfactory dysfunction using objective measures or olfactory test scores using validated scales were included. Results A total of 47 articles were included in systematic review and 35 in the pooled data analysis. The prevalence of olfactory dysfunction in chronic rhinosinusitis was found to be 30.0% using the Brief Smell Identification Test, 67.0% using the 40-item Smell Identification Test, and 78.2% using the total Sniffin’ Sticks score. Weighted averages ± standard deviation of olfactory test scores were 25.96±7.11 using the 40-item Smell Identification Test, 8.60±2.81 using the Brief Smell Identification Test, 21.96±8.88 using total Sniffin’ sticks score, 5.65±1.51 using Sniffin’ Sticks threshold, 9.21±4.63 using Sniffin’ Sticks discrimination, 9.47±3.92 using Sniffin’ Sticks Identification, and 8.90±5.14 using the questionnaire for olfactory disorders-negative statements. Conclusion In chronic rhinosinusitis populations, a significant percentage of patients experience olfactory dysfunction and mean olfactory scores are within the dysosmic range. PMID:27873345
Associative memory in aging: the effect of unitization on source memory.

PubMed

Bastin, Christine; Diana, Rachel A; Simon, Jessica; Collette, Fabienne; Yonelinas, Andrew P; Salmon, Eric

2013-03-01

In normal aging, memory for associations declines more than memory for individual items. Unitization is an encoding process defined by creation of a new single entity to represent a new arbitrary association. The current study tested the hypothesis that age-related differences in associative memory can be reduced by encoding instructions that promote unitization. In two experiments, groups of 20 young and 20 older participants learned new associations between a word and a background color under two conditions. In the item detail condition, they had to imagine that the item is the same color as the background-an instruction promoting unitization of the associations. In the context detail condition, which did not promote unitization, they had to imagine that the item interacted with another colored object. At test, they had to retrieve the color that was associated with each word (source memory). In both experiments, the results showed an age-related decrement in source memory performance in the context detail but not in the item detail condition. Moreover, Experiment 2 examined receiver operating characteristics in older participants and indicated that familiarity contributed more to source memory performance in the item detail than in the context detail condition. These findings suggest that unitization of new associations can overcome the associative memory deficit observed in aging, at least for item-color associations.
Analogical reasoning in amazons.

PubMed

Obozova, Tanya; Smirnova, Anna; Zorina, Zoya; Wasserman, Edward

2015-11-01

Two juvenile orange-winged amazons (Amazona amazonica) were initially trained to match visual stimuli by color, shape, and number of items, but not by size. After learning these three identity matching-to-sample tasks, the parrots transferred discriminative responding to new stimuli from the same categories that had been used in training (other colors, shapes, and numbers of items) as well as to stimuli from a different category (stimuli varying in size). In the critical testing phase, both parrots exhibited reliable relational matching-to-sample (RMTS) behavior, suggesting that they perceived and compared the relationship between objects in the sample stimulus pair to the relationship between objects in the comparison stimulus pairs, even though no physical matches were possible between items in the sample and comparison pairs. The parrots spontaneously exhibited this higher-order relational responding without having ever before been trained on RMTS tasks, therefore joining apes and crows in displaying this abstract cognitive behavior.
The Influence of Similarity on Visual Working Memory Representations

PubMed Central

Lin, Po-Han; Luck, Steven J.

2007-01-01

In verbal memory, similarity between items in memory often leads to interference and impaired memory performance. The present study sought to determine whether analogous interference effects would be observed in visual working memory by varying the similarity of the to-be-remembered objects in a color change-detection task. Instead of leading to interference and impaired performance, increased similarity among the items being held in memory led to improved performance. Moreover, when two similar colors were presented along with one dissimilar color, memory performance was better for the similar colors than for the dissimilar color. Similarity produced better performance even when the objects were presented sequentially and even when memory for the first item in the sequence was tested. These findings show that similarity does not lead to interference between representations in visual working memory. Instead, similarity may lead to improved task performance, possibly due to increased stability or precision of the memory representations during maintenance. PMID:19430536
Multiple balance tests improve the assessment of postural stability in subjects with Parkinson's disease

PubMed Central

Jacobs, J V; Horak, F B; Tran, V K; Nutt, J G

2006-01-01

Objectives Clinicians often base the implementation of therapies on the presence of postural instability in subjects with Parkinson's disease (PD). These decisions are frequently based on the pull test from the Unified Parkinson's Disease Rating Scale (UPDRS). We sought to determine whether combining the pull test, the one‐leg stance test, the functional reach test, and UPDRS items 27–29 (arise from chair, posture, and gait) predicts balance confidence and falling better than any test alone. Methods The study included 67 subjects with PD. Subjects performed the one‐leg stance test, the functional reach test, and the UPDRS motor exam. Subjects also responded to the Activities‐specific Balance Confidence (ABC) scale and reported how many times they fell during the previous year. Regression models determined the combination of tests that optimally predicted mean ABC scores or categorised fall frequency. Results When all tests were included in a stepwise linear regression, only gait (UPDRS item 29), the pull test (UPDRS item 30), and the one‐leg stance test, in combination, represented significant predictor variables for mean ABC scores (r2 = 0.51). A multinomial logistic regression model including the one‐leg stance test and gait represented the model with the fewest significant predictor variables that correctly identified the most subjects as fallers or non‐fallers (85% of subjects were correctly identified). Conclusions Multiple balance tests (including the one‐leg stance test, and the gait and pull test items of the UPDRS) that assess different types of postural stress provide an optimal assessment of postural stability in subjects with PD. PMID:16484639
Constructing three emotion knowledge tests from the invariant measurement approach

PubMed Central

Prieto, Gerardo; Burin, Debora I.

2017-01-01

Background Psychological constructionist models like the Conceptual Act Theory (CAT) postulate that complex states such as emotions are composed of basic psychological ingredients that are more clearly respected by the brain than basic emotions. The objective of this study was the construction and initial validation of Emotion Knowledge measures from the CAT frame by means of an invariant measurement approach, the Rasch Model (RM). Psychological distance theory was used to inform item generation. Methods Three EK tests—emotion vocabulary (EV), close emotional situations (CES) and far emotional situations (FES)—were constructed and tested with the RM in a community sample of 100 females and 100 males (age range: 18–65), both separately and conjointly. Results It was corroborated that data-RM fit was sufficient. Then, the effect of type of test and emotion on Rasch-modelled item difficulty was tested. Significant effects of emotion on EK item difficulty were found, but the only statistically significant difference was that between “happiness” and the remaining emotions; neither type of test, nor interaction effects on EK item difficulty were statistically significant. The testing of gender differences was carried out after corroborating that differential item functioning (DIF) would not be a plausible alternative hypothesis for the results. No statistically significant sex-related differences were found out in EV, CES, FES, or total EK. However, the sign of d indicate that female participants were consistently better than male ones, a result that will be of interest for future meta-analyses. Discussion The three EK tests are ready to be used as components of a higher-level measurement process. PMID:28929013
How does aging affect the types of error made in a visual short-term memory ‘object-recall’ task?

PubMed Central

Sapkota, Raju P.; van der Linde, Ian; Pardhan, Shahina

2015-01-01

This study examines how normal aging affects the occurrence of different types of incorrect responses in a visual short-term memory (VSTM) object-recall task. Seventeen young (Mean = 23.3 years, SD = 3.76), and 17 normally aging older (Mean = 66.5 years, SD = 6.30) adults participated. Memory stimuli comprised two or four real world objects (the memory load) presented sequentially, each for 650 ms, at random locations on a computer screen. After a 1000 ms retention interval, a test display was presented, comprising an empty box at one of the previously presented two or four memory stimulus locations. Participants were asked to report the name of the object presented at the cued location. Errors rates wherein participants reported the names of objects that had been presented in the memory display but not at the cued location (non-target errors) vs. objects that had not been presented at all in the memory display (non-memory errors) were compared. Significant effects of aging, memory load and target recency on error type and absolute error rates were found. Non-target error rate was higher than non-memory error rate in both age groups, indicating that VSTM may have been more often than not populated with partial traces of previously presented items. At high memory load, non-memory error rate was higher in young participants (compared to older participants) when the memory target had been presented at the earliest temporal position. However, non-target error rates exhibited a reversed trend, i.e., greater error rates were found in older participants when the memory target had been presented at the two most recent temporal positions. Data are interpreted in terms of proactive interference (earlier examined non-target items interfering with more recent items), false memories (non-memory items which have a categorical relationship to presented items, interfering with memory targets), slot and flexible resource models, and spatial coding deficits. PMID:25653615
How does aging affect the types of error made in a visual short-term memory 'object-recall' task?

PubMed

Sapkota, Raju P; van der Linde, Ian; Pardhan, Shahina

2014-01-01

This study examines how normal aging affects the occurrence of different types of incorrect responses in a visual short-term memory (VSTM) object-recall task. Seventeen young (Mean = 23.3 years, SD = 3.76), and 17 normally aging older (Mean = 66.5 years, SD = 6.30) adults participated. Memory stimuli comprised two or four real world objects (the memory load) presented sequentially, each for 650 ms, at random locations on a computer screen. After a 1000 ms retention interval, a test display was presented, comprising an empty box at one of the previously presented two or four memory stimulus locations. Participants were asked to report the name of the object presented at the cued location. Errors rates wherein participants reported the names of objects that had been presented in the memory display but not at the cued location (non-target errors) vs. objects that had not been presented at all in the memory display (non-memory errors) were compared. Significant effects of aging, memory load and target recency on error type and absolute error rates were found. Non-target error rate was higher than non-memory error rate in both age groups, indicating that VSTM may have been more often than not populated with partial traces of previously presented items. At high memory load, non-memory error rate was higher in young participants (compared to older participants) when the memory target had been presented at the earliest temporal position. However, non-target error rates exhibited a reversed trend, i.e., greater error rates were found in older participants when the memory target had been presented at the two most recent temporal positions. Data are interpreted in terms of proactive interference (earlier examined non-target items interfering with more recent items), false memories (non-memory items which have a categorical relationship to presented items, interfering with memory targets), slot and flexible resource models, and spatial coding deficits.

Psychometrics of the preschooler physical activity parenting practices instrument among a Latino sample

PubMed Central

2014-01-01

Background Latino preschoolers (3-5 year old children) have among the highest rates of obesity. Low levels of physical activity (PA) are a risk factor for obesity. Characterizing what Latino parents do to encourage or discourage their preschooler to be physically active can help inform interventions to increase their PA. The objective was therefore to develop and assess the psychometrics of a new instrument: the Preschooler Physical Activity Parenting Practices (PPAPP) among a Latino sample, to assess parenting practices used to encourage or discourage PA among preschool-aged children. Methods Cross-sectional study of 240 Latino parents who reported the frequency of using PA parenting practices. 95% of respondents were mothers; 42% had more than a high school education. Child mean age was 4.5 (±0.9) years (52% male). Test-retest reliability was assessed in 20%, 2 weeks later. We assessed the fit of a priori models using Confirmatory factor analyses (CFA). In a separate sub-sample (35%), preschool-aged children wore accelerometers to assess associations with their PA and PPAPP subscales. Results The a-priori models showed poor fit to the data. A modified factor structure for encouraging PPAPP had one multiple-item scale: engagement (15 items), and two single-items (have outdoor toys; not enroll in sport-reverse coded). The final factor structure for discouraging PPAPP had 4 subscales: promote inactive transport (3 items), promote screen time (3 items), psychological control (4 items) and restricting for safety (4 items). Test-retest reliability (ICC) for the two scales ranged from 0.56-0.85. Cronbach’s alphas ranged from 0.5-0.9. Several sub-factors correlated in the expected direction with children’s objectively measured PA. Conclusion The final models for encouraging and discouraging PPAPP had moderate to good fit, with moderate to excellent test-retest reliabilities. The PPAPP should be further evaluated to better assess its associations with children’s PA and offers a new tool for measuring PPAPP among Latino families with preschool-aged children. PMID:24428935
Aging, culture, and memory for socially meaningful item-context associations: an East-West cross-cultural comparison study.

PubMed

Yang, Lixia; Li, Juan; Spaniol, Julia; Hasher, Lynn; Wilkinson, Andrea J; Yu, Jing; Niu, Yanan

2013-01-01

Research suggests that people in Eastern interdependent cultures process information more holistically and attend more to contextual information than do people in Western independent cultures. The current study examined the effects of culture and age on memory for socially meaningful item-context associations in 71 Canadians of Western European descent (35 young and 36 older) and 72 native Chinese citizens (36 young and 36 older). All participants completed two blocks of context memory tasks. During encoding, participants rated pictures of familiar objects. In one block, objects were rated either for their meaningfulness in the independent living context or their typicality in daily life. In the other block, objects were rated for their meaningfulness in the context of fostering relationships with others or for their typicality in daily life. The encoding in each block was followed by a recognition test in which participants identified pictures and their associated contexts. The results showed that Chinese outperformed Canadians in context memory, though both culture groups showed similar age-related deficits in item and context memory. The results suggest that Chinese are at an advantage in memory for socially meaningful item-context associations, an advantage that continues from young adulthood into old age.
Aging, Culture, and Memory for Socially Meaningful Item-Context Associations: An East-West Cross-Cultural Comparison Study

PubMed Central

Yang, Lixia; Li, Juan; Spaniol, Julia; Hasher, Lynn; Wilkinson, Andrea J.; Yu, Jing; Niu, Yanan

2013-01-01

Research suggests that people in Eastern interdependent cultures process information more holistically and attend more to contextual information than do people in Western independent cultures. The current study examined the effects of culture and age on memory for socially meaningful item-context associations in 71 Canadians of Western European descent (35 young and 36 older) and 72 native Chinese citizens (36 young and 36 older). All participants completed two blocks of context memory tasks. During encoding, participants rated pictures of familiar objects. In one block, objects were rated either for their meaningfulness in the independent living context or their typicality in daily life. In the other block, objects were rated for their meaningfulness in the context of fostering relationships with others or for their typicality in daily life. The encoding in each block was followed by a recognition test in which participants identified pictures and their associated contexts. The results showed that Chinese outperformed Canadians in context memory, though both culture groups showed similar age-related deficits in item and context memory. The results suggest that Chinese are at an advantage in memory for socially meaningful item-context associations, an advantage that continues from young adulthood into old age. PMID:23593288
Memory Performance for Everyday Motivational and Neutral Objects Is Dissociable from Attention

PubMed Central

Schomaker, Judith; Wittmann, Bianca C.

2017-01-01

Episodic memory is typically better for items coupled with monetary reward or punishment during encoding. It is yet unclear whether memory is also enhanced for everyday objects with appetitive or aversive values learned through a lifetime of experience, and to what extent episodic memory enhancement for motivational and neutral items is attributable to attention. In a first experiment, we investigated attention to everyday motivational objects using eye-tracking during free-viewing and subsequently tested episodic memory using a remember/know procedure. Attention was directed more to aversive stimuli, as evidenced by longer viewing durations, whereas recollection was higher for both appetitive and aversive objects. In the second experiment, we manipulated the visual contrast of neutral objects through changes of contrast to further dissociate attention and memory encoding. While objects presented with high visual contrast were looked at longer, recollection was best for objects presented in unmodified, medium contrast. Generalized logistic mixed models on recollection performance showed that attention as measured by eye movements did not enhance subsequent memory, while motivational value (Experiment 1) and visual contrast (Experiment 2) had quadratic effects in opposite directions. Our findings suggest that an enhancement of incidental memory encoding for appetitive items can occur without an increase in attention and, vice versa, that enhanced attention towards salient neutral objects is not necessarily associated with memory improvement. Together, our results provide evidence for a double dissociation of attention and memory effects under certain conditions. PMID:28694774
The GRACE checklist for rating the quality of observational studies of comparative effectiveness: a tale of hope and caution.

PubMed

Dreyer, Nancy A; Velentgas, Priscilla; Westrich, Kimberly; Dubois, Robert

2014-03-01

While there is growing demand for information about comparative effectiveness (CE), there is substantial debate about whether and when observational studies have sufficient quality to support decision making. To develop and test an item checklist that can be used to qualify those observational CE studies sufficiently rigorous in design and execution to contribute meaningfully to the evidence base for decision support. An 11-item checklist about data and methods (the GRACE checklist) was developed through literature review and consultation with experts from professional societies, payer groups, the private sector, and academia. Since no single gold standard exists for validation, checklist item responses were compared with 3 different types of external quality ratings (N=88 articles). The articles compared treatment effectiveness and/or safety of drugs, medical devices, and medical procedures. We validated checklist item responses 3 ways against external quality ratings, using published articles of observational CE or safety studies: (a) Systematic Review-quality assessment from a published systematic review; (b) Single Expert Review-quality assessment made according to the solicited "expert opinion" of a senior researcher; and (c) Concordant Expert Review-quality assessments from 2 experts for which there was concordance. Volunteers (N=113) from 5 continents completed 280 article assessments using the checklist. Positive and negative predictive values (PPV, NPV, respectively) of individual items were estimated to compare testers' assessments with those of experts. Taken as a whole, the scale had better NPV than PPV, for both data and methods. The most consistent predictor of quality relates to the validity of the primary outcomes measurement for the study purpose. Other consistent markers of quality relate to using concurrent comparators, minimizing the effects of bias by prudent choice of covariates, and using sensitivity analysis to test robustness of results. Concordance of expert opinion on the quality of the rated articles was 52%; most checklist items performed better. The 11-item GRACE checklist provides guidance to help determine which observational studies of CE have used strong scientific methods and good data that are fit for purpose and merit consideration for decision making. The checklist contains a parsimonious set of elements that can be objectively assessed in published studies, and user testing shows that it can be successfully applied to studies of drugs, medical devices, and clinical and surgical interventions. Although no scoring is provided, study reports that rate relatively well across checklist items merit in-depth examination to understand applicability, effect size, and likelihood of residual bias. The current testing and validation efforts did not achieve clear discrimination between studies fit for purpose and those not, but we have identified a critical, though remediable, limitation in our approach. Not specifying a specific granular decision for evaluation, or not identifying a single study objective in reports that included more than one, left reviewers with too broad an assessment challenge. We believe that future efforts will be more successful if reviewers are asked to focus on a specific objective or question. Despite the challenges encountered in this testing, an agreed upon set of assessment elements, checklists, or score cards is critical for the maturation of this field. Substantial resources will be expended on studies of real-world effectiveness, and if the rigor of these observational assessments cannot be assessed, then the impact of the studies will be suboptimal. Similarly, agreement on key elements of quality will ensure that budgets are appropriately directed toward those elements. Given the importance of this task and the lessons learned from these extensive efforts at validation and user testing, we are optimistic about the potential for improved assessments that can be used for diverse situations by people with a wide range of experience and training. Future testing would benefit by directing reviewers to address a single, granular research question, which would avoid problems that arose by using the checklist to evaluate multiple objectives, by using other types of validation test sets, and by employing further multivariate analysis to see if any combination or sequence of item responses has particularly high predictive validity.
Forgotten but not gone: savings for pictures and words in long-term memory.

PubMed

MacLeod, C M

1988-04-01

Five experiments examined the relearning of words, simple line-drawing pictures, and complex photographic pictures after retention intervals of 1 to 10 weeks. For those items that were neither recalled nor recognized, the identical item was relearned better than an unrelated control item, as measured by a recall test following relearning. This relearning advantage in recall held for all three classes of material and extended to the cross-modality case (i.e., picture-word and word-picture) and the same-referent case (i.e., two pictures of the same object). However, recognition tests of relearning failed to detect this same relearning advantage for apparently forgotten items. Taken together, these findings conflict with the existing account of savings. Most fundamental, the classic argument that relearning serves a trace-strengthening function is undetermined by the observed recall-recognition contrast. An alternative explanation of savings is suggested wherein relearning assists retrieval of information, thereby affecting recall in particular.
[Development of an Atypical Response Scale.

ERIC Educational Resources Information Center

Mendelsohn, Mark; Linden, James

The development of an objective diagnostic scale to measure atypical behavior is discussed. The Atypical Response Scale (ARS) is a structured projective test consisting of 17 items, each weighted 1, 2, or 3, that were tested for convergence and reliability. ARS may be individually or group administered in 10-15 minutes; hand scoring requires 90…
15 CFR Appendix A to Part 946 - National Weather Service Modernization Criteria

Code of Federal Regulations, 2010 CFR

2010-01-01

... specifically in Addendum I, Appendix D of the ASOS Site Component Commissioning Evaluation Package (the ASOS Package). Criteria: a. ASOS Acceptance Test: The site component acceptance test, which includes objective..., has been successfully completed in accordance with item 1a, p. D-2 of Appendix D of the ASOS Package...
15 CFR Appendix A to Part 946 - National Weather Service Modernization Criteria

Code of Federal Regulations, 2013 CFR

2013-01-01

... specifically in Addendum I, Appendix D of the ASOS Site Component Commissioning Evaluation Package (the ASOS Package). Criteria: a. ASOS Acceptance Test: The site component acceptance test, which includes objective..., has been successfully completed in accordance with item 1a, p. D-2 of Appendix D of the ASOS Package...
15 CFR Appendix A to Part 946 - National Weather Service Modernization Criteria

Code of Federal Regulations, 2012 CFR

2012-01-01

... specifically in Addendum I, Appendix D of the ASOS Site Component Commissioning Evaluation Package (the ASOS Package). Criteria: a. ASOS Acceptance Test: The site component acceptance test, which includes objective..., has been successfully completed in accordance with item 1a, p. D-2 of Appendix D of the ASOS Package...
15 CFR Appendix A to Part 946 - National Weather Service Modernization Criteria

Code of Federal Regulations, 2011 CFR

2011-01-01

... specifically in Addendum I, Appendix D of the ASOS Site Component Commissioning Evaluation Package (the ASOS Package). Criteria: a. ASOS Acceptance Test: The site component acceptance test, which includes objective..., has been successfully completed in accordance with item 1a, p. D-2 of Appendix D of the ASOS Package...
Reliability and Validity Testing of the Physical Resilience Measure

ERIC Educational Resources Information Center

Resnick, Barbara; Galik, Elizabeth; Dorsey, Susan; Scheve, Ann; Gutkin, Susan

2011-01-01

Objective: The purpose of this study was to test reliability and validity of the Physical Resilience Scale. Methods: A single-group repeated measure design was used and 130 older adults from three different housing sites participated. Participants completed the Physical Resilience Scale, Hardy-Gill Resilience Scale, 14-item Resilience Scale,…
Remembered but Unused: The Accessory Items in Working Memory that Do Not Guide Attention

ERIC Educational Resources Information Center

Peters, Judith C.; Goebel, Rainer; Roelfsema, Pieter R.

2009-01-01

If we search for an item, a representation of this item in our working memory guides attention to matching items in the visual scene. We can hold multiple items in working memory. Do all these items guide attention in parallel? We asked participants to detect a target object in a stream of objects while they maintained a second item in memory for…
Find the Hidden Object. Understanding Play in Psychological Assessments.

PubMed

Fasulo, Alessandra; Shukla, Janhavi; Bennett, Stephanie

2017-01-01

Standardized psychological assessments are extensively used by practitioners to determine rate and level of development in different domains of ability in both typical and atypical children. The younger the children, the more likely the trials will resemble play activities. However, mode of administration, timing and use of objects involved are constrained. The purpose of this study is to explore what kind of play is play in psychological assessments, what are the expectations about children's performance and what are the abilities supporting the test activities. Conversation Analysis (CA) was applied to the videorecording of an interaction between a child and a practitioner during the administration of the Bayley Scale of Infant and Toddler Development, III edition. The analysis focuses on a 2'07″ long sequence relative to the administration of the test item "Find the hidden object" to a 23 months old child with Down syndrome. The analysis of the sequence shows that the assessor promotes the child's engagement by couching the actions required to administer the item in utterances with marked child-directed features. The analysis also shows that the objects constituting the test item did not suggest to the child a unique course of action, leading to the assessor's modeling of the successful sequence. We argue that when a play frame is activated by an interactional partner, the relational aspect of the activity is foregrounded and the co-player becomes a source of cues for ways in which playing can develop. We discuss the assessment interaction as orienting the child toward a right-or-wrong interpretation, leaving the realm of play, which is inherently exploratory and inventive, to enter that of instructional activities. Finally, we argue that the sequential analysis of the interaction and of the mutual sense-making procedures that partners put in place during the administration of an assessment could be used in the design and evaluation of tests for a finer understanding of the abilities involved.
76 FR 80391 - Notice of Intent to Repatriate Cultural Items: U.S. Department of the Interior, National Park...

Federal Register 2010, 2011, 2012, 2013, 2014

2011-12-23

... cultural items meet the definition of sacred objects and repatriation to the lineal descendant stated below... descendants of the individual who owned these sacred objects and who wish to claim the items should contact... descendants of the individual who owned these sacred objects and who wish to claim the items should contact...
76 FR 80390 - Notice of Intent To Repatriate Cultural Items: U.S. Department of the Interior, National Park...

Federal Register 2010, 2011, 2012, 2013, 2014

2011-12-23

... cultural items meet the definition of sacred objects and repatriation to the lineal descendant stated below... descendants of the individual who owned these sacred objects and who wish to claim the items should contact... descendants of the individual who owned these sacred objects who wish to claim the items should contact Little...
Cognitive testing of tobacco use items for administration to patients with cancer and cancer survivors in clinical research.

PubMed

Land, Stephanie R; Warren, Graham W; Crafts, Jennifer L; Hatsukami, Dorothy K; Ostroff, Jamie S; Willis, Gordon B; Chollette, Veronica Y; Mitchell, Sandra A; Folz, Jasmine N M; Gulley, James L; Szabo, Eva; Brandon, Thomas H; Duffy, Sonia A; Toll, Benjamin A

2016-06-01

To the authors' knowledge, there are currently no standardized measures of tobacco use and secondhand smoke exposure in patients diagnosed with cancer, and this gap hinders the conduct of studies examining the impact of tobacco on cancer treatment outcomes. The objective of the current study was to evaluate and refine questionnaire items proposed by an expert task force to assess tobacco use. Trained interviewers conducted cognitive testing with cancer patients aged ≥21 years with a history of tobacco use and a cancer diagnosis of any stage and organ site who were recruited at the National Institutes of Health Clinical Center in Bethesda, Maryland. Iterative rounds of testing and item modification were conducted to identify and resolve cognitive issues (comprehension, memory retrieval, decision/judgment, and response mapping) and instrument navigation issues until no items warranted further significant modification. Thirty participants (6 current cigarette smokers, 1 current cigar smoker, and 23 former cigarette smokers) were enrolled from September 2014 to February 2015. The majority of items functioned well. However, qualitative testing identified wording ambiguities related to cancer diagnosis and treatment trajectory, such as "treatment" and "surgery"; difficulties with lifetime recall; errors in estimating quantities; and difficulties with instrument navigation. Revisions to item wording, format, order, response options, and instructions resulted in a questionnaire that demonstrated navigational ease as well as good question comprehension and response accuracy. The Cancer Patient Tobacco Use Questionnaire (C-TUQ) can be used as a standardized item set to accelerate the investigation of tobacco use in the cancer setting. Cancer 2016;122:1728-34. © 2016 American Cancer Society. © 2016 American Cancer Society.
The dialysis orders objective structured clinical examination (OSCE): a formative assessment for nephrology fellows.

PubMed

Prince, Lisa K; Campbell, Ruth C; Gao, Sam W; Kendrick, Jessica; Lebrun, Christopher J; Little, Dustin J; Mahoney, David L; Maursetter, Laura A; Nee, Robert; Saddler, Mark; Watson, Maura A; Yuan, Christina M

2018-04-01

Few quantitative nephrology-specific simulations assess fellow competency. We describe the development and initial validation of a formative objective structured clinical examination (OSCE) assessing fellow competence in ordering acute dialysis. The three test scenarios were acute continuous renal replacement therapy, chronic dialysis initiation in moderate uremia and acute dialysis in end-stage renal disease-associated hyperkalemia. The test committee included five academic nephrologists and four clinically practicing nephrologists outside of academia. There were 49 test items (58 points). A passing score was 46/58 points. No item had median relevance less than 'important'. The content validity index was 0.91. Ninety-five percent of positive-point items were easy-medium difficulty. Preliminary validation was by 10 board-certified volunteers, not test committee members, a median of 3.5 years from graduation. The mean score was 49 [95% confidence interval (CI) 46-51], κ = 0.68 (95% CI 0.59-0.77), Cronbach's α = 0.84. We subsequently administered the test to 25 fellows. The mean score was 44 (95% CI 43-45); 36% passed the test. Fellows scored significantly less than validators (P < 0.001). Of evidence-based questions, 72% were answered correctly by validators and 54% by fellows (P = 0.018). Fellows and validators scored least well on the acute hyperkalemia question. In self-assessing proficiency, 71% of fellows surveyed agreed or strongly agreed that the OSCE was useful. The OSCE may be used to formatively assess fellow proficiency in three common areas of acute dialysis practice. Further validation studies are in progress.
Differential Item Functioning in Primary Healthcare Evaluation Instruments by French/English Version, Educational Level and Urban/Rural Location

PubMed Central

Haggerty, Jeannie L.; Bouharaoui, Fatima; Santor, Darcy A.

2011-01-01

Evaluating the extent to which groups or subgroups of individuals differ with respect to primary healthcare experience depends on first ruling out the possibility of bias. Objective: To determine whether item or subscale performance differs systematically between French/English, high/low education subgroups and urban/rural residency. Method: A sample of 645 adult users balanced by French/English language (in Quebec and Nova Scotia, respectively), high/low education and urban/rural residency responded to six validated instruments: the Primary Care Assessment Survey (PCAS); the Primary Care Assessment Tool – Short Form (PCAT-S); the Components of Primary Care Index (CPCI); the first version of the EUROPEP (EUROPEP-I); the Interpersonal Processes of Care Survey, version II (IPC-II); and part of the Veterans Affairs National Outpatient Customer Satisfaction Survey (VANOCSS). We normalized subscale scores to a 0-to-10 scale and tested for between-group differences using ANOVA tests. We used a parametric item response model to test for differences between subgroups in item discriminability and item difficulty. We re-examined group differences after removing items with differential item functioning. Results: Experience of care was assessed more positively in the English-speaking (Nova Scotia) than in the French-speaking (Quebec) respondents. We found differential English/French item functioning in 48% of the 153 items: discriminability in 20% and differential difficulty in 28%. English items were more discriminating generally than the French. Removing problematic items did not change the differences in French/English assessments. Differential item functioning by high/low education status affected 27% of items, with items being generally more discriminating in high-education groups. Between-group comparisons were unchanged. In contrast, only 9% of items showed differential item functioning by geography, affecting principally the accessibility attribute. Removing problematic items reversed a previously non-significant finding, revealing poorer first-contact access in rural than in urban areas. Conclusion: Differential item functioning does not bias or invalidate French/English comparisons on subscales, but additional development is required to make French and English items equivalent. These instruments are relatively robust by educational status and geography, but results suggest potential differences in the underlying construct in low-education and rural respondents. PMID:23205035
Greater loss of object than spatial mnemonic discrimination in aged adults.

PubMed

Reagh, Zachariah M; Ho, Huy D; Leal, Stephanie L; Noche, Jessica A; Chun, Amanda; Murray, Elizabeth A; Yassa, Michael A

2016-04-01

Previous studies across species have established that the aging process adversely affects certain memory-related brain regions earlier than others. Behavioral tasks targeted at the function of vulnerable regions can provide noninvasive methods for assessing the integrity of particular components of memory throughout the lifespan. The present study modified a previous task designed to separately but concurrently test detailed memory for object identity and spatial location. Memory for objects or items is thought to rely on perirhinal and lateral entorhinal cortices, among the first targets of Alzheimer's related neurodegeneration. In line with prior work, we split an aged adult sample into "impaired" and "unimpaired" groups on the basis of a standardized word-learning task. The "impaired" group showed widespread difficulty with memory discrimination, whereas the "unimpaired" group showed difficulty with object, but not spatial memory discrimination. These findings support the hypothesized greater age-related impacts on memory for objects or items in older adults, perhaps even with healthy aging. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.

Dissociating electrophysiological correlates of subjective, objective, and correct memory in investigating the emotion-induced recognition bias.

PubMed

Windmann, Sabine; Hill, Holger

2014-10-01

Performance on tasks requiring discrimination of at least two stimuli can be viewed either from an objective perspective (referring to actual stimulus differences), or from a subjective perspective (corresponding to participant's responses). Using event-related potentials recorded during an old/new recognition memory test involving emotionally laden and neutral words studied either blockwise or randomly intermixed, we show here how the objective perspective (old versus new items) yields late effects of blockwise emotional item presentation at parietal sites that the subjective perspective fails to find, whereas the subjective perspective ("old" versus "new" responses) is more sensitive to early effects of emotion at anterior sites than the objective perspective. Our results demonstrate the potential advantage of dissociating the subjective and the objective perspective onto task performance (in addition to analyzing trials with correct responses), especially for investigations of illusions and information processing biases, in behavioral and cognitive neuroscience studies. Copyright © 2014 Elsevier Inc. All rights reserved.
Capacity and precision in an animal model of visual short-term memory.

PubMed

Lara, Antonio H; Wallis, Jonathan D

2012-03-14

Temporary storage of information in visual short-term memory (VSTM) is a key component of many complex cognitive abilities. However, it is highly limited in capacity. Understanding the neurophysiological nature of this capacity limit will require a valid animal model of VSTM. We used a multiple-item color change detection task to measure macaque monkeys' VSTM capacity. Subjects' performance deteriorated and reaction times increased as a function of the number of items in memory. Additionally, we measured the precision of the memory representations by varying the distance between sample and test colors. In trials with similar sample and test colors, subjects made more errors compared to trials with highly discriminable colors. We modeled the error distribution as a Gaussian function and used this to estimate the precision of VSTM representations. We found that as the number of items in memory increases the precision of the representations decreases dramatically. Additionally, we found that focusing attention on one of the objects increases the precision with which that object is stored and degrades the precision of the remaining. These results are in line with recent findings in human psychophysics and provide a solid foundation for understanding the neurophysiological nature of the capacity limit of VSTM.
Strategic retrieval in a reality monitoring task.

PubMed

Rosburg, Timm; Mecklinger, Axel; Johansson, Mikael

2011-08-01

Strategic recollection refers to control processes that allow the retrieval of information that is relevant for a specific situation. These processes can be studied in memory exclusion tasks, which require the retrieval of particular kinds of episodic information. In the current study, we investigated strategic recollection in reality monitoring by event-related potentials (ERPs). Participants studied object words, followed by a picture of the denoted object (perceive condition) or followed by the instruction to imagine such a picture (imagine condition). At test, subjects had to identify words of one study condition and to reject words of the second study condition together with newly presented items. Data analysis showed that object names were better identified when items of the perceive condition were targeted. In this test condition, a left parietal old/new effect (the ERP correlate of recollection) was observed only in response to targets. In contrast, both targets and nontargets elicited this old/new effect when items of the imagine condition were targeted. The magnitude of the left parietal old/new effect to nontargets in this condition (but no other left parietal old/new effect) correlated positively with the discrimination indices of both test conditions. In addition, ERPs to targets and nontargets differed at right frontal electrode sites at longer latencies (1500-1800 ms), with more positive ERPs for targets. Findings indicate that subjects retrieved nontarget information in the more difficult task condition, while they relied on target information alone in the less difficult task. This kind of strategic retrieval was not mirrored in other old/new effects. The correlation between the left parietal old/new effect for nontargets in the imagined item target condition and the discrimination indices of both conditions may indicate that the ease of nontarget retrieval, rather than the difficulty of target retrieval, increases the likelihood that nontarget information is actually retrieved. Copyright © 2011 Elsevier Ltd. All rights reserved.
Cancer Health Literacy Test-30-Spanish (CHLT-30-DKspa), a new Spanish- language version of the Cancer Health Literacy Test (CHLT-30) for Spanish-speaking Latinos

PubMed Central

Echeverri, Margarita; Anderson, David; Nápoles, Anna María

2016-01-01

Objective Describe adaptation and initial validation of the Cancer Health Literacy Test (CHLT) for Spanish-speakers. Methods Cross-sectional field test of the CHLT Spanish version (CHLT-30-DKspa) among healthy Latinos in Louisiana. Diagonally Weighted Least Squares were used to confirm the factor structure. Item-Response Analysis using 2-parameter logistic estimates were used to identify questions that may require modification to avoid bias. Cronbach's alpha coefficients estimated scale internal consistency reliability. Analysis of variance was used to test for significant differences in CHLT-30-DKspa scores by gender, origin, age and education. Results Mean CHLT-30-DKspa score (N=400) was 17.13 (range 0 to 30; SD 6.65). Results confirmed a unidimensional structure (X2[405] =461.55, p=.027, CFI=.993; TLI=.992, RMSEA=.0180). Cronbach's alpha was 0.88. Items Q1-High calorie and Q15-Tumor spread had the lowest item-scale correlations (.148 and .288) and standardized factor loadings (.152 and .302). Items Q1-High Calories, Q8-Palliative Care, and Q19-Smoking Risk had the highest item-difficulty parameters (diff=1.12, 1.21, and 2.40). Conclusions Results generally supported the applicability of the CHLT-30-DKspa for Spanish-speaking healthy populations, with the exception of four items that need to be deleted or revised and further studied Q1, Q8, Q15, and Q19). Practical Implications The CHLT-30-DKspa can be used to assess cancer health literacy among Spanish-speaking populations to advance research on cancer health literacy and outcomes. PMID:27043760
14 CFR Section 17 - Objective Classification-Extraordinary Items

Code of Federal Regulations, 2010 CFR

2010-01-01

... 14 Aeronautics and Space 4 2010-01-01 2010-01-01 false Objective Classification-Extraordinary Items Section 17 Section 17 Aeronautics and Space OFFICE OF THE SECRETARY, DEPARTMENT OF TRANSPORTATION... AIR CARRIERS Profit and Loss Classification Section 17 Objective Classification—Extraordinary Items...
Moving Knowledge Acquisition From the Lecture Hall to the Student Home: A Prospective Intervention Study

PubMed Central

Grefe, Clemens; Brown, Jamie; Meyer, Katharina; Schuelper, Nikolai; Anders, Sven

2015-01-01

Background Podcasts are popular with medical students, but the impact of podcast use on learning outcomes in undergraduate medical education has not been studied in detail. Objective Our aim was to assess the impact of podcasts accompanied by quiz questions and lecture attendance on short- and medium-term knowledge retention. Methods Students enrolled for a cardio-respiratory teaching module were asked to prepare for 10 specific lectures by watching podcasts and submitting answers to related quiz questions before attending live lectures. Performance on the same questions was assessed in a surprise test and a retention test. Results Watching podcasts and submitting answers to quiz questions (versus no podcast/quiz use) was associated with significantly better test performance in all items in the surprise test and 7 items in the retention test. Lecture attendance (versus no attendance) was associated with higher test performance in 3 items and 1 item, respectively. In a linear regression analysis adjusted for age, gender, and overall performance levels, both podcast/quiz use and lecture attendance were significant predictors of student performance. However, the variance explained by podcast/quiz use was greater than the variance explained by lecture attendance in the surprise test (38.7% vs 2.2%) and retention test (19.1% vs 4.0%). Conclusions When used in conjunction with quiz questions, podcasts have the potential to foster knowledge acquisition and retention over and above the effect of live lectures. PMID:26416467
76 FR 80389 - Notice of Intent To Repatriate a Cultural Item: U.S. Department of the Interior, National Park...

Federal Register 2010, 2011, 2012, 2013, 2014

2011-12-23

... cultural item meets the definition of sacred object and repatriation to the lineal descendant stated below... descendants of the individual who owned the sacred object and who wish to claim the item should contact Little... descendants of the individual who owned the sacred object and who wish to claim the item should contact Little...
Rasch Analysis of the Power as Knowing Participation in Change Tool--the Brazilian version.

PubMed

Guedes, Erika de Souza; Orozco-Vargas, Luiz Carlos; Turrini, Ruth Natália Teresa; de Sousa, Regina Márcia Cardoso; dos Santos, Mariana Alvina; da Cruz, Diná de Almeida Lopes Monteiro

2013-01-01

the objective of this study was to evaluate the items contained in the Brazilian version of the Power as Knowing Participation in Change Tool (PKPCT). investigation of the psychometric properties of the mentioned questionnaire through Rasch analysis. the data from 952 nursing assistants and 627 baccalaureate nurses were analyzed (average age 44.1 (SD=9.5); 13.0% men). The subscales Choices, Awareness, Freedom and Involvement were tested separately and presented unidimensionality; the categories of the responses given to the items were compiled from 7 to 3 levels and the items fit the model well, except for the following/leading item, in which the infit and outfit values were above 1.4; this item has also presented Differential Item Functioning (DIF) according to the participant's role. The reliability of the items was of 0.99 and the reliability of the participants ranged from 0.80 to 0.84 in the subscales. Items with extremely high levels of difficulty were not identified. the PKPCT should not be viewed as unidimensional, items with extremely high levels of difficulty in the scale need to be created and the differential functioning of some items has to be further investigated.
Assessing Patients’ Experiences with Communication Across the Cancer Care Continuum

PubMed Central

Mazor, Kathleen M.; Street, Richard L.; Sue, Valerie M.; Williams, Andrew E.; Rabin, Borsika A.; Arora, Neeraj K.

2016-01-01

Objective To evaluate the relevance, performance and potential usefulness of the Patient Assessment of cancer Communication Experiences (PACE) items. Methods Items focusing on specific communication goals related to exchanging information, fostering healing relationships, responding to emotions, making decisions, enabling self-management, and managing uncertainty were tested via a retrospective, cross-sectional survey of adults who had been diagnosed with cancer. Analyses examined response frequencies, inter-item correlations, and coefficient alpha. Results A total of 366 adults were included in the analyses. Relatively few selected “Does Not Apply”, suggesting that items tap relevant communication experiences. Ratings of whether specific communication goals were achieved were strongly correlated with overall ratings of communication, suggesting item content reflects important aspects of communication. Coefficient alpha was ≥.90 for each item set, indicating excellent reliability. Variations in the percentage of respondents selecting the most positive response across items suggest results can identify strengths and weaknesses. Conclusion The PACE items tap relevant, important aspects of communication during cancer care, and may be useful to cancer care teams desiring detailed feedback. PMID:26979476
ORES - Objective Referenced Evaluation in Science.

ERIC Educational Resources Information Center

Shaw, Terry

Science process skills considered important in making decisions and solving problems include: observing, classifying, measuring, using numbers, using space/time relationships, communicating, predicting, inferring, manipulating variables, making operational definitions, forming hypotheses, interpreting data, and experimenting. This 60-item test,…
Development and validation of an objective instrument to measure surgical performance at tonsillectomy.

PubMed

Roberson, David W; Kentala, Erna; Forbes, Peter

2005-12-01

The goals of this project were 1) to develop and validate an objective instrument to measure surgical performance at tonsillectomy, 2) to assess its interobserver and interobservation reliability and construct validity, and 3) to select those items with best reliability and most independent information to design a simplified form suitable for routine use in otolaryngology surgical evaluation. Prospective, observational data collection for an educational quality improvement project. The evaluation instrument was based on previous instruments developed in general surgery with input from attending otolaryngologic surgeons and experts in medical education. It was pilot tested and subjected to iterative improvements. After the instrument was finalized, a total of 55 tonsillectomies were observed and scored during academic year 2002 to 2003: 45 cases by residents at different points during their rotation, 5 by fellows, and 5 by faculty. Results were assessed for interobserver reliability, interobservation reliability, and construct validity. Factor analysis was used to identify items with independent information. Interobserver and interobservation reliability was high. On technical items, faculty substantially outperformed fellows, who in turn outperformed residents (P < .0001 for both comparisons). On the "global" scale (overall assessment), residents improved an average of 1 full point (on a 5 point scale) during a 3 month rotation (P = .01). In the subscale of "patient care," results were less clear cut: fellows outperformed residents, who in turn outperformed faculty, but only the fellows to faculty comparison was statistically significant (P = .04), and residents did not clearly improve over time (P = .36). Factor analysis demonstrated that technical items and patient care items factor separately and thus represent separate skill domains in surgery. It is possible to objectively measure surgical skill at tonsillectomy with high reliability and good construct validity. Factor analysis demonstrated that patient care is a distinct domain in surgical skill. Although the interobserver reliability for some patient care items reached statistical significance, it was not high enough for "high stakes testing" purposes. Using reliability and factor analysis results, we propose a simplified instrument for use in evaluating trainees in otolaryngologic surgery.
Development and validation of a socioculturally competent trust in physician scale for a developing country setting

PubMed Central

Gopichandran, Vijayaprasad; Wouters, Edwin; Chetlapalli, Satish Kumar

2015-01-01

Trust in physicians is the unwritten covenant between the patient and the physician that the physician will do what is in the best interest of the patient. This forms the undercurrent of all healthcare relationships. Several scales exist for assessment of trust in physicians in developed healthcare settings, but to our knowledge none of these have been developed in a developing country context. Objectives To develop and validate a new trust in physician scale for a developing country setting. Methods Dimensions of trust in physicians, which were identified in a previous qualitative study in the same setting, were used to develop a scale. This scale was administered among 616 adults selected from urban and rural areas of Tamil Nadu, south India, using a multistage sampling cross sectional survey method. The individual items were analysed using a classical test approach as well as item response theory. Cronbach's α was calculated and the item to total correlation of each item was assessed. After testing for unidimensionality and absence of local dependence, a 2 parameter logistic Semajima's graded response model was fit and item characteristics assessed. Results Competence, assurance of treatment, respect for the physician and loyalty to the physician were important dimensions of trust. A total of 31 items were developed using these dimensions. Of these, 22 were selected for final analysis. The Cronbach's α was 0.928. The item to total correlations were acceptable for all the 22 items. The item response analysis revealed good item characteristic curves and item information for all the items. Based on the item parameters and item information, a final 12 item scale was developed. The scale performs optimally in the low to moderate trust range. Conclusions The final 12 item trust in physician scale has a good construct validity and internal consistency. PMID:25941182
Harmonizing Screening for Gambling Problems in Epidemiological Surveys – Development of the Rapid Screener for Problem Gambling (RSPG)

PubMed Central

Challet-Bouju, Gaëlle; Perrot, Bastien; Romo, Lucia; Valleur, Marc; Magalon, David; Fatséas, Mélina; Chéreau-Boudet, Isabelle; Luquiens, Amandine; Grall-Bronnec, Marie; Hardouin, Jean-Benoit

2016-01-01

Background and aims The aim of this study was to test the screening properties of several combinations of items from gambling scales, in order to harmonize screening of gambling problems in epidemiological surveys. The objective was to propose two brief screening tools (three items or less) for a use in interviews and self-administered questionnaires. Methods We tested the screening properties of combinations of items from several gambling scales, in a sample of 425 gamblers (301 non-problem gamblers and 124 disordered gamblers). Items tested included interview-based items (Pathological Gambling section of the DSM-IV, lifetime history of problem gambling, monthly expenses in gambling, and abstinence of 1 month or more) and self-report items (South Oaks Gambling Screen, Gambling Attitudes, and Beliefs Survey). The gold standard used was the diagnosis of a gambling disorder according to the DSM-5. Results Two versions of the Rapid Screener for Problem Gambling (RSPG) were developed: the RSPG-Interview (RSPG-I), being composed of two interview items (increasing bets and loss of control), and the RSPG-Self-Assessment (RSPG-SA), being composed of three self-report items (chasing, guiltiness, and perceived inability to stop). Discussion and conclusions We recommend using the RSPG-SA/I for screening problem gambling in epidemiological surveys, with the version adapted for each purpose (RSPG-I for interview-based surveys and RSPG-SA for self-administered surveys). This first triage of potential problem gamblers must be supplemented by further assessment, as it may overestimate the proportion of problem gamblers. However, a first triage has the great advantage of saving time and energy in large-scale screening for problem gambling. PMID:27348558
Test blueprints for psychiatry residency in-training written examinations in Riyadh, Saudi Arabia

PubMed Central

Gaffas, Eisha M; Sequeira, Reginald P; Namla, Riyadh A Al; Al-Harbi, Khalid S

2012-01-01

Background The postgraduate training program in psychiatry in Saudi Arabia, which was established in 1997, is a 4-year residency program. Written exams comprising of multiple choice questions (MCQs) are used as a summative assessment of residents in order to determine their eligibility for promotion from one year to the next. Test blueprints are not used in preparing examinations. Objective To develop test blueprints for the written examinations used in the psychiatry residency program. Methods Based on the guidelines of four professional bodies, documentary analysis was used to develop global and detailed test blueprints for each year of the residency program. An expert panel participated during piloting and final modification of the test blueprints. Their opinion about the content, weightage for each content domain, and proportion of test items to be sampled in each cognitive category as defined by modified Bloom’s taxonomy were elicited. Results Eight global and detailed test blueprints, two for each year of the psychiatry residency program, were developed. The global test blueprints were reviewed by experts and piloted. Six experts participated in the final modification of test blueprints. Based on expert consensus, the content, total weightage for each content domain, and proportion of test items to be included in each cognitive category were determined for each global test blueprint. Experts also suggested progressively decreasing the weightage for recall test items and increasing problem solving test items in examinations, from year 1 to year 4 of the psychiatry residence program. Conclusion A systematic approach using a documentary and content analysis technique was used to develop test blueprints with additional input from an expert panel as appropriate. Test blueprinting is an important step to ensure the test validity in all residency programs. PMID:23762000
Spacelab mission development tests

NASA Technical Reports Server (NTRS)

Dalton, B. P.

1978-01-01

The paper describes Spacelab Mission Development Test III (SMD III) whose principal scientific objective was to demonstrate the feasibility of conducting biological research in the Life Sciences Spacelab. The test also provided an opportunity to try out several items of Common Operational Research Equipment (CORE) hardware being developed for operational use in Shuttle/Spacelab, such as rodent and primate handling, transportation units, and a 'zero-g' surgical bench. Operational concepts planned for Spacelab were subjected to evaluation, including animal handling procedures, animal logistics, crew selection and training, and a 'remote' ground station concept. It is noted that all the objectives originally proposed for SMD III were accomplished
Validation of a scale for assessing attitudes towards outcomes of genetic cancer testing among primary care providers and breast specialists

PubMed Central

N’Diaye, Khadim; Evans, D. Gareth; Harris, Hilary; Tibben, Aad; van Asperen, Christi; Schmidtke, Joerg; Nippert, Irmgard; Mancini, Julien; Julian-Reynier, Claire

2017-01-01

Objective To develop a generic scale for assessing attitudes towards genetic testing and to psychometrically assess these attitudes in the context of BRCA1/2 among a sample of French general practitioners, breast specialists and gyneco-obstetricians. Study design and setting Nested within the questionnaire developed for the European InCRisC (International Cancer Risk Communication Study) project were 14 items assessing expected benefits (8 items) and drawbacks (6 items) of the process of breast/ovarian genetic cancer testing (BRCA1/2). Another item assessed agreement with the statement that, overall, the expected health benefits of BRCA1/2 testing exceeded its drawbacks, thereby justifying its prescription. The questionnaire was mailed to a sample of 1,852 French doctors. Of these, 182 breast specialists, 275 general practitioners and 294 gyneco-obstetricians completed and returned the questionnaire to the research team. Principal Component Analysis, Cronbach’s α coefficient, and Pearson’s correlation coefficients were used in the statistical analyses of collected data. Results Three dimensions emerged from the respondents’ responses, and were classified under the headings: “Anxiety, Conflict and Discrimination”, “Risk Information”, and “Prevention and Surveillance”. Cronbach’s α coefficient for the 3 dimensions was 0.79, 0.76 and 0.62, respectively, and each dimension exhibited strong correlation with the overall indicator of agreement (criterion validity). Conclusions The validation process of the 15 items regarding BRCA1/2 testing revealed satisfactory psychometric properties for the creation of a new scale entitled the Attitudes Towards Genetic Testing for BRCA1/2 (ATGT-BRCA1/2) Scale. Further testing is required to confirm the validity of this tool which could be used generically in other genetic contexts. PMID:28570656
Components of a Measure to Describe Organizational Culture in Academic Pharmacy.

PubMed

Desselle, Shane; Rosenthal, Meagen; Holmes, Erin R; Andrews, Brienna; Lui, Julia; Raja, Leela

2017-12-01

Objective. To develop a measure of organizational culture in academic pharmacy and identify characteristics of an academic pharmacy program that would be impactful for internal (eg, students, employees) and external (eg, preceptors, practitioners) clients of the program. Methods. A three-round Delphi procedure of 24 panelists from pharmacy schools in the U.S. and Canada generated items based on the Organizational Culture Profile (OCP), which were then evaluated and refined for inclusion in subsequent rounds. Items were assessed for appropriateness and impact. Results. The panel produced 35 items across six domains that measured organizational culture in academic pharmacy: competitiveness, performance orientation, social responsibility, innovation, emphasis on collegial support, and stability. Conclusion. The items generated require testing for validation and reliability in a large sample to finalize this measure of organizational culture.
Evaluation and simplification of the occupational slip, trip and fall risk-assessment test

PubMed Central

NAKAMURA, Takehiro; OYAMA, Ichiro; FUJINO, Yoshihisa; KUBO, Tatsuhiko; KADOWAKI, Koji; KUNIMOTO, Masamizu; ODOI, Haruka; TABATA, Hidetoshi; MATSUDA, Shinya

2016-01-01

Objective: The purpose of this investigation is to evaluate the efficacy of the occupational slip, trip and fall (STF) risk assessment test developed by the Japan Industrial Safety and Health Association (JISHA). We further intended to simplify the test to improve efficiency. Methods: A previous cohort study was performed using 540 employees aged ≥50 years who took the JISHA’s STF risk assessment test. We conducted multivariate analysis using these previous results as baseline values and answers to questionnaire items or score on physical fitness tests as variables. The screening efficiency of each model was evaluated based on the obtained receiver operating characteristic (ROC) curve. Results: The area under the ROC obtained in multivariate analysis was 0.79 when using all items. Six of the 25 questionnaire items were selected for stepwise analysis, giving an area under the ROC curve of 0.77. Conclusion: Based on the results of follow-up performed one year after the initial examination, we successfully determined the usefulness of the STF risk assessment test. Administering a questionnaire alone is sufficient for screening subjects at risk of STF during the subsequent one-year period. PMID:27021057
Industrial Arts Test Development, Book III. Resource Items for Graphics Technology, Power Technology, Production Technology.

ERIC Educational Resources Information Center

New York State Education Dept., Albany.

This booklet is designed to assist teachers in developing examinations for classroom use. It is a collection of 955 objective test questions, mostly multiple choice, for industrial arts students in the three areas of graphics technology, power technology, and production technology. Scoring keys are provided. There are no copyright restrictions,…
The Development of a Test to Assess Drug Using Behavior.

ERIC Educational Resources Information Center

Althoff, Michael E.

The objective of the study was to develop a test which could measure both the qualitative and quantitative aspects of drug-using behavior, including such factors as attitudes toward drugs, experience with drugs, and knowledge about drugs. The Drug Use Scale was developed containing 134 items and dealing with five classes of drugs: marijuana,…

Cross-Cultural Comparisons of the Motivation of Young Children to Achieve in School.

ERIC Educational Resources Information Center

Adkins, Dorothy C.

Research on the differences in motivation to achieve in school among 10 groups of four-year-olds utilized a new, 75-item objective projective test called Gumpgookies. This test was individually administered to approximately 2000 children mainly from low economic backgrounds. The various ethnic and religious groups were compared with respect to…
A Computer-Assisted Test Design and Diagnosis System for Use by Classroom Teachers

ERIC Educational Resources Information Center

He, Q.; Tymms, P.

2005-01-01

Computer-assisted assessment (CAA) has become increasingly important in education in recent years. A variety of computer software systems have been developed to help assess the performance of students at various levels. However, such systems are primarily designed to provide objective assessment of students and analysis of test items, and focus…
Measuring pain in the context of homelessness

PubMed Central

Matter, Rebecca; Kline, Susan; Cook, Karon F.; Amtmann, Dagmar

2009-01-01

Purpose The primary objective of this study was to inform the development of measures of pain impact appropriate for all respondents, including homeless individuals, so that they can be used in clinical research and practice. The secondary objective was to increase understanding about the unique experience of homeless people with pain. Methods Seventeen homeless individuals with chronic health conditions (often associated with pain) participated in cognitive interviews to test the functioning of 56 pain measurement items and provided information about their experience living with and accessing treatment for pain. Results The most common problems identified with items were that they lacked clarity or were irrelevant in the context of homelessness. Items that were unclear, irrelevant and/or had other identified problems made it difficult for participants to respond. Participants also described multiple ways in which their pain was exacerbated by conditions of homelessness and identified barriers to accessing appropriate treatment. Conclusions Results suggested that the majority of items were problematic for the homeless and require substantial modifications to make the pain impact bank relevant to this population. Additional recommendations include involving homeless in future item bank development, conducting research on the topic of pain and homelessness, and using cognitive interviewing in other types of health disparities research. PMID:19582592
Development and Field Test of an Audit Tool and Tracer Methodology for Clinician Assessment of Quality in End-of-Life Care.

PubMed

Bookbinder, Marilyn; Hugodot, Amandine; Freeman, Katherine; Homel, Peter; Santiago, Elisabeth; Riggs, Alexa; Gavin, Maggie; Chu, Alice; Brady, Ellen; Lesage, Pauline; Portenoy, Russell K

2018-02-01

Quality improvement in end-of-life care generally acquires data from charts or caregivers. "Tracer" methodology, which assesses real-time information from multiple sources, may provide complementary information. The objective of this study was to develop a valid brief audit tool that can guide assessment and rate care when used in a clinician tracer to evaluate the quality of care for the dying patient. To identify items for a brief audit tool, 248 items were created to evaluate overall quality, quality in specific content areas (e.g., symptom management), and specific practices. Collected into three instruments, these items were used to interview professional caregivers and evaluate the charts of hospitalized patients who died. Evidence that this information could be validly captured using a small number of items was obtained through factor analyses, canonical correlations, and group comparisons. A nurse manager field tested tracer methodology using candidate items to evaluate the care provided to other patients who died. The survey of 145 deaths provided chart data and data from 445 interviews (26 physicians, 108 nurses, 18 social workers, and nine chaplains). The analyses yielded evidence of construct validity for a small number of items, demonstrating significant correlations between these items and content areas identified as latent variables in factor analyses. Criterion validity was suggested by significant differences in the ratings on these items between the palliative care unit and other units. The field test evaluated 127 deaths, demonstrated the feasibility of tracer methodology, and informed reworking of the candidate items into the 14-item Tracer EoLC v1. The Tracer EoLC v1 can be used with tracer methodology to guide the assessment and rate the quality of end-of-life care. Copyright © 2017 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Domestic violence on children: development and validation of an instrument to evaluate knowledge of health professionals 1

PubMed Central

Oliveira, Lanuza Borges; Soares, Fernanda Amaral; Silveira, Marise Fagundes; de Pinho, Lucinéia; Caldeira, Antônio Prates; Leite, Maísa Tavares de Souza

2016-01-01

ABSTRACT Objective: to develop and validate an instrument to evaluate the knowledge of health professionals about domestic violence on children. Method: this was a study conducted with 194 physicians, nurses and dentists. A literature review was performed for preparation of the items and identification of the dimensions. Apparent and content validation was performed using analysis of three experts and 27 professors of the pediatric health discipline. For construct validation, Cronbach's alpha was used, and the Kappa test was applied to verify reproducibility. The criterion validation was conducted using the Student's t-test. Results: the final instrument included 56 items; the Cronbach alpha was 0.734, the Kappa test showed a correlation greater than 0.6 for most items, and the Student t-test showed a statistically significant value to the level of 5% for the two selected variables: years of education and using the Family Health Strategy. Conclusion: the instrument is valid and can be used as a promising tool to develop or direct actions in public health and evaluate knowledge about domestic violence on children. PMID:27556878
An Academic-Government-Faith Partnership to Build Disaster Mental Health Preparedness and Community Resilience

PubMed Central

Semon, Natalie L.; Lating, Jeffrey M.; Everly, George S.; Perry, Charlene J.; Moore, Suzanne Straub; Mosley, Adrian M.; Thompson, Carol B.; Links, Jonathan M.

2014-01-01

Objectives Faculty and affiliates of the Johns Hopkins Preparedness and Emergency Response Research Center partnered with local health departments and faith-based organizations to develop a dual-intervention model of capacity-building for public mental health preparedness and community resilience. Project objectives included (1) determining the feasibility of the tri-partite collaborative concept; (2) designing, delivering, and evaluating psychological first aid (PFA) training and guided preparedness planning (GPP); and (3) documenting preliminary evidence of the sustainability and impact of the model. Methods We evaluated intervention effectiveness by analyzing pre- and post-training changes in participant responses on knowledge-acquisition tests administered to three urban and four rural community cohorts. Changes in percent of correct items and mean total correct items were evaluated. Criteria for model sustainability and impact were, respectively, observations of nonacademic partners engaging in efforts to advance post-project preparedness alliances, and project-attributable changes in preparedness-related practices of local or state governments. Results The majority (11 of 14) test items addressing technical or practical PFA content showed significant improvement; we observed comparable testing results for GPP training. Government and faith partners developed ideas and tools for sustaining preparedness activities, and numerous project-driven changes in local and state government policies were documented. Conclusions Results suggest that the model could be an effective approach to promoting public health preparedness and community resilience. PMID:25355980
Psychometric characteristics of Clinical Reasoning Problems (CRPs) and its correlation with routine multiple choice question (MCQ) in Cardiology department.

PubMed

Derakhshandeh, Zahra; Amini, Mitra; Kojuri, Javad; Dehbozorgian, Marziyeh

2018-01-01

Clinical reasoning is one of the most important skills in the process of training a medical student to become an efficient physician. Assessment of the reasoning skills in a medical school program is important to direct students' learning. One of the tests for measuring the clinical reasoning ability is Clinical Reasoning Problems (CRPs). The major aim of this study is to measure psychometric qualities of CRPs and define correlation between this test and routine MCQ in cardiology department of Shiraz medical school. This study was a descriptive study conducted on total cardiology residents of Shiraz Medical School. The study population consists of 40 residents in 2014. The routine CRPs and the MCQ tests was designed based on similar objectives and were carried out simultaneously. Reliability, item difficulty, item discrimination, and correlation between each item and the total score of CRPs were all measured by Excel and SPSS software for checking psycometeric CRPs test. Furthermore, we calculated the correlation between CRPs test and MCQ test. The mean differences of CRPs test score between residents' academic year [second, third and fourth year] were also evaluated by Analysis of variances test (One Way ANOVA) using SPSS software (version 20)(α=0.05). The mean and standard deviation of score in CRPs was 10.19 ±3.39 out of 20; in MCQ, it was 13.15±3.81 out of 20. Item difficulty was in the range of 0.27-0.72; item discrimination was 0.30-0.75 with question No.3 being the exception (that was 0.24). The correlation between each item and the total score of CRP was 0.26-0.87; the correlation between CRPs test and MCQ test was 0.68 (p<0.001). The reliability of the CRPs was 0.72 as calculated by using Cronbach's alpha. The mean score of CRPs was different among residents based on their academic year and this difference was statistically significant (p<0.001). The results of this present investigation revealed that CRPs could be reliable test for measuring clinical reasoning in residents. It can be included in cardiology residency assessment programs.
Exploring the effects of ownership and choice on self-memory biases.

PubMed

Cunningham, Sheila J; Brady-Van den Bos, Mirjam; Turk, David J

2011-07-01

Objects encoded in the context of temporary ownership by self enjoy a memorial advantage over objects owned by other people. This memory effect has been linked to self-referential encoding processes. The current inquiry explored the extent to which the effects of ownership are influenced by the degree of personal choice involved in assigning ownership. In three experiments pairs of participants chose objects to keep for ownership by self, and rejected objects that were given to the other participant to own. Recognition memory for the objects was then assessed. Experiment 1 showed that participants recognised more items encoded as "self-owned" than "other-owned", but only when they had been chosen by self. Experiment 2 replicated this pattern when participants' sense of choice was illusory. A source memory test in Experiment 3 showed that self-chosen items were most likely to be correctly attributed to ownership by self. These findings are discussed with reference to the link between owned objects and the self, and the routes through which self-referential operations can impact on cognition.
Development and psychometric testing of the childhood obesity perceptions (COP) survey among African American caregivers: A tool for obesity prevention program planning.

PubMed

Alexander, Dayna S; Alfonso, Moya L; Cao, Chunhua

2016-12-01

Currently, public health practitioners are analyzing the role that caregivers play in childhood obesity efforts. Assessing African American caregiver's perceptions of childhood obesity in rural communities is an important prevention effort. This article's objective is to describe the development and psychometric testing of a survey tool to assess childhood obesity perceptions among African American caregivers in a rural setting, which can be used for obesity prevention program development or evaluation. The Childhood Obesity Perceptions (COP) survey was developed to reflect the multidimensional nature of childhood obesity including risk factors, health complications, weight status, built environment, and obesity prevention strategies. A 97-item survey was pretested and piloted with the priority population. After pretesting and piloting, the survey was reduced to 59-items and administered to 135 African American caregivers. An exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) was conducted to test how well the survey items represented the number of Social Cognitive Theory constructs. Twenty items were removed from the original 59-item survey and acceptable internal consistency of the six factors (α=0.70-0.85) was documented for all scales in the final COP instrument. CFA resulted in a less than adequate fit; however, a multivariate Lagrange multiplier test identified modifications to improve the model fit. The COP survey represents a promising approach as a potentially comprehensive assessment for implementation or evaluation of childhood obesity programs. Copyright © 2016 Elsevier Ltd. All rights reserved.
Measuring the effects of online health information for patients: Item generation for an e-health impact questionnaire

PubMed Central

Kelly, Laura; Jenkinson, Crispin; Ziebland, Sue

2013-01-01

Objective The internet is a valuable resource for accessing health information and support. We are developing an instrument to assess the effects of websites with experiential and factual health information. This study aimed to inform an item pool for the proposed questionnaire. Methods Items were informed through a review of relevant literature and secondary qualitative analysis of 99 narrative interviews relating to patient and carer experiences of health. Statements relating to identified themes were re-cast as questionnaire items and shown for review to an expert panel. Cognitive debrief interviews (n = 21) were used to assess items for face and content validity. Results Eighty-two generic items were identified following secondary qualitative analysis and expert review. Cognitive interviewing confirmed the questionnaire instructions, 62 items and the response options were acceptable to patients and carers. Conclusion Using a clear conceptual basis to inform item generation, 62 items have been identified as suitable to undergo further psychometric testing. Practice implications The final questionnaire will initially be used in a randomized controlled trial examining the effects of online patient's experiences. This will inform recommendations on the best way to present patients’ experiences within health information websites. PMID:23598293
Exogenous temporal cues enhance recognition memory in an object-based manner.

PubMed

Ohyama, Junji; Watanabe, Katsumi

2010-11-01

Exogenous attention enhances the perception of attended items in both a space-based and an object-based manner. Exogenous attention also improves recognition memory for attended items in the space-based mode. However, it has not been examined whether object-based exogenous attention enhances recognition memory. To address this issue, we examined whether a sudden visual change in a task-irrelevant stimulus (an exogenous cue) would affect participants' recognition memory for items that were serially presented around a cued time. The results showed that recognition accuracy for an item was strongly enhanced when the visual cue occurred at the same location and time as the item (Experiments 1 and 2). The memory enhancement effect occurred when the exogenous visual cue and an item belonged to the same object (Experiments 3 and 4) and even when the cue was counterpredictive of the timing of an item to be asked about (Experiment 5). The present study suggests that an exogenous temporal cue automatically enhances the recognition accuracy for an item that is presented at close temporal proximity to the cue and that recognition memory enhancement occurs in an object-based manner.
Study protocol of psychometric properties of the Spanish translation of a competence test in evidence based practice: the Fresno test.

PubMed

Argimon-Pallàs, Josep M; Flores-Mateo, Gemma; Jiménez-Villa, Josep; Pujol-Ribera, Enriqueta; Foz, Gonçal; Bundó-Vidiella, Magda; Juncosa, Sebastià; Fuentes-Bellido, Cruz M; Pérez-Rodríguez, Belén; Margalef-Pallarès, Francesc; Villafafila-Ferrero, Rosa; Forès-Garcia, Dolors; Roman-Martínez, Josep; Vilert-Garroga, Esther

2009-02-24

There are few high-quality instruments for evaluating the effectiveness of Evidence-Based Practice (EBP) curricula with objective outcomes measures. The Fresno test is an instrument that evaluates most of EBP steps with a high reliability and validity in the English original version. The present study has the aims to translate the Fresno questionnaire into Spanish and its subsequent validation to ensure the equivalence of the Spanish version against the English original. The questionnaire will be translated with the back translation technique and tested in Primary Care Teaching Units in Catalonia (PCTU). Participants will be: (a) tutors of Family Medicine residents (expert group); (b) Family Medicine residents in their second year of the Family Medicine training program (novice group), and (c) Family Medicine physicians (intermediate group). The questionnaire will be administered before and after an educational intervention. The educational intervention will be an interactive four half-day sessions designed to develop the knowledge and skills required to EBP. Responsiveness statistics used in the analysis will be the effect size, the standardised response mean and Guyatt's method. For internal consistency reliability, two measures will be used: corrected item-total correlations and Cronbach's alpha. Inter-rater reliability will be tested using Kappa coefficient for qualitative items and intra-class correlation coefficient for quantitative items and the overall score. Construct validity, item difficulty, item discrimination and feasibility will be determined. The validation of the Fresno questionnaire into different languages will enable the expansion of the questionnaire, as well as allowing comparison between countries and the evaluation of different teaching models.
Concept Area Three Objectives and Test Items (Rev.). Part One and Part Two. Economic Analysis Course. Segments 50 - 84.

ERIC Educational Resources Information Center

Sterling Inst., Washington, DC. Educational Technology Center.

A multimedia course in economic analysis was developed and used in conjunction with the United States Naval Academy. (See ED 043 790 and ED 043 791 for final reports of the project evaluation and development model.) This report deals with concept area three of the course, which focuses on microeconomics. The behavioral objectives, hierarchy…
Effects of hypnosis and level of processing on repeated recall of line drawings.

PubMed

McKelvie, S J; Pullara, M

1988-07-01

Moderately susceptible subjects (N = 30) initially judged 30 line drawings of objects for pleasantness (deep processing) and 30 line drawings for visual complexity (shallow processing), after which they were given two immediate recall tests. Following a 48-hr delay, subjects were allocated randomly to hypnosis, simulation, or neutral control conditions and were tested four more times. Subjects produced more correct and incorrect responses over the six trials and gave a higher number of correct responses for deep items than for shallow items. Over the last four trials, hypnosis had no general facilitative effect relative to the other two treatments, but the effect of depth was strongest for hypnotized subjects, who recalled more deep items than did the controls. Finally, both hypnotized and simulating subjects rated their recall as more involuntary and their experimental treatment as more helpful than did the controls. Caution is urged in the forensic use of hypnosis as a retrieval device.
Development and Validity Testing of the Worksite Health Index: An Assessment Tool to Help and Improve Korean Employees' Health-Related Outcome.

PubMed

Yun, Young Ho; Sim, Jin Ah; Lim, Ye Jin; Lim, Cheol Il; Kang, Sung-Choon; Kang, Joon-Ho; Park, Jun Dong; Noh, Dong Young

2016-06-01

The objective of this study was to develop the Worksite Health Index (WHI) and validate its psychometric properties. The development of the WHI questionnaire included item generation, item construction, and field testing. To assess the instrument's reliability and validity, we recruited 30 different Korean worksites. We developed the WHI questionnaire of 136 items categorized into five domains, namely Governance and Infrastructure, Need Assessment and Planning, Health Prevention and Promotion Program, Occupational Safety, and Monitoring and Feedback. All WHI domains demonstrated a high reliability with good internal consistency. The total WHI scores differentiated worksite groups effectively according to firm size. Each domain was associated significantly with employees' health status, absence, and financial outcome. The WHI can assess comprehensive worksite health programs. This tool is publicly available for addressing the growing need for worksite health programs.
Improving language mapping in clinical fMRI through assessment of grammar.

PubMed

Połczyńska, Monika; Japardi, Kevin; Curtiss, Susan; Moody, Teena; Benjamin, Christopher; Cho, Andrew; Vigil, Celia; Kuhn, Taylor; Jones, Michael; Bookheimer, Susan

2017-01-01

Brain surgery in the language dominant hemisphere remains challenging due to unintended post-surgical language deficits, despite using pre-surgical functional magnetic resonance (fMRI) and intraoperative cortical stimulation. Moreover, patients are often recommended not to undergo surgery if the accompanying risk to language appears to be too high. While standard fMRI language mapping protocols may have relatively good predictive value at the group level, they remain sub-optimal on an individual level. The standard tests used typically assess lexico-semantic aspects of language, and they do not accurately reflect the complexity of language either in comprehension or production at the sentence level. Among patients who had left hemisphere language dominance we assessed which tests are best at activating language areas in the brain. We compared grammar tests (items testing word order in actives and passives, wh -subject and object questions, relativized subject and object clauses and past tense marking) with standard tests (object naming, auditory and visual responsive naming), using pre-operative fMRI. Twenty-five surgical candidates (13 females) participated in this study. Sixteen patients presented with a brain tumor, and nine with epilepsy. All participants underwent two pre-operative fMRI protocols: one including CYCLE-N grammar tests (items testing word order in actives and passives, wh-subject and object questions, relativized subject and object clauses and past tense marking); and a second one with standard fMRI tests (object naming, auditory and visual responsive naming). fMRI activations during performance in both protocols were compared at the group level, as well as in individual candidates. The grammar tests generated more volume of activation in the left hemisphere (left/right angular gyrus, right anterior/posterior superior temporal gyrus) and identified additional language regions not shown by the standard tests (e.g., left anterior/posterior supramarginal gyrus). The standard tests produced more activation in left BA 47. Ten participants had more robust activations in the left hemisphere in the grammar tests and two in the standard tests. The grammar tests also elicited substantial activations in the right hemisphere and thus turned out to be superior at identifying both right and left hemisphere contribution to language processing. The grammar tests may be an important addition to the standard pre-operative fMRI testing.
Development and initial evaluation of the SCI-FI/AT

PubMed Central

Jette, Alan M.; Slavin, Mary D.; Ni, Pengsheng; Kisala, Pamela A.; Tulsky, David S.; Heinemann, Allen W.; Charlifue, Susie; Tate, Denise G.; Fyffe, Denise; Morse, Leslie; Marino, Ralph; Smith, Ian; Williams, Steve

2015-01-01

Objectives To describe the domain structure and calibration of the Spinal Cord Injury Functional Index for samples using Assistive Technology (SCI-FI/AT) and report the initial psychometric properties of each domain. Design Cross sectional survey followed by computerized adaptive test (CAT) simulations. Setting Inpatient and community settings. Participants A sample of 460 adults with traumatic spinal cord injury (SCI) stratified by level of injury, completeness of injury, and time since injury. Interventions None Main outcome measure SCI-FI/AT Results Confirmatory factor analysis (CFA) and Item response theory (IRT) analyses identified 4 unidimensional SCI-FI/AT domains: Basic Mobility (41 items) Self-care (71 items), Fine Motor Function (35 items), and Ambulation (29 items). High correlations of full item banks with 10-item simulated CATs indicated high accuracy of each CAT in estimating a person's function, and there was high measurement reliability for the simulated CAT scales compared with the full item bank. SCI-FI/AT item difficulties in the domains of Self-care, Fine Motor Function, and Ambulation were less difficult than the same items in the original SCI-FI item banks. Conclusion With the development of the SCI-FI/AT, clinicians and investigators have available multidimensional assessment scales that evaluate function for users of AT to complement the scales available in the original SCI-FI. PMID:26010975
Scale Refinement and Initial Evaluation of a Behavioral Health Function Measurement Tool for Work Disability Evaluation

PubMed Central

Marfeo, Elizabeth E.; Ni, Pengsheng; Bogusz, Kara; Meterko, Mark; McDonough, Christine M.; Chan, Leighton; Rasch, Elizabeth K.; Brandt, Diane E.; Jette, Alan M.

2014-01-01

Objectives To use item response theory (IRT) data simulations to construct and perform initial psychometric testing of a newly developed instrument, the Social Security Administration Behavioral Health Function (SSA-BH) instrument, that aims to assess behavioral health functioning relevant to the context of work. Design Cross-sectional survey followed by item response theory (IRT) calibration data simulations Setting Community Participants A sample of individuals applying for SSA disability benefits, claimants (N=1015), and a normative comparative sample of US adults (N=1000) Interventions None. Main Outcome Measure Social Security Administration Behavioral Health Function (SSA-BH) measurement instrument Results Item response theory analyses supported the unidimensionality of four SSA-BH scales: Mood and Emotions (35 items), Self-Efficacy (23 items), Social Interactions (6 items), and Behavioral Control (15 items). All SSA-BH scales demonstrated strong psychometric properties including reliability, accuracy, and breadth of coverage. High correlations of the simulated 5- or 10- item CATs with the full item bank indicated robust ability of the CAT approach to comprehensively characterize behavioral health function along four distinct dimensions. Conclusions Initial testing and evaluation of the SSA-BH instrument demonstrated good accuracy, reliability, and content coverage along all four scales. Behavioral function profiles of SSA claimants were generated and compared to age and sex matched norms along four scales: Mood and Emotions, Behavioral Control, Social Interactions, and Self-Efficacy. Utilizing the CAT based approach offers the ability to collect standardized, comprehensive functional information about claimants in an efficient way, which may prove useful in the context of the SSA’s work disability programs. PMID:23542404
Calibration of context-specific survey items to assess youth physical activity behaviour.

PubMed

Saint-Maurice, Pedro F; Welk, Gregory J; Bartee, R Todd; Heelan, Kate

2017-05-01

This study tests calibration models to re-scale context-specific physical activity (PA) items to accelerometer-derived PA. A total of 195 4th-12th grades children wore an Actigraph monitor and completed the Physical Activity Questionnaire (PAQ) one week later. The relative time spent in moderate-to-vigorous PA (MVPA % ) obtained from the Actigraph at recess, PE, lunch, after-school, evening and weekend was matched with a respective item score obtained from the PAQ's. Item scores from 145 participants were calibrated against objective MVPA % using multiple linear regression with age, and sex as additional predictors. Predicted minutes of MVPA for school, out-of-school and total week were tested in the remaining sample (n = 50) using equivalence testing. The results showed that PAQ β-weights ranged from 0.06 (lunch) to 4.94 (PE) MVPA % (P < 0.05) and models root mean square error ranged from 4.2% (evening) to 20.2% (recess). When applied to an independent sample, differences between PAQ and accelerometer MVPA at school and out-of-school ranged from -15.6 to +3.8 min and the PAQ was within 10-15% of accelerometer measured activity. This study demonstrated that context-specific items can be calibrated to predict minutes of MVPA in groups of youth during in- and out-of-school periods.
Development of a wheelchair mobility skills test for children and adolescents: combining evidence with clinical expertise.

PubMed

Sol, Marleen Elisabeth; Verschuren, Olaf; de Groot, Laura; de Groot, Janke Frederike

2017-02-13

Wheelchair mobility skills (WMS) training is regarded by children using a manual wheelchair and their parents as an important factor to improve participation and daily physical activity. Currently, there is no outcome measure available for the evaluation of WMS in children. Several wheelchair mobility outcome measures have been developed for adults, but none of these have been validated in children. Therefore the objective of this study is to develop a WMS outcome measure for children using the current knowledge from literature in combination with the clinical expertise of health care professionals, children and their parents. Mixed methods approach. Phase 1: Item identification of WMS items through a systematic review using the 'COnsensus-based Standards for the selection of health Measurement Instruments' (COSMIN) recommendations. Phase 2: Item selection and validation of relevant WMS items for children, using a focus group and interviews with children using a manual wheelchair, their parents and health care professionals. Phase 3: Feasibility of the newly developed Utrecht Pediatric Wheelchair Mobility Skills Test (UP-WMST) through pilot testing. Phase 1: Data analysis and synthesis of nine WMS related outcome measures showed there is no widely used outcome measure with levels of evidence across all measurement properties. However, four outcome measures showed some levels of evidence on reliability and validity for adults. Twenty-two WMS items with the best clinimetric properties were selected for further analysis in phase 2. Phase 2: Fifteen items were deemed as relevant for children, one item needed adaptation and six items were considered not relevant for assessing WMS in children. Phase 3: Two health care professionals administered the UP-WMST in eight children. The instructions of the UP-WMST were clear, but the scoring method of the height difference items needed adaptation. The outdoor items for rolling over soft surface and the side slope item were excluded in the final version of the UP-WMST due to logistic reasons. The newly developed 15 item UP-WMST is a validated outcome measure which is easy to administer in children using a manual wheelchair. More research regarding reliability, construct validity and responsiveness is warranted before the UP-WMST can be used in practice.

Face Validity of the Single Work Ability Item: Comparison with Objectively Measured Heart Rate Reserve over Several Days

PubMed Central

Gupta, Nidhi; Jensen, Bjørn Søvsø; Søgaard, Karen; Carneiro, Isabella Gomes; Christiansen, Caroline Stordal; Hanisch, Christiana; Holtermann, Andreas

2014-01-01

Purpose: The purpose of this study was to investigate the face validity of the self-reported single item work ability with objectively measured heart rate reserve (%HRR) among blue-collar workers. Methods: We utilized data from 127 blue-collar workers (Female = 53; Male = 74) aged 18–65 years from the cross-sectional “New method for Objective Measurements of physical Activity in Daily living (NOMAD)” study. The workers reported their single item work ability and completed an aerobic capacity cycling test and objective measurements of heart rate reserve monitored with Actiheart for 3–4 days with a total of 5,810 h, including 2,640 working hours. Results: A significant moderate correlation between work ability and %HRR was observed among males (R = −0.33, P = 0.005), but not among females (R = 0.11, P = 0.431). In a gender-stratified multi-adjusted logistic regression analysis, males with high %HRR were more likely to report a reduced work ability compared to males with low %HRR [OR = 4.75, 95% confidence interval (95% CI) = 1.31 to 17.25]. However, this association was not found among females (OR = 0.26, 95% CI 0.03 to 2.16), and a significant interaction between work ability, %HRR and gender was observed (P = 0.03). Conclusions: The observed association between work ability and objectively measured %HRR over several days among male blue-collar workers supports the face validity of the single work ability item. It is a useful and valid measure of the relation between physical work demands and resources among male blue-collar workers. The contrasting association among females needs to be further investigated. PMID:24840350
The dialysis orders objective structured clinical examination (OSCE): a formative assessment for nephrology fellows

PubMed Central

Prince, Lisa K; Campbell, Ruth C; Gao, Sam W; Kendrick, Jessica; Lebrun, Christopher J; Little, Dustin J; Mahoney, David L; Maursetter, Laura A; Nee, Robert; Saddler, Mark; Watson, Maura A

2018-01-01

Abstract Background Few quantitative nephrology-specific simulations assess fellow competency. We describe the development and initial validation of a formative objective structured clinical examination (OSCE) assessing fellow competence in ordering acute dialysis. Methods The three test scenarios were acute continuous renal replacement therapy, chronic dialysis initiation in moderate uremia and acute dialysis in end-stage renal disease-associated hyperkalemia. The test committee included five academic nephrologists and four clinically practicing nephrologists outside of academia. There were 49 test items (58 points). A passing score was 46/58 points. No item had median relevance less than ‘important’. The content validity index was 0.91. Ninety-five percent of positive-point items were easy–medium difficulty. Preliminary validation was by 10 board-certified volunteers, not test committee members, a median of 3.5 years from graduation. The mean score was 49 [95% confidence interval (CI) 46–51], κ = 0.68 (95% CI 0.59–0.77), Cronbach’s α = 0.84. Results We subsequently administered the test to 25 fellows. The mean score was 44 (95% CI 43–45); 36% passed the test. Fellows scored significantly less than validators (P < 0.001). Of evidence-based questions, 72% were answered correctly by validators and 54% by fellows (P = 0.018). Fellows and validators scored least well on the acute hyperkalemia question. In self-assessing proficiency, 71% of fellows surveyed agreed or strongly agreed that the OSCE was useful. Conclusions The OSCE may be used to formatively assess fellow proficiency in three common areas of acute dialysis practice. Further validation studies are in progress. PMID:29644053
78 FR 13889 - Notice of Intent To Repatriate Cultural Items: Arizona State Museum, University of Arizona...

Federal Register 2010, 2011, 2012, 2013, 2014

2013-03-01

... the Hopi Tribe gives a positive identification to substantiate ownership of these sacred and religious... and religious items as described. These items are identified as sacred and religious objects, and are... definition of sacred objects and objects of cultural patrimony, and repatriation to the Indian tribe stated...
78 FR 64436 - Disposition of Unclaimed Human Remains and Other Cultural Items Discovered on Federal Lands After...

Federal Register 2010, 2011, 2012, 2013, 2014

2013-10-29

..., funerary objects, sacred objects, or objects of cultural patrimony discovered on Federal lands after..., funerary objects, sacred objects, or objects of cultural patrimony (``cultural items'' under NAGPRA) not... responsibility to care for human remains, funerary objects, sacred objects, or objects of cultural patrimony, the...
Imagery encoding and false recognition errors: Examining the role of imagery process and imagery content on source misattributions.

PubMed

Foley, Mary Ann; Foy, Jeffrey; Schlemmer, Emily; Belser-Ehrlich, Janna

2010-11-01

Imagery encoding effects on source-monitoring errors were explored using the Deese-Roediger-McDermott paradigm in two experiments. While viewing thematically related lists embedded in mixed picture/word presentations, participants were asked to generate images of objects or words (Experiment 1) or to simply name the items (Experiment 2). An encoding task intended to induce spontaneous images served as a control for the explicit imagery instruction conditions (Experiment 1). On the picture/word source-monitoring tests, participants were much more likely to report "seeing" a picture of an item presented as a word than the converse particularly when images were induced spontaneously. However, this picture misattribution error was reversed after generating images of words (Experiment 1) and was eliminated after simply labelling the items (Experiment 2). Thus source misattributions were sensitive to the processes giving rise to imagery experiences (spontaneous vs deliberate), the kinds of images generated (object vs word images), and the ways in which materials were presented (as pictures vs words).
Evaluation of Modified Essay Questions (MEQ) and Multiple Choice Questions (MCQ) as a tool for Assessing the Cognitive Skills of Undergraduate Medical Students

PubMed Central

Khan, Moeen-uz-Zafar; Aljarallah, Badr Muhammad

2011-01-01

Objectives: Developing and testing the cognitive skills and abstract thinking of undergraduate medical students are the main objectives of problem based learning. Modified Essay Questions (MEQ) and Multiple Choice Questions (MCQ) may both be designed to test these skills. The objectives of this study were to assess the effectiveness of both forms of questions in testing the different levels of the cognitive skills of undergraduate medical students and to detect any item writing flaws in the questions. Methods: A total of 50 MEQs and 50 MCQs were evaluated. These questions were chosen randomly from various examinations given to different batches of undergraduate medical students taking course MED 411–412 at the Department of Medicine, Qassim University from the years 2005 to 2009. The effectiveness of the questions was determined by two assessors and was defined by the question’s ability to measure higher cognitive skills, as determined by modified Bloom’s taxonomy, and its quality as determined by the presence of item writing flaws. ‘SPSS15’ and ‘Medcalc’ programs were used to tabulate and analyze the data. Results: The percentage of questions testing the level III (problem solving) cognitive skills of the students was 40% for MEQs and 60% for the MCQs; the remaining questions merely assessed the recall and comprehension. No significant difference was found between MEQ and MCQ in relation to the type of questions (recall; comprehension or problem solving x2 = 5.3, p = 0.07).The agreement between the two assessors was quite high in case of MCQ (kappa=0.609; SE 0.093; 95%CI 0.426 – 0.792) but lower in case of MEQ (kappa=0.195; SE 0.073; 95%CI 0.052 – 0.338). 16% of the MEQs and 12% of the MCQs had item writing flaws. Conclusion: A well constructed MCQ is superior to MEQ in testing the higher cognitive skills of undergraduate medical students in a problem based learning setup. Constructing an MEQ for assessing the cognitive skills of a student is not a simple task and is more frequently associated with item writing flaws. PMID:22489228
Competitive control of cognition in rhesus monkeys.

PubMed

Kowaguchi, Mayuka; Patel, Nirali P; Bunnell, Megan E; Kralik, Jerald D

2016-12-01

The brain has evolved different approaches to solve problems, but the mechanisms that determine which approach to take remain unclear. One possibility is that control progresses from simpler processes, such as associative learning, to more complex ones, such as relational reasoning, when the simpler ones prove inadequate. Alternatively, control could be based on competition between the processes. To test between these possibilities, we posed the support problem to rhesus monkeys using a tool-use paradigm, in which subjects could pull an object (the tool) toward themselves to obtain an otherwise out-of-reach goal item. We initially provided one problem exemplar as a choice: for the correct option, a food item placed on the support tool; for the incorrect option, the food item placed off the tool. Perceptual cues were also correlated with outcome: e.g., red, triangular tool correct, blue, rectangular tool incorrect. Although the monkeys simply needed to touch the tool to register a response, they immediately pulled it, reflecting a relational reasoning process between themselves and another object (R self-other ), rather than an associative one between the arbitrary touch response and reward (A resp-reward ). Probe testing then showed that all four monkeys used a conjunction of perceptual features to select the correct option, reflecting an associative process between stimuli and reward (A stim-reward ). We then added a second problem exemplar and subsequent testing revealed that the monkeys switched to using the on/off relationship, reflecting a relational reasoning process between two objects (R other-other ). Because behavior appeared to reflect R self-other rather than A resp-reward , and A stim-reward prior to R other-other , our results suggest that cognitive processes are selected via competitive control dynamics. Copyright © 2016 Elsevier B.V. All rights reserved.
A drop in pediatric subject examination scores after curriculum changes that emphasize general pediatric topics.

PubMed

Potts, M J; Phelan, K W

1997-09-01

To determine whether emphasizing a limited number of general pediatric objectives and using a test based on them would improve student knowledge of the topic areas. Before-after trial. Community-based medical school. Third-year medical students on a required clerkship in pediatrics. Six core objectives: recognizing the seriously ill child, stabilizing such a child, fluid and electrolyte requirements and therapy, newborn care, well child care, and variability of normal vital signs in children based on their age were defined and a modified essay examination was constructed. The test was given to pediatric students close to the end of their clerkship. In study year 1, no warning was given about the examination and results did not affect student grades. In study year 2, passing all items was a requirement and failure required remedial oral examination of any missed items. All students completed the National Board of Medical Examiners pediatric subject examination. For 7 of 8 essay items, significant increases in numbers of students passing were seen in study year 2, but students scored 51 points lower on the National Board of Medical Examiners pediatric subject examination (P=.002). The decrease in scores was not seen in any other clerkship or among pediatric students from a different campus of the medical school. Emphasis on core objectives and an essay examination significantly improved students' knowledge of the defined topics but decreased the scores on the National Board of Medical Examiners subject examination. This may be attributable to a difference in content between the 2 tests. Faculty proposing new curriculum guidelines need to review student assessment methods to avoid such unexpected changes in scores.
Small-Item Vapor Test Method, FY11 Release

DTIC Science & Technology

2012-07-01

to this test procedure is provided alphabetically in the following list: absorption: The uptake of a contaminant INTO the volume of a material. The... powders , wipes), or gas-phase (fumigants, including aerosols). decontamination process: The process of making any person, object, or area safe by...with another contaminant. Generally, bare metals and glass are nonsorptive materials for some agents. operational decontamination: Decontamination
Design and development of food safety knowledge and attitude scales for consumer food safety education.

PubMed

Medeiros, Lydia C; Hillers, Virginia N; Chen, Gang; Bergmann, Verna; Kendall, Patricia; Schroeder, Mary

2004-11-01

The objective of this study was to design and develop food safety knowledge and attitude scales based on food-handling guidelines developed by a national panel of food safety experts. Knowledge (n=43) and attitude (n=49) questions were developed and pilot-tested with a variety of consumer groups. Final questions were selected based on item analysis and on validity and reliability statistical tests. Knowledge questions were tested in Washington State with participants in low-income nutrition education programs (pretest/posttest n=58, test/retest n=19) and college students (pretest/posttest n=34). Attitude questions were tested in Ohio with nutrition education program participants (n=30) and college students (non-nutrition majors n=138, nutrition majors n=57). Item analysis, paired sample t tests, Pearson's correlation coefficients, and Cronbach's alpha were used. Reliability and validity tests of individual items and the question sets were used to reduce the scales to 18 knowledge questions and 10 attitude questions. The knowledge and attitude scales covered topics ranked as important by a national panel of experts and met most validity and reliability standards. The 18-item knowledge questionnaire had instructional sensitivity (mean score increase of more than three points after instruction), internal reliability (Cronbach's alpha >.75), and produced similar results in test-retest without intervention (coefficient of stability=.81). Knowledge of correct procedures for hand washing and avoiding cross-contamination was widespread before instruction. Knowledge was limited regarding avoiding food preparation while ill, cooking hamburgers, high-risk foods, and whether cooked rice and potatoes could be stored at room temperature. The 10-item attitude scale had an appropriate range of responses (item difficulty) and produced similar results in test-retest ( P
Components of a Measure to Describe Organizational Culture in Academic Pharmacy

PubMed Central

Rosenthal, Meagen; Holmes, Erin R.; Andrews, Brienna; Lui, Julia; Raja, Leela

2017-01-01

Objective. To develop a measure of organizational culture in academic pharmacy and identify characteristics of an academic pharmacy program that would be impactful for internal (eg, students, employees) and external (eg, preceptors, practitioners) clients of the program. Methods. A three-round Delphi procedure of 24 panelists from pharmacy schools in the U.S. and Canada generated items based on the Organizational Culture Profile (OCP), which were then evaluated and refined for inclusion in subsequent rounds. Items were assessed for appropriateness and impact. Results. The panel produced 35 items across six domains that measured organizational culture in academic pharmacy: competitiveness, performance orientation, social responsibility, innovation, emphasis on collegial support, and stability. Conclusion. The items generated require testing for validation and reliability in a large sample to finalize this measure of organizational culture. PMID:29367768
Measurement in Sensory Modulation: The Sensory Processing Scale Assessment

PubMed Central

Miller, Lucy J.; Sullivan, Jillian C.

2014-01-01

OBJECTIVE. Sensory modulation issues have a significant impact on participation in daily life. Moreover, understanding phenotypic variation in sensory modulation dysfunction is crucial for research related to defining homogeneous groups and for clinical work in guiding treatment planning. We thus evaluated the new Sensory Processing Scale (SPS) Assessment. METHOD. Research included item development, behavioral scoring system development, test administration, and item analyses to evaluate reliability and validity across sensory domains. RESULTS. Items with adequate reliability (internal reliability >.4) and discriminant validity (p < .01) were retained. Feedback from the expert panel also contributed to decisions about retaining items in the scale. CONCLUSION. The SPS Assessment appears to be a reliable and valid measure of sensory modulation (scale reliability >.90; discrimination between group effect sizes >1.00). This scale has the potential to aid in differential diagnosis of sensory modulation issues. PMID:25184464
The picture superiority effect in a cross-modality recognition task.

PubMed

Stenbert, G; Radeborg, K; Hedman, L R

1995-07-01

Words and pictures were studied and recognition tests given in which each studied object was to be recognized in both word and picture format. The main dependent variable was the latency of the recognition decision. The purpose was to investigate the effects of study modality (word or picture), of congruence between study and test modalities, and of priming resulting from repeated testing. Experiments 1 and 2 used the same basic design, but the latter also varied retention interval. Experiment 3 added a manipulation of instructions to name studied objects, and Experiment 4 deviated from the others by presenting both picture and word referring to the same object together for study. The results showed that congruence between study and test modalities consistently facilitated recognition. Furthermore, items studied as pictures were more rapidly recognized than were items studied as words. With repeated testing, the second instance was affected by its predecessor, but the facilitating effect of picture-to-word priming exceeded that of word-to-picture priming. The finds suggest a two- stage recognition process, in which the first is based on perceptual familiarity and the second uses semantic links for a retrieval search. Common-code theories that grant privileged access to the semantic code for pictures or, alternatively, dual-code theories that assume mnemonic superiority for the image code are supported by the findings. Explanations of the picture superiority effect as resulting from dual encoding of pictures are not supported by the data.
The objects of visuospatial short-term memory: Perceptual organization and change detection.

PubMed

Nikolova, Atanaska; Macken, Bill

2016-01-01

We used a colour change-detection paradigm where participants were required to remember colours of six equally spaced circles. Items were superimposed on a background so as to perceptually group them within (a) an intact ring-shaped object, (b) a physically segmented but perceptually completed ring-shaped object, or (c) a corresponding background segmented into three arc-shaped objects. A nonpredictive cue at the location of one of the circles was followed by the memory items, which in turn were followed by a test display containing a probe indicating the circle to be judged same/different. Reaction times for correct responses revealed a same-object advantage; correct responses were faster to probes on the same object as the cue than to equidistant probes on a segmented object. This same-object advantage was identical for physically and perceptually completed objects, but was only evident in reaction times, and not in accuracy measures. Not only, therefore, is it important to consider object-level perceptual organization of stimulus elements when assessing the influence of a range of factors (e.g., number and complexity of elements) in visuospatial short-term memory, but a more detailed picture of the structure of information in memory may be revealed by measuring speed as well as accuracy.
The objects of visuospatial short-term memory: Perceptual organization and change detection

PubMed Central

Nikolova, Atanaska; Macken, Bill

2016-01-01

We used a colour change-detection paradigm where participants were required to remember colours of six equally spaced circles. Items were superimposed on a background so as to perceptually group them within (a) an intact ring-shaped object, (b) a physically segmented but perceptually completed ring-shaped object, or (c) a corresponding background segmented into three arc-shaped objects. A nonpredictive cue at the location of one of the circles was followed by the memory items, which in turn were followed by a test display containing a probe indicating the circle to be judged same/different. Reaction times for correct responses revealed a same-object advantage; correct responses were faster to probes on the same object as the cue than to equidistant probes on a segmented object. This same-object advantage was identical for physically and perceptually completed objects, but was only evident in reaction times, and not in accuracy measures. Not only, therefore, is it important to consider object-level perceptual organization of stimulus elements when assessing the influence of a range of factors (e.g., number and complexity of elements) in visuospatial short-term memory, but a more detailed picture of the structure of information in memory may be revealed by measuring speed as well as accuracy. PMID:26286369
Development and testing of the Multidimensional Trust in Health Care Systems Scale.

PubMed

Egede, Leonard E; Ellis, Charles

2008-06-01

To describe the development and psychometric testing of the Multidimensional Trust in Health Care Systems Scale (MTHCSS). Scale development occurred in 2 phases. In phase 1, a pilot instrument with 70 items was generated from the review of the trust literature, focus groups, and expert opinion. The 70 items were pilot tested in a sample of 256 students. Exploratory factor analysis was used to derive an orthogonal set of correlated factors. In phase 2, the final scale was administered to 301 primary care patients to assess reliability and validity. Phase 2 participants also completed validated measures of patient-centered care, health locus of control, medication nonadherence, social support, and patient satisfaction. In phase 1, a 17-item scale (MTHCSS) was developed with 10 items measuring trust in health care providers, 4 items measuring trust in health care payers, and 3 items measuring trust in health care institutions. In phase 2, the 17-item MTHCSS had a mean score of 63.0 (SD 8.8); the provider subscale had a mean of 40.0 (SD 6.2); the payers subscale had a mean of 12.8 (SD 3.0); and the institutions subscale had a mean of 10.3 (SD 2.1). Cronbach's alpha for the MTHCSS was 0.89 and 0.92, 0.74, and 0.64 for the 3 subscales. The MTHCSS was significantly correlated with patient-centered care (r = .22 to .62), locus of control-chance (r = .42), medication nonadherence (r = -.22), social support (r = .25), and patient satisfaction (r = .67). The MTHCSS is a valid and reliable instrument for measuring the 3 objects of trust in health care and is correlated with patient-level health outcomes.
78 FR 50107 - Notice of Intent To Repatriate Cultural Items: University of Colorado Museum of Natural History...

Federal Register 2010, 2011, 2012, 2013, 2014

2013-08-16

... organizations, has determined that the cultural items listed in this notice meet the definition of sacred... University of Colorado Museum of Natural History, Boulder, CO that meet the definition of sacred objects and.... Dr. Wheat acquired this item from an unknown individual. The sacred object and object of cultural...
Capacity and precision in an animal model of visual short-term memory

PubMed Central

Lara, Antonio H.; Wallis, Jonathan D.

2013-01-01

Temporary storage of information in visual short-term memory (VSTM) is a key component of many complex cognitive abilities. However, it is highly limited in capacity. Understanding the neurophysiological nature of this capacity limit will require a valid animal model of VSTM. We used a multiple-item color change detection task to measure macaque monkeys’ VSTM capacity. Subjects’ performance deteriorated and reaction times increased as a function of the number of items in memory. Additionally, we measured the precision of the memory representations by varying the distance between sample and test colors. In trials with similar sample and test colors, subjects made more errors compared to trials with highly discriminable colors. We modeled the error distribution as a Gaussian function and used this to estimate the precision of VSTM representations. We found that as the number of items in memory increases the precision of the representations decreases dramatically. Additionally, we found that focusing attention on one of the objects increases the precision with which that object is stored and degrading the precision of the remaining. These results are in line with recent findings in human psychophysics and provide a solid foundation for understanding the neurophysiological nature of the capacity limit of VSTM. PMID:22419756
Impaired integration of object knowledge and visual input in a case of ventral simultanagnosia with bilateral damage to area V4.

PubMed

Leek, E Charles; d'Avossa, Giovanni; Tainturier, Marie-Josèphe; Roberts, Daniel J; Yuen, Sung Lai; Hu, Mo; Rafal, Robert

2012-01-01

This study examines how brain damage can affect the cognitive processes that support the integration of sensory input and prior knowledge during shape perception. It is based on the first detailed study of acquired ventral simultanagnosia, which was found in a patient (M.T.) with posterior occipitotemporal lesions encompassing V4 bilaterally. Despite showing normal object recognition for single items in both accuracy and response times (RTs), and intact low-level vision assessed across an extensive battery of tests, M.T. was impaired in object identification with overlapping figures displays. Task performance was modulated by familiarity: Unlike controls, M.T. was faster with overlapping displays of abstract shapes than with overlapping displays of common objects. His performance with overlapping common object displays was also influenced by both the semantic relatedness and visual similarity of the display items. These findings challenge claims that visual perception is driven solely by feedforward mechanisms and show how brain damage can selectively impair high-level perceptual processes supporting the integration of stored knowledge and visual sensory input.
[Ageism: adaptation of the Fraboni of Ageism Scale-Revised to the French language and testing the effects of empathy, social dominance orientation and dogmatism on ageism].

PubMed

Boudjemad, Valérian; Gana, Kamel

2009-12-01

ABSTRACTThis article presents two studies dealing with ageism. The objective of the first study was to adapt to French language and validate the Fraboni of Ageism Scale-Revised (FSA-R) which contains 23 items, while the objective of the second study was to test a structural model containing ageism as measured by the FSA-R and the "Big Three": empathy, social dominance orientation, and dogmatism, controlled for by sex and age. The results of the first study (n = 323) generated a version of the FSA-R comprising 14 items, of which the psychometric properties were very satisfactory. Using structural equation modelling and bootstrap procedure, the results of the second study (n = 284) showed a direct negative and significant effect of empathy on agism. They also showed that this negative effect was mediated by dogmatism and social dominance orientation, which both exerted a positive effect on ageism.

Rasch analysis of the Patient Rated Elbow Evaluation questionnaire.

PubMed

Vincent, Joshua I; MacDermid, Joy C; King, Graham J W; Grewal, Ruby

2015-06-20

The Patient Rated Elbow Evaluation (PREE) was developed as an elbow joint specific measure of pain and disability and validated with classical psychometric methods. More recently, Rasch analysis has contributed new methods for analyzing the clinical measurement properties of self-report outcome measures. The objective of the study was to determine aspects of validity of the PREE using the Rasch model to assess the overall fit of the PREE data, the response scaling, individual item fit, differential item functioning (DIF), local dependency, unidimensionality and person separation index (PSI). A convenience sample of 236 patients (Age range 21-79 years; M: F- 97:139) with elbow disorders were recruited from the Roth│McFarlane Hand and Upper Limb Centre, London, Ontario, Canada. The baseline scores of the PREE were used. Rasch analysis was conducted using RUMM 2030 software on the 3 sub scales of the PREE separately. The 3 sub scales showed misfit initially with disordered thresholds on17 out of 20 items), uniform DIF was observed for two items ("Carrying a 10lbs object" from specific activities subscale for age group; and "household work" from the usual activities subscale for gender); multidimensionality and local dependency. The Pain subscale satisfied Rasch expectations when item 2 "Pain - At rest" was split for age group, while the usual activities subscale readily stood up to Rasch requirements when the item 2 "household work" was split for gender. The specific activities subscale demonstrated fit to the Rasch model when sub test analysis accounted for local dependency. All three subscales of the PREE were well targeted and had high reliability (PSI >0.80). The three subscales of the PREE appear to be robust when tested against the Rasch model when subject to a few alterations. The value of changing the 0-10 format is questionable given its widespread use; further Rasch-based analysis of whether these findings are stable in other samples is warranted.
Rasch analysis of three dry eye questionnaires and correlates with objective clinical tests.

PubMed

McAlinden, Colm; Gao, Rongrong; Wang, Qinmei; Zhu, Senmiao; Yang, Jing; Yu, Ayong; Bron, Anthony J; Huang, Jinhai

2017-04-01

To assess the psychometric properties of Chinese versions of the Ocular Comfort Index (OCI), Ocular Surface Disease Index (OSDI) and McMonnies questionnaires. Further, to assess the correlation between questionnaire scores and objective dry eye disease (DED) clinical tests. Translated versions of the OCI, OSDI and McMonnies questionnaires were completed in a random order by 238 participants with DED. Objective clinical tests included visual acuity (VA), fluorescein tear film break-up time (TBUT), corneal fluorescein staining, Schirmer I testing and meibomian gland grading. Rasch analysis was used to assess questionnaire psychometrics and spearman rank for correlations. For the OCI, the person separation was 2.31, item infit and outfit statistics ranged from 0.74-1.14 and 0.75-1.32, respectively, and targeting 1.54 logits. For the OSDI, person separation was 0.94. None of the three subscales provided valid measurements based on Rasch analysis. For the McMonnies questionnaire, person separation was 1.17, item infit and outfit statistics ranged from 0.7 to 1.21 and 0.51-3.49, respectively. There were weak correlations between questionnaire scores and clinical tests. There were weak correlations between OSDI scores and VA, fluorescein TBUT, Schirmer I testing and corneal fluorescein staining. There were weak correlations between McMonnies scores and VA, fluorescein TBUT, Schirmer I testing, and corneal fluorescein staining and meibomian gland grading. The OCI questionnaire was the only questionnaire that provided valid measurement on the basis of Rasch analysis, although slight multidimensionality was found. There were weak correlations between OCI scores and fluorescein TBUT, Schirmer I testing, and corneal fluorescein staining. Due to this paradoxical disconnect between symptoms and signs and the repeatability of tests, the use of both subjective and objective markers in the clinical management of patients or as endpoints in clinical trials would appear prudent. Copyright © 2017 Elsevier Inc. All rights reserved.
Psychometric properties of the Brisbane Burn Scar Impact Profile in adults with burn scars

PubMed Central

Kimble, Roy; McPhail, Steven; Plaza, Anita; Simons, Megan

2017-01-01

Objective The aim of the study was to determine the longitudinal validity, reproducibility, responsiveness and interpretability of the adult version of the Brisbane Burn Scar Impact Profile, a patient-report measure of health-related quality of life. Methods A prospective longitudinal cohort study of patients with or at risk of burn scarring was conducted at three assessment points (at baseline around the time of wound healing, one to two weeks post-baseline and 1-month post-baseline). Participants attending a major metropolitan adult burn centre at baseline were recruited. Participants completed the Brisbane Burn Scar Impact Profile and the 36-item Short Form Health Survey and Patient Observer Scar Assessment Scale. Intraclass Correlation Coefficients (ICCs), smallest detectable change, percentage of those who improved, stayed the same or worsened and Area under the Receiver Operating Characteristic Curve (AUC) were used to test the aim. Results Data were included for 118 participants at baseline, 68 participants at one to two weeks and 57 participants at 1-month post-baseline. All groups of items had acceptable reproducibility, except for the overall impact of burn scars (ICC = 0.69), the impact of sensations which was not expected to be stable (ICC = 0.63), mobility and daily activities (ICC = 0.63, 0.67 respectively). The responsiveness of six out of seven groups of items able to be tested against external criterion was supported (AUC = 0.72–0.75). Hypothesised correlations of changes in the Brisbane Burn Scar Impact Profile items with changes in criterion measures generally supported longitudinal validity (e.g., nine out of thirteen hypotheses using the SF-36 as an external criterion were supported). Internal consistency estimates, item-total and inter-item correlations indicated there was likely redundancy of some groups of items, particularly in the relationships and social interaction, appearance and emotional reactions items (Chronbach’s alpha range = 0.94–0.95). Conclusion Support was found for the reproducibility, longitudinal validity, responsiveness and interpretability of most groups of Brisbane Burn Scar Impact Profile items and some individual items in the test population. Potential redundancy of items should be investigated further. PMID:28902874
A New Functional Health Literacy Scale for Japanese Young Adults Based on Item Response Theory.

PubMed

Tsubakita, Takashi; Kawazoe, Nobuo; Kasano, Eri

2017-03-01

Health literacy predicts health outcomes. Despite concerns surrounding the health of Japanese young adults, to date there has been no objective assessment of health literacy in this population. This study aimed to develop a Functional Health Literacy Scale for Young Adults (funHLS-YA) based on item response theory. Each item in the scale requires participants to choose the most relevant term from 3 choices in relation to a target item, thus assessing objective rather than perceived health literacy. The 20-item scale was administered to 1816 university students and 1751 responded. Cronbach's α coefficient was .73. Difficulty and discrimination parameters of each item were estimated, resulting in the exclusion of 1 item. Some items showed different difficulty parameters for male and female participants, reflecting that some aspects of health literacy may differ by gender. The current 19-item version of funHLS-YA can reliably assess the objective health literacy of Japanese young adults.
A New Tool for Nutrition App Quality Evaluation (AQEL): Development, Validation, and Reliability Testing

PubMed Central

Huang, Wenhao; Chapman-Novakofski, Karen M

2017-01-01

Background The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. Objective The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps’ educational quality and technical functionality. Methods Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Results Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no significant change over time (P>.05) for all but skill development (P=.001). Construct reliability was good for items assessing age appropriateness of apps for children, teens, and a general audience. In addition, construct reliability was acceptable for assessing app appropriateness for various target audiences (Cronbach alpha >.70). For the 5 main factors, ICC (1,k) was >.80, with a P value of <.05. When 15 nutrition professionals evaluated one app, ICC (2,15) was .98, with a P value of <.001 for all 7 constructs when the modifiable items were specified for adults seeking weight loss support. Conclusions Our preliminary effort shows that AQEL is a valid, reliable instrument for evaluating nutrition apps’ qualities for clinical interventions by nutrition clinicians, educators, and researchers. Further efforts in validating AQEL in various contexts are needed. PMID:29079554
Validity and reliability of the TED-QOL: a new three-item questionnaire to assess quality of life in thyroid eye disease.

PubMed

Fayers, Tessa; Dolman, Peter J

2011-12-01

To develop and test a user-friendly questionnaire for rapidly assessing quality of life (QOL) in thyroid eye disease (TED). A three-item questionnaire, the TED-QOL, was designed and compared to the 16-item Graves Ophthalmopathy (GO)-QOL and the nine-item GO-Quality of Life Scale (QLS). 100 patients with TED were administered all three questionnaires on two occasions. Results were compared to clinical severity scores (Vision, Inflammation, Strabismus, Appearance (VISA) classification). Main outcomes were construct and criterion validity, test-retest reliability, duration, comprehension and completion rates. TED-QOL correlated strongly with the other questionnaires for corresponding items (Pearson correlation: appearance 0.71, 0.62; functioning 0.69, 0.66; overall QOL 0.53). Test-retest analysis demonstrated good reliability for all three questionnaires (intraclass correlations: TED-QOL 0.81, 0.74, 0.87; GO-QOL 0.81, 0.82; GO-QLS 0.74, 0.86, 0.67). TED-QOL was significantly faster to complete (1.6 min vs GO-QOL 3.1 min, GO-QLS 2.7 min, p<0.0001) and had a higher completion rate (100% vs GO-QOL 78%, GO-QLS 94%). There was only moderate correlation between items on all three questionnaires and VISA scores. The TED-QOL is rapid and easy to complete and analyse and has similar validity and reliability to longer questionnaires. All questionnaires showed only moderate correlation with disease severity, emphasising the discrepancy between objective and subjective assessments and the importance of measuring both.
Development of the Facial Skin Care Index: A Health-Related Outcomes Index for Skin Cancer Patients

PubMed Central

Matthews, B. Alex; Rhee, John S.; Neuburg, Marcy; Burzynski, Mary L.; Nattinger, Ann B.

2006-01-01

BACKGROUND Existing health-related quality-of-life (HRQOL) tools do not appear to capture patients' specific skin cancer concerns. OBJECTIVE To describe the conceptual foundation, item generation, reduction process, and reliability testing for the Facial Skin Cancer Index (FSCI), a HRQOL outcomes tool for skin cancer researchers and clinicians. METHODS Participants in Phases I to III consisted of adult patients (N = 134) diagnosed with biopsy-proven nonmelanoma cervicofacial skin cancer. Data were collected via self-report surveys and clinical records. RESULTS Seventy-one distinct items were generated in Phase I and rated for their importance by an independent sample during Phase II; 36 items representing six theoretical HRQOL domains were retained. Test–retest I results indicated that four subscales showed adequate reliability coefficients (α = 0.60 to 0.91). Twenty-six items remained for test–retest II. Results indicated excellent internal consistency for emotional, social, appearance, and modified financial/work subscales (range 0.79 to 0.95); test–retest correlation coefficients were consistent across time (range 0.81 to 0.97; lifestyle omitted). CONCLUSION Pretesting afforded the opportunity to select items that optimally met our a priori conceptual and psychometric criteria for high data quality. Phase IV testing (validity and sensitivity before surgery and 4 months after Mohs micrographic surgery) for the 20-item FSCI is under way. PMID:16875475
A New Approach to Response Sets in Analysis of a Test of Motivation to Achieve. A Section of the Final Report for 1969-70.

ERIC Educational Resources Information Center

Adkins, Dorothy C.; Ballif, Bonnie L.

Gumpgookies, an objective-projective test of school achievement motivation for children 3 1/2 to 8 year, was reduced from 100 to 75 items following extensive factor analyses. This revised test attempted to dissipate the effects of response sets of the subjects and was prepared in three versions--an individual form, a group form for non-readers,…
Recognizing familiar objects by hand and foot: Haptic shape perception generalizes to inputs from unusual locations and untrained body parts.

PubMed

Lawson, Rebecca

2014-02-01

The limits of generalization of our 3-D shape recognition system to identifying objects by touch was investigated by testing exploration at unusual locations and using untrained effectors. In Experiments 1 and 2, people found identification by hand of real objects, plastic 3-D models of objects, and raised line drawings placed in front of themselves no easier than when exploration was behind their back. Experiment 3 compared one-handed, two-handed, one-footed, and two-footed haptic object recognition of familiar objects. Recognition by foot was slower (7 vs. 13 s) and much less accurate (9 % vs. 47 % errors) than recognition by either one or both hands. Nevertheless, item difficulty was similar across hand and foot exploration, and there was a strong correlation between an individual's hand and foot performance. Furthermore, foot recognition was better with the largest 20 of the 80 items (32 % errors), suggesting that physical limitations hampered exploration by foot. Thus, object recognition by hand generalized efficiently across the spatial location of stimuli, while object recognition by foot seemed surprisingly good given that no prior training was provided. Active touch (haptics) thus efficiently extracts 3-D shape information and accesses stored representations of familiar objects from novel modes of input.
Young children's fast mapping and generalization of words, facts, and pictograms.

PubMed

Deák, Gedeon O; Toney, Alexis J

2013-06-01

To test general and specific processes of symbol learning, 4- and 5-year-old children learned three kinds of abstract associates for novel objects: words, facts, and pictograms. To test fast mapping (i.e., one-trial learning) and subsequent learning, comprehension was tested after each of four exposures. Production was also tested, as was children's tendency to generalize learned items to new objects in the same taxon. To test for a bias toward mutually exclusive associations, children learned either one-to-one or many-to-many mappings. In Experiment 1, children learned words, facts (with or without incidental novel words), or pictograms. In Experiment 2, children learned words or pictograms. In both of these experiments, children learned words slower than facts and pictograms. Pictograms and facts were generalized more systematically than words, but only in Experiment 1. Children learned one-to-one mappings faster only in Experiment 2, when cognitive load was increased. In Experiment 3, 3- and 4-year-olds were taught facts (with novel words), words, and pictograms. Children learned facts faster than words; however, they remembered all items equally well a week later. The results suggest that word learning follows non-specialized memory and associative learning processes. Copyright © 2013 Elsevier Inc. All rights reserved.
Test-retest reliability and construct validity of the ENERGY-parent questionnaire on parenting practices, energy balance-related behaviours and their potential behavioural determinants: the ENERGY-project.

PubMed

Singh, Amika S; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Vik, Froydis N; van Lippevelde, Wendy; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; van der Sluijs, Maria; Terwee, Caroline; Brug, Johannes

2012-08-13

Insight in parental energy balance-related behaviours, their determinants and parenting practices are important to inform childhood obesity prevention. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. The objective of the current study was to examine the test-retest reliability and construct validity of the parent questionnaire used in the ENERGY-project, assessing parental energy balance-related behaviours, their determinants, and parenting practices among parents of 10-12 year old children. We collected data among parents (n = 316 in the test-retest reliability study; n = 109 in the construct validity study) of 10-12 year-old children in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent interview was assessed using ICC and percentage agreement.All but one item showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Construct validity appeared to be good to excellent for 92 out of 121 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 29 items, construct validity was moderate for 24 and poor for 5 items. The reliability and construct validity of the items of the ENERGY-parent questionnaire on multiple energy balance-related behaviours, their potential determinants, and parenting practices appears to be good. Based on the results of the validity study, we strongly recommend adapting parts of the ENERGY-parent questionnaire if used in future research.
75 FR 14460 - Notice of Intent to Repatriate Cultural Items: U.S. Department of the Interior, National Park...

Federal Register 2010, 2011, 2012, 2013, 2014

2010-03-25

... National Park, WY, that meet the definition of ``sacred objects'' under 25 U.S.C. 3001. This notice is... three cultural items as ``sacred objects'' coming from the Cattaraugus Reservation. The three items are... faces'', are sacred objects which belong to a society which still functions at the Newtown Longhouse on...
76 FR 80388 - Notice of Intent to Repatriate Cultural Items: U.S. Department of the Interior, National Park...

Federal Register 2010, 2011, 2012, 2013, 2014

2011-12-23

... cultural items meet the definition of sacred objects and repatriation to the lineal descendant stated below... descendants of the individual who owned these sacred objects and who wish to claim the items should contact... they are lineal descendants of the individual who owned these sacred objects and who wish to claim the...
77 FR 34986 - Notice of Intent To Repatriate Cultural Items: U.S. Department of the Interior, National Park...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-06-12

... appropriate Indian tribes, has determined that the cultural items meet the definition of sacred objects and... individuals who believe they are lineal descendants of the individual who owned these sacred objects and who... descendants of the individual who owned these sacred objects and who wish to claim the items should contact...
77 FR 23497 - Notice of Intent To Repatriate Cultural Items: Benton County Historical Society and Museum...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-04-19

... for all nine cultural items and that these cultural items are sacred objects that have religious... sacred objects and repatriation to the Indian tribe stated below may occur if no additional claimants..., Philomath, OR, that meet the definition of sacred objects under 25 U.S.C. 3001. This notice is published as...
Contamination by ten harmful elements in toys and children's jewelry bought on the North American market.

PubMed

Guney, Mert; Zagury, Gerald J

2013-06-04

Toys and children's jewelry may contain metals to which children can be orally exposed. The objectives of this research were (1) to determine total concentrations (TC's) of As, Ba, Cd, Cr, Cu, Mn, Ni, Pb, Sb, and Se in toys and jewelry (n = 72) bought on the North American market and compare TC's to regulatory limits, and (2) to estimate oral metal bioavailability in selected items (n = 4) via bioaccessibility testing. For metallic toys and children's jewelry (n = 24) 20 items had TC's exceeding migratable concentration limits (European Union). Seven of seventeen jewelry items did not comply with TC limits in U.S. and Canadian regulations. Samples included articles with very high Cd (37% [w/w]), Pb (65%), and Cu (71%) concentrations. For plastic toys (n = 18), toys with paint or coating (n = 12), and brittle or pliable toys (n = 18), TC's were below the EU migration limits (except in one toy for each category). Bioaccessibility tests showed that a tested jewelry item strongly leached Pb (gastric: 698 μg, intestinal: 705 μg) and some Cd (1.38 and 1.42 μg). Especially in metallic toys and jewelry, contamination by Pb and Cd, and to a lesser extent by Cu, Ni, As, and Sb, still poses an acute problem in North America.
Reliability and validity of a scale to measure consumer attitudes regarding the private food safety certification of restaurants.

PubMed

Uggioni, Paula Lazzarin; Salay, Elisabete

2012-04-01

Validated and reliable instruments for measuring consumer attitudes regarding food quality certifications are lacking, but the measurement of consumer attitude could be an important tool for understanding consumer behavior. Thus the objective of this study was to develop an instrument for measuring consumer attitudes regarding private food safety certifications for commercial restaurants. To this end, the following steps were carried out: development of the interview items; complete pilot testing; item analyses (influence of social desirability and total-item correlation); reliability test (internal consistency and test-retest); and validity assessment (content and discriminative validity and exploratory and confirmatory factor analysis). The subjects, all over the age of 18 and drawn from six non-probabilistic samples (n=7-350) in the city of Campinas, Brazil, were all subjected to an interview. The final scale included 24 items and had a Cronbach's alpha coefficient of 0.79 and a content validation coefficient of 0.99, both within acceptable limits. The confirmatory factor analysis validated a model with five factors and the final instrument discriminated reasonably well between the groups and showed satisfactory reproducibility (r=0.955). Furthermore, the scale validity and reliability were satisfactory, suggesting it could also be applied to future studies. Copyright Â© 2011 Elsevier Ltd. All rights reserved.
Improving Inpatient Surveys: Web-Based Computer Adaptive Testing Accessed via Mobile Phone QR Codes

PubMed Central

2016-01-01

Background The National Health Service (NHS) 70-item inpatient questionnaire surveys inpatients on their perceptions of their hospitalization experience. However, it imposes more burden on the patient than other similar surveys. The literature shows that computerized adaptive testing (CAT) based on item response theory can help shorten the item length of a questionnaire without compromising its precision. Objective Our aim was to investigate whether CAT can be (1) efficient with item reduction and (2) used with quick response (QR) codes scanned by mobile phones. Methods After downloading the 2008 inpatient survey data from the Picker Institute Europe website and analyzing the difficulties of this 70-item questionnaire, we used an author-made Excel program using the Rasch partial credit model to simulate 1000 patients’ true scores followed by a standard normal distribution. The CAT was compared to two other scenarios of answering all items (AAI) and the randomized selection method (RSM), as we investigated item length (efficiency) and measurement accuracy. The author-made Web-based CAT program for gathering patient feedback was effectively accessed from mobile phones by scanning the QR code. Results We found that the CAT can be more efficient for patients answering questions (ie, fewer items to respond to) than either AAI or RSM without compromising its measurement accuracy. A Web-based CAT inpatient survey accessed by scanning a QR code on a mobile phone was viable for gathering inpatient satisfaction responses. Conclusions With advances in technology, patients can now be offered alternatives for providing feedback about hospitalization satisfaction. This Web-based CAT is a possible option in health care settings for reducing the number of survey items, as well as offering an innovative QR code access. PMID:26935793
Development of the Primary Care Quality-Homeless (PCQ-H) Instrument: A Practical Survey of Patients' Experiences in Primary Care

PubMed Central

Kertesz, Stefan. G.; Pollio, David E.; Jones, Richard N.; Steward, Jocelyn; Stringfellow, Erin J.; Gordon, Adam J.; Johnson, Nancy K.; Kim, Theresa A.; Granstaff, Unita; Austin, Erika L.; Young, Alexander S.; Golden, Joya; Davis, Lori L.; Roth, David L.; Holt, Cheryl L.

2015-01-01

Background Homeless patients face unique challenges in obtaining primary care responsive to their needs and context. Patient experience questionnaires could permit assessment of patient-centered medical homes for this population, but standard instruments may not reflect homeless patients' priorities and concerns. Objectives This report describes (a) the content and psychometric properties of a new primary care questionnaire for homeless patients and (b) the methods utilized in its development. Methods Starting with quality-related constructs from the Institute of Medicine, we identified relevant themes by interviewing homeless patients and experts in their care. A multidisciplinary team drafted a preliminary set of 78 items. This was administered to homeless-experienced clients (n=563) across 3 VA facilities and 1 non-VA Health Care for the Homeless Program. Using Item Response Theory, we examined Test Information Function curves to eliminate less informative items and devise plausibly distinct subscales. Results The resulting 33-item instrument (Primary Care Quality-Homeless, PCQ-H) has four subscales: Patient-Clinician Relationship (15 items), Cooperation among Clinicians (3 items), Access/Coordination (11 items) and Homeless-Specific Needs (4 items). Evidence for divergent and convergent validity is provided. Test Information Function (TIF) graphs showed adequate informational value to permit inferences about groups for 3 subscales (Relationship, Cooperation and Access/Coordination). The 3-item Cooperation subscale had lower informational value (TIF<5) but had good internal consistency (alpha=0.75) and patients frequently reported problems in this aspect of care. Conclusions Systematic application of qualitative and quantitative methods supported the development of a brief patient-reported questionnaire focused on the primary care of homeless patients and offers guidance for future population-specific instrument development. PMID:25023918
Mental health in primary care: an evaluation using the Item Response Theory.

PubMed

Rocha, Hugo André da; Santos, Alaneir de Fátima Dos; Reis, Ilka Afonso; Santos, Marcos Antônio da Cunha; Cherchiglia, Mariângela Leal

2018-01-01

OBJECTIVE To determine the items of the Brazilian National Program for Improving Access and Quality of Primary Care that better evaluate the capacity to provide mental health care. METHODS This is a cross-sectional study carried out using the Graded Response Model of the Item Response Theory using secondary data from the second cycle of the National Program for Improving Access and Quality of Primary Care, which evaluates 30,523 primary care teams in the period from 2013 to 2014 in Brazil. The internal consistency, correlation between items, and correlation between items and the total score were tested using the Cronbach's alpha, Spearman's correlation, and point biserial coefficients, respectively. The assumptions of unidimensionality and local independence of the items were tested. Word clouds were used as one way to present the results. RESULTS The items with the greatest ability to discriminate were scheduling of the agenda according to risk stratification, keeping of records of the most serious cases of users in psychological distress, and provision of group care. The items that required a higher level of mental health care in the parameter of location were the provision of any type of group care and the provision of educational and mental health promotion activities. Total Cronbach's alpha coefficient was 0.87. The items that obtained the highest correlation with total score were the recording of the most serious cases of users in psychological distress and scheduling of the agenda according to risk stratification. The final scores obtained oscillated between -2.07 (minimum) and 1.95 (maximum). CONCLUSIONS There are important aspects in the discrimination of the capacity to provide mental health care by primary health care teams: risk stratification for care management, follow-up of the most serious cases, group care, and preventive and health promotion actions.

Development of a Brief Questionnaire to Assess Contraceptive Intent

PubMed Central

Raine-Bennett, Tina R; Rocca, Corinne H

2015-01-01

Objective We sought to develop and validate an instrument that can enable providers to identify young women who may be at risk of contraceptive non-adherence. Methods Item response theory based methods were used to evaluate the psychometric properties of the Contraceptive Intent Questionnaire, a 15-item self-administered questionnaire, based on theory and prior qualitative and quantitative research. The questionnaire was administered to 200 women aged 15–24 years who were initiating contraceptives. We assessed item fit to the item response model, internal consistency, internal structure validity, and differential item functioning. Results All items fit a one-dimensional model. The separation reliability coefficient was 0.73. Participants’ overall scores covered the full range of the scale (0–15), and items appropriately matched the range of participants’ contraceptive intent. Items met the criteria for internal structure validity and most items functioned similarly between groups of women. Conclusion The Contraceptive Intent Questionnaire appears to be a reliable and valid tool. Future testing is needed to assess predictive ability and clinical utility. Practice Implications The Contraceptive Intent Questionnaire may serve as a valid tool to help providers identify women who may have problems with contraceptive adherence, as well as to pinpoint areas in which counseling may be directed. PMID:26104994
Validation of a condition-specific measure for women having an abnormal screening mammography.

PubMed

Brodersen, John; Thorsen, Hanne; Kreiner, Svend

2007-01-01

The aim of this study is to assess the validity of a new condition-specific instrument measuring psychosocial consequences of abnormal screening mammography (PCQ-DK33). The draft version of the PCQ-DK33 was completed on two occasions by 184 women who had received an abnormal screening mammography and on one occasion by 240 women who had received a normal screening result. Item Response Theories and Classical Test Theories were used to analyze data. Construct validity, concurrent validity, known group validity, objectivity and reliability were established by item analysis examining the fit between item responses and Rasch models. Six dimensions covering anxiety, behavioral impact, sense of dejection, impact on sleep, breast examination, and sexuality were identified. One item belonging to the dejection dimension had uniform differential item functioning. Two items not fitting the Rasch models were retained because of high face validity. A sick leave item added useful information when measuring side effects and socioeconomic consequences of breast cancer screening. Five "poor items" were identified and should be deleted from the final instrument. Preliminary evidence for a valid and reliable condition-specific measure for women having an abnormal screening mammography was established. The measure includes 27 "good" items measuring different attributes of the same overall latent structure-the psychosocial consequences of abnormal screening mammography.
Find the Hidden Object. Understanding Play in Psychological Assessments

PubMed Central

Fasulo, Alessandra; Shukla, Janhavi; Bennett, Stephanie

2017-01-01

Standardized psychological assessments are extensively used by practitioners to determine rate and level of development in different domains of ability in both typical and atypical children. The younger the children, the more likely the trials will resemble play activities. However, mode of administration, timing and use of objects involved are constrained. The purpose of this study is to explore what kind of play is play in psychological assessments, what are the expectations about children's performance and what are the abilities supporting the test activities. Conversation Analysis (CA) was applied to the videorecording of an interaction between a child and a practitioner during the administration of the Bayley Scale of Infant and Toddler Development, III edition. The analysis focuses on a 2′07″ long sequence relative to the administration of the test item “Find the hidden object” to a 23 months old child with Down syndrome. The analysis of the sequence shows that the assessor promotes the child's engagement by couching the actions required to administer the item in utterances with marked child-directed features. The analysis also shows that the objects constituting the test item did not suggest to the child a unique course of action, leading to the assessor's modeling of the successful sequence. We argue that when a play frame is activated by an interactional partner, the relational aspect of the activity is foregrounded and the co-player becomes a source of cues for ways in which playing can develop. We discuss the assessment interaction as orienting the child toward a right-or-wrong interpretation, leaving the realm of play, which is inherently exploratory and inventive, to enter that of instructional activities. Finally, we argue that the sequential analysis of the interaction and of the mutual sense-making procedures that partners put in place during the administration of an assessment could be used in the design and evaluation of tests for a finer understanding of the abilities involved. PMID:28392771
Mere exposure effect: A consequence of direct and indirect fluency-preference links.

PubMed

Willems, Sylvie; Van der Linden, Martial

2006-06-01

In three experiments, picture quality between test items was manipulated to examine whether subjects' expectations about the fluency normally associated with these different stimuli might influence the effects of fluency on preference or familiarity-based recognition responses. The results showed that fluency due to pre-exposure influenced responses less when objects were presented with high picture quality, suggesting that attributions of fluency to preference and familiarity are adjusted according to expectations about the different test pictures. However, this expectations influence depended on subjects' awareness of these different quality levels. Indeed, imperceptible differences seemed not to induce expectations about the test item fluency. In this context, fluency due to both picture quality and pre-exposure influenced direct responses. Conversely, obvious, and noticed, differences in test picture quality did no affect responses, suggesting that expectations moderated attributions of fluency only when fluency normally associated with these different stimuli was perceptible but difficult to assess.
48 CFR 252.227-7013 - Rights in technical data-Noncommercial items.

Code of Federal Regulations, 2011 CFR

2011-10-01

... causing a computer to perform a specific operation or series of operations. (3) Computer software means computer programs, source code, source code listings, object code listings, design details, algorithms... or will be developed exclusively with Government funds; (ii) Studies, analyses, test data, or similar...
48 CFR 252.227-7013 - Rights in technical data-Noncommercial items.

Code of Federal Regulations, 2012 CFR

2012-10-01

... causing a computer to perform a specific operation or series of operations. (3) Computer software means computer programs, source code, source code listings, object code listings, design details, algorithms... or will be developed exclusively with Government funds; (ii) Studies, analyses, test data, or similar...
48 CFR 252.227-7013 - Rights in technical data-Noncommercial items.

Code of Federal Regulations, 2014 CFR

2014-10-01

... causing a computer to perform a specific operation or series of operations. (3) Computer software means computer programs, source code, source code listings, object code listings, design details, algorithms... or will be developed exclusively with Government funds; (ii) Studies, analyses, test data, or similar...
48 CFR 252.227-7013 - Rights in technical data-Noncommercial items.

Code of Federal Regulations, 2010 CFR

2010-10-01

... causing a computer to perform a specific operation or series of operations. (3) Computer software means computer programs, source code, source code listings, object code listings, design details, algorithms... developed exclusively with Government funds; (ii) Studies, analyses, test data, or similar data produced for...
An Application of the Rasch Measurement Theory to an Assessment of Geometric Thinking Levels

ERIC Educational Resources Information Center

Stols, Gerrit; Long, Caroline; Dunne, Tim

2015-01-01

The purpose of this study is to apply the Rasch model to investigate both the Van Hiele theory for geometric development and an associated test. In terms of the test, the objective is to investigate the functioning of a classic 25-item instrument designed to identify levels of geometric proficiency. The dataset of responses by 244 students (106…
Identifying Dyslexia in Adults: An Iterative Method Using the Predictive Value of Item Scores and Self-Report Questions

ERIC Educational Resources Information Center

Tamboer, Peter; Vorst, Harrie C. M.; Oort, Frans J.

2014-01-01

Methods for identifying dyslexia in adults vary widely between studies. Researchers have to decide how many tests to use, which tests are considered to be the most reliable, and how to determine cut-off scores. The aim of this study was to develop an objective and powerful method for diagnosing dyslexia. We took various methodological measures,…
The Impact of Item Dependency on the Efficiency of Testing and Reliability of Student Scores from a Computer Adaptive Assessment of Reading Comprehension

ERIC Educational Resources Information Center

Petscher, Yaacov; Foorman, Barbara R.; Truckenmiller, Adrea J.

2017-01-01

The objective of the present study was to evaluate the extent to which students who took a computer adaptive test of reading comprehension accounting for testlet effects were administered fewer passages and had a more precise estimate of their reading comprehension ability compared to students in the control condition. A randomized controlled…
77 FR 5839 - Notice of Intent To Repatriate a Cultural Item: University of Denver Department of Anthropology...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-02-06

... that a cultural item meets the definition of sacred object and repatriation to the Indian tribes stated... sacred object under 25 U.S.C. 3001. This notice is published as part of the National Park Service's... storage of sacred items, shell money, beads and other treasured items. Ceremonial baskets were sometimes...
Pancreatitis Quality of Life Instrument: Development of a new instrument

PubMed Central

Bova, Carol; Barton, Bruce; Hartigan, Celia

2014-01-01

Objectives: The goal of this project was to develop the first disease-specific instrument for the evaluation of quality of life in chronic pancreatitis. Methods: Focus groups and interview sessions were conducted, with chronic pancreatitis patients, to identify items felt to impact quality of life which were subsequently formatted into a paper-and-pencil instrument. This instrument was used to conduct an online survey by an expert panel of pancreatologists to evaluate its content validity. Finally, the modified instrument was presented to patients during precognitive testing interviews to evaluate its clarity and appropriateness. Results: In total, 10 patients were enrolled in the focus groups and interview sessions where they identified 50 items. Once redundant items were removed, the 40 remaining items were made into a paper-and-pencil instrument referred to as the Pancreatitis Quality of Life Instrument. Through the processes of content validation and precognitive testing, the number of items in the instrument was reduced to 24. Conclusions: This marks the development of the first disease-specific instrument to evaluate quality of life in chronic pancreatitis. It includes unique features not found in generic instruments (economic factors, stigma, and spiritual factors). Although this marks a giant step forward, psychometric evaluation is still needed prior to its clinical use. PMID:26770703
Validation of the MedUseQ: A Self-Administered Screener for Older Adults to Assess Medication Use Problems.

PubMed

Berman, Rebecca L; Iris, Madelyn; Conrad, Kendon J; Robinson, Carrie

2018-01-01

Older adults taking multiple prescription and nonprescription drugs are at risk for medication use problems, yet there are few brief, self-administered screening tools designed specifically for them. The study objective was to develop and validate a patient-centered screener for community-dwelling older adults. In phase 1, a convenience sample of 57 stakeholders (older adults, pharmacists, nurses, and physicians) participated in concept mapping, using Concept System® Global MAX TM , to identify items for a questionnaire. In phase 2, a 40-item questionnaire was tested with a convenience sample of 377 adults and a 24-item version was tested with 306 older adults, aged 55 and older, using Rasch methodology. In phase 3, stakeholder focus groups provided feedback on the format of questionnaire materials and recommended strategies for addressing problems. The concept map contained 72 statements organized into 6 conceptual clusters or domains. The 24-item screener was unidimensional. Cronbach's alpha was .87, person reliability was acceptable (.74), and item reliability was high (.96). The MedUseQ is a validated, patient-centered tool targeting older adults that can be used to assess a wide range of medication use problems in clinical and community settings and to identify areas for education, intervention, or further assessment.
Quality of life in patients with Parkinson's disease: development of a questionnaire.

PubMed Central

de Boer, A G; Wijker, W; Speelman, J D; de Haes, J C

1996-01-01

OBJECTIVES--To develop and test a questionnaire for measuring quality of life in patients with Parkinson's disease. METHODS--An item pool was developed based on the experience of patients with Parkinson's disease and of neurologists; medical literature on the problems of patients with Parkinson's disease; and other quality of life questionnaires. To reduce the item pool, 13 patients identified items that were a problem to them and rated their importance. Items which were most often chosen and rated most important were included in the Parkinson's disease quality of life questionnaire (PDQL). The PDQL consists of 37 items. To evaluate the discriminant validity of the PDQL three groups of severity of disease were compared. To test for convergent validity, the scores of the PDQL were tested for correlation with standard indices of quality of life. RESULTS--The PDQL was filled out by 384 patients with Parkinson's disease. It consisted of four subscales: parkinsonian symptoms, systemic symptoms, emotional functioning, and social functioning. The internal-consistency reliability coefficients of the PDQL subscales were high (0.80-0.87). Patients with higher disease severity had significantly lower quality of life on all PDQL subscales (P < 0.05). Almost all PDQL subscales correlated highly (P < 0.001) with the corresponding scales of the standard quality of life indices. CONCLUSION--The PDQL is a relevant, reliable, and valid measure of the quality of life of patients with Parkinson's disease. Images PMID:8676165
Psychometric characteristics of Clinical Reasoning Problems (CRPs) and its correlation with routine multiple choice question (MCQ) in Cardiology department

PubMed Central

DERAKHSHANDEH, ZAHRA; AMINI, MITRA; KOJURI, JAVAD; DEHBOZORGIAN, MARZIYEH

2018-01-01

Introduction: Clinical reasoning is one of the most important skills in the process of training a medical student to become an efficient physician. Assessment of the reasoning skills in a medical school program is important to direct students’ learning. One of the tests for measuring the clinical reasoning ability is Clinical Reasoning Problems (CRPs). The major aim of this study is to measure psychometric qualities of CRPs and define correlation between this test and routine MCQ in cardiology department of Shiraz medical school. Methods: This study was a descriptive study conducted on total cardiology residents of Shiraz Medical School. The study population consists of 40 residents in 2014. The routine CRPs and the MCQ tests was designed based on similar objectives and were carried out simultaneously. Reliability, item difficulty, item discrimination, and correlation between each item and the total score of CRPs were all measured by Excel and SPSS software for checking psycometeric CRPs test. Furthermore, we calculated the correlation between CRPs test and MCQ test. The mean differences of CRPs test score between residents’ academic year [second, third and fourth year] were also evaluated by Analysis of variances test (One Way ANOVA) using SPSS software (version 20)(α=0.05). Results: The mean and standard deviation of score in CRPs was 10.19 ±3.39 out of 20; in MCQ, it was 13.15±3.81 out of 20. Item difficulty was in the range of 0.27-0.72; item discrimination was 0.30-0.75 with question No.3 being the exception (that was 0.24). The correlation between each item and the total score of CRP was 0.26-0.87; the correlation between CRPs test and MCQ test was 0.68 (p<0.001). The reliability of the CRPs was 0.72 as calculated by using Cronbach's alpha. The mean score of CRPs was different among residents based on their academic year and this difference was statistically significant (p<0.001). Conclusion: The results of this present investigation revealed that CRPs could be reliable test for measuring clinical reasoning in residents. It can be included in cardiology residency assessment programs. PMID:29344528
77 FR 13622 - Notice of Intent To Repatriate Cultural Items: U.S. Fish and Wildlife Service, Office of Law...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-03-07

... below meet the definition of sacred objects and repatriation to the Indian tribe stated below may occur... definition of sacred objects under 25 U.S.C. 3001. This notice is published as part of the National Park.... Upon review, the USFWS determined that two sacred objects (Item 6: Crow lumpwood dance wand and Item 46...
Contextual consistency facilitates long-term memory of perceptual detail in barely seen images.

PubMed

Gronau, Nurit; Shachar, Meytal

2015-08-01

It is long known that contextual information affects memory for an object's identity (e.g., its basic level category), yet it is unclear whether schematic knowledge additionally enhances memory for the precise visual appearance of an item. Here we investigated memory for visual detail of merely glimpsed objects. Participants viewed pairs of contextually related and unrelated stimuli, presented for an extremely brief duration (24 ms, masked). They then performed a forced-choice memory-recognition test for the precise perceptual appearance of 1 of 2 objects within each pair (i.e., the "memory-target" item). In 3 experiments, we show that memory-target stimuli originally appearing within contextually related pairs are remembered better than targets appearing within unrelated pairs. These effects are obtained whether the target is presented at test with its counterpart pair object (i.e., when reiterating the original context at encoding) or whether the target is presented alone, implying that the contextual consistency effects are mediated predominantly by processes occurring during stimulus encoding, rather than during stimulus retrieval. Furthermore, visual detail encoding is improved whether object relations involve implied action or not, suggesting that, contrary to some prior suggestions, action is not a necessary component for object-to-object associative "grouping" processes. Our findings suggest that during a brief glimpse, but not under long viewing conditions, contextual associations may play a critical role in reducing stimulus competition for attention selection and in facilitating rapid encoding of sensory details. Theoretical implications with respect to classic frame theories are discussed. (PsycINFO Database Record (c) 2015 APA, all rights reserved).
Decoding the content of recollection within the core recollection network and beyond.

PubMed

Thakral, Preston P; Wang, Tracy H; Rugg, Michael D

2017-06-01

Recollection - retrieval of qualitative information about a past event - is associated with enhanced neural activity in a consistent set of neural regions (the 'core recollection network') seemingly regardless of the nature of the recollected content. Here, we employed multi-voxel pattern analysis (MVPA) to assess whether retrieval-related functional magnetic resonance imaging (fMRI) activity in core recollection regions - including the hippocampus, angular gyrus, medial prefrontal cortex, retrosplenial/posterior cingulate cortex, and middle temporal gyrus - contain information about studied content and thus demonstrate retrieval-related 'reinstatement' effects. During study, participants viewed objects and concrete words that were subjected to different encoding tasks. Test items included studied words, the names of studied objects, or unstudied words. Participants judged whether the items were recollected, familiar, or new by making 'remember', 'know', and 'new' responses, respectively. The study history of remembered test items could be reliably decoded using MVPA in most regions, as well as from the dorsolateral prefrontal cortex, a region where univariate recollection effects could not be detected. The findings add to evidence that members of the core recollection network, as well as at least one neural region where mean signal is insensitive to recollection success, carry information about recollected content. Importantly, the study history of recognized items endorsed with a 'know' response could be decoded with equal accuracy. The results thus demonstrate a striking dissociation between mean signal and multi-voxel indices of recollection. Moreover, they converge with prior findings in suggesting that, as it is operationalized by classification-based MVPA, reinstatement is not uniquely a signature of recollection. Copyright © 2016 Elsevier Ltd. All rights reserved.
Measuring ability to assess claims about treatment effects: a latent trait analysis of items from the ‘Claim Evaluation Tools’ database using Rasch modelling

PubMed Central

Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D

2017-01-01

Background The Claim Evaluation Tools database contains multiple-choice items for measuring people’s ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. Objectives To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. Participants We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Results Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Conclusion Most of the items conformed well to the Rasch model’s expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. PMID:28550019

Binding of multiple features in memory by high-functioning adults with autism spectrum disorder.

PubMed

Bowler, Dermot M; Gaigg, Sebastian B; Gardiner, John M

2014-09-01

Diminished episodic memory and diminished use of semantic information to aid recall by individuals with autism spectrum disorder (ASD) are both thought to result from diminished relational binding of elements of complex stimuli. To test this hypothesis, we asked high-functioning adults with ASD and typical comparison participants to study grids in which some cells contained drawings of objects in non-canonical colours. Participants were told at study which features (colour, item, location) would be tested in a later memory test. In a second experiment, participants studied similar grids and were told that they would be tested on object-location or object-colour combinations. Recognition of combinations was significantly diminished in ASD, which survived covarying performance on the Color Trails Test (D'Elia et al. Color trails test. Professional manual. Psychological Assessment Resources, Lutz, 1996), a test of executive difficulties. The findings raise the possibility that medial temporal as well as frontal lobe processes are dysfunctional in ASD.
Validation of the Dutch version of the Swallowing Quality-of-Life Questionnaire (DSWAL-QoL) and the adjusted DSWAL-QoL (aDSWAL-QoL) using item analysis with the Rasch model: a pilot study.

PubMed

Simpelaere, Ingeborg S; Van Nuffelen, Gwen; De Bodt, Marc; Vanderwegen, Jan; Hansen, Tina

2017-04-07

The Swallowing Quality-of-Life Questionnaire (SWAL-QoL) is considered the gold standard for assessing health-related QoL in oropharyngeal dysphagia. The Dutch translation (DSWAL-QoL) and its adjusted version (aDSWAL-QoL) have been validated using classical test theory (CTT). However, these scales have not been tested against the Rasch measurement model, which is required to establish the structural validity and objectivity of the total scale and subscale scores. Thus, the purpose of this study was to examine the psychometric properties of these scales using item analysis according to the Rasch model. Item analysis with the Rasch model was performed using RUMM2030 software with previously collected data from a validation study of 108 patients. The assessment included evaluations of overall model fit, reliability, unidimensionality, threshold ordering, individual item and person fits, differential item functioning (DIF), local item dependency (LID) and targeting. The analysis could not establish the psychometric properties of either of the scales or their subscales because they did not fit the Rasch model, and multidimensionality, disordered thresholds, DIF, and/or LID were found. The reliability and power of fit were high for the total scales (PSI = 0.93) but low for most of the subscales (PSI < 0.70). The targeting of persons and items was suboptimal. The main source of misfit was disordered thresholds for both the total scales and subscales. Based on the results of the analysis, adjustments to improve the scales were implemented as follows: disordered thresholds were rescaled, misfit items were removed and items were split for DIF. However, the multidimensionality and LID could not be resolved. The reliability and power of fit remained low for most of the subscales. This study represents the first analyses of the DSWAL-QoL and aDSWAL-QoL with the Rasch model. Relying on the DSWAL-QoL and aDSWAL-QoL total and subscale scores to make conclusions regarding dysphagia-related HRQoL should be treated with caution before the structural validity and objectivity of both scales have been established. A larger and well-targeted sample is recommended to derive definitive conclusions about the items and scales. Solutions for the psychometric weaknesses suggested by the model and practical implications are discussed.
Scale Development for Perceived School Climate for Girls' Physical Activity

ERIC Educational Resources Information Center

Birnbaum, Amanda S.; Evenson, Kelly R.; Motl, Robert W.; Dishman, Rod K.; Voorhees, Carolyn C.; Sallis, James F.; Elder, John P.; Dowda, Marsha

2005-01-01

Objectives: To test an original scale assessing perceived school climate for girls' physical activity in middle school girls. Methods: Confirmatory factor analysis (CFA) and structural equation modeling (SEM). Results: CFA retained 5 of 14 original items. A model with 2 correlated factors, perceptions about teachers' and boys' behaviors,…
High School Students' Concepts of Acids and Bases.

ERIC Educational Resources Information Center

Ross, Bertram H. B.

An investigation of Ontario high school students' understanding of acids and bases with quantitative and qualitative methods revealed misconceptions. A concept map, based on the objectives of the Chemistry Curriculum Guideline, generated multiple-choice items and interview questions. The multiple-choice test was administered to 34 grade 12…
Trauma Resilience Scale: Validation of Protective Factors Associated with Adaptation following Violence

ERIC Educational Resources Information Center

Madsen, Machelle D.; Abell, Neil

2010-01-01

Objectives: The Trauma Resilience Scale (TRS), assessing protective factors associated with positive adaptation following violence, was tested in three waves of data collection. Empirical and theoretical literature shaped subscale and item formation emphasizing resilience following physical abuse, sexual abuse, intimate partner violence, and/or a…
A Test of the Similar Sequence Hypothesis.

ERIC Educational Resources Information Center

Silverstein, A. B.; And Others

1982-01-01

Scales for object permanence and spatial relationships were administered to 98 severely and profoundly mentally retarded children (mean age 13 years) on three occasions, 6 months apart. Differences in the difficulty of the items were quite stable, but their order of difficulty differed appreciably from that for nonretarded infants. (Author/SB)
48 CFR 252.227-7013 - Rights in technical data-Noncommercial items.

Code of Federal Regulations, 2013 CFR

2013-10-01

... causing a computer to perform a specific operation or series of operations. (3) Computer software means computer programs, source code, source code listings, object code listings, design details, algorithms... funds; (ii) Studies, analyses, test data, or similar data produced for this contract, when the study...
Selecting Items for Criterion-Referenced Tests.

ERIC Educational Resources Information Center

Mellenbergh, Gideon J.; van der Linden, Wim J.

1982-01-01

Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)
Negative Symptom Dimensions of the Positive and Negative Syndrome Scale Across Geographical Regions

PubMed Central

Liharska, Lora; Harvey, Philip D.; Atkins, Alexandra; Ulshen, Daniel; Keefe, Richard S.E.

2017-01-01

Objective: Recognizing the discrete dimensions that underlie negative symptoms in schizophrenia and how these dimensions are understood across localities might result in better understanding and treatment of these symptoms. To this end, the objectives of this study were to 1) identify the Positive and Negative Syndrome Scale negative symptom dimensions of expressive deficits and experiential deficits and 2) analyze performance on these dimensions over 15 geographical regions to determine whether the items defining them manifest similar reliability across these regions. Design: Data were obtained for the baseline Positive and Negative Syndrome Scale visits of 6,889 subjects across 15 geographical regions. Using confirmatory factor analysis, we examined whether a two-factor negative symptom structure that is found in schizophrenia (experiential deficits and expressive deficits) would be replicated in our sample, and using differential item functioning, we tested the degree to which specific items from each negative symptom subfactor performed across geographical regions in comparison with the United States. Results: The two-factor negative symptom solution was replicated in this sample. Most geographical regions showed moderate-to-large differential item functioning for Positive and Negative Syndrome Scale expressive deficit items, especially N3 Poor Rapport, as compared with Positive and Negative Syndrome Scale experiential deficit items, showing that these items might be interpreted or scored differently in different regions. Across countries, except for India, the differential item functioning values did not favor raters in the United States. Conclusion: These results suggest that the Positive and Negative Syndrome Scale negative symptom factor can be better represented by a two-factor model than by a single-factor model. Additionally, the results show significant differences in responses to items representing the Positive and Negative Syndrome Scale expressive factors, but not the experiential factors, across regions. This could be due to a lack of equivalence between the original and translated versions, cultural differences with the interpretation of items, dissimilarities in rater training, or diversity in the understanding of scoring anchors. Knowing which items are challenging for raters across regions can help to guide Positive and Negative Syndrome Scale training and improve the results of international clinical trials aimed at negative symptoms. PMID:29410935
A new, female-specific irritability rating scale

PubMed Central

Born, Leslie; Koren, Gideon; Lin, Elizabeth; Steiner, Meir

2008-01-01

Objective Irritability is a prominent symptom in the spectrum of female-specific mood disorders, and in some women, irritability is serious enough to disrupt their lives and warrant treatment. The objective of this research was to develop a new, female-specific state measure of irritability. Methods We constructed self-rating and observer rating scales using items derived from spontaneous descriptions of irritability by women with mood disturbances related to the menstrual cycle, childbearing or menopause. Following a pretest, the scales were shortened to the core items of irritability (annoyance, anger, tension, hostility, sensitivity to noise and touch) and tested on a new cohort of patients. Results The 14-item Self-Rating Scale and the 5-item Observer Rating Scale showed evidence for internal consistency (Self-Rating: n = 36 patients, Cronbach's α = 0.9257, mean interitem correlation = 0.4690; Observer Rating: Cronbach's α = 0.7418, mean interitem correlation = 0.3616), Self-Rating test–retest reliability (n = 29 patients, rs = 0.704, p = 0.01) and interrater reliability (n = 20 patients; τb = 1.000, p = 0.001). Conclusion This new, female-specific scale for rating irritability has the potential to further the evaluation of this prominent symptom cluster and increase specificity in clinical assessments of emotional disturbances related to reproductive cyclicity in women. PMID:18592028
Development of a Computer-Adaptive Physical Function Instrument for Social Security Administration Disability Determination

PubMed Central

Ni, Pengsheng; McDonough, Christine M.; Jette, Alan M.; Bogusz, Kara; Marfeo, Elizabeth E.; Rasch, Elizabeth K.; Brandt, Diane E.; Meterko, Mark; Chan, Leighton

2014-01-01

Objectives To develop and test an instrument to assess physical function (PF) for Social Security Administration (SSA) disability programs, the SSA-PF. Item Response Theory (IRT) analyses were used to 1) create a calibrated item bank for each of the factors identified in prior factor analyses, 2) assess the fit of the items within each scale, 3) develop separate Computer-Adaptive Test (CAT) instruments for each scale, and 4) conduct initial psychometric testing. Design Cross-sectional data collection; IRT analyses; CAT simulation. Setting Telephone and internet survey. Participants Two samples: 1,017 SSA claimants, and 999 adults from the US general population. Interventions None. Main Outcome Measure Model fit statistics, correlation and reliability coefficients, Results IRT analyses resulted in five unidimensional SSA-PF scales: Changing & Maintaining Body Position, Whole Body Mobility, Upper Body Function, Upper Extremity Fine Motor, and Wheelchair Mobility for a total of 102 items. High CAT accuracy was demonstrated by strong correlations between simulated CAT scores and those from the full item banks. Comparing the simulated CATs to the full item banks, very little loss of reliability or precision was noted, except at the lower and upper ranges of each scale. No difference in response patterns by age or sex was noted. The distributions of claimant scores were shifted to the lower end of each scale compared to those of a sample of US adults. Conclusions The SSA-PF instrument contributes important new methodology for measuring the physical function of adults applying to the SSA disability programs. Initial evaluation revealed that the SSA-PF instrument achieved considerable breadth of coverage in each content domain and demonstrated noteworthy psychometric properties. PMID:23578594
Age-related increases in false recognition: the role of perceptual and conceptual similarity.

PubMed

Pidgeon, Laura M; Morcom, Alexa M

2014-01-01

Older adults (OAs) are more likely to falsely recognize novel events than young adults, and recent behavioral and neuroimaging evidence points to a reduced ability to distinguish overlapping information due to decline in hippocampal pattern separation. However, other data suggest a critical role for semantic similarity. Koutstaal et al. [(2003) false recognition of abstract vs. common objects in older and younger adults: testing the semantic categorization account, J. Exp. Psychol. Learn. 29, 499-510] reported that OAs were only vulnerable to false recognition of items with pre-existing semantic representations. We replicated Koutstaal et al.'s (2003) second experiment and examined the influence of independently rated perceptual and conceptual similarity between stimuli and lures. At study, young and OAs judged the pleasantness of pictures of abstract (unfamiliar) and concrete (familiar) items, followed by a surprise recognition test including studied items, similar lures, and novel unrelated items. Experiment 1 used dichotomous "old/new" responses at test, while in Experiment 2 participants were also asked to judge lures as "similar," to increase explicit demands on pattern separation. In both experiments, OAs showed a greater increase in false recognition for concrete than abstract items relative to the young, replicating Koutstaal et al.'s (2003) findings. However, unlike in the earlier study, there was also an age-related increase in false recognition of abstract lures when multiple similar images had been studied. In line with pattern separation accounts of false recognition, OAs were more likely to misclassify concrete lures with high and moderate, but not low degrees of rated similarity to studied items. Results are consistent with the view that OAs are particularly susceptible to semantic interference in recognition memory, and with the possibility that this reflects age-related decline in pattern separation.
Age-related increases in false recognition: the role of perceptual and conceptual similarity

PubMed Central

Pidgeon, Laura M.; Morcom, Alexa M.

2014-01-01

Older adults (OAs) are more likely to falsely recognize novel events than young adults, and recent behavioral and neuroimaging evidence points to a reduced ability to distinguish overlapping information due to decline in hippocampal pattern separation. However, other data suggest a critical role for semantic similarity. Koutstaal et al. [(2003) false recognition of abstract vs. common objects in older and younger adults: testing the semantic categorization account, J. Exp. Psychol. Learn. 29, 499–510] reported that OAs were only vulnerable to false recognition of items with pre-existing semantic representations. We replicated Koutstaal et al.’s (2003) second experiment and examined the influence of independently rated perceptual and conceptual similarity between stimuli and lures. At study, young and OAs judged the pleasantness of pictures of abstract (unfamiliar) and concrete (familiar) items, followed by a surprise recognition test including studied items, similar lures, and novel unrelated items. Experiment 1 used dichotomous “old/new” responses at test, while in Experiment 2 participants were also asked to judge lures as “similar,” to increase explicit demands on pattern separation. In both experiments, OAs showed a greater increase in false recognition for concrete than abstract items relative to the young, replicating Koutstaal et al.’s (2003) findings. However, unlike in the earlier study, there was also an age-related increase in false recognition of abstract lures when multiple similar images had been studied. In line with pattern separation accounts of false recognition, OAs were more likely to misclassify concrete lures with high and moderate, but not low degrees of rated similarity to studied items. Results are consistent with the view that OAs are particularly susceptible to semantic interference in recognition memory, and with the possibility that this reflects age-related decline in pattern separation. PMID:25368576
Conceptual fluency at test shifts recognition response bias in Alzheimer's disease: implications for increased false recognition.

PubMed

Gold, Carl A; Marchant, Natalie L; Koutstaal, Wilma; Schacter, Daniel L; Budson, Andrew E

2007-09-20

The presence or absence of conceptual information in pictorial stimuli may explain the mixed findings of previous studies of false recognition in patients with mild Alzheimer's disease (AD). To test this hypothesis, 48 patients with AD were compared to 48 healthy older adults on a recognition task first described by Koutstaal et al. [Koutstaal, W., Reddy, C., Jackson, E. M., Prince, S., Cendan, D. L., & Schacter D. L. (2003). False recognition of abstract versus common objects in older and younger adults: Testing the semantic categorization account. Journal of Experimental Psychology: Learning, Memory, and Cognition, 29, 499-510]. Participants studied and were tested on their memory for categorized ambiguous pictures of common objects. The presence of conceptual information at study and/or test was manipulated by providing or withholding disambiguating semantic labels. Analyses focused on testing two competing theories. The semantic encoding hypothesis, which posits that the inter-item perceptual details are not encoded by AD patients when conceptual information is present in the stimuli, was not supported by the findings. In contrast, the conceptual fluency hypothesis was supported. Enhanced conceptual fluency at test dramatically shifted AD patients to a more liberal response bias, raising their false recognition. These results suggest that patients with AD rely on the fluency of test items in making recognition memory decisions. We speculate that AD patients' over reliance upon fluency may be attributable to (1) dysfunction of the hippocampus, disrupting recollection, and/or (2) dysfunction of prefrontal cortex, disrupting post-retrieval processes.
Development and evaluation of a brief screener to estimate fast-food and beverage consumption among adolescents.

PubMed

Nelson, Melissa C; Lytle, Leslie A

2009-04-01

Sweetened beverage and fast-food intake have been identified as important targets for obesity prevention. However, there are few brief dietary assessment tools available to evaluate these behaviors among adolescents. The objective of this research was to examine reliability and validity of a 22-item dietary screener assessing adolescent consumption of specific energy-containing and non-energy-containing beverages (nine items) and fast food (13 items). The screener was administered to adolescents (ages 11 to 18 years) recruited from the Minneapolis/St Paul, MN, metro region. One sample of adolescents completed test-retest reliability of the screener (n=33, primarily white adolescents). Another adolescent sample completed the screener along with three 24-hour dietary recalls to assess criterion validity (n=59 white adolescents). Test-retest assessments were completed approximately 7 to 14 days apart, and agreement between the two administrations of the screener was substantial, with most items yielding Spearman correlations and kappa statistics that were >0.60. When compared to the gold standard dietary recall data, findings indicate that the validity of the screener items assessing adolescents' intake of regular soda, sports drinks, milk, and water was fair. However, the differential assessment periods captured by the two methods (ie, 1 month for the screener vs 3 days for the recalls) posed challenges in analysis and made it impossible to assess the validity of some screener items. Overall while these screener items largely represent reliable measures with fair validity, our findings highlight the challenges inherent in the validation of brief dietary assessment tools.
The cost of proactive interference is constant across presentation conditions.

PubMed

Endress, Ansgar D; Siddique, Aneela

2016-10-01

Proactive interference (PI) severely constrains how many items people can remember. For example, Endress and Potter (2014a) presented participants with sequences of everyday objects at 250ms/picture, followed by a yes/no recognition test. They manipulated PI by either using new images on every trial in the unique condition (thus minimizing PI among items), or by re-using images from a limited pool for all trials in the repeated condition (thus maximizing PI among items). In the low-PI unique condition, the probability of remembering an item was essentially independent of the number of memory items, showing no clear memory limitations; more traditional working memory-like memory limitations appeared only in the high-PI repeated condition. Here, we ask whether the effects of PI are modulated by the availability of long-term memory (LTM) and verbal resources. Participants viewed sequences of 21 images, followed by a yes/no recognition test. Items were presented either quickly (250ms/image) or sufficiently slowly (1500ms/image) to produce LTM representations, either with or without verbal suppression. Across conditions, participants performed better in the unique than in the repeated condition, and better for slow than for fast presentations. In contrast, verbal suppression impaired performance only with slow presentations. The relative cost of PI was remarkably constant across conditions: relative to the unique condition, performance in the repeated condition was about 15% lower in all conditions. The cost of PI thus seems to be a function of the relative strength or recency of target items and interfering items, but relatively insensitive to other experimental manipulations. Copyright © 2016 Elsevier B.V. All rights reserved.
Frequency of consumption of cariogenic food items by 4-month-old to 24-month-old children: comparison between two rural communities in KwaZulu-Natal, South Africa.

PubMed

MacKeown, Jennifer M; Faber, Mieke

2005-03-01

The objective of the study was to compare the frequency of consumption of cariogenic food items among 4-month-old to 24-month-old children in two neighbouring rural areas in KwaZulu-Natal Province, South Africa: Nyuswa/Embo (Area A) (n = 127) and Ndunakazi (Area B) (n = 105). Dietary intake was assessed using a food frequency questionnaire. Mothers or caregivers were interviewed by a team of Zulu-speaking fieldworkers. The percentage of children consuming the individual food items (consumers) and the weekly consumption for consumers were calculated for the two areas separately. The food items were ranked in descending order according to the combined group of children and reported for each area within five selected food groups (carbohydrates, sugars, fruit and vegetables, milk and milk products, and other foods and snacks). Food items were 'flagged' according to their cariogenic potential. Fisher's exact test on absolute numbers tested for significant differences in the frequency of intake between individual food items between the two groups. Significance was set at P < 0.05. The frequency of consumption of certain listed cariogenic food items showed significant differences between the two areas. A higher percentage of children in Area A than in Area B consumed most of the food items and also more frequently. Children mainly consumed foods with a cariogenic score of 2, solid foods with 8-20% sugars as well as foods high in starch with less than 10% sugars. This knowledge is essential to gain insight into the eating pattern among rural communities and will provide a baseline for developing and adapting dietary advice specifically for young rural South African children with particular emphasis on the prevention of dental caries.
77 FR 74866 - Notice of Intent To Repatriate Cultural Items: New York State Museum, Albany, NY

Federal Register 2010, 2011, 2012, 2013, 2014

2012-12-18

... tribe, has determined that the cultural items meet the definition of sacred objects and objects of... York State Museum that meet the definition of sacred objects and objects of cultural patrimony under 25... Five Nations Alliance Belt as both a sacred object and an object of cultural patrimony as it relates to...
77 FR 13623 - Notice of Intent To Repatriate Cultural Items: U.S. Fish and Wildlife Service, Office of Law...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-03-07

... cultural items listed below meet the definition of sacred objects and/or objects of cultural patrimony and... of Law Enforcement, that meet the definition of sacred objects and/or objects of cultural patrimony.... Upon review, the USFWS determined that three objects of cultural patrimony and five sacred objects are...
History & implementation of Item Unique Identification (IUID) - Has it Improved Asset Visibility?

DTIC Science & Technology

2012-03-27

figure 3) as of November 2011 on their progress in implementing IUID38: 10 Objectives • Policy Updates • Systems Updates (AIS and ERP ...Implement SAP Enhancement Pack – Nov 2013 • Design & build LMP IUID solution – Aug 2014 • Integrate & test with Trading Partners, Army IUID...issues Objectives • Policy Updates • Systems Updates (AIS and ERP ) • Contract Compliance Rate • Physical Marking • Use of IUID Registry IUID Scorecard

Effect of clinically discriminating, evidence-based checklist items on the reliability of scores from an Internal Medicine residency OSCE.

PubMed

Daniels, Vijay J; Bordage, Georges; Gierl, Mark J; Yudkowsky, Rachel

2014-10-01

Objective structured clinical examinations (OSCEs) are used worldwide for summative examinations but often lack acceptable reliability. Research has shown that reliability of scores increases if OSCE checklists for medical students include only clinically relevant items. Also, checklists are often missing evidence-based items that high-achieving learners are more likely to use. The purpose of this study was to determine if limiting checklist items to clinically discriminating items and/or adding missing evidence-based items improved score reliability in an Internal Medicine residency OSCE. Six internists reviewed the traditional checklists of four OSCE stations classifying items as clinically discriminating or non-discriminating. Two independent reviewers augmented checklists with missing evidence-based items. We used generalizability theory to calculate overall reliability of faculty observer checklist scores from 45 first and second-year residents and predict how many 10-item stations would be required to reach a Phi coefficient of 0.8. Removing clinically non-discriminating items from the traditional checklist did not affect the number of stations (15) required to reach a Phi of 0.8 with 10 items. Focusing the checklist on only evidence-based clinically discriminating items increased test score reliability, needing 11 stations instead of 15 to reach 0.8; adding missing evidence-based clinically discriminating items to the traditional checklist modestly improved reliability (needing 14 instead of 15 stations). Checklists composed of evidence-based clinically discriminating items improved the reliability of checklist scores and reduced the number of stations needed for acceptable reliability. Educators should give preference to evidence-based items over non-evidence-based items when developing OSCE checklists.
Vegetable parenting practices scale. Item response modeling analyses

PubMed Central

Chen, Tzu-An; O’Connor, Teresia; Hughes, Sheryl; Beltran, Alicia; Baranowski, Janice; Diep, Cassandra; Baranowski, Tom

2015-01-01

Objective To evaluate the psychometric properties of a vegetable parenting practices scale using multidimensional polytomous item response modeling which enables assessing item fit to latent variables and the distributional characteristics of the items in comparison to the respondents. We also tested for differences in the ways item function (called differential item functioning) across child’s gender, ethnicity, age, and household income groups. Method Parents of 3–5 year old children completed a self-reported vegetable parenting practices scale online. Vegetable parenting practices consisted of 14 effective vegetable parenting practices and 12 ineffective vegetable parenting practices items, each with three subscales (responsiveness, structure, and control). Multidimensional polytomous item response modeling was conducted separately on effective vegetable parenting practices and ineffective vegetable parenting practices. Results One effective vegetable parenting practice item did not fit the model well in the full sample or across demographic groups, and another was a misfit in differential item functioning analyses across child’s gender. Significant differential item functioning was detected across children’s age and ethnicity groups, and more among effective vegetable parenting practices than ineffective vegetable parenting practices items. Wright maps showed items only covered parts of the latent trait distribution. The harder- and easier-to-respond ends of the construct were not covered by items for effective vegetable parenting practices and ineffective vegetable parenting practices, respectively. Conclusions Several effective vegetable parenting practices and ineffective vegetable parenting practices scale items functioned differently on the basis of child’s demographic characteristics; therefore, researchers should use these vegetable parenting practices scales with caution. Item response modeling should be incorporated in analyses of parenting practice questionnaires to better assess differences across demographic characteristics. PMID:25895694
76 FR 14048 - Notice of Intent To Repatriate a Cultural Item: Arizona State Museum, University of Arizona...

Federal Register 2010, 2011, 2012, 2013, 2014

2011-03-15

... sacred object and object of cultural patrimony under 25 U.S.C. 3001. This notice is published as part of... item described above is a specific ceremonial object needed by traditional Native American religious... relationship of shared group identity that can be reasonably traced between the sacred object/object of...
76 FR 9049 - Notice of Intent To Repatriate Cultural Items: University of Pennsylvania Museum of Archaeology...

Federal Register 2010, 2011, 2012, 2013, 2014

2011-02-16

... Anthropology, Philadelphia, PA, that meet the definitions of sacred objects and/or objects of cultural..., anthropological literature, and expert opinion, one cultural item is considered to be a sacred object (Wolf Helmet... considered to be both sacred objects and objects of cultural patrimony (Ganook Hat, NA6864; Noble Killer Hat...
Applying Computerized Adaptive Testing to the Negative Acts Questionnaire-Revised: Rasch Analysis of Workplace Bullying

PubMed Central

Ma, Shu-Ching; Li, Yu-Chi; Yui, Mei-Shu

2014-01-01

Background Workplace bullying is a prevalent problem in contemporary work places that has adverse effects on both the victims of bullying and organizations. With the rapid development of computer technology in recent years, there is an urgent need to prove whether item response theory–based computerized adaptive testing (CAT) can be applied to measure exposure to workplace bullying. Objective The purpose of this study was to evaluate the relative efficiency and measurement precision of a CAT-based test for hospital nurses compared to traditional nonadaptive testing (NAT). Under the preliminary conditions of a single domain derived from the scale, a CAT module bullying scale model with polytomously scored items is provided as an example for evaluation purposes. Methods A total of 300 nurses were recruited and responded to the 22-item Negative Acts Questionnaire-Revised (NAQ-R). All NAT (or CAT-selected) items were calibrated with the Rasch rating scale model and all respondents were randomly selected for a comparison of the advantages of CAT and NAT in efficiency and precision by paired t tests and the area under the receiver operating characteristic curve (AUROC). Results The NAQ-R is a unidimensional construct that can be applied to measure exposure to workplace bullying through CAT-based administration. Nursing measures derived from both tests (CAT and NAT) were highly correlated (r=.97) and their measurement precisions were not statistically different (P=.49) as expected. CAT required fewer items than NAT (an efficiency gain of 32%), suggesting a reduced burden for respondents. There were significant differences in work tenure between the 2 groups (bullied and nonbullied) at a cutoff point of 6 years at 1 worksite. An AUROC of 0.75 (95% CI 0.68-0.79) with logits greater than –4.2 (or >30 in summation) was defined as being highly likely bullied in a workplace. Conclusions With CAT-based administration of the NAQ-R for nurses, their burden was substantially reduced without compromising measurement precision. PMID:24534113
An Item Gains and Losses Analysis of False Memories Suggests Critical Items Receive More Item-Specific Processing than List Items

ERIC Educational Resources Information Center

Burns, Daniel J.; Martens, Nicholas J.; Bertoni, Alicia A.; Sweeney, Emily J.; Lividini, Michelle D.

2006-01-01

In a repeated testing paradigm, list items receiving item-specific processing are more likely to be recovered across successive tests (item gains), whereas items receiving relational processing are likely to be forgotten progressively less on successive tests. Moreover, analysis of cumulative-recall curves has shown that item-specific processing…
Development and Validation of the Chinese Attitudes to Starting Insulin Questionnaire (Ch-ASIQ) for Primary Care Patients with Type 2 Diabetes

PubMed Central

Fu, Sau Nga; Chin, Weng Yee; Wong, Carlos King Ho; Yeung, Vincent Tok Fai; Yiu, Ming Pong; Tsui, Hoi Yee; Chan, Ka Hung

2013-01-01

Objectives To develop and evaluate the psychometric properties of a Chinese questionnaire which assesses the barriers and enablers to commencing insulin in primary care patients with poorly controlled Type 2 diabetes. Research Design and Method Questionnaire items were identified using literature review. Content validation was performed and items were further refined using an expert panel. Following translation, back translation and cognitive debriefing, the translated Chinese questionnaire was piloted on target patients. Exploratory factor analysis and item-scale correlations were performed to test the construct validity of the subscales and items. Internal reliability was tested by Cronbach’s alpha. Results Twenty-seven identified items underwent content validation, translation and cognitive debriefing. The translated questionnaire was piloted on 303 insulin naïve (never taken insulin) Type 2 diabetes patients recruited from 10 government-funded primary care clinics across Hong Kong. Sufficient variability in the dataset for factor analysis was confirmed by Bartlett’s Test of Sphericity (P<0.001). Using exploratory factor analysis with varimax rotation, 10 factors were generated onto which 26 items loaded with loading scores > 0.4 and Eigenvalues >1. Total variance for the 10 factors was 66.22%. Kaiser-Meyer-Olkin measure was 0.725. Cronbach’s alpha coefficients for the first four factors were ≥0.6 identifying four sub-scales to which 13 items correlated. Remaining sub-scales and items with poor internal reliability were deleted. The final 13-item instrument had a four scale structure addressing: ‘Self-image and stigmatization’; ‘Factors promoting self-efficacy; ‘Fear of pain or needles’; and ‘Time and family support’. Conclusion The Chinese Attitudes to Starting Insulin Questionnaire (Ch-ASIQ) appears to be a reliable and valid measure for assessing barriers to starting insulin. This short instrument is easy to administer and may be used by healthcare providers and researchers as an assessment tool for Chinese diabetic primary care patients, including the elderly, who are unwilling to start insulin. PMID:24236071
Monitoring population health for Healthy People 2020: evaluation of the NIH PROMIS® Global Health, CDC Healthy Days, and satisfaction with life instruments

PubMed Central

Barile, John P.; Reeve, Bryce B.; Smith, Ashley Wilder; Zack, Matthew M.; Mitchell, Sandra A.; Kobau, Rosemarie; Cella, David F.; Luncheon, Cecily; Thompson, William W.

2015-01-01

Purpose Healthy People 2020 identified health-related quality of life and well-being (WB) as indicators of population health for the next decade. This study examined the measurement properties of the NIH PROMIS® Global Health Scale, the CDC Healthy Days items, and associations with the Satisfaction with Life Scale. Methods A total of 4,184 adults completed the Porter Novelli's HealthStyles mailed survey. Physical and mental health (9 items from PROMIS Global Scale and 3 items from CDC Healthy days measure), and 4 WB factor items were tested for measurement equivalence using multiple-group confirmatory factor analysis. Results The CDC items accounted for similar variance as the PROMIS items on physical and mental health factors; both factors were moderately correlated with WB. Measurement invariance was supported across gender and age; the magnitude of some factor loadings differed between those with and without a chronic medical condition. Conclusions The PROMIS, CDC, and WB items all performed well. The PROMIS items captured a broad range of functioning across the entire continuum of physical and mental health, while the CDC items appear appropriate for assessing burden of disease for chronic conditions and are brief and easily interpretable. All three measures under study appear to be appropriate measures for monitoring several aspects of the Healthy People 2020 goals and objectives. PMID:23404737
Monitoring population health for Healthy People 2020: evaluation of the NIH PROMIS® Global Health, CDC Healthy Days, and satisfaction with life instruments.

PubMed

Barile, John P; Reeve, Bryce B; Smith, Ashley Wilder; Zack, Matthew M; Mitchell, Sandra A; Kobau, Rosemarie; Cella, David F; Luncheon, Cecily; Thompson, William W

2013-08-01

Healthy People 2020 identified health-related quality of life and well-being (WB) as indicators of population health for the next decade. This study examined the measurement properties of the NIH PROMIS(®) Global Health Scale, the CDC Healthy Days items, and associations with the Satisfaction with Life Scale. A total of 4,184 adults completed the Porter Novelli's HealthStyles mailed survey. Physical and mental health (9 items from PROMIS Global Scale and 3 items from CDC Healthy days measure), and 4 WB factor items were tested for measurement equivalence using multiple-group confirmatory factor analysis. The CDC items accounted for similar variance as the PROMIS items on physical and mental health factors; both factors were moderately correlated with WB. Measurement invariance was supported across gender and age; the magnitude of some factor loadings differed between those with and without a chronic medical condition. The PROMIS, CDC, and WB items all performed well. The PROMIS items captured a broad range of functioning across the entire continuum of physical and mental health, while the CDC items appear appropriate for assessing burden of disease for chronic conditions and are brief and easily interpretable. All three measures under study appear to be appropriate measures for monitoring several aspects of the Healthy People 2020 goals and objectives.
Development and psychometric characteristics of the SCI-QOL Pressure Ulcers scale and short form

PubMed Central

Kisala, Pamela A.; Tulsky, David S.; Choi, Seung W.; Kirshblum, Steven C.

2015-01-01

Objective To develop a self-reported measure of the subjective impact of pressure ulcers on health-related quality of life (HRQOL) in individuals with spinal cord injury (SCI) as part of the SCI quality of life (SCI-QOL) measurement system. Design Grounded-theory based qualitative item development methods, large-scale item calibration testing, confirmatory factor analysis (CFA), and item response theory-based psychometric analysis. Setting Five SCI Model System centers and one Department of Veterans Affairs medical center in the United States. Participants Adults with traumatic SCI. Main Outcome Measures SCI-QOL Pressure Ulcers scale. Results 189 individuals with traumatic SCI who experienced a pressure ulcer within the past 7 days completed 30 items related to pressure ulcers. CFA confirmed a unidimensional pool of items. IRT analyses were conducted. A constrained Graded Response Model with a constant slope parameter was used to estimate item thresholds for the 12 retained items. Conclusions The 12-item SCI-QOL Pressure Ulcers scale is unique in that it is specifically targeted to individuals with spinal cord injury and at every stage of development has included input from individuals with SCI. Furthermore, use of CFA and IRT methods provide flexibility and precision of measurement. The scale may be administered in its entirety or as a 7-item “short form” and is available for both research and clinical practice. PMID:26010965
Hippocampus is required for paired associate memory with neither delay nor trial uniqueness

PubMed Central

Yoon, Jinah; Seo, Yeran; Kim, Jangjin; Lee, Inah

2012-01-01

Cued retrieval of memory is typically examined with delay when testing hippocampal functions, as in delayed matching-to-sample tasks. Equally emphasized in the literature, on the other hand, is the hippocampal involvement in making arbitrary associations. Paired associate memory tasks are widely used for examining this function. However, the two variables (i.e., delay and paired association) were often mixed in paired associate tasks, and this makes it difficult to localize the cognitive source of deficits with hippocampal perturbation. Specifically, a few studies have recently shown that rats can learn arbitrary paired associations between certain locations and nonspatial items (e.g., object or flavor) and later can retrieve the paired location when cued by the item remotely. Such tasks involve both (1) delay between sampling the cue and retrieving the target location and (2) arbitrary association between the cueing object and its paired location. Here, we tested whether delay was necessary in a cued paired associate task by using a task in which no delay existed between object cueing and the choice of its paired associate. Moreover, fixed associative relationships between the cueing objects and their paired locations were repeatedly used, thus involving no trial-unique association. Nevertheless, inactivations of the dorsal hippocampus with muscimol severely disrupted retrieval of paired associates, whereas the same manipulations did not affect discriminating individual objects or locations. The results powerfully demonstrate that the hippocampus is inherently required for retrieving paired associations between objects and places, and that delay and trial uniqueness of the paired associates are not necessarily required. PMID:22174309
Unidimensional IRT Item Parameter Estimates across Equivalent Test Forms with Confounding Specifications within Dimensions

ERIC Educational Resources Information Center

Matlock, Ki Lynn; Turner, Ronna

2016-01-01

When constructing multiple test forms, the number of items and the total test difficulty are often equivalent. Not all test developers match the number of items and/or average item difficulty within subcontent areas. In this simulation study, six test forms were constructed having an equal number of items and average item difficulty overall.…
A Multidimensional Tool Based on the eHealth Literacy Framework: Development and Initial Validity Testing of the eHealth Literacy Questionnaire (eHLQ).

PubMed

Kayser, Lars; Karnoe, Astrid; Furstrand, Dorthe; Batterham, Roy; Christensen, Karl Bang; Elsworth, Gerald; Osborne, Richard H

2018-02-12

For people to be able to access, understand, and benefit from the increasing digitalization of health services, it is critical that services are provided in a way that meets the user's needs, resources, and competence. The objective of the study was to develop a questionnaire that captures the 7-dimensional eHealth Literacy Framework (eHLF). Draft items were created in parallel in English and Danish. The items were generated from 450 statements collected during the conceptual development of eHLF. In all, 57 items (7 to 9 items per scale) were generated and adjusted after cognitive testing. Items were tested in 475 people recruited from settings in which the scale was intended to be used (community and health care settings) and including people with a range of chronic conditions. Measurement properties were assessed using approaches from item response theory (IRT) and classical test theory (CTT) such as confirmatory factor analysis (CFA) and reliability using composite scale reliability (CSR); potential bias due to age and sex was evaluated using differential item functioning (DIF). CFA confirmed the presence of the 7 a priori dimensions of eHLF. Following item analysis, a 35-item 7-scale questionnaire was constructed, covering (1) using technology to process health information (5 items, CSR=.84), (2) understanding of health concepts and language (5 items, CSR=.75), (3) ability to actively engage with digital services (5 items, CSR=.86), (4) feel safe and in control (5 items, CSR=.87), (5) motivated to engage with digital services (5 items, CSR=.84), (6) access to digital services that work (6 items, CSR=.77), and (7) digital services that suit individual needs (4 items, CSR=.85). A 7-factor CFA model, using small-variance priors for cross-loadings and residual correlations, had a satisfactory fit (posterior productive P value: .27, 95% CI for the difference between the observed and replicated chi-square values: -63.7 to 133.8). The CFA showed that all items loaded strongly on their respective factors. The IRT analysis showed that no items were found to have disordered thresholds. For most scales, discriminant validity was acceptable; however, 2 pairs of dimensions were highly correlated; dimensions 1 and 5 (r=.95), and dimensions 6 and 7 (r=.96). All dimensions were retained because of strong content differentiation and potential causal relationships between these dimensions. There is no evidence of DIF. The eHealth Literacy Questionnaire (eHLQ) is a multidimensional tool based on a well-defined a priori eHLF framework with robust properties. It has satisfactory evidence of construct validity and reliable measurement across a broad range of concepts (using both CTT and IRT traditions) in various groups. It is designed to be used to understand and evaluate people's interaction with digital health services. ©Lars Kayser, Astrid Karnoe, Dorthe Furstrand, Roy Batterham, Karl Bang Christensen, Gerald Elsworth, Richard H Osborne. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 12.02.2018.
Development of an Instrument to Measure Pharmacy Student Attitudes Toward Social Media Professionalism.

PubMed

Chisholm-Burns, Marie A; Spivey, Christina A; Jaeger, Melanie C; Williams, Jennifer; George, Christa

2017-05-01

Objectives. To develop and validate a scale measuring pharmacy students' attitudes toward social media professionalism, and assess the impact of an educational presentation on social media professionalism. Methods. A social media professionalism scale was used in a pre- and post-survey to determine the effects of a social media professionalism presentation. The 26-item scale was administered to 197 first-year pharmacy (P1) students during orientation. Exploratory factor analysis was applied to determine the number of underlying factors responsible for covariation of the data. Principal components analysis was used as the extraction method. Varimax was selected as the rotation method. Cronbach's alpha was estimated. Wilcoxon signed rank test was used to compare pre- and post-scores of each item, subscale, and total scale. Results. There were 187 (95%) students who participated. The final scale had five subscales and 15 items. Subscales were named according to the professionalism tenet they best represented. Scores of items addressing reading/posting to social media during class, an employer's use of social media when making hiring decisions, and a college/university's use of social media as a measure of professional conduct significantly increased from pre-test to post-test. The "honesty and integrity" subscale score also significantly increased. Conclusion. The social media professionalism scale measures five tenets of professionalism and exhibits satisfactory reliability. The presentation improved P1 students' attitudes regarding social media professionalism.
The Dutch motor skills assessment as tool for talent development in table tennis: a reproducibility and validity study.

PubMed

Faber, Irene R; Nijhuis-Van Der Sanden, Maria W G; Elferink-Gemser, Marije T; Oosterveld, Frits G J

2015-01-01

A motor skills assessment could be helpful in talent development by estimating essential perceptuo-motor skills of young players, which are considered requisite to develop excellent technical and tactical qualities. The Netherlands Table Tennis Association uses a motor skills assessment in their talent development programme consisting of eight items measuring perceptuo-motor skills specific to table tennis under varying conditions. This study aimed to investigate this assessment regarding its reproducibility, internal consistency, underlying dimensions and concurrent validity in 113 young table tennis players (6-10 years). Intraclass correlation coefficients of six test items met the criteria of 0.7 with coefficients of variation between 3% and 8%. Cronbach's alpha valued 0.853 for internal consistency. The principal components analysis distinguished two conceptually meaningful factors: "ball control" and "gross motor function." Concurrent validity analyses demonstrated moderate associations between the motor skills assessment's results and national ranking; boys r = -0.53 (P < 0.001) and girls r = -0.45 (P = 0.015). In conclusion, this evaluation demonstrated six test items with acceptable reproducibility, good internal consistency and good prospects for validity. Two test items need revision to upgrade reproducibility. Since the motor skills assessment seems to be a reproducible, objective part of a talent development programme, more longitudinal studies are required to investigate its predictive validity.
An Objective Instrument for Assessment of Erikson's Developmental Conflicts. Presentation Summary.

ERIC Educational Resources Information Center

Speisman, Joseph C.; And Others

An objective measure of Erikson's ego-identity construct is being developed. The total scale includes seven relatively independent subscales designed to reflect the residuals (part conflicts) of Erikson's psychosocial stages of development. An initial item pool of 194 items has been reduced to 113 items by means of judgemental and statistical…
Forecasting in foodservice: model development, testing, and evaluation.

PubMed

Miller, J L; Thompson, P A; Orabella, M M

1991-05-01

This study was designed to develop, test, and evaluate mathematical models appropriate for forecasting menu-item production demand in foodservice. Data were collected from residence and dining hall foodservices at Ohio State University. Objectives of the study were to collect, code, and analyze the data; develop and test models using actual operation data; and compare forecasting results with current methods in use. Customer count was forecast using deseasonalized simple exponential smoothing. Menu-item demand was forecast by multiplying the count forecast by a predicted preference statistic. Forecasting models were evaluated using mean squared error, mean absolute deviation, and mean absolute percentage error techniques. All models were more accurate than current methods. A broad spectrum of forecasting techniques could be used by foodservice managers with access to a personal computer and spread-sheet and database-management software. The findings indicate that mathematical forecasting techniques may be effective in foodservice operations to control costs, increase productivity, and maximize profits.
Erasing and blurring memories: The differential impact of interference on separate aspects of forgetting.

PubMed

Sun, Sol Z; Fidalgo, Celia; Barense, Morgan D; Lee, Andy C H; Cant, Jonathan S; Ferber, Susanne

2017-11-01

Interference disrupts information processing across many timescales, from immediate perception to memory over short and long durations. The widely held similarity assumption states that as similarity between interfering information and memory contents increases, so too does the degree of impairment. However, information is lost from memory in different ways. For instance, studied content might be erased in an all-or-nothing manner. Alternatively, information may be retained but the precision might be degraded or blurred. Here, we asked whether the similarity of interfering information to memory contents might differentially impact these 2 aspects of forgetting. Observers studied colored images of real-world objects, each followed by a stream of interfering objects. Across 4 experiments, we manipulated the similarity between the studied object and the interfering objects in circular color space. After interference, memory for object color was tested continuously on a color wheel, which in combination with mixture modeling, allowed for estimation of how erasing and blurring differentially contribute to forgetting. In contrast to the similarity assumption, we show that highly dissimilar interfering items caused the greatest increase in random guess responses, suggesting a greater frequency of memory erasure (Experiments 1-3). Moreover, we found that observers were generally able to resist interference from highly similar items, perhaps through surround suppression (Experiments 1 and 4). Finally, we report that interference from items of intermediate similarity tended to blur or decrease memory precision (Experiments 3 and 4). These results reveal that the nature of visual similarity can differentially alter how information is lost from memory. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
An object cue is more effective than a word in ERP-based detection of deception.

PubMed

Cutmore, Tim R H; Djakovic, Tatjana; Kebbell, Mark R; Shum, David H K

2009-03-01

Recent studies of deception have used a form of the guilty knowledge test along with the oddball P300 event-related potential (ERP) to uncover hidden memories. These studies typically have used words as the cuing stimuli. In the present study, a mock crime was enacted by participants to prime their episodic memory and different memory cue types (Words, Pictures of Objects and Faces) were created to investigate their relative efficacy in identifying guilt. A peak-to peak (p-p) P300 response was computed for rare known non-guilty item (target), rare guilty knowledge item (probe) and frequently presented unknown items (irrelevant). Difference in this P300 measure between the probe and irrelevant was the key dependent variable. Object cues were found to be the most effective, particularly at the parietal site. A bootstrap procedure commonly used to detect deception in individual participants by comparing their probe and irrelevant P300 p-p showed the object cues to provide the best discrimination. Furthermore, using all three of the cue types together provided high detection accuracy (94%). These results confirm prior findings on the utility of ERPs for detecting deception. More importantly, they provide support for the hypothesis that direct cueing with a picture of the crime object may be more effective than using a word (consistent with the picture superiority effect reported in the literature). Finally, a face cue (e.g., crime victim) may also provide a useful probe for detection of guilty knowledge but this stimulus form needs to be chosen with due caution.
Evolution of a Test Item

ERIC Educational Resources Information Center

Spaan, Mary

2007-01-01

This article follows the development of test items (see "Language Assessment Quarterly", Volume 3 Issue 1, pp. 71-79 for the article "Test and Item Specifications Development"), beginning with a review of test and item specifications, then proceeding to writing and editing of items, pretesting and analysis, and finally selection of an item for a…

Readability Level of Standardized Test Items and Student Performance: The Forgotten Validity Variable

ERIC Educational Resources Information Center

Hewitt, Margaret A.; Homan, Susan P.

2004-01-01

Test validity issues considered by test developers and school districts rarely include individual item readability levels. In this study, items from a major standardized test were examined for individual item readability level and item difficulty. The Homan-Hewitt Readability Formula was applied to items across three grade levels. Results of…
Use of Bloom's Taxonomy in Developing Reading Comprehension Specifications

ERIC Educational Resources Information Center

Luebke, Stephen; Lorie, James

2013-01-01

This article is a brief account of the use of Bloom's Taxonomy of Educational Objectives (Bloom, Engelhart, Furst, Hill, & Krathwohl, 1956) by staff of the Law School Admission Council in the 1990 development of redesigned specifications for the Reading Comprehension section of the Law School Admission Test. Summary item statistics for the…
Development and testing of the Solar Control Corporation modular controller and Solarstat subsystem

NASA Technical Reports Server (NTRS)

Hankins, J. D.

1979-01-01

Results of development work on an existing controller and solarstat subsystem for use with solar heating and cooling systems are presented. The deliverable end items, program objectives, and how they were accomplished are described. It is shown that the products developed are marketable and suitable for public use.
Personality Tests: Self-Disclosures or Self-Presentations?

ERIC Educational Resources Information Center

Johnson, John A.

When people talk about themselves, psychologists have noted that their verbal reports can be categorized as simple factual communications about the self, i.e., self-disclosure, or as ways to instruct others about how one is to be regarded, i.e., self-presentation. Responses to items on objective self-report measures of personality similarly can be…
Development of a Drug Use Resistance Self-Efficacy (DURSE) Scale

ERIC Educational Resources Information Center

Carpenter, Carrie M.; Howard, Donna

2009-01-01

Objectives: To develop and evaluate psychometric properties of a new instrument, the drug use resistance self-efficacy (DURSE) scale, designed for young adolescents. Methods: Scale construction occurred in 3 phases: (1) initial development, (2) pilot testing of preliminary items, and (3) final scale administration among a sample of seventh graders…
The Time-Course of Lexical Activation during Sentence Comprehension in People with Aphasia

ERIC Educational Resources Information Center

Ferrill, Michelle; Love, Tracy; Walenski, Matthew; Shapiro, Lewis P.

2012-01-01

Purpose: To investigate the time-course of processing of lexical items in auditorily presented canonical (subject-verb-object) constructions in young, neurologically unimpaired control participants and participants with left-hemisphere damage and agrammatic aphasia. Method: A cross modal picture priming (CMPP) paradigm was used to test 114 control…
Measuring the Institutional Stance on Matters of Student Conduct.

ERIC Educational Resources Information Center

Seligman, Richard

This study is concerned with the initial testing of a questionnaire designed to examine student discipline policies in terms of student perception of goals and objectives, scope, procedures, and sanctions. "Institutional Procedures in Colleges and Universities" (IPCU), the questionnaire, containing 45 items, was sent to students and deans at 5…
Psychometric Properties of an Arabic Version of the Depression Anxiety Stress Scales (DASS)

ERIC Educational Resources Information Center

Moussa, Miriam Taouk; Lovibond, Peter; Laube, Roy; Megahead, Hamido A.

2017-01-01

Objective: To translate and evaluate the psychometric properties of an Arabic-language version of the Depression Anxiety Stress Scales (DASS). Method: The items were translated, back translated, refined, and tested in an Australian immigrant sample (N = 220). Results: Confirmatory factor analysis showed that the Arabic DASS discriminates between…
Development and psychometric evaluation of a health-related quality of life instrument for individuals with adult-onset hearing loss.

PubMed

Stika, Carren J; Hays, Ron D

2015-07-01

Self-reports of 'hearing handicap' are available, but a comprehensive measure of health-related quality of life (HRQOL) for individuals with adult-onset hearing loss (AOHL) does not exist. Our objective was to develop and evaluate a multidimensional HRQOL instrument for individuals with AOHL. The Impact of Hearing Loss Inventory Tool (IHEAR-IT) was developed using results of focus groups, a literature review, advisory expert panel input, and cognitive interviews. The 73-item field-test instrument was completed by 409 adults (22-91 years old) with varying degrees of AOHL and from different areas of the USA. Multitrait scaling analysis supported four multi-item scales and five individual items. Internal consistency reliabilities ranged from 0.93 to 0.96 for the scales. Construct validity was supported by correlations between the IHEAR-IT scales and scores on the 36-item Short Form Health Survey, version 2.0 (SF-36v2) mental composite summary (r = 0.32-0.64) and the Hearing Handicap Inventory for the Elderly/Adults (HHIE/HHIA) (r ≥ -0.70). The field test provides initial support for the reliability and construct validity of the IHEAR-IT for evaluating HRQOL of individuals with AOHL. Further research is needed to evaluate the responsiveness to change of the IHEAR-IT scales and identify items for a short-form.
76 FR 58032 - Notice of Intent To Repatriate a Cultural Item: State Historical Society of Wisconsin, Madison, WI

Federal Register 2010, 2011, 2012, 2013, 2014

2011-09-19

... Indian Tribe, has determined a cultural item meets the definitions of sacred object and object of..., that meets the definitions of sacred object and object of cultural patrimony under 25 U.S.C. 3001. This... ceremonial object needed by Ho-Chunk religious leaders for the practice of traditional Native American...
Different impairments of semantic cognition in semantic dementia and semantic aphasia: evidence from the non-verbal domain

PubMed Central

Corbett, Faye; Jefferies, Elizabeth; Ehsan, Sheeba

2009-01-01

Disorders of semantic cognition in different neuropsychological conditions result from diverse areas of brain damage and may have different underlying causes. This study used a comparative case-series design to examine the hypothesis that relatively circumscribed bilateral atrophy of the anterior temporal lobe in semantic dementia (SD) produces a gradual degradation of core semantic representations, whilst a deficit of cognitive control produces multi-modal semantic impairment in a subset of patients with stroke aphasia following damage involving the left prefrontal cortex or regions in and around the temporoparietal area; this condition, which transcends traditional aphasia classifications, is referred to as ‘semantic aphasia’ (SA). There have been very few direct comparisons of these patient groups to date and these previous studies have focussed on verbal comprehension. This study used a battery of object-use tasks to extend this line of enquiry into the non-verbal domain for the first time. A group of seven SA patients were identified who failed both word and picture versions of a semantic association task. These patients were compared with eight SD cases. Both groups showed significant deficits in object use but these impairments were qualitatively different. Item familiarity correlated with performance on object-use tasks for the SD group, consistent with the view that core semantic representations are degrading in this condition. In contrast, the SA participants were insensitive to the familiarity of the objects. Further, while the SD patients performed consistently across tasks that tapped different aspects of knowledge and object use for the same items, the performance of the SA participants reflected the control requirements of the tasks. Single object use was relatively preserved in SA but performance on complex mechanical puzzles was substantially impaired. Similarly, the SA patients were able to complete straightforward item matching tasks, such as word-picture matching, but performed more poorly on associative picture-matching tasks, even when the tests involved the same items. The two groups of patients also showed a different pattern of errors in object use. SA patients made substantial numbers of erroneous intrusions in their demonstrations, such as inappropriate object movements. In contrast, response omissions were more common in SD. This study provides converging evidence for qualitatively different impairments of semantic cognition in SD and SA, and uniquely demonstrates this pattern in a non-verbal expressive domain—object use. PMID:19506072
Predictors of nutrition label viewing during food purchase decision making: an eye tracking investigation

PubMed Central

Graham, Dan J; Jeffery, Robert W

2015-01-01

Objective Nutrition label use could help consumers eat healthfully. Despite consumers reporting label use, diets are not very healthful and obesity rates continue to rise. The present study investigated whether self-reported label use matches objectively measured label viewing by monitoring the gaze of individuals viewing labels. Design The present study monitored adults viewing sixty-four food items on a computer equipped with an eye-tracking camera as they made simulated food purchasing decisions. ANOVA and t tests were used to compare label viewing across various subgroups (e.g. normal weight υ. overweight υ. obese; married υ. unmarried) and also across various types of foods (e.g. snacks υ. fruits and vegetables). Setting Participants came to the University of Minnesota’s Epidemiology Clinical Research Center in spring 2010. Subjects The 203 participants were ≥18 years old and capable of reading English words on a computer 76 cm (30 in) away. Results Participants looked longer at labels for ‘meal’ items like pizza, soup and yoghurt compared with fruits and vegetables, snack items like crackers and nuts, and dessert items like ice cream and cookies. Participants spent longer looking at labels for foods they decided to purchase compared with foods they decided not to purchase. There were few between-group differences in nutrition label viewing across sex, race, age, BMI, marital status, income or educational attainment. Conclusions Nutrition label viewing is related to food purchasing, and labels are viewed more when a food’s healthfulness is ambiguous. Objectively measuring nutrition label viewing provides new insight into label use by various sociodemographic groups. PMID:21733280
Development and validation of a new questionnaire for the assessment of subjective physical performance in adult patients with haemophilia--the HEP-Test-Q.

PubMed

von Mackensen, S; Czepa, D; Herbsleb, M; Hilberg, T

2010-01-01

Specific research studies for the investigation of physical performance in haemophilic patients are rare. However, these instruments become increasingly more important to evaluate therapeutic treatments. Within the frame of the Haemophilia & Exercise Project (HEP), a new questionnaire, namely HEP-Test-Q, has been developed for the assessment of subjective physical performance in haemophilic adults. In this article, the development and validation of the HEP-Test-Q is described. The development consisted of different phases including item collection, pilot testing and field testing. The preliminary version was pilot-tested in 24 German HEP-participants. Following evaluation and preliminary psychometric analysis, the HEP-Test-Q was revised. The final version consists of 25 items pertaining to the domains 'mobility', 'strength & coordination', 'endurance' and 'body perception', which was administered to 43 German haemophilic patients (43.8 +/- 11.2 years). Psychometric analysis included reliability and validity testing. Convergent validity was tested correlating the HEP-Test-Q with SF-36, Haem-A-QoL, HAL and the Orthopaedic Joint Score. Discriminant validity tested different clinical subgroups. Patients accepted the questionnaire and found it easy to fill in. Psychometric testing revealed good values for reliability in terms of internal consistency (Cronbach's alpha = 0.96) and test-retest reliability (r = 0.90) as well as for convergent validity correlating highly with Haem-A-QoL, HAL and SF-36. Discriminant validity testing showed significant differences for age, hepatitis A and hepatitis B and the number of target joints. HEP-Test-Q is a short and well-accepted questionnaire, assessing subjective physical performance of haemophiliacs, which might be combined with objective assessments to reveal aspects, which cannot be measured objectively, such as body perception.
[Effect of object consistency in a spatial contextual cueing paradigm].

PubMed

Takeda, Yuji

2008-04-01

Previous studies demonstrated that attention can be quickly guided to a target location in a visual search task when the spatial configurations of search items and/or the object identities were repeated in the previous trials. This phenomenon is termed contextual cueing. Recently, it was reported that spatial configuration learning and object identity learning occurred independently, when novel contours were used as search items. The present study examined whether this learning occurred independently even when the search items were meaningful. The results showed that the contextual cueing effect was observed even if the relationships between the spatial locations and object identities were jumbled (Experiment 1). However, it disappeared when the search items were changed into geometric patterns (Experiment 2). These results suggest that the spatial configuration can be learned independent of the object identities; however, the use of the learned configuration is restricted by the learning situations.
Impressions of functional food consumers.

PubMed

Saher, Marieke; Arvola, Anne; Lindeman, Marjaana; Lähteenmäki, Liisa

2004-02-01

Functional foods provide a new way of expressing healthiness in food choices. The objective of this study was to apply an indirect measure to explore what kind of impressions people form of users of functional foods. Respondents (n=350) received one of eight versions of a shopping list and rated the buyer of the foods on 66 bipolar attributes on 7-point scales. The shopping lists had either healthy or neutral background items, conventional or functional target items and the buyer was described either as a 40-year-old woman or man. The attribute ratings revealed three factors: disciplined, innovative and gentle. Buyers with healthy background items were perceived as more disciplined than those having neutral items on the list, users of functional foods were rated as more disciplined than users of conventional target items only when the background list consisted of neutral items. Buyers of functional foods were regarded as more innovative and less gentle, but gender affected the ratings on gentle dimension. The impressions of functional food users clearly differ from those formed of users of conventional foods with a healthy image. The shopping list method performed well as an indirect method, but further studies are required to test its feasibility in measuring other food-related impressions.
The Effect of the Position of an Item within a Test on the Item Difficulty Value.

ERIC Educational Resources Information Center

Rubin, Lois S.; Mott, David E. W.

An investigation of the effect on the difficulty value of an item due to position placement within a test was made. Using a 60-item operational test comprised of 5 subtests, 60 items were placed as experimental items on a number of spiralled test forms in three different positions (first, middle, last) within the subtest composed of like items.…
Relevance of Item Analysis in Standardizing an Achievement Test in Teaching of Physical Science in B.Ed Syllabus

ERIC Educational Resources Information Center

Marie, S. Maria Josephine Arokia; Edannur, Sreekala

2015-01-01

This paper focused on the analysis of test items constructed in the paper of teaching Physical Science for B.Ed. class. It involved the analysis of difficulty level and discrimination power of each test item. Item analysis allows selecting or omitting items from the test, but more importantly item analysis is a tool to help the item writer improve…
Rapid and Accurate Behavioral Health Diagnostic Screening: Initial Validation Study of a Web-Based, Self-Report Tool (the SAGE-SR)

PubMed Central

Purcell, Susan E; Rhea, Karen; Maier, Philip; First, Michael; Zweede, Lisa; Sinisterra, Manuela; Nunn, M Brad; Austin, Marie-Paule; Brodey, Inger S

2018-01-01

Background The Structured Clinical Interview for DSM (SCID) is considered the gold standard assessment for accurate, reliable psychiatric diagnoses; however, because of its length, complexity, and training required, the SCID is rarely used outside of research. Objective This paper aims to describe the development and initial validation of a Web-based, self-report screening instrument (the Screening Assessment for Guiding Evaluation-Self-Report, SAGE-SR) based on the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) and the SCID-5-Clinician Version (CV) intended to make accurate, broad-based behavioral health diagnostic screening more accessible within clinical care. Methods First, study staff drafted approximately 1200 self-report items representing individual granular symptoms in the diagnostic criteria for the 8 primary SCID-CV modules. An expert panel iteratively reviewed, critiqued, and revised items. The resulting items were iteratively administered and revised through 3 rounds of cognitive interviewing with community mental health center participants. In the first 2 rounds, the SCID was also administered to participants to directly compare their Likert self-report and SCID responses. A second expert panel evaluated the final pool of items from cognitive interviewing and criteria in the DSM-5 to construct the SAGE-SR, a computerized adaptive instrument that uses branching logic from a screener section to administer appropriate follow-up questions to refine the differential diagnoses. The SAGE-SR was administered to healthy controls and outpatient mental health clinic clients to assess test duration and test-retest reliability. Cutoff scores for screening into follow-up diagnostic sections and criteria for inclusion of diagnoses in the differential diagnosis were evaluated. Results The expert panel reduced the initial 1200 test items to 664 items that panel members agreed collectively represented the SCID items from the 8 targeted modules and DSM criteria for the covered diagnoses. These 664 items were iteratively submitted to 3 rounds of cognitive interviewing with 50 community mental health center participants; the expert panel reviewed session summaries and agreed on a final set of 661 clear and concise self-report items representing the desired criteria in the DSM-5. The SAGE-SR constructed from this item pool took an average of 14 min to complete in a nonclinical sample versus 24 min in a clinical sample. Responses to individual items can be combined to generate DSM criteria endorsements and differential diagnoses, as well as provide indices of individual symptom severity. Preliminary measures of test-retest reliability in a small, nonclinical sample were promising, with good to excellent reliability for screener items in 11 of 13 diagnostic screening modules (intraclass correlation coefficient [ICC] or kappa coefficients ranging from .60 to .90), with mania achieving fair test-retest reliability (ICC=.50) and other substance use endorsed too infrequently for analysis. Conclusions The SAGE-SR is a computerized adaptive self-report instrument designed to provide rigorous differential diagnostic information to clinicians. PMID:29572204
Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

ERIC Educational Resources Information Center

Wang, Wei

2013-01-01

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…
Test item linguistic complexity and assessments for deaf students.

PubMed

Cawthon, Stephanie

2011-01-01

Linguistic complexity of test items is one test format element that has been studied in the context of struggling readers and their participation in paper-and-pencil tests. The present article presents findings from an exploratory study on the potential relationship between linguistic complexity and test performance for deaf readers. A total of 64 students completed 52 multiple-choice items, 32 in mathematics and 20 in reading. These items were coded for linguistic complexity components of vocabulary, syntax, and discourse. Mathematics items had higher linguistic complexity ratings than reading items, but there were no significant relationships between item linguistic complexity scores and student performance on the test items. The discussion addresses issues related to the subject area, student proficiency levels in the test content, factors to look for in determining a "linguistic complexity effect," and areas for further research in test item development and deaf students.

Source Memory for Self and Other in Patients With Mild Cognitive Impairment due to Alzheimer’s Disease

PubMed Central

Deason, Rebecca G.; Budson, Andrew E.; Gutchess, Angela H.

2016-01-01

Objectives. The present study examined the role of enactment in source memory in a cognitively impaired population. As seen in healthy older adults, it was predicted that source memory in people with mild cognitive impairment due to Alzheimer’s disease (MCI-AD) would benefit from the self-reference aspect of enactment. Method. Seventeen participants with MCI-AD and 18 controls worked in small groups to pack a picnic basket and suitcase and were later tested for their source memory for each item. Results. For item memory, self-referencing improved corrected recognition scores for both MCI-AD and control participants. The MCI-AD group did not demonstrate the same benefit as controls in correct source memory for self-related items. However, those with MCI-AD were relatively less likely to misattribute new items to the self and more likely to misattribute new items to others when committing errors, compared with controls. Discussion. The enactment effect and self-referencing did not enhance accurate source memory more than other referencing for patients with MCI-AD. However, people with MCI-AD benefited in item memory and source memory, being less likely to falsely claim new items as their own, indicating some self-reference benefit occurs for people with MCI-AD. PMID:24904049
78 FR 11679 - Notice of Intent To Repatriate a Cultural Item: Binghamton University, State University of New...

Federal Register 2010, 2011, 2012, 2013, 2014

2013-02-19

... with the appropriate Indian tribes, has determined that a cultural item meets the definition of sacred... the definition of sacred object under 25 U.S.C. 3001. This notice is published as part of the National... item described above is a specific ceremonial object needed by traditional Native American religious...
School environments and physical activity: the development and testing of an audit tool

PubMed Central

Jones, Natalia R; Jones, Andy; van Sluijs, Esther MF; Panter, Jenna; Harrison, Flo; Griffin, Simon J

2013-01-01

The aim of this study was to develop, test, and employ an audit tool to objectively assess the opportunities for physical activity within school environments. A 44 item tool was developed and tested at 92 primary schools in the county of Norfolk, England, during summer term of 2007. Scores from the tool covering 6 domains of facility provision were examined against objectively measured hourly moderate to vigorous physical activity levels in 1868 9-10 year old pupils attending the schools. The tool was found to have acceptable reliability and good construct validity, differentiating the physical activity levels of children attending the highest and lowest scoring schools. The characteristics of school grounds may influence pupil’s physical activity levels. PMID:20435506
Evaluation of the automatic optical authentication technologies for control systems of objects

NASA Astrophysics Data System (ADS)

Averkin, Vladimir V.; Volegov, Peter L.; Podgornov, Vladimir A.

2000-03-01

The report considers the evaluation of the automatic optical authentication technologies for the automated integrated system of physical protection, control and accounting of nuclear materials at RFNC-VNIITF, and for providing of the nuclear materials nonproliferation regime. The report presents the nuclear object authentication objectives and strategies, the methodology of the automatic optical authentication and results of the development of pattern recognition techniques carried out under the ISTC project #772 with the purpose of identification of unique features of surface structure of a controlled object and effects of its random treatment. The current decision of following functional control tasks is described in the report: confirmation of the item authenticity (proof of the absence of its substitution by an item of similar shape), control over unforeseen change of item state, control over unauthorized access to the item. The most important distinctive feature of all techniques is not comprehensive description of some properties of controlled item, but unique identification of item using minimum necessary set of parameters, properly comprising identification attribute of the item. The main emphasis in the technical approach is made on the development of rather simple technological methods for the first time intended for use in the systems of physical protection, control and accounting of nuclear materials. The developed authentication devices and system are described.
The Selection of Test Items for Decision Making with a Computer Adaptive Test.

ERIC Educational Resources Information Center

Spray, Judith A.; Reckase, Mark D.

The issue of test-item selection in support of decision making in adaptive testing is considered. The number of items needed to make a decision is compared for two approaches: selecting items from an item pool that are most informative at the decision point or selecting items that are most informative at the examinee's ability level. The first…
Comparison of Objective and Subjective Methods on Determination of Differential Item Functioning

ERIC Educational Resources Information Center

Sahin, Melek Gülsah

2017-01-01

Research objective is comparing the objective methods often used in literature for determination of differential item functioning (DIF) and the subjective method based on the opinions of the experts which are not used so often in literature. Mantel-Haenszel (MH), Logistic Regression (LR) and SIBTEST are chosen as objective methods. While the data…
77 FR 23500 - Notice of Intent To Repatriate Cultural Items: Milwaukee Public Museum, Milwaukee, WI

Federal Register 2010, 2011, 2012, 2013, 2014

2012-04-19

... determined that the cultural items meet the definition of sacred objects and repatriation to the Indian tribe... the control of the Milwaukee Public Museum that meet the definition of sacred object under 25 U.S.C... sacred object based on the documented use of these objects during the Midewiwin ceremonies...
Development and psychometric evaluation of an information literacy self-efficacy survey and an information literacy knowledge test.

PubMed

Tepe, Rodger; Tepe, Chabha

2015-03-01

To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.
A New Item Selection Procedure for Mixed Item Type in Computerized Classification Testing.

ERIC Educational Resources Information Center

Lau, C. Allen; Wang, Tianyou

This paper proposes a new Information-Time index as the basis for item selection in computerized classification testing (CCT) and investigates how this new item selection algorithm can help improve test efficiency for item pools with mixed item types. It also investigates how practical constraints such as item exposure rate control, test…
A Process for Reviewing and Evaluating Generated Test Items

ERIC Educational Resources Information Center

Gierl, Mark J.; Lai, Hollis

2016-01-01

Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…
Validation of psychosocial scales for physical activity in university students

PubMed Central

Tassitano, Rafael Miranda; de Farias, José Cazuza; Rech, Cassiano Ricardo; Tenório, Maria Cecília Marinho; Cabral, Poliana Coelho; da Silva, Giselia Alves Pontes

2015-01-01

OBJECTIVE Translate the Patient-centered Assessment and Counseling for Exercise questionnaire, adapt it cross-culturally and identify the psychometric properties of the psychosocial scales for physical activity in young university students. METHODS The Patient-centered Assessment and Counseling for Exercise questionnaire is made up of 39 items divided into constructs based on the social cognitive theory and the transtheoretical model. The analyzed constructs were, as follows: behavior change strategy (15 items), decision-making process (10), self-efficacy (6), support from family (4), and support from friends (4). The validation procedures were conceptual, semantic, operational, and functional equivalences, in addition to the equivalence of the items and of measurements. The conceptual, of items and semantic equivalences were performed by a specialized committee. During measurement equivalence, the instrument was applied to 717 university students. Exploratory factor analysis was used to verify the loading of each item, explained variance and internal consistency of the constructs. Reproducibility was measured by means of intraclass correlation coefficient. RESULTS The two translations were equivalent and back-translation was similar to the original version, with few adaptations. The layout, presentation order of the constructs and items from the original version were kept in the same form as the original instrument. The sample size was adequate and was evaluated by the Kaiser-Meyer-Olkin test, with values between 0.72 and 0.91. The correlation matrix of the items presented r < 0.8 (p < 0.05). The factor loadings of the items from all the constructs were satisfactory (> 0.40), varying between 0.43 and 0.80, which explained between 45.4% and 59.0% of the variance. Internal consistency was satisfactory (α ≥ 0.70), with support from friends being 0.70 and 0.92 for self-efficacy. Most items (74.3%) presented values above 0.70 for the reproducibility test. CONCLUSIONS The validation process steps were considered satisfactory and adequate for applying to the population. PMID:26270013
Validation of psychosocial scales for physical activity in university students.

PubMed

Tassitano, Rafael Miranda; de Farias Júnior, José Cazuza; Rech, Cassiano Ricardo; Tenório, Maria Cecília Marinho; Cabral, Poliana Coelho; da Silva, Giselia Alves Pontes

2015-01-01

OBJECTIVE Translate the Patient-centered Assessment and Counseling for Exercise questionnaire, adapt it cross-culturally and identify the psychometric properties of the psychosocial scales for physical activity in young university students. METHODS The Patient-centered Assessment and Counseling for Exercise questionnaire is made up of 39 items divided into constructs based on the social cognitive theory and the transtheoretical model. The analyzed constructs were, as follows: behavior change strategy (15 items), decision-making process (10), self-efficacy (6), support from family (4), and support from friends (4). The validation procedures were conceptual, semantic, operational, and functional equivalences, in addition to the equivalence of the items and of measurements. The conceptual, of items and semantic equivalences were performed by a specialized committee. During measurement equivalence, the instrument was applied to 717 university students. Exploratory factor analysis was used to verify the loading of each item, explained variance and internal consistency of the constructs. Reproducibility was measured by means of intraclass correlation coefficient. RESULTS The two translations were equivalent and back-translation was similar to the original version, with few adaptations. The layout, presentation order of the constructs and items from the original version were kept in the same form as the original instrument. The sample size was adequate and was evaluated by the Kaiser-Meyer-Olkin test, with values between 0.72 and 0.91. The correlation matrix of the items presented r < 0.8 (p < 0.05). The factor loadings of the items from all the constructs were satisfactory (> 0.40), varying between 0.43 and 0.80, which explained between 45.4% and 59.0% of the variance. Internal consistency was satisfactory (α ≥ 0.70), with support from friends being 0.70 and 0.92 for self-efficacy. Most items (74.3%) presented values above 0.70 for the reproducibility test. CONCLUSIONS The validation process steps were considered satisfactory and adequate for applying to the population.
Mental health in primary care: an evaluation using the Item Response Theory

PubMed Central

da Rocha, Hugo André; dos Santos, Alaneir de Fátima; Reis, Ilka Afonso; Santos, Marcos Antônio da Cunha; Cherchiglia, Mariângela Leal

2018-01-01

ABSTRACT OBJECTIVE To determine the items of the Brazilian National Program for Improving Access and Quality of Primary Care that better evaluate the capacity to provide mental health care. METHODS This is a cross-sectional study carried out using the Graded Response Model of the Item Response Theory using secondary data from the second cycle of the National Program for Improving Access and Quality of Primary Care, which evaluates 30,523 primary care teams in the period from 2013 to 2014 in Brazil. The internal consistency, correlation between items, and correlation between items and the total score were tested using the Cronbach’s alpha, Spearman’s correlation, and point biserial coefficients, respectively. The assumptions of unidimensionality and local independence of the items were tested. Word clouds were used as one way to present the results. RESULTS The items with the greatest ability to discriminate were scheduling of the agenda according to risk stratification, keeping of records of the most serious cases of users in psychological distress, and provision of group care. The items that required a higher level of mental health care in the parameter of location were the provision of any type of group care and the provision of educational and mental health promotion activities. Total Cronbach’s alpha coefficient was 0.87. The items that obtained the highest correlation with total score were the recording of the most serious cases of users in psychological distress and scheduling of the agenda according to risk stratification. The final scores obtained oscillated between -2.07 (minimum) and 1.95 (maximum). CONCLUSIONS There are important aspects in the discrimination of the capacity to provide mental health care by primary health care teams: risk stratification for care management, follow-up of the most serious cases, group care, and preventive and health promotion actions. PMID:29489992
Influence of dominant- as compared with nondominant-side symptoms on Disabilities of the Arm, Shoulder and Hand and Western Ontario Rotator Cuff scores in patients with rotator cuff tendinopathy.

PubMed

Christiansen, David Høyrup; Michener, Lori; Roy, Jean-Sébastien

2018-02-13

The Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire and the Western Ontario Rotator Cuff (WORC) index are 2 widely used patient-reported questionnaires in individuals with rotator cuff (RC) tendinopathy. In contrast to the WORC index, for which the items are specific to the affected shoulder, the items of the DASH questionnaire assess the ability to perform activities regardless of the arm used. The objective of this study is to determine whether scores on the DASH questionnaire and WORC index are affected if the symptoms are on the dominant or nondominant side in individuals with RC tendinopathy. Given the number of items that can be influenced by dominance, the hypothesis is that DASH scores will be impacted by the side of the symptoms. Individuals with RC tendinopathy (N = 149) completed questions on symptomatology and hand dominance, the DASH questionnaire, and the WORC index. Differences in total scores (independent t test) and single items (Wilcoxon rank sum test) were compared between groups of participants with dominant-side symptoms and those without dominant-side symptoms. No significant differences were observed for WORC or DASH total scores when comparing participants with and without symptoms on their dominant side. Single-item comparison revealed more items being affected by symptom side on the DASH questionnaire (6 of 30 items) than on the WORC index (2 of 21 items). The side of the symptoms does not influence the DASH and WORC total scores, as there are no systematic differences between individuals with and without symptoms in their dominant shoulder. However, the presence of dominant symptoms does influence item scores more on the DASH questionnaire than on the WORC index. Copyright © 2018 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
Confirming the cognition of rising scores: Fox and Mitchum (2013) predicts violations of measurement invariance in series completion between age-matched cohorts.

PubMed

Fox, Mark C; Mitchum, Ainsley L

2014-01-01

The trend of rising scores on intelligence tests raises important questions about the comparability of variation within and between time periods. Descriptions of the processes that mediate selection of item responses provide meaningful psychological criteria upon which to base such comparisons. In a recent paper, Fox and Mitchum presented and tested a cognitive theory of rising scores on analogical and inductive reasoning tests that is specific enough to make novel predictions about cohort differences in patterns of item responses for tests such as the Raven's Matrices. In this paper we extend the same proposal in two important ways by (1) testing it against a dataset that enables the effects of cohort to be isolated from those of age, and (2) applying it to two other inductive reasoning tests that exhibit large Flynn effects: Letter Series and Word Series. Following specification and testing of a confirmatory item response model, predicted violations of measurement invariance are observed between two age-matched cohorts that are separated by only 20 years, as members of the later cohort are found to map objects at higher levels of abstraction than members of the earlier cohort who possess the same overall level of ability. Results have implications for the Flynn effect and cognitive aging while underscoring the value of establishing psychological criteria for equating members of distinct groups who achieve the same scores.
The associative memory deficit in aging is related to reduced selectivity of brain activity during encoding

PubMed Central

Saverino, Cristina; Fatima, Zainab; Sarraf, Saman; Oder, Anita; Strother, Stephen C.; Grady, Cheryl L.

2016-01-01

Human aging is characterized by reductions in the ability to remember associations between items, despite intact memory for single items. Older adults also show less selectivity in task-related brain activity, such that patterns of activation become less distinct across multiple experimental tasks. This reduced selectivity, or dedifferentiation, has been found for episodic memory, which is often reduced in older adults, but not for semantic memory, which is maintained with age. We used functional magnetic resonance imaging (fMRI) to investigate whether there is a specific reduction in selectivity of brain activity during associative encoding in older adults, but not during item encoding, and whether this reduction predicts associative memory performance. Healthy young and older adults were scanned while performing an incidental-encoding task for pictures of objects and houses under item or associative instructions. An old/new recognition test was administered outside the scanner. We used agnostic canonical variates analysis and split-half resampling to detect whole brain patterns of activation that predicted item vs. associative encoding for stimuli that were later correctly recognized. Older adults had poorer memory for associations than did younger adults, whereas item memory was comparable across groups. Associative encoding trials, but not item encoding trials, were predicted less successfully in older compared to young adults, indicating less distinct patterns of associative-related activity in the older group. Importantly, higher probability of predicting associative encoding trials was related to better associative memory after accounting for age and performance on a battery of neuropsychological tests. These results provide evidence that neural distinctiveness at encoding supports associative memory and that a specific reduction of selectivity in neural recruitment underlies age differences in associative memory. PMID:27082043
What's in a Topic? Exploring the Interaction between Test-Taker Age and Item Content in High-Stakes Testing

ERIC Educational Resources Information Center

Banerjee, Jayanti; Papageorgiou, Spiros

2016-01-01

The research reported in this article investigates differential item functioning (DIF) in a listening comprehension test. The study explores the relationship between test-taker age and the items' language domains across multiple test forms. The data comprise test-taker responses (N = 2,861) to a total of 133 unique items, 46 items of which were…
Item response theory analysis of the Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised in the Pooled Resource Open-Access ALS Clinical Trials Database.

PubMed

Bacci, Elizabeth D; Staniewska, Dorota; Coyne, Karin S; Boyer, Stacey; White, Leigh Ann; Zach, Neta; Cedarbaum, Jesse M

2016-01-01

Our objective was to examine dimensionality and item-level performance of the Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised (ALSFRS-R) across time using classical and modern test theory approaches. Confirmatory factor analysis (CFA) and Item Response Theory (IRT) analyses were conducted using data from patients with amyotrophic lateral sclerosis (ALS) Pooled Resources Open-Access ALS Clinical Trials (PRO-ACT) database with complete ALSFRS-R data (n = 888) at three time-points (Time 0, Time 1 (6-months), Time 2 (1-year)). Results demonstrated that in this population of 888 patients, mean age was 54.6 years, 64.4% were male, and 93.7% were Caucasian. The CFA supported a 4* individual-domain structure (bulbar, gross motor, fine motor, and respiratory domains). IRT analysis within each domain revealed misfitting items and overlapping item response category thresholds at all time-points, particularly in the gross motor and respiratory domain items. Results indicate that many of the items of the ALSFRS-R may sub-optimally distinguish among varying levels of disability assessed by each domain, particularly in patients with less severe disability. Measure performance improved across time as patient disability severity increased. In conclusion, modifications to select ALSFRS-R items may improve the instrument's specificity to disability level and sensitivity to treatment effects.
A Computer-Adaptive Disability Instrument for Lower Extremity Osteoarthritis Research Demonstrated Promising Breadth, Precision and Reliability

PubMed Central

Jette, Alan M.; McDonough, Christine M.; Haley, Stephen M.; Ni, Pengsheng; Olarsch, Sippy; Latham, Nancy; Hambleton, Ronald K.; Felson, David; Kim, Young-jo; Hunter, David

2012-01-01

Objective To develop and evaluate a prototype measure (OA-DISABILITY-CAT) for osteoarthritis research using Item Response Theory (IRT) and Computer Adaptive Test (CAT) methodologies. Study Design and Setting We constructed an item bank consisting of 33 activities commonly affected by lower extremity (LE) osteoarthritis. A sample of 323 adults with LE osteoarthritis reported their degree of limitation in performing everyday activities and completed the Health Assessment Questionnaire-II (HAQ-II). We used confirmatory factor analyses to assess scale unidimensionality and IRT methods to calibrate the items and examine the fit of the data. Using CAT simulation analyses, we examined the performance of OA-DISABILITY-CATs of different lengths compared to the full item bank and the HAQ-II. Results One distinct disability domain was identified. The 10-item OA-DISABILITY-CAT demonstrated a high degree of accuracy compared with the full item bank (r=0.99). The item bank and the HAQ-II scales covered a similar estimated scoring range. In terms of reliability, 95% of OA-DISABILITY reliability estimates were over 0.83 versus 0.60 for the HAQ-II. Except at the highest scores the 10-item OA-DISABILITY-CAT demonstrated superior precision to the HAQ-II. Conclusion The prototype OA-DISABILITY-CAT demonstrated promising measurement properties compared to the HAQ-II, and is recommended for use in LE osteoarthritis research. PMID:19216052
Reliability of a store observation tool in measuring availability of alcohol and selected foods.

PubMed

Cohen, Deborah A; Schoeff, Diane; Farley, Thomas A; Bluthenthal, Ricky; Scribner, Richard; Overton, Adrian

2007-11-01

Alcohol and food items can compromise or contribute to health, depending on the quantity and frequency with which they are consumed. How much people consume may be influenced by product availability and promotion in local retail stores. We developed and tested an observational tool to objectively measure in-store availability and promotion of alcoholic beverages and selected food items that have an impact on health. Trained observers visited 51 alcohol outlets in Los Angeles and southeastern Louisiana. Using a standardized instrument, two independent observations were conducted documenting the type of outlet, the availability and shelf space for alcoholic beverages and selected food items, the purchase price of standard brands, the placement of beer and malt liquor, and the amount of in-store alcohol advertising. Reliability of the instrument was excellent for measures of item availability, shelf space, and placement of malt liquor. Reliability was lower for alcohol advertising, beer placement, and items that measured the "least price" of apples and oranges. The average kappa was 0.87 for categorical items and the average intraclass correlation coefficient was 0.83 for continuous items. Overall, systematic observation of the availability and promotion of alcoholic beverages and food items was feasible, acceptable, and reliable. Measurement tools such as the one we evaluated should be useful in studies of the impact of availability of food and beverages on consumption and on health outcomes.

Development and validation of brief scales to measure emotional and behavioural problems among Chinese adolescents

PubMed Central

Shen, Minxue; Hu, Ming; Sun, Zhenqiu

2017-01-01

Objectives To develop and validate brief scales to measure common emotional and behavioural problems among adolescents in the examination-oriented education system and collectivistic culture of China. Setting Middle schools in Hunan province. Participants 5442 middle school students aged 11–19 years were sampled. 4727 valid questionnaires were collected and used for validation of the scales. The final sample included 2408 boys and 2319 girls. Primary and secondary outcome measures The tools were assessed by the item response theory, classical test theory (reliability and construct validity) and differential item functioning. Results Four scales to measure anxiety, depression, study problem and sociality problem were established. Exploratory factor analysis showed that each scale had two solutions. Confirmatory factor analysis showed acceptable to good model fit for each scale. Internal consistency and test–retest reliability of all scales were above 0.7. Item response theory showed that all items had acceptable discrimination parameters and most items had appropriate difficulty parameters. 10 items demonstrated differential item functioning with respect to gender. Conclusions Four brief scales were developed and validated among adolescents in middle schools of China. The scales have good psychometric properties with minor differential item functioning. They can be used in middle school settings, and will help school officials to assess the students’ emotional/behavioural problems. PMID:28062469
Rover nuclear rocket engine program: Overview of rover engine tests

NASA Technical Reports Server (NTRS)

Finseth, J. L.

1991-01-01

The results of nuclear rocket development activities from the inception of the ROVER program in 1955 through the termination of activities on January 5, 1973 are summarized. This report discusses the nuclear reactor test configurations (non cold flow) along with the nuclear furnace demonstrated during this time frame. Included in the report are brief descriptions of the propulsion systems, test objectives, accomplishments, technical issues, and relevant test results for the various reactor tests. Additionally, this document is specifically aimed at reporting performance data and their relationship to fuel element development with little or no emphasis on other (important) items.
The Effects of Similarity on High-Level Visual Working Memory Processing.

PubMed

Yang, Li; Mo, Lei

2017-01-01

Similarity has been observed to have opposite effects on visual working memory (VWM) for complex images. How can these discrepant results be reconciled? To answer this question, we used a change-detection paradigm to test visual working memory performance for multiple real-world objects. We found that working memory for moderate similarity items was worse than that for either high or low similarity items. This pattern was unaffected by manipulations of stimulus type (faces vs. scenes), encoding duration (limited vs. self-paced), and presentation format (simultaneous vs. sequential). We also found that the similarity effects differed in strength in different categories (scenes vs. faces). These results suggest that complex real-world objects are represented using a centre-surround inhibition organization . These results support the category-specific cortical resource theory and further suggest that centre-surround inhibition organization may differ by category.
Age-Related Differences in Recognition Memory for Items and Associations: Contribution of Individual Differences in Working Memory and Metamemory

PubMed Central

Bender, Andrew R.; Raz, Naftali

2012-01-01

Ability to form new associations between unrelated items is particularly sensitive to aging, but the reasons for such differential vulnerability are unclear. In this study, we examined the role of objective and subjective factors (working memory and beliefs about memory strategies) on differential relations of age with recognition of items and associations. Healthy adults (N = 100, age 21 to 79) studied word pairs, completed item and association recognition tests, and rated the effectiveness of shallow (e.g., repetition) and deep (e.g., imagery or sentence generation) encoding strategies. Advanced age was associated with reduced working memory (WM) capacity and poorer associative recognition. In addition, reduced WM capacity, beliefs in the utility of ineffective encoding strategies, and lack of endorsement of effective ones were independently associated with impaired associative memory. Thus, maladaptive beliefs about memory in conjunction with reduced cognitive resources account in part for differences in associative memory commonly attributed to aging. PMID:22251381
[Investigation of the process of personal hygiene items biodegradation by cellulose-fermenting microorganisms].

PubMed

Il'in, V K; Starkov, L V; Kostrov, S V; Belikodvorskaia, G A; Chuvil'skaia, N A; Mukhamedieva, L N; Mikos, K N

2004-01-01

Cellulose-containing wastes are one of the heaviest and biggest ingredients of solid domestic wastes piling up during spaceflight. For the most part these are disposable personal hygiene items used in large quantities in the absence of shower. These wastes contain human body products which are very dangerous from the sanitary-epidemiological standpoint. The purpose was to explore potentiality of microbial biodegradation of cellulose-containing hygiene items anaerobically with dry mass transformation into liquid and biogas. Among specific objectives were test cultivation of active strains of reference cultures of cellulose-fermenting anaerobic thermophilic bacteria on hygiene items as the only source of carbon, evaluation of ways and need of pretreatment of gauze pads to stimulate biodegradation, and chemical analysis of resulting biogas. From the investigation it was concluded that gauze pads are susceptible to biodegradation by anaerobic bacteria producing a low toxicity gas fraction. Therefore, the proposed technology can be considered as a candidate for integration into the spacecrew life support system.
Item validity vs. item discrimination index: a redundancy?

NASA Astrophysics Data System (ADS)

Panjaitan, R. L.; Irawati, R.; Sujana, A.; Hanifah, N.; Djuanda, D.

2018-03-01

In several literatures about evaluation and test analysis, it is common to find that there are calculations of item validity as well as item discrimination index (D) with different formula for each. Meanwhile, other resources said that item discrimination index could be obtained by calculating the correlation between the testee’s score in a particular item and the testee’s score on the overall test, which is actually the same concept as item validity. Some research reports, especially undergraduate theses tend to include both item validity and item discrimination index in the instrument analysis. It seems that these concepts might overlap for both reflect the test quality on measuring the examinees’ ability. In this paper, examples of some results of data processing on item validity and item discrimination index were compared. It would be discussed whether item validity and item discrimination index can be represented by one of them only or it should be better to present both calculations for simple test analysis, especially in undergraduate theses where test analyses were included.
A Comparison of Three Types of Test Development Procedures Using Classical and Latent Trait Methods.

ERIC Educational Resources Information Center

Benson, Jeri; Wilson, Michael

Three methods of item selection were used to select sets of 38 items from a 50-item verbal analogies test and the resulting item sets were compared for internal consistency, standard errors of measurement, item difficulty, biserial item-test correlations, and relative efficiency. Three groups of 1,500 cases each were used for item selection. First…
Examining Differential Item Functions of Different Item Ordered Test Forms According to Item Difficulty Levels

ERIC Educational Resources Information Center

Çokluk, Ömay; Gül, Emrah; Dogan-Gül, Çilem

2016-01-01

The study aims to examine whether differential item function is displayed in three different test forms that have item orders of random and sequential versions (easy-to-hard and hard-to-easy), based on Classical Test Theory (CTT) and Item Response Theory (IRT) methods and bearing item difficulty levels in mind. In the correlational research, the…
The Effects of Test Length and Sample Size on Item Parameters in Item Response Theory

ERIC Educational Resources Information Center

Sahin, Alper; Anil, Duygu

2017-01-01

This study investigates the effects of sample size and test length on item-parameter estimation in test development utilizing three unidimensional dichotomous models of item response theory (IRT). For this purpose, a real language test comprised of 50 items was administered to 6,288 students. Data from this test was used to obtain data sets of…
78 FR 22283 - Notice of Intent To Repatriate Cultural Items: U.S. Department of the Interior, National Park...

Federal Register 2010, 2011, 2012, 2013, 2014

2013-04-15

.... History and Description of the Cultural Items In 1982, two cultural items were removed from the Mosquito... objects were collected from Mosquito Island. The human remains and 41 objects were described in a Notice... that the Mosquito Island Site was a Miccosukee campsite during the mid-20th century. Determinations...
[Perceptions on item disclosure for the Korean medical licensing examination].

PubMed

Yang, Eunbae B

2015-09-01

This study analyzed the perceptions of medical students and faculty regarding disclosure of test items on the Korean medical licensing examination. I conducted a survey of medical students from medical colleges and professional medical schools nationwide. Responses were analyzed from 718 participants as well as 69 faculty members who participated in creating the medical licensing examination item sets. Data were analyzed using descriptive statistics and the chi-square test. It is important to maintain test quality and to keep the test items unavailable to the public. There are also concerns among students that disclosure of test items would prompt increasing difficulty of test items (48.3%). Further, few students found it desirable to disclose test items regardless of any considerations (28.5%). The professors, who had experience in designing the test items, also expressed their opposition to test item disclosure (60.9%). It is desirable not to disclose the test items of the Korean medical licensing examination to the public on the condition that students are provided with a sufficient amount of information regarding the examination. This is so that the exam can appropriately identify candidates with the required qualifications.
Crowded and Sparse Domains in Object Recognition: Consequences for Categorization and Naming

ERIC Educational Resources Information Center

Gale, Tim M.; Laws, Keith R.; Foley, Kerry

2006-01-01

Some models of object recognition propose that items from structurally crowded categories (e.g., living things) permit faster access to superordinate semantic information than structurally dissimilar categories (e.g., nonliving things), but slower access to individual object information when naming items. We present four experiments that utilize…
The positive mental health instrument: development and validation of a culturally relevant scale in a multi-ethnic Asian population.

PubMed

Vaingankar, Janhavi Ajit; Subramaniam, Mythily; Chong, Siow Ann; Abdin, Edimansyah; Orlando Edelen, Maria; Picco, Louisa; Lim, Yee Wei; Phua, Mei Yen; Chua, Boon Yiang; Tee, Joseph Y S; Sherbourne, Cathy

2011-10-31

Instruments to measure mental health and well-being are largely developed and often used within Western populations and this compromises their validity in other cultures. A previous qualitative study in Singapore demonstrated the relevance of spiritual and religious practices to mental health, a dimension currently not included in exiting multi-dimensional measures. The objective of this study was to develop a self-administered measure that covers all key and culturally appropriate domains of mental health, which can be applied to compare levels of mental health across different age, gender and ethnic groups. We present the item reduction and validation of the Positive Mental Health (PMH) instrument in a community-based adult sample in Singapore. Surveys were conducted among adult (21-65 years) residents belonging to Chinese, Malay and Indian ethnicities. Exploratory and confirmatory factor analysis (EFA, CFA) were conducted and items were reduced using item response theory tests (IRT). The final version of the PMH instrument was tested for internal consistency and criterion validity. Items were tested for differential item functioning (DIF) to check if items functioned in the same way across all subgroups. EFA and CFA identified six first-order factor structure (General coping, Personal growth and autonomy, Spirituality, Interpersonal skills, Emotional support, and Global affect) under one higher-order dimension of Positive Mental Health (RMSEA=0.05, CFI=0.96, TLI=0.96). A 47-item self-administered multi-dimensional instrument with a six-point Likert response scale was constructed. The slope estimates and strength of the relation to the theta for all items in each six PMH subscales were high (range:1.39 to 5.69), suggesting good discrimination properties. The threshold estimates for the instrument ranged from -3.45 to 1.61 indicating that the instrument covers entire spectrums for the six dimensions. The instrument demonstrated high internal consistency and had significant and expected correlations with other well-being measures. Results confirmed absence of DIF. The PMH instrument is a reliable and valid instrument that can be used to measure and compare level of mental health across different age, gender and ethnic groups in Singapore.
The medial temporal lobes distinguish between within-item and item-context relations during autobiographical memory retrieval.

PubMed

Sheldon, Signy; Levine, Brian

2015-12-01

During autobiographical memory retrieval, the medial temporal lobes (MTL) relate together multiple event elements, including object (within-item relations) and context (item-context relations) information, to create a cohesive memory. There is consistent support for a functional specialization within the MTL according to these relational processes, much of which comes from recognition memory experiments. In this study, we compared brain activation patterns associated with retrieving within-item relations (i.e., associating conceptual and sensory-perceptual object features) and item-context relations (i.e., spatial relations among objects) with respect to naturalistic autobiographical retrieval. We developed a novel paradigm that cued participants to retrieve information about past autobiographical events, non-episodic within-item relations, and non-episodic item-context relations with the perceptuomotor aspects of retrieval equated across these conditions. We used multivariate analysis techniques to extract common and distinct patterns of activity among these conditions within the MTL and across the whole brain, both in terms of spatial and temporal patterns of activity. The anterior MTL (perirhinal cortex and anterior hippocampus) was preferentially recruited for generating within-item relations later in retrieval whereas the posterior MTL (posterior parahippocampal cortex and posterior hippocampus) was preferentially recruited for generating item-context relations across the retrieval phase. These findings provide novel evidence for functional specialization within the MTL with respect to naturalistic memory retrieval. © 2015 Wiley Periodicals, Inc.
The cross-cultural adaptation of the DASH questionnaire in Thai (DASH-TH).

PubMed

Tongprasert, Siam; Rapipong, Jeeranan; Buntragulpoontawee, Montana

2014-01-01

Clinical measurement. Currently there are no self-report questionnaires in Thai to evaluate disability levels in patients suffering from upper extremity musculoskeletal disorders. To translate and cross-cultural adaptation the disabilities of the arm, shoulder and hand (DASH) questionnaire to Thai version and to evaluate content validity, construct validity and internal consistency of the questionnaire. The DASH-TH was produced by following cross-cultural adaptation guidelines stated by the Institute for Work and Health (IWH). Forty Thai patients with arm, shoulder or hand problems participated in field testing of the questionnaire. Content validity was determined by obtaining the item-objective congruence (IOC) value for each questionnaire item. Correlation between the DASH-TH score and numeric rating scale was used to assess construct validity. Internal consistency of DASH-TH was measured using Cronbach's alpha coefficient. Forty patients (14 males, 26 females) with arm, shoulder or hand problems enrolled in the present study. The average age of patients was 44.8 years. The index of item-objective congruence (IOC) of each item ranged from 0.7 to 1.0. The Cronbach's alpha coefficient of the questionnaire was 0.938. There was no correlation between DASH-TH score and numeric rating scale. The DASH-TH has high content validity and internal consistency. N/A. Copyright © 2014 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
The contributions of visual and central attention to visual working memory.

PubMed

Souza, Alessandra S; Oberauer, Klaus

2017-10-01

We investigated the role of two kinds of attention-visual and central attention-for the maintenance of visual representations in working memory (WM). In Experiment 1 we directed attention to individual items in WM by presenting cues during the retention interval of a continuous delayed-estimation task, and instructing participants to think of the cued items. Attending to items improved recall commensurate with the frequency with which items were attended (0, 1, or 2 times). Experiments 1 and 3 further tested which kind of attention-visual or central-was involved in WM maintenance. We assessed the dual-task costs of two types of distractor tasks, one tapping sustained visual attention and one tapping central attention. Only the central attention task yielded substantial dual-task costs, implying that central attention substantially contributes to maintenance of visual information in WM. Experiment 2 confirmed that the visual-attention distractor task was demanding enough to disrupt performance in a task relying on visual attention. We combined the visual-attention and the central-attention distractor tasks with a multiple object tracking (MOT) task. Distracting visual attention, but not central attention, impaired MOT performance. Jointly, the three experiments provide a double dissociation between visual and central attention, and between visual WM and visual object tracking: Whereas tracking multiple targets across the visual filed depends on visual attention, visual WM depends mostly on central attention.
A secondstep in development of a checklist for screening risk for violence in acute psychiatric patients: evaluation of interrater reliability of the Preliminary Scheme 33.

PubMed

Bjørkly, Stål; Moger, Tron A

2007-12-01

The Acute Project is a research project conducted on acute psychiatric admission wards in Norway. The objective is to develop and validate a structured, easy-to-use screening checklist for assessment of risk for violence in patients both during their stay in the ward and after discharge. The Preliminary Scheme 33 is a 33-item screening checklist with content domain inspired by the Historical-Clinical-Risk Management Scheme (HCR-20), the Brøset Violence Checklist, and eight risk factors extracted from the literature on risk assessment. The Preliminary Scheme 33 was designed and tested in two steps by a research group which includes the authors. The common aim of both steps was to develop this into a time economical, reliable, and valid checklist. In the first step in 2006 the predictive validity of the individual items was tested. The present work presents results from the second step, a study conducted to assess the interrater reliability of the 33 items. Eight clinicians working in an acute psychiatric unit volunteered to be raters and were trained to score the 33 items on a three-point scale in relation to 15 clinical vignettes, which contained information from 15 acute psychiatric patients' files. Analysis showed high interrater reliability for the total score with an intraclass correlation coefficient (ICC) of .86 (95% CI: 0.74-0.94). However, a substantial proportion of the items had medium to low ICCs. Consequences of this finding for further development of these items into a brief screen are discussed.
Attitude of dental hygienists, general practitioners and periodontists towards preventive oral care: an exploratory study.

PubMed

Thevissen, Eric; De Bruyn, Hugo; Colman, Roos; Koole, Sebastiaan

2017-08-01

Promoting oral hygiene and stimulating patient's responsibility for his/her personal health remain challenging objectives. The presence of dental hygienists has led to delegation of preventive tasks. However, in some countries, such as Belgium, this profession is not yet legalized. The aim of this exploratory study was to compare the attitude towards oral-hygiene instructions and patient motivational actions by dental hygienists and by general practitioners/periodontists in a context without dental hygienists. A questionnaire on demographics (six items), oral-hygiene instructions (eight items) and patient motivational actions (six items) was distributed to 241 Dutch dental hygienists, 692 general practitioners and 32 periodontists in Flanders/Belgium. Statistical analysis included Fisher's exact-test, Pearson's chi-square test and multiple (multinomial) logistic regression analysis to observe the influence of profession, age, workload, practice area and chair-assistance. Significant variance was found between general practitioners and dental hygienists (in 13 of 14 items), between general practitioners and periodontists (in nine of 14 items) and between dental hygienists and periodontists (in five of 14 items). In addition to qualification, chair-assistance was also identified as affecting the attitude towards preventive oral care. The present study identified divergence in the application of, and experienced barriers and opinions about, oral-hygiene instructions and patient motivational actions between dental hygienists and general practitioners/periodontists in a context without dental hygienists. In response to the barriers reported it is suggested that preventive oriented care may benefit from the deployment of dental hygienists to increase access to qualified preventive oral care. © 2017 FDI World Dental Federation.
Using Compressed Speech to Measure Simultaneous Processing in Persons with and without Visual Impairment

ERIC Educational Resources Information Center

Marks, William J.; Jones, W. Paul; Loe, Scott A.

2013-01-01

This study investigated the use of compressed speech as a modality for assessment of the simultaneous processing function for participants with visual impairment. A 24-item compressed speech test was created using a sound editing program to randomly remove sound elements from aural stimuli, holding pitch constant, with the objective to emulate the…
Effects of Text Illustration on Children's Learning of a School Science Topic.

ERIC Educational Resources Information Center

Reid, D. J.; Beveridge, M.

1986-01-01

This study of 272 13-year-old science students in England focuses on the effect of varied text and picture content on learning. A criterion-referenced objective items test was used to measure the effect of pictures on students of varying abilities and compare the effectiveness of traditional worksheet presentation and microcomputer presentation.…

The Brief Impairment Scale (Bis): A Multidimensional Scale of Functional Impairment for Children and Adolescents.

ERIC Educational Resources Information Center

Bird, Hector R.; Canino, Glorisa J.; Davies, Mark; Ramirez, Rafael; Chavez, Ligia; Duarte, Cristiane; Shen, Sa

2005-01-01

Objective: This article provides the results of the psychometric testing of the Brief Impairment Scale (BIS). The BIS is a 23-item instrument that evaluates three domains of functioning: interpersonal relations, school/work functioning, and self-care/self-fulfilment. It capitalizes on the strengths of existing global measures while addressing some…
Validity Study in Multidimensional Latent Space and Efficient Computerized Adaptive Testing. Final Report.

ERIC Educational Resources Information Center

Samejima, Fumiko

This paper is the final report of a multi-year project sponsored by the Office of Naval Research (ONR) in 1987 through 1990. The main objectives of the research summarized were to: investigate the non-parametric approach to the estimation of the operating characteristics of discrete item responses; revise and strengthen the package computer…
Self-Efficacy Scale for Weight Loss among Multi-Ethnic Women of Lower Income: A Psychometric Evaluation

ERIC Educational Resources Information Center

Latimer, Lara; Walker, Lorraine O.; Kim, Sunghun; Pasch, Keryn E.; Sterling, Bobbie Sue

2011-01-01

Objective: This study examined test-retest reliability, internal consistency, and construct and predictive validity of the Physical Activity and Nutrition Self-Efficacy (PANSE) scale, an 11-item instrument to assess weight-loss self-efficacy among postpartum women of lower income. Methods: Seventy-one women completed the PANSE scale and…
The Item-Specific Deficit Approach to evaluating verbal memory dysfunction: rationale, psychometrics, and application.

PubMed

Wright, Matthew J; Woo, Ellen; Schmitter-Edgecombe, Maureen; Hinkin, Charles H; Miller, Eric N; Gooding, Amanda L

2009-10-01

In the current study, we introduce the Item-Specific Deficit Approach (ISDA), a novel method for characterizing memory process deficits in list-learning data. To meet this objective, we applied the ISDA to California Verbal Learning Test (CVLT) data collected from a sample of 132 participants (53 healthy participants and 79 neurologically compromised participants). Overall, the ISDA indices measuring encoding, consolidation, and retrieval deficits demonstrated advantages over some traditional indices and indicated acceptable reliability and validity. Currently, the ISDA is intended for experimental use, although further research may support its utility for characterizing memory impairments in clinical assessments.
A Review of Classical Methods of Item Analysis.

ERIC Educational Resources Information Center

French, Christine L.

Item analysis is a very important consideration in the test development process. It is a statistical procedure to analyze test items that combines methods used to evaluate the important characteristics of test items, such as difficulty, discrimination, and distractibility of the items in a test. This paper reviews some of the classical methods for…
Modeling Item-Position Effects within an IRT Framework

ERIC Educational Resources Information Center

Debeer, Dries; Janssen, Rianne

2013-01-01

Changing the order of items between alternate test forms to prevent copying and to enhance test security is a common practice in achievement testing. However, these changes in item order may affect item and test characteristics. Several procedures have been proposed for studying these item-order effects. The present study explores the use of…
ACER Chemistry Test Item Collection. ACER Chemtic Year 12.

ERIC Educational Resources Information Center

Australian Council for Educational Research, Hawthorn.

The chemistry test item banks contains 225 multiple-choice questions suitable for diagnostic and achievement testing; a three-page teacher's guide; answer key with item facilities; an answer sheet; and a 45-item sample achievement test. Although written for the new grade 12 chemistry course in Victoria, Australia, the items are widely applicable.…
A short-form version of the Boston Naming Test for language screening in dementia in a bilingual rural community in Galicia (Spain).

PubMed

Nebreda, M C; García-Caballero, A; Asensio, E; Revilla, P; Rodriguez-Girondo, M; Mateos, R

2011-04-01

Aphasia, one of the core symptoms of cortical dementia, is routinely evaluated using graded naming tests like the Boston Naming Test (BNT). However, the application of this 60-item test is time-consuming and shortened versions have been devised for screening. The hypothesis of this research is that a specifically designed shortened version of the BNT could replace the original 60-item BNT as part of a mini-battery for screening for dementia. The objective of this study was to design a short version of the BNT for a rural population in Galicia (Spain). A clinic group of 102 patients including 43 with dementia was recruited along with 78 healthy volunteers. The clinic and control groups were scored on the Spanish version of the Mini-mental State Examination (MMSE) and BNT. In addition, the clinic group was tested with standard neuropsychological instruments and underwent brain investigations and routine neurological examination. BNT items with specificity and sensitivity above 0.5 were selected to compose a short battery of 11 pictures named BNTOu11. ANOVA and mean comparisons were made for MMSE and BNT versions. Receiver operating characteristics (ROC) curves and internal consistency were calculated. Areas under ROC curves (AUC) did not show statistically significant differences; therefore BNTOu11's AUC (0.814) was similar to the 60-item BNT versions (0.785 and 0.779), to the short versions from Argentina (0.772) and Andalusia (0.799) and to the Spanish MMSE (0.866). BNTOu11 had higher internal consistency than the other short versions. BNTOu11 is a useful and time-saving method as part of a battery for screening for dementia in a psychogeriatric outpatient unit.
Efficacy of the alcohol use disorders identification test as a screening tool for hazardous alcohol intake and related disorders in primary care: a validity study.

PubMed Central

Piccinelli, M.; Tessari, E.; Bortolomasi, M.; Piasere, O.; Semenzin, M.; Garzotto, N.; Tansella, M.

1997-01-01

OBJECTIVE: To determine the properties of the alcohol use disorders identification test in screening primary care attenders for alcohol problems. DESIGN: A validity study among consecutive primary care attenders aged 18-65 years. Every third subject completed the alcohol use disorders identification test (a 10 item self report questionnaire on alcohol intake and related problems) and was interviewed by an investigator with the composite international diagnostic interview alcohol use module (a standardised interview for the independent assessment of alcohol intake and related disorders). SETTING: 10 primary care clinics in Verona, north eastern Italy. PATIENTS: 500 subjects were approached and 482 (96.4%) completed evaluation. RESULTS: When the alcohol use disorders identification test was used to detect subjects with alcohol problems the area under the receiver operating characteristic curve was 0.95. The cut off score of 5 was associated with a sensitivity of 0.84, a specificity of 0.90, and a positive predictive value of 0.60. The screening ability of the total score derived from summing the responses to the five items minimising the probability of misclassification between subjects with and without alcohol problems provided an area under the receiver operating characteristic curve of 0.93. A score of 5 or more on the five items was associated with a sensitivity of 0.79, a specificity of 0.95, and a positive predictive value of 0.73. CONCLUSIONS: The alcohol use disorders identification test performs well in detecting subjects with formal alcohol disorders and those with hazardous alcohol intake. Using five of the 10 items on the questionnaire gives reasonable accuracy, and these are recommended as questions of choice to screen patients for alcohol problems. PMID:9040389
Negative Symptom Dimensions of the Positive and Negative Syndrome Scale Across Geographical Regions: Implications for Social, Linguistic, and Cultural Consistency.

PubMed

Khan, Anzalee; Liharska, Lora; Harvey, Philip D; Atkins, Alexandra; Ulshen, Daniel; Keefe, Richard S E

2017-12-01

Objective: Recognizing the discrete dimensions that underlie negative symptoms in schizophrenia and how these dimensions are understood across localities might result in better understanding and treatment of these symptoms. To this end, the objectives of this study were to 1) identify the Positive and Negative Syndrome Scale negative symptom dimensions of expressive deficits and experiential deficits and 2) analyze performance on these dimensions over 15 geographical regions to determine whether the items defining them manifest similar reliability across these regions. Design: Data were obtained for the baseline Positive and Negative Syndrome Scale visits of 6,889 subjects across 15 geographical regions. Using confirmatory factor analysis, we examined whether a two-factor negative symptom structure that is found in schizophrenia (experiential deficits and expressive deficits) would be replicated in our sample, and using differential item functioning, we tested the degree to which specific items from each negative symptom subfactor performed across geographical regions in comparison with the United States. Results: The two-factor negative symptom solution was replicated in this sample. Most geographical regions showed moderate-to-large differential item functioning for Positive and Negative Syndrome Scale expressive deficit items, especially N3 Poor Rapport, as compared with Positive and Negative Syndrome Scale experiential deficit items, showing that these items might be interpreted or scored differently in different regions. Across countries, except for India, the differential item functioning values did not favor raters in the United States. Conclusion: These results suggest that the Positive and Negative Syndrome Scale negative symptom factor can be better represented by a two-factor model than by a single-factor model. Additionally, the results show significant differences in responses to items representing the Positive and Negative Syndrome Scale expressive factors, but not the experiential factors, across regions. This could be due to a lack of equivalence between the original and translated versions, cultural differences with the interpretation of items, dissimilarities in rater training, or diversity in the understanding of scoring anchors. Knowing which items are challenging for raters across regions can help to guide Positive and Negative Syndrome Scale training and improve the results of international clinical trials aimed at negative symptoms.
Science Library of Test Items. Volume Nineteen. A Collection of Multiple Choice Test Items Relating Mainly to Geology.

ERIC Educational Resources Information Center

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Science Library of Test Items. Volume Seventeen. A Collection of Multiple Choice Test Items Relating Mainly to Biology.

ERIC Educational Resources Information Center

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Science Library of Test Items. Volume Eighteen. A Collection of Multiple Choice Test Items Relating Mainly to Chemistry.

ERIC Educational Resources Information Center

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Bias effects in the possible/impossible object decision test with matching objects.

PubMed

Soldan, Anja; Hilton, H John; Stern, Yaakov

2009-03-01

In the possible/impossible object decision test, priming has consistently been found for structurally possible, but not impossible, objects, leading Schacter, Cooper, and Delaney (1990) to suggest that priming relies on a system that represents the global 3-D structure of objects. Using a modified design with matching objects to control for the influence of episodic memory, Ratcliff and McKoon (1995) and Williams and Tarr (1997) found negative priming for impossible objects (i.e., lower performance for old than for new items). Both teams argued that priming derives from (1) episodic memory for object features and (2) bias to respond "possible" to encoded objects or their possible parts. The present study applied the matched-objects design to the original Schacter and Cooper stimuli-same possible objects and matching impossible figures-with minimal procedural variation. The data from Experiment 1 only partially supported the bias models and suggested that priming was mediated by both local and global structural descriptions. Experiment 2 showed that negative priming for impossible objects derived from the structural properties of these objects, not from the influence of episodic memory on task performance. Supplemental materials for this study may be downloaded from mc.psychonomic-journals.org/content/supplemental.
Neural Architecture for Feature Binding in Visual Working Memory.

PubMed

Schneegans, Sebastian; Bays, Paul M

2017-04-05

Binding refers to the operation that groups different features together into objects. We propose a neural architecture for feature binding in visual working memory that employs populations of neurons with conjunction responses. We tested this model using cued recall tasks, in which subjects had to memorize object arrays composed of simple visual features (color, orientation, and location). After a brief delay, one feature of one item was given as a cue, and the observer had to report, on a continuous scale, one or two other features of the cued item. Binding failure in this task is associated with swap errors, in which observers report an item other than the one indicated by the cue. We observed that the probability of swapping two items strongly correlated with the items' similarity in the cue feature dimension, and found a strong correlation between swap errors occurring in spatial and nonspatial report. The neural model explains both swap errors and response variability as results of decoding noisy neural activity, and can account for the behavioral results in quantitative detail. We then used the model to compare alternative mechanisms for binding nonspatial features. We found the behavioral results fully consistent with a model in which nonspatial features are bound exclusively via their shared location, with no indication of direct binding between color and orientation. These results provide evidence for a special role of location in feature binding, and the model explains how this special role could be realized in the neural system. SIGNIFICANCE STATEMENT The problem of feature binding is of central importance in understanding the mechanisms of working memory. How do we remember not only that we saw a red and a round object, but that these features belong together to a single object rather than to different objects in our environment? Here we present evidence for a neural mechanism for feature binding in working memory, based on encoding of visual information by neurons that respond to the conjunction of features. We find clear evidence that nonspatial features are bound via space: we memorize directly where a color or an orientation appeared, but we memorize which color belonged with which orientation only indirectly by virtue of their shared location. Copyright © 2017 Schneegans and Bays.
Positional priming of visual pop-out search is supported by multiple spatial reference frames

PubMed Central

Gokce, Ahu; Müller, Hermann J.; Geyer, Thomas

2015-01-01

The present study investigates the representations(s) underlying positional priming of visual ‘pop-out’ search (Maljkovic and Nakayama, 1996). Three search items (one target and two distractors) were presented at different locations, in invariant (Experiment 1) or random (Experiment 2) cross-trial sequences. By these manipulations it was possible to disentangle retinotopic, spatiotopic, and object-centered priming representations. Two forms of priming were tested: target location facilitation (i.e., faster reaction times – RTs– when the trial n target is presented at a trial n-1 target relative to n-1 blank location) and distractor location inhibition (i.e., slower RTs for n targets presented at n-1 distractor compared to n-1 blank locations). It was found that target locations were coded in positional short-term memory with reference to both spatiotopic and object-centered representations (Experiment 1 vs. 2). In contrast, distractor locations were maintained in an object-centered reference frame (Experiments 1 and 2). We put forward the idea that the uncertainty induced by the experiment manipulation (predictable versus random cross-trial item displacements) modulates the transition from object- to space-based representations in cross-trial memory for target positions. PMID:26136718
Analysis of the psychometric properties of the Multiple Sclerosis Impact Scale-29 (MSIS-29) in relapsing–remitting multiple sclerosis using classical and modern test theory

PubMed Central

Wyrwich, KW; Phillips, GA; Vollmer, T; Guo, S

2016-01-01

Background Investigations using classical test theory support the psychometric properties of the original version of the Multiple Sclerosis Impact Scale (MSIS-29v1), a disease-specific measure of multiple sclerosis (MS) impact (physical and psychological subscales). Later, assessments of the MSIS-29v1 in an MS community-based sample using Rasch analysis led to revisions of the instrument’s response options (MSIS-29v2). Objective The objective of this paper is to evaluate the psychometric properties of the MSIS-29v1 in a clinical trial cohort of relapsing–remitting MS patients (RRMS). Methods Data from 600 patients with RRMS enrolled in the SELECT clinical trial were used. Assessments were performed at baseline and at Weeks 12, 24, and 52. In addition to traditional psychometric analyses, Item Response Theory (IRT) and Rasch analysis were used to evaluate the measurement properties of the MSIS-29v1. Results Both MSIS-29v1 subscales demonstrated strong reliability, construct validity, and responsiveness. The IRT and Rasch analysis showed overall support for response category threshold ordering, person-item fit, and item fit for both subscales. Conclusions Both MSIS-29v1 subscales demonstrated robust measurement properties using classical, IRT, and Rasch techniques. Unlike previous research using a community-based sample, the MSIS-29v1 was found to be psychometrically sound to assess physical and psychological impairments in a clinical trial sample of patients with RRMS. PMID:28607741
Has the butcher on the bus dyed his hair? When color changes modulate ERP correlates of familiarity and recollection.

PubMed

Groh-Bordin, Christian; Zimmer, Hubert D; Ecker, Ullrich K H

2006-10-01

Recognition memory is usually thought of as comprising two distinct memory processes, namely familiarity and recollection. This distinction is reflected in specific event-related potential (ERP) components associated with both subprocesses. A mid-frontal attenuated negativity for correctly recognized old items relative to new ones around 400 ms has been typically linked to familiarity, whereas a parietally accentuated, more pronounced positivity for old items from 500 to 800 ms has been connected with recollection. Recently, this classification has been challenged by relating the mid-frontal old/new effect to conceptual priming mechanisms. Moreover, the perceptual sensitivity of both old/new effects is still under debate. The present study used a recognition memory task for visual objects and nonsense figures in order to investigate the functional significance of both ERP old/new effects. With respect to study presentation, all items were either presented in a perceptually identical or a color-modified version at test. Old nonsense figures, despite being meaningless, elicited a reliable mid-frontal old/new effect, thereby strongly suggesting a close relationship to familiarity processes rather than conceptual priming. Additionally, both the mid-frontal and the parietal old/new effect for real objects were graded with respect to the perceptual similarity between study and test. We argue that not only recollection, but also familiarity processes can provide information about perceptual atttributes, which is used in the course of recognition memory decisions.
Assembling a Computerized Adaptive Testing Item Pool as a Set of Linear Tests

ERIC Educational Resources Information Center

van der Linden, Wim J.; Ariel, Adelaide; Veldkamp, Bernard P.

2006-01-01

Test-item writing efforts typically results in item pools with an undesirable correlational structure between the content attributes of the items and their statistical information. If such pools are used in computerized adaptive testing (CAT), the algorithm may be forced to select items with less than optimal information, that violate the content…
Evaluation of Northwest University, Kano Post-UTME Test Items Using Item Response Theory

ERIC Educational Resources Information Center

Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi

2016-01-01

High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…

The zebrafish world of colors and shapes: preference and discrimination.

PubMed

Oliveira, Jessica; Silveira, Mayara; Chacon, Diana; Luchiari, Ana

2015-04-01

Natural environment imposes many challenges to animals, which have to use cognitive abilities to cope with and exploit it to enhance their fitness. Since zebrafish is a well-established model for cognitive studies and high-throughput screening for drugs and diseases that affect cognition, we tested their ability for ambient color preference and 3D objects discrimination to establish a protocol for memory evaluation. For the color preference test, zebrafish were observed in a multiple-chamber tank with different environmental color options. Zebrafish showed preference for blue and green, and avoided yellow and red. For the 3D objects discrimination, zebrafish were allowed to explore two equal objects and then observed in a one-trial test in which a new color, size, or shape of the object was presented. Zebrafish showed discrimination for color, shape, and color+shape combined, but not size. These results imply that zebrafish seem to use some categorical system to discriminate items, and distracters affect their ability for discrimination. The type of variables available (color and shape) may favor zebrafish objects perception and facilitate discrimination processing. We suggest that this easy and simple memory test could serve as a useful screening tool for cognitive dysfunction and neurotoxicological studies.
Criterion-Referenced Test Items for Welding.

ERIC Educational Resources Information Center

Davis, Diane, Ed.

This test item bank on welding contains test questions based upon competencies found in the Missouri Welding Competency Profile. Some test items are keyed for multiple competencies. These criterion-referenced test items are designed to work with the Vocational Instructional Management System. Questions have been statistically sampled and validated…
Decomposing the interaction between retention interval and study/test practice: The role of retrievability

PubMed Central

Jang, Yoonhee; Wixted, John T.; Pecher, Diane; Zeelenberg, René; Huber, David E.

2012-01-01

Even without feedback, test practice enhances delayed performance compared to study practice, but the size of the effect is variable across studies. We investigated the benefit of testing, separating initially retrievable items from initially non-retrievable items. In two experiments, an initial test determined item retrievability. Retrievable or non-retrievable items were subsequently presented for repeated study or test practice. Collapsing across items, in Experiment 1, we obtained the typical crossover interaction between retention interval and practice type. For retrievable items, however, the crossover interaction was quantitatively different, with a small study benefit for an immediate test and a larger testing benefit after a delay. For non-retrievable items, there was a large study benefit for an immediate test, but one week later there was no difference between the study and test practice conditions. In Experiment 2, initially non-retrievable items were given additional study followed by either an immediate test or even more additional study, and one week later performance did not differ between the two conditions. These results indicate that the effect size of study/test practice is due to the relative contribution of retrievable and non-retrievable items. PMID:22304454
Decomposing the interaction between retention interval and study/test practice: the role of retrievability.

PubMed

Jang, Yoonhee; Wixted, John T; Pecher, Diane; Zeelenberg, René; Huber, David E

2012-01-01

Even without feedback, test practice enhances delayed performance compared to study practice, but the size of the effect is variable across studies. We investigated the benefit of testing, separating initially retrievable items from initially nonretrievable items. In two experiments, an initial test determined item retrievability. Retrievable or nonretrievable items were subsequently presented for repeated study or test practice. Collapsing across items, in Experiment 1, we obtained the typical cross-over interaction between retention interval and practice type. For retrievable items, however, the cross-over interaction was quantitatively different, with a small study benefit for an immediate test and a larger testing benefit after a delay. For nonretrievable items, there was a large study benefit for an immediate test, but one week later there was no difference between the study and test practice conditions. In Experiment 2, initially nonretrievable items were given additional study followed by either an immediate test or even more additional study, and one week later performance did not differ between the two conditions. These results indicate that the effect size of study/test practice is due to the relative contribution of retrievable and nonretrievable items.
Optimal Test Design with Rule-Based Item Generation

ERIC Educational Resources Information Center

Geerlings, Hanneke; van der Linden, Wim J.; Glas, Cees A. W.

2013-01-01

Optimal test-design methods are applied to rule-based item generation. Three different cases of automated test design are presented: (a) test assembly from a pool of pregenerated, calibrated items; (b) test generation on the fly from a pool of calibrated item families; and (c) test generation on the fly directly from calibrated features defining…
Normative data for Chinese compound remote associate problems.

PubMed

Wu, Ching-Lin; Chen, Hsueh-Chih

2017-12-01

The Remote Associates Test (RAT) is a well-known measure of creativity, with each item on the RAT is composed of three unrelated stimulus words. The participant's task is to find an answer in the form of a word that could combine with each of the stimulus words, thus forming three new actual nouns. Researchers have modified the RAT to develop compound remote associate problems that emphasize combining vocabulary to form compound words. In the field of creativity research for Mandarin speakers, the Chinese RAT has been widely applied for over 10 years. The original RAT, compound remote associate problems, and Chinese RAT have various common advantages, such as being convenient to use and having objective scoring; additionally, the development of items for certain tests is easy and satisfies the requirements of psychological assessments in terms of the quantity of items. Currently, many language editions of the RAT and compound remote associate problems already exist. In particular, the English and Italian versions of these tests already have derived normative data. Because approximately 20% of the world's population are native Mandarin speakers, and because increasing numbers of people are choosing Mandarin as a second language, the need to increase Mandarin-language resources is growing; however, normative data for the Chinese RAT still do not exist. To address this issue, in the present study we developed Chinese compound remote associate problems and analyzed the passing rates by items, problem solving times, and various normative data, using the responses of 253 subjects in three experiments.
Development of an Instrument to Measure Pharmacy Student Attitudes Toward Social Media Professionalism

PubMed Central

Spivey, Christina A.; Jaeger, Melanie C.; Williams, Jennifer; George, Christa

2017-01-01

Objectives. To develop and validate a scale measuring pharmacy students’ attitudes toward social media professionalism, and assess the impact of an educational presentation on social media professionalism. Methods. A social media professionalism scale was used in a pre- and post-survey to determine the effects of a social media professionalism presentation. The 26-item scale was administered to 197 first-year pharmacy (P1) students during orientation. Exploratory factor analysis was applied to determine the number of underlying factors responsible for covariation of the data. Principal components analysis was used as the extraction method. Varimax was selected as the rotation method. Cronbach’s alpha was estimated. Wilcoxon signed rank test was used to compare pre- and post-scores of each item, subscale, and total scale. Results. There were 187 (95%) students who participated. The final scale had five subscales and 15 items. Subscales were named according to the professionalism tenet they best represented. Scores of items addressing reading/posting to social media during class, an employer’s use of social media when making hiring decisions, and a college/university’s use of social media as a measure of professional conduct significantly increased from pre-test to post-test. The “honesty and integrity” subscale score also significantly increased. Conclusion. The social media professionalism scale measures five tenets of professionalism and exhibits satisfactory reliability. The presentation improved P1 students’ attitudes regarding social media professionalism. PMID:28630506
The reliability and validity of the standardized Mensendieck test in relation to disability in patients with chronic pain.

PubMed

Keessen, Paul; Maaskant, Jolanda; Visser, Bart

2018-08-01

The standardized Mensendieck test (SMT) was developed to quantify posture, movement, gait, and respiration. In the hands of an experienced therapist, the SMT is proven to be a reliable tool. It is unclear whether posture, movement, gait, and respiration are related to the degree of functional disability in patients with chronic pain. The objective of this study was to assess the reliability and convergent validity of the SMT in a heterogeneous sample of 50 patients with chronic pain. Internal consistency was determined by Cronbach's α and interrater reliability by the intraclass correlation coefficient (ICC). Convergent validity was assessed by determining the Spearman rank correlation coefficient between the movement quality measured in the SMT and functional limitation measured on the disability rating index (DRI). The internal consistency was Cronbach's α 0.91. Substantial reliability was found for the items: movement (ICC = 0.68), gait (ICC = 0.69), sitting posture (ICC = 0.63), and respiration (ICC = 0.64). Insufficient reliability was found for standing posture (ICC = 0.23). A moderate correlation was found between average test score SMT and the DRI (r = -0.37) and respiration and DRI (r = -0.45). The SMT is a reasonably reliable tool to assess movement, gait, sitting posture, and respiration. None of the items in the domain standing posture has sufficient reliability. A thorough study of this domain should be considered. The results show little evidence for convergent validity. Several items of the SMT correlated moderately with functional limitation with the DRI. These items were global movement, hip flexion, pelvis rotation, and all respiration items.
Questionnaire for low back pain in the garment industry workers

PubMed Central

Bindra, Supreet; Sinha, A. G. K.; Benjamin, A. I.

2013-01-01

Low back pain affects up to 90% of the world's population at some point in their lives. Until date no questionnaire has been designed for back pain in the garment industry workers. Therefore, the objective of this study is to design a questionnaire to determine the prevalence, risk factors, impact, health care service utilization and back pain features in the garment industry workers and gain preliminary experience of its use. The content validity and reliability of the questionnaire was established. Items showing acceptable internal consistency and moderate to high test re-test reliability were retained in the questionnaire. Items showing unacceptable internal consistency, low test re-test reliability or poor differentiation were reworded, redrafted and re-tested on the workers. It took 20 min to complete one interview schedule. Environmental factors such as the absence of the garment industry owner/supervisor or co-workers at the time of the interview and interview during leisure hours need to be standardized. Thus, final questionnaire is ready for use after necessary amendments and will be used on the larger sample size in the main study. PMID:24421591
Questionnaire for low back pain in the garment industry workers.

PubMed

Bindra, Supreet; Sinha, A G K; Benjamin, A I

2013-05-01

Low back pain affects up to 90% of the world's population at some point in their lives. Until date no questionnaire has been designed for back pain in the garment industry workers. Therefore, the objective of this study is to design a questionnaire to determine the prevalence, risk factors, impact, health care service utilization and back pain features in the garment industry workers and gain preliminary experience of its use. The content validity and reliability of the questionnaire was established. Items showing acceptable internal consistency and moderate to high test re-test reliability were retained in the questionnaire. Items showing unacceptable internal consistency, low test re-test reliability or poor differentiation were reworded, redrafted and re-tested on the workers. It took 20 min to complete one interview schedule. Environmental factors such as the absence of the garment industry owner/supervisor or co-workers at the time of the interview and interview during leisure hours need to be standardized. Thus, final questionnaire is ready for use after necessary amendments and will be used on the larger sample size in the main study.
Emergence of the benefits and costs of grouping for visual search.

PubMed

Wu, Rachel; McGee, Brianna; Rubenstein, Madelyn; Pruitt, Zoe; Cheung, Olivia S; Aslin, Richard N

2018-04-16

The present study investigated how grouping related items leads to the emergence of benefits (facilitation when related items are search targets) and costs (interference when related items are distractors) in visual search. Participants integrated different views (related items) of a novel Lego object via (a) assembling the object, (b) disassembling the object, or (c) sitting quietly without explicit instructions. An omnibus ANOVA revealed that neural responses (N2pc ERP) for attentional selection increased between pretest to posttest regardless of the training condition when a specific target view appeared (benefit) and when a nontarget view from the same object as the target view appeared (cost). Bonferroni-corrected planned comparisons revealed that assembling the object (but not disassembling the object or no training) had a significant impact from pretest to posttest, although the ANOVA did not reveal any interaction effects, suggesting that the effects might not differ across training conditions. This study is one of the first to demonstrate the emergence of the costs and benefits of grouping novel targets on visual search efficiency. © 2018 Society for Psychophysiological Research.
Science Library of Test Items. Volume Twenty. A Collection of Multiple Choice Test Items Relating Mainly to Physics, 1.

ERIC Educational Resources Information Center

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Science Library of Test Items. Volume Twenty-One. A Collection of Multiple Choice Test Items Relating Mainly to Physics, 2.

ERIC Educational Resources Information Center

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Science Library of Test Items. Volume Twenty-Two. A Collection of Multiple Choice Test Items Relating Mainly to Skills.

ERIC Educational Resources Information Center

New South Wales Dept. of Education, Sydney (Australia).

As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
Criterion-Referenced Test Items for Small Engines.

ERIC Educational Resources Information Center

Herd, Amon

This notebook contains criterion-referenced test items for testing students' knowledge of small engines. The test items are based upon competencies found in the Missouri Small Engine Competency Profile. The test item bank is organized in 18 sections that cover the following duties: shop procedures; tools and equipment; fasteners; servicing fuel…
An Investigation of the Impact of Guessing on Coefficient α and Reliability

PubMed Central

2014-01-01

Guessing is known to influence the test reliability of multiple-choice tests. Although there are many studies that have examined the impact of guessing, they used rather restrictive assumptions (e.g., parallel test assumptions, homogeneous inter-item correlations, homogeneous item difficulty, and homogeneous guessing levels across items) to evaluate the relation between guessing and test reliability. Based on the item response theory (IRT) framework, this study investigated the extent of the impact of guessing on reliability under more realistic conditions where item difficulty, item discrimination, and guessing levels actually vary across items with three different test lengths (TL). By accommodating multiple item characteristics simultaneously, this study also focused on examining interaction effects between guessing and other variables entered in the simulation to be more realistic. The simulation of the more realistic conditions and calculations of reliability and classical test theory (CTT) item statistics were facilitated by expressing CTT item statistics, coefficient α, and reliability in terms of IRT model parameters. In addition to the general negative impact of guessing on reliability, results showed interaction effects between TL and guessing and between guessing and test difficulty.
Validation of 5-item and 2-item questionnaires in Chinese version of Dizziness Handicap Inventory for screening objective benign paroxysmal positional vertigo.

PubMed

Chen, Wei; Shu, Liang; Wang, Qian; Pan, Hui; Wu, Jing; Fang, Jie; Sun, Xu-Hong; Zhai, Yu; Dong, You-Rong; Liu, Jian-Ren

2016-08-01

As possible candidate screening instruments for benign paroxysmal positional vertigo (BPPV), studies to validate the Dizziness Handicap Inventory (DHI) sub-scale (5-item and 2-item) and total scores are rare in China. From May 2014 to December 2014, 108(55 with and 53 without BPPV) patients complaining of episodic vertigo in the past week from a vertigo outpatient clinic were enrolled for DHI evaluation, as well as demographic and other clinical data. Objective BPPV was subsequently determined by positional evoking maneuvers under the record of optical Frenzel glasses. Cronbach's coefficient α was used to evaluate the reliability of psychometric scales. The validity of DHI total, 5-item and 2-item questionnaires to screen for BPPV was assessed by receiver operating characteristic (ROC) curves. It revealed that the DHI 5-item questionnaire had good internal consistency (Cronbach's coefficient α = 0.72). Area under the curve of total DHI, 5-item and 2-item scores for discriminating BPPV from those without was 0.678 (95 % CI 0.578-0.778), 0.873(95 % CI 0.807-0.940) and 0.895(95 % CI 0.836-0.953), respectively. It revealed 74.5 % sensitivity and 88.7 % specificity in separating BPPV and those without, with a cutoff value of 12 in the 5-item questionnaire. The corresponding rate of sensitivity and specificity was 78.2 and 88.7 %, respectively, with a cutoff value of 6 in 2-item questionnaire. The present study indicated that both 5-item and 2-item questionnaires in the Chinese version of DHI may be more valid than DHI total score for screening objective BPPV and merit further application in clinical practice in China.
Rasch validation of the Arabic version of the lower extremity functional scale.

PubMed

Alnahdi, Ali H

2018-02-01

The purpose of this study was to examine the internal construct validity of the Arabic version of the Lower Extremity Functional Scale (20-item Arabic LEFS) using Rasch analysis. Patients (n = 170) with lower extremity musculoskeletal dysfunction were recruited. Rasch analysis of 20-item Arabic LEFS was performed. Once the initial Rasch analysis indicated that the 20-item Arabic LEFS did not fit the Rasch model, follow-up analyses were conducted to improve the fit of the scale to the Rasch measurement model. These modifications included removing misfitting individuals, changing item scoring structure, removing misfitting items, addressing bias caused by response dependency between items and differential item functioning (DIF). Initial analysis indicated deviation of the 20-item Arabic LEFS from the Rasch model. Disordered thresholds in eight items and response dependency between six items were detected with the scale as a whole did not meet the requirement of unidimensionality. Refinements led to a 15-item Arabic LEFS that demonstrated excellent internal consistency (person separation index [PSI] = 0.92) and satisfied all the requirement of the Rasch model. Rasch analysis did not support the 20-item Arabic LEFS as a unidimensional measure of lower extremity function. The refined 15-item Arabic LEFS met all the requirement of the Rasch model and hence is a valid objective measure of lower extremity function. The Rasch-validated 15-item Arabic LEFS needs to be further tested in an independent sample to confirm its fit to the Rasch measurement model. Implications for Rehabilitation The validity of the 20-item Arabic Lower Extremity Functional Scale to measure lower extremity function is not supported. The 15-item Arabic version of the LEFS is a valid measure of lower extremity function and can be used to quantify lower extremity function in patients with lower extremity musculoskeletal disorders.
The Content Validity of a Chemotherapy-Induced Peripheral Neuropathy Patient-Reported Outcome Measure

PubMed Central

Lavoie Smith, Ellen M.; Haupt, Rylie; Kelly, James P.; Lee, Deborah; Kanzawa-Lee, Grace; Knoerl, Robert; Bridges, Celia; Alberti, Paola; Prasertsri, Nusara; Donohoe, Clare

2018-01-01

Purpose/Objectives To test the content validity of a 16-item version of the European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Questionnaire–Chemotherapy-Induced Peripheral Neuropathy (QLQ-CIPN20). Research Approach Cross-sectional, prospective, qualitative design. Setting Six outpatient oncology clinics within the University of Michigan Health System’s comprehensive cancer center in Ann Arbor. Participants 25 adults with multiple myeloma or breast, gynecologic, gastrointestinal, or head and neck malignancies experiencing peripheral neuropathy caused by neurotoxic chemotherapy. Methodologic Approach Cognitive interviewing methodology was used to evaluate the content validity of a 16-item version of the QLQ-CIPN20 instrument. Findings Minor changes were made to three questions to enhance readability. Twelve questions were revised to define unfamiliar terminology, clarify the location of neuropathy, and emphasize important aspects. One question was deleted because of clinical and conceptual redundancy with other items, as well as concerns regarding generalizability and social desirability. Interpretation Cognitive interviewing methodology revealed inconsistencies between patients’ understanding and researchers’ intent, along with points that required clarification to avoid misunderstanding. Implications for Nursing Patients’ interpretations of the instrument’s items were inconsistent with the intended meanings of the questions. One item was dropped and others were revised, resulting in greater consistency in how patients, clinicians, and researchers interpreted the items’ meanings and improving the instrument’s content validity. Following additional revision and psychometric testing, the QLQ-CIPN20 could evolve into a gold-standard CIPN patient-reported outcome measure. PMID:28820525
Identification of high school students' ability level of constructing free body diagrams to solve restricted and structured response items in force matter

NASA Astrophysics Data System (ADS)

Rahmaniar, Andinisa; Rusnayati, Heni; Sutiadi, Asep

2017-05-01

While solving physics problem particularly in force matter, it is needed to have the ability of constructing free body diagrams which can help students to analyse every force which acts on an object, the length of its vector and the naming of its force. Mix method was used to explain the result without any special treatment to participants. The participants were high school students in first grade totals 35 students. The purpose of this study is to identify students' ability level of constructing free body diagrams in solving restricted and structured response items. Considering of two types of test, every student would be classified into four levels ability of constructing free body diagrams which is every level has different characteristic and some students were interviewed while solving test in order to know how students solve the problem. The result showed students' ability of constructing free body diagrams on restricted response items about 34.86% included in no evidence of level, 24.11% inadequate level, 29.14% needs improvement level and 4.0% adequate level. On structured response items is about 16.59% included no evidence of level, 23.99% inadequate level, 36% needs improvement level, and 13.71% adequate level. Researcher found that students who constructed free body diagrams first and constructed free body diagrams correctly were more successful in solving restricted and structured response items.

In search of memory tests equivalent for experiments on animals and humans.

PubMed

Brodziak, Andrzej; Kołat, Estera; Różyk-Myrta, Alicja

2014-12-19

Older people often exhibit memory impairments. Contemporary demographic trends cause aging of the society. In this situation, it is important to conduct clinical trials of drugs and use training methods to improve memory capacity. Development of new memory tests requires experiments on animals and then clinical trials in humans. Therefore, we decided to review the assessment methods and search for tests that evaluate analogous cognitive processes in animals and humans. This review has enabled us to propose 2 pairs of tests of the efficiency of working memory capacity in animals and humans. We propose a basic set of methods for complex clinical trials of drugs and training methods to improve memory, consisting of 2 pairs of tests: 1) the Novel Object Recognition Test - Sternberg Item Recognition Test and 2) the Object-Location Test - Visuospatial Memory Test. We postulate that further investigations of methods that are equivalent in animals experiments and observations performed on humans are necessary.
Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

ERIC Educational Resources Information Center

Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André

2016-01-01

Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…
The New Jettison Policy for the International Space Station

NASA Technical Reports Server (NTRS)

Johnson, Nicholas L.

2006-01-01

During more than seven years of operations by the International Space Station (ISS), approximately three dozen pieces of debris were released and subsequently cataloged by the U.S. Space Surveillance Network (SSN). The individual mass of these objects ranged from less than 1 kg to 70 kg. Although some of these debris were separated from the ISS accidentally, some were intentionally cast-off, especially the larger items. In addition, small operational satellites are candidates for launch from the ISS, such as the TNS-O satellite deployed from ISS in March 2005. Recently an official ISS Jettison Policy was developed to ensure that decisions to deliberately release objects in the future were based upon a complete evaluation of the benefits and risks to the ISS, other resident space objects, and people on the Earth. The policy identifies four categories of items which might be considered for release: (1) items that pose a safety issue for return on-board a visiting vehicle, (2) items that negatively impact ISS utilization, return, or on-orbit stowage manifests, (3) items that represent an EVA timeline savings, and (4) items that are designed for jettison. Some of the principal issues to be addressed during this evaluation process are the potential for the object to recontact the ISS within the first two days after jettison, the potential of the object to breakup prior to reentry, the ability of the SSN to track the object, and the risk to people on Earth from components which might survive reentry. This paper summarizes the history of objects released from ISS, examines the specifics of the ISS jettison policy, and addresses the overall impact of ISS debris on the space environment.
Integrating Test-Form Formatting into Automated Test Assembly

ERIC Educational Resources Information Center

Diao, Qi; van der Linden, Wim J.

2013-01-01

Automated test assembly uses the methodology of mixed integer programming to select an optimal set of items from an item bank. Automated test-form generation uses the same methodology to optimally order the items and format the test form. From an optimization point of view, production of fully formatted test forms directly from the item pool using…
Instructional Topics in Educational Measurement (ITEMS) Module: Using Automated Processes to Generate Test Items

ERIC Educational Resources Information Center

Gierl, Mark J.; Lai, Hollis

2013-01-01

Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…
Computerized Adaptive Testing Provides Reliable and Efficient Depression Measurement Using the CES-D Scale

PubMed Central

2017-01-01

Background The Center for Epidemiologic Studies Depression Scale (CES-D) is a measure of depressive symptomatology which is widely used internationally. Though previous attempts were made to shorten the CES-D scale, few have attempted to develop a Computerized Adaptive Test (CAT) version for the CES-D. Objective The aim of this study was to provide evidence on the efficiency and accuracy of the CES-D when administered using CAT using an American sample group. Methods We obtained a sample of 2060 responses to the CESD-D from US participants using the myPersonality application. The average age of participants was 26 years (range 19-77). We randomly split the sample into two groups to evaluate and validate the psychometric models. We used evaluation group data (n=1018) to assess dimensionality with both confirmatory factor and Mokken analysis. We conducted further psychometric assessments using item response theory (IRT), including assessments of item and scale fit to Samejima’s graded response model (GRM), local dependency and differential item functioning. We subsequently conducted two CAT simulations to evaluate the CES-D CAT using the validation group (n=1042). Results Initial CFA results indicated a poor fit to the model and Mokken analysis revealed 3 items which did not conform to the same dimension as the rest of the items. We removed the 3 items and fit the remaining 17 items to GRM. We found no evidence of differential item functioning (DIF) between age and gender groups. Estimates of the level of CES-D trait score provided by the simulated CAT algorithm and the original CES-D trait score derived from original scale were correlated highly. The second CAT simulation conducted using real participant data demonstrated higher precision at the higher levels of depression spectrum. Conclusions Depression assessments using the CES-D CAT can be more accurate and efficient than those made using the fixed-length assessment. PMID:28931496
Development and Validation of a Novel Generic Health-related Quality of Life Instrument With 20 Items (HINT-20)

PubMed Central

2017-01-01

Objectives Few attempts have been made to develop a generic health-related quality of life (HRQoL) instrument and to examine its validity and reliability in Korea. We aimed to do this in our present study. Methods After a literature review of existing generic HRQoL instruments, a focus group discussion, in-depth interviews, and expert consultations, we selected 30 tentative items for a new HRQoL measure. These items were evaluated by assessing their ceiling effects, difficulty, and redundancy in the first survey. To validate the HRQoL instrument that was developed, known-groups validity and convergent/discriminant validity were evaluated and its test-retest reliability was examined in the second survey. Results Of the 30 items originally assessed for the HRQoL instrument, four were excluded due to high ceiling effects and six were removed due to redundancy. We ultimately developed a HRQoL instrument with a reduced number of 20 items, known as the Health-related Quality of Life Instrument with 20 items (HINT-20), incorporating physical, mental, social, and positive health dimensions. The results of the HINT-20 for known-groups validity were poorer in women, the elderly, and those with a low income. For convergent/discriminant validity, the correlation coefficients of items (except vitality) in the physical health dimension with the physical component summary of the Short Form 36 version 2 (SF-36v2) were generally higher than the correlations of those items with the mental component summary of the SF-36v2, and vice versa. Regarding test-retest reliability, the intraclass correlation coefficient of the total HINT-20 score was 0.813 (p<0.001). Conclusions A novel generic HRQoL instrument, the HINT-20, was developed for the Korean general population and showed acceptable validity and reliability. PMID:28173686
Reducing calories, fat, saturated fat and sodium in restaurant menu items: Effects on consumer acceptance

PubMed Central

Patel, Anjali A.; Lopez, Nanette V.; Lawless, Harry T.; Njike, Valentine; Beleche, Mariana; Katz, David L.

2016-01-01

OBJECTIVE This study assessed consumer acceptance of reductions of calories, fat, saturated fat, and sodium to current restaurant recipes. METHODS Twenty-four menu items, from six restaurant chains, were slightly modified and moderately modified by reducing targeted ingredients. Restaurant customers (n=1,838) were recruited for a taste test and were blinded to the recipe version as well as the purpose of the study. Overall consumer acceptance was measured using a 9-point hedonic (like/dislike) scale, likelihood to purchase scale, Just-About-Right (JAR) 5-point scale, penalty analysis and alienation analysis. RESULTS Overall, modified recipes of 19 menu items were scored similar to (or better than) their respective current versions. Eleven menu items were found to be acceptable at the slightly modified recipe version and eight menu items were found to be acceptable at the moderately modified recipe version. Acceptable ingredient reductions resulted in a reduction of up to 26% in calories and a reduction of up to 31% in sodium per serving. CONCLUSIONS The majority of restaurant menu items with small reductions of calories, fat, saturated fat and sodium were acceptable. Given the frequency of eating foods away from home, these reductions could be effective in creating dietary improvements for restaurant diners. PMID:27891828
A Procedure To Detect Test Bias Present Simultaneously in Several Items.

ERIC Educational Resources Information Center

Shealy, Robin; Stout, William

A statistical procedure is presented that is designed to test for unidirectional test bias existing simultaneously in several items of an ability test, based on the assumption that test bias is incipient within the two groups' ability differences. The proposed procedure--Simultaneous Item Bias (SIB)--is based on a multidimensional item response…
An Item Response Theory Model for Test Bias.

ERIC Educational Resources Information Center

Shealy, Robin; Stout, William

This paper presents a conceptualization of test bias for standardized ability tests which is based on multidimensional, non-parametric, item response theory. An explanation of how individually-biased items can combine through a test score to produce test bias is provided. It is contended that bias, although expressed at the item level, should be…
77 FR 19699 - Notice of Intent to Repatriate Cultural Items: Rochester Museum & Science Center, Rochester, NY

Federal Register 2010, 2011, 2012, 2013, 2014

2012-04-02

... Indian tribe, has determined that the cultural items meet the definition of both sacred objects and... Rochester Museum & Science Center that meet the definition of both sacred objects and [[Page 19700
Development and validation of a brief screening instrument for psychosocial risk associated with genetic testing: a pan-Canadian cohort study

PubMed Central

Esplen, Mary Jane; Cappelli, Mario; Wong, Jiahui; Bottorff, Joan L; Hunter, Jon; Carroll, June; Dorval, Michel; Wilson, Brenda; Allanson, Judith; Semotiuk, Kara; Aronson, Melyssa; Bordeleau, Louise; Charlemagne, Nicole; Meschino, Wendy

2013-01-01

Objectives To develop a brief, reliable and valid instrument to screen psychosocial risk among those who are undergoing genetic testing for Adult-Onset Hereditary Disease (AOHD). Design A prospective two-phase cohort study. Setting 5 genetic testing centres for AOHD, such as cancer, Huntington's disease or haemochromatosis, in ambulatory clinics of tertiary hospitals across Canada. Participants 141 individuals undergoing genetic testing were approached and consented to the instrument development phase of the study (Phase I). The Genetic Psychosocial Risk Instrument (GPRI) developed in Phase I was tested in Phase II for item refinement and validation. A separate cohort of 722 individuals consented to the study, 712 completed the baseline package and 463 completed all follow-up assessments. Most participants were female, at the mid-life stage. Individuals in advanced stages of the illness or with cognitive impairment or a language barrier were excluded. Interventions Phase I: GPRI items were generated from (1) a review of the literature, (2) input from genetic counsellors and (3) phase I participants. Phase II: further item refinement and validation were conducted with a second cohort of participants who completed the GPRI at baseline and were followed for psychological distress 1-month postgenetic testing results. Primary and secondary outcome measures GPRI, Hamilton Depression Rating Scale (HAM-D), Hamilton Anxiety Rating Scale (HAM-A), Brief Symptom Inventory (BSI) and Impact of Event Scale (IES). Results The final 20-item GPRI had a high reliability—Cronbach's α at 0.81. The construct validity was supported by high correlations between GPRI and BSI and IES. The predictive value was demonstrated by a receiver operating characteristic curve of 0.78 plotting GPRI against follow-up assessments using HAM-D and HAM-A. Conclusions With a cut-off score of 50, GPRI identified 84% of participants who displayed distress postgenetic testing results, supporting its potential usefulness in a clinical setting. PMID:23485718
Using Reliability and Item Analysis to Evaluate a Teacher-Developed Test in Educational Measurement and Evaluation

ERIC Educational Resources Information Center

Quaigrain, Kennedy; Arhin, Ato Kwamina

2017-01-01

Item analysis is essential in improving items which will be used again in later tests; it can also be used to eliminate misleading items in a test. The study focused on item and test quality and explored the relationship between difficulty index (p-value) and discrimination index (DI) with distractor efficiency (DE). The study was conducted among…
Measurement characteristics for two health-related quality of life measures in older adults: The SF-36 and the CDC Healthy Days items

PubMed Central

Barile, John P.; Horner-Johnson, Willi; Krahn, Gloria; Zack, Matthew; Miranda, David; DeMichele, Kimberly; Ford, Derek; Thompson, William W.

2017-01-01

Background The Short Form Health Survey (SF-36) and the Centers for Disease Control and Prevention (CDC) Healthy Days items are well known measures of health-related quality of life. The validity of the SF-36 for older adults and those with disabilities has been questioned. Objective Assess the extent to which the SF-36 and the Centers for Disease Control and Prevention (CDC) Healthy Days items measure the same aspects of health; whether the SF-36 and the CDC unhealthy days items are invariant across gender, functional status, or the presence of chronic health conditions of older adults; and whether each of the SF-36’s eight subscales is independently associated with the CDC Healthy Days items. Methods We analyzed data from 66,269 adult Medicare advantage members age 65 and older. We used confirmatory factor analyses and regression modeling to test associations between the CDC Healthy Days items and subscales of the SF-36. Results The CDC Healthy Days items were associated with the SF-36 global measures of physical and mental health. The CDC physically unhealthy days item was associated with the SF-36 subscales for bodily pain, physical role limitations, and general health, while the CDC mentally unhealthy days item was associated with the SF-36 subscales for mental health, emotional role limitations, vitality and social functioning. The SF-36 physical functioning subscale was not independently associated with either of the CDC Healthy Days items. Conclusions The CDC Healthy Days items measure similar domains as the SF-36 but appear to assess HRQOL without regard to limitations in functioning. PMID:27259343
Insightful problem solving in an Asian elephant.

PubMed

Foerder, Preston; Galloway, Marie; Barthel, Tony; Moore, Donald E; Reiss, Diana

2011-01-01

The "aha" moment or the sudden arrival of the solution to a problem is a common human experience. Spontaneous problem solving without evident trial and error behavior in humans and other animals has been referred to as insight. Surprisingly, elephants, thought to be highly intelligent, have failed to exhibit insightful problem solving in previous cognitive studies. We tested whether three Asian elephants (Elephas maximus) would use sticks or other objects to obtain food items placed out-of-reach and overhead. Without prior trial and error behavior, a 7-year-old male Asian elephant showed spontaneous problem solving by moving a large plastic cube, on which he then stood, to acquire the food. In further testing he showed behavioral flexibility, using this technique to reach other items and retrieving the cube from various locations to use as a tool to acquire food. In the cube's absence, he generalized this tool utilization technique to other objects and, when given smaller objects, stacked them in an attempt to reach the food. The elephant's overall behavior was consistent with the definition of insightful problem solving. Previous failures to demonstrate this ability in elephants may have resulted not from a lack of cognitive ability but from the presentation of tasks requiring trunk-held sticks as potential tools, thereby interfering with the trunk's use as a sensory organ to locate the targeted food.
Diagnostic value of history and physical examination in patients suspected of lumbosacral nerve root compression

PubMed Central

Vroomen, P; de Krom, M C T F M; Wilmink, J; Kester, A; Knottnerus, J

2002-01-01

Objective: To evaluate patient characteristics, symptoms, and examination findings in the clinical diagnosis of lumbosacral nerve root compression causing sciatica. Methods: The study involved 274 patients with pain radiating into the leg. All had a standardised clinical assessment and magnetic resonance (MR) imaging. The associations between patient characteristics, clinical findings, and lumbosacral nerve root compression on MR imaging were analysed. Results: Nerve root compression was associated with three patient characteristics, three symptoms, and four physical examination findings (paresis, absence of tendon reflexes, a positive straight leg raising test, and increased finger-floor distance). Multivariate analysis, analysing the independent diagnostic value of the tests, showed that nerve root compression was predicted by two patient characteristics, four symptoms, and two signs (increased finger-floor distance and paresis). The straight leg raise test was not predictive. The area under the curve of the receiver-operating characteristic was 0.80 for the history items. It increased to 0.83 when the physical examination items were added. Conclusions: Various clinical findings were found to be associated with nerve root compression on MR imaging. While this set of findings agrees well with those commonly used in daily practice, the tests tended to have lower sensitivity and specificity than previously reported. Stepwise multivariate analysis showed that most of the diagnostic information revealed by physical examination findings had already been revealed by the history items. PMID:11971050
Development, testing, and certification of Owens-Illinois model SEC-601 solar energy collector system

NASA Technical Reports Server (NTRS)

Parker, J. C.

1979-01-01

The final results are presented of the additional development work on the existing air-cooled solar energy collector subsystem for use with solar heating and cooling systems. The report discusses the intended use of the final report, describes the deliverable end items, lists program objectives, relates how they were accomplished, deals with problems encountered during fabrication and testing, and includes a certification statement of performance. The report shows that the products developed are marketable and suitable for public use.
Audio Adapted Assessment Data: Does the Addition of Audio to Written Items Modify the Item Calibration?

ERIC Educational Resources Information Center

Snyder, James

2010-01-01

This dissertation research examined the changes in item RIT calibration that occurred when adding audio to a set of currently calibrated RIT items and then placing these new items as field test items in the modified assessments on the NWEA MAP test platform. The researcher used test results from over 600 students in the Poway School District in…
Vegetable parenting practices scale: Item response modeling analyses

USDA-ARS?s Scientific Manuscript database

Our objective was to evaluate the psychometric properties of a vegetable parenting practices scale using multidimensional polytomous item response modeling which enables assessing item fit to latent variables and the distributional characteristics of the items in comparison to the respondents. We al...
Application of an IRT Polytomous Model for Measuring Health Related Quality of Life

ERIC Educational Resources Information Center

Tejada, Antonio J. Rojas; Rojas, Oscar M. Lozano

2005-01-01

Background: The Item Response Theory (IRT) has advantages for measuring Health Related Quality of Life (HRQOL) as opposed to the Classical Tests Theory (CTT). Objectives: To present the results of the application of a polytomous model based on IRT, specifically, the Rating Scale Model (RSM), to measure HRQOL with the EORTC QLQ-C30. Methods: 103…

A Study on Linear Programming Applications for the Optimization of School Lunch Menus. Summation Report.

ERIC Educational Resources Information Center

Findorff, Irene K.

This document summarizes the results of a project at Tulane University that was designed to adapt, test, and evaluate a computerized information and menu planning system utilizing linear programing techniques for use in school lunch food service operations. The objectives of the menu planning were to formulate menu items into a palatable,…
Nouns Referring to Tools and Natural Objects Differentially Modulate the Motor System

ERIC Educational Resources Information Center

Gough, Patricia M.; Riggio, Lucia; Chersi, Fabian; Sato, Marc; Fogassi, Leonardo; Buccino, Giovanni

2012-01-01

While increasing evidence points to a critical role for the motor system in language processing, the focus of previous work has been on the linguistic category of verbs. Here we tested whether nouns are effective in modulating the motor system and further whether different kinds of nouns--those referring to artifacts or natural items, and items…
Application of a Utility Analysis to Evaluate a Novel Assessment Tool for Clinically Oriented Physiology and Pharmacology

ERIC Educational Resources Information Center

Cramer, Nicholas; Asmar, Abdo; Gorman, Laurel; Gros, Bernard; Harris, David; Howard, Thomas; Hussain, Mujtaba; Salazar, Sergio; Kibble, Jonathan D.

2016-01-01

Multiple-choice questions are a gold-standard tool in medical school for assessment of knowledge and are the mainstay of licensing examinations. However, multiple-choice questions items can be criticized for lacking the ability to test higher-order learning or integrative thinking across multiple disciplines. Our objective was to develop a novel…
Rap-Music Attitude and Perception Scale: A Validation Study

ERIC Educational Resources Information Center

Tyson, Edgar H.

2006-01-01

Objective: This study tests the validity of the Rap-music Attitude and Perception (RAP) Scale, a 1-page, 24-item measure of a person's thoughts and feelings surrounding the effects and content of rap music. The RAP was designed as a rapid assessment instrument for youth programs and practitioners using rap music and hip hop culture in their work…
A NORMATIVE STUDY OF CHILDREN'S HOUSE-TREE-PERSON DRAWINGS.

ERIC Educational Resources Information Center

RAPPAPORT, SHELDON R.

THIS STUDY WAS THE FIRST PHASE OF A THREE-PART PROJECT WHOSE GOAL IS TO ESTABLISH VALID CRITERIA FOR IDENTIFYING THE HOUSE-TREE-PERSON (H-T-P) DRAWINGS OF NORMAL CHILDREN THROUGHOUT THE ELEMENTARY SCHOOL YEARS. THE SPECIFIC OBJECTIVES OF THIS STUDY WERE (1) TO IDENTIFY WHICH ITEMS OF THE H-T-P TEST CHARACTERIZE NORMAL DEVELOPMENT THROUGH GRADES 2,…
77 FR 48533 - Notice of Intent To Repatriate Cultural Items: U.S. Department of the Interior, National Park...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-08-14

... Indian tribes, has determined that the cultural items meet the definition of sacred objects and... individuals who believe they are lineal descendants of the individual who owned these sacred objects and who... individuals who believe they are lineal descendants of the individual who owned these sacred objects and who...
77 FR 19698 - Notice of Intent to Repatriate Cultural Items: Rochester Museum & Science Center, Rochester, NY

Federal Register 2010, 2011, 2012, 2013, 2014

2012-04-02

... Indian tribe, has determined that the cultural items meet the definition of both sacred objects and... Rochester Museum & Science Center that meet the definition of both sacred objects and objects of cultural.... Traditional religious leaders of the Seneca Nation of New York have identified these medicine faces as being...
75 FR 14460 - Notice of Intent to Repatriate Cultural Items: U.S. Department of the Interior, National Park...

Federal Register 2010, 2011, 2012, 2013, 2014

2010-03-25

... National Park, WY, that meet the definition of ``sacred objects'' under 25 U.S.C. 3001. This notice is... Indians of the Kickapoo Reservation in Kansas have identified these two cultural items as ``sacred objects... are specific ceremonial objects needed by traditional Native American religious leaders for the...
The discrimination of discrete and continuous amounts in African grey parrots (Psittacus erithacus).

PubMed

Aïn, Syrina Al; Giret, Nicolas; Grand, Marion; Kreutzer, Michel; Bovet, Dalila

2009-01-01

A wealth of research in infants and animals demonstrates discrimination of quantities, in some cases nonverbal numerical perception, and even elementary calculation capacities. We investigated the ability of three African grey parrots (Psittacus erithacus) to select the largest amount of food between two sets, either discrete food items (experiment 1) or as volume of a food substance (experiment 2). The two amounts were presented simultaneously and were visible at the time of choice. Parrots were tested several times for all possible combinations between 1 and 5 seeds or 0.2 and 1 ml of food substance. In both conditions, subjects performed above chance for almost all combinations. Accuracy was negatively correlated with the ratio, that is performance improved with greater differences between amounts. Therefore, these results with both individual items and volume discrimination suggest that parrots use an analogue of magnitude, rather than object-file mechanisms to quantify items and substances.
A cost-effective method to characterize variation in clinical practice.

PubMed

Chang, K; Sauereisen, S; Dlutowski, M; Veloski, J J; Nash, D B

1999-06-01

This study's objective was to measure variation in physicians' practice styles and policies. Family physicians and general internists were surveyed about evidence-based medicine in the areas of asthma, congestive heart failure, and diabetes mellitus. They were asked about clinical recommendations where standards of practice were uncertain, controversial, or changing in response to published guidelines. Also included were items dealing with managed care. Although there was wide variation in responses to 20 of 36 items, some responses were consistent with practice guidelines. Responses to several items indicated a tendency to overuse expensive tests. Overall, the results indicate that a brief, open-ended survey can assess practice variation quickly and economically, as contrasted with more expensive analyses of medical records or claims data. With proper validation such assessments can be used as baselines to guide interventions, as well as measures of the outcomes of these interventions to change practice styles.
Validation of an Empathy Scale in Pharmacy and Nursing Students

PubMed Central

Chen, Aleda M. H.; Yehle, Karen S.; Plake, Kimberly S.

2013-01-01

Objective. To validate an empathy scale to measure empathy in pharmacy and nursing students. Methods. A 15-item instrument comprised of the cognitive and affective empathy domains, was created. Each item was rated using a 7-point Likert scale, ranging from strongly disagree to strongly agree. Concurrent validity was demonstrated with the Jefferson Scale of Empathy – Health Professional Students (JSE-HPS). Results. Reliability analysis of data from 216 students (pharmacy, N=158; nursing, N=58) showed that scores on the empathy scale were positively associated with JSE-HPS scores (p<0.001). Factor analysis confirmed that 14 of the 15 items were significantly associated with their respective domain, but the overall instrument had limited goodness of fit. Conclusions. Results of this study demonstrate the reliability and validity of a new scale for evaluating student empathy. Further testing of the scale at other universities is needed to establish validity. PMID:23788805
Attitude measurement: Judging the emotional intensity of likert-type science attitude statements

NASA Astrophysics Data System (ADS)

Shrigley, Robert L.; Koballa, Thomas R., Jr.

Emotional intensity, that readiness of a teacher to respond favorably or unfavorably toward such psychological objects as science or the teaching of science, is the quality that distinguishes the attitude concept from other related psychological concepts. It would seem, then, that valid attitude statements, if they are to reflect the definition of attitude, would evoke emotional intensity, responses in both a favorable and unfavorable direction by a group of teachers on each item on a science attitude scale. Science educators who design or modify science attitude scales should continue using item-total correlations and other quantitative techniques to test for emotional intensity, but qualitative judgments are necessary, too. In addition, the frequency distribution of data generated by each statement should be examined for skewness and high percentages of neutral responses, both of which can impair the emotional intensity of an item.
[Research of the Epworth sleepiness scale based on ruzzy comprehensive evaluation].

PubMed

Li, P; Lv, Y H; Ma, L; Yang, S H; Xiang, Y; Lei, Q; Du, G D; Huang, D J

2017-03-05

Objective: This research explores the effect of Epworth sleepiness scale (ESS) items on domestic patients. Method: Four thousand six hundred and thirty-three suspected OSAHS patients with snoring were selected from respiratory sleep center in the first people's hospital, Yunnan province, between January 2006 and December 2012. These patients filled in the ESS before PSG test. Firstly, these questionnaires were preprocessed, and the null and incorrect ones were deleted. Then, the fuzzy comprehensive evaluation was applied for the value of each item in ESS. Finally, the reliability was compared between before and after the removal of the lowest values. Result: Fuzzy comprehensive evaluation results show that the total value is 1.016, the item value of Sitting and talking to someone and In a car, while stopped for a few minutes in traffic is the lowest, which is 0.131. The result of reliability analysis shows that the value increases 0.2% after the two items being deleted. Conclusion: Some items of ESS are not suitable for Chinese patients, and they need to be deleted or modified to improve the screening efficiency. Copyright© by the Editorial Department of Journal of Clinical Otorhinolaryngology Head and Neck Surgery.
Reward associations impact both iconic and visual working memory.

PubMed

Infanti, Elisa; Hickey, Clayton; Turatto, Massimo

2015-02-01

Reward plays a fundamental role in human behavior. A growing number of studies have shown that stimuli associated with reward become salient and attract attention. The aim of the present study was to extend these results into the investigation of iconic memory and visual working memory. In two experiments we asked participants to perform a visual-search task where different colors of the target stimuli were paired with high or low reward. We then tested whether the pre-established feature-reward association affected performance on a subsequent visual memory task, in which no reward was provided. In this test phase participants viewed arrays of 8 objects, one of which had unique color that could match the color associated with reward during the previous visual-search task. A probe appeared at varying intervals after stimulus offset to identify the to-be-reported item. Our results suggest that reward biases the encoding of visual information such that items characterized by a reward-associated feature interfere with mnemonic representations of other items in the test display. These results extend current knowledge regarding the influence of reward on early cognitive processes, suggesting that feature-reward associations automatically interact with the encoding and storage of visual information, both in iconic memory and visual working memory. Copyright © 2014 Elsevier Ltd. All rights reserved.
Expectancies, socioeconomic status, and self-rated health: use of the simplified TOMCATS Questionnaire.

PubMed

Odéen, Magnus; Westerlund, Hugo; Theorell, Töres; Leineweber, Constanze; Eriksen, Hege R; Ursin, Holger

2013-06-01

Coping has traditionally been measured with inventories containing many items meant to identify specific coping strategies. An alternative is to develop a shorter inventory that focusses on coping expectancies which may determine the extent to which an individual attempts to cope actively. This paper explores the usefulness and validity of a simplified seven-item questionnaire (Theoretically Originated Measure of the Cognitive Activation Theory of Stress, TOMCATS) for response outcome expectancies defined either as positive ("coping"), negative ("hopelessness"), or none ("helplessness"). The definitions are based on the Cognitive Activation Theory of Stress (CATS; Ursin and Eriksen, Psychoneuroendocrinology, 29(5):567–92, 2004). The questionnaire was tested in two different samples. First, the questionnaire was compared with a traditional test of coping and then tested for validity in relation to socioeconomic differences in self-reported health. The first study was a comparison of the brief TOMCATS with a short version of the Utrecht Coping List (UCL; Eriksen et al., Scand J Psychol, 38(3):175–82, 1997). Both questionnaires were tested in a population of 1,704 Norwegian municipality workers. The second study was a cross-sectional analysis of TOMCATS, subjective and objective socioeconomic status, and health in a representative sample of the Swedish working population in 2003–2005 (N = 11,441). In the first study, the coping item in the TOMCATS questionnaire showed an expected significant positive correlation with the UCL factors of instrumental mastery-oriented coping and negative correlations with passive and depressive scores. There were also the expected correlations for the helplessness and hopelessness scores, but there was no clear distinction between helplessness and hopelessness in the way they correlated with the UCL. In the second study, the coping item in TOMCATS and the three-item helplessness scores showed clear and monotonous gradients over a subjective socioeconomic status (SES) ladder. Positive response outcome expectancy ("coping") was related to high subjective SES and no expectancy ("helplessness") to low subjective SES. In a model including age and sex, TOMCATS scores explained more variance (r 2 = 0.16) in self-reported health than both subjective (r 2 = 0.08) and objective SES (r 2 = 0.02). The brief TOMCATS questionnaire showed acceptable and significant correlations with a traditional coping questionnaire and is sensitive enough to register systematic differences in response outcome expectancies across the socioeconomic ladder. The results furthermore confirm that psychological and learning factors contribute to the socioeconomic gradient in health.
An international measure of awareness and beliefs about cancer: development and testing of the ABC

PubMed Central

Simon, Alice E; Forbes, Lindsay J L; Boniface, David; Warburton, Fiona; Brain, Kate E; Dessaix, Anita; Donnelly, Michael; Haynes, Kerry; Hvidberg, Line; Lagerlund, Magdalena; Petermann, Lisa; Tishelman, Carol; Vedsted, Peter; Vigmostad, Maria Nyre; Wardle, Jane; Ramirez, Amanda J

2012-01-01

Objectives To develop an internationally validated measure of cancer awareness and beliefs; the awareness and beliefs about cancer (ABC) measure. Design and setting Items modified from existing measures were assessed by a working group in six countries (Australia, Canada, Denmark, Norway, Sweden and the UK). Validation studies were completed in the UK, and cross-sectional surveys of the general population were carried out in the six participating countries. Participants Testing in UK English included cognitive interviewing for face validity (N=10), calculation of content validity indexes (six assessors), and assessment of test–retest reliability (N=97). Conceptual and cultural equivalence of modified (Canadian and Australian) and translated (Danish, Norwegian, Swedish and Canadian French) ABC versions were tested quantitatively for equivalence of meaning (≥4 assessors per country) and in bilingual cognitive interviews (three interviews per translation). Response patterns were assessed in surveys of adults aged 50+ years (N≥2000) in each country. Main outcomes Psychometric properties were evaluated through tests of validity and reliability, conceptual and cultural equivalence and systematic item analysis. Test–retest reliability used weighted-κ and intraclass correlations. Construction and validation of aggregate scores was by factor analysis for (1) beliefs about cancer outcomes, (2) beliefs about barriers to symptomatic presentation, and item summation for (3) awareness of cancer symptoms and (4) awareness of cancer risk factors. Results The English ABC had acceptable test–retest reliability and content validity. International assessments of equivalence identified a small number of items where wording needed adjustment. Survey response patterns showed that items performed well in terms of difficulty and discrimination across countries except for awareness of cancer outcomes in Australia. Aggregate scores had consistent factor structures across countries. Conclusions The ABC is a reliable and valid international measure of cancer awareness and beliefs. The methods used to validate and harmonise the ABC may serve as a methodological guide in international survey research. PMID:23253874
Student science achievement and the integration of Indigenous knowledge on standardized tests

NASA Astrophysics Data System (ADS)

Dupuis, Juliann; Abrams, Eleanor

2017-09-01

In this article, we examine how American Indian students in Montana performed on standardized state science assessments when a small number of test items based upon traditional science knowledge from a cultural curriculum, "Indian Education for All", were included. Montana is the first state in the US to mandate the use of a culturally relevant curriculum in all schools and to incorporate this curriculum into a portion of the standardized assessment items. This study compares White and American Indian student test scores on these particular test items to determine how White and American Indian students perform on culturally relevant test items compared to traditional standard science test items. The connections between student achievement on adapted culturally relevant science test items versus traditional items brings valuable insights to the fields of science education, research on student assessments, and Indigenous studies.
Computerized Adaptive Test (CAT) Applications and Item Response Theory Models for Polytomous Items

ERIC Educational Resources Information Center

Aybek, Eren Can; Demirtasli, R. Nukhet

2017-01-01

This article aims to provide a theoretical framework for computerized adaptive tests (CAT) and item response theory models for polytomous items. Besides that, it aims to introduce the simulation and live CAT software to the related researchers. Computerized adaptive test algorithm, assumptions of item response theory models, nominal response…
An Effect Size Measure for Raju's Differential Functioning for Items and Tests

ERIC Educational Resources Information Center

Wright, Keith D.; Oshima, T. C.

2015-01-01

This study established an effect size measure for differential functioning for items and tests' noncompensatory differential item functioning (NCDIF). The Mantel-Haenszel parameter served as the benchmark for developing NCDIF's effect size measure for reporting moderate and large differential item functioning in test items. The effect size of…
Detecting a Gender-Related DIF Using Logistic Regression and Transformed Item Difficulty

ERIC Educational Resources Information Center

Abedlaziz, Nabeel; Ismail, Wail; Hussin, Zaharah

2011-01-01

Test items are designed to provide information about the examinees. Difficult items are designed to be more demanding and easy items are less so. However, sometimes, test items carry with their demands other than those intended by the test developer (Scheuneman & Gerritz, 1990). When personal attributes such as gender systematically affect…

Some links on this page may take you to non-federal websites. Their policies may differ from this site.