Science.gov

Sample records for item set mining

  1. Finding neural assemblies with frequent item set mining

    PubMed Central

    Picado-Muiño, David; Borgelt, Christian; Berger, Denise; Gerstein, George; Grün, Sonja

    2013-01-01

    Cell assemblies, defined as groups of neurons exhibiting precise spike coordination, were proposed as a model of network processing in the cortex. Fortunately, in recent years considerable progress has been made in multi-electrode recordings, which enable recording massively parallel spike trains of hundred(s) of neurons simultaneously. However, due to the challenges inherent in multivariate approaches, most studies in favor of cortical cell assemblies still resorted to analyzing pairwise interactions. However, to recover the underlying correlation structures, higher-order correlations need to be identified directly. Inspired by the Accretion method proposed by Gerstein et al. (1978) we propose a new assembly detection method based on frequent item set mining (FIM). In contrast to Accretion, FIM searches effectively and without redundancy for individual spike patterns that exceed a given support threshold. We study different search methods, with which the space of potential cell assemblies may be explored, as well as different test statistics and subset conditions with which candidate assemblies may be assessed and filtered. It turns out that a core challenge of cell assembly detection is the problem of multiple testing, which causes a large number of false discoveries. Unfortunately, criteria that address individual candidate assemblies and try to assess them with statistical tests and/or subset conditions do not help much to tackle this problem. The core idea of our new method is that in order to cope with the multiple testing problem one has to shift the focus of statistical testing from specific assemblies (consisting of a specific set of neurons) to spike patterns of a certain size (i.e., with a certain number of neurons). This significantly reduces the number of necessary tests, thus alleviating the multiple testing problem. We demonstrate that our method is able to reliably suppress false discoveries, while it is still very sensitive in discovering

  2. Gene expression data analysis using closed item set mining for labeled data.

    PubMed

    Rotter, Ana; Novak, Petra Kralj; Baebler, Spela; Toplak, Natasa; Blejec, Andrej; Lavrac, Nada; Gruden, Kristina

    2010-04-01

    This article presents an approach to microarray data analysis using discretised expression values in combination with a methodology of closed item set mining for class labeled data (RelSets). A statistical 2 x 2 factorial design analysis was run in parallel. The approach was validated on two independent sets of two-color microarray experiments using potato plants. Our results demonstrate that the two different analytical procedures, applied on the same data, are adequate for solving two different biological questions being asked. Statistical analysis is appropriate if an overview of the consequences of treatments and their interaction terms on the studied system is needed. If, on the other hand, a list of genes whose expression (upregulation or downregulation) differentiates between classes of data is required, the use of the RelSets algorithm is preferred. The used algorithms are freely available upon request to the authors.

  3. Using Frequent Item Set Mining and Feature Selection Methods to Identify Interacted Risk Factors - The Atrial Fibrillation Case Study.

    PubMed

    Li, Xiang; Liu, Haifeng; Du, Xin; Hu, Gang; Xie, Guotong; Zhang, Ping

    2016-01-01

    Disease risk prediction is highly important for early intervention and treatment, and identification of predictive risk factors is the key point to achieve accurate prediction. In addition to original independent features in a dataset, some interacted features, such as comorbidities and combination therapies, may have non-additive influence on the disease outcome and can also be used in risk prediction to improve the prediction performance. However, it is usually difficult to manually identify the possible interacted risk factors due to the combination explosion of features. In this paper, we propose an automatic approach to identify predictive risk factors with interactions using frequent item set mining and feature selection methods. The proposed approach was applied in the real world case study of predicting ischemic stroke and thromboembolism for atrial fibrillation patients on the Chinese atrial fibrillation registry dataset, and the results show that our approach can not only improve the prediction performance, but also identify the comorbidities and combination therapies that have potential influences on TE occurrence for AF.

  4. 17 CFR 229.104 - (Item 104) Mine safety disclosure.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... 17 Commodity and Securities Exchanges 2 2013-04-01 2013-04-01 false (Item 104) Mine safety... ENERGY POLICY AND CONSERVATION ACT OF 1975-REGULATION S-K Business § 229.104 (Item 104) Mine safety... other mine shall provide the information specified below for the time period covered by the report: (1...

  5. 17 CFR 229.104 - (Item 104) Mine safety disclosure.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... 17 Commodity and Securities Exchanges 3 2014-04-01 2014-04-01 false (Item 104) Mine safety... ENERGY POLICY AND CONSERVATION ACT OF 1975-REGULATION S-K Business § 229.104 (Item 104) Mine safety... other mine shall provide the information specified below for the time period covered by the report: (1...

  6. 17 CFR 229.104 - (Item 104) Mine safety disclosure.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... 17 Commodity and Securities Exchanges 2 2012-04-01 2012-04-01 false (Item 104) Mine safety... ENERGY POLICY AND CONSERVATION ACT OF 1975-REGULATION S-K Business § 229.104 (Item 104) Mine safety... other mine shall provide the information specified below for the time period covered by the report: (1...

  7. Preference Mining Using Neighborhood Rough Set Model on Two Universes

    PubMed Central

    2016-01-01

    Preference mining plays an important role in e-commerce and video websites for enhancing user satisfaction and loyalty. Some classical methods are not available for the cold-start problem when the user or the item is new. In this paper, we propose a new model, called parametric neighborhood rough set on two universes (NRSTU), to describe the user and item data structures. Furthermore, the neighborhood lower approximation operator is used for defining the preference rules. Then, we provide the means for recommending items to users by using these rules. Finally, we give an experimental example to show the details of NRSTU-based preference mining for cold-start problem. The parameters of the model are also discussed. The experimental results show that the proposed method presents an effective solution for preference mining. In particular, NRSTU improves the recommendation accuracy by about 19% compared to the traditional method. PMID:28044074

  8. Preference Mining Using Neighborhood Rough Set Model on Two Universes.

    PubMed

    Zeng, Kai

    2016-01-01

    Preference mining plays an important role in e-commerce and video websites for enhancing user satisfaction and loyalty. Some classical methods are not available for the cold-start problem when the user or the item is new. In this paper, we propose a new model, called parametric neighborhood rough set on two universes (NRSTU), to describe the user and item data structures. Furthermore, the neighborhood lower approximation operator is used for defining the preference rules. Then, we provide the means for recommending items to users by using these rules. Finally, we give an experimental example to show the details of NRSTU-based preference mining for cold-start problem. The parameters of the model are also discussed. The experimental results show that the proposed method presents an effective solution for preference mining. In particular, NRSTU improves the recommendation accuracy by about 19% compared to the traditional method.

  9. Efficient Algorithms for Segmentation of Item-Set Time Series

    NASA Astrophysics Data System (ADS)

    Chundi, Parvathi; Rosenkrantz, Daniel J.

    We propose a special type of time series, which we call an item-set time series, to facilitate the temporal analysis of software version histories, email logs, stock market data, etc. In an item-set time series, each observed data value is a set of discrete items. We formalize the concept of an item-set time series and present efficient algorithms for segmenting a given item-set time series. Segmentation of a time series partitions the time series into a sequence of segments where each segment is constructed by combining consecutive time points of the time series. Each segment is associated with an item set that is computed from the item sets of the time points in that segment, using a function which we call a measure function. We then define a concept called the segment difference, which measures the difference between the item set of a segment and the item sets of the time points in that segment. The segment difference values are required to construct an optimal segmentation of the time series. We describe novel and efficient algorithms to compute segment difference values for each of the measure functions described in the paper. We outline a dynamic programming based scheme to construct an optimal segmentation of the given item-set time series. We use the item-set time series segmentation techniques to analyze the temporal content of three different data sets—Enron email, stock market data, and a synthetic data set. The experimental results show that an optimal segmentation of item-set time series data captures much more temporal content than a segmentation constructed based on the number of time points in each segment, without examining the item set data at the time points, and can be used to analyze different types of temporal data.

  10. Vocabulary Mining for Information Retrieval: Rough Sets and Fuzzy Sets.

    ERIC Educational Resources Information Center

    Srinivasan, Padmini; Ruiz, Miguel E.; Kraft, Donald H.; Chen, Jianhua

    2001-01-01

    Explains vocabulary mining in information retrieval and describes a framework for vocabulary mining that allows the use of rough set-based approximations even when documents and queries are described using weighted, or fuzzy, representations. Examines coordination between multiple vocabulary views and applies the framework to the Unified Medical…

  11. Set of Frequent Word Item sets as Feature Representation for Text with Indonesian Slang

    NASA Astrophysics Data System (ADS)

    Sa'adillah Maylawati, Dian; Putri Saptawati, G. A.

    2017-01-01

    Indonesian slang are commonly used in social media. Due to their unstructured syntax, it is difficult to extract their features based on Indonesian grammar for text mining. To do so, we propose Set of Frequent Word Item sets (SFWI) as text representation which is considered match for Indonesian slang. Besides, SFWI is able to keep the meaning of Indonesian slang with regard to the order of appearance sentence. We use FP-Growth algorithm with adding separation sentence function into the algorithm to extract the feature of SFWI. The experiments is done with text data from social media such as Facebook, Twitter, and personal website. The result of experiments shows that Indonesian slang were more correctly interpreted based on SFWI.

  12. Item Selection Strategy for Reducing the Number of Items Rated in an Angoff Standard Setting Study

    ERIC Educational Resources Information Center

    Ferdous, Abdullah A.; Plake, Barbara S.

    2007-01-01

    In an Angoff standard setting procedure, judges estimate the probability that a hypothetical randomly selected minimally competent candidate will answer correctly each item in the test. In many cases, these item performance estimates are made twice, with information shared with the panelists between estimates. Especially for long tests, this…

  13. Effects of Decreasing the Number of Common Items in Equating Link Item Sets.

    ERIC Educational Resources Information Center

    Hanick, Patricia L.; Huang, Chi-Yu

    The term "equating" refers to a statistical procedure that adjusts test scores on different forms of the same examination so that scores can be interpreted interchangeably. This study examines the impact of equating with fewer items than originally planned when items have been removed from the equating set for a variety of reasons. A…

  14. Identifying and Managing Local Item Dependence in Context-Dependent Item Sets.

    ERIC Educational Resources Information Center

    Allen, Sally; Sudweeks, Richard R.

    A study was conducted to identify local item dependence (LID) in the context-dependent item sets used in an examination prepared for use in an introductory university physics class and to assess the effects of LID on estimates of the reliability and standard error of measurement. Test scores were obtained for 487 students in the physics class. The…

  15. Mining multi-item drug adverse effect associations in spontaneous reporting systems

    PubMed Central

    2010-01-01

    Background Multi-item adverse drug event (ADE) associations are associations relating multiple drugs to possibly multiple adverse events. The current standard in pharmacovigilance is bivariate association analysis, where each single drug-adverse effect combination is studied separately. The importance and difficulty in the detection of multi-item ADE associations was noted in several prominent pharmacovigilance studies. In this paper we examine the application of a well established data mining method known as association rule mining, which we tailored to the above problem, and demonstrate its value. The method was applied to the FDAs spontaneous adverse event reporting system (AERS) with minimal restrictions and expectations on its output, an experiment that has not been previously done on the scale and generality proposed in this work. Results Based on a set of 162,744 reports of suspected ADEs reported to AERS and published in the year 2008, our method identified 1167 multi-item ADE associations. A taxonomy that characterizes the associations was developed based on a representative sample. A significant number (67% of the total) of potential multi-item ADE associations identified were characterized and clinically validated by a domain expert as previously recognized ADE associations. Several potentially novel ADEs were also identified. A smaller proportion (4%) of associations were characterized and validated as known drug-drug interactions. Conclusions Our findings demonstrate that multi-item ADEs are present and can be extracted from the FDA’s adverse effect reporting system using our methodology, suggesting that our method is a valid approach for the initial identification of multi-item ADEs. The study also revealed several limitations and challenges that can be attributed to both the method and quality of data. PMID:21044365

  16. Mining knowledge in astrophysical massive data sets

    NASA Astrophysics Data System (ADS)

    Brescia, Massimo; Longo, Giuseppe; Pasian, Fabio

    2010-11-01

    Modern scientific data mainly consist of huge data sets gathered by a very large number of techniques and stored in much diversified and often incompatible data repositories. More in general, in the e-science environment, it is considered as a critical and urgent requirement to integrate services across distributed, heterogeneous, dynamic “virtual organizations” formed by different resources within a single enterprise. In the last decade, Astronomy has become an immensely data-rich field due to the evolution of detectors (plates to digital to mosaics), telescopes and space instruments. The Virtual Observatory approach consists of the federation under common standards of all astronomical archives available worldwide, as well as data analysis, data mining and data exploration applications. The main drive behind such an effort is that once the infrastructure is complete, it will allow a new type of multi-wavelength, multi-epoch science, which can only be barely imagined. Data mining, or knowledge discovery in databases, while being the main methodology to extract the scientific information contained in such Massive Data Sets (MDS), poses crucial problems since it has to orchestrate complex problems posed by transparent access to different computing environments, scalability of algorithms, reusability of resources, etc. In the present paper we summarize the present status of the MDS in the Virtual Observatory and what is currently done and planned to bring advanced data mining methodologies in the case of the DAME (DAta Mining and Exploration) project.

  17. Confirmatory Factor Analysis of the M5-50: An Implementation of the International Personality Item Pool Item Set

    ERIC Educational Resources Information Center

    Socha, Alan; Cooper, Christopher A.; McCord, David M.

    2010-01-01

    Goldberg's International Personality Item Pool (IPIP; Goldberg, 1999) provides researchers with public-domain, free-access personality measurement scales that are proxies of well-established published scales. One of the more commonly used IPIP sets employs 50 items to measure the 5 broad domains of the 5-factor model, with 10 items per factor. The…

  18. Double Mine Building, general view in setting; view northeast ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    Double Mine Building, general view in setting; view northeast - Fort McKinley, Double Mine Building, East side of East Side Drive, approximately 125 feet south of Weymouth Way, Great Diamond Island, Portland, Cumberland County, ME

  19. Data Mining Using Extensions of the Rough Set Model.

    ERIC Educational Resources Information Center

    Lingras, P. J.; Yao, Y. Y.

    1998-01-01

    Examines basic issues of data mining using the theory of rough sets, a recent proposal for generalizing classical set theory. Demonstrates that a generalized rough set model can be used for generating rules from incomplete databases. Discusses the importance of rule extraction from incomplete databases in data mining. (AEF)

  20. Method of data mining including determining multidimensional coordinates of each item using a predetermined scalar similarity value for each item pair

    DOEpatents

    Meyers, Charles E.; Davidson, George S.; Johnson, David K.; Hendrickson, Bruce A.; Wylie, Brian N.

    1999-01-01

    A method of data mining represents related items in a multidimensional space. Distance between items in the multidimensional space corresponds to the extent of relationship between the items. The user can select portions of the space to perceive. The user also can interact with and control the communication of the space, focusing attention on aspects of the space of most interest. The multidimensional spatial representation allows more ready comprehension of the structure of the relationships among the items.

  1. 48 CFR 552.219-70 - Allocation of Orders-Partially Set-Aside Items.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ...-Partially Set-Aside Items. 552.219-70 Section 552.219-70 Federal Acquisition Regulations System GENERAL... and Clauses 552.219-70 Allocation of Orders—Partially Set-Aside Items. As prescribed in 519.508, insert the following clause: Allocation of Orders—Partially Set-Aside Items (SEP 1999) Where the...

  2. Innovations in Measuring Rater Accuracy in Standard Setting: Assessing "Fit" to Item Characteristic Curves

    ERIC Educational Resources Information Center

    Hurtz, Gregory M.; Jones, J. Patrick

    2009-01-01

    Standard setting methods such as the Angoff method rely on judgments of item characteristics; item response theory empirically estimates item characteristics and displays them in item characteristic curves (ICCs). This study evaluated several indexes of rater fit to ICCs as a method for judging rater accuracy in their estimates of expected item…

  3. Confirmatory factor analysis of the M5-50: an implementation of the International Personality Item Pool item set.

    PubMed

    Socha, Alan; Cooper, Christopher A; McCord, David M

    2010-03-01

    Goldberg's International Personality Item Pool (IPIP; Goldberg, 1999) provides researchers with public-domain, free-access personality measurement scales that are proxies of well-established published scales. One of the more commonly used IPIP sets employs 50 items to measure the 5 broad domains of the 5-factor model, with 10 items per factor. The M5-50 (McCord, 2002) is a specific ordering and presentation of this 50-item set. Using data from a sample of 760 faculty, staff, and students at a midsized university, the authors assessed the reliability and construct validity of the M5-50. Cronbach's alphas on the 5 scales ranged from acceptable to excellent. Confirmatory factor analysis indicated reasonably good model fit. Researchers who wish to measure personality would be well advised to consider using the M5-50. PsycINFO Database Record (c) 2010 APA, all rights reserved.

  4. Multiple-Choice Item Difficulty: The Effects of Language and Distracter Set Similarity.

    ERIC Educational Resources Information Center

    Green, Kathy E.

    The purpose of this study was to determine whether item difficulty is significantly affected by language difficulty and response set convergence. Language difficulty was varied by increasing sentence (stem) length, increasing syntactic complexity, and substituting uncommon words for more familiar terms in the item stem. Item wording ranged from…

  5. Combining the Best of Two Standard Setting Methods: The Ordered Item Booklet Angoff

    ERIC Educational Resources Information Center

    Smith, Russell W.; Davis-Becker, Susan L.; O'Leary, Lisa S.

    2014-01-01

    This article describes a hybrid standard setting method that combines characteristics of the Angoff (1971) and Bookmark (Mitzel, Lewis, Patz & Green, 2001) methods. The proposed approach utilizes strengths of each method while addressing weaknesses. An ordered item booklet, with items sorted based on item difficulty, is used in combination…

  6. Combining the Best of Two Standard Setting Methods: The Ordered Item Booklet Angoff

    ERIC Educational Resources Information Center

    Smith, Russell W.; Davis-Becker, Susan L.; O'Leary, Lisa S.

    2014-01-01

    This article describes a hybrid standard setting method that combines characteristics of the Angoff (1971) and Bookmark (Mitzel, Lewis, Patz & Green, 2001) methods. The proposed approach utilizes strengths of each method while addressing weaknesses. An ordered item booklet, with items sorted based on item difficulty, is used in combination…

  7. Identifying an Efficient Set of Items Sensitive to Clinical-Range Externalizing Problems in Children

    PubMed Central

    Petersen, Isaac T.; Bates, John E.; Dodge, Kenneth A.; Lansford, Jennifer E.; Pettit, Gregory S.

    2015-01-01

    The present study applied item response theory to identify an efficient set of items of the Achenbach Externalizing scale from the Child Behavior Checklist (CBCL; 33 items) and Teacher's Report Form (TRF; 35 items) that were sensitive to clinical-range scores. Mothers and teachers rated children's externalizing problems annually from ages 5 to 13 years in two independent samples (Ns = 585 and 1,199). Item properties for each rater across ages 5–8 and 9–13 were examined with item response theory. We identified 10 mother- and teacher-reported items from both samples based on the items' measurement precision for sub-clinical and clinical levels of externalizing problems: externalizing problems that involve meanness to others, destroying others' things, fighting, lying and cheating, attacking people, screaming, swearing/obscene language, temper tantrums, threatening people, and being loud. Scores on the scales using these items had strong reliability and psychometric properties, capturing nearly as much information as the full Externalizing scale for classifying clinical levels of externalizing problems. Scores on the scale with the 10 CBCL items had moderate accuracy, equivalent to the full Externalizing scale, in classifying diagnoses of conduct disorder based on a research diagnostic interview. Of course, comprehensive clinical assessment would consider additional items, dimensions of behavior, and sources of information, too, but it appears that the behaviors tapped by this select set of items may be core to externalizing psychopathology in children. PMID:26322800

  8. Identifying an efficient set of items sensitive to clinical-range externalizing problems in children.

    PubMed

    Petersen, Isaac T; Bates, John E; Dodge, Kenneth A; Lansford, Jennifer E; Pettit, Gregory S

    2016-05-01

    The present study applied item response theory to identify an efficient set of items of the Achenbach Externalizing scale from the Child Behavior Checklist (CBCL; 33 items) and Teacher's Report Form (TRF; 35 items) that were sensitive to clinical-range scores. Mothers and teachers rated children's externalizing problems annually from ages 5 to 13 years in 2 independent samples (Ns = 585 and 1,199). Item properties for each rater across ages 5-8 and 9-13 were examined with item response theory. We identified 10 mother- and teacher-reported items from both samples based on the items' measurement precision for subclinical and clinical levels of externalizing problems: externalizing problems that involve meanness to others, destroying others' things, fighting, lying and cheating, attacking people, screaming, swearing/obscene language, temper tantrums, threatening people, and being loud. Scores on the scales using these items had strong reliability and psychometric properties, capturing nearly as much information as the full Externalizing scale for classifying clinical levels of externalizing problems. Scores on the scale with the 10 CBCL items had moderate accuracy, equivalent to the full Externalizing scale, in classifying diagnoses of conduct disorder based on a research diagnostic interview. Of course, comprehensive clinical assessment would consider additional items, dimensions of behavior, and sources of information, too, but it appears that the behaviors tapped by this select set of items may be core to externalizing psychopathology in children. (PsycINFO Database Record

  9. Mining large heterogeneous data sets in drug discovery.

    PubMed

    Wild, David J

    2009-10-01

    Increasingly, effective drug discovery involves the searching and data mining of large volumes of information from many sources covering the domains of chemistry, biology and pharmacology amongst others. This has led to a proliferation of databases and data sources relevant to drug discovery. This paper provides a review of the publicly-available large-scale databases relevant to drug discovery, describes the kinds of data mining approaches that can be applied to them and discusses recent work in integrative data mining that looks for associations that pan multiple sources, including the use of Semantic Web techniques. The future of mining large data sets for drug discovery requires intelligent, semantic aggregation of information from all of the data sources described in this review, along with the application of advanced methods such as intelligent agents and inference engines in client applications.

  10. Physics Mining of Multi-Source Data Sets

    NASA Technical Reports Server (NTRS)

    Helly, John; Karimabadi, Homa; Sipes, Tamara

    2012-01-01

    Powerful new parallel data mining algorithms can produce diagnostic and prognostic numerical models and analyses from observational data. These techniques yield higher-resolution measures than ever before of environmental parameters by fusing synoptic imagery and time-series measurements. These techniques are general and relevant to observational data, including raster, vector, and scalar, and can be applied in all Earth- and environmental science domains. Because they can be highly automated and are parallel, they scale to large spatial domains and are well suited to change and gap detection. This makes it possible to analyze spatial and temporal gaps in information, and facilitates within-mission replanning to optimize the allocation of observational resources. The basis of the innovation is the extension of a recently developed set of algorithms packaged into MineTool to multi-variate time-series data. MineTool is unique in that it automates the various steps of the data mining process, thus making it amenable to autonomous analysis of large data sets. Unlike techniques such as Artificial Neural Nets, which yield a blackbox solution, MineTool's outcome is always an analytical model in parametric form that expresses the output in terms of the input variables. This has the advantage that the derived equation can then be used to gain insight into the physical relevance and relative importance of the parameters and coefficients in the model. This is referred to as physics-mining of data. The capabilities of MineTool are extended to include both supervised and unsupervised algorithms, handle multi-type data sets, and parallelize it.

  11. Standard setting with dichotomous and constructed response items: some Rasch model approaches.

    PubMed

    MacCann, Robert G

    2009-01-01

    Using real data comprising responses to both dichotomously scored and constructed response items, this paper shows how Rasch modeling may be used to facilitate standard-setting. The modeling uses Andrich's Extended Logistic Model, which is incorporated into the RUMM software package. After a review of the fundamental equations of the model, an application to Bookmark standard setting is given, showing how to calculate the bookmark difficulty location (BDL) for both dichomotous items and tests containing a mixture of item types. An example showing how the bookmark is set is also discussed. The Rasch model is then applied in various ways to the Angoff standard-setting methods. In the first Angoff approach, the judges' item ratings are compared to Rasch model expected scores, allowing the judges to find items where their ratings differ significantly from the Rasch model values. In the second Angoff approach, the distribution of item ratings are converted to a distribution of possible cutscores, from which a final cutscore may be selected. In the third Angoff approach, the Rasch model provides a comprehensive information set to the judges. For every total score on the test, the model provides a column of item ratings (expected scores) for the ability associated with the total score. The judges consider each column of item ratings as a whole and select the column that best fits the expected pattern of responses of a marginal candidate. The total score corresponding to the selected column is then the performance band cutscore.

  12. Development of a multilevel health and safety climate survey tool within a mining setting.

    PubMed

    Parker, Anthony W; Tones, Megan J; Ritchie, Gabrielle E

    2017-09-01

    This study aimed to design, implement and evaluate the reliability and validity of a multifactorial and multilevel health and safety climate survey (HSCS) tool with utility in the Australian mining setting. An 84-item questionnaire was developed and pilot tested on a sample of 302 Australian miners across two open cut sites. A 67-item, 10 factor solution was obtained via exploratory factor analysis (EFA) representing prioritization and attitudes to health and safety across multiple domains and organizational levels. Each factor demonstrated a high level of internal reliability, and a series of ANOVAs determined a high level of consistency in responses across the workforce, and generally irrespective of age, experience or job category. Participants tended to hold favorable views of occupational health and safety (OH&S) climate at the management, supervisor, workgroup and individual level. The survey tool demonstrated reliability and validity for use within an open cut Australian mining setting and supports a multilevel, industry specific approach to OH&S climate. Findings suggested a need for mining companies to maintain high OH&S standards to minimize risks to employee health and safety. Future research is required to determine the ability of this measure to predict OH&S outcomes and its utility within other mine settings. As this tool integrates health and safety, it may have benefits for assessment, monitoring and evaluation in the industry, and improving the understanding of how health and safety climate interact at multiple levels to influence OH&S outcomes. Copyright © 2017 National Safety Council and Elsevier Ltd. All rights reserved.

  13. Equating Multiple Tests via an IRT Linking Design: Utilizing a Single Set of Anchor Items with Fixed Common Item Parameters during the Calibration Process.

    ERIC Educational Resources Information Center

    Li, Yuan H.; Griffith, William D.; Tam, Hak P.

    This study explores the relative merits of a potentially useful item response theory (IRT) linking design: using a single set of anchor items with fixed common item parameters (FCIP) during the calibration process. An empirical study was conducted to investigate the appropriateness of this linking design using 6 groups of students taking 6 forms…

  14. Using Set Covering with Item Sampling to Analyze the Infeasibility of Linear Programming Test Assembly Models

    ERIC Educational Resources Information Center

    Huitzing, Hiddo A.

    2004-01-01

    This article shows how set covering with item sampling (SCIS) methods can be used in the analysis and preanalysis of linear programming models for test assembly (LPTA). LPTA models can construct tests, fulfilling a set of constraints set by the test assembler. Sometimes, no solution to the LPTA model exists. The model is then said to be…

  15. Writing context-dependent item sets that reflect critical thinking learning outcomes.

    PubMed

    Su, Whei Ming

    2007-01-01

    Faculty in a medical-surgical nursing course developed learning outcomes to foster critical thinking skills involved in clinical reasoning. To facilitate students' abilities to make effective clinical judgments, direct teaching of thinking skills was incorporated into nursing content instruction. This instructional design challenged faculty to develop test items that would permit direct observation of students' thought processes. The author describes how context-dependent item sets were developed to meet this evaluation need.

  16. Using Data Mining to Predict K-12 Students' Performance on Large-Scale Assessment Items Related to Energy

    ERIC Educational Resources Information Center

    Liu, Xiufeng; Ruiz, Miguel E.

    2008-01-01

    This article reports a study on using data mining to predict K-12 students' competence levels on test items related to energy. Data sources are the 1995 Third International Mathematics and Science Study (TIMSS), 1999 TIMSS-Repeat, 2003 Trend in International Mathematics and Science Study (TIMSS), and the National Assessment of Educational…

  17. Using Data Mining to Predict K-12 Students' Performance on Large-Scale Assessment Items Related to Energy

    ERIC Educational Resources Information Center

    Liu, Xiufeng; Ruiz, Miguel E.

    2008-01-01

    This article reports a study on using data mining to predict K-12 students' competence levels on test items related to energy. Data sources are the 1995 Third International Mathematics and Science Study (TIMSS), 1999 TIMSS-Repeat, 2003 Trend in International Mathematics and Science Study (TIMSS), and the National Assessment of Educational…

  18. The Consumer Assessment of Healthcare Providers and Systems (CAHPS®) Cultural Competence (CC) Item Set

    PubMed Central

    Weech-Maldonado, Robert; Carle, Adam; Weidmer, Beverly; Hurtado, Margarita; Ngo-Metzger, Quyen; Hays, Ron D.

    2013-01-01

    Background There is a need for reliable and valid measures of cultural competence from the patient’s perspective. Objective This paper evaluates the reliability and validity of the Consumer Assessments of Healthcare Providers and Systems (CAHPS®) Cultural Competence (CC) item set. Research Design Using 2008 survey data, we assessed the internal consistency of the CAHPS CC scales using Cronbach alphas, and examined the validity of the measures using exploratory and confirmatory factor analysis, multitrait scaling analysis, and regression analysis. Subjects A random stratified sample (based on race/ethnicity and language) of 991 enrollees, less than 65 years old, from two Medicaid managed care plans in California and New York. Measures CAHPS CC item set after excluding screener items and ratings. Results Confirmatory factor analysis (CFI= 0.98; TLI= 0.98; RMSEA= 0.06) provided support for a seven-factor structure: Doctor Communication-Positive Behaviors; Doctor Communication-Negative Behaviors; Doctor Communication-Health Promotion; Doctor Communication-Alternative Medicine; Shared Decision Making; Equitable Treatment; and Trust. Item--total correlations (corrected for item overlap) for the 7 scales exceeded 0.40. Exploratory factor analysis showed support for one additional factor: Access to Interpreter Services. Internal consistency reliability estimates ranged from 0.58 (Alternative Medicine) to 0.92 (Positive Behaviors), and was 0.70 or higher for four of the eight composites. All composites were positively and significantly associated with the overall doctor rating. Conclusions The CAHPS CC 26-item set demonstrates adequate measurement properties, and can be used as a supplemental item set to the CAHPS Clinician and Group Surveys in assessing culturally competent care from the patient’s perspective. PMID:22895226

  19. On Item Mappings and Statistical Rules for Selecting Binary Items for Criterion-Referenced Interpretation and Bookmark Standard Settings.

    ERIC Educational Resources Information Center

    Huyhn, Huynh

    Item mappings are widely used in educational assessment for applications such as test administration (through test form assembly and computer assisted testing) and for criterion-referenced (CR) interpretation of test scores or scale anchoring. Item mappings are also used to construct ordered item booklets in the CTB/McGraw Hill Bookmark standard…

  20. Optimal Assembly of Tests with Item Sets. Research Report 98-12.

    ERIC Educational Resources Information Center

    van der Linden, Wim J.

    Six methods for assembling tests from a pool with an item-set structure are presented. All methods are computational and based on the technique of mixed integer programming. The methods are evaluated using such criteria as the feasibility of their linear programming problems and their expected solution times. The methods are illustrated for two…

  1. Method using a density field for locating related items for data mining

    DOEpatents

    Wylie, Brian N.

    2002-01-01

    A method for locating related items in a geometric space transforms relationships among items to geometric locations. The method locates items in the geometric space so that the distance between items corresponds to the degree of relatedness. The method facilitates communication of the structure of the relationships among the items. The method makes use of numeric values as a measure of similarity between each pairing of items. The items are given initial coordinates in the space. An energy is then determined for each item from the item's distance and similarity to other items, and from the density of items assigned coordinates near the item. The distance and similarity component can act to draw items with high similarities close together, while the density component can act to force all items apart. If a terminal condition is not yet reached, then new coordinates can be determined for one or more items, and the energy determination repeated. The iteration can terminate, for example, when the total energy reaches a threshold, when each item's energy is below a threshold, after a certain amount of time or iterations.

  2. The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

    ERIC Educational Resources Information Center

    Öztürk-Gübes, Nese; Kelecioglu, Hülya

    2016-01-01

    The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

  3. Conversion of Proportion-Correct Standard-Setting Judgments to Cutoff Scores on the Item Response Theory Theta Scale

    ERIC Educational Resources Information Center

    Hurtz, Gregory M.; Jones, J. Patrick; Jones, Christian N.

    2008-01-01

    This study compares the efficacy of different strategies for translating item-level, proportion-correct standard-setting judgments into a theta-metric test cutoff score for use with item response theory (IRT) scoring, using Monte Carlo methods. Simulated Angoff-type ratings, consisting of 1,000 independent 75 Item x13 Rater matrices, were…

  4. A Comparison of the Paper Selection Method and the Contrasting Groups Method for Setting Standards on Constructed-Response Items.

    ERIC Educational Resources Information Center

    Webb, Melvin W., II; Miller, Eva R.

    As constructed-response items become an integral part of educational assessments, setting student performance standards on constructed-response items has become an important issue. Two standard-setting methods, one used for setting standards on the National Assessment of Educational Progress (NAEP) in reading in grade 8 and the other used to set…

  5. Dynamic triggering of microseismicity in a mine setting

    NASA Astrophysics Data System (ADS)

    Castellanos, Fernando; Van der Baan, Mirko

    2015-08-01

    We examine spatio-temporal patterns of microseismicity recorded during one month in an underground mine by addressing three key questions: (1) where does the seismicity occur? (2) Why does it occur in these locations? and (3) what triggers it? To obtain accurate locations, we perform a multiplet analysis and use a modified version of the double-difference (DD) relocation method. This approach leads to highly accurate relative event locations and requires groups of multiplets only. Most of the 281 relocated events are close to the main shaft and tunnels; thus we postulate seismicity is facilitated by stresses associated with the potential for subsidence in addition to the hoop stresses acting on the two vertical shafts. Most events occurred during certain hours of the day and there is a 68 per cent correlation with reported rock removal; therefore, it is likely they were triggered by static and dynamic stress perturbations caused by the transportation of debris along tunnels instead of our initial guess that blasting was the principal causative mechanism. Given that seismicity is present around the main shaft but absent close to the second one, we conclude that for seismicity to occur both a favourable stress state and additional external perturbing forces must exist, thus leading to dynamic event triggering in an initially stable stress situation. This analysis provides more insight into anthropogenic processes that might trigger seismicity, thereby facilitating identification of hazardous and potential damage areas in mine settings.

  6. Setting priorities for abandoned-mine land research. Final report

    SciTech Connect

    Not Available

    1987-01-01

    In FY 1987, Congress transferred the abandoned mine land (AML) research program and $1.9 million in funding from the Office of Surface Mining (OSM) to the Bureau of Mines (BOM). Subsequently the BOM asked National Research Council (NRC) to recommend research priorities and criteria for evaluating AML reclamation research proposals. The committee identified the highest-priority research problems which are: water quality; subsidence; mine waste; and revegetation, are ranked as the top four research priorities in all regions.

  7. Discovering Psychological Principles by Mining Naturally Occurring Data Sets.

    PubMed

    Goldstone, Robert L; Lupyan, Gary

    2016-07-01

    The very expertise with which psychologists wield their tools for achieving laboratory control may have had the unwelcome effect of blinding psychologists to the possibilities of discovering principles of behavior without conducting experiments. When creatively interrogated, a diverse range of large, real-world data sets provides powerful diagnostic tools for revealing principles of human judgment, perception, categorization, decision-making, language use, inference, problem solving, and representation. Examples of these data sets include patterns of website links, dictionaries, logs of group interactions, collections of images and image tags, text corpora, history of financial transactions, trends in twitter tag usage and propagation, patents, consumer product sales, performance in high-stakes sporting events, dialect maps, and scientific citations. The goal of this issue is to present some exemplary case studies of mining naturally existing data sets to reveal important principles and phenomena in cognitive science, and to discuss some of the underlying issues involved with conducting traditional experiments, analyses of naturally occurring data, computational modeling, and the synthesis of all three methods.

  8. Method of locating related items in a geometric space for data mining

    DOEpatents

    Hendrickson, Bruce A.

    1999-01-01

    A method for locating related items in a geometric space transforms relationships among items to geometric locations. The method locates items in the geometric space so that the distance between items corresponds to the degree of relatedness. The method facilitates communication of the structure of the relationships among the items. The method is especially beneficial for communicating databases with many items, and with non-regular relationship patterns. Examples of such databases include databases containing items such as scientific papers or patents, related by citations or keywords. A computer system adapted for practice of the present invention can include a processor, a storage subsystem, a display device, and computer software to direct the location and display of the entities. The method comprises assigning numeric values as a measure of similarity between each pairing of items. A matrix is constructed, based on the numeric values. The eigenvectors and eigenvalues of the matrix are determined. Each item is located in the geometric space at coordinates determined from the eigenvectors and eigenvalues. Proper construction of the matrix and proper determination of coordinates from eigenvectors can ensure that distance between items in the geometric space is representative of the numeric value measure of the items' similarity.

  9. Method of locating related items in a geometric space for data mining

    DOEpatents

    Hendrickson, B.A.

    1999-07-27

    A method for locating related items in a geometric space transforms relationships among items to geometric locations. The method locates items in the geometric space so that the distance between items corresponds to the degree of relatedness. The method facilitates communication of the structure of the relationships among the items. The method is especially beneficial for communicating databases with many items, and with non-regular relationship patterns. Examples of such databases include databases containing items such as scientific papers or patents, related by citations or keywords. A computer system adapted for practice of the present invention can include a processor, a storage subsystem, a display device, and computer software to direct the location and display of the entities. The method comprises assigning numeric values as a measure of similarity between each pairing of items. A matrix is constructed, based on the numeric values. The eigenvectors and eigenvalues of the matrix are determined. Each item is located in the geometric space at coordinates determined from the eigenvectors and eigenvalues. Proper construction of the matrix and proper determination of coordinates from eigenvectors can ensure that distance between items in the geometric space is representative of the numeric value measure of the items' similarity. 12 figs.

  10. Developing operational items for the International Classification of Functioning, Disability and Health Rehabilitation Set: the experience from China.

    PubMed

    Gao, Yan; Yan, Tiebin; You, Liming; Li, Kun

    2017-09-27

    The International Classification of Functioning, Disability and Health (ICF) Rehabilitation Set is proposed as a tool to measure functioning among a clinical population in Chinese rehabilitation services. It needs to be applied in a transparent and reliable way and considered from a measurement perspective. The aim of this study was to develop items operationalizing the ICF Rehabilitation Set to enhance the standardized and unified assessment in varied clinical settings. A list of proposed operational items was generated by (i) linking the items of four standard assessment instruments; (ii) literature review; and (iii) self-development. The candidate operational items were then evaluated by Chinese experts in rehabilitation medicine. One hundred and sixty-six operational items were generated using the three approaches and 54 were evaluated in the formal expert survey. Finally, 30 experts validated nine items from the assessment instruments, two found in the literature and 19 self-developed items for operationalizing the ICF Rehabilitation Set. The 30 operational items validated could potentially facilitate the implementation of the ICF Rehabilitation Set in clinical settings and provide a reference point for future research.

  11. Functional Independence Measure - Minimum Data Set Motor Item Bank: Short Forms Development and Precision Comparison in Veterans.

    PubMed

    Li, Chih-Ying; Romero, Sergio; Simpson, Annie N; Bonilha, Heather S; Simpson, Kit N; Hong, Ickpyo; Velozo, Craig A

    2017-07-26

    To improve the practical use of the short forms (SFs) developed from the item bank, we compared the measurement precision of the 4- and 8-item SFs generated from a motor item bank composed of the Functional Independence Measure (FIM™) and the Minimum Data Set (MDS). The FIM-MDS motor item bank allowed scores generated from different instruments to be co-calibrated. The 4- and 8-item SFs were developed based on Rasch analysis procedures. This paper compared person strata, ceiling/floor effects, test standard error (SE) plots for each administration form and examined 95% confidence interval (CI) error bands of anchored person measures with the corresponding SFs. We used 0.3 SE as a criterion to reflect a reliability level of 0.90. Veterans' inpatient rehabilitation facilities and community living centers. 2500 Veterans who had both FIM and the MDS data within 6 days during years 2008 through 2010. NA MAIN OUTCOME MEASURE(S): 4- and 8-item SFs of FIM, MDS and FIM-MDS motor item bank. Six SFs were generated with 4- and 8-items across a range of difficulty levels from the FIM-MDS motor item bank. The three 8-item SFs all had higher correlations with the item bank (r=0.82∼0.95), higher person strata and less test error than the corresponding 4-item SFs (r=0.80∼ 0.90). The three 4-item SFs did not meet the criteria of SE less than 0.3 for any theta values. 8-item short forms could improve clinical use of item bank composed of existing instruments across the continuum of care in Veterans. We also found number of items, not test specificity, determines the precision of the instrument. Copyright © 2017. Published by Elsevier Inc.

  12. Effects of Individual Optimization in Setting the Boundaries of Dichotomous Items on the Accuracy of Estimation

    ERIC Educational Resources Information Center

    Samejima, Fumiko

    1977-01-01

    The accuracy of estimation of the subjects' latent ability maintained by tailoring for each testee the order of item presentation and the border of item dichotomization was compared to the information provided by the original graded test items. (RC)

  13. Large-Scale Constraint-Based Pattern Mining

    ERIC Educational Resources Information Center

    Zhu, Feida

    2009-01-01

    We studied the problem of constraint-based pattern mining for three different data formats, item-set, sequence and graph, and focused on mining patterns of large sizes. Colossal patterns in each data formats are studied to discover pruning properties that are useful for direct mining of these patterns. For item-set data, we observed robustness of…

  14. Large-Scale Constraint-Based Pattern Mining

    ERIC Educational Resources Information Center

    Zhu, Feida

    2009-01-01

    We studied the problem of constraint-based pattern mining for three different data formats, item-set, sequence and graph, and focused on mining patterns of large sizes. Colossal patterns in each data formats are studied to discover pruning properties that are useful for direct mining of these patterns. For item-set data, we observed robustness of…

  15. Optimizing the ULV-VFQ for Clinical Use Through Item Set Reduction: Psychometric Properties and Trade-Offs

    PubMed Central

    Dagnelie, Gislin; Jeter, Pamela E.; Adeyemo, Olukemi

    2017-01-01

    Purpose We examine the dimensionality of the 150-item visual functioning questionnaire for individuals with ultralow vision (ULV-VFQ) and develop representative abbreviated versions, facilitating clinical use, while retaining compatibility with a 17-item performance assessment. Methods Subsets with 50 and 23 items covering the full difficulty range were selected, with evenly spaced item measures (IMs) and good representation of visual aspects and functional domains. Person measures (PMs) for the anchored subsets were derived through Rasch analysis of data from 80 respondents. Results Fit statistics for the reduced item sets were similar to those for the full set, with reliabilities at or above 95%. Mean PMs in the reduced sets were within 0.8 standard errors (SEs) of those in the full set. SEs of the PMs increased from the SE for 150 items, roughly in inverse proportion with the square root of the set size. Unexplained variance levels (24%–27%) and variance of the first unexplained factor (3.3%–3.9%) were close to those (30% and 2.6%) for 150 items. Differential item functions for omitted items were negligible. Aspects and domains are adequately represented in the reduced sets. Conclusions Self-reported visual ability can be measured accurately using appropriately chosen anchored subsets of the ULV-VFQ. Functional ability of individuals with ULV is characterized adequately by a single dimension. Translational Relevance The ULV-VFQ50 and ULV-VFQ23, using anchored IMs from the 150-item ULV-VFQ, provide an efficient and reliable self-report assessment of visual ability in individuals whose visual impairment is too severe for assessment with VFQs currently in use. PMID:28573076

  16. A Clarification on the Response Probability Criterion RP67 for Standard Settings Based on Bookmark and Item Mapping

    ERIC Educational Resources Information Center

    Huynh, Huynh

    2006-01-01

    By analyzing the Fisher information allotted to the correct response of a Rasch binary item, Huynh (1994) established the response probability criterion 0.67 (RP67) for standard settings based on bookmarks and item mapping. The purpose of this note is to help clarify the conceptual and psychometric framework of the RP criterion.

  17. Assembling a Computerized Adaptive Testing Item Pool as a Set of Linear Tests

    ERIC Educational Resources Information Center

    van der Linden, Wim J.; Ariel, Adelaide; Veldkamp, Bernard P.

    2006-01-01

    Test-item writing efforts typically results in item pools with an undesirable correlational structure between the content attributes of the items and their statistical information. If such pools are used in computerized adaptive testing (CAT), the algorithm may be forced to select items with less than optimal information, that violate the content…

  18. Bayesian Modal Estimation of the Four-Parameter Item Response Model in Real, Realistic, and Idealized Data Sets.

    PubMed

    Waller, Niels G; Feuerstahler, Leah

    2017-01-01

    In this study, we explored item and person parameter recovery of the four-parameter model (4PM) in over 24,000 real, realistic, and idealized data sets. In the first analyses, we fit the 4PM and three alternative models to data from three Minnesota Multiphasic Personality Inventory-Adolescent form factor scales using Bayesian modal estimation (BME). Our results indicated that the 4PM fits these scales better than simpler item Response Theory (IRT) models. Next, using the parameter estimates from these real data analyses, we estimated 4PM item parameters in 6,000 realistic data sets to establish minimum sample size requirements for accurate item and person parameter recovery. Using a factorial design that crossed discrete levels of item parameters, sample size, and test length, we also fit the 4PM to an additional 18,000 idealized data sets to extend our parameter recovery findings. Our combined results demonstrated that 4PM item parameters and parameter functions (e.g., item response functions) can be accurately estimated using BME in moderate to large samples (N ⩾ 5, 000) and person parameters can be accurately estimated in smaller samples (N ⩾ 1, 000). In the supplemental files, we report annotated [Formula: see text] code that shows how to estimate 4PM item and person parameters in [Formula: see text] (Chalmers, 2012 ).

  19. Data mining in big data sets of spectroscopic surveys

    NASA Astrophysics Data System (ADS)

    Luo, A.-Li

    2015-08-01

    More and more spectroscopic surveys provide many big datasets, in which various rare objects could be found. In this presentation, we suggested two different data mining approaches based on spectral feature by detection and machine learning way respectively. The most difficult thing of data mining is try to understand the result, and most of found outliers were finally identified as bad data. Candidates need follow up observations or multi waveband data to be confirmed. Some successful examples of newly discovered rare objects in SDSS and LAMOST data release are presented in this talk.

  20. Identification of Differential Item Functioning in Multiple-Group Settings: A Multivariate Outlier Detection Approach

    ERIC Educational Resources Information Center

    Magis, David; De Boeck, Paul

    2011-01-01

    We focus on the identification of differential item functioning (DIF) when more than two groups of examinees are considered. We propose to consider items as elements of a multivariate space, where DIF items are outlying elements. Following this approach, the situation of multiple groups is a quite natural case. A robust statistics technique is…

  1. Identification of Differential Item Functioning in Multiple-Group Settings: A Multivariate Outlier Detection Approach

    ERIC Educational Resources Information Center

    Magis, David; De Boeck, Paul

    2011-01-01

    We focus on the identification of differential item functioning (DIF) when more than two groups of examinees are considered. We propose to consider items as elements of a multivariate space, where DIF items are outlying elements. Following this approach, the situation of multiple groups is a quite natural case. A robust statistics technique is…

  2. Item Response Theory-Based Approaches for Computing Minimum Passing Scores from an Angoff-Based Standard-Setting Study

    ERIC Educational Resources Information Center

    Ferdous, Abdullah A.; Plake, Barbara S.

    2008-01-01

    Even when the scoring of an examination is based on item response theory (IRT), standard-setting methods seldom use this information directly when determining the minimum passing score (MPS) for an examination from an Angoff-based standard-setting study. Often, when IRT scoring is used, the MPS value for a test is converted to an IRT-based theta…

  3. Metals and metalloids in nestling tree swallows and their dietary items near oilsands mine operations in Northern Alberta.

    PubMed

    Godwin, Christine M; Smits, Judit E G; Barclay, Robert M R

    2016-08-15

    Tree swallows (Tachycineta bicolor) nesting near oilsands development in northern Alberta are potentially exposed to elevated levels of metals. The objective of this study was to determine whether levels of metals and metalloid elements in dietary items and tissues of nestling tree swallows inhabiting areas near oilsands mine operations were higher compared to those of reference sites. We hypothesized that if there was increased, industry-related exposure to metals, it would be via the diet. We identified the invertebrate prey in the stomach contents of nestlings. We also collected invertebrates using Malaise traps near nest boxes, and analyzed those taxa found in the nestling diet to understand potential variability in metal exposure. For most elements, we found no significant differences in concentrations in the liver, kidney, or stomach contents between sites near to and far from oilsands operations. Concentrations of five elements were positively correlated among tissues and stomach contents. For invertebrates collected from Malaise traps, location differences occurred in some absolute elemental concentrations, which were most often highest at reference sites away from mining operations. We found no evidence that nestling tree swallows accumulated metals approaching toxic levels. Tree swallows consumed relatively high quantities of terrestrial insects, possibly limiting exposure to water borne, food-web-related contaminants. We suggest that annual variability associated with elemental exposure and dietary levels of elements be considered when interpreting concentrations in bird tissues. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  4. Discrete quantity judgments in the great apes (Pan paniscus, Pan troglodytes, Gorilla gorilla, Pongo pygmaeus): the effect of presenting whole sets versus item-by-item.

    PubMed

    Hanus, Daniel; Call, Josep

    2007-08-01

    The authors examined quantity-based judgments for up to 10 items for simultaneous and sequential whole sets as well as for sequentially dropped items in chimpanzees (Pan troglodytes), gorillas (Gorilla gorilla), bonobos (Pan paniscus), and orangutans (Pongo pygmaeus). In Experiment 1, subjects had to choose the larger of 2 quantities presented in 2 separate dishes either simultaneously or 1 dish after the other. Representatives of all species were capable of selecting the larger of 2 quantities in both conditions, even when the quantities were large and the numerical distance between them was small. In Experiment 2, subjects had to select between the same food quantities sequentially dropped into 2 opaque cups so that none of the quantities were ever viewed as a whole. The authors found some evidence (albeit weaker) that subjects were able to select the larger quantity of items. Furthermore, the authors found no performance breakdown with the inclusion of certain quantities. Instead, the ratio between quantities was the best performance predictor. The authors conclude that quantity-based judgments rely on an analogical system, not a discrete object file model or perceptual estimation mechanism, such as subitizing.

  5. Evidence for a Global Sampling Process in Extraction of Summary Statistics of Item Sizes in a Set.

    PubMed

    Tokita, Midori; Ueda, Sachiyo; Ishiguchi, Akira

    2016-01-01

    Several studies have shown that our visual system may construct a "summary statistical representation" over groups of visual objects. Although there is a general understanding that human observers can accurately represent sets of a variety of features, many questions on how summary statistics, such as an average, are computed remain unanswered. This study investigated sampling properties of visual information used by human observers to extract two types of summary statistics of item sets, average and variance. We presented three models of ideal observers to extract the summary statistics: a global sampling model without sampling noise, global sampling model with sampling noise, and limited sampling model. We compared the performance of an ideal observer of each model with that of human observers using statistical efficiency analysis. Results suggest that summary statistics of items in a set may be computed without representing individual items, which makes it possible to discard the limited sampling account. Moreover, the extraction of summary statistics may not necessarily require the representation of individual objects with focused attention when the sets of items are larger than 4.

  6. Evidence for a Global Sampling Process in Extraction of Summary Statistics of Item Sizes in a Set

    PubMed Central

    Tokita, Midori; Ueda, Sachiyo; Ishiguchi, Akira

    2016-01-01

    Several studies have shown that our visual system may construct a “summary statistical representation” over groups of visual objects. Although there is a general understanding that human observers can accurately represent sets of a variety of features, many questions on how summary statistics, such as an average, are computed remain unanswered. This study investigated sampling properties of visual information used by human observers to extract two types of summary statistics of item sets, average and variance. We presented three models of ideal observers to extract the summary statistics: a global sampling model without sampling noise, global sampling model with sampling noise, and limited sampling model. We compared the performance of an ideal observer of each model with that of human observers using statistical efficiency analysis. Results suggest that summary statistics of items in a set may be computed without representing individual items, which makes it possible to discard the limited sampling account. Moreover, the extraction of summary statistics may not necessarily require the representation of individual objects with focused attention when the sets of items are larger than 4. PMID:27242622

  7. Set of Criteria for Efficiency of the Process Forming the Answers to Multiple-Choice Test Items

    ERIC Educational Resources Information Center

    Rybanov, Alexander Aleksandrovich

    2013-01-01

    Is offered the set of criteria for assessing efficiency of the process forming the answers to multiple-choice test items. To increase accuracy of computer-assisted testing results, it is suggested to assess dynamics of the process of forming the final answer using the following factors: loss of time factor and correct choice factor. The model…

  8. Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

    ERIC Educational Resources Information Center

    Wang, Wei

    2013-01-01

    Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

  9. Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

    ERIC Educational Resources Information Center

    Wang, Wei

    2013-01-01

    Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

  10. Improving the Scalability of an Exact Approach for Frequent Item Set Hiding

    ERIC Educational Resources Information Center

    LaMacchia, Carolyn

    2013-01-01

    Technological advances have led to the generation of large databases of organizational data recognized as an information-rich, strategic asset for internal analysis and sharing with trading partners. Data mining techniques can discover patterns in large databases including relationships considered strategically relevant to the owner of the data.…

  11. Improving the Scalability of an Exact Approach for Frequent Item Set Hiding

    ERIC Educational Resources Information Center

    LaMacchia, Carolyn

    2013-01-01

    Technological advances have led to the generation of large databases of organizational data recognized as an information-rich, strategic asset for internal analysis and sharing with trading partners. Data mining techniques can discover patterns in large databases including relationships considered strategically relevant to the owner of the data.…

  12. A gene pattern mining algorithm using interchangeable gene sets for prokaryotes.

    PubMed

    Hu, Meng; Choi, Kwangmin; Su, Wei; Kim, Sun; Yang, Jiong

    2008-02-26

    Mining gene patterns that are common to multiple genomes is an important biological problem, which can lead us to novel biological insights. When family classification of genes is available, this problem is similar to the pattern mining problem in the data mining community. However, when family classification information is not available, mining gene patterns is a challenging problem. There are several well developed algorithms for predicting gene patterns in a pair of genomes, such as FISH and DAGchainer. These algorithms use the optimization problem formulation which is solved using the dynamic programming technique. Unfortunately, extending these algorithms to multiple genome cases is not trivial due to the rapid increase in time and space complexity. In this paper, we propose a novel algorithm for mining gene patterns in more than two prokaryote genomes using interchangeable sets. The basic idea is to extend the pattern mining technique from the data mining community to handle the situation where family classification information is not available using interchangeable sets. In an experiment with four newly sequenced genomes (where the gene annotation is unavailable), we show that the gene pattern can capture important biological information. To examine the effectiveness of gene patterns further, we propose an ortholog prediction method based on our gene pattern mining algorithm and compare our method to the bi-directional best hit (BBH) technique in terms of COG orthologous gene classification information. The experiment show that our algorithm achieves a 3% increase in recall compared to BBH without sacrificing the precision of ortholog detection. The discovered gene patterns can be used for the detecting of ortholog and genes that collaborate for a common biological function.

  13. InfVis--platform-independent visual data mining of multidimensional chemical data sets.

    PubMed

    Oellien, Frank; Ihlenfeldt, Wolf-Dietrich; Gasteiger, Johann

    2005-01-01

    The tremendous increase of chemical data sets, both in size and number, and the simultaneous desire to speed up the drug discovery process has resulted in an increasing need for a new generation of computational tools that assist in the extraction of information from data and allow for rapid and in-depth data mining. During recent years, visual data mining has become an important tool within the life sciences and drug discovery area with the potential to help avoiding data analysis from turning into a bottleneck. In this paper, we present InfVis, a platform-independent visual data mining tool for chemists, who usually only have little experience with classical data mining tools, for the visualization, exploration, and analysis of multivariate data sets. InfVis represents multidimensional data sets by using intuitive 3D glyph information visualization techniques. Interactive and dynamic tools such as dynamic query devices allow real-time, interactive data set manipulations and support the user in the identification of relationships and patterns. InfVis has been implemented in Java and Java3D and can be run on a broad range of platforms and operating systems. It can also be embedded as an applet in Web-based interfaces. We will present in this paper examples detailing the analysis of a reaction database that demonstrate how InfVis assists chemists in identifying and extracting hidden information.

  14. Mining temporal data sets: hypoplastic left heart syndrome case study

    NASA Astrophysics Data System (ADS)

    Kusiak, Andrew; Caldarone, Christopher A.; Kelleher, Michael D.; Lamb, Fred S.; Persoon, Thomas J.; Gan, Yuan; Burns, Alex

    2003-03-01

    Hypoplastic left heart syndrome (HLHS) affects infants and is uniformly fatal without surgery. Post-surgery mortality rates are highly variable and dependent on postoperative management. The high mortality after the first stage surgery usually occurs within the first few days after procedure. Typically, the deaths are attributed to the unstable balance between the pulmonary and systemic circulations. An experienced team of physicians, nurses, and therapists is required to successfully manage the infant. However, even the most experienced teams report significant mortality due to the extremely complex relationships among physiologic parameters in a given patient. A data acquisition system was developed for the simultaneous collection of 73 physiologic, laboratory, and nurse-assessed variables. Data records were created at intervals of 30 seconds. An expert-validated wellness score was computed for each data record. A training data set consisting of over 5000 data records from multiple patients was collected. Preliminary results demonstratd that the knowledge discovery approach was over 94.57% accurate in predicting the "wellness score" of an infant. The discovered knowledge can improve care of complex patients by development of an intelligent simulator that can be used to support decisions.

  15. Diesel exhaust and coal mine dust: lung cancer risk in occupational settings

    SciTech Connect

    Hoffmann, B.; Jockel, K.H.

    2006-09-15

    Conflicting evidence on the carcinogenicity of diesel exhaust (DE) and coal mine dust in occupational settings exist. Exposure measurement in most studies is inferred on the basis of job classifications and may lead to misclassification. Confounding behavioral factors (i.e., smoking) and occupational risk factors (exposure to asbestos, arsenic, radon) need to be considered. We evaluated the epidemiological evidence and current findings of the carcinogenicity of DE and coal mine dust in occupational settings. Pertaining literature was identified through Medline search and recent review articles. Strengths and limitations of recent approaches are discussed. Many epidemiological studies have addressed the question of carcinogenicity in workers exposed to DE, and most showed a low-to-medium increase in the risk of bronchial carcinoma. The pooled relative risk (RR) estimates lie between 1.33 and 1.47, and a consistent rise in risk across various job categories and study designs point to a causal relationship. Data on the carcinogenicity of coal mine dust are less consistent and the potential for confounding by unmeasured risk factors (arsenic, radon, DE) are higher. While silica as one of its components has been evaluated as carcinogenic, there is inadequate evidence for the carcinogenicity of pure coal dust according to the International Agency for Research on Cancer (IARC). There is sufficient evidence for a causal relationship between DE and lung cancer in occupational settings. The evidence for coal mine dust is less convincing, but individual studies show an increase in risk of lung cancer in exposed workers. 4 refs.

  16. Assessing Dimensionality of a Set of Items--Comparison of Different Approaches.

    ERIC Educational Resources Information Center

    Nandakumar, Ratna

    The performance of the following four methodologies for assessing unidimensionality was examined: (1) DIMTEST; (2) the approach of P. W. Holland and P. R. Rosenbaum; (3) linear factor analysis; and (4) non-linear factor analysis. Each method is examined and compared with other methods using simulated data sets and real data sets. Seven data sets,…

  17. Clinical and pharmacogenomic data mining: 4. The FANO program and command set as an example of tools for biomedical discovery and evidence based medicine.

    PubMed

    Robson, Barry

    2008-09-01

    The culmination of methodology explored and developed in the preceding three papers is described in terms of the FANO program (also known as CliniMiner) and specifically in terms of the contemporary command set for data mining. This provides a more detailed account of how strategies were implemented in applications described elsewhere, in the previous papers in the series and in a paper on the analysis of 667 000 patient records. Although it is not customary to think of a command set as the output of research, it represents the elements and strategies for data mining biomedical and clinical data with many parameters, that is, in a high dimensional space that requires skilful navigation. The intent is not to promote FANO per se, but to report its science and methodologies. Typical example rules from traditional data mining are that A and B and C associate, or IF A & B THEN C. We need much higher complexity rules for clinical data especially with inclusion of proteomics and genomics. FANO's specific goal is to be able routinely to extract from clinical record repositories and other data not only the complex rules required for biomedical research and the clinical practice of evidence based medicine, but to quantify their uncertainty, that is, their essentially probabilistic nature. The underlying information and number theoretic basis previously described is less of an issue here, being "under the hood", although the fundamental role and use of the Incomplete (generalized) Riemann Zeta Function as a general surprise measure is highlighted, along with its covariance or multivariance analogue, as it appears to be a unique and powerful feature. Another characteristic described is the very general tactic of the metadata operator ':='. It allows decomposition of diverse data types such as trees, spreadsheets, biosequences, sets of objects, amorphous data collections with repeating items, XML structures, and so forth into universally atomic data items with or without

  18. Toolbox of multiple-item measures aligning with the ICF Core Sets for children and youth with cerebral palsy.

    PubMed

    Schiariti, Verónica; Tatla, Sandy; Sauve, Karen; O'Donnell, Maureen

    2017-03-01

    Selecting appropriate measure(s) for clinical and/or research applications for children and youth with Cerebral Palsy (CP) poses many challenges. The newly developed International Classification of Functioning, Disability and Health (ICF) Core Sets for children and youth with CP serve as universal guidelines for assessment, intervention and follow-up. The aims of this study were: 1) to identify valid and reliable measures used in studies with children and youth with CP, 2) to characterize the content of each measure using the ICF Core Sets for children and youth with CP as a framework, and finally 3) to create a toolbox of psychometrically sound measures covering the content of each ICF Core Set for children and youth with CP. All clearly defined multiple-item measures used in studies with CP between 1998 and 2015 were identified. Psychometric properties were extracted when available. Construct of the measures were linked to the ICF Core Sets. Overall, 83 multiple-item measures were identified. Of these, 68 measures (80%) included reliability and validity testing. The majority of the measures were discriminative, generic and designed for school-aged children. The degree to which measures with proven psychometric properties represented the ICF Core Sets for children and youth with CP varied considerably. Finally, 25 valid and reliable measures aligned highly with the content of the ICF Core Sets, and as such, these measures are proposed as a novel ICF Core Sets-based toolbox of measures for CP. Our results will guide professionals seeking appropriate measures to meet their research and clinical needs worldwide.

  19. Developing and Exploiting a Unique Seismic Data Set from South African Gold Mines for Source Characterization and Wave Propagation

    DTIC Science & Technology

    2007-09-01

    the University of the Witwatersrand . RESEARCH ACCOMPLISHED Three complimentary data sets are being assembled in this project: (1) high...frequency in-mine seismic data from three mines along the northwestern edge of the Witwatersrand basin, (2) seismograms from mine events recorded at...Seismicity The Witwatersrand basin is part of a granite-greenstone complex constituting the basement of the Kaapvaal craton. The basement evolved

  20. Knowledge mining from clinical datasets using rough sets and backpropagation neural network.

    PubMed

    Nahato, Kindie Biredagn; Harichandran, Khanna Nehemiah; Arputharaj, Kannan

    2015-01-01

    The availability of clinical datasets and knowledge mining methodologies encourages the researchers to pursue research in extracting knowledge from clinical datasets. Different data mining techniques have been used for mining rules, and mathematical models have been developed to assist the clinician in decision making. The objective of this research is to build a classifier that will predict the presence or absence of a disease by learning from the minimal set of attributes that has been extracted from the clinical dataset. In this work rough set indiscernibility relation method with backpropagation neural network (RS-BPNN) is used. This work has two stages. The first stage is handling of missing values to obtain a smooth data set and selection of appropriate attributes from the clinical dataset by indiscernibility relation method. The second stage is classification using backpropagation neural network on the selected reducts of the dataset. The classifier has been tested with hepatitis, Wisconsin breast cancer, and Statlog heart disease datasets obtained from the University of California at Irvine (UCI) machine learning repository. The accuracy obtained from the proposed method is 97.3%, 98.6%, and 90.4% for hepatitis, breast cancer, and heart disease, respectively. The proposed system provides an effective classification model for clinical datasets.

  1. Knowledge Mining from Clinical Datasets Using Rough Sets and Backpropagation Neural Network

    PubMed Central

    Nahato, Kindie Biredagn; Harichandran, Khanna Nehemiah; Arputharaj, Kannan

    2015-01-01

    The availability of clinical datasets and knowledge mining methodologies encourages the researchers to pursue research in extracting knowledge from clinical datasets. Different data mining techniques have been used for mining rules, and mathematical models have been developed to assist the clinician in decision making. The objective of this research is to build a classifier that will predict the presence or absence of a disease by learning from the minimal set of attributes that has been extracted from the clinical dataset. In this work rough set indiscernibility relation method with backpropagation neural network (RS-BPNN) is used. This work has two stages. The first stage is handling of missing values to obtain a smooth data set and selection of appropriate attributes from the clinical dataset by indiscernibility relation method. The second stage is classification using backpropagation neural network on the selected reducts of the dataset. The classifier has been tested with hepatitis, Wisconsin breast cancer, and Statlog heart disease datasets obtained from the University of California at Irvine (UCI) machine learning repository. The accuracy obtained from the proposed method is 97.3%, 98.6%, and 90.4% for hepatitis, breast cancer, and heart disease, respectively. The proposed system provides an effective classification model for clinical datasets. PMID:25821508

  2. Keeping it simple: studying grammatical encoding with lexically reduced item sets.

    PubMed

    Veenstra, Alma; Acheson, Daniel J; Meyer, Antje S

    2014-01-01

    Compared to the large body of work on lexical access, little research has been done on grammatical encoding in language production. An exception is the generation of subject-verb agreement. Here, two key findings have been reported: (1) speakers make more agreement errors when the head and local noun of a phrase mismatch in number than when they match [e.g., the key to the cabinet(s)]; and (2) this attraction effect is asymmetric, with stronger attraction for singular than for plural head nouns. Although these findings are robust, the cognitive processes leading to agreement errors and their significance for the generation of correct agreement are not fully understood. We propose that future studies of agreement, and grammatical encoding in general, may benefit from using paradigms that tightly control the variability of the lexical content of the material. We report two experiments illustrating this approach. In both of them, the experimental items featured combinations of four nouns, four color adjectives, and two prepositions. In Experiment 1, native speakers of Dutch described pictures in sentences such as the circle next to the stars is blue. In Experiment 2, they carried out a forced-choice task, where they read subject noun phrases (e.g., the circle next to the stars) and selected the correct verb-phrase (is blue or are blue) with a button press. Both experiments showed an attraction effect, with more errors after subject phrases with mismatching, compared to matching head and local nouns. This effect was stronger for singular than plural heads, replicating the attraction asymmetry. In contrast, the response times recorded in Experiment 2 showed similar attraction effects for singular and plural head nouns. These results demonstrate that critical agreement phenomena can be elicited reliably in lexically reduced contexts. We discuss the theoretical implications of the findings and the potential and limitations of studies using lexically simple materials.

  3. Evaluating the Bookmark Standard Setting Method: The Impact of Random Item Ordering

    ERIC Educational Resources Information Center

    Davis-Becker, Susan L.; Buckendahl, Chad W.; Gerrow, Jack

    2011-01-01

    Throughout the world, cut scores are an important aspect of a high-stakes testing program because they are a key operational component of the interpretation of test scores. One method for setting standards that is prevalent in educational testing programs--the Bookmark method--is intended to be a less cognitively complex alternative to methods…

  4. The Comparative Translucency of Initial Lexical Items Represented in Five Graphic Symbol Systems and Sets.

    ERIC Educational Resources Information Center

    Bloomberg, Karen; And Others

    1990-01-01

    The comparative translucency within and across 5 aided augmentative and alternative symbol systems for symbols representing 3 parts of speech (nouns, verbs, and modifiers) was investigated with 50 college undergraduates. Results indicated that translucency varies among systems or sets and among parts of speech. (Author/DB)

  5. Testing whether the DSM-5 personality disorder trait model can be measured with a reduced set of items: An item response theory investigation of the Personality Inventory for DSM-5.

    PubMed

    Maples, Jessica L; Carter, Nathan T; Few, Lauren R; Crego, Cristina; Gore, Whitney L; Samuel, Douglas B; Williamson, Rachel L; Lynam, Donald R; Widiger, Thomas A; Markon, Kristian E; Krueger, Robert F; Miller, Joshua D

    2015-12-01

    The fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) includes an alternative model of personality disorders (PDs) in Section III, consisting in part of a pathological personality trait model. To date, the 220-item Personality Inventory for DSM-5 (PID-5; Krueger, Derringer, Markon, Watson, & Skodol, 2012) is the only extant self-report instrument explicitly developed to measure this pathological trait model. The present study used item response theory-based analyses in a large sample (n = 1,417) to investigate whether a reduced set of 100 items could be identified from the PID-5 that could measure the 25 traits and 5 domains. This reduced set of PID-5 items was then tested in a community sample of adults currently receiving psychological treatment (n = 109). Across a wide range of criterion variables including NEO PI-R domains and facets, DSM-5 Section II PD scores, and externalizing and internalizing outcomes, the correlational profiles of the original and reduced versions of the PID-5 were nearly identical (rICC = .995). These results provide strong support for the hypothesis that an abbreviated set of PID-5 items can be used to reliably, validly, and efficiently assess these personality disorder traits. The ability to assess the DSM-5 Section III traits using only 100 items has important implications in that it suggests these traits could still be measured in settings in which assessment-related resources (e.g., time, compensation) are limited. (c) 2015 APA, all rights reserved).

  6. An intelligent knowledge mining model for kidney cancer using rough set theory.

    PubMed

    Durai, M A Saleem; Acharjya, D P; Kannan, A; Iyengar, N Ch Sriman Narayana

    2012-01-01

    Medical diagnosis processes vary in the degree to which they attempt to deal with different complicating aspects of diagnosis such as relative importance of symptoms, varied symptom pattern and the relation between diseases themselves. Rough set approach has two major advantages over the other methods. First, it can handle different types of data such as categorical, numerical etc. Secondly, it does not make any assumption like probability distribution function in stochastic modeling or membership grade function in fuzzy set theory. It involves pattern recognition through logical computational rules rather than approximating them through smooth mathematical functional forms. In this paper we use rough set theory as a data mining tool to derive useful patterns and rules for kidney cancer faulty diagnosis. In particular, the historical data of twenty five research hospitals and medical college is used for validation and the results show the practical viability of the proposed approach.

  7. Chemical Topic Modeling: Exploring Molecular Data Sets Using a Common Text-Mining Approach.

    PubMed

    Schneider, Nadine; Fechner, Nikolas; Landrum, Gregory A; Stiefl, Nikolaus

    2017-08-28

    Big data is one of the key transformative factors which increasingly influences all aspects of modern life. Although this transformation brings vast opportunities it also generates novel challenges, not the least of which is organizing and searching this data deluge. The field of medicinal chemistry is not different: more and more data are being generated, for instance, by technologies such as DNA encoded libraries, peptide libraries, text mining of large literature corpora, and new in silico enumeration methods. Handling those huge sets of molecules effectively is quite challenging and requires compromises that often come at the expense of the interpretability of the results. In order to find an intuitive and meaningful approach to organizing large molecular data sets, we adopted a probabilistic framework called "topic modeling" from the text-mining field. Here we present the first chemistry-related implementation of this method, which allows large molecule sets to be assigned to "chemical topics" and investigating the relationships between those. In this first study, we thoroughly evaluate this novel method in different experiments and discuss both its disadvantages and advantages. We show very promising results in reproducing human-assigned concepts using the approach to identify and retrieve chemical series from sets of molecules. We have also created an intuitive visualization of the chemical topics output by the algorithm. This is a huge benefit compared to other unsupervised machine-learning methods, like clustering, which are commonly used to group sets of molecules. Finally, we applied the new method to the 1.6 million molecules of the ChEMBL22 data set to test its robustness and efficiency. In about 1 h we built a 100-topic model of this large data set in which we could identify interesting topics like "proteins", "DNA", or "steroids". Along with this publication we provide our data sets and an open-source implementation of the new method (CheTo) which

  8. Use of an Item Bank to Develop Two Short-Form FAMCARE Scales to Measure Family Satisfaction With Care in the Setting of Serious Illness.

    PubMed

    Ornstein, Katherine A; Teresi, Jeanne A; Ocepek-Welikson, Katja; Ramirez, Mildred; Meier, Diane E; Morrison, R Sean; Siu, Albert L

    2015-05-01

    Family satisfaction is an important and commonly used research measure. Yet current measures of family satisfaction are lengthy and may be unnecessarily burdensome--particularly in the setting of serious illness. To use an item bank to develop short forms of the Family Satisfaction with End-of-Life Care (FAMCARE) scale, which measures family satisfaction with care. To shorten the existing 20-item FAMCARE measure, item response theory parameters from an item bank were used to select the most informative items. The psychometric properties of the new short-form scales were examined. The item bank was based on data from family members from an ethnically diverse sample of 1983 patients with advanced cancer. Evidence for the new short-form scales supported essential unidimensionality. Reliability estimates from several methods were relatively high, ranging from 0.84 for the five-item scale to 0.94 for the 10-item scale across different age, gender, education, ethnic, and relationship groups. The FAMCARE-10 and FAMCARE-5 short-form scales evidenced high reliability across sociodemographic subgroups and are potentially less burdensome and time-consuming scales for monitoring family satisfaction among seriously ill patients. Copyright © 2015 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.

  9. Consistency of Angoff-Based Standard-Setting Judgments: Are Item Judgments and Passing Scores Replicable across Different Panels of Experts?

    ERIC Educational Resources Information Center

    Tannenbaum, Richard J.; Kannan, Priya

    2015-01-01

    Angoff-based standard setting is widely used, especially for high-stakes licensure assessments. Nonetheless, some critics have claimed that the judgment task is too cognitively complex for panelists, whereas others have explicitly challenged the consistency in (replicability of) standard-setting outcomes. Evidence of consistency in item judgments…

  10. Generalizability and Dependability of a Multi-Item Direct Behavior Rating Scale in a Kindergarten Classroom Setting

    ERIC Educational Resources Information Center

    Wickerd, Garry; Hulac, David

    2017-01-01

    Accurate and rapid identification of students displaying behavioral problems requires instrumentation that is user friendly and reliable. The purpose of the study was to evaluate a multi-item direct behavior rating scale called the Direct Behavior Rating-Multiple Item Scale (DBR-MIS) for disruptive behavior to determine the number of…

  11. Generalizability and Dependability of a Multi-Item Direct Behavior Rating Scale in a Kindergarten Classroom Setting

    ERIC Educational Resources Information Center

    Wickerd, Garry; Hulac, David

    2017-01-01

    Accurate and rapid identification of students displaying behavioral problems requires instrumentation that is user friendly and reliable. The purpose of the study was to evaluate a multi-item direct behavior rating scale called the Direct Behavior Rating-Multiple Item Scale (DBR-MIS) for disruptive behavior to determine the number of…

  12. Setting standards for severity of common symptoms in oncology using the PROMIS item banks and expert judgment.

    PubMed

    Cella, David; Choi, Seung; Garcia, Sofia; Cook, Karon F; Rosenbloom, Sarah; Lai, Jin-Shei; Tatum, Donna Surges; Gershon, Richard

    2014-12-01

    Although the use of patient-reported outcome measures (PROs) has increased markedly, clinical interpretation of scores remains lacking. We developed a method to identify clinical severity thresholds for pain, fatigue, depression, and anxiety in people with cancer. Using available Patient-Reported Outcomes Measurement Information System (PROMIS) item bank response data collected on 840 cancer patients, symptom vignettes across a range of symptom severity were developed and placed on index cards. Cards represented symptom severity at five-point intervals differences on the T score metric [mean = 50; standard deviation (SD) = 10]. Symptom vignettes for each symptom were anchored on these standardized scores at 0.5 SD increments across the full range of severity. Clinical experts, blind to the PROMIS score associated with each vignette, rank-ordered the vignettes by severity, then arrived at consensus regarding which two vignettes were at the upper and lower boundaries of normal and mildly symptomatic for each symptom. The procedure was repeated to identify cut scores separating mildly from moderately symptomatic, and moderately from severely symptomatic scores. Clinician severity rankings were then compared to the T scores upon which the vignettes were based. For each of the targeted PROs, the severity rankings reached by clinician consensus perfectly matched the numerical rankings of their associated T scores. Across all symptoms, the thresholds (cut scores) identified to differentiate normal from mildly symptomatic were near a T score of 50. Cut scores differentiating mildly from moderately symptomatic were at or near 60, and those separating moderately from severely symptomatic were at or near 70. The study results provide empirically generated PROMIS T score thresholds that differentiate levels of symptom severity for pain interference, fatigue, anxiety, and depression. The convergence of clinical judgment with self-reported patient severity scores supports the

  13. Dynamic Triggering of Microseismicity inferred from Spatio/Temporal Patterns in a Mine Setting

    NASA Astrophysics Data System (ADS)

    Castellanos, F.; Van der Baan, M.

    2014-12-01

    We examine spatial and temporal patterns of microseismic events in an underground mine. Our objective is to address three key questions: Where does the seismicity occur? Why does it occur in these locations but not elsewhere? And what triggers it? We take advantage of waveform similarity by performing multiplet analysis based on the double-difference technique to obtain highly accurate relative locations. Seven vertical receiver boreholes that surround the area under investigation have recorded microseismic data during a month, thus providing a good azimuthal coverage. A minimum crosscorrelation level of 85% is used to detect 21 multiplet groups, which represents 60% of the total seismicity. The largest groups are located close to the main shaft and tunnels at orebody levels, thus we postulate seismicity is facilitated by the potential of subsidence if we assume a compacting earth and hoop stresses acting on the vertical shafts. Surprisingly, most events only occur during certain hours of the day but do not relate to blasting. They correlate with scheduled operations of rock removal. Therefore, it is likely they have been triggered by the transportation of the debris along the main shaft instead of blasting, as we initially expected. Given that seismicity is present around the main shaft but absent close to the second air shaft, we conclude that for seismicity to occur both a favourable stress state must exist, as well as additional external forces, causing dynamic triggering. This analysis provides more insight into anthropogenic processes and their roles as major initiators of seismicity during dynamic stress transfer thereby facilitating identification of hazardous areas in mine settings.

  14. Genomic comparison using data mining techniques based on a possibilistic fuzzy sets model.

    PubMed

    Balzano, Walter; Del Sorbo, Maria Rosaria

    2007-04-01

    Current copiousness of genomic information stored in biological databases [Mar Albà, M., Lee, M., Pearl, D., Shepherd, F.M.G., Martin, A.J., Orengo, N., Kellam, C.A., 2001. P. VIDA: a virus database system for the organisation of virus genome open reading frames. Nuleic Acids Res. 133-136] makes ultimately feasible the proposal for an application of knowledge management aimed to discover general rules in subcellular phenomena. The goal of this work is primarily to discover relationships between genes by microarray analysis. The tools exploited come from clustering techniques and are mainly based on Knowledge Discovery in Databases (KDD) concepts [Fayyad, U., Piatetsky-Shapiro, G., Smyth, P., 1996. From data mining to knowledge discovery in databases. AI Magazine 17(3), 37-54]. Starting from a data set, each element can be represented by a characteristic matrix, which sums up all data attributes. In this case data mining is oriented to perform a Pattern Recognition of related sequences, hidden in databases [Hand, D.J., Nicholas, A., 2005. Heard finding groups in gene expression data. J. Biomed. Biotechnol. 215-225]. Following a bottom up approach, the next refinement is to compare retrieved data to gather similar features, by dedicated clustering algorithms [Kaufman, L., Rousseeuw, P.J., 1990. Finding groups in data. An Introduction to Cluster Analysis. John Wiley & Sons, New York; Forman, G., Zhang, B., 2000. Distributed Data clustering can be efficient and exact HP. Laboratories Palo Alto HPL-2000, p. 158], driven by fuzzy logic, allowing us to perceive by intuition a common denominator for various genomic families and to anticipate likely future developments.

  15. Data mining and visualization techniques

    DOEpatents

    Wong, Pak Chung; Whitney, Paul; Thomas, Jim

    2004-03-23

    Disclosed are association rule identification and visualization methods, systems, and apparatus. An association rule in data mining is an implication of the form X.fwdarw.Y where X is a set of antecedent items and Y is the consequent item. A unique visualization technique that provides multiple antecedent, consequent, confidence, and support information is disclosed to facilitate better presentation of large quantities of complex association rules.

  16. The flexibility of context-specific control: evidence for context-driven generalization of item-specific control settings.

    PubMed

    Crump, Matthew J C; Milliken, Bruce

    2009-08-01

    In two experiments we address an ongoing debate concerning the processes driving context-driven modulations to the Stroop effect (Crump, Gong, & Milliken, 2006). In particular, we demonstrate that context-driven processes can modulate the size of the Stroop effect for frequency-unbiased item types. We also clarify the role of item frequency in producing context-driven modulations to the Stroop effect. Taken together, our results provide unambiguous support for the claim that contextual processing can impart fast and flexible control over the operation of selective attention processes during online performance.

  17. Visual data mining of coexpression data to set research priorities in cardiac development research.

    PubMed

    VanBuren, Vincent

    2012-01-01

    Over the past decade, an immense amount of biomedical data have become available in the public domain due to the development of ever-more efficient screening tools such as expression microarrays. To fully leverage this important new resource, it has become imperative to develop new methodologies for mining and visualizing data to make inferences beyond the scope of the original experiments. This need motivated the development of a new freely available web-based application called StarNet ( http://vanburenlab.medicine.tamhsc.edu/starnet2.html ). Here we describe the use of StarNet, which functions primarily as a query tool that draws correlation networks centered about a gene of interest. To support inferences and the development of new hypotheses using the resulting correlation network, StarNet queries all genes in the correlation network against a database of known interactions and displays the results in a second graph and provides a statistical test of Gene Ontology term enrichment (keyword enrichment) to provide tentative summary functional annotations for the correlation network. Finally, StarNet provides additional tools for comparing networks drawn from two different selected data sets, thus providing methods for making inferences and developing new hypotheses about differential wiring for different regulatory domains.

  18. Mining pinyin-to-character conversion rules from large-scale corpus: a rough set approach.

    PubMed

    Wang, Xiaolong; Chen, Qingcai; Yeung, Daniel S

    2004-04-01

    This paper introduces a rough set technique for solving the problem of mining Pinyin-to-character (PTC) conversion rules. It first presents a text-structuring method by constructing a language information table from a corpus for each pinyin, which it will then apply to a free-form textual corpus. Data generalization and rule extraction algorithms can then be used to eliminate redundant information and extract consistent PTC conversion rules. The design of our model also addresses a number of important issues such as the long-distance dependency problem, the storage requirements of the rule base, and the consistency of the extracted rules, while the performance of the extracted rules as well as the effects of different model parameters are evaluated experimentally. These results show that by the smoothing method, high precision conversion (0.947) and recall rates (0.84) can be achieved even for rules represented directly by pinyin rather than words. A comparison with the baseline tri-gram model also shows good complement between our method and the tri-gram language model.

  19. An Investigation of One and Three Parameter Item Response Models in a Field Setting, with Implications for Computerized Adaptive Testing

    DTIC Science & Technology

    1981-05-01

    accu’acy of estimation of their status on the trait becomes progressively more inaccurate as their ability deviates from this point. The result of...Conventional tests typically use number of pro- portion of items discrimination. Two recent developments have enabled the psychometrician to more accurately...is to accurately estimate an individual’s position on the underlying trait the test purports to mearure. Con’ventional paper-and-pencil test

  20. Translation and cross-cultural adaptation of the ASAS Health Index and ASAS Environmental Factors Item Set into European Portuguese Language.

    PubMed

    Cruz, Eduardo B; Ramiro, Sofia; Machado, Pedro; Sousa, Sandra; Aguiar, Renata; Sepriano, Alexandre; Manica, Santiago Rodrigues; Kiltz, Uta; Branco, Jaime C; Pimentel-Santos, Fernando M

    2017-01-01

    There is a lack of outcome measures to assess the impact of axial spondyloarthritis (axSpA) on health, function and quality of life. The Assessment of SpondyloArthritis International Society (ASAS) group developed the ASAS Health Index (ASAS-HI) and the ASAS Environmental Factors Item Set (ASAS-EF) to measure functioning and health across all aspects of health that are typically affected and relevant for patients with axSpA, based on the International Classification of Functioning, Disability and Health (ICF). The aim of this paper was to describe the translation and cross-cultural adaptation of both questionnaires into European Portuguese among patients with radiographic and non-radiographic axial SpA (nr-axSpA) and test the conceptual equivalence of the translated version in the Portuguese context. The ASAS-HI and ASAS-EF were firstly translated into European Portuguese and then back-translated into English, following forward-backward procedure. After the review of the Portuguese version by an expert committee, the field test with cognitive debriefing involved a sample of 10 axSpA patients with different gender, age, disease duration, and educational background. Minor difficulties arose from the translation process of the ASAS-HI. The EF Item Set offered more difficulties indicating that concepts underlying the contextual factors may be more culture-dependent. A total of 10 patients with axSpA [8 males, mean age of 41.4 (±13.7)] participated in the field test. Cognitive debriefing showed that items of the ASAS-HI and EF Item Set of the Portuguese version are clear, relevant, understandable and easy to complete. As a result of cognitive debriefing, the wording of four items had to be changed to avoid misunderstandings or unintended interpretations, and a new response option "not applicable" was added to two items of the ASAS-HI to improve appropriateness. The resulting Portuguese version of the ASAS-HI and ASAS-EF showed acceptable linguistic validity and has

  1. Shared decision making in psychiatric practice and the primary care setting is unique, as measured using a 9-item Shared Decision Making Questionnaire (SDM-Q-9).

    PubMed

    De Las Cuevas, Carlos; Peñate, Wenceslao; Perestelo-Pérez, Lilisbeth; Serrano-Aguilar, Pedro

    2013-01-01

    To measure and compare the extent to which shared a decision making (SDM) process is implemented both in psychiatric outpatient clinical encounters and in the primary care setting from the patient's perspective. A total of 1,477 patients recruited from the Canary Islands Health Service mental health and primary care departments were invited to complete the nine-item Shared Decision Making Questionnaire (SDM-Q-9) immediately after their consultation. MANCOVA, Student's t-test, and Pearson correlations were used to assess the relationship and differences between SDM-Q-9 scores in patient samples. No differences were found in SDM-Q-9 total scores between the two patient samples, but there were relevant differences when item by item analysis was applied; differences were observed according to the different steps of the SDM process. SDM is present to a very limited extent in the routine psychiatric setting compared to primary care. Patients' age, education, type of appointment, and treatment decision all play a specific role in predicting SDM. The study provides evidence that SDM is a complex process that needs to be analyzed according to its different steps. SDM patterns were different in the primary care and psychiatric outpatient care settings and reflect quite a different perspective of the decision making process.

  2. TREatment of ATopic eczema (TREAT) Registry Taskforce: protocol for an international Delphi exercise to identify a core set of domains and domain items for national atopic eczema registries.

    PubMed

    Gerbens, Louise A A; Boyce, Aaron E; Wall, Dmitri; Barbarot, Sebastien; de Booij, Richard J; Deleuran, Mette; Middelkamp-Hup, Maritza A; Roberts, Amanda; Vestergaard, Christian; Weidinger, Stephan; Apfelbacher, Christian J; Irvine, Alan D; Schmitt, Jochen; Williamson, Paula R; Spuls, Phyllis I; Flohr, Carsten

    2017-02-27

    Patients with moderate-to-severe atopic eczema (AE) often require photo- or systemic immunomodulatory therapies to induce disease remission and maintain long-term control. The current evidence to guide clinical management is small, despite the frequent and often off-label use of these treatments. Registries of patients on photo- and systemic immunomodulatory therapies could fill this gap, and the collection of a core set concerning these therapies in AE will allow direct comparisons across registries as well as data sharing and pooling. Using an eDelphi approach, the international TREatment of ATopic eczema (TREAT) Registry Taskforce aims to seek consensus between key stakeholders internationally on a core set of domains and domain items for AE patient registries with a research focus that collect data of children and adults on photo- and systemic immunomodulatory therapies. Participants from six stakeholder groups will be invited: doctors, nurses, non-clinical researchers, patients, as well as industry and regulatory body representatives. The eDelphi will comprise three sequential online rounds, requesting participants to rate the importance of each proposed domain and domain items. Participants will be able to add domains and domain items to the proposed list in round 1. A final consensus meeting will be held with representatives of each stakeholder group. Identifying a uniform core set of domains and domain items to be captured by AE patient registries will increase the utility of individual registries, and provide greater insight into the effectiveness, safety and cost-effectiveness of photo- and systemic immunomodulatory therapies to guide clinical management across dermatology centres and country borders. Not applicable. This eDelphi study was registered in the Core Outcome Measures for Effectiveness Trials (COMET) database.

  3. Handling Dynamic Weights in Weighted Frequent Pattern Mining

    NASA Astrophysics Data System (ADS)

    Ahmed, Chowdhury Farhan; Tanbeer, Syed Khairuzzaman; Jeong, Byeong-Soo; Lee, Young-Koo

    Even though weighted frequent pattern (WFP) mining is more effective than traditional frequent pattern mining because it can consider different semantic significances (weights) of items, existing WFP algorithms assume that each item has a fixed weight. But in real world scenarios, the weight (price or significance) of an item can vary with time. Reflecting these changes in item weight is necessary in several mining applications, such as retail market data analysis and web click stream analysis. In this paper, we introduce the concept of a dynamic weight for each item, and propose an algorithm, DWFPM (dynamic weighted frequent pattern mining), that makes use of this concept. Our algorithm can address situations where the weight (price or significance) of an item varies dynamically. It exploits a pattern growth mining technique to avoid the level-wise candidate set generation-and-test methodology. Furthermore, it requires only one database scan, so it is eligible for use in stream data mining. An extensive performance analysis shows that our algorithm is efficient and scalable for WFP mining using dynamic weights.

  4. DEVELOPING AND EXPLOITING A UNIQUE SEISMIC DATA SET FROM SOUTH AFRICAN GOLD MINES FOR SOURCE CHARACTERIZATION AND WAVE PROPAGATION

    SciTech Connect

    Julia, J; Nyblade, A A; Gok, R; Walter, W R; Linzer, L; Durrheim, R

    2008-07-08

    In this project, we are developing and exploiting a unique seismic data set to address the characteristics of small seismic events and the associated seismic signals observed at local (< 200 km) and regional (< 2000 km) distances. The dataset is being developed using mining-induced events from 3 deep gold mines in South Africa recorded on inmine networks (< 1 km) comprised of tens of high-frequency sensors, a network of 4 broadband stations installed as part of this project at the surface around the mines (1-10 km), and a network of existing broadband seismic stations at local/regional distances (50-1000 km) from the mines. After 1 year of seismic monitoring of mine activity (2007), over 10,000 events in the range -3.4 < ML < 4.4 have been catalogued and recorded by the in-mine networks. Events with positive magnitudes are generally well recorded by the surface-mine stations, while magnitudes 3.0 and larger are seen at regional distances (up to {approx}600 km) in high-pass filtered recordings. We have analyzed in-mine recordings in detail at one of the South African mines (Savuka) to (i) improve on reported hypocentral locations, (ii) verify sensor orientations, and (iii) determine full moment tensor solutions. Hypocentral relocations on all catalogued events have been obtained from P- and S-wave travel-times reported by the mine network operator through an automated procedure that selects travel-times falling on Wadati lines with slopes in the 0.6-0.7 range; sensor orientations have been verified and, when possible, corrected by correlating P-, SV-, and SH-waveforms obtained from theoretical and empirical (polarization filter) rotation angles; full moment tensor solutions have been obtained by inverting P-, SV-, and SH- spectral amplitudes measured on the theoretically rotated waveforms with visually assigned polarities. The relocation procedure has revealed that origin times often necessitate a negative correction of a few tenths of second and that hypocentral

  5. The EMCDDA/Pompidou Group treatment demand indicator protocol: a European core item set for treatment monitoring and reporting.

    PubMed

    Simon, R; Donmall, M; Hartnoll, R; Kokkevi, A; Ouwehand, A W; Stauffacher, M; Vicente, J

    1999-12-01

    Over the last decades inside and outside of Europe, treatment-based data have been used in epidemiological research on drugs and drug abuse. They offer information on hidden populations and allow to follow socially stigmatised behaviour. As this type of research can be done on rather low budgets, there are long-term projects run in many countries. Experts from the national systems in several EU member states have been working together to develop a common standard on the basis of the Pompidou Group (PG) Definitive Protocol. The items and basic definitions of the European Monitoring Centre for Drugs and Drug Addiction (EMCDDA)/PG Treatment Demand Indicator Protocol are described, which plays an important role in the process of harmonisation of data collection for the EMCDDA. Implementation strategies are described, and future steps are discussed.

  6. Concurrent and predictive validity of physical activity measurement items commonly used in clinical settings--data from SCAPIS pilot study.

    PubMed

    Ekblom, Örjan; Ekblom-Bak, Elin; Bolam, Kate A; Ekblom, Björn; Schmidt, Caroline; Söderberg, Stefan; Bergström, Göran; Börjesson, Mats

    2015-09-28

    As the understanding of how different aspects of the physical activity (PA) pattern relate to health and disease, proper assessment is increasingly important. In clinical care, self-reports are the most commonly used assessment technique. However, systematic comparisons between questions regarding concurrent or criterion validity are rare, as are measures of predictive validity. The aim of the study was to examine the concurrent (using accelerometry as reference) and predictive validity (for metabolic syndrome) of five PA questions. A sample of 948 middle-aged Swedish men and women reported their PA patterns via five different questions and wore an accelerometer (Actigraph GT3X) for a minimum of 4 days. Concurrent validity was assessed as correlations and ROC-analyses. Predictive validity was assessed using logistic regression, controlling for potential confounders. Concurrent validity was low-to-moderate (r <0.35 and ROC AUC <0.7) with large misclassifications regarding time spent sitting/sedentary and in moderate-to vigorous PA. The predictive validity of the questions was good, and one question (PHAS) showed an 80 % decreased odds-ratio of having metabolic syndrome, after taking potential confounders into consideration. In this mixed sample of adults, both concurrent and predictive validity vaired between items and between measures of the physical activity pattern. The PHAS and WALK items are proposed for assessment of adherence to PA recommendations. Assessing PA patterns using self-report measures results in methodological problems when trying to predict individual risk for the metabolic syndrome, as the concurrent validity generally was low. However, several of the investigated questions may be useful for assessing risk at a group level, showing better predictive validity.

  7. Environmental Intervention in Carryout Restaurants Increases Sales of Healthy Menu Items in a Low-Income Urban Setting.

    PubMed

    Lee-Kwan, Seung Hee; Bleich, Sara N; Kim, Hyunju; Colantuoni, Elizabeth; Gittelsohn, Joel

    2015-01-01

    To investigate how a pilot environmental intervention changed food sales patterns in carryout restaurants. Quasi-experimental. Low-income neighborhoods of Baltimore, Maryland. Seven carryouts (three intervention, four comparison). Phase 1, menu board revision and healthy menu labeling; phase 2, increase of healthy sides and beverages; and phase 3, promotion of cheaper and healthier combination meals. Weekly handwritten menu orders collected to assess changes in the proportion of units sold and revenue of healthy items (entrée, sides and beverages, and combined). Logistic and Poisson regression models with generalized estimating equations. In the intervention group, odds for healthy entrée units and odds for healthy side and beverage units sold significantly increased in phases 2 and 3; odds for healthy entrée revenue significantly increased in phase 1 (odds ratio [OR] 1.16, 95% confidence interval [CI] 1.08-1.26), phase 2 (OR 1.32, 95% CI 1.25-1.41), and phase 3 (OR 1.39, 95% CI 1.14-1.70); and odds for healthy side and beverage revenues increased significantly in phase 2 (OR 1.62, 95% CI 1.33-1.97) and phase 3 (OR 2.73, 95% CI 2.15-3.47) compared to baseline. Total revenue in the intervention group was significantly higher in all phases than in the comparison group (p < .05). Environmental intervention changes such as menu revision, menu labeling, improved healthy food selection, and competitive pricing can increase availability and sales of healthy items in carryouts.

  8. Construct Validity of a Set of Mathematical Superitems. A Report on the NIE/ECS Item Development Project.

    ERIC Educational Resources Information Center

    Romberg, Thomas A.; And Others

    Procedural documentation is presented for administering, scoring and analyzing data gathered to examine the construct validity of a set of superitems developed to assess student levels of mathematical reasoning ability. Each superitem includes a mathematical situation and a structured set of questions about that situation. The questions were based…

  9. Screening Test Items for Differential Item Functioning

    ERIC Educational Resources Information Center

    Longford, Nicholas T.

    2014-01-01

    A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

  10. Screening Test Items for Differential Item Functioning

    ERIC Educational Resources Information Center

    Longford, Nicholas T.

    2014-01-01

    A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

  11. Comparative Characterization of Crofelemer Samples Using Data Mining and Machine Learning Approaches With Analytical Stability Data Sets.

    PubMed

    Nariya, Maulik K; Kim, Jae Hyun; Xiong, Jian; Kleindl, Peter A; Hewarathna, Asha; Fisher, Adam C; Joshi, Sangeeta B; Schöneich, Christian; Forrest, M Laird; Middaugh, C Russell; Volkin, David B; Deeds, Eric J

    2017-07-22

    There is growing interest in generating physicochemical and biological analytical data sets to compare complex mixture drugs, for example, products from different manufacturers. In this work, we compare various crofelemer samples prepared from a single lot by filtration with varying molecular weight cutoffs combined with incubation for different times at different temperatures. The 2 preceding articles describe experimental data sets generated from analytical characterization of fractionated and degraded crofelemer samples. In this work, we use data mining techniques such as principal component analysis and mutual information scores to help visualize the data and determine discriminatory regions within these large data sets. The mutual information score identifies chemical signatures that differentiate crofelemer samples. These signatures, in many cases, would likely be missed by traditional data analysis tools. We also found that supervised learning classifiers robustly discriminate samples with around 99% classification accuracy, indicating that mathematical models of these physicochemical data sets are capable of identifying even subtle differences in crofelemer samples. Data mining and machine learning techniques can thus identify fingerprint-type attributes of complex mixture drugs that may be used for comparative characterization of products. Copyright © 2017 American Pharmacists Association®. All rights reserved.

  12. A predictive model for ADL dependence in community-living older adults based on a reduced set of cognitive status items.

    PubMed

    Gill, T M; Williams, C S; Richardson, E D; Berkman, L F; Tinetti, M E

    1997-04-01

    To develop and validate a simple tool, based on a reduced set of Mini-Mental State Examination (MMSE) items, that can be used to predict the onset of ADL dependence, and to compare the predictive accuracy of this new tool with that of the MMSE. Two prospective, population-based cohort studies, in tandem. The predictive model developed in the initial cohort was subsequently validated in a separate cohort. General community in New Haven, Connecticut. For the development cohort, 775 community-living persons, 72 years of age and older, who were independent at baseline in their ADI, function. For the validation cohort, 1038 comparable subjects. All subjects underwent a baseline interview and cognitive assessment in their homes by a trained research nurse using standard instrument. Self-reported ADLs were ascertained at 1 year and 3 years for the development cohort and at 1 year and 21/2 years for the validation cohort. ADL dependence developed in 221 (28.5%) subjects in the development cohort. Although the rate of ADL dependence increased within each MMSE domain as the number of incorrect items increased, only orientation and short-term memory remained significantly associated with ADL dependence in multivariable analysis. A predictive model, based on the presence of impairments in these two domains, was developed that stratified subjects into three risk groups. Rates of ADL dependence were 22% (neither domain impaired), 44% (one domain impaired), and 68% (both domains impaired) (P < .001). The corresponding rates in the validation cohort, in which 191 (18.4%) subjects developed ADL dependence, were 15%, 26%, and 45% (P < .001). The area under the ROC curves for the MMSE and the reduced item strategy were nearly identical at 0.63 and 0.62, respectively. A simple and valid six-item strategy, based on the presence of impairments in orientation and short-term memory, predicts the onset of ADL dependence as effectively as does the 30-item MMSE. This new tool may be useful

  13. Item Banking with Embedded Standards

    ERIC Educational Resources Information Center

    MacCann, Robert G.; Stanley, Gordon

    2009-01-01

    An item banking method that does not use Item Response Theory (IRT) is described. This method provides a comparable grading system across schools that would be suitable for low-stakes testing. It uses the Angoff standard-setting method to obtain item ratings that are stored with each item. An example of such a grading system is given, showing how…

  14. A Bayesian method to mine spatial data sets to evaluate the vulnerability of human beings to catastrophic risk.

    PubMed

    Li, Lianfa; Wang, Jinfeng; Leung, Hareton; Zhao, Sisi

    2012-06-01

    Vulnerability of human beings exposed to a catastrophic disaster is affected by multiple factors that include hazard intensity, environment, and individual characteristics. The traditional approach to vulnerability assessment, based on the aggregate-area method and unsupervised learning, cannot incorporate spatial information; thus, vulnerability can be only roughly assessed. In this article, we propose Bayesian network (BN) and spatial analysis techniques to mine spatial data sets to evaluate the vulnerability of human beings. In our approach, spatial analysis is leveraged to preprocess the data; for example, kernel density analysis (KDA) and accumulative road cost surface modeling (ARCSM) are employed to quantify the influence of geofeatures on vulnerability and relate such influence to spatial distance. The knowledge- and data-based BN provides a consistent platform to integrate a variety of factors, including those extracted by KDA and ARCSM to model vulnerability uncertainty. We also consider the model's uncertainty and use the Bayesian model average and Occam's Window to average the multiple models obtained by our approach to robust prediction of the risk and vulnerability. We compare our approach with other probabilistic models in the case study of seismic risk and conclude that our approach is a good means to mining spatial data sets for evaluating vulnerability. © 2012 Society for Risk Analysis.

  15. Evaluation of Item Candidates: The PROMIS Qualitative Item Review

    PubMed Central

    DeWalt, Darren A.; Rothrock, Nan; Yount, Susan; Stone, Arthur A.

    2009-01-01

    One of the PROMIS (Patient-Reported Outcome Measurement Information System) network's primary goals is the development of a comprehensive item bank for patient-reported outcomes of chronic diseases. For its first set of item banks, PROMIS chose to focus on pain, fatigue, emotional distress, physical function, and social function. An essential step for the development of an item pool is the identification, evaluation, and revision of extant questionnaire items for the core item pool. In this work, we also describe the systematic process wherein items are classified for subsequent statistical processing by the PROMIS investigators. Six phases of item development are documented: identification of extant items, item classification and selection, item review and revision, focus group input on domain coverage, cognitive interviews with individual items, and final revision before field testing. Identification of items refers to the systematic search for existing items in currently available scales. Expert item review and revision was conducted by trained professionals who reviewed the wording of each item and revised as appropriate for conventions adopted by the PROMIS network. Focus groups were used to confirm domain definitions and to identify new areas of item development for future PROMIS item banks. Cognitive interviews were used to examine individual items. Items successfully screened through this process were sent to field testing and will be subjected to innovative scale construction procedures. PMID:17443114

  16. Mining remote sensing image data: an integration of fuzzy set theory and image understanding techniques for environmental change detection

    NASA Astrophysics Data System (ADS)

    Eklund, Peter W.; You, Jane; Deer, Peter

    2000-04-01

    This paper presents an image understanding approach to mine remotely sensed image data from different source dates for environmental change detection. It is focused on the immediate needs for knowledge discovery from large sets of image data for environmental monitoring. In contrast to the traditional approaches for change detection, we introduce a wavelet-based hierarchical scheme which integrates fuzzy set theory and image understanding techniques for knowledge discovery of the remote image data. The proposed approach includes algorithms for hierarchical change detection, region representations and classification. The effectiveness of the proposed algorithms is demonstrated throughout the completion of three tasks, namely hierarchial detection of change by fuzzy post classification comparisons, localization of change by B-spline based region representation, and categorization of change by hierarchial texture classification.

  17. Mining unusual and rare stellar spectra from large spectroscopic survey data sets using the outlier-detection method

    NASA Astrophysics Data System (ADS)

    Wei, Peng; Luo, Ali; Li, Yinbi; Pan, Jingchang; Tu, Liangping; Jiang, Bin; Kong, Xiao; Shi, Zhixin; Yi, Zhenping; Wang, Fengfei; Liu, Jie; Zhao, Yongheng

    2013-05-01

    The large number of spectra obtained from sky surveys such as the Sloan Digital Sky Survey (SDSS) and the survey executed by the Large sky Area Multi-Object fibre Spectroscopic Telescope (LAMOST, also called GuoShouJing Telescope) provide us with opportunities to search for peculiar or even unknown types of spectra. In response to the limitations of existing methods, a novel outlier-mining method, the Monte Carlo Local Outlier Factor (MCLOF), is proposed in this paper, which can be used to highlight unusual and rare spectra from large spectroscopic survey data sets. The MCLOF method exposes outliers automatically and efficiently by marking each spectrum with a number, i.e. using outlier index as a flag for an unusual and rare spectrum. The Local Outlier Factor (LOF) represents how unusual and rare a spectrum is compared with other spectra and the Monte Carlo method is used to compute the global LOF for each spectrum by randomly selecting samples in each independent iteration. Our MCLOF method is applied to over half a million stellar spectra (classified as STAR by the SDSS Pipeline) from the SDSS data release 8 (DR8) and a total of 37 033 spectra are selected as outliers with signal-to-noise ratio (S/N) ≥ 3 and outlier index ≥0.85. Some of these outliers are shown to be binary stars, emission-line stars, carbon stars and stars with unusual continuum. The results show that our proposed method can efficiently highlight these unusual spectra from the survey data sets. In addition, some relatively rare and interesting spectra are selected, indicating that the proposed method can also be used to mine rare, even unknown, spectra. The proposed method can be applicable not only to spectral survey data sets but also to other types of survey data sets. The spectra of all peculiar objects selected by our MCLOF method are available from a user-friendly website: http://sciwiki.lamost.org/Miningdr8/.

  18. Mining Hesitation Information by Vague Association Rules

    NASA Astrophysics Data System (ADS)

    Lu, An; Ng, Wilfred

    In many online shopping applications, such as Amazon and eBay, traditional Association Rule (AR) mining has limitations as it only deals with the items that are sold but ignores the items that are almost sold (for example, those items that are put into the basket but not checked out). We say that those almost sold items carry hesitation information, since customers are hesitating to buy them. The hesitation information of items is valuable knowledge for the design of good selling strategies. However, there is no conceptual model that is able to capture different statuses of hesitation information. Herein, we apply and extend vague set theory in the context of AR mining. We define the concepts of attractiveness and hesitation of an item, which represent the overall information of a customer's intent on an item. Based on the two concepts, we propose the notion of Vague Association Rules (VARs). We devise an efficient algorithm to mine the VARs. Our experiments show that our algorithm is efficient and the VARs capture more specific and richer information than do the traditional ARs.

  19. Unlocking Data for Statistical Analyses and Data Mining: Generic Case Extraction of Clinical Items from i2b2 and tranSMART.

    PubMed

    Firnkorn, Daniel; Merker, Sebastian; Ganzinger, Matthias; Muley, Thomas; Knaup, Petra

    2016-01-01

    In medical science, modern IT concepts are increasingly important to gather new findings out of complex diseases. Data Warehouses (DWH) as central data repository systems play a key role by providing standardized, high-quality and secure medical data for effective analyses. However, DWHs in medicine must fulfil various requirements concerning data privacy and the ability to describe the complexity of (rare) disease phenomena. Here, i2b2 and tranSMART are free alternatives representing DWH solutions especially developed for medical informatics purposes. But different functionalities are not yet provided in a sufficient way. In fact, data import and export is still a major problem because of the diversity of schemas, parameter definitions and data quality which are described variously in each single clinic. Further, statistical analyses inside i2b2 and tranSMART are possible, but restricted to the implemented functions. Thus, data export is needed to provide a data basis which can be directly included within statistics software like SPSS and SAS or data mining tools like Weka and RapidMiner. The standard export tools of i2b2 and tranSMART are more or less creating a database dump of key-value pairs which cannot be used immediately by the mentioned tools. They need an instance-based or a case-based representation of each patient. To overcome this lack, we developed a concept called Generic Case Extractor (GCE) which pivots the key-value pairs of each clinical fact into a row-oriented format for each patient sufficient to enable analyses in a broader context. Therefore, complex pivotisation routines where necessary to ensure temporal consistency especially in terms of different data sets and the occurrence of identical but repeated parameters like follow-up data. GCE is embedded inside a comprehensive software platform for systems medicine.

  20. Hydrogeologic setting and simulation of groundwater flow near the Canterbury and Leadville Mine Drainage Tunnels, Leadville, Colorado

    USGS Publications Warehouse

    Wellman, Tristan P.; Paschke, Suzanne S.; Minsley, Burke; Dupree, Jean A.

    2011-01-01

    63 percent of the total water budget, respectively). The largest contributions to the water budget were groundwater entering from the upper reaches of the watershed and the hydrologic interaction of the groundwater with the East Fork Arkansas River. Potentiometric surface maps of the simulated model results were generated for depths of 50, 100, and 250 m. The surfaces revealed a positive trend in hydraulic head with land-surface altitude and evidence of increased control on fluid movement by the fault network structure at progressively greater depths in the aquifer. Results of advective particle-tracking simulations indicate that the sets of simulated flow paths for the Canterbury Tunnel and the Leadville Mine Drainage Tunnel were mutually exclusive of one another, which also suggested that no major hydraulic connection was present between the tunnels. Particle-tracking simulations also revealed that although the fault network generally restricted groundwater movement locally, hydrologic conditions were such that groundwater did cross the fault network at many locations. This cross-fault movement indicates that the fault network controls regional groundwater flow to some degree but is not a complete barrier to flow. The cumulative distributions of adjusted age results for the watershed indicate that approximately 30 percent of the flow pathways transmit groundwater that was younger than 68 years old (post-1941) and that about 70 percent of the flow pathways transmit old groundwater. The particle-tracking results are consistent with the apparent ages and mixing ratios developed from the chlorofluorocarbon and tritium results. The model simulations also indicate that approximately 50 percent of the groundwater flowing through the study area was less than 200 years old and about 50 percent of the groundwater flowing through the study area is old water stored in low-permeability geologic units and fault blocks. As a final examination of model response, the conductance

  1. Data mining of the GAW14 simulated data using rough set theory and tree-based methods.

    PubMed

    Wei, Liang-Ying; Huang, Cheng-Lung; Chen, Chien-Hsiun

    2005-12-30

    Rough set theory and decision trees are data mining methods used for dealing with vagueness and uncertainty. They have been utilized to unearth hidden patterns in complicated datasets collected for industrial processes. The Genetic Analysis Workshop 14 simulated data were generated using a system that implemented multiple correlations among four consequential layers of genetic data (disease-related loci, endophenotypes, phenotypes, and one disease trait). When information of one layer was blocked and uncertainty was created in the correlations among these layers, the correlation between the first and last layers (susceptibility genes and the disease trait in this case), was not easily directly detected. In this study, we proposed a two-stage process that applied rough set theory and decision trees to identify genes susceptible to the disease trait. During the first stage, based on phenotypes of subjects and their parents, decision trees were built to predict trait values. Phenotypes retained in the decision trees were then advanced to the second stage, where rough set theory was applied to discover the minimal subsets of genes associated with the disease trait. For comparison, decision trees were also constructed to map susceptible genes during the second stage. Our results showed that the decision trees of the first stage had accuracy rates of about 99% in predicting the disease trait. The decision trees and rough set theory failed to identify the true disease-related loci.

  2. Nurse staffing levels and outcomes - mining the UK national data sets for insight.

    PubMed

    Leary, Alison; Tomai, Barbara; Swift, Adrian; Woodward, Andrew; Hurst, Keith

    2017-04-18

    Purpose Despite the generation of mass data by the nursing workforce, determining the impact of the contribution to patient safety remains challenging. Several cross-sectional studies have indicated a relationship between staffing and safety. The purpose of this paper is to uncover possible associations and explore if a deeper understanding of relationships between staffing and other factors such as safety could be revealed within routinely collected national data sets. Design/methodology/approach Two longitudinal routinely collected data sets consisting of 30 years of UK nurse staffing data and seven years of National Health Service (NHS) benchmark data such as survey results, safety and other indicators were used. A correlation matrix was built and a linear correlation operation was applied (Pearson product-moment correlation coefficient). Findings A number of associations were revealed within both the UK staffing data set and the NHS benchmarking data set. However, the challenges of using these data sets soon became apparent. Practical implications Staff time and effort are required to collect these data. The limitations of these data sets include inconsistent data collection and quality. The mode of data collection and the itemset collected should be reviewed to generate a data set with robust clinical application. Originality/value This paper revealed that relationships are likely to be complex and non-linear; however, the main contribution of the paper is the identification of the limitations of routinely collected data. Much time and effort is expended in collecting this data; however, its validity, usefulness and method of routine national data collection appear to require re-examination.

  3. A Study of the Homogeneity of Items Produced From Item Forms Across Different Taxonomic Levels.

    ERIC Educational Resources Information Center

    Weber, Margaret B.; Argo, Jana K.

    This study determined whether item forms ( rules for constructing items related to a domain or set of tasks) would enable naive item writers to generate multiple-choice items at three taxonomic levels--knowledge, comprehension, and application. Students wrote 120 multiple-choice items from 20 item forms, corresponding to educational objectives…

  4. A Study of the Homogeneity of Items Produced From Item Forms Across Different Taxonomic Levels.

    ERIC Educational Resources Information Center

    Weber, Margaret B.; Argo, Jana K.

    This study determined whether item forms ( rules for constructing items related to a domain or set of tasks) would enable naive item writers to generate multiple-choice items at three taxonomic levels--knowledge, comprehension, and application. Students wrote 120 multiple-choice items from 20 item forms, corresponding to educational objectives…

  5. Measuring impairments of functioning and health in patients with axial spondyloarthritis by using the ASAS Health Index and the Environmental Item Set: translation and cross-cultural adaptation into 15 languages

    PubMed Central

    Kiltz, U; van der Heijde, D; Boonen, A; Bautista-Molano, W; Burgos-Vargas, R; Chiowchanwisawakit, P; Duruoz, T; El-Zorkany, B; Essers, I; Gaydukova, I; Géher, P; Gossec, L; Grazio, S; Gu, J; Khan, M A; Kim, T J; Maksymowych, W P; Marzo-Ortega, H; Navarro-Compán, V; Olivieri, I; Patrikos, D; Pimentel-Santos, F M; Schirmer, M; van den Bosch, F; Weber, U; Zochling, J; Braun, J

    2016-01-01

    Introduction The Assessments of SpondyloArthritis international society Health Index (ASAS HI) measures functioning and health in patients with spondyloarthritis (SpA) across 17 aspects of health and 9 environmental factors (EF). The objective was to translate and adapt the original English version of the ASAS HI, including the EF Item Set, cross-culturally into 15 languages. Methods Translation and cross-cultural adaptation has been carried out following the forward–backward procedure. In the cognitive debriefing, 10 patients/country across a broad spectrum of sociodemographic background, were included. Results The ASAS HI and the EF Item Set were translated into Arabic, Chinese, Croatian, Dutch, French, German, Greek, Hungarian, Italian, Korean, Portuguese, Russian, Spanish, Thai and Turkish. Some difficulties were experienced with translation of the contextual factors indicating that these concepts may be more culturally-dependent. A total of 215 patients with axial SpA across 23 countries (62.3% men, mean (SD) age 42.4 (13.9) years) participated in the field test. Cognitive debriefing showed that items of the ASAS HI and EF Item Set are clear, relevant and comprehensive. All versions were accepted with minor modifications with respect to item wording and response option. The wording of three items had to be adapted to improve clarity. As a result of cognitive debriefing, a new response option ‘not applicable’ was added to two items of the ASAS HI to improve appropriateness. Discussion This study showed that the items of the ASAS HI including the EFs were readily adaptable throughout all countries, indicating that the concepts covered were comprehensive, clear and meaningful in different cultures. PMID:27752358

  6. Measuring impairments of functioning and health in patients with axial spondyloarthritis by using the ASAS Health Index and the Environmental Item Set: translation and cross-cultural adaptation into 15 languages.

    PubMed

    Kiltz, U; van der Heijde, D; Boonen, A; Bautista-Molano, W; Burgos-Vargas, R; Chiowchanwisawakit, P; Duruoz, T; El-Zorkany, B; Essers, I; Gaydukova, I; Géher, P; Gossec, L; Grazio, S; Gu, J; Khan, M A; Kim, T J; Maksymowych, W P; Marzo-Ortega, H; Navarro-Compán, V; Olivieri, I; Patrikos, D; Pimentel-Santos, F M; Schirmer, M; van den Bosch, F; Weber, U; Zochling, J; Braun, J

    2016-01-01

    The Assessments of SpondyloArthritis international society Health Index (ASAS HI) measures functioning and health in patients with spondyloarthritis (SpA) across 17 aspects of health and 9 environmental factors (EF). The objective was to translate and adapt the original English version of the ASAS HI, including the EF Item Set, cross-culturally into 15 languages. Translation and cross-cultural adaptation has been carried out following the forward-backward procedure. In the cognitive debriefing, 10 patients/country across a broad spectrum of sociodemographic background, were included. The ASAS HI and the EF Item Set were translated into Arabic, Chinese, Croatian, Dutch, French, German, Greek, Hungarian, Italian, Korean, Portuguese, Russian, Spanish, Thai and Turkish. Some difficulties were experienced with translation of the contextual factors indicating that these concepts may be more culturally-dependent. A total of 215 patients with axial SpA across 23 countries (62.3% men, mean (SD) age 42.4 (13.9) years) participated in the field test. Cognitive debriefing showed that items of the ASAS HI and EF Item Set are clear, relevant and comprehensive. All versions were accepted with minor modifications with respect to item wording and response option. The wording of three items had to be adapted to improve clarity. As a result of cognitive debriefing, a new response option 'not applicable' was added to two items of the ASAS HI to improve appropriateness. This study showed that the items of the ASAS HI including the EFs were readily adaptable throughout all countries, indicating that the concepts covered were comprehensive, clear and meaningful in different cultures.

  7. Bibliomining for Automated Collection Development in a Digital Library Setting: Using Data Mining To Discover Web-Based Scholarly Research Works.

    ERIC Educational Resources Information Center

    Nicholson, Scott

    2003-01-01

    Discusses quality issues regarding Web sites and describes research that created an intelligent agent for automated collection development in a digital academic library setting, which uses a predictive model based on facets of each Web page to select scholarly works. Describes the use of bibliomining, or data mining for libraries. (Author/LRW)

  8. On Bayesian Rules for Selecting 3PL Binary Items for Criterion-Referenced Interpretations and Creating Booklets for Bookmark Standard Setting.

    ERIC Educational Resources Information Center

    Huynh, Huynh

    By noting that a Rasch or two parameter logistic (2PL) item belongs to the exponential family of random variables and that the probability density function (pdf) of the correct response (X=1) and the incorrect response (X=0) are symmetric with respect to the vertical line at the item location, it is shown that the conjugate prior for ability is…

  9. An Experimental Study of the Effect of Judges' Knowledge of Item Data on Two Forms of the Angoff Standard Setting Method.

    ERIC Educational Resources Information Center

    Garrido, Mariquita; Payne, David A.

    Minimum competency cut-off scores on a statistics exam were estimated under four conditions: the Angoff judging method with item data (n=20), and without data available (n=19); and the Modified Angoff method with (n=19), and without (n=19) item data available to judges. The Angoff method required free response percentage estimates (0-100) percent,…

  10. Data Mining on Large Data Set for Predicting Salmon Spawning Habitat

    SciTech Connect

    Xie, YuLong; Murray, Christopher J.; Hanrahan, Timothy P.; Geist, David R.

    2008-07-01

    Hydraulic properties related to river flow affect salmon spawning habitat. Accurate prediction of salmon spawning habitat and understanding the influential properties on the spawning behavior are of great interest for hydroelectric dam management. Previous research predicted salmon spawning habitat through deriving river specific spawning suitability indices and employing a function estimate method like logistic regression on several static river flow related properties and had some success. The objective of this study was two-fold. First dynamic river flow properties associated with upstream dam operation were successfully derived from a huge set of time series of both water velocity and water depth for about one fifth of a million habitat cells through principal component analysis (PCA) using nonlinear iterative partial least squares (NIPLAS). The inclusion of dynamic variables in the models greatly improved the model prediction. Secondly, nine machine learning methods were applied to the data and it was found that decision tree and rule induction methods were generally outperformed usually used logistic regression. Specifically random forest, an advanced decision tree algorithm, provided unanimous better results. Over-prediction problem in previous studies were greatly alleviated.

  11. Identifying high-cost patients using data mining techniques and a small set of non-trivial attributes.

    PubMed

    Izad Shenas, Seyed Abdolmotalleb; Raahemi, Bijan; Hossein Tekieh, Mohammad; Kuziemsky, Craig

    2014-10-01

    In this paper, we use data mining techniques, namely neural networks and decision trees, to build predictive models to identify very high-cost patients in the top 5 percentile among the general population. A large empirical dataset from the Medical Expenditure Panel Survey with 98,175 records was used in our study. After pre-processing, partitioning and balancing the data, the refined dataset of 31,704 records was modeled by Decision Trees (including C5.0 and CHAID), and Neural Networks. The performances of the models are analyzed using various measures including accuracy, G-mean, and Area under ROC curve. We concluded that the CHAID classifier returns the best G-mean and AUC measures for top performing predictive models ranging from 76% to 85%, and 0.812 to 0.942 units, respectively. We also identify a small set of 5 non-trivial attributes among a primary set of 66 attributes to identify the top 5% of the high cost population. The attributes are the individual׳s overall health perception, age, history of blood cholesterol check, history of physical/sensory/mental limitations, and history of colonic prevention measures. The small set of attributes are what we call non-trivial and does not include visits to care providers, doctors or hospitals, which are highly correlated with expenditures and does not offer new insight to the data. The results of this study can be used by healthcare data analysts, policy makers, insurer, and healthcare planners to improve the delivery of health services.

  12. Oral/dental items in the resident assessment instrument - minimum Data Set 2.0 lack validity: results of a retrospective, longitudinal validation study.

    PubMed

    Hoben, Matthias; Poss, Jeffrey W; Norton, Peter G; Estabrooks, Carole A

    2016-01-01

    Oral health in nursing home residents is poor. Robust, mandated assessment tools such as the Resident Assessment Instrument - Minimum Data Set (RAI-MDS) 2.0 are key to monitoring and improving quality of oral health care in nursing homes. However, psychometric properties of RAI-MDS 2.0 oral/dental items have been challenged and criterion validity of these items has never been assessed. We used 73,829 RAI-MDS 2.0 records (13,118 residents), collected in a stratified random sample of 30 urban nursing homes in Western Canada (2007-2012). We derived a subsample of all residents (n = 2,711) with an admission and two or more subsequent annual assessments. Using Generalized Estimating Equations, adjusted for known covariates of nursing home residents' oral health, we assessed the association of oral/dental problems with time, dentate status, dementia, debris, and daily cleaning. Prevalence of oral/dental problems fluctuated (4.8 %-5.6 %) with no significant differences across time. This range of prevalence is substantially smaller than the ones reported by studies using clinical assessments by dental professionals. Denture wearers were less likely than dentate residents to have oral/dental problems (adjusted odds ratio [OR] = 0.458, 95 % confidence interval [CI]: 0.308, 0.680). Residents lacking teeth and not wearing dentures had higher odds than dentate residents of oral/dental problems (adjusted OR = 2.718, 95 % CI: 1.845, 4.003). Oral/dental problems were more prevalent in persons with debris (OR = 2.187, 95 % CI: 1.565, 3.057). Of the other variables assessed, only age at assessment was significantly associated with oral/dental problems. Robust, reliable RAI-MDS 2.0 oral health indicators are vital to monitoring and improving oral health related quality and safety in nursing homes. However, severe underdetection of oral/dental problems and lack of association of well-known oral health predictors with oral/dental problems suggest validity problems

  13. Empirical Mining of Large Data Sets Already Helps to Solve Practical Ecological Problems; A Panoply of Working Examples (Invited)

    NASA Astrophysics Data System (ADS)

    Hargrove, W. W.; Hoffman, F. M.; Kumar, J.; Spruce, J.; Norman, S. P.

    2013-12-01

    Here we present diverse examples where empirical mining and statistical analysis of large data sets have already been shown to be useful for a wide variety of practical decision-making problems within the realm of large-scale ecology. Because a full understanding and appreciation of particular ecological phenomena are possible only after hypothesis-directed research regarding the existence and nature of that process, some ecologists may feel that purely empirical data harvesting may represent a less-than-satisfactory approach. Restricting ourselves exclusively to process-driven approaches, however, may actually slow progress, particularly for more complex or subtle ecological processes. We may not be able to afford the delays caused by such directed approaches. Rather than attempting to formulate and ask every relevant question correctly, empirical methods allow trends, relationships and associations to emerge freely from the data themselves, unencumbered by a priori theories, ideas and prejudices that have been imposed upon them. Although they cannot directly demonstrate causality, empirical methods can be extremely efficient at uncovering strong correlations with intermediate "linking" variables. In practice, these correlative structures and linking variables, once identified, may provide sufficient predictive power to be useful themselves. Such correlation "shadows" of causation can be harnessed by, e.g., Bayesian Belief Nets, which bias ecological management decisions, made with incomplete information, toward favorable outcomes. Empirical data-harvesting also generates a myriad of testable hypotheses regarding processes, some of which may even be correct. Quantitative statistical regionalizations based on quantitative multivariate similarity have lended insights into carbon eddy-flux direction and magnitude, wildfire biophysical conditions, phenological ecoregions useful for vegetation type mapping and monitoring, forest disease risk maps (e.g., sudden oak

  14. Data set of world phosphate mines, deposits, and occurrences: Part A. geologic data; Part B. location and mineral economic data

    USGS Publications Warehouse

    Chernoff, Carlotta B.; Orris, G.J.

    2002-01-01

    An inventory of more than 1,600 world phosphate mines, deposits, and occurrences was compiled from smaller data sets collected as part of multiple research efforts by Carlotta Chernoff, University of Arizona, and Greta Orris, U.S. Geological Survey. These data have been utilized during studies of black shale depositional environments and to construct phosphate deposit models. The compiled data have been edited for consistency and additional location information has been added where possible. The database of compiled phosphate information is being released in two sections; the geologic data in one section and the location and mineral economic data in the second. This report, U.S. Geological Survey Open-File Report 02–156–A, contains the geologic data and is best used with the complimentary data contained in Open-File Report 02–156–B. U.S. Geological Survey Open-File Report 02–156–B contains commodity data, location and analytical data, a variety of mineral economic data, reference information, and pointers to related records in the U.S. Geological Survey National mineral databases—MASMILS and MRDS.

  15. Action Items

    ERIC Educational Resources Information Center

    Baker, Mark; Keane, Brian

    2009-01-01

    Maximizing school resources and managing a shrinking budget--these are two important items affected when a building's roofing system does not perform properly. Rather than acting in haste, school and university administrators should do what every teacher tells a student prior to answering any question: think through the research and studies to…

  16. Clinical utility of the MMPI-2-RF SUI items and scale in a forensic inpatient setting: Association with interview self-report and future suicidal behaviors.

    PubMed

    Glassmire, David M; Tarescavage, Anthony M; Burchett, Danielle; Martinez, Jennifer; Gomez, Anthony

    2016-11-01

    In this study, we examined whether the 5 Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2008/2011) Suicidal/Death Ideation (SUI) items (93, 120, 164, 251, and 334) would provide incremental suicide-risk assessment information after accounting for information garnered from clinical interview questions. Among 229 forensic inpatients (146 men, 83 women) who were administered the MMPI-2-RF, 34.9% endorsed at least 1 SUI item. We found that patients who endorsed SUI items on the MMPI-2-RF concurrently denied conceptually related suicide-risk information during the clinical interview. For instance, 8% of the sample endorsed Item 93 (indicating recent suicidal ideation), yet denied current suicidal ideation upon interview. Conversely, only 2.2% of the sample endorsed current suicidal ideation during the interview, yet denied recent suicidal ideation on Item 93. The SUI scale, as well as the MMPI-2-RF Demoralization (RCd) and Low Positive Emotions (RC2) scales, correlated significantly and meaningfully with conceptually related suicide-risk information from the interview, including history of suicide attempts, history of suicidal ideation, current suicidal ideation, and months since last suicide attempt. We also found that the SUI scale added incremental variance (after accounting for information garnered from the interview and after accounting for scores on RCd and RC2) to predictions of future suicidal behavior within 1 year of testing. Relative risk ratios indicated that both SUI-item endorsement and the presence of interview-reported risk information significantly and meaningfully increased the risk of suicidal behavior in the year following testing, particularly when endorsement of suicidal ideation occurred for both methods of self-report. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  17. Matching Factors Derived from the Same Set of Items but Different Groups of Respondents: A Review of Methods with an Example.

    ERIC Educational Resources Information Center

    Bowers, John J.

    Procedures devised for computing factor matching statistics in questionnaire construction are described. In a study of the impact of vocational education on social, affective and nontechnical development of students, an attitude questionnaire to measure outcomes was developed. Each item in the final section had a six-point Likert type response…

  18. Accommodations for English Language Learner Students: The Effect of Linguistic Modification of Math Test Item Sets. Final Report. NCEE 2009-4079

    ERIC Educational Resources Information Center

    Sato, Edynn; Rabinowitz, Stanley; Gallagher, Carole; Huang, Chun-Wei

    2010-01-01

    This study examined the effect of linguistic modification on middle school students' ability to show what they know and can do on math assessments. REL West's study on middle school math assessment accommodations found that simplifying the language--or linguistic modification--on standardized math test items made it easier for English Language…

  19. Single-item measures for depression and anxiety: Validation of the Screening Tool for Psychological Distress in an inpatient cardiology setting.

    PubMed

    Young, Quincy-Robyn; Nguyen, Michelle; Roth, Susan; Broadberry, Ann; Mackay, Martha H

    2015-12-01

    Depression and anxiety are common among patients with cardiovascular disease (CVD) and confer significant cardiac risk, contributing to CVD morbidity and mortality. Unfortunately, due to the lack of screening tools that address the specific needs of hospitalized patients, few cardiac inpatient programs offer routine screening for these forms of psychological distress, despite recommendations to do so. The purpose of this study was to validate single-item measures for depression and anxiety among cardiac inpatients. Consecutive inpatients were recruited from the cardiology and cardiac surgery step-down units at a university-affiliated, quaternary-care hospital. Subjects completed a questionnaire that included: (a) demographics, (b) single-item-measures for depression and anxiety (from the Screening Tool for Psychological Distress (STOP-D)), and (c) Hospital Anxiety and Depression Scale (HADS). One hundred and five participants were recruited with a wide variety of cardiac diagnoses, having a mean age of 66 years, and 28% were women. Both STOP-D items were highly correlated with their corresponding validated measures and demonstrated robust receiver-operator characteristic curves. Severity scores on both items correlated well with established severity cut-off scores on the corresponding subscales of the HADS. The STOP-D is a self-administered, self-report measure using two independent items that provide severity scores for depression and anxiety. The tool performs very well compared with other previously validated measures. Requiring no additional scoring and being free, STOP-D offers a simple and valid method for identifying hospitalized cardiac patients who are experiencing psychological distress. This crucial first step triggers initiation of appropriate monitoring and intervention, thus reducing the likelihood of the adverse cardiac outcomes associated with psychological distress. © The European Society of Cardiology 2014.

  20. Optimal Calibration Designs for Tests of Polytomously Scored Items Described by Item Response Theory Models.

    ERIC Educational Resources Information Center

    Holman, Rebecca; Berger, Martijn P. F.

    2001-01-01

    Studied calibration designs that maximize the determinants of Fisher's information matrix on the item parameters for sets of polytomously scored items. Analyzed these items using a number of item response theory models. Results show that for the data and models used, a D-optimal calibration design for an answer or set of answers can reduce the…

  1. Item-Writing Guidelines for Physics

    ERIC Educational Resources Information Center

    Regan, Tom

    2015-01-01

    A teacher learning how to write test questions (test items) will almost certainly encounter item-writing guidelines--lists of item-writing do's and don'ts. Item-writing guidelines usually are presented as applicable across all assessment settings. Table I shows some guidelines that I believe to be generally applicable and two will be briefly…

  2. Item-Writing Guidelines for Physics

    ERIC Educational Resources Information Center

    Regan, Tom

    2015-01-01

    A teacher learning how to write test questions (test items) will almost certainly encounter item-writing guidelines--lists of item-writing do's and don'ts. Item-writing guidelines usually are presented as applicable across all assessment settings. Table I shows some guidelines that I believe to be generally applicable and two will be briefly…

  3. Accuracy Assessment of Geometrical Elements for Setting-Out in Horizontal Plane of Conveying Chambers at the Bauxite Mine "KOSTURI" Srebrenica

    NASA Astrophysics Data System (ADS)

    Milutinović, Aleksandar; Ganić, Aleksandar; Tokalić, Rade

    2014-03-01

    Setting-out of objects on the exploitation field of the mine, both in surface mining and in the underground mines, is determined by the specified setting-out accuracy of reference points, which are best to define spatial position of the object projected. For the purpose of achieving of the specified accuracy, it is necessary to perform a priori accuracy assessment of parameters, which are to be used when performing setting-out. Based on the a priori accuracy assessment, verification of the quality of geometrical setting- -out elements specified in the layout; definition of the accuracy for setting-out of geometrical elements; selection of setting-out method; selection at the type and class of instruments and tools that need to be applied in order to achieve predefined accuracy. The paper displays the accuracy assessment of geometrical elements for setting-out of the main haul gallery, haul downcast and helical conveying downcasts in shape of an inclined helix in horizontal plane, using the example of the underground bauxite mine »Kosturi«, Srebrenica. Wytyczanie obiektów na polu wydobywczym w kopalniach, zarówno podziemnych jak i odkrywkowych, zależy w dużej mierze od określonej dokładności wytyczania punktów referencyjnych, przy pomocy których określane jest następnie położenie przestrzenne pozostałych obiektów. W celu uzyskania założonej dokładności, należy przeprowadzić wstępną analizę dokładności oszacowania parametrów które następnie wykorzystane będą w procesie wytyczania. W oparciu o wyniki wstępnej analizy dokładności dokonuje się weryfikacji jakości geometrycznego wytyczenia elementów zaznaczonych na szkicu, uwzględniając te wyniki dobrać należy odpowiednią metodę wytyczania i rodzaj oraz klasę wykorzystywanych narzędzi i instrumentów, tak by osiągnąć założony poziom dokładności. W pracy przedstawiono oszacowanie dokładności wytyczania elementów geometrycznych dla głównego chodnika transportowego

  4. Cluster Analysis-Based Approaches for Geospatiotemporal Data Mining of Massive Data Sets for Identification of Forest Threats

    Treesearch

    Richard Trans Mills; Forrest M Hoffman; Jitendra Kumar; William W. Hargrove

    2011-01-01

    We investigate methods for geospatiotemporal data mining of multi-year land surface phenology data (250 m2 Normalized Difference Vegetation Index (NDVI) values derived from the Moderate Resolution Imaging Spectrometer (MODIS) in this study) for the conterminous United States (CONUS) as part of an early warning system for detecting threats to forest ecosystems. The...

  5. Perspectives on Knowledge Discovery Algorithms Recently Introduced in Chemoinformatics: Rough Set Theory, Association Rule Mining, Emerging Patterns, and Formal Concept Analysis.

    PubMed

    Gardiner, Eleanor J; Gillet, Valerie J

    2015-09-28

    Knowledge Discovery in Databases (KDD) refers to the use of methodologies from machine learning, pattern recognition, statistics, and other fields to extract knowledge from large collections of data, where the knowledge is not explicitly available as part of the database structure. In this paper, we describe four modern data mining techniques, Rough Set Theory (RST), Association Rule Mining (ARM), Emerging Pattern Mining (EP), and Formal Concept Analysis (FCA), and we have attempted to give an exhaustive list of their chemoinformatics applications. One of the main strengths of these methods is their descriptive ability. When used to derive rules, for example, in structure-activity relationships, the rules have clear physical meaning. This review has shown that there are close relationships between the methods. Often apparent differences lie in the way in which the problem under investigation has been formulated which can lead to the natural adoption of one or other method. For example, the idea of a structural alert, as a structure which is present in toxic and absent in nontoxic compounds, leads to the natural formulation of an Emerging Pattern search. Despite the similarities between the methods, each has its strengths. RST is useful for dealing with uncertain and noisy data. Its main chemoinformatics applications so far have been in feature extraction and feature reduction, the latter often as input to another data mining method, such as an Support Vector Machine (SVM). ARM has mostly been used for frequent subgraph mining. EP and FCA have both been used to mine both structural and nonstructural patterns for classification of both active and inactive molecules. Since their introduction in the 1980s and 1990s, RST, ARM, EP, and FCA have found wide-ranging applications, with many thousands of citations in Web of Science, but their adoption by the chemoinformatics community has been relatively slow. Advances, both in computer power and in algorithm development

  6. Options for compiling an inventory of mining waste sites throughout Europe by combining Landsat-TM derived information with national and pan-European thematic data sets

    NASA Astrophysics Data System (ADS)

    Vijdea, Anca-Marina; Sommer, Stefan

    2004-10-01

    Presently no reliable synoptic picture of number, extent, distribution and emissions from mining waste sites exists, neither for EU member states, nor for the Accession and Candidate Countries. At EU level, this information is needed to assess the large range of environmental impacts caused by mining wastes and their emissions in a coherent way across the different policies addressing the protection and sustainable use of environmental resources. The core task lies in the harmonised collection and standardised compilation and evaluation of existing data and in connecting them to a geographical reference system compatible with other European data sets. In the proposed approach information from national registers of mining wastes is linked to related standardized spatial data layers such as CORINE Land Cover (the classes of mineral extraction sites, dump sites) or other data sets available in the EUROSTAT GISCO data base, thus adding the spatial dimension at regional scale. Higher level of spatial detail and distinction between mineral extraction site and waste sites with or without accumulation of potentially hazardous material is added by remote sensing, applying a semi-automated principal component analysis (PCA) to selected spectral channels of geo-referenced Landsat-TM full scenes. The method was demonstrated on large areas covering approximately 120000 km2 of Slovakia and Romania and was validated against mining-related features from Pan-European and/or national databases, detailed geological maps, mineral resource maps, as well as by a GIS analysis showing the distribution of anomalous pixels in the above-mentioned features compared to the main land cover classes.

  7. Detection performance assessment of hand-held mine detection systems in a procurement process: test set-up for MDs and MD/GPRs

    NASA Astrophysics Data System (ADS)

    Schoolderman, Arnold J.; Roosenboom, Jacques H. J.

    2005-06-01

    The Engineers Centre of Expertise of the Royal Netherlands Army (RNLA) has conducted a study on countermine in peace operations. This study, finished in 2002, concluded that the final solution to countermine will depend in the first place on better detection of buried low-metal mines, e.g. by direct detection of the explosive components in mines. Until such detection systems are available, intermediate solutions are necessary in order to assure freedom of movement in peace operations. Because countermine operations consist of a number of different activities (area preparation, detection, clearance, etc) and the suitability of the different types of available equipment depends on the scenario, the toolbox concept for countermine equipment was adopted. In 2003 a procurement process was started in order to fill this toolbox with commercial-off-the-shelf and military-off-the-shelf equipment. The paper gives a concise description of the study on countermine operations and the procurement process, and subsequently focuses on the set-up of the tests that were conducted in the framework of the procurement of hand-held mine detection systems, like metal detectors and dual-sensor mine detectors. Programs of requirements for these systems were drawn up, aiming at systems for general use and special purpose systems. Blind tests to check the compliancy to the detection performance requirements were designed and conducted in the short timeframe that was available in the procurement process. These tests are discussed in this paper, including the set-up of the test lanes, the targets used and their depths, and the role of the operator. The tests of the capability of the detectors to discriminate small targets adjacent to large targets were conducted according the guidelines of the CEN Workshop Agreement on metal detector tests. Although the results of the tests are commercially confidential, conclusions and lessons learned from the execution of these tests are presented.

  8. Visual Data Mining of Astronomic Data With Virtual Reality Spaces: Understanding the Underlying Structure of Large Data Sets

    NASA Astrophysics Data System (ADS)

    Valdés, J. J.

    2005-12-01

    The information explosion in astronomy requires the development of data mining procedures that speed up the process of scientific discovery, and the in-depth understanding of the internal structure of the data. This is crucial for the identification of valid, novel, potentially useful, and understandable patterns (regularities, oddities, etc.). A Virtual Reality (VR) approach for large heterogeneous, incomplete and imprecise information is introduced for the problem of visualizing and analyzing astronomic data. The method is based on mappings between one heterogeneous space representing the data, and a homogeneous virtual reality space. This VR-based visual data mining technique allows the incorporation of the unmatched geometric capabilities of the human brain into the knowledge discovery process, and helps in understanding data structure and patterns. This approach has been applied successfully to a wide variety of real-world domains, and it has a large potential in astronomy. Examples are presented from the domain of galaxy research.

  9. Relating a Reading Comprehension Test to the CEFR Levels: A Case of Standard Setting in Practice with Focus on Judges and Items

    ERIC Educational Resources Information Center

    Figueras, Neus; Kaftandjieva, Felianka; Takala, Sauli

    2013-01-01

    The article addresses some problems and options in setting standards on language tests and examinations. More specifically, it reports on a set of three workshops conducted in the European context where standard setting in language education typically concerns linking tests and examinations to the Council of Europe's "Common European…

  10. Using the Advanced Progressive Matrices (Set I) to Assess Fluid Ability in a Short Time Frame: An Item Response Theory-Based Analysis

    ERIC Educational Resources Information Center

    Chiesi, Francesca; Ciancaleoni, Matteo; Galli, Silvia; Primi, Caterina

    2012-01-01

    This article is aimed at evaluating the possibility that Set I of the Advanced Progressive Matrices (APM-Set I) can be employed to assess fluid ability in a short time frame. The APM-Set I was administered to a sample of 1,389 primary and secondary school students. Confirmatory factor analysis attested to the unidimensionality of the scale. Item…

  11. Using the Advanced Progressive Matrices (Set I) to Assess Fluid Ability in a Short Time Frame: An Item Response Theory-Based Analysis

    ERIC Educational Resources Information Center

    Chiesi, Francesca; Ciancaleoni, Matteo; Galli, Silvia; Primi, Caterina

    2012-01-01

    This article is aimed at evaluating the possibility that Set I of the Advanced Progressive Matrices (APM-Set I) can be employed to assess fluid ability in a short time frame. The APM-Set I was administered to a sample of 1,389 primary and secondary school students. Confirmatory factor analysis attested to the unidimensionality of the scale. Item…

  12. Relating a Reading Comprehension Test to the CEFR Levels: A Case of Standard Setting in Practice with Focus on Judges and Items

    ERIC Educational Resources Information Center

    Figueras, Neus; Kaftandjieva, Felianka; Takala, Sauli

    2013-01-01

    The article addresses some problems and options in setting standards on language tests and examinations. More specifically, it reports on a set of three workshops conducted in the European context where standard setting in language education typically concerns linking tests and examinations to the Council of Europe's "Common European…

  13. The Unique Correspondence of the Item Response Function and the Item Category Response Functions in Polytomously Scored Item Response Models.

    ERIC Educational Resources Information Center

    Chang, Hua-Hua; Mazzeo, John

    1994-01-01

    This paper establishes the correspondence between an item response function and a unique set of item category response functions for the partial credit model and the graded response model. Theoretical and practical implications are discussed. (SLD)

  14. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    PubMed Central

    2013-01-01

    Background Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values < 0.05) of the next-gen TM-derived gene sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants

  15. Application of Optimal Designs to Item Calibration

    PubMed Central

    Lu, Hung-Yi

    2014-01-01

    In computerized adaptive testing (CAT), examinees are presented with various sets of items chosen from a precalibrated item pool. Consequently, the attrition speed of the items is extremely fast, and replenishing the item pool is essential. Therefore, item calibration has become a crucial concern in maintaining item banks. In this study, a two-parameter logistic model is used. We applied optimal designs and adaptive sequential analysis to solve this item calibration problem. The results indicated that the proposed optimal designs are cost effective and time efficient. PMID:25188318

  16. Item Difficulty Modeling of Paragraph Comprehension Items

    ERIC Educational Resources Information Center

    Gorin, Joanna S.; Embretson, Susan E.

    2006-01-01

    Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…

  17. Item Difficulty Modeling of Paragraph Comprehension Items

    ERIC Educational Resources Information Center

    Gorin, Joanna S.; Embretson, Susan E.

    2006-01-01

    Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…

  18. Distractor Similarity and Item-Stem Structure: Effects on Item Difficulty

    ERIC Educational Resources Information Center

    Ascalon, M. Evelina; Meyers, Lawrence S.; Davis, Bruce W.; Smits, Niels

    2007-01-01

    This article examined two item-writing guidelines: the format of the item stem and homogeneity of the answer set. Answering the call of Haladyna, Downing, and Rodriguez (2002) for empirical tests of item writing guidelines and extending the work of Smith and Smith (1988) on differential use of item characteristics, a mock multiple-choice driver's…

  19. Creating alliances for disease management in industrial settings: a case study of HIV/AIDS in workers in South African gold mines.

    PubMed

    Williams, B; Campbell, C

    1998-01-01

    The epidemic of HIV/AIDS is at an advanced stage in many African countries, but little attention has been given to the impact that this will have in industrial settings. Using the Southern African mining industry as a case study, the authors consider the state of the HIV epidemic and discuss programs that have been undertaken to manage HIV. They critically analyze the reasons current interventions have had little impact on HIV among mine workers, tracing the lack of success to neglect of the social and community contexts within which HIV transmission takes place, as well as the lack of attention to the psychosocial processes and mechanisms underlying disease transmission. Finally, they present an intervention that aims to address the limitations of existing industrial programs and improve the management of sexually transmitted diseases (STDs), including HIV, in a particular occupational setting through creating alliances between a wide range of community stakeholders. The intervention aims not only to reduce STDs, promote awareness of HIV risks, and distribute condoms, as existing programs have done, but also to address the broader social, cultural, and community contexts that facilitate HIV transmission.

  20. The Unique Correspondence of Item Response Functions and Item Category Response Functions in Polytomously Scored Item Response Models.

    ERIC Educational Resources Information Center

    Chang, Hua-Hua; Mazzeo, John

    The item response function (IRF) for a polytomously scored item is defined as a weighted sum of the item category response functions (ICRF, the probability of getting a particular score for a randomly sampled examinee of ability theta). This paper establishes the correspondence between an IRF and a unique set of ICRFs for two of the most commonly…

  1. SiBIC: a web server for generating gene set networks based on biclusters obtained by maximal frequent itemset mining.

    PubMed

    Takahashi, Kei-ichiro; Takigawa, Ichigaku; Mamitsuka, Hiroshi

    2013-01-01

    Detecting biclusters from expression data is useful, since biclusters are coexpressed genes under only part of all given experimental conditions. We present a software called SiBIC, which from a given expression dataset, first exhaustively enumerates biclusters, which are then merged into rather independent biclusters, which finally are used to generate gene set networks, in which a gene set assigned to one node has coexpressed genes. We evaluated each step of this procedure: 1) significance of the generated biclusters biologically and statistically, 2) biological quality of merged biclusters, and 3) biological significance of gene set networks. We emphasize that gene set networks, in which nodes are not genes but gene sets, can be more compact than usual gene networks, meaning that gene set networks are more comprehensible. SiBIC is available at http://utrecht.kuicr.kyoto-u.ac.jp:8080/miami/faces/index.jsp.

  2. WWC Review of the Report "Accommodations for English Language Learner Students: The Effect of Linguistic Modification of Math Test Item Sets"

    ERIC Educational Resources Information Center

    What Works Clearinghouse, 2012

    2012-01-01

    The research described in this report is a randomized controlled trial in which seventh- and eighth-grade students were randomly assigned to complete a set of 25 math questions delivered with either standard language or language that had undergone "linguistic modification" by the research team. The purpose of the study was to assess the…

  3. The Development and Validation of a Set of Mathematical Problem-Solving Superitems. Executive Summary of the NIE/ECS Item Development Project.

    ERIC Educational Resources Information Center

    Romberg, Thomas A.

    This document describes seven steps that were followed to develop and validate a pool of mathematical problem-solving situations and a set of questions for each situation which were designed to provide information about students' qualitatively different levels of reasoning ability. For each stage a description is presented of the work that was…

  4. Automatic Item Generation of Probability Word Problems

    ERIC Educational Resources Information Center

    Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina

    2009-01-01

    Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…

  5. Automatic Item Generation of Probability Word Problems

    ERIC Educational Resources Information Center

    Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina

    2009-01-01

    Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…

  6. Computerized Numerical Control Test Item Bank.

    ERIC Educational Resources Information Center

    Reneau, Fred; And Others

    This guide contains 285 test items for use in teaching a course in computerized numerical control. All test items were reviewed, revised, and validated by incumbent workers and subject matter instructors. Items are provided for assessing student achievement in such aspects of programming and planning, setting up, and operating machines with…

  7. Association Relation Mining in Multimedia Data

    NASA Astrophysics Data System (ADS)

    Wei, Wei; Ye, Bin; Li, Qing

    In media content analysis, the semantic concepts are often treated as a separate class label. However, many visual semantic concepts are semantically related and so class labels are correlated. This paper presents a mining method for media semantic concept. Semantic frequent label set are selected when their occurrence frequency of several labels in one shot is greater than or equal to the support threshold. Then, semantic frequent item sets are used to generate association dependency relation. The correlations of more than two concepts could be directly obtained by this method. Finally, the mining results of association relation are applied to improve classification performance for concepts. Experiments on real-world media data show that the method can effectively extract potential association and dependency relations among semantic concepts.

  8. Quantum algorithm for association rules mining

    NASA Astrophysics Data System (ADS)

    Yu, Chao-Hua; Gao, Fei; Wang, Qing-Le; Wen, Qiao-Yan

    2016-10-01

    Association rules mining (ARM) is one of the most important problems in knowledge discovery and data mining. Given a transaction database that has a large number of transactions and items, the task of ARM is to acquire consumption habits of customers by discovering the relationships between itemsets (sets of items). In this paper, we address ARM in the quantum settings and propose a quantum algorithm for the key part of ARM, finding frequent itemsets from the candidate itemsets and acquiring their supports. Specifically, for the case in which there are Mf(k ) frequent k -itemsets in the Mc(k ) candidate k -itemsets (Mf(k )≤Mc(k ) ), our algorithm can efficiently mine these frequent k -itemsets and estimate their supports by using parallel amplitude estimation and amplitude amplification with complexity O (k/√{Mc(k )Mf(k ) } ɛ ) , where ɛ is the error for estimating the supports. Compared with the classical counterpart, i.e., the classical sampling-based algorithm, whose complexity is O (k/Mc(k ) ɛ2) , our quantum algorithm quadratically improves the dependence on both ɛ and Mc(k ) in the best case when Mf(k )≪Mc(k ) and on ɛ alone in the worst case when Mf(k )≈Mc(k ) .

  9. Validation of the Self Reporting Questionnaire 20-Item (SRQ-20) for Use in a Low- and Middle-Income Country Emergency Centre Setting.

    PubMed

    van der Westhuizen, Claire; Wyatt, Gail; Williams, John K; Stein, Dan J; Sorsdahl, Katherine

    2016-02-01

    Common mental disorders are highly prevalent in emergency centre (EC) patients, yet few brief screening tools have been validated for low- and middle-income country (LMIC) ECs. This study explored the psychometric properties of the SRQ-20 screening tool in South African ECs using the Mini Neuropsychiatric Interview (MINI) as the gold standard comparison tool. Patients (n=200) from two ECs in Cape Town, South Africa were interviewed using the SRQ-20 and the MINI. Internal consistency, screening properties and factorial validity were examined. The SRQ-20 was effective in identifying participants with major depression, anxiety disorders or suicidality and displayed good internal consistency. The optimal cutoff scores were 4/5 and 6/7 for men and women respectively. The factor structure differed by gender. The SRQ-20 is a useful tool for EC settings in South Africa and holds promise for task-shifted approaches to decreasing the LMIC burden of mental disorders.

  10. Geologic setting and water quality of selected basins in the active coal-mining areas of Ohio, 1987-88

    USGS Publications Warehouse

    Sedam, A.C.

    1991-01-01

    This report presents hydrologic data from selected drainage basins in the active coal-mining areas of Ohio from July 1987 through October 1988. The study area is mostly within the unglaciated part of eastern Ohio along the western edge of the Appalachian Plateaus physiographic province. The 1987-88 work is the second phase of a 7-year study to assess baseline water quality in Ohio's coal region. The data collection network consisted of 41 long-term surface-water sites in 21 basins. The sites were measured and sampled twice yearly at low flow. In addition, six individual basins (three each year) selected for a more detailed representation of surface-water and ground-water quality. In 1987, the Sandy Creek, Middle Tuscarawas River and Sugar Creek, and Lower Tuscarawas River basins were chosen. In 1988, the Short and Wheeling Creeks, Upper Wills Creek, and Upper Raccoon Creek basins were chosen. Because of their proximity to the glaciated region and outwash drainage, the basins studied intensively in 1987 contain more shallow productive aquifers than do the basins studied in detail for 1988, in which shallow ground-water sources are very localized. Chemical analyses for 202 surface-water and 24 ground-water samples are presented. For field measurements made at surface-water sites, the specific conductance ranged from 295 to 3150 ? S/cm (microsiemens per centimeter at 25 degrees Celsius). For pH, the range was 2.8 to 8.6. Alkalinity ranged from 5 to 305 mg/L (milligrams per liter) as CaCO3.

  11. Mining environmental high-throughput sequence data sets to identify divergent amplicon clusters for phylogenetic reconstruction and morphotype visualization.

    PubMed

    Gimmler, Anna; Stoeck, Thorsten

    2015-08-01

    Environmental high-throughput sequencing (envHTS) is a very powerful tool, which in protistan ecology is predominantly used for the exploration of diversity and its geographic and local patterns. We here used a pyrosequenced V4-SSU rDNA data set from a solar saltern pond as test case to exploit such massive protistan amplicon data sets beyond this descriptive purpose. Therefore, we combined a Swarm-based blastn network including 11 579 ciliate V4 amplicons to identify divergent amplicon clusters with targeted polymerase chain reaction (PCR) primer design for full-length small subunit of the ribosomal DNA retrieval and probe design for fluorescence in situ hybridization (FISH). This powerful strategy allows to benefit from envHTS data sets to (i) reveal the phylogenetic position of the taxon behind divergent amplicons; (ii) improve phylogenetic resolution and evolutionary history of specific taxon groups; (iii) solidly assess an amplicons (species') degree of similarity to its closest described relative; (iv) visualize the morphotype behind a divergent amplicons cluster; (v) rapidly FISH screen many environmental samples for geographic/habitat distribution and abundances of the respective organism and (vi) to monitor the success of enrichment strategies in live samples for cultivation and isolation of the respective organisms.

  12. Predictors of sinus rhythm after electrical cardioversion of atrial fibrillation: results from a data mining project on the Flec-SL trial data set.

    PubMed

    Oto, Emre; Okutucu, Sercan; Katircioglu-Öztürk, Deniz; Güvenir, Halil Altay; Karaagaoglu, Ergun; Borggrefe, Martin; Breithardt, Günter; Goette, Andreas; Ravens, Ursula; Steinbeck, Gerhard; Wegscheider, Karl; Oto, Ali; Kirchhof, Paulus

    2017-06-01

    Data mining is the computational process to obtain information from a data set and transform it for further use. Herein, through data mining with supportive statistical analyses, we identified and consolidated variables of the Flecainide Short-Long (Flec-SL-AFNET 3) trial dataset that are associated with the primary outcome of the trial, recurrence of persistent atrial fibrillation (AF) or death. The 'Ranking Instances by Maximizing the Area under the ROC Curve' (RIMARC) algorithm was applied to build a classifier that can predict the primary outcome by using variables in the Flec-SL dataset. The primary outcome was time to persistent AF or death. The RIMARC algorithm calculated the predictive weights of each variable in the Flec-SL dataset for the primary outcome. Among the initial 21 parameters, 6 variables were identified by the RIMARC algorithm. In univariate Cox regression analysis of these variables, increased heart rate during AF and successful pharmacological conversion (PC) to sinus rhythm (SR) were found to be significant predictors. Multivariate Cox regression analysis revealed successful PC as the single relevant predictor of SR maintenance. The primary outcome risk was 3.14 times (95% CI:1.7-5.81) lower in those who had successful PC to SR than those who needed electrical cardioversion. Pharmacological conversion of persistent AF with flecainide without the need for electrical cardioversion is a powerful and independent predictor of maintenance of SR. A strategy of flecainide pretreatment for 48 h prior to planned electrical cardioversion may be a useful planning of a strategy of long-term rhythm control.

  13. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records.

    PubMed

    Jiang, Li; Edwards, Stefan M; Thomsen, Bo; Workman, Christopher T; Guldbrandtsen, Bernt; Sørensen, Peter

    2014-09-24

    Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization. We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance. We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data

  14. SARANEA: a freely available program to mine structure-activity and structure-selectivity relationship information in compound data sets.

    PubMed

    Lounkine, Eugen; Wawer, Mathias; Wassermann, Anne Mai; Bajorath, Jürgen

    2010-01-01

    We introduce SARANEA, an open-source Java application for interactive exploration of structure-activity relationship (SAR) and structure-selectivity relationship (SSR) information in compound sets of any source. SARANEA integrates various SAR and SSR analysis functions and utilizes a network-like similarity graph data structure for visualization. The program enables the systematic detection of activity and selectivity cliffs and corresponding key compounds across multiple targets. Advanced SAR analysis functions implemented in SARANEA include, among others, layered chemical neighborhood graphs, cliff indices, selectivity trees, editing functions for molecular networks and pathways, bioactivity summaries of key compounds, and markers for bioactive compounds having potential side effects. We report the application of SARANEA to identify SAR and SSR determinants in different sets of serine protease inhibitors. It is found that key compounds can influence SARs and SSRs in rather different ways. Such compounds and their SAR/SSR characteristics can be systematically identified and explored using SARANEA. The program and source code are made freely available under the GNU General Public License.

  15. Investigating Separate and Concurrent Approaches for Item Parameter Drift in 3PL Item Response Theory Equating

    ERIC Educational Resources Information Center

    Arce-Ferrer, Alvaro J.; Bulut, Okan

    2017-01-01

    This study examines separate and concurrent approaches to combine the detection of item parameter drift (IPD) and the estimation of scale transformation coefficients in the context of the common item nonequivalent groups design with the three-parameter item response theory equating. The study uses real and synthetic data sets to compare the two…

  16. Dissecting Item Misfit on Vocabulary Items.

    ERIC Educational Resources Information Center

    Gershon, Richard C.

    The Johnson O'Connor Research Foundation, which produces vocabulary instructional materials for test takers, is in the process of determining the difficulty values of nontechnical words in the English language. To this end, the Foundation writes test items for vocabulary words and tests them in schools. The items are then calibrated using the…

  17. Approximation Preserving Reductions among Item Pricing Problems

    NASA Astrophysics Data System (ADS)

    Hamane, Ryoso; Itoh, Toshiya; Tomita, Kouhei

    When a store sells items to customers, the store wishes to determine the prices of the items to maximize its profit. Intuitively, if the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. So it would be hard for the store to decide the prices of items. Assume that the store has a set V of n items and there is a set E of m customers who wish to buy those items, and also assume that each item i ∈ V has the production cost di and each customer ej ∈ E has the valuation vj on the bundle ej ⊆ V of items. When the store sells an item i ∈ V at the price ri, the profit for the item i is pi = ri - di. The goal of the store is to decide the price of each item to maximize its total profit. We refer to this maximization problem as the item pricing problem. In most of the previous works, the item pricing problem was considered under the assumption that pi ≥ 0 for each i ∈ V, however, Balcan, et al. [In Proc. of WINE, LNCS 4858, 2007] introduced the notion of “loss-leader, ” and showed that the seller can get more total profit in the case that pi < 0 is allowed than in the case that pi < 0 is not allowed. In this paper, we derive approximation preserving reductions among several item pricing problems and show that all of them have algorithms with good approximation ratio.

  18. Item-Writing Guidelines for Physics

    NASA Astrophysics Data System (ADS)

    Regan, Tom

    2015-11-01

    A teacher learning how to write test questions (test items) will almost certainly encounter item-writing guidelines—lists of item-writing do's and don'ts. Item-writing guidelines usually are presented as applicable across all assessment settings. Table I shows some guidelines that I believe to be generally applicable and two will be briefly discussed here. The teacher may also encounter some guidelines that are not applicable in all assessment settings, or that can be applied to physics only with care. Two such guidelines, and their limitations, will be discussed as well.

  19. The Licensing of Negative Sensitive Items in Jordanian Arabic

    ERIC Educational Resources Information Center

    Alsarayreh, Atef

    2012-01-01

    This study investigates the licensing conditions on Negative Sensitive Items (NSIs) in Jordanian Arabic (JA). JA exhibits both types of NSIs that are discussed in the literature: Negative Polarity Items (NPIs) and Negative Concord Items (NCIs). Although these two sets of items seem to form a natural class in the sense that they show certain…

  20. Utilizing Response Time Distributions for Item Selection in CAT

    ERIC Educational Resources Information Center

    Fan, Zhewen; Wang, Chun; Chang, Hua-Hua; Douglas, Jeffrey

    2012-01-01

    Traditional methods for item selection in computerized adaptive testing only focus on item information without taking into consideration the time required to answer an item. As a result, some examinees may receive a set of items that take a very long time to finish, and information is not accrued as efficiently as possible. The authors propose two…

  1. The Licensing of Negative Sensitive Items in Jordanian Arabic

    ERIC Educational Resources Information Center

    Alsarayreh, Atef

    2012-01-01

    This study investigates the licensing conditions on Negative Sensitive Items (NSIs) in Jordanian Arabic (JA). JA exhibits both types of NSIs that are discussed in the literature: Negative Polarity Items (NPIs) and Negative Concord Items (NCIs). Although these two sets of items seem to form a natural class in the sense that they show certain…

  2. Utilizing Response Time Distributions for Item Selection in CAT

    ERIC Educational Resources Information Center

    Fan, Zhewen; Wang, Chun; Chang, Hua-Hua; Douglas, Jeffrey

    2012-01-01

    Traditional methods for item selection in computerized adaptive testing only focus on item information without taking into consideration the time required to answer an item. As a result, some examinees may receive a set of items that take a very long time to finish, and information is not accrued as efficiently as possible. The authors propose two…

  3. Bayesian Analysis of Item Response Curves.

    DTIC Science & Technology

    1984-07-01

    response.a are studied from a Bayesian viewpoint of estimating the item parameters. For the two-parameter logistic model with normally distributed ability...estimating the item parameters. For the two-parameter logistic model with normally distributed ability, restricted bivariate beta priors are used to...responses, Bayesian estimation, EM algorithm. 3 Introduction We will consider dichotomous responses to a set of test items which are designed to measure the

  4. Item Purification Does Not Always Improve DIF Detection: A Counterexample with Angoff's Delta Plot

    ERIC Educational Resources Information Center

    Magis, David; Facon, Bruno

    2013-01-01

    Item purification is an iterative process that is often advocated as improving the identification of items affected by differential item functioning (DIF). With test-score-based DIF detection methods, item purification iteratively removes the items currently flagged as DIF from the test scores to get purified sets of items, unaffected by DIF. The…

  5. Item Purification Does Not Always Improve DIF Detection: A Counterexample with Angoff's Delta Plot

    ERIC Educational Resources Information Center

    Magis, David; Facon, Bruno

    2013-01-01

    Item purification is an iterative process that is often advocated as improving the identification of items affected by differential item functioning (DIF). With test-score-based DIF detection methods, item purification iteratively removes the items currently flagged as DIF from the test scores to get purified sets of items, unaffected by DIF. The…

  6. Trajectory Pattern Mining Using Sequential Pattern Mining and K-Means for Predicting Future Location

    NASA Astrophysics Data System (ADS)

    Kautsar, G.; Akbar, S.

    2017-01-01

    Sequential pattern mining is a method used to find patterns while concerning the sequence of an item set. Sequential pattern mining can be used to find trajectory patterns in moving object data. To implement it in the real life, the spatial attribute of the data needs to be generalized/grouped. In this paper, K-Means is used to group the spatial attribute. In order to group the spatial attribute, the temporal attribute is also considered to see how the patterns are related to time. The resulting trajectory patterns are then used to visualize the habit of the moving object. Therefore, trajectory patterns are used as the reference in this paper to predict the future location of the object. Predicting the future location of the object is performed using the movement history of the object. Result of this research is trajectory pattern which repeat at certain time duration according to its data characteristics.

  7. Simulated Equating Using Several Item Response Curves.

    ERIC Educational Resources Information Center

    Boldt, R. F.

    The comparison of item response theory models for the Test of English as a Foreign Language (TOEFL) was extended to an equating context as simulation trials were used to "equate the test to itself." Equating sample data were generated from administration of identical item sets. Equatings that used procedures based on each model (simple…

  8. Urban Combat Data Mining

    DTIC Science & Technology

    2004-12-01

    We describe an approach and its implementation involving simulation and data mining for improved understanding of the potential relationships among battle parameters and battle outcomes in an urban setting.

  9. 12 CFR 210.12 - Return of cash items and handling of returned checks.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... under § 229.2(aaa) of this chapter; and the item conforms to the technical standards for an electronic...(aaa) of this chapter; and the item conforms with the technical standards for an electronic item set...(aaa) of this chapter; and the item conforms with the technical standards for an electronic item set...

  10. The diatom-inferred pH reconstructions for a naturally neutralized pit lake in south-west Poland using the Mining and the Combined pH training sets.

    PubMed

    Sienkiewicz, Elwira; Gąsiorowski, Michał

    2017-12-15

    Many artificial lakes in various parts of the world were formed as a result of the exploitation of lignite and other mineral deposits. Usually, lakes created after the end of lignite mine activity have extremely acidic or very acidic water, with pH<3.0, caused by the oxidation of pyrite and marcasite and other sulfide compounds. After the closure of mines, the lakes undergo natural neutralization. However, this process is long-term and depends on many different factors, such as the amount of carbonates, the geological structure of bedrock, hydrology and hydrobiology, climate and the method of lignite exploitation. In some types of anthropogenic lakes (i.e., clay pits), water pH is almost neutral or alkaline due to the adsorption properties of clay minerals. Along the Polish-German border in the Łuk Mużakowa Geopark, over 100 post-mining lakes were formed as a result of lignite and clay exploitation. The Mining pH training set of modern diatom samples and water chemistry was created from 69 lakes to quantify the relationship between the diatom assemblages living in these water bodies and the lakes' chemical composition. Then, this database was applied to the reconstruction of diatom-inferred pH in the sediments of a post-mining lake TR-33. The results of reconstruction was compared to similar analysis based on the Combined pH training set containing more variety of diatom taxa from a higher number of lakes. Although the second dataset was larger and gave a smaller root mean square error, the Mining pH training set has a longer gradient of pH including lakes with extremely acidic water and gives more reliable results. The reconstruction of DI-pH in TR-33 which had very acidic water in the past and currently neutralized from acidification was more precise and reliable using the database sensu stricto, which consists of only the post-mining lakes. Copyright © 2017 Elsevier B.V. All rights reserved.

  11. Privacy-Preserving Collaborative Data Mining

    DTIC Science & Technology

    2003-01-01

    Privacy-preserving data mining is an important issue in the areas of data mining and security. In this paper, we study how to conduct association...rule mining, one of the core data mining techniques, on private data in the following scenario: Multiple parties, each having a private data set, want to

  12. Writing better test items.

    PubMed

    Aucoin, Julia W

    2005-01-01

    Professional development specialists have had little opportunity to learn how to write test items to meet the expectations of today's graduate nurse. Schools of nursing have moved away from knowledge-level test items and have had to develop more application and analysis items to prepare graduates for the National Council Licensure Examination (NCLEX). This same type of question can be used effectively to support a competence assessment system and document critical thinking skills.

  13. 48 CFR 852.214-72 - Alternate item(s).

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 48 Federal Acquisition Regulations System 5 2011-10-01 2011-10-01 false Alternate item(s). 852.214-72 Section 852.214-72 Federal Acquisition Regulations System DEPARTMENT OF VETERANS AFFAIRS CLAUSES... Alternate item(s). As prescribed in 814.201-6(b)(2), insert the following provision: Alternate Item(s) (JAN...

  14. 48 CFR 852.214-72 - Alternate item(s).

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 48 Federal Acquisition Regulations System 5 2014-10-01 2014-10-01 false Alternate item(s). 852.214-72 Section 852.214-72 Federal Acquisition Regulations System DEPARTMENT OF VETERANS AFFAIRS CLAUSES... Alternate item(s). As prescribed in 814.201-6(b)(2), insert the following provision: Alternate Item(s) (JAN...

  15. Longwall mining

    SciTech Connect

    1995-03-14

    As part of EIA`s program to provide information on coal, this report, Longwall-Mining, describes longwall mining and compares it with other underground mining methods. Using data from EIA and private sector surveys, the report describes major changes in the geologic, technological, and operating characteristics of longwall mining over the past decade. Most important, the report shows how these changes led to dramatic improvements in longwall mining productivity. For readers interested in the history of longwall mining and greater detail on recent developments affecting longwall mining, the report includes a bibliography.

  16. EnvMine: a text-mining system for the automatic extraction of contextual information.

    PubMed

    Tamames, Javier; de Lorenzo, Victor

    2010-06-01

    For ecological studies, it is crucial to count on adequate descriptions of the environments and samples being studied. Such a description must be done in terms of their physicochemical characteristics, allowing a direct comparison between different environments that would be difficult to do otherwise. Also the characterization must include the precise geographical location, to make possible the study of geographical distributions and biogeographical patterns. Currently, there is no schema for annotating these environmental features, and these data have to be extracted from textual sources (published articles). So far, this had to be performed by manual inspection of the corresponding documents. To facilitate this task, we have developed EnvMine, a set of text-mining tools devoted to retrieve contextual information (physicochemical variables and geographical locations) from textual sources of any kind. EnvMine is capable of retrieving the physicochemical variables cited in the text, by means of the accurate identification of their associated units of measurement. In this task, the system achieves a recall (percentage of items retrieved) of 92% with less than 1% error. Also a Bayesian classifier was tested for distinguishing parts of the text describing environmental characteristics from others dealing with, for instance, experimental settings.Regarding the identification of geographical locations, the system takes advantage of existing databases such as GeoNames to achieve 86% recall with 92% precision. The identification of a location includes also the determination of its exact coordinates (latitude and longitude), thus allowing the calculation of distance between the individual locations. EnvMine is a very efficient method for extracting contextual information from different text sources, like published articles or web pages. This tool can help in determining the precise location and physicochemical variables of sampling sites, thus facilitating the performance

  17. EnvMine: A text-mining system for the automatic extraction of contextual information

    PubMed Central

    2010-01-01

    Background For ecological studies, it is crucial to count on adequate descriptions of the environments and samples being studied. Such a description must be done in terms of their physicochemical characteristics, allowing a direct comparison between different environments that would be difficult to do otherwise. Also the characterization must include the precise geographical location, to make possible the study of geographical distributions and biogeographical patterns. Currently, there is no schema for annotating these environmental features, and these data have to be extracted from textual sources (published articles). So far, this had to be performed by manual inspection of the corresponding documents. To facilitate this task, we have developed EnvMine, a set of text-mining tools devoted to retrieve contextual information (physicochemical variables and geographical locations) from textual sources of any kind. Results EnvMine is capable of retrieving the physicochemical variables cited in the text, by means of the accurate identification of their associated units of measurement. In this task, the system achieves a recall (percentage of items retrieved) of 92% with less than 1% error. Also a Bayesian classifier was tested for distinguishing parts of the text describing environmental characteristics from others dealing with, for instance, experimental settings. Regarding the identification of geographical locations, the system takes advantage of existing databases such as GeoNames to achieve 86% recall with 92% precision. The identification of a location includes also the determination of its exact coordinates (latitude and longitude), thus allowing the calculation of distance between the individual locations. Conclusion EnvMine is a very efficient method for extracting contextual information from different text sources, like published articles or web pages. This tool can help in determining the precise location and physicochemical variables of sampling sites, thus

  18. The Use of Unidimensional Item Parameter Estimates of Multidimensional Items in Adaptive Testing.

    ERIC Educational Resources Information Center

    Ackerman, Terry A.

    The purpose of this study was to investigate the effect of using multidimensional items in a computer adaptive test (CAT) setting which assumes a unidimensional item response theory (IRT) framework. Previous research has suggested that the composite of multidimensional abilities being estimated by a unidimensional IRT model is not constant…

  19. Evaluating Common Item Block Options When Faced with Practical Constraints

    ERIC Educational Resources Information Center

    Wolkowitz, Amanda; Davis-Becker, Susan

    2015-01-01

    This study evaluates the impact of common item characteristics on the outcome of equating in credentialing examinations when traditionally recommended representation is not possible. This research used real data sets from several credentialing exams to test the impact of content representation, item statistics, and number of common items on…

  20. A Lognormal Model for Response Times on Test Items

    ERIC Educational Resources Information Center

    van der Linden, Wim J.

    2006-01-01

    A lognormal model for the response times of a person on a set of test items is investigated. The model has a parameter structure analogous to the two-parameter logistic response models in item response theory, with a parameter for the speed of each person as well as parameters for the time intensity and discriminating power of each item. It is…

  1. The Effects of Testlets on Reliability and Differential Item Functioning

    ERIC Educational Resources Information Center

    Teker, Gulsen Tasdelen; Dogan, Nuri

    2015-01-01

    Reliability and differential item functioning (DIF) analyses were conducted on testlets displaying local item dependence in this study. The data set employed in the research was obtained from the answers given by 1,500 students to the 20 items included in six testlets given in English Proficiency Exam by the School of Foreign Languages of a state…

  2. A Lognormal Model for Response Times on Test Items

    ERIC Educational Resources Information Center

    van der Linden, Wim J.

    2006-01-01

    A lognormal model for the response times of a person on a set of test items is investigated. The model has a parameter structure analogous to the two-parameter logistic response models in item response theory, with a parameter for the speed of each person as well as parameters for the time intensity and discriminating power of each item. It is…

  3. Investigating Factors that Influence Item Performance on ACS Exams

    ERIC Educational Resources Information Center

    Schroeder, Jacob; Murphy, Kristen L.; Holme, Thomas A.

    2012-01-01

    General chemistry tests from the Examinations Institute of the Division of Chemical Education of the American Chemical Society have been analyzed to identify factors that may influence how individual test items perform. In this paper, issues of item order (position within a set of items that comprise a test) and answer order (position of correct…

  4. Mokken Scale Analysis for Dichotomous Items Using Marginal Models

    ERIC Educational Resources Information Center

    van der Ark, L. Andries; Croon, Marcel A.; Sijtsma, Klaas

    2008-01-01

    Scalability coefficients play an important role in Mokken scale analysis. For a set of items, scalability coefficients have been defined for each pair of items, for each individual item, and for the entire scale. Hypothesis testing with respect to these scalability coefficients has not been fully developed. This study introduces marginal modelling…

  5. Developing a Common Metric in Item Response Theory.

    ERIC Educational Resources Information Center

    Stocking, Martha L.; Lord, Frederic M.

    A common problem arises in scale transformation when independent estimates of item parameters from two separate data sets must be expressed in the same metric. These item parameter estimates will be different because the metric or scale defined by each independent calibration of the items is different. The problem is frequently confronted in…

  6. Research on parallel algorithm for sequential pattern mining

    NASA Astrophysics Data System (ADS)

    Zhou, Lijuan; Qin, Bai; Wang, Yu; Hao, Zhongxiao

    2008-03-01

    Sequential pattern mining is the mining of frequent sequences related to time or other orders from the sequence database. Its initial motivation is to discover the laws of customer purchasing in a time section by finding the frequent sequences. In recent years, sequential pattern mining has become an important direction of data mining, and its application field has not been confined to the business database and has extended to new data sources such as Web and advanced science fields such as DNA analysis. The data of sequential pattern mining has characteristics as follows: mass data amount and distributed storage. Most existing sequential pattern mining algorithms haven't considered the above-mentioned characteristics synthetically. According to the traits mentioned above and combining the parallel theory, this paper puts forward a new distributed parallel algorithm SPP(Sequential Pattern Parallel). The algorithm abides by the principal of pattern reduction and utilizes the divide-and-conquer strategy for parallelization. The first parallel task is to construct frequent item sets applying frequent concept and search space partition theory and the second task is to structure frequent sequences using the depth-first search method at each processor. The algorithm only needs to access the database twice and doesn't generate the candidated sequences, which abates the access time and improves the mining efficiency. Based on the random data generation procedure and different information structure designed, this paper simulated the SPP algorithm in a concrete parallel environment and implemented the AprioriAll algorithm. The experiments demonstrate that compared with AprioriAll, the SPP algorithm had excellent speedup factor and efficiency.

  7. Novelty and Promotional Items

    EPA Pesticide Factsheets

    Small novelty or promotional products, primarily used for outreach and educational purposes, must effectively convey a message, and their purchase will only be allowed if the item will contribute to the accomplishment of the Agency's mission.

  8. Web Mining

    NASA Astrophysics Data System (ADS)

    Fürnkranz, Johannes

    The World-Wide Web provides every internet citizen with access to an abundance of information, but it becomes increasingly difficult to identify the relevant pieces of information. Research in web mining tries to address this problem by applying techniques from data mining and machine learning to Web data and documents. This chapter provides a brief overview of web mining techniques and research areas, most notably hypertext classification, wrapper induction, recommender systems and web usage mining.

  9. Text Mining.

    ERIC Educational Resources Information Center

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  10. Data Mining.

    ERIC Educational Resources Information Center

    Benoit, Gerald

    2002-01-01

    Discusses data mining (DM) and knowledge discovery in databases (KDD), taking the view that KDD is the larger view of the entire process, with DM emphasizing the cleaning, warehousing, mining, and visualization of knowledge discovery in databases. Highlights include algorithms; users; the Internet; text mining; and information extraction.…

  11. Surface mining

    Treesearch

    Robert Leopold; Bruce Rowland; Reed Stalder

    1979-01-01

    The surface mining process consists of four phases: (1) exploration; (2) development; (3) production; and (4) reclamation. A variety of surface mining methods has been developed, including strip mining, auger, area strip, open pit, dredging, and hydraulic. Sound planning and design techniques are essential to implement alternatives to meet the myriad of laws,...

  12. Data Mining.

    ERIC Educational Resources Information Center

    Benoit, Gerald

    2002-01-01

    Discusses data mining (DM) and knowledge discovery in databases (KDD), taking the view that KDD is the larger view of the entire process, with DM emphasizing the cleaning, warehousing, mining, and visualization of knowledge discovery in databases. Highlights include algorithms; users; the Internet; text mining; and information extraction.…

  13. Item Banking. Basic Testing Series.

    ERIC Educational Resources Information Center

    Childs, Roy

    This pamphlet describes the exciting potential of item banking--a new approach to testing which combines both comparability of scores with flexibility of test format. Item banks are collections of items where the characteristics of each item is known and these characteristics can be summated to described a test made from such items. The principle…

  14. Evolution of a Test Item

    ERIC Educational Resources Information Center

    Spaan, Mary

    2007-01-01

    This article follows the development of test items (see "Language Assessment Quarterly", Volume 3 Issue 1, pp. 71-79 for the article "Test and Item Specifications Development"), beginning with a review of test and item specifications, then proceeding to writing and editing of items, pretesting and analysis, and finally selection of an item for a…

  15. PROMIS Pediatric Pain Interference Scale: An Item Response Theory Analysis of the Pediatric Pain Item Bank

    PubMed Central

    Varni, James W.; Stucky, Brian D.; Thissen, David; DeWitt, Esi Morgan; Irwin, Debra E.; Lai, Jin-Shei; Yeatts, Karin; DeWalt, Darren A.

    2011-01-01

    An aim of the National Institutes of Health (NIH) Patient Reported Outcomes Measurement Information System (PROMIS) initiative is to develop item banks and computerized adaptive tests (CAT) that are applicable across a wide variety of chronic disorders. The PROMIS Pediatric Cooperative Group has concentrated on the development of pediatric self-report item banks for ages 8-17 years. The objective of the present study is to describe the Item Response Theory (IRT) analysis of the NIH PROMIS pediatric pain item bank and the measurement properties of the new unidimensional PROMIS Pediatric Pain Interference Scale. Test forms containing pediatric pain items were completed by a total of 3,048 respondents. IRT analyses regarding scale dimensionality, item local dependence, and differential item functioning were conducted. A pain item pool was developed to yield scores on a T-score scale with a mean of 50 and standard deviation of 10. The recommended 8-item unidimensional short form for the PROMIS Pediatric Pain Interference Scale contains the item set which provides the maximum test information at the mean (50) on the T-score metric. A simulated CAT was computed that provides the most information at five possible score locations (30, 40, 50, 60, and 70 on the T-score metric). PMID:20627819

  16. Speeding-Up Association Rule Mining With Inverted Index Compression.

    PubMed

    Luna, Jose Maria; Cano, Alberto; Pechenizkiy, Mykola; Ventura, Sebastian

    2016-12-01

    The growing interest in data storage has made the data size to be exponentially increased, hampering the process of knowledge discovery from these large volumes of high-dimensional and heterogeneous data. In recent years, many efficient algorithms for mining data associations have been proposed, facing up time and main memory requirements. Nevertheless, this mining process could still become hard when the number of items and records is extremely high. In this paper, the goal is not to propose new efficient algorithms but a new data structure that could be used by a variety of existing algorithms without modifying its original schema. Thus, our aim is to speed up the association rule mining process regardless the algorithm used to this end, enabling the performance of efficient implementations to be enhanced. The structure simplifies, reorganizes, and speeds up the data access by sorting data by means of a shuffling strategy based on the hamming distance, which achieve similar values to be closer, and considering both an inverted index mapping and a run length encoding compression. In the experimental study, we explore the bounds of the algorithms' performance by using a wide number of data sets that comprise either thousands or millions of both items and records. The results demonstrate the utility of the proposed data structure in enhancing the algorithms' runtime orders of magnitude, and substantially reducing both the auxiliary and the main memory requirements.

  17. Item Writer Judgments of Item Difficulty versus Actual Item Difficulty: A Case Study

    ERIC Educational Resources Information Center

    Sydorenko, Tetyana

    2011-01-01

    This study investigates how accurate one item writer can be on item difficulty estimates and whether factors affecting item writer judgments correspond to predictors of actual item difficulty. The items were based on conversational dialogs (presented as videos online) that focus on pragmatic functions. Thirty-five 2nd-, 3rd-, and 4th-year learners…

  18. Apriori Versions Based on MapReduce for Mining Frequent Patterns on Big Data.

    PubMed

    Luna, Jose Maria; Padillo, Francisco; Pechenizkiy, Mykola; Ventura, Sebastian

    2017-09-27

    Pattern mining is one of the most important tasks to extract meaningful and useful information from raw data. This task aims to extract item-sets that represent any type of homogeneity and regularity in data. Although many efficient algorithms have been developed in this regard, the growing interest in data has caused the performance of existing pattern mining techniques to be dropped. The goal of this paper is to propose new efficient pattern mining algorithms to work in big data. To this aim, a series of algorithms based on the MapReduce framework and the Hadoop open-source implementation have been proposed. The proposed algorithms can be divided into three main groups. First, two algorithms [Apriori MapReduce (AprioriMR) and iterative AprioriMR] with no pruning strategy are proposed, which extract any existing item-set in data. Second, two algorithms (space pruning AprioriMR and top AprioriMR) that prune the search space by means of the well-known anti-monotone property are proposed. Finally, a last algorithm (maximal AprioriMR) is also proposed for mining condensed representations of frequent patterns. To test the performance of the proposed algorithms, a varied collection of big data datasets have been considered, comprising up to 3 · 10#x00B9;⁸ transactions and more than 5 million of distinct single-items. The experimental stage includes comparisons against highly efficient and well-known pattern mining algorithms. Results reveal the interest of applying MapReduce versions when complex problems are considered, and also the unsuitability of this paradigm when dealing with small data.

  19. Item response theory - A first approach

    NASA Astrophysics Data System (ADS)

    Nunes, Sandra; Oliveira, Teresa; Oliveira, Amílcar

    2017-07-01

    The Item Response Theory (IRT) has become one of the most popular scoring frameworks for measurement data, frequently used in computerized adaptive testing, cognitively diagnostic assessment and test equating. According to Andrade et al. (2000), IRT can be defined as a set of mathematical models (Item Response Models - IRM) constructed to represent the probability of an individual giving the right answer to an item of a particular test. The number of Item Responsible Models available to measurement analysis has increased considerably in the last fifteen years due to increasing computer power and due to a demand for accuracy and more meaningful inferences grounded in complex data. The developments in modeling with Item Response Theory were related with developments in estimation theory, most remarkably Bayesian estimation with Markov chain Monte Carlo algorithms (Patz & Junker, 1999). The popularity of Item Response Theory has also implied numerous overviews in books and journals, and many connections between IRT and other statistical estimation procedures, such as factor analysis and structural equation modeling, have been made repeatedly (Van der Lindem & Hambleton, 1997). As stated before the Item Response Theory covers a variety of measurement models, ranging from basic one-dimensional models for dichotomously and polytomously scored items and their multidimensional analogues to models that incorporate information about cognitive sub-processes which influence the overall item response process. The aim of this work is to introduce the main concepts associated with one-dimensional models of Item Response Theory, to specify the logistic models with one, two and three parameters, to discuss some properties of these models and to present the main estimation procedures.

  20. Cytokine profile determined by data-mining analysis set into clusters of non-small-cell lung cancer patients according to prognosis.

    PubMed

    Barrera, L; Montes-Servín, E; Barrera, A; Ramírez-Tirado, L A; Salinas-Parra, F; Bañales-Méndez, J L; Sandoval-Ríos, M; Arrieta, Ó

    2015-02-01

    Immunoregulatory cytokines may play a fundamental role in tumor growth and metastases. Their effects are mediated through complex regulatory networks. Human cytokine profiles could define patient subgroups and represent new potential biomarkers. The aim of this study was to associate a cytokine profile obtained through data mining with the clinical characteristics of patients with advanced non-small-cell lung cancer (NSCLC). We conducted a prospective study of the plasma levels of 14 immunoregulatory cytokines by ELISA and a cytometric bead array assay in 110 NSCLC patients before chemotherapy and 25 control subjects. Cytokine levels and data-mining profiles were associated with clinical, quality of life and pathological outcomes. NSCLC patients had higher levels of interleukin (IL)-6, IL-8, IL-12p70, IL-17a and interferon (IFN)-γ, and lower levels of IL-33 and IL-29 compared with controls. The pro-inflammatory cytokines IL-1b, IL-6 and IL-8 were associated with lower hemoglobin levels, worse functional performance status (Eastern Cooperative Oncology Group, ECOG), fatigue and hyporexia. The anti-inflammatory cytokines IL-4, IL-10 and IL-33 were associated with anorexia and lower body mass index. We identified three clusters of patients according to data-mining analysis with different overall survival (OS; 25.4, 16.8 and 5.09 months, respectively, P = 0.0012). Multivariate analysis showed that ECOG performance status and data-mining clusters were significantly associated with OS (RR 3.59, [95% CI 1.9-6.7], P < 0.001 and 2.2, [1.2-3.8], P = 0.005). Our results provide evidence that complex cytokine networks may be used to identify patient subgroups with different prognoses in advanced NSCLC. These cytokines may represent potential biomarkers, particularly in the immunotherapy era in cancer research. © The Author 2014. Published by Oxford University Press on behalf of the European Society for Medical Oncology. All rights reserved. For permissions, please email

  1. Examining the Impact of Drifted Polytomous Anchor Items on Test Characteristic Curve (TCC) Linking and IRT True Score Equating. Research Report. ETS RR-12-09

    ERIC Educational Resources Information Center

    Li, Yanmei

    2012-01-01

    In a common-item (anchor) equating design, the common items should be evaluated for item parameter drift. Drifted items are often removed. For a test that contains mostly dichotomous items and only a small number of polytomous items, removing some drifted polytomous anchor items may result in anchor sets that no longer resemble mini-versions of…

  2. The Role of Item Models in Automatic Item Generation

    ERIC Educational Resources Information Center

    Gierl, Mark J.; Lai, Hollis

    2012-01-01

    Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

  3. The Role of Item Models in Automatic Item Generation

    ERIC Educational Resources Information Center

    Gierl, Mark J.; Lai, Hollis

    2012-01-01

    Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

  4. Australian Item Bank Program: Handbook for Science Item Bank.

    ERIC Educational Resources Information Center

    Australian Council for Educational Research, Hawthorn.

    This handbook is designed to assist teachers in using the Science Item Bank to construct diagnostic tests and end-of-course achievement tests. The item bank consists of over 2,800 multiple-choice items, and teachers are encouraged to supplement this source of test items with other forms of test questions. Key answers to these questions are arrived…

  5. Investigating an Invariant Item Ordering for Polytomously Scored Items

    ERIC Educational Resources Information Center

    Ligtvoet, Rudy; van der Ark, L. Andries; te Marvelde, Janneke M.; Sijtsma, Klaas

    2010-01-01

    This article discusses the concept of an invariant item ordering (IIO) for polytomously scored items and proposes methods for investigating an IIO in real test data. Method manifest IIO is proposed for assessing whether item response functions intersect. Coefficient H[superscript T] is defined for polytomously scored items. Given that an IIO…

  6. Identifying Unbiased Items for Screening Preschoolers for Disruptive Behavior Problems.

    PubMed

    Studts, Christina R; Polaha, Jodi; van Zyl, Michiel A

    2016-10-25

    OBJECTIVE : Efficient identification and referral to behavioral services are crucial in addressing early-onset disruptive behavior problems. Existing screening instruments for preschoolers are not ideal for pediatric primary care settings serving diverse populations. Eighteen candidate items for a new brief screening instrument were examined to identify those exhibiting measurement bias (i.e., differential item functioning, DIF) by child characteristics. METHOD : Parents/guardians of preschool-aged children (N = 900) from four primary care settings completed two full-length behavioral rating scales. Items measuring disruptive behavior problems were tested for DIF by child race, sex, and socioeconomic status using two approaches: item response theory-based likelihood ratio tests and ordinal logistic regression. RESULTS : Of 18 items, eight were identified with statistically significant DIF by at least one method. CONCLUSIONS : The bias observed in 8 of 18 items made them undesirable for screening diverse populations of children. These items were excluded from the new brief screening tool.

  7. 26 CFR 1.751-1 - Unrealized receivables and inventory items.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ..., also includes potential gain from mining property defined in section 617(f)(2). With respect to each item of partnership mining property so defined, the potential gain is the amount that would be treated... the taxable year, or property held by the partnership primarily for sale to customers in the...

  8. 26 CFR 1.751-1 - Unrealized receivables and inventory items.

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ..., also includes potential gain from mining property defined in section 617(f)(2). With respect to each item of partnership mining property so defined, the potential gain is the amount that would be treated... the taxable year, or property held by the partnership primarily for sale to customers in the...

  9. 26 CFR 1.751-1 - Unrealized receivables and inventory items.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ..., also includes potential gain from mining property defined in section 617(f)(2). With respect to each item of partnership mining property so defined, the potential gain is the amount that would be treated... the taxable year, or property held by the partnership primarily for sale to customers in the...

  10. 26 CFR 1.751-1 - Unrealized receivables and inventory items.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ..., also includes potential gain from mining property defined in section 617(f)(2). With respect to each item of partnership mining property so defined, the potential gain is the amount that would be treated... the taxable year, or property held by the partnership primarily for sale to customers in the...

  11. Development of the PROMIS® Coping Expectancies of Smoking Item Banks

    PubMed Central

    Edelen, Maria Orlando; Tucker, Joan S.; Stucky, Brian D.; Hansen, Mark; Cai, Li

    2014-01-01

    Objective: Smoking is a coping strategy for many smokers who then have difficulty finding new ways to cope with negative affect when they quit. This paper describes analyses conducted to develop and evaluate item banks for assessing the coping expectancies of smoking for daily and nondaily smokers. Methods: Using data from a large sample of daily (N = 4,201) and nondaily (N = 1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning (DIF) analyses (according to gender, age, and ethnicity) to arrive at a unidimensional set of items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) for assessing coping expectancies of smoking. Results: For both daily and nondaily smokers, the unidimensional Coping Expectancies item banks (21 items) are relatively DIF free and are highly reliable (0.96 and 0.97, respectively). A common 4-item SF for daily and nondaily smokers also showed good reliability (0.85). Adaptive tests required an average of 4.3 and 3.7 items for simulated daily and nondaily respondents, respectively, and achieved reliabilities of 0.91 for both when the maximum test length was 10 items. Conclusions: This research provides a new set of items that can be used to reliably assess coping expectancies of smoking, through a SF, CAT, or a tailored set selected for a specific research purpose. PMID:25118227

  12. Extending item response theory to online homework

    NASA Astrophysics Data System (ADS)

    Kortemeyer, Gerd

    2014-06-01

    Item response theory (IRT) becomes an increasingly important tool when analyzing "big data" gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large-enrollment physics course for scientists and engineers, the study compares outcomes from IRT analyses of exam and homework data, and then proceeds to investigate the effects of each confounding factor introduced in the online realm. It is found that IRT yields the correct trends for learner ability and meaningful item parameters, yet overall agreement with exam data is moderate. It is also found that learner ability and item discrimination is robust over a wide range with respect to model assumptions and introduced noise. Item difficulty is also robust, but over a narrower range.

  13. Limited life item management

    NASA Technical Reports Server (NTRS)

    Eaglen, R. L.

    1971-01-01

    Plans are available for age-sensitive hardware management. Control plan identifies shelf life or age control requirements for materials considered age sensitive, use sensitive, or time service or shelf life controlled items, and describes methods of arriving at age controls through adherence to detailed specifications.

  14. Reading Current Events Items.

    ERIC Educational Resources Information Center

    Ediger, Marlow

    People who live in a democracy should be well informed of local, state, national, and international happenings. Students should become curious about news items and relate current happenings to the personal self. They must possess skills in word recognition and in diverse kinds of comprehension since reading is an important way to glean current…

  15. Cataloging Green Items

    DTIC Science & Technology

    2010-06-15

    Federal Logistics Information System, Asset Visibility, DOD EMALL , Item Unique Identification Registry, and DRMS Automated Information System... Marketing Service (DRMS) in the CONUS • DLA Customer Interaction Center – 642K Service Tickets in FY09 • U.S. Codification Bureau / International

  16. Development of the PROMIS® Social Motivations for Smoking Item Banks

    PubMed Central

    Shadel, William G.; Edelen, Maria Orlando; Stucky, Brian D.; Kuhfeld, Megan; Hansen, Mark; Cai, Li

    2014-01-01

    Introduction: Smoking behavior is influenced by social motivations such as the expected social benefits of smoking and the social cues that induce craving. This paper describes development of the PROMIS® Social Motivations for Smoking item banks, which will serve to standardize assessment of these social motivations among daily and nondaily smokers. Methods: Daily (N = 4,201) and nondaily (N =1,183) smokers completed an online survey. Item factor analyses, item response theory analyses, and differential item functioning analyses were conducted to identify a unidimensional set of items for each group. Short forms (SFs) and computer adaptive tests (CATs) were evaluated as tools for more efficiently assessing this construct. Results: A total of 15 items were included in the item banks (9 items common to daily and nondaily smokers, 3 unique to daily, 3 unique to nondaily). Scores based on full item banks are highly reliable (reliability = 0.90–0.91). Additionally, the item banks are strongly unidimensional and perform similarly across gender, age, and race/ethnicity groups. A fixed SF for use with both daily and nondaily smokers consists of 4 items (reliability = 0.80). Results from simulated CATs showed that, on average, fewer than 5 items are needed to assess this construct with adequate precision using the item banks. Conclusions: A new set of items has been identified for assessing the social motivations for smoking in a reliable, standardized manner for daily and nondaily smokers. In addition to using the full item banks, efficient assessment can be achieved by using SFs, employing CATs, or selecting items tailored to specific research or clinical purposes. PMID:25118231

  17. Development of the PROMIS Social Motivations for Smoking item banks.

    PubMed

    Tucker, Joan S; Shadel, William G; Edelen, Maria Orlando; Stucky, Brian D; Kuhfeld, Megan; Hansen, Mark; Cai, Li

    2014-09-01

    Smoking behavior is influenced by social motivations such as the expected social benefits of smoking and the social cues that induce craving. This paper describes development of the PROMIS Social Motivations for Smoking item banks, which will serve to standardize assessment of these social motivations among daily and nondaily smokers. Daily (N = 4,201) and nondaily (N =1,183) smokers completed an online survey. Item factor analyses, item response theory analyses, and differential item functioning analyses were conducted to identify a unidimensional set of items for each group. Short forms (SFs) and computer adaptive tests (CATs) were evaluated as tools for more efficiently assessing this construct. A total of 15 items were included in the item banks (9 items common to daily and nondaily smokers, 3 unique to daily, 3 unique to nondaily). Scores based on full item banks are highly reliable (reliability = 0.90-0.91). Additionally, the item banks are strongly unidimensional and perform similarly across gender, age, and race/ethnicity groups. A fixed SF for use with both daily and nondaily smokers consists of 4 items (reliability = 0.80). Results from simulated CATs showed that, on average, fewer than 5 items are needed to assess this construct with adequate precision using the item banks. A new set of items has been identified for assessing the social motivations for smoking in a reliable, standardized manner for daily and nondaily smokers. In addition to using the full item banks, efficient assessment can be achieved by using SFs, employing CATs, or selecting items tailored to specific research or clinical purposes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  18. The Identification of Biased Items.

    ERIC Educational Resources Information Center

    Sinnott, Loraine T.

    A standard method for exploring item bias is the intergroup comparison of item difficulties. This paper describes a refinement and generalization of this technique. In contrast to prior approaches, the proposed method deletes outlying items from the formulation of a criterion for identifying items as deviant. It also extends the mathematical…

  19. Lack of autoantibody induction by mercury exposure in artisanal gold mining settings in Colombia: Findings and a review of the epidemiology literature.

    PubMed

    Sánchez Rodríguez, Luz Helena; Flórez-Vargas, Oscar; Rodríguez-Villamizar, Laura Andrea; Vargas Fiallo, Yolanda; Stashenko, Elena E; Ramírez, Gerardo

    2015-01-01

    Mercury (Hg) has been implicated as an immunotoxicant in experimental animal models, but its role in the induction of human autoimmunity remains unclear due to contradictory findings. Therefore, it has been claimed that it is important to examine other populations in order to clarify the role of Hg in these diseases. The aim of this study was to investigate whether occupational Hg exposure due to artisanal gold mining is associated with the prevalence of autoimmune biomarkers. A cross-sectional study was conducted comparing Hg-exposed gold miners (n = 164) with a control population (n = 127). Hair, blood, and 24-h urine samples were collected for measures of Hg levels, as well as of anti-nuclear antibodies (ANA) and rheumatoid factor (RF). Participants were clinically evaluated by a general practice physician, a rheumatologist, and a toxicologist. Statistically significant differences (p < 0.001) were found between Hg-exposed and non-exposed groups for all Hg biomarkers tested: blood (7.03 versus 2.46 µg Hg/L), urine (3.96 versus 1.48 µg Hg/g creatinine), and hair (0.79 versus 0.39 µg Hg/g). No difference was observed in ANA (cut-off titre of 1:80; PR = 0.93, 95% CI = 0.45-1.90) and RF (cut-off = 30 IU/mL; PR = 0.062, 95% CI = 0.03-1.08) status between the groups. In conclusion, the findings here do not support the hypothesis that Hg exposure due to artisanal gold mining activities had a significant impact on autoantibodies as biomarkers of autoimmune diseases. In a review context, the epidemiological findings were interpreted in light of the conflicting data in the literature about how Hg exposure was linked to development of autoantibodies. Validation of these findings in prospective studies is needed to firmly establish the role of Hg in development of autoimmunity in human populations.

  20. Assessment of differential item functioning.

    PubMed

    Wang, Wen-Chung

    2008-01-01

    This study addresses several important issues in assessment of differential item functioning (DIF). It starts with the definition of DIF, effectiveness of using item fit statistics to detect DIF, and linear modeling of DIF in dichotomous items, polytomous items, facets, and testlet-based items. Because a common metric over groups of test-takers is a prerequisite in DIF assessment, this study reviews three such methods of establishing a common metric: the equal-mean-difficulty method, the all-other-item method, and the constant-item (CI) method. A small simulation demonstrates the superiority of the CI method over the others. As the CI method relies on a correct specification of DIF-free items to serve as anchors, a method of identifying such items is recommended and its effectiveness is illustrated through a simulation. Finally, this study discusses how to assess practical significance of DIF at both item and test levels.

  1. Examining Differential Item Functions of Different Item Ordered Test Forms According to Item Difficulty Levels

    ERIC Educational Resources Information Center

    Çokluk, Ömay; Gül, Emrah; Dogan-Gül, Çilem

    2016-01-01

    The study aims to examine whether differential item function is displayed in three different test forms that have item orders of random and sequential versions (easy-to-hard and hard-to-easy), based on Classical Test Theory (CTT) and Item Response Theory (IRT) methods and bearing item difficulty levels in mind. In the correlational research, the…

  2. African mining

    SciTech Connect

    Not Available

    1987-01-01

    This book contains papers presented at a conference addressing the development of the minerals industry in Africa. Topics covered include: A review - past, present and future - of Zimbabwe's mining industry; Geomorphological processes and related mineralization in Tanzania; and Rock mechanics investigations at Mufulira mine, Zambia.

  3. Extending Item Response Theory to Online Homework

    ERIC Educational Resources Information Center

    Kortemeyer, Gerd

    2014-01-01

    Item response theory (IRT) becomes an increasingly important tool when analyzing "big data" gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large-enrollment physics course for…

  4. Extending Item Response Theory to Online Homework

    ERIC Educational Resources Information Center

    Kortemeyer, Gerd

    2014-01-01

    Item response theory (IRT) becomes an increasingly important tool when analyzing "big data" gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large-enrollment physics course for…

  5. Prevention of retained surgical items.

    PubMed

    Feldman, David L

    2011-01-01

    Reduction in retained surgical items is an important part of any operating room patient-safety effort. Any item used in an operation can result in a retained surgical item, but sponges are the most frequent and the abdomen is the most common location. Retained sponges can cause significant morbidity, and the costs associated with both prevention and treatment of retained surgical items, including legal costs, can be considerable. This review will examine counting, teamwork, radiography, and new technology as methods used to prevent retained surgical items. Even though none of these techniques individually is likely to completely prevent retained surgical items, when used together the numbers can be reduced.

  6. ICCS 2009 User Guide for the International Database. Supplement 5: ICCS 2009 Released Items

    ERIC Educational Resources Information Center

    Brese, Falk; Jung, Michael; Mirazchiyski, Plamen; Schulz, Wolfram; Zuehlke, Olaf

    2011-01-01

    This document contains released items used in the ICCS 2009 main survey. Seven clusters of items were used in the study in a rotated booklet design with three clusters per test booklet. Clusters 1 and 7 comprise the released item set. Cluster 1 comprises items that were newly developed for ICCS 2009 and Cluster 7 is drawn from previously secure…

  7. Component Identification and Item Difficulty of Raven's Matrices Items.

    ERIC Educational Resources Information Center

    Green, Kathy E.; Kluever, Raymond C.

    Item components that might contribute to the difficulty of items on the Raven Colored Progressive Matrices (CPM) and the Standard Progressive Matrices (SPM) were studied. Subjects providing responses to CPM items were 269 children aged 2 years 9 months to 11 years 8 months, most of whom were referred for testing as potentially gifted. A second…

  8. Real and Artificial Differential Item Functioning in Polytomous Items

    ERIC Educational Resources Information Center

    Andrich, David; Hagquist, Curt

    2015-01-01

    Differential item functioning (DIF) for an item between two groups is present if, for the same person location on a variable, persons from different groups have different expected values for their responses. Applying only to dichotomously scored items in the popular Mantel-Haenszel (MH) method for detecting DIF in which persons are classified by…

  9. The Impact of Item Characteristics on Item and Scale Validity

    ERIC Educational Resources Information Center

    Johnson, John A.

    2004-01-01

    This study describes the relation between personality items' validities, defined as the items' correlations with acquaintance ratings on the Big 5 personality factors, and other itemmetric properties including ambiguity, syntactic complexity, social desirability, content, and trait indicativity. Five external validity coefficients for each item on…

  10. Item Context Factors Affecting Students' Performance on Mathematics Items

    ERIC Educational Resources Information Center

    Salgado, Felipe Almuna; Stacey, Kaye

    2014-01-01

    This paper reports how the context in which a mathematics item is embedded impacts on students' performance. The performance of Year 10 students on four PISA items was compared with performance on variants with more familiar contexts. Performance was not better when they solved items with more familiar contexts but there was some evidence that…

  11. The Impact of Item Characteristics on Item and Scale Validity

    ERIC Educational Resources Information Center

    Johnson, John A.

    2004-01-01

    This study describes the relation between personality items' validities, defined as the items' correlations with acquaintance ratings on the Big 5 personality factors, and other itemmetric properties including ambiguity, syntactic complexity, social desirability, content, and trait indicativity. Five external validity coefficients for each item on…

  12. IRT Item Parameter Scaling for Developing New Item Pools

    ERIC Educational Resources Information Center

    Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua

    2017-01-01

    Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

  13. Real and Artificial Differential Item Functioning in Polytomous Items

    ERIC Educational Resources Information Center

    Andrich, David; Hagquist, Curt

    2015-01-01

    Differential item functioning (DIF) for an item between two groups is present if, for the same person location on a variable, persons from different groups have different expected values for their responses. Applying only to dichotomously scored items in the popular Mantel-Haenszel (MH) method for detecting DIF in which persons are classified by…

  14. IRT Item Parameter Scaling for Developing New Item Pools

    ERIC Educational Resources Information Center

    Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua

    2017-01-01

    Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

  15. Development of the PROMIS® Nicotine Dependence Item Banks

    PubMed Central

    Edelen, Maria Orlando; Tucker, Joan S.; Stucky, Brian D.; Hansen, Mark; Cai, Li

    2014-01-01

    Introduction: Nicotine dependence is a core construct important for understanding cigarette smoking and smoking cessation behavior. This article describes analyses conducted to develop and evaluate item banks for assessing nicotine dependence among daily and nondaily smokers. Methods: Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of nicotine dependence items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess dependence. Results: A total of 32 items were included in the Nicotine Dependence item banks; 22 items are common across daily and nondaily smokers, 5 are unique to daily smokers, and 5 are unique to nondaily smokers. For both daily and nondaily smokers, the Nicotine Dependence item banks are strongly unidimensional, highly reliable (reliability = 0.97 and 0.97, respectively), and perform similarly across gender, age, and race/ethnicity groups. SFs common to daily and nondaily smokers consist of 8 and 4 items (reliability = 0.91 and 0.81, respectively). Results from simulated CATs showed that dependence can be assessed with very good precision for most respondents using fewer than 6 items adaptively selected from the item banks. Conclusions: Nicotine dependence on cigarettes can be assessed on the basis of these item banks via one of the SFs, by using CATs, or through a tailored set of items selected for a specific research purpose. PMID:25118226

  16. Development of the PROMIS® Health Expectancies of Smoking Item Banks

    PubMed Central

    Tucker, Joan S.; Shadel, William G.; Stucky, Brian D.; Cerully, Jennifer; Li, Zhen; Hansen, Mark; Cai, Li

    2014-01-01

    Introduction: Smokers’ health-related outcome expectancies are associated with a number of important constructs in smoking research, yet there are no measures currently available that focus exclusively on this domain. This paper describes the development and evaluation of item banks for assessing the health expectancies of smoking. Methods: Using data from a sample of daily (N = 4,201) and nondaily (N = 1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of health expectancies items for daily and nondaily smokers. We also evaluated the performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess health expectancies. Results: A total of 24 items were included in the Health Expectancies item banks; 13 items are common across daily and nondaily smokers, 6 are unique to daily, and 5 are unique to nondaily. For both daily and nondaily smokers, the Health Expectancies item banks are unidimensional, reliable (reliability = 0.95 and 0.96, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.87). Results from simulated CATs showed that health expectancies can be assessed with good precision with an average of 5–6 items adaptively selected from the item banks. Conclusions: Health expectancies of smoking can be assessed on the basis of these item banks via SFs, CATs, or through a tailored set of items selected for a specific research purpose. PMID:25118229

  17. Determinants and development of a web-based child mortality prediction model in resource-limited settings: A data mining approach.

    PubMed

    Tesfaye, Brook; Atique, Suleman; Elias, Noah; Dibaba, Legesse; Shabbir, Syed-Abdul; Kebede, Mihiretu

    2017-03-01

    Improving child health and reducing child mortality rate are key health priorities in developing countries. This study aimed to identify determinant sand develop, a web-based child mortality prediction model in Ethiopian local language using classification data mining algorithm. Decision tree (using J48 algorithm) and rule induction (using PART algorithm) techniques were applied on 11,654 records of Ethiopian demographic and health survey data. Waikato Environment for Knowledge Analysis (WEKA) for windows version 3.6.8 was used to develop optimal models. 8157 (70%) records were randomly allocated to training group for model building while; the remaining 3496 (30%) records were allocated as the test group for model validation. The validation of the model was assessed using accuracy, sensitivity, specificity and area under Receiver Operating Characteristics (ROC) curve. Using Statistical Package for Social Sciences (SPSS) version 20.0; logistic regressions and Odds Ratio (OR) with 95% Confidence Interval (CI) was used to identify determinants of child mortality. The child mortality rate was 72 deaths per 1000 live births. Breast-feeding (AOR= 1.46, (95% CI [1.22. 1.75]), maternal education (AOR= 1.40, 95% CI [1.11, 1.81]), family planning (AOR= 1.21, [1.08, 1.43]), preceding birth interval (AOR= 4.90, [2.94, 8.15]), presence of diarrhea (AOR= 1.54, 95% CI [1.32, 1.66]), father's education (AOR= 1.4, 95% CI [1.04, 1.78]), low birth weight (AOR= 1.2, 95% CI [0.98, 1.51]) and, age of the mother at first birth (AOR= 1.42, [1.01-1.89]) were found to be determinants for child mortality. The J48 model had better performance, accuracy (94.3%), sensitivity (93.8%), specificity (94.3%), Positive Predictive Value (PPV) (92.2%), Negative Predictive Value (NPV) (94.5%) and, the area under ROC (94.8%). Subsequent to developing an optimal prediction model, we relied on this model to develop a web-based application system for child mortality prediction. In this study

  18. Eagle Mine

    EPA Pesticide Factsheets

    This Web page contains Eagle Mine Superfund site information, site description, site risk, cleanup progress, community involvement, reuse, land use controls, five-year reviews, site documents, contacts and links.

  19. Applying Association Rule of the Data Mining Method for the Network Event Analysis

    NASA Astrophysics Data System (ADS)

    Kim, Wankyung; Soh, Wooyoung

    2007-12-01

    Network event analysis gives useful information on the network status that helps protect from attacks. It involves finding sets of frequently used packet information such as IP addresses and requires real-time processing by its nature. This paper applies association rules to network event analysis. Originally association rules used for data mining can be applied to find frequent item sets. So, if frequent items occur on networks, information system can guess that there is a threat. But existed association rules such as Apriori algorithm are not suitable for analyzing network events on real-time due to the high usage of CPU and memory and thus low processing speed. This paper develops a network event audit module by applying association rules to network events using a new algorithm instead of Apriori algorithm. Test results show that the application of the new algorithm gives drastically low usage of both CPU and memory for network event analysis compared with existing Apriori algorithm.

  20. Methodological Issues in Using the Rasch Model To Select Cross Culturally Equivalent Items in Order To Develop a Quality of Life Index: The Analysis of Four WHOQOL-100 Data Sets (Argentina, France, Hong Kong, United Kingdom).

    ERIC Educational Resources Information Center

    Leplege, Alain; Ecosse, Emmanuel

    2000-01-01

    Describes the source instrument, the database, and the experimental methodology that have been used in developing the World Health Organization Quality of Life (WHOQOL) profiles to allow the comparison of quality of life findings from a variety of cultural settings. Illustrates the precautions necessary in interpreting scores from different…

  1. Real-time intelligent decision making with data mining

    NASA Astrophysics Data System (ADS)

    Gupta, Deepak P.; Gopalakrishnan, Bhaskaran

    2004-03-01

    Database mining, widely known as knowledge discovery and data mining (KDD), has attracted lot of attention in recent years. With the rapid growth of databases in commercial, industrial, administrative and other applications, it is necessary and interesting to extract knowledge automatically from huge amount of data. Almost all the organizations are generating data and information at an unprecedented rate and they need to get some useful information from this data. Data mining is the extraction of non-trivial, previously unknown and potentially useful patterns, trends, dependence and correlation known as association rules among data values in large databases. In last ten to fifteen years, data mining spread out from one company to the other to help them understand more about customers' aspect of quality and response and also distinguish the customers they want from those they do not. A credit-card company found that customers who complete their applications in pencil rather than pen are more likely to default. There is a program that identifies callers by purchase history. The bigger the spender, the quicker the call will be answered. If you feel your call is being answered in the order in which it was received, think again. Many algorithms assume that data is static in nature and mine the rules and relations in that data. But for a dynamic database e.g. in most of the manufacturing industries, the rules and relations thus developed among the variables/items no longer hold true. A simple approach may be to mine the associations among the variables after every fixed period of time. But again, how much the length of this period should be, is a question to be answered. The next problem with the static data mining is that some of the relationships that might be of interest from one period to the other may be lost after a new set of data is used. To reflect the effect of new data set and current status of the association rules where some of the strong rules might become

  2. Microcomputer keeps watch at Emerald Mine

    SciTech Connect

    Not Available

    1987-04-01

    This paper reviews the computerized mine monitoring system set up at the Emerald Mine, SW Pennsylvania, USA. This coal mine has pioneered the automation of many production and safety features and this article covers their work in fire detection and conveyor belt monitoring. A central computer control room can safely watch over the whole underground mining operation using one 25 inch colour monitor. These new data-acquisition systems will lead the way, in the future, to safer move efficient coal mining. Multi-point monitoring of carbon monoxide, heat anomalies, toxic gases and the procedures in conveyor belt operation from start-up to closedown.

  3. The Communicative Participation Item Bank (CPIB): item bank calibration and development of a disorder-generic short form.

    PubMed

    Baylor, Carolyn; Yorkston, Kathryn; Eadie, Tanya; Kim, Jiseon; Chung, Hyewon; Amtmann, Dagmar

    2013-08-01

    The purpose of this study was to calibrate the items for the Communicative Participation Item Bank (CPIB; Baylor, Yorkston, Eadie, Miller, & Amtmann, 2009; Yorkston et al., 2008) using item response theory (IRT). One overriding objective was to examine whether the IRT item parameters would be consistent across different diagnostic groups, thereby allowing creation of a disorder-generic instrument. The intended outcomes were the final item bank and a short form ready for clinical and research applications. Self-report data were collected from 701 individuals representing 4 diagnoses: multiple sclerosis, Parkinson's disease, amyotrophic lateral sclerosis, and head and neck cancer. Participants completed the CPIB and additional self-report questionnaires. CPIB data were analyzed using the IRT graded response model. The initial set of 94 candidate CPIB items were reduced to an item bank of 46 items demonstrating unidimensionality, local independence, good item fit, and good measurement precision. Differential item functioning analyses detected no meaningful differences across diagnostic groups. A 10-item, disorder-generic short form was generated. The CPIB provides speech-language pathologists with a unidimensional, self-report outcomes measurement instrument dedicated to the construct of communicative participation. This instrument may be useful to clinicians and researchers wanting to implement measures of communicative participation in their work.

  4. The Communicative Participation Item Bank (CPIB): Item bank calibration and development of a disorder-generic short form

    PubMed Central

    Baylor, Carolyn; Yorkston, Kathryn; Eadie, Tanya; Kim, Jiseon; Chung, Hyewon; Amtmann, Dagmar

    2015-01-01

    Purpose The purpose of this study was to calibrate the items for the Communicative Participation Item Bank (CPIB) using Item Response Theory (IRT). One overriding objective was to examine if the IRT item parameters would be consistent across different diagnostic groups, thereby allowing creation of a disorder-generic instrument. The intended outcomes were the final item bank and a short form ready for clinical and research applications. Methods Self-report data were collected from 701 individuals representing four diagnoses: multiple sclerosis, Parkinson’s disease, amyotrophic lateral sclerosis and head and neck cancer. Participants completed the CPIB and additional self-report questionnaires. CPIB data were analyzed using the IRT Graded Response Model (GRM). Results The initial set of 94 candidate CPIB items were reduced to an item bank of 46 items demonstrating unidimensionality, local independence, good item fit, and good measurement precision. Differential item function (DIF) analyses detected no meaningful differences across diagnostic groups. A 10-item, disorder-generic short form was generated. Conclusions The CPIB provides speech-language pathologists with a unidimensional, self-report outcomes measurement instrument dedicated to the construct of communicative participation. This instrument may be useful to clinicians and researchers wanting to implement measures of communicative participation in their work. PMID:23816661

  5. Detecting Local Item Dependence in Polytomous Adaptive Data

    ERIC Educational Resources Information Center

    Mislevy, Jessica L.; Rupp, Andre A.; Harring, Jeffrey R.

    2012-01-01

    A rapidly expanding arena for item response theory (IRT) is in attitudinal and health-outcomes survey applications, often with polytomous items. In particular, there is interest in computer adaptive testing (CAT). Meeting model assumptions is necessary to realize the benefits of IRT in this setting, however. Although initial investigations of…

  6. The Quality of Domain-Referenced Test Items.

    ERIC Educational Resources Information Center

    Haladyna, Thomas M.

    The objectives of this study were to first determine whether or not the empirical item analysis of domain referenced tests (DR) was justified; and second, in the event that it was, which of a set of recommended procedures was most effective for determining item quality. The analysis that followed led to the conclusion that empirical procedures…

  7. Costs of Matrix Sampling of Test Items. ERIC Digest.

    ERIC Educational Resources Information Center

    Childs, Ruth A.; Jaciw, Andrew P.

    Matrix sampling of test items, the division of a set of items into different versions of a test form, is used by several large-scale testing programs. This Digest discusses nine categories of costs associated with matrix sampling. These categories are: (1) development costs; (2) materials costs; (3) administration costs; (4) educational costs; (5)…

  8. 17 CFR 229.1100 - (Item 1100) General.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1100 (Item 1100) General. (a) Application of Regulation AB. Regulation AB (§§ 229.1100 through 229.1123) is the... this Regulation AB, including the definition of “asset-backed security,” are set forth in Item 1101. (b...

  9. 17 CFR 229.1100 - (Item 1100) General.

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1100 (Item 1100) General. (a) Application of Regulation AB. Regulation AB (§§ 229.1100 through 229.1123) is the... this Regulation AB, including the definition of “asset-backed security,” are set forth in Item 1101. (b...

  10. 17 CFR 229.1100 - (Item 1100) General.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1100 (Item 1100) General. (a) Application of Regulation AB. Regulation AB (§§ 229.1100 through 229.1123) is the... this Regulation AB, including the definition of “asset-backed security,” are set forth in Item 1101. (b...

  11. 17 CFR 229.1100 - (Item 1100) General.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... AND CONSERVATION ACT OF 1975-REGULATION S-K Asset-Backed Securities (Regulation AB) § 229.1100 (Item 1100) General. (a) Application of Regulation AB. Regulation AB (§§ 229.1100 through 229.1123) is the... this Regulation AB, including the definition of “asset-backed security,” are set forth in Item 1101....

  12. Tree versus Geometric Representation of Tests and Items.

    ERIC Educational Resources Information Center

    Beller, Michael

    1990-01-01

    Geometric approaches to representing interrelations among tests and items are compared with an additive tree model (ATM), using 2,644 examinees and 2 other data sets. The ATM's close fit to the data and its coherence of presentation indicate that it is the best means of representing tests and items. (TJH)

  13. Detecting Local Item Dependence in Polytomous Adaptive Data

    ERIC Educational Resources Information Center

    Mislevy, Jessica L.; Rupp, Andre A.; Harring, Jeffrey R.

    2012-01-01

    A rapidly expanding arena for item response theory (IRT) is in attitudinal and health-outcomes survey applications, often with polytomous items. In particular, there is interest in computer adaptive testing (CAT). Meeting model assumptions is necessary to realize the benefits of IRT in this setting, however. Although initial investigations of…

  14. Effects of Ignoring Item Interaction on Item Parameter Estimation and Detection of Interacting Items

    ERIC Educational Resources Information Center

    Chen, Cheng-Te; Wang, Wen-Chung

    2007-01-01

    This study explores the effects of ignoring item interaction on item parameter estimation and the efficiency of using the local dependence index Q[subscript 3] and the SAS NLMIXED procedure to detect item interaction under the three-parameter logistic model and the generalized partial credit model. Through simulations, it was found that ignoring…

  15. Lunar surface mine feasibility study

    NASA Astrophysics Data System (ADS)

    Blair, Brad R.

    This paper describes a lunar surface mine, and demonstrates the economic feasibility of mining oxygen from the moon. The mine will be at the Apollo 16 landing site. Mine design issues include pit size and shape, excavation equipment, muck transport, and processing requirements. The final mine design will be driven by production requirements, and constrained by the lunar environment. This mining scenario assumes the presence of an operating lunar base. Lunar base personnel will set-up a and run the mine. The goal of producing lunar oxygen is to reduce dependence on fuel shipped from Earth. Thus, the lunar base is the customer for the finished product. The perspective of this paper is that of a mining contractor who must produce a specific product at a remote location, pay local labor, and sell the product to an onsite captive market. To make a profit, it must be less costly to build and ship specialized equipment to the site, and pay high labor and operating costs, than to export the product directly to the site.

  16. 78 FR 35974 - Proposed Information Collection; Comment Request; Coal Mine Rescue Teams; Arrangements for...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-06-14

    ... Safety and Health Administration Proposed Information Collection; Comment Request; Coal Mine Rescue Teams... protecting the safety and health of miners. 30 CFR Part 49, Mine Rescue Teams, Subpart B--Mine Rescue Teams for Underground Coal Mines, sets standards related to the availability of mine rescue teams;...

  17. Data mining in spontaneous reports.

    PubMed

    Bate, Andrew; Edwards, I R

    2006-03-01

    The increasing size of spontaneous report data sets and the increasing capability for screening such data due to increases in computational power has led to a recent increase in interest and use of data mining on such data. While data mining plays an important role in the analysis of spontaneous reports, there is general debate on how and when data mining should be best performed. While the cornerstone principles for data mining of spontaneous reports have been in place since the 1960s, several significant changes have occurred to make their use widespread. Superficially the Bayesian methods seem unnecessarily complex, particularly given the nature of the data, but in practice implementation in Bayesian framework gives clear benefits. There are difficulties evaluating the performance of the methods, but they work and save resources in managing large data sets. The use of neural networks allows more sophisticated pattern recognition to be performed.

  18. Mining with backfill

    SciTech Connect

    Granholm, S.

    1983-01-01

    This book reviews the fill mining practice in Sweden and other countries. Research results and technological innovations are presented on mining methods, mining operations, mining machinery and geomechanics. Other topics discussed are fill properties, technology, geomechanics, and new development.

  19. Coastal mining

    NASA Astrophysics Data System (ADS)

    Bell, Peter M.

    The Exclusive Economic Zone (EEZ) declared by President Reagan in March 1983 has met with a mixed response from those who would benefit from a guaranteed, 200-nautical-mile (370-km) protected underwater mining zone off the coasts of the United States and its possessions. On the one hand, the U.S. Department of the Interior is looking ahead and has been very successful in safeguarding important natural resources that will be needed in the coming decades. On the other hand, the mining industry is faced with a depressed metals and mining market.A report of the Exclusive Economic Zone Symposium held in November 1983 by the U.S. Geological Survey, the Mineral Management Service, and the Bureau of Mines described the mixed response as: “ … The Department of Interior … raring to go into promotion of deep-seal mining but industrial consortia being very pessimistic about the program, at least for the next 30 or so years.” (Chemical & Engineering News, February 5, 1983).

  20. 48 CFR 252.246-7006 - Warranty Tracking of Serialized Items.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... means a set of data elements marked on an item that is globally unique and unambiguous. Usage means the... to trace a warranted item from delivery through completion of the effectivity of the warranty....

  1. 48 CFR 252.246-7006 - Warranty Tracking of Serialized Items.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... means a set of data elements marked on an item that is globally unique and unambiguous. Usage means the... to trace a warranted item from delivery through completion of the effectivity of the warranty....

  2. 48 CFR 252.246-7006 - Warranty Tracking of Serialized Items.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... means a set of data elements marked on an item that is globally unique and unambiguous. Usage means the... to trace a warranted item from delivery through completion of the effectivity of the warranty....

  3. 48 CFR 252.246-7006 - Warranty Tracking of Serialized Items.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... means a set of data elements marked on an item that is globally unique and unambiguous. Usage means the... to trace a warranted item from delivery through completion of the effectivity of the warranty....

  4. 17 CFR 229.1122 - (Item 1122) Compliance with applicable servicing criteria.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... (Regulation AB) § 229.1122 (Item 1122) Compliance with applicable servicing criteria. (a) Reports on... (3) or Item 1115 of this Regulation AB, is maintained as set forth in the transaction agreements... 1100(d)(2) of this Regulation AB. ...

  5. 17 CFR 229.1122 - (Item 1122) Compliance with applicable servicing criteria.

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... (Regulation AB) § 229.1122 (Item 1122) Compliance with applicable servicing criteria. (a) Reports on... (3) or Item 1115 of this Regulation AB, is maintained as set forth in the transaction agreements... 1100(d)(2) of this Regulation AB. ...

  6. 17 CFR 229.1122 - (Item 1122) Compliance with applicable servicing criteria.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... (Regulation AB) § 229.1122 (Item 1122) Compliance with applicable servicing criteria. (a) Reports on... (3) or Item 1115 of this Regulation AB, is maintained as set forth in the transaction agreements... 1100(d)(2) of this Regulation AB. ...

  7. Item Analysis of Multiple-Choice Questions in Teaching Prosthodontics.

    PubMed

    Madhav, V N V

    2015-11-01

    The aim of this study was to analyze multiple-choice questions used in teaching prosthodontics to dental students based on the items' distractor effectiveness and their difficulty and discrimination capacity. The overall goals were to assist the prosthodontics department in modifying its existing question bank and creating a set of prevalidated questions of known difficulty and discrimination capacity. The items selected for this study were from the three formative examinations in prosthodontics for fourth-year BDS students at Sinhgad Dental College and Hospital, Pune, Maharastra, India, during academic year 2011-12. As all the students' completed exams and scores were available, the task was to evaluate the efficacy of items already administered. Based on the item analysis of the questions, the following results were noted. None of the three examinations had items that were too difficult or defective. However, all three examinations had large percentages of easy items and many items had a low discrimination index. More than 60-70% of the items on all three examinations included ineffective distractors. All items therefore needed modifications based on their distractor effectiveness and difficulty and discrimination indices. After the study was completed, the results were discussed in a meeting with all teachers in the prosthodontics department and the test constructors, and suggestions were made with regard to improving the items for better evaluation of students.

  8. Asteroid mining

    NASA Technical Reports Server (NTRS)

    Gertsch, Richard E.

    1992-01-01

    The earliest studies of asteroid mining proposed retrieving a main belt asteroid. Because of the very long travel times to the main asteroid belt, attention has shifted to the asteroids whose orbits bring them fairly close to the Earth. In these schemes, the asteroids would be bagged and then processed during the return trip, with the asteroid itself providing the reaction mass to propel the mission homeward. A mission to one of these near-Earth asteroids would be shorter, involve less weight, and require a somewhat lower change in velocity. Since these asteroids apparently contain a wide range of potentially useful materials, our study group considered only them. The topics covered include asteroid materials and properties, asteroid mission selection, manned versus automated missions, mining in zero gravity, and a conceptual mining method.

  9. Multisource causal data mining

    NASA Astrophysics Data System (ADS)

    Woodley, Robert; Gosnell, Michael; Shallenberger, Kevin

    2012-06-01

    Analysts are faced with mountains of data, and finding that relevant piece of information is the proverbial needle in a haystack, only with dozens of haystacks. Analysis tools that facilitate identifying causal relationships across multiple data sets are sorely needed. 21st Century Systems, Inc. (21CSi) has initiated research called Causal-View, a causal datamining visualization tool, to address this challenge. Causal-View is built on an agent-enabled framework. Much of the processing that Causal-View will do is in the background. When a user requests information, Data Extraction Agents launch to gather information. This initial search is a raw, Monte Carlo type search designed to gather everything available that may have relevance to an individual, location, associations, and more. This data is then processed by Data- Mining Agents. The Data-Mining Agents are driven by user supplied feature parameters. If the analyst is looking to see if the individual frequents a known haven for insurgents he may request information on his last known locations. Or, if the analyst is trying to see if there is a pattern in the individual's contacts, the mining agent can be instructed with the type and relevance of the information fields to look at. The same data is extracted from the database, but the Data Mining Agents customize the feature set to determine causal relationships the user is interested in. At this point, a Hypothesis Generation and Data Reasoning Agents take over to form conditional hypotheses about the data and pare the data, respectively. The newly formed information is then published to the agent communication backbone of Causal- View to be displayed. Causal-View provides causal analysis tools to fill the gaps in the causal chain. We present here the Causal-View concept, the initial research into data mining tools that assist in forming the causal relationships, and our initial findings.

  10. Item Parameter Drift as an Indication of Differential Opportunity to Learn: An Exploration of Item Flagging Methods & Accurate Classification of Examinees

    ERIC Educational Resources Information Center

    Sukin, Tia M.

    2010-01-01

    The presence of outlying anchor items is an issue faced by many testing agencies. The decision to retain or remove an item is a difficult one, especially when the content representation of the anchor set becomes questionable by item removal decisions. Additionally, the reason for the aberrancy is not always clear, and if the performance of the…

  11. The DIF-Free-Then-DIF Strategy for the Assessment of Differential Item Functioning

    ERIC Educational Resources Information Center

    Wang, Wen-Chung; Shih, Ching-Lin; Sun, Guo-Wei

    2012-01-01

    The DIF-free-then-DIF (DFTD) strategy consists of two steps: (a) select a set of items that are the most likely to be DIF-free and (b) assess the other items for DIF (differential item functioning) using the designated items as anchors. The rank-based method together with the computer software IRTLRDIF can select a set of DIF-free polytomous items…

  12. The DIF-Free-Then-DIF Strategy for the Assessment of Differential Item Functioning

    ERIC Educational Resources Information Center

    Wang, Wen-Chung; Shih, Ching-Lin; Sun, Guo-Wei

    2012-01-01

    The DIF-free-then-DIF (DFTD) strategy consists of two steps: (a) select a set of items that are the most likely to be DIF-free and (b) assess the other items for DIF (differential item functioning) using the designated items as anchors. The rank-based method together with the computer software IRTLRDIF can select a set of DIF-free polytomous items…

  13. Text Mining for Neuroscience

    NASA Astrophysics Data System (ADS)

    Tirupattur, Naveen; Lapish, Christopher C.; Mukhopadhyay, Snehasis

    2011-06-01

    Text mining, sometimes alternately referred to as text analytics, refers to the process of extracting high-quality knowledge from the analysis of textual data. Text mining has wide variety of applications in areas such as biomedical science, news analysis, and homeland security. In this paper, we describe an approach and some relatively small-scale experiments which apply text mining to neuroscience research literature to find novel associations among a diverse set of entities. Neuroscience is a discipline which encompasses an exceptionally wide range of experimental approaches and rapidly growing interest. This combination results in an overwhelmingly large and often diffuse literature which makes a comprehensive synthesis difficult. Understanding the relations or associations among the entities appearing in the literature not only improves the researchers current understanding of recent advances in their field, but also provides an important computational tool to formulate novel hypotheses and thereby assist in scientific discoveries. We describe a methodology to automatically mine the literature and form novel associations through direct analysis of published texts. The method first retrieves a set of documents from databases such as PubMed using a set of relevant domain terms. In the current study these terms yielded a set of documents ranging from 160,909 to 367,214 documents. Each document is then represented in a numerical vector form from which an Association Graph is computed which represents relationships between all pairs of domain terms, based on co-occurrence. Association graphs can then be subjected to various graph theoretic algorithms such as transitive closure and cycle (circuit) detection to derive additional information, and can also be visually presented to a human researcher for understanding. In this paper, we present three relatively small-scale problem-specific case studies to demonstrate that such an approach is very successful in

  14. Integrating Entropy and Closed Frequent Pattern Mining for Social Network Modelling and Analysis

    NASA Astrophysics Data System (ADS)

    Adnan, Muhaimenul; Alhajj, Reda; Rokne, Jon

    The recent increase in the explicitly available social networks has attracted the attention of the research community to investigate how it would be possible to benefit from such a powerful model in producing effective solutions for problems in other domains where the social network is implicit; we argue that social networks do exist around us but the key issue is how to realize and analyze them. This chapter presents a novel approach for constructing a social network model by an integrated framework that first preparing the data to be analyzed and then applies entropy and frequent closed patterns mining for network construction. For a given problem, we first prepare the data by identifying items and transactions, which arc the basic ingredients for frequent closed patterns mining. Items arc main objects in the problem and a transaction is a set of items that could exist together at one time (e.g., items purchased in one visit to the supermarket). Transactions could be analyzed to discover frequent closed patterns using any of the well-known techniques. Frequent closed patterns have the advantage that they successfully grab the inherent information content of the dataset and is applicable to a broader set of domains. Entropies of the frequent closed patterns arc used to keep the dimensionality of the feature vectors to a reasonable size; it is a kind of feature reduction process. Finally, we analyze the dynamic behavior of the constructed social network. Experiments were conducted on a synthetic dataset and on the Enron corpus email dataset. The results presented in the chapter show that social networks extracted from a feature set as frequent closed patterns successfully carry the community structure information. Moreover, for the Enron email dataset, we present an analysis to dynamically indicate the deviations from each user's individual and community profile. These indications of deviations can be very useful to identify unusual events.

  15. The use of sequential pattern mining to predict next prescribed medications.

    PubMed

    Wright, Aileen P; Wright, Adam T; McCoy, Allison B; Sittig, Dean F

    2015-02-01

    Therapy for certain medical conditions occurs in a stepwise fashion, where one medication is recommended as initial therapy and other medications follow. Sequential pattern mining is a data mining technique used to identify patterns of ordered events. To determine whether sequential pattern mining is effective for identifying temporal relationships between medications and accurately predicting the next medication likely to be prescribed for a patient. We obtained claims data from Blue Cross Blue Shield of Texas for patients prescribed at least one diabetes medication between 2008 and 2011, and divided these into a training set (90% of patients) and test set (10% of patients). We applied the CSPADE algorithm to mine sequential patterns of diabetes medication prescriptions both at the drug class and generic drug level and ranked them by the support statistic. We then evaluated the accuracy of predictions made for which diabetes medication a patient was likely to be prescribed next. We identified 161,497 patients who had been prescribed at least one diabetes medication. We were able to mine stepwise patterns of pharmacological therapy that were consistent with guidelines. Within three attempts, we were able to predict the medication prescribed for 90.0% of patients when making predictions by drug class, and for 64.1% when making predictions at the generic drug level. These results were stable under 10-fold cross validation, ranging from 89.1%-90.5% at the drug class level and 63.5-64.9% at the generic drug level. Using 1 or 2 items in the patient's medication history led to more accurate predictions than not using any history, but using the entire history was sometimes worse. Sequential pattern mining is an effective technique to identify temporal relationships between medications and can be used to predict next steps in a patient's medication regimen. Accurate predictions can be made without using the patient's entire medication history. Copyright © 2014 Elsevier

  16. Differential Item Functioning Analysis Using Rasch Item Information Functions

    ERIC Educational Resources Information Center

    Wyse, Adam E.; Mapuranga, Raymond

    2009-01-01

    Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

  17. MIMIC Methods for Assessing Differential Item Functioning in Polytomous Items

    ERIC Educational Resources Information Center

    Wang, Wen-Chung; Shih, Ching-Lin

    2010-01-01

    Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…

  18. GRE Verbal Analogy Items: Examinee Reasoning on Items.

    ERIC Educational Resources Information Center

    Duran, Richard P.; And Others

    Information about how Graduate Record Examination (GRE) examinees solve verbal analogy problems was obtained in this study through protocol analysis. High- and low-ability subjects who had recently taken the GRE General Test were asked to "think aloud" as they worked through eight analogy items. These items varied factorially on the…

  19. Comparison of CAT Item Selection Criteria for Polytomous Items

    ERIC Educational Resources Information Center

    Choi, Seung W.; Swartz, Richard J.

    2009-01-01

    Item selection is a core component in computerized adaptive testing (CAT). Several studies have evaluated new and classical selection methods; however, the few that have applied such methods to the use of polytomous items have reported conflicting results. To clarify these discrepancies and further investigate selection method properties, six…

  20. Differential Item Functioning Analysis Using Rasch Item Information Functions

    ERIC Educational Resources Information Center

    Wyse, Adam E.; Mapuranga, Raymond

    2009-01-01

    Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

  1. Evaluating Item Fit for Multidimensional Item Response Models

    ERIC Educational Resources Information Center

    Zhang, Bo; Stone, Clement A.

    2008-01-01

    This research examines the utility of the s-x[superscript 2] statistic proposed by Orlando and Thissen (2000) in evaluating item fit for multidimensional item response models. Monte Carlo simulation was conducted to investigate both the Type I error and statistical power of this fit statistic in analyzing two kinds of multidimensional test…

  2. Assessing the evolution of sustainability reporting in the mining sector.

    PubMed

    Perez, Fabiana; Sanchez, Luis E

    2009-06-01

    Since the 1990s several large companies have been publishing nonfinancial performance reports. Focusing initially on the physical environment, these reports evolved to consider social relations, as well as data on the firm's economic performance. A few mining companies pioneered this trend, and in the last years some of them incorporated the three dimensions of sustainable development, publishing so-called sustainability reports. This article reviews 31 reports published between 2001 and 2006 by four major mining companies. A set of 62 assessment items organized in six categories (namely context and commitment, management, environmental, social and economic performance, and accessibility and assurance) were selected to guide the review. The items were derived from international literature and recommended best practices, including the Global Reporting Initiative G3 framework. A content analysis was performed using the report as a sampling unit, and using phrases, graphics, or tables containing certain information as data collection units. A basic rating scale (0 or 1) was used for noting the presence or absence of information and a final percentage score was obtained for each report. Results show that there is a clear evolution in report's comprehensiveness and depth. Categories "accessibility and assurance" and "economic performance" featured the lowest scores and do not present a clear evolution trend in the period, whereas categories "context and commitment" and "social performance" presented the best results and regular improvement; the category "environmental performance," despite it not reaching the biggest scores, also featured constant evolution. Description of data measurement techniques, besides more comprehensive third-party verification are the items most in need of improvement.

  3. Assessing the Evolution of Sustainability Reporting in the Mining Sector

    NASA Astrophysics Data System (ADS)

    Perez, Fabiana; Sanchez, Luis E.

    2009-06-01

    Since the 1990s several large companies have been publishing nonfinancial performance reports. Focusing initially on the physical environment, these reports evolved to consider social relations, as well as data on the firm’s economic performance. A few mining companies pioneered this trend, and in the last years some of them incorporated the three dimensions of sustainable development, publishing so-called sustainability reports. This article reviews 31 reports published between 2001 and 2006 by four major mining companies. A set of 62 assessment items organized in six categories (namely context and commitment, management, environmental, social and economic performance, and accessibility and assurance) were selected to guide the review. The items were derived from international literature and recommended best practices, including the Global Reporting Initiative G3 framework. A content analysis was performed using the report as a sampling unit, and using phrases, graphics, or tables containing certain information as data collection units. A basic rating scale (0 or 1) was used for noting the presence or absence of information and a final percentage score was obtained for each report. Results show that there is a clear evolution in report’s comprehensiveness and depth. Categories “accessibility and assurance” and “economic performance” featured the lowest scores and do not present a clear evolution trend in the period, whereas categories “context and commitment” and “social performance” presented the best results and regular improvement; the category “environmental performance,” despite it not reaching the biggest scores, also featured constant evolution. Description of data measurement techniques, besides more comprehensive third-party verification are the items most in need of improvement.

  4. 10 CFR 74.55 - Item monitoring.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... days for Category IA items and 60 calendar days for Category IB items contained in a vault or in a... days for Category IA items and seven calendar days for Category BI items located elsewhere in the...

  5. 10 CFR 74.55 - Item monitoring.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... days for Category IA items and 60 calendar days for Category IB items contained in a vault or in a... days for Category IA items and seven calendar days for Category BI items located elsewhere in the...

  6. 10 CFR 74.55 - Item monitoring.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... days for Category IA items and 60 calendar days for Category IB items contained in a vault or in a... days for Category IA items and seven calendar days for Category BI items located elsewhere in the...

  7. 10 CFR 74.55 - Item monitoring.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... days for Category IA items and 60 calendar days for Category IB items contained in a vault or in a... days for Category IA items and seven calendar days for Category BI items located elsewhere in the...

  8. 10 CFR 74.55 - Item monitoring.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... days for Category IA items and 60 calendar days for Category IB items contained in a vault or in a... days for Category IA items and seven calendar days for Category BI items located elsewhere in the...

  9. Intrusion detection: a novel approach that combines boosting genetic fuzzy classifier and data mining techniques

    NASA Astrophysics Data System (ADS)

    Ozyer, Tansel; Alhajj, Reda; Barker, Ken

    2005-03-01

    This paper proposes an intelligent intrusion detection system (IDS) which is an integrated approach that employs fuzziness and two of the well-known data mining techniques: namely classification and association rule mining. By using these two techniques, we adopted the idea of using an iterative rule learning that extracts out rules from the data set. Our final intention is to predict different behaviors in networked computers. To achieve this, we propose to use a fuzzy rule based genetic classifier. Our approach has two main stages. First, fuzzy association rule mining is applied and a large number of candidate rules are generated for each class. Then the rules pass through pre-screening mechanism in order to reduce the fuzzy rule search space. Candidate rules obtained after pre-screening are used in genetic fuzzy classifier to generate rules for the specified classes. Classes are defined as Normal, PRB-probe, DOS-denial of service, U2R-user to root and R2L- remote to local. Second, an iterative rule learning mechanism is employed for each class to find its fuzzy rules required to classify data each time a fuzzy rule is extracted and included in the system. A Boosting mechanism evaluates the weight of each data item in order to help the rule extraction mechanism focus more on data having relatively higher weight. Finally, extracted fuzzy rules having the corresponding weight values are aggregated on class basis to find the vote of each class label for each data item.

  10. A Mixed Effects Randomized Item Response Model

    ERIC Educational Resources Information Center

    Fox, J.-P.; Wyrick, Cheryl

    2008-01-01

    The randomized response technique ensures that individual item responses, denoted as true item responses, are randomized before observing them and so-called randomized item responses are observed. A relationship is specified between randomized item response data and true item response data. True item response data are modeled with a (non)linear…

  11. Evolving Compact Decision Rule Sets

    DTIC Science & Technology

    1999-06-01

    While data mining technology holds the promise of automatically extracting useful patterns (such as decision rules) from data, this potential has yet...to be realized. One of the major technical impediments is that the current generation of data mining tools produce decision rule sets that are very

  12. Radioecological impacts of tin mining.

    PubMed

    Aliyu, Abubakar Sadiq; Mousseau, Timothy Alexander; Ramli, Ahmad Termizi; Bununu, Yakubu Aliyu

    2015-12-01

    The tin mining activities in the suburbs of Jos, Plateau State, Nigeria, have resulted in technical enhancement of the natural background radiation as well as higher activity concentrations of primordial radionuclides in the topsoil of mining sites and their environs. Several studies have considered the radiological human health risks of the mining activity; however, to our knowledge no documented study has investigated the radiological impacts on biota. Hence, an attempt is made to assess potential hazards using published data from the literature and the ERICA Tool. This paper considers the effects of mining and milling on terrestrial organisms like shrubs, large mammals, small burrowing mammals, birds (duck), arthropods (earth worm), grasses, and herbs. The dose rates and risk quotients to these organisms are computed using conservative values for activity concentrations of natural radionuclides reported in Bitsichi and Bukuru mining areas. The results suggest that grasses, herbs, lichens, bryophytes and shrubs receive total dose rates that are of potential concern. The effects of dose rates to specific indicator species of interest are highlighted and discussed. We conclude that further investigation and proper regulations should be set in place in order to reduce the risk posed by the tin mining activity on biota. This paper also presents a brief overview of the impact of mineral mining on biota based on documented literature for other countries.

  13. [Introduction to medical data mining].

    PubMed

    Zhu, Lingyun; Wu, Baoming; Cao, Changxiu

    2003-09-01

    Modern medicine generates a great deal of information stored in the medical database. Extracting useful knowledge and providing scientific decision-making for the diagnosis and treatment of disease from the database increasingly becomes necessary. Data mining in medicine can deal with this problem. It can also improve the management level of hospital information and promote the development of telemedicine and community medicine. Because the medical information is characteristic of redundancy, multi-attribution, incompletion and closely related with time, medical data mining differs from other one. In this paper we have discussed the key techniques of medical data mining involving pretreatment of medical data, fusion of different pattern and resource, fast and robust mining algorithms and reliability of mining results. The methods and applications of medical data mining based on computation intelligence such as artificial neural network, fuzzy system, evolutionary algorithms, rough set, and support vector machine have been introduced. The features and problems in data mining are summarized in the last section.

  14. Data Mining and Homeland Security: An Overview

    DTIC Science & Technology

    2006-01-27

    Data mining has become one of the key features of many homeland security initiatives. Often used as a means for detecting fraud, assessing risk, and...product retailing, data mining involves the use of data analysis tools to discover previously unknown, valid patterns and relationships in large data...sets. In the context of homeland security, data mining can be a potential means to identify terrorist activities, such as money transfers and

  15. IDDM: Intrusion Detection Using Data Mining Techniques

    DTIC Science & Technology

    2001-05-01

    The IDDM project aims to determine the feasibility and effectiveness of data mining techniques in real-time intrusion detection and produce solutions...for this purpose. Traditionally, data mining is designed to operate on large off-line data sets. Previous attempts to apply the discipline in real...time environments met with varying success. In this paper, we overview earlier attempts to employ data mining principles in intrusion detection and

  16. Introduction to data mining for medical informatics.

    PubMed

    Brown, Donald E

    2008-03-01

    Data mining consists of a series of techniques for the discovery of patterns in large databases. This article provides an introduction to common data mining techniques with a view toward their use. The article begins by describing methods for discovering and exploring associations in observations and variables. The discussion then turns to methods for prediction. These techniques discover relationships between sets of variables. The article concludes with a description of evaluative techniques that are useful for assessing the results from data mining.

  17. Methods for Linking Item Parameters.

    DTIC Science & Technology

    1981-08-01

    of Linking Adequacy ... ........... ... 27 Evaluation of Linking Techniques .. ......... ... 28 Rasch model . ................. 28 Three-parameter...class are models for use when the response is dichotomous (Lord & Novick, 1968; lirnbaum, 1968), models for use when the response is polychotomous... dichotomous ability items of a multiple-choice nature. Two IRT models are appropriate for such items: the three-parameter normal and logistic ogive models

  18. Item Veto: Dangerous Constitutional Tinkering.

    ERIC Educational Resources Information Center

    Bellamy, Calvin

    1989-01-01

    In theory, the item veto would empower the President to remove wasteful and unnecessary projects from legislation. Yet, despite its history at the state level, the item veto is a loosely defined concept that may not work well at the federal level. Much more worrisome is the impact on the balance of power. (Author/CH)

  19. Generalizability in Item Response Modeling

    ERIC Educational Resources Information Center

    Briggs, Derek C.; Wilson, Mark

    2007-01-01

    An approach called generalizability in item response modeling (GIRM) is introduced in this article. The GIRM approach essentially incorporates the sampling model of generalizability theory (GT) into the scaling model of item response theory (IRT) by making distributional assumptions about the relevant measurement facets. By specifying a random…

  20. Generalizability in Item Response Modeling

    ERIC Educational Resources Information Center

    Briggs, Derek C.; Wilson, Mark

    2007-01-01

    An approach called generalizability in item response modeling (GIRM) is introduced in this article. The GIRM approach essentially incorporates the sampling model of generalizability theory (GT) into the scaling model of item response theory (IRT) by making distributional assumptions about the relevant measurement facets. By specifying a random…

  1. Career and Occupational Development Items.

    ERIC Educational Resources Information Center

    Education Commission of the States, Denver, CO. National Assessment of Educational Progress.

    The career and occupational development items contained in this document are part of a kit consisting of four documents which bring together different types of items that measure a number of career and occupational development (COD) objectives developed by the National Assessment of Educational Progress (NAEP). (NAEP--which completed a national…

  2. Career and Occupational Development Items.

    ERIC Educational Resources Information Center

    Education Commission of the States, Denver, CO. National Assessment of Educational Progress.

    The career and occupational development items contained in this document are part of a kit consisting of four documents which bring together different types of items that measure a number of career and occupational development (COD) objectives developed by the National Assessment of Educational Progress (NAEP). (NAEP--which completed a national…

  3. 30 CFR 57.19106 - Shaft sets.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 30 Mineral Resources 1 2012-07-01 2012-07-01 false Shaft sets. 57.19106 Section 57.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND....19106 Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  4. 30 CFR 56.19106 - Shaft sets.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 30 Mineral Resources 1 2013-07-01 2013-07-01 false Shaft sets. 56.19106 Section 56.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND... Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  5. 30 CFR 56.19106 - Shaft sets.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 30 Mineral Resources 1 2014-07-01 2014-07-01 false Shaft sets. 56.19106 Section 56.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND... Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  6. 30 CFR 57.19106 - Shaft sets.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 30 Mineral Resources 1 2014-07-01 2014-07-01 false Shaft sets. 57.19106 Section 57.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND....19106 Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  7. 30 CFR 57.19106 - Shaft sets.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 30 Mineral Resources 1 2011-07-01 2011-07-01 false Shaft sets. 57.19106 Section 57.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND....19106 Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  8. 30 CFR 57.19106 - Shaft sets.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 30 Mineral Resources 1 2013-07-01 2013-07-01 false Shaft sets. 57.19106 Section 57.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND....19106 Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  9. 30 CFR 56.19106 - Shaft sets.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 30 Mineral Resources 1 2012-07-01 2012-07-01 false Shaft sets. 56.19106 Section 56.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND... Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  10. 30 CFR 56.19106 - Shaft sets.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 30 Mineral Resources 1 2011-07-01 2011-07-01 false Shaft sets. 56.19106 Section 56.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND... Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  11. 30 CFR 57.19106 - Shaft sets.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Shaft sets. 57.19106 Section 57.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND....19106 Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  12. 30 CFR 56.19106 - Shaft sets.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Shaft sets. 56.19106 Section 56.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND... Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  13. Planning the Mine and Mining the Plan

    NASA Astrophysics Data System (ADS)

    Boucher, D. S.; Chen, N.

    2016-11-01

    Overview of best practices used in the terrestrial mining industry when developing a mine site towards production. The intent is to guide planners towards an effective and well constructed roadmap for the development of ISRU mining activities. A strawman scenario is presented as an illustration for lunar mining of water ice.

  14. Embodied Valuation: Directional Action is Associated with Item Values.

    PubMed

    Green, Danielle J; Harris, Alison; Young, Aleena; Reed, Catherine L

    2017-08-31

    We have a lifetime of experience interacting with objects we value. Although many economic theories represent valuation as a purely cognitive process independent of the sensorimotor system, embodied cognitive theory suggests that our memories for items' value should be linked to actions we use to obtain them. Here we investigated whether the value of real items was associated with specific directional movements toward or away from the body. Participants priced a set of food items to determine their values; they then used directional actions to classify each item as high- or low-value. To determine if value is linked to specific action mappings, movements were referenced either with respect to the object (push toward high-value items; pull away from low-value items) or the self (pull high-value items toward self; push low-value items away). Participants who were assigned (Experiment 1) or chose (Experiment 2) to use an object-referenced action mapping were faster than those using a self-referenced mapping. A control experiment (Experiment 3) using left/right movements found no such difference when action mappings were not toward/away from the body. These results indicate that directional actions towards items are associated with the representation of their value, suggesting an embodied component to economic choice.

  15. Three controversies over item disclosure in medical licensure examinations

    PubMed Central

    Park, Yoon Soo; Yang, Eunbae B.

    2015-01-01

    In response to views on public's right to know, there is growing attention to item disclosure – release of items, answer keys, and performance data to the public – in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations – 1) fairness and validity, 2) impact on passing levels, and 3) utility of item disclosure – by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers’ right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration. PMID:26374693

  16. Three controversies over item disclosure in medical licensure examinations.

    PubMed

    Park, Yoon Soo; Yang, Eunbae B

    2015-01-01

    In response to views on public's right to know, there is growing attention to item disclosure - release of items, answer keys, and performance data to the public - in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations - 1) fairness and validity, 2) impact on passing levels, and 3) utility of item disclosure - by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers' right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration.

  17. [Perceptions on item disclosure for the Korean medical licensing examination].

    PubMed

    Yang, Eunbae B

    2015-09-01

    This study analyzed the perceptions of medical students and faculty regarding disclosure of test items on the Korean medical licensing examination. I conducted a survey of medical students from medical colleges and professional medical schools nationwide. Responses were analyzed from 718 participants as well as 69 faculty members who participated in creating the medical licensing examination item sets. Data were analyzed using descriptive statistics and the chi-square test. It is important to maintain test quality and to keep the test items unavailable to the public. There are also concerns among students that disclosure of test items would prompt increasing difficulty of test items (48.3%). Further, few students found it desirable to disclose test items regardless of any considerations (28.5%). The professors, who had experience in designing the test items, also expressed their opposition to test item disclosure (60.9%). It is desirable not to disclose the test items of the Korean medical licensing examination to the public on the condition that students are provided with a sufficient amount of information regarding the examination. This is so that the exam can appropriately identify candidates with the required qualifications.

  18. Continuous Online Item Calibration: Parameter Recovery and Item Utilization.

    PubMed

    Ren, Hao; van der Linden, Wim J; Diao, Qi

    2017-06-01

    Parameter recovery and item utilization were investigated for different designs for online test item calibration. The design was adaptive in a double sense: it assumed both adaptive testing of examinees from an operational pool of previously calibrated items and adaptive assignment of field-test items to the examinees. Four criteria of optimality for the assignment of the field-test items were used, each of them based on the information in the posterior distributions of the examinee's ability parameter during adaptive testing as well as the sequentially updated posterior distributions of the field-test item parameters. In addition, different stopping rules based on target values for the posterior standard deviations of the field-test parameters and the size of the calibration sample were used. The impact of each of the criteria and stopping rules on the statistical efficiency of the estimates of the field-test parameters and on the time spent by the items in the calibration procedure was investigated. Recommendations as to the practical use of the designs are given.

  19. Data Mining and Analysis

    NASA Technical Reports Server (NTRS)

    Samms, Kevin O.

    2015-01-01

    The Data Mining project seeks to bring the capability of data visualization to NASA anomaly and problem reporting systems for the purpose of improving data trending, evaluations, and analyses. Currently NASA systems are tailored to meet the specific needs of its organizations. This tailoring has led to a variety of nomenclatures and levels of annotation for procedures, parts, and anomalies making difficult the realization of the common causes for anomalies. Making significant observations and realizing the connection between these causes without a common way to view large data sets is difficult to impossible. In the first phase of the Data Mining project a portal was created to present a common visualization of normalized sensitive data to customers with the appropriate security access. The tool of the visualization itself was also developed and fine-tuned. In the second phase of the project we took on the difficult task of searching and analyzing the target data set for common causes between anomalies. In the final part of the second phase we have learned more about how much of the analysis work will be the job of the Data Mining team, how to perform that work, and how that work may be used by different customers in different ways. In this paper I detail how our perspective has changed after gaining more insight into how the customers wish to interact with the output and how that has changed the product.

  20. Assessing item fit for unidimensional item response theory models using residuals from estimated item response functions.

    PubMed

    Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee

    2013-07-01

    Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models.

  1. Mining in chemometrics.

    PubMed

    Mutihac, Lucia; Mutihac, Radu

    2008-03-31

    Some of the increasingly spread data mining methods in chemometrics like exploratory data analysis, artificial neural networks, pattern recognition, and digital image processing with their highs and lows along with some of their representative applications are discussed. The development of more complex analytical instruments and the need to cope with larger experimental data sets have demanded for new approaches in data analysis, which have led to advanced methods in experimental design and data processing. Hypothesis-driven methods typified by inferential statistics have been gradually complemented or even replaced by data-driven model-free methods that seek for structure in data without reference to the experimental protocol or prior hypotheses. The emphasis is put on the ability of data mining methods to solve multivariate-multiresponse problems on the basis of experimental data and minimal statistical assumptions only, in contrast to classical methods, which require predefined priors to be tested against some null-hypothesis.

  2. Optimizing data collection for public health decisions: a data mining approach

    PubMed Central

    2014-01-01

    Background Collecting data can be cumbersome and expensive. Lack of relevant, accurate and timely data for research to inform policy may negatively impact public health. The aim of this study was to test if the careful removal of items from two community nutrition surveys guided by a data mining technique called feature selection, can (a) identify a reduced dataset, while (b) not damaging the signal inside that data. Methods The Nutrition Environment Measures Surveys for stores (NEMS-S) and restaurants (NEMS-R) were completed on 885 retail food outlets in two counties in West Virginia between May and November of 2011. A reduced dataset was identified for each outlet type using feature selection. Coefficients from linear regression modeling were used to weight items in the reduced datasets. Weighted item values were summed with the error term to compute reduced item survey scores. Scores produced by the full survey were compared to the reduced item scores using a Wilcoxon rank-sum test. Results Feature selection identified 9 store and 16 restaurant survey items as significant predictors of the score produced from the full survey. The linear regression models built from the reduced feature sets had R2 values of 92% and 94% for restaurant and grocery store data, respectively. Conclusions While there are many potentially important variables in any domain, the most useful set may only be a small subset. The use of feature selection in the initial phase of data collection to identify the most influential variables may be a useful tool to greatly reduce the amount of data needed thereby reducing cost. PMID:24919484

  3. Acid mine drainage on public and private lands, the Walker Mine experience, Plumas County, California

    SciTech Connect

    Croyle, W.A.; Rosenbaum, S.E.

    1996-11-01

    A widespread environmental problem associated with abandoned mines and their tailings is acid mine drainage (AMD). AMID typically has low pH and elevated metal concentrations that are toxic to aquatic life. In Northern California, Iron Mountain and other mines in the Shasta mining districts are the largest sources of AMD. Additional sources lie to the south along a discontinuous belt of copper and zinc mineralization in the western Sierra foothills. Between these areas lies a remote group of copper mines in northeastern Plumas County including the Walker, Engels and Superior mines. Of this group, AMD from Walker Mine has caused the most severe water quality impairment. This paper describes the history and environmental setting of Walker Mine and the approaches used by the Central Valley Regional Water Quality Control Board, a state regulatory agency, to improve water quality at the site. Both the mine and its tailings contribute pollutants to the watershed. The mine has a portal discharge with depressed pH and high copper concentrations. The tailings add fine grained sediment to the creek and generate low but significant concentrations of dissolved copper. The mine is on private property and the tailings are on land managed by the U. S. Forest Service. Because of these differences in pollution problems and ownership, the methods employed by the Regional Board to improve conditions at the mine and tailings have been on different, but parallel tracks. Monitoring shows these efforts have significantly improved water quality in the watershed over the last 10 years.

  4. Data mining

    SciTech Connect

    Lee, K.; Kargupta, H.; Stafford, B.G.; Buescher, K.L.; Ravindran, B.

    1998-12-31

    This is the final report of a one-year, Laboratory Directed Research and Development (LDRD) project at the Los Alamos National Laboratory (LANL). The objective of this project was to develop and implement data mining technology suited to the analysis of large collections of unstructured data. This has taken the form of a software tool, PADMA (Parallel Data Mining Agents), which incorporates parallel data accessing, parallel scalable hierarchical clustering algorithms, and a web-based user interface for submitting Structured Query Language (SQL) queries and interactive data visualization. The authors have demonstrated the viability and scalability of PADMA by applying it to an unstructured text database of 25,000 documents running on an IBM SP2 at Argonne National Laboratory. The utility of PADMA for discovering patterns in data has also been demonstrated by applying it to laboratory test data for Hepatitis C patients and autopsy reports in collaboration with the University of New Mexico School of Medicine.

  5. Mining wavelet transformed boiler data sets

    NASA Astrophysics Data System (ADS)

    Letsche, Terry Lee

    Accurate combustion models provide information that allows increased boiler efficiency optimization, saving money and resources while reducing waste. Boiler combustion processes are noted for being complex, nonstationary and nonlinear. While numerous methods have been used to model boiler processes, data driven approaches reflect actual operating conditions within a particular boiler and do not depend on idealized, complex, or expensive empirical models. Boiler and combustion processes vary in time, requiring a denoising technique that preserves the temporal and frequency nature of the data. Moving average, a common technique, smoothes data---low frequency noise is not removed. This dissertation examines models built with wavelet denoising techniques that remove low and high frequency noise in both time and frequency domains. The denoising process has a number of parameters, including choice of wavelet, threshold value, level of wavelet decomposition, and disposition of attributes that appear to be significant at multiple thresholds. A process is developed to experimentally evaluate the predictive accuracy of these models and compares this result against two benchmarks. The first research hypothesis compares the performance of these wavelet denoised models to the model generated from the original data. The second research hypothesis compares the performance of the models generated with this denoising approach to the most effective model generated from a moving average process. In both experiments it was determined that the Daubechies 4 wavelet was a better choice than the more typically chosen Haar wavelet, wavelet packet decomposition outperforms other levels of wavelet decomposition, and discarding all but the lowest threshold repeating attributes produces superior results. The third research hypothesis examined using a two-dimensional wavelet transform on the data. Another parameter for handling the boundary condition was introduced. In the two-dimensional case, the CDF(4,6) wavelet and zero symmetric padded boundary produced the best results. The two-dimensional transform produces models six percent more accurate than the one-dimensional transform, nearly eight percent more accurate than non-transformed data, and four percent more accurate than moving average transformed model.

  6. Modeling local item dependence with the hierarchical generalized linear model.

    PubMed

    Jiao, Hong; Wang, Shudong; Kamata, Akihito

    2005-01-01

    Local item dependence (LID) can emerge when the test items are nested within common stimuli or item groups. This study proposes a three-level hierarchical generalized linear model (HGLM) to model LID when LID is due to such contextual effects. The proposed three-level HGLM was examined by analyzing simulated data sets and was compared with the Rasch-equivalent two-level HGLM that ignores such a nested structure of test items. The results demonstrated that the proposed model could capture LID and estimate its magnitude. Also, the two-level HGLM resulted in larger mean absolute differences between the true and the estimated item difficulties than those from the proposed three-level HGLM. Furthermore, it was demonstrated that the proposed three-level HGLM estimated the ability distribution variance unaffected by the LID magnitude, while the two-level HGLM with no LID consideration increasingly underestimated the ability variance as the LID magnitude increased.

  7. Text mining.

    PubMed

    Clegg, Andrew B; Shepherd, Adrian J

    2008-01-01

    One of the fastest-growing fields in bioinformatics is text mining: the application of natural language processing techniques to problems of knowledge management and discovery, using large collections of biological or biomedical text such as MEDLINE. The techniques used in text mining range from the very simple (e.g., the inference of relationships between genes from frequent proximity in documents) to the complex and computationally intensive (e.g., the analysis of sentence structures with parsers in order to extract facts about protein-protein interactions from statements in the text). This chapter presents a general introduction to some of the key principles and challenges of natural language processing, and introduces some of the tools available to end-users and developers. A case study describes the construction and testing of a simple tool designed to tackle a task that is crucial to almost any application of text mining in bioinformatics--identifying gene/protein names in text and mapping them onto records in an external database.

  8. The Mechanization of Mining.

    ERIC Educational Resources Information Center

    Marovelli, Robert L.; Karhnak, John M.

    1982-01-01

    Mechanization of mining is explained in terms of its effect on the mining of coal, focusing on, among others, types of mining, productivity, machinery, benefits to retired miners, fatality rate in underground coal mines, and output of U.S. mining industry. (Author/JN)

  9. The Mechanization of Mining.

    ERIC Educational Resources Information Center

    Marovelli, Robert L.; Karhnak, John M.

    1982-01-01

    Mechanization of mining is explained in terms of its effect on the mining of coal, focusing on, among others, types of mining, productivity, machinery, benefits to retired miners, fatality rate in underground coal mines, and output of U.S. mining industry. (Author/JN)

  10. Toward improved branch prediction through data mining.

    SciTech Connect

    Hemmert, K. Scott; Johnson, D. Eric

    2009-09-01

    Data mining and machine learning techniques can be applied to computer system design to aid in optimizing design decisions, improving system runtime performance. Data mining techniques have been investigated in the context of branch prediction. Specifically, a comparison of traditional branch predictor performance has been made to data mining algorithms. Additionally, the possiblity of whether additional features available within the architectural state might serve to further improve branch prediction has been evaluated. Results show that data mining techniques indicate potential for improved branch prediction, especially when register file contents are included as a feature set.

  11. An Efficiency Balanced Information Criterion for Item Selection in Computerized Adaptive Testing

    ERIC Educational Resources Information Center

    Han, Kyung T.

    2012-01-01

    Successful administration of computerized adaptive testing (CAT) programs in educational settings requires that test security and item exposure control issues be taken seriously. Developing an item selection algorithm that strikes the right balance between test precision and level of item pool utilization is the key to successful implementation…

  12. A Comparison of Two Area Measures for Detecting Differential Item Functioning.

    ERIC Educational Resources Information Center

    Kim, Seock-Ho; Cohen, Allan S.

    1991-01-01

    The exact and closed-interval area measures for detecting differential item functioning are compared for actual data from 1,000 African-American and 1,000 white college students taking a vocabulary test with items intentionally constructed to favor 1 set of examinees. No real differences in detection of biased items were found. (SLD)

  13. Estimation of Item Response Theory Parameters in the Presence of Missing Data

    ERIC Educational Resources Information Center

    Finch, Holmes

    2008-01-01

    Missing data are a common problem in a variety of measurement settings, including responses to items on both cognitive and affective assessments. Researchers have shown that such missing data may create problems in the estimation of item difficulty parameters in the Item Response Theory (IRT) context, particularly if they are ignored. At the same…

  14. The Asymptotic Distribution of Ability Estimates: Beyond Dichotomous Items and Unidimensional IRT Models

    ERIC Educational Resources Information Center

    Sinharay, Sandip

    2015-01-01

    The maximum likelihood estimate (MLE) of the ability parameter of an item response theory model with known item parameters was proved to be asymptotically normally distributed under a set of regularity conditions for tests involving dichotomous items and a unidimensional ability parameter (Klauer, 1990; Lord, 1983). This article first considers…

  15. Diverse Food Items Are Similarly Categorized by 8- to 13-Year-Old Children

    ERIC Educational Resources Information Center

    Beltran, Alicia; Knight Sepulveda, Karina; Watson, Kathy; Baranowski, Tom; Baranowski, Janice; Islam, Noemi; Missaghian, Mariam

    2008-01-01

    Objective: Assess how 8- to 13-year-old children categorized and labeled food items for possible use as part of a food search strategy in a computerized 24-hour dietary recall. Design: A set of 62 cards with pictures and names of food items from 18 professionally defined food groups was sorted by each child into piles of similar food items.…

  16. Item Difficulty in the Evaluation of Computer-Based Instruction: An Example from Neuroanatomy

    ERIC Educational Resources Information Center

    Chariker, Julia H.; Naaz, Farah; Pani, John R.

    2012-01-01

    This article reports large item effects in a study of computer-based learning of neuroanatomy. Outcome measures of the efficiency of learning, transfer of learning, and generalization of knowledge diverged by a wide margin across test items, with certain sets of items emerging as particularly difficult to master. In addition, the outcomes of…

  17. An Alternative Approach for the Analyses and Interpretation of Attachment Sort Items

    ERIC Educational Resources Information Center

    Kirkland, John; Bimler, David; Drawneek, Andrew; McKim, Margaret; Scholmerich, Axel

    2004-01-01

    Attachment Q-Sort (AQS) is a tool for quantifying observations about toddler/caregiver relationships. Previous studies have applied factor analysis to the full 90 AQS item set to explore the structure underlying them. Here we explore that structure by applying multidimensional scaling (MDS) to judgements of inter-item similarity. AQS items are…

  18. Diverse Food Items Are Similarly Categorized by 8- to 13-Year-Old Children

    ERIC Educational Resources Information Center

    Beltran, Alicia; Knight Sepulveda, Karina; Watson, Kathy; Baranowski, Tom; Baranowski, Janice; Islam, Noemi; Missaghian, Mariam

    2008-01-01

    Objective: Assess how 8- to 13-year-old children categorized and labeled food items for possible use as part of a food search strategy in a computerized 24-hour dietary recall. Design: A set of 62 cards with pictures and names of food items from 18 professionally defined food groups was sorted by each child into piles of similar food items.…

  19. Item Difficulty in the Evaluation of Computer-Based Instruction: An Example from Neuroanatomy

    ERIC Educational Resources Information Center

    Chariker, Julia H.; Naaz, Farah; Pani, John R.

    2012-01-01

    This article reports large item effects in a study of computer-based learning of neuroanatomy. Outcome measures of the efficiency of learning, transfer of learning, and generalization of knowledge diverged by a wide margin across test items, with certain sets of items emerging as particularly difficult to master. In addition, the outcomes of…

  20. An Efficiency Balanced Information Criterion for Item Selection in Computerized Adaptive Testing

    ERIC Educational Resources Information Center

    Han, Kyung T.

    2012-01-01

    Successful administration of computerized adaptive testing (CAT) programs in educational settings requires that test security and item exposure control issues be taken seriously. Developing an item selection algorithm that strikes the right balance between test precision and level of item pool utilization is the key to successful implementation…

  1. Classical Test Theory and Item Response Theory: Analytical and Empirical Comparisons.

    ERIC Educational Resources Information Center

    Hwang, Dae-Yeop

    This study compared classical test theory (CTT) and item response theory (IRT). The behavior of the item and person statistics derived from these two measurement frameworks was examined analytically and empirically using a data set obtained from BILOG (R. Mislay and D. Block, 1997). The example was a 15-item test with a sample size of 600…

  2. Addressing Naturally Occurring Asbestos in the Mining Industry

    NASA Astrophysics Data System (ADS)

    Bieber, D. W.

    2012-12-01

    Mining companies deal with naturally occurring asbestos (NOA) issues on their sites in two ways, avoidance and management. Avoidance simply means that to the extent practical, new mines are located in areas where NOA is unlikely to occur. Where mines are located in areas where NOA may be present, mines implement management procedures to identify and control potential sources of NOA. Management practices may include procedures set forth in regulations such as California's Air Toxicity Control Measure that deals with surface mining, voluntary procedures, or a combination of both. The mining industry generally recognizes that addressing NOA issues is a cost of doing business.;

  3. Mining maximal approximate numerical frequent patterns from uncertain data and application for emitter entity resolution

    NASA Astrophysics Data System (ADS)

    Xu, Xin

    2017-06-01

    Numerous fuzzy pattern mining methods have been proposed to address the uncertainty and incompleteness of quantitative data. Traditional fuzzy pattern mining methods generally have to transform the original quantitative values into either crystal items or fuzzy regions first, which is hard to apply without comprehensive domain knowledge. In addition, existing numerical pattern mining methods generally suffer high computational cost. Inspired by the above problems, we put forward an efficient maximal approximate numerical frequent pattern mining (MANFPM) method without fuzzy item or region specification. Experimental results have validated its scalability and effectiveness for application in emitter entity resolution.

  4. Item Disordinality with the Bookmark Standard Setting Procedure.

    ERIC Educational Resources Information Center

    Skaggs, Gary; Tessema, Aster

    This paper presents an application of the bookmark procedure to a test comprised of increasing text difficulty levels. The Test of English Proficiency for Adults (TEPA) was used for this study. Three forms of the TEPA were field tested in 1999 with approximately 1,000 non-native English speaking students enrolled in English-as-a-Second-Language…

  5. Printed Comments With Item Analyses

    ERIC Educational Resources Information Center

    Reid, J. Christopher

    1970-01-01

    The author suggests that a computer program be designed which not only computes item statistics, but also provides a running commetary in English to assist teachers in their analysis of test results. (PR)

  6. Northern Trust Mines

    EPA Pesticide Factsheets

    The United States and the Navajo Nation entered into settlement agreements that provide funds to conduct investigations and any needed cleanup at 16 of the 46 priority mines, including six mines in the Northern Abandoned Uranium Mine Region.

  7. Exploration and Mining Roadmap

    SciTech Connect

    none,

    2002-09-01

    This Exploration and Mining Technology Roadmap represents the third roadmap for the Mining Industry of the Future. It is based upon the results of the Exploration and Mining Roadmap Workshop held May 10 ñ 11, 2001.

  8. Item Banking. ERIC/AE Digest.

    ERIC Educational Resources Information Center

    Rudner, Lawrence

    This digest discusses the advantages and disadvantages of using item banks, and it provides useful information for those who are considering implementing an item banking project in their school districts. The primary advantage of item banking is in test development. Using an item response theory method, such as the Rasch model, items from multiple…

  9. Multilevel Modeling of Item Position Effects

    ERIC Educational Resources Information Center

    Albano, Anthony D.

    2013-01-01

    In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

  10. Computerized Adaptive Testing with Item Cloning.

    ERIC Educational Resources Information Center

    Glas, Cees A. W.; van der Linden, Wim J.

    2003-01-01

    Developed a multilevel item response (IRT) model that allows for differences between the distributions of item parameters of families of item clones. Results from simulation studies based on an item pool from the Law School Admission Test illustrate the accuracy of the item pool calibration and adaptive testing procedures based on the model. (SLD)

  11. 7 CFR 2902.5 - Item designation.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... availability of such items and the economic and technological feasibility of using such items, including life cycle costs. USDA will gather information on individual products within an item and extrapolate that product information to the item level for consideration in designating items. In considering these...

  12. Computerized Adaptive Testing with Item Cloning.

    ERIC Educational Resources Information Center

    Glas, Cees A. W.; van der Linden, Wim J.

    2003-01-01

    Developed a multilevel item response (IRT) model that allows for differences between the distributions of item parameters of families of item clones. Results from simulation studies based on an item pool from the Law School Admission Test illustrate the accuracy of the item pool calibration and adaptive testing procedures based on the model. (SLD)

  13. Effects of Incorporating Humor in Test Items.

    ERIC Educational Resources Information Center

    McMorris, Robert F.; And Others

    1985-01-01

    Two matched forms of a 50 item grammar test were developed. Twenty items designed to be humorous were included in one form. Inclusion of humorous items did not affect grammar scores on matched humorous/nonhumorous items, nor on commmon post-treatment items. Inclusion did not affect results of anxiety measures. (Author/DWH)

  14. Multilevel Modeling of Item Position Effects

    ERIC Educational Resources Information Center

    Albano, Anthony D.

    2013-01-01

    In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

  15. Improved Approximation Algorithms for Item Pricing with Bounded Degree and Valuation

    NASA Astrophysics Data System (ADS)

    Hamane, Ryoso; Itoh, Toshiya

    When a store sells items to customers, the store wishes to decide the prices of the items to maximize its profit. If the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. It would be hard for the store to decide the prices of items. Assume that a store has a set V of n items and there is a set C of m customers who wish to buy those items. The goal of the store is to decide the price of each item to maximize its profit. We refer to this maximization problem as an item pricing problem. We classify the item pricing problems according to how many items the store can sell or how the customers valuate the items. If the store can sell every item i with unlimited (resp. limited) amount, we refer to this as unlimited supply (resp. limited supply). We say that the item pricing problem is single-minded if each customer j∈C wishes to buy a set ej⊆V of items and assigns valuation w(ej)≥0. For the single-minded item pricing problems (in unlimited supply), Balcan and Blum regarded them as weighted k-hypergraphs and gave several approximation algorithms. In this paper, we focus on the (pseudo) degree of k-hypergraphs and the valuation ratio, i. e., the ratio between the smallest and the largest valuations. Then for the single-minded item pricing problems (in unlimited supply), we show improved approximation algorithms (for k-hypergraphs, general graphs, bipartite graphs, etc.) with respect to the maximum (pseudo) degree and the valuation ratio.

  16. The Identification of Neurologically Relevant Items in the MMPI-2.

    ERIC Educational Resources Information Center

    Van Balen, H. G. G.; Van Limbeek, J.; De Mey, H. R. A.

    1997-01-01

    Forty neuropsychologists, neurologists, psychiatrists, and physiatrists identified neurologically relevant items (NRIs) in the Minnesota Multiphasic Personality Inventory-2 (MMPI-2). Raters identified four sets of NRIs: one for brain damage in general and three partially overlapping sets for stroke, traumatic brain damage, and whiplash.…

  17. Mining review

    USGS Publications Warehouse

    McCartan, L.; Morse, D.E.; Plunkert, P.A.; Sibley, S.F.

    2004-01-01

    The average annual growth rate of real gross domestic product (GDP) from the third quarter of 2001 through the second quarter of 2003 in the United States was about 2.6 percent. GDP growth rates in the third and fourth quarters of 2003 were about 8 percent and 4 percent, respectively. The upward trends in many sectors of the U.S. economy in 2003, however, were shared by few of the mineral materials industries. Annual output declined in most nonfuel mining and mineral processing industries, although there was an upward turn toward yearend as prices began to increase.

  18. Surface mining

    SciTech Connect

    Not Available

    1989-06-01

    This paper reports on a GAO study of attorney and expert witness fees awarded as a result of litigation brought under the Surface Mining Control and Reclamation Act. As of March 24, 1989, a total of about $1.4 million had been awarded in attorney fees and expenses - about $1.3 subject to the provisions of the Employee Retirement Income Security Act, a comparison of its features with provisions of ERISA showed that the plan differed from ERISA provisions in areas such as eligibility, funding, and contribution limits.

  19. Binding agent for molding ceramic items

    NASA Technical Reports Server (NTRS)

    Beshentsev, B. D.; Vityuk, N. P.; Volkov, A. V.; Yevdokimov, A. I.; Novikov, M. N.; Piskunov, Y. G.; Pobortsev, E. P.; Sadovnichaya, L. M.

    1983-01-01

    The invention refers to the fabrication of ceramic items by the molding method. It can be used to produce items of complicated configuration, in particular composition of binding agent for electroceramic items.

  20. The Effects of Various Item Selection Methods on the Classification Accuracy and Classification Consistency of Criterion-Referenced Instruments.

    ERIC Educational Resources Information Center

    Smith, Douglas U.

    This study examined the effects of certain item selection methods on the classification accuracy and classification consistency of criterion-referenced instruments. Three item response data sets, representing varying situations of instructional effectiveness, were simulated. Five methods of item selection were then applied to each data set for the…

  1. Wikipedia Mining

    NASA Astrophysics Data System (ADS)

    Nakayama, Kotaro; Ito, Masahiro; Erdmann, Maike; Shirakawa, Masumi; Michishita, Tomoyuki; Hara, Takahiro; Nishio, Shojiro

    Wikipedia, a collaborative Wiki-based encyclopedia, has become a huge phenomenon among Internet users. It covers a huge number of concepts of various fields such as arts, geography, history, science, sports and games. As a corpus for knowledge extraction, Wikipedia's impressive characteristics are not limited to the scale, but also include the dense link structure, URL based word sense disambiguation, and brief anchor texts. Because of these characteristics, Wikipedia has become a promising corpus and a new frontier for research. In the past few years, a considerable number of researches have been conducted in various areas such as semantic relatedness measurement, bilingual dictionary construction, and ontology construction. Extracting machine understandable knowledge from Wikipedia to enhance the intelligence on computational systems is the main goal of "Wikipedia Mining," a project on CREP (Challenge for Realizing Early Profits) in JSAI. In this paper, we take a comprehensive, panoramic view of Wikipedia Mining research and the current status of our challenge. After that, we will discuss about the future vision of this challenge.

  2. Introduction to the mining of clinical data.

    PubMed

    Harrison, James H

    2008-03-01

    The increasing volume of medical data online, including laboratory data, represents a substantial resource that can provide a foundation for improved understanding of disease presentation, response to therapy, and health care delivery processes. Data mining supports these goals by providing a set of techniques designed to discover similarities and relationships between data elements in large data sets. Currently, medical data have several characteristics that increase the difficulty of applying these techniques, although there have been notable medical data mining successes. Future developments in integrated medical data repositories, standardized data representation, and guidelines for the appropriate research use of medical data will decrease the barriers to mining projects.

  3. Mine seepage problems in drift mine operations

    SciTech Connect

    DeRossett, C.; Johnson, D.E.; Bradshaw, D.B.

    1996-12-31

    Extensive mining in the Eastern Kentucky Coal Region has occurred in coal deposits located above valley floors. Underground mines present unique stability problems resulting from the creation of mine pools in abandoned works. {open_quotes}Blowouts{close_quotes} occur when hydrostatic pressures result in the cataclysmic failure of an outcrop-barrier. Additionally, seepage from flooded works results in saturation of colluvium, which may ultimately mobilize as landslides. Several case studies of both landslides and blowouts illustrate that considerations should be taken into account to control or prevent these problems. Underground mine maps and seepage conditions at the individual sites were examined to determine the mine layouts, outcrop-barrier widths, and structure of the mine floors. Discharge monitoring points were established in and near the landslides. These studies depict how mine layout, operation, and geology influence drainage conditions. The authors suggest that mine designs should incorporate drainage control to insure long-term stability and limit liability. The goal of the post-mining drainage plan is control of the mine drainage, which will reduce the size of mine pools and lower the hydrostatic pressure. Recommendations are made as to several methods that may be useful in controlling mine drainage.

  4. Historical archaeology at the Clarkson Mine, an eastern Ohio mining complex

    SciTech Connect

    Keener, C.S.

    2003-07-01

    This study examines the Clarkson Mine (33BL333), an eastern Ohio coal mine complex dating to the 1910s to 1920s, situated along Wheeling Creek. The results of preliminary surveys and the subsequent mitigation of four structures at the site are presented. The historical archaeology conducted at the site demonstrates the significant research possibilities inherent at many of these early industrial mine complexes. Of particular interest is the findings of depositional patterning around residential structures that revealed the influence of architecture on where and how items were deposited on the land surface. The ceramic and faunal assemblage were analyzed and provide significant details on socioeconomic attributes associated with the workers or staff. Artifacts recovered at the site provide an excellent diagnostic framework from which other similarly aged sites can be compared and dated. The findings at the Clarkson Mine are also placed into a more regional perspective and compared with other contemporary studies.

  5. Thermal response based item identification.

    SciTech Connect

    Smith, M. K.; Hypes, P. A.; Bracken, D. S.

    2001-01-01

    One of the most difficult problems in NDA of nuclear materials is identifying the chemical form of the nuclear material and the surrounding matrix. Recent work analyzing the calorimeter response of sources embedded in a variety of matrices has led to a possible solution to this problem. The wide range of thermal time constants exhibited by typical matrix materials lends itself to permitting the differentiation between materials, based on time constants extracted from the measured response. Potential applications include simple item identification, item fingerprinting as part of shipper-receiver measurements, and distinguishing between Pu metal and Pu oxide as required under certain proposed attribute measurements. The results of applying this technique to a variety of items will be presented and discussed.

  6. Caries Risk Assessment Item Importance

    PubMed Central

    Chaffee, B.W.; Featherstone, J.D.B.; Gansky, S.A.; Cheng, J.; Zhan, L.

    2016-01-01

    Caries risk assessment (CRA) is widely recommended for dental caries management. Little is known regarding how practitioners use individual CRA items to determine risk and which individual items independently predict clinical outcomes in children younger than 6 y. The objective of this study was to assess the relative importance of pediatric CRA items in dental providers’ decision making regarding patient risk and in association with clinically evident caries, cross-sectionally and longitudinally. CRA information was abstracted retrospectively from electronic patient records of children initially aged 6 to 72 mo at a university pediatric dentistry clinic (n = 3,810 baseline; n = 1,315 with follow-up). The 17-item CRA form included caries risk indicators, caries protective items, and clinical indicators. Conditional random forests classification trees were implemented to identify and assign variable importance to CRA items independently associated with baseline high-risk designation, baseline evident tooth decay, and follow-up evident decay. Thirteen individual CRA items, including all clinical indicators and all but 1 risk indicator, were independently and statistically significantly associated with student/resident providers’ caries risk designation. Provider-assigned baseline risk category was strongly associated with follow-up decay, which increased from low (20.4%) to moderate (30.6%) to high/extreme risk patients (68.7%). Of baseline CRA items, before adjustment, 12 were associated with baseline decay and 7 with decay at follow-up; however, in the conditional random forests models, only the clinical indicators (evident decay, dental plaque, and recent restoration placement) and 1 risk indicator (frequent snacking) were independently and statistically significantly associated with future disease, for which baseline evident decay was the strongest predictor. In this predominantly high-risk population under caries-preventive care, more individual CRA items

  7. Bloom filter based frequent patterns mining over data streams

    NASA Astrophysics Data System (ADS)

    Tan, JunShan; Kuang, Zhufang; Yang, Guogui

    2013-03-01

    A data streams synopses structure FPCBF, which is based on bloom filter, is proposed in this paper. Each unit of vector BF[e] in FPCBF is a two-tuples. The transactions insert operation (InsertTran) and 1-frequent patterns pregeneration operation (PreGenFP) is defined. Then, A data streams frequent patterns mining algorithm BFFPM which is based on FPCBF is proposed in this paper. The BFFPM algorithm contains two parts: 1-frequent patterns generation and r-frequent patterns generation. The problem of computing the 1-frequent patterns generation is transformed into the problem of computing the longest common sub-sequence α i LCS of k setting identifier sequence in BF[e] in FPCBF. In the same way, the problem of computing the r-frequent patterns generation is transformed into the problem of computing the longest common sub-sequence LCS r , which is the longest common sequence of the identifier sequence α i LCS ,…, i r LCS α + of r items. The IBM synthesizes data generation which output customers shopping a data are adopted as experiment data. The FPCBF algorithm not only has high precision for mining frequent patterns, but also has low memory requirement

  8. Cost comparison of selected US and Colombian coal mines - analyses of cost-structure differences between steam coal mines in US and Colombia

    SciTech Connect

    Not Available

    1986-01-01

    The report presents the results from a study comparing the mine-mouth costs among selected steam-coal mines in Colombia and the United States. The study was done in response to a Congressional request to identify the differences in production costs, and those factors, including mine health, safety, and environmental requirements, that account for the difference. Three sets of mines are compared to determine the basic cost differences in mining between the two coal countries.

  9. Multidimensional Item Response Theory Parameter Estimation with Nonsimple Structure Items

    ERIC Educational Resources Information Center

    Finch, Holmes

    2011-01-01

    Estimation of multidimensional item response theory (MIRT) model parameters can be carried out using the normal ogive with unweighted least squares estimation with the normal-ogive harmonic analysis robust method (NOHARM) software. Previous simulation research has demonstrated that this approach does yield accurate and efficient estimates of item…

  10. Evaluating the Magnitude of Differential Item Functioning in Polytomous Items.

    ERIC Educational Resources Information Center

    Zwick, Rebecca; Thayer, Dorothy T.

    1996-01-01

    Two possible standard error formulas for the polytomous differential item functioning index proposed by N. J. Dorans and A. P. Schmitt (1991) were derived. These standard errors, and associated hypothesis-testing procedures, were evaluated through simulated data. The standard error that performed better is based on N. Mantel's (1963)…

  11. Australian Item Bank Program: Social Science Item Bank.

    ERIC Educational Resources Information Center

    Australian Council for Educational Research, Hawthorn.

    After vigorous review, editing, and trial testing, this item bank was compiled to help secondary school teachers construct objective tests in the social sciences. Anthropology, economics, ethnic and cultural studies, geography, history, legal studies, politics, and sociology are among the topics represented. The bank consists of multiple choice…

  12. Multidimensional Item Response Theory Parameter Estimation with Nonsimple Structure Items

    ERIC Educational Resources Information Center

    Finch, Holmes

    2011-01-01

    Estimation of multidimensional item response theory (MIRT) model parameters can be carried out using the normal ogive with unweighted least squares estimation with the normal-ogive harmonic analysis robust method (NOHARM) software. Previous simulation research has demonstrated that this approach does yield accurate and efficient estimates of item…

  13. Mine detection training based on expert skill

    NASA Astrophysics Data System (ADS)

    Staszewski, James J.; Davison, Alan

    2000-08-01

    Studies show that soldiers' mine detection capabilities with the PSS-12 hand-held detector are substandard and that their probabilities of detecting (PD) low-metal mines are dangerously low. Highly experienced PSS-12 operators, however, achieve PDS over 0.90 on high- and low-metal anti- tank (AT) and anti-personnel (AP) mines. Significantly, experts' detection techniques differ from conventional military PSS-12 operating procedures. We report three studies investigating whether instruction based on expert skill could bridge the observed performance gap. Basic research on human expertise has shown that instruction based on detailed scientific analyses of experts' behaviors and thought processes boosts skill acquisition dramatically. These studies tested the effects of an experimental detection training program based on knowledge and techniques learned from analysis of PSS-12 expertise. In Study I soldiers who had completed standard mine detection training participated as operators/trainees. This experiment used a pretest-posttest design. Mine simulants served as targets in testing gand training. Targets simulate d5 different mines and represented high- and low-metal AT and AP mine types. Pretest performance failed to distinguish the treatment and control groups. Both achieved very low PDs on low metal mines. Treatment-group soldiers then received approximately 15 hours of experimental, hands-on training. Posttest results showed that the treatment groups PD on minimal metal targets was more than 6 times that of the control group. Study 2 tested a subset of the treated soldiers in the same setting, now wearing body armor. Results replicated those of Study 1. Study 3 tested treatment group soldiers on real mine targets. Several mines from each mine type were used. The surface of the test lanes was expected to increase detection difficulty. Soldiers nonetheless achieved an aggregate PD of 0.97 and showed significant improvement in detecting low-metal mines.

  14. Design risk assessment for burst-prone mines: Application in a Canadian mine

    NASA Astrophysics Data System (ADS)

    Cheung, David J.

    reduce both exposure risk (personnel and equipment), and economical risk (revenue and costs). Fatal and catastrophic consequences can be averted through robust planning and design. Two customized approaches were developed to conduct risk assessment of case studies at Craig Mine. Firstly, the Brownfield Approach utilizes the seismic database to determine the seismic hazard from a rating system that evaluates frequency-magnitude, event size, and event-blast relation. Secondly, the Greenfield Approach utilizes the seismic database, focusing on larger magnitude events, rocktype, and geological structure. The customized Greenfield Approach can also be applied in the evaluation of design risk in deep mines with the same setting and condition as Craig Mine. Other mines with different settings and conditions can apply the principles in the methodology to evaluate design alternatives and risk reduction strategies for burst-prone mines.

  15. Mining machine

    SciTech Connect

    Becker, H.R.

    1984-12-04

    A mining machine is disclosed comprising a mobile base and a cutting head assembly at a forward end of the mobile base having a cutter drum rotatable about an output shaft disposed along the longitudinal axis of the cutter drum. A drive system for the cutting head assembly comprises at least one motor for driving at least one toothed motor pinion and a generally cylindrical combination gear having generally circular end surfaces. A bevel or face gear is formed in at least one of the end surfaces, having teeth adapted to mate with and be driven by the toothed motor pinion. The combination gear has a worm gear formed in the outside cylindrical surface, which is disposed in driving engagement with the teeth of an output gear integrally and coaxially connected to the output shaft of the cutter drum.

  16. Underground coal mining section data

    NASA Technical Reports Server (NTRS)

    Gabrill, C. P.; Urie, J. T.

    1981-01-01

    A set of tables which display the allocation of time for ten personnel and eight pieces of underground coal mining equipment to ten function categories is provided. Data from 125 full shift time studies contained in the KETRON database was utilized as the primary source data. The KETRON activity and delay codes were mapped onto JPL equipment, personnel and function categories. Computer processing was then performed to aggregate the shift level data and generate the matrices. Additional, documented time study data were analyzed and used to supplement the KETRON databased. The source data including the number of shifts are described. Specific parameters of the mines from which there data were extracted are presented. The result of the data processing including the required JPL matrices is presented. A brief comparison with a time study analysis of continuous mining systems is presented. The procedures used for processing the source data are described.

  17. Inactive and abandoned noncoal mines

    SciTech Connect

    Not Available

    1991-08-01

    Volume 1 outlines the environmental, health and safety problems at IAMS (Inactive, Abandoned Mine Sites), remediation technologies, remediation costs, the methodology states used in preparing state reports, and state summary tables. It also describes the broad range of policy options for remediation of problems associated with IAMS. Volume 2 gives state reports for inactive and abandoned noncoal mines for the following states: Alaska, Arizona, California, Colorado, Florida, Idaho, Minnesota, Missouri, Montana, Nevada, New Mexico, Oregon, South Carolina, South Dakota, Utah, Washington, Wisconsin and Wyoming. Volume 3 lists the State reports for the inactive and abandoned noncoal mines for the following states: Alabama, Arkansas, Illinois, Indiana, Iowa, Louisiana, Maine, Maryland, Mississippi, New York, North Carolina, Ohio, Oklahoma, Pennsylvania, Texas, and Virginia. A separate abstract is included for each of the 3 volumes of this set.

  18. Filtering for Medical News Items.

    ERIC Educational Resources Information Center

    Watters, Carolyn; Zheng, Wanhong; Milios, Evangelos

    2002-01-01

    Describes recent work to provide a filtering service for readers interested in medically related news articles from online news sources. The first task is to filter out nonmedical news items; the remaining articles are then assigned MeSH headings for context and then further categorized by intended audience level. (Author/LRW)

  19. Paradoxical Results and Item Bundles

    ERIC Educational Resources Information Center

    Hooker, Giles; Finkelman, Matthew

    2010-01-01

    Hooker, Finkelman, and Schwartzman ("Psychometrika," 2009, in press) defined a paradoxical result as the attainment of a higher test score by changing answers from correct to incorrect and demonstrated that such results are unavoidable for maximum likelihood estimates in multidimensional item response theory. The potential for these results to…

  20. Paradoxical Results and Item Bundles

    ERIC Educational Resources Information Center

    Hooker, Giles; Finkelman, Matthew

    2010-01-01

    Hooker, Finkelman, and Schwartzman ("Psychometrika," 2009, in press) defined a paradoxical result as the attainment of a higher test score by changing answers from correct to incorrect and demonstrated that such results are unavoidable for maximum likelihood estimates in multidimensional item response theory. The potential for these results to…

  1. Effects of Education on Differential Item Functioning on the 15-Item Modified Korean Version of the Boston Naming Test

    PubMed Central

    Kim, Byung-Soo; Lee, Dong-Woo; Bae, Jae-Nam; Kim, Ji-Hyun; Kim, Shinkyum; Kim, Ki Woong; Park, Jee-Eun; Cho, Maeng Je

    2017-01-01

    Objective Education is expected to have an effect on differential item functioning (DIF) on the 15-item Modified Boston Naming Test in the Korean version of the Consortium to Establish a Registry for Alzheimer's Disease Assessment Packet (BNT-KC). However, no study has examined DIF in the BNT-KC. Methods We used the item response theory to investigate the impact of education on the DIF in the BNT-KC among elderly individuals with or without dementia (n=720). A two-parameter item response model was used to determine the difficulty and discrimination parameters of each item. The Benjamini-Hochberg procedure was used to address the risk of Type I errors on multiple testing. Results Four items, “mermaid,” “acorn,” “compass,” and “pomegranate” continued to demonstrate DIF after controlling for multiple comparisons. Those with low education levels were more likely to error on “mermaid” and “compass,” while those with high education levels were more likely to error on “acorn” and “pomegranate.” “Hand” and “red pepper” were too easily identified to be used for detecting dementia patients. “Monk's hat” and “pomegranate” were less discriminating than other items, limiting their usefulness in clinical setting. Conclusion These findings may provide useful information for the development of a revised version of the BNT-KC to help clinicians make diagnostic decisions more accurately. PMID:28326109

  2. Effects of underground mining and mine collapse on the hydrology of selected basins in West Virginia

    USGS Publications Warehouse

    Hobba, William A.

    1993-01-01

    The effects of underground mining and mine collapse on areal hydrology were determined at one site where the mined bed of coal lies above major streams and at two sites where the bed of coal lies below major streams. Subsidence cracks observed at land surface generally run parallel to predominant joint sets in the rocks. The mining and subsidence cracks increase hydraulic conductivity and interconnection of water-bearing rock units, which in turn cause increased infiltration of precipitation and surface water, decreased evapotranspiration, and higher base flows in some small streams. Water levels in observation wells in mined areas fluctuate as much as 100 ft annually. Both gaining and losing streams are found in mined areas. Mine pumpage and drainage can cause diversion of water underground from one basin to another. Areal and single-well aquifer tests indicated that near-surface rocks have higher transmissivity in a mine-subsided basin than in unmined basins. Increased infiltration and circulation through shallow subsurface rocks increase dissolved mineral loads in streams, as do treated and untreated contributions from mine pumpage and drainage. Abandoned and flooded underground mines make good reservoirs because of their increased transmissivity and storage. Subsidence cracks were not detectable by thermal imagery, but springs and seeps were detectable.

  3. Psychometric Consequences of Subpopulation Item Parameter Drift

    ERIC Educational Resources Information Center

    Huggins-Manley, Anne Corinne

    2017-01-01

    This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

  4. Psychometric Consequences of Subpopulation Item Parameter Drift

    ERIC Educational Resources Information Center

    Huggins-Manley, Anne Corinne

    2017-01-01

    This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

  5. A Monte Carlo Study Investigating the Influence of Item Discrimination, Category Intersection Parameters, and Differential Item Functioning Patterns on the Detection of Differential Item Functioning in Polytomous Items

    ERIC Educational Resources Information Center

    Thurman, Carol

    2009-01-01

    The increased use of polytomous item formats has led assessment developers to pay greater attention to the detection of differential item functioning (DIF) in these items. DIF occurs when an item performs differently for two contrasting groups of respondents (e.g., males versus females) after controlling for differences in the abilities of the…

  6. Adaptable Learning Assistant for Item Bank Management

    ERIC Educational Resources Information Center

    Nuntiyagul, Atorn; Naruedomkul, Kanlaya; Cercone, Nick; Wongsawang, Damras

    2008-01-01

    We present PKIP, an adaptable learning assistant tool for managing question items in item banks. PKIP is not only able to automatically assist educational users to categorize the question items into predefined categories by their contents but also to correctly retrieve the items by specifying the category and/or the difficulty level. PKIP adapts…

  7. Unidimensional Interpretations for Multidimensional Test Items

    ERIC Educational Resources Information Center

    Kahraman, Nilufer

    2013-01-01

    This article considers potential problems that can arise in estimating a unidimensional item response theory (IRT) model when some test items are multidimensional (i.e., show a complex factorial structure). More specifically, this study examines (1) the consequences of model misfit on IRT item parameter estimates due to unintended minor item-level…

  8. Adaptable Learning Assistant for Item Bank Management

    ERIC Educational Resources Information Center

    Nuntiyagul, Atorn; Naruedomkul, Kanlaya; Cercone, Nick; Wongsawang, Damras

    2008-01-01

    We present PKIP, an adaptable learning assistant tool for managing question items in item banks. PKIP is not only able to automatically assist educational users to categorize the question items into predefined categories by their contents but also to correctly retrieve the items by specifying the category and/or the difficulty level. PKIP adapts…

  9. Experimental Item Types to Measure Judgment.

    ERIC Educational Resources Information Center

    Northrop, Lois C.

    The Professional and Administrative Career Examination measures judgment ability using comprehension and logical order of events item types. Since the verbal component of comprehension items is high, and since these items are extremely difficult to write and document, a search was conducted for a simpler item type for assessing judgment. Three…

  10. 7 CFR 2902.5 - Item designation.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... AGRICULTURE GUIDELINES FOR DESIGNATING BIOBASED PRODUCTS FOR FEDERAL PROCUREMENT General § 2902.5 Item... designate items composed of generic groupings of specific products and will identify the minimum biobased content for each listed item. As items are designated for procurement preference, they will be added to...

  11. Item Banking Enables Stand-Alone Measurement of Driving Ability.

    PubMed

    Khadka, Jyoti; Fenwick, Eva K; Lamoureux, Ecosse L; Pesudovs, Konrad

    2016-12-01

    To explore whether large item sets, as used in item banking, enable important latent traits, such as driving, to form stand-alone measures. The 88-item activity limitation (AL) domain of the glaucoma module of the Eye-tem Bank was interviewer-administered to patients with glaucoma. Rasch analysis was used to calibrate all items in AL domain on the same interval-level scale and test its psychometric properties. Based on Rasch dimensionality metrics, the AL scale was separated into subscales. These subscales underwent separate Rasch analyses to test whether they could form stand-alone measures. Independence of these measures was tested with Bland and Altman (B&A) Limit of Agreement (LOA). The AL scale was completed by 293 patients (median age, 71 years). It demonstrated excellent precision (3.12). However, Rasch analysis dimensionality metrics indicated that the domain arguably had other dimensions which were driving, luminance, and reading. Once separated, the remaining AL items, driving and luminance subscales, were unidimensional and had excellent precision of 4.25, 2.94, and 2.22, respectively. The reading subscale showed poor precision (1.66), so it was not examined further. The luminance subscale demonstrated excellent agreement (mean bias, 0.2 logit; 95% LOA, -2.2 to 3.3 logit); however, the driving subscale demonstrated poor agreement (mean bias, 1.1 logit; 95% LOA, -4.8 to 7.0 logit) with the AL scale. These findings indicate that driving items in the AL domain of the glaucoma module were perceived and responded to differently from the other AL items, but the reading and luminance items were not. Therefore, item banking enables stand-alone measurement of driving ability in glaucoma.

  12. The impact of item-writing flaws and item complexity on examination item difficulty and discrimination value.

    PubMed

    Rush, Bonnie R; Rankin, David C; White, Brad J

    2016-09-29

    Failure to adhere to standard item-writing guidelines may render examination questions easier or more difficult than intended. Item complexity describes the cognitive skill level required to obtain a correct answer. Higher cognitive examination items promote critical thinking and are recommended to prepare students for clinical training. This study evaluated faculty-authored examinations to determine the impact of item-writing flaws and item complexity on the difficulty and discrimination value of examination items used to assess third year veterinary students. The impact of item-writing flaws and item complexity (cognitive level I-V) on examination item difficulty and discrimination value was evaluated on 1925 examination items prepared by clinical faculty for third year veterinary students. The mean (± SE) percent correct (83.3 % ± 17.5) was consistent with target values in professional education, and the mean discrimination index (0.18 ± 0.17) was slightly lower than recommended (0.20). More than one item-writing flaw was identified in 37.3 % of questions. The most common item-writing flaws were awkward stem structure, implausible distractors, longest response is correct, and responses are series of true-false statements. Higher cognitive skills (complexity level III-IV) were required to correctly answer 38.4 % of examination items. As item complexity increased, item difficulty and discrimination values increased. The probability of writing discriminating, difficult examination items decreased when implausible distractors and all of the above were used, and increased if the distractors were comprised of a series of true/false statements. Items with four distractors were not more difficult or discriminating than items with three distractors. Preparation of examination questions targeting higher cognitive levels will increase the likelihood of constructing discriminating items. Use of implausible distractors to complete a five-option multiple choice

  13. Top-down vertical itemset mining

    NASA Astrophysics Data System (ADS)

    Sohrabi, Mohammad Karim; Ghods, Vahid

    2015-03-01

    Vertical itemset mining is an important frequent pattern mining problem with broad applications. It is challenging since one may need to examine a combinatorial explosive number of possible patterns of items of a dataset in a traditional horizontal algorithm. Since high dimensional datasets typically contain a large number of columns and a small number of rows, vertical itemset mining algorithms, which extract the frequent itemsets of dataset by producing all combination of rows ids, are a good alternative for horizontal algorithms in mining frequent itemsets from high dimensional dataset. Since a rowset can be simply produced from its subsets by adding a new row id to a sub rowset, many bottom up vertical itemset mining algorithms are designed and represented in the literature. However, bottom up vertical mining algorithms suffer from a main drawback. Bottom-up algorithms start the process of generating and testing of rowsets from the small rowsets and go on to the larger rowsets, whereas the small rowsets cannot produce a frequent itemsets because they contain less than minimum support threshold number of rows. In this paper, we described a new efficient vertical top down algorithm called VTD (Vertical Top Down) to conduct mining of frequent itemsets in high dimensional datasets. Our top down approach employed the minimum support threshold to prune the rowsets which any itemset could not be extracted from them. Several experiments on real bioinformatics datasets showed that VTD is orders of magnitude better than previous closed pattern mining algorithms. Our performance study showed that this algorithm outperformed substantially the best former algorithms.

  14. Identifying content for the glaucoma-specific item bank to measure quality-of-life parameters.

    PubMed

    Khadka, Jyoti; McAlinden, Colm; Craig, Jamie E; Fenwick, Eva K; Lamoureux, Ecosse L; Pesudovs, Konrad

    2015-01-01

    Patient-reported outcomes (PROs) have become essential clinical trial end points. However, a comprehensive, multidimensional, patient-relevant, and precise glaucoma-specific PRO instrument is not available. Therefore, the purpose of this study was to identify content for a new, glaucoma-specific, quality-of-life (QOL) item bank. Content identification was undertaken in 5 phases: (1) identification of extant items in glaucoma-specific instruments and the qualitative literature; (2) focus groups and interviews with glaucoma patients; (3) item classification and selection; (4) expert review and revision of items; and (5) cognitive interviews with patients. A total of 737 unique items (extant items from PRO instruments, 247; qualitative articles, 14 items; focus groups and semistructured interviews, 476 items) were identified. These items were classified into 10 QOL domains. Four criteria (item redundancy, item inconsistent with domain definition, item content too narrow to have wider applicability, and item clarity) were used to remove and refine the items. After the cognitive interviews, the final minimally representative item set had a total of 342 unique items belonging to 10 domains: activity limitation (88), mobility (20), visual symptoms (19), ocular surface symptoms (22), general symptoms (15), convenience (39), health concerns (45), emotional well-being (49), social issues (23), and economic issues (22). The systematic content identification process identified 10 QOL domains, which were important to patients with glaucoma. The majority of the items were identified from the patient-specific focus groups and semistructured interviews suggesting that the existing PRO instruments do not adequately address QOL issues relevant to individuals with glaucoma.

  15. Simple imputation methods versus direct likelihood analysis for missing item scores in multilevel educational data.

    PubMed

    Kadengye, Damazo T; Cools, Wilfried; Ceulemans, Eva; Van den Noortgate, Wim

    2012-06-01

    Missing data, such as item responses in multilevel data, are ubiquitous in educational research settings. Researchers in the item response theory (IRT) context have shown that ignoring such missing data can create problems in the estimation of the IRT model parameters. Consequently, several imputation methods for dealing with missing item data have been proposed and shown to be effective when applied with traditional IRT models. Additionally, a nonimputation direct likelihood analysis has been shown to be an effective tool for handling missing observations in clustered data settings. This study investigates the performance of six simple imputation methods, which have been found to be useful in other IRT contexts, versus a direct likelihood analysis, in multilevel data from educational settings. Multilevel item response data were simulated on the basis of two empirical data sets, and some of the item scores were deleted, such that they were missing either completely at random or simply at random. An explanatory IRT model was used for modeling the complete, incomplete, and imputed data sets. We showed that direct likelihood analysis of the incomplete data sets produced unbiased parameter estimates that were comparable to those from a complete data analysis. Multiple-imputation approaches of the two-way mean and corrected item mean substitution methods displayed varying degrees of effectiveness in imputing data that in turn could produce unbiased parameter estimates. The simple random imputation, adjusted random imputation, item means substitution, and regression imputation methods seemed to be less effective in imputing missing item scores in multilevel data settings.

  16. German mining equipment

    SciTech Connect

    Not Available

    1993-10-01

    The German mining equipment industry developed to supply machines and services to the local mining industry, i.e., coal, lignite, salt, potash, ore mining, industrial minerals, and quarrying. The sophistication and reliability of its technology also won it worldwide export markets -- which is just as well since former major domestic mining sectors such as coal and potash have declined precipitously, and others such as ore mining have all but disappeared. Today, German mining equipment suppliers focus strongly on export sales, and formerly unique German mining technologies such as continuous mining with bucket wheel excavators and conveyors for open pits, or plowing of underground coal longwalls are widely used abroad. The status of the German mining equipment industry is reviewed.

  17. Kernel-Smoothing Estimation of Item Characteristic Functions for Continuous Personality Items: An Empirical Comparison with the Linear and the Continuous-Response Models

    ERIC Educational Resources Information Center

    Ferrando, Pere J.

    2004-01-01

    This study used kernel-smoothing procedures to estimate the item characteristic functions (ICFs) of a set of continuous personality items. The nonparametric ICFs were compared with the ICFs estimated (a) by the linear model and (b) by Samejima's continuous-response model. The study was based on a conditioned approach and used an error-in-variables…

  18. Kernel-Smoothing Estimation of Item Characteristic Functions for Continuous Personality Items: An Empirical Comparison with the Linear and the Continuous-Response Models

    ERIC Educational Resources Information Center

    Ferrando, Pere J.

    2004-01-01

    This study used kernel-smoothing procedures to estimate the item characteristic functions (ICFs) of a set of continuous personality items. The nonparametric ICFs were compared with the ICFs estimated (a) by the linear model and (b) by Samejima's continuous-response model. The study was based on a conditioned approach and used an error-in-variables…

  19. 1. OVERALL VIEW OF MINE SITE FROM KEETLEY MINE ROAD, ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    1. OVERALL VIEW OF MINE SITE FROM KEETLEY MINE ROAD, SHOWING TAILING DUMP. VIEW TO WEST. - Park Utah Mining Company: Keetley Mine Complex, 1 mile East of U.S. 40 at Keetley, Heber City, Wasatch County, UT

  20. 4. OVERALL VIEW OF MINE SITE, SHOWING MINE CAR TRACKS, ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    4. OVERALL VIEW OF MINE SITE, SHOWING MINE CAR TRACKS, SNOWSHEDS AND TIPPLE (LEFT BACKGROUND). VIEW TO EAST. - Park Utah Mining Company: Keetley Mine Complex, 1 mile East of U.S. 40 at Keetley, Heber City, Wasatch County, UT

  1. Ineffectiveness of reverse wording of questionnaire items: let's learn from cows in the rain.

    PubMed

    van Sonderen, Eric; Sanderman, Robbert; Coyne, James C

    2013-01-01

    We examined the effectiveness of reverse worded items as a means of reducing or preventing response bias. We first distinguished between several types of response bias that are often confused in literature. We next developed arguments why reversing items is probably never a good way to address response bias. We proposed testing whether reverse wording affects response bias with item-level data from the Multidimensional Fatigue Inventory (MFI-20), an instrument that contains reversed worded items. With data from 700 respondents, we compared scores on items that were similar with respect either to content or to direction of wording. Psychometric properties of sets of these items worded in the same direction were compared with sets consisting of both straightforward and reversed worded items. We did not find evidence that ten reverse-worded items prevented response bias. Instead, the data suggest scores were contaminated by respondent inattention and confusion. Using twenty items, balanced for scoring direction, to assess fatigue did not prevent respondents from inattentive or acquiescent answering. Rather, fewer mistakes are made with a 10-item instrument with items posed in the same direction. Such a format is preferable for both epidemiological and clinical studies.

  2. Ineffectiveness of Reverse Wording of Questionnaire Items: Let’s Learn from Cows in the Rain

    PubMed Central

    van Sonderen, Eric; Sanderman, Robbert; Coyne, James C.

    2013-01-01

    Objective We examined the effectiveness of reverse worded items as a means of reducing or preventing response bias. We first distinguished between several types of response bias that are often confused in literature. We next developed arguments why reversing items is probably never a good way to address response bias. We proposed testing whether reverse wording affects response bias with item-level data from the Multidimensional Fatigue Inventory (MFI-20), an instrument that contains reversed worded items. Methods With data from 700 respondents, we compared scores on items that were similar with respect either to content or to direction of wording. Psychometric properties of sets of these items worded in the same direction were compared with sets consisting of both straightforward and reversed worded items. Results We did not find evidence that ten reverse-worded items prevented response bias. Instead, the data suggest scores were contaminated by respondent inattention and confusion. Conclusions Using twenty items, balanced for scoring direction, to assess fatigue did not prevent respondents from inattentive or acquiescent answering. Rather, fewer mistakes are made with a 10-item instrument with items posed in the same direction. Such a format is preferable for both epidemiological and clinical studies. PMID:23935915

  3. Development of the PROMIS® Positive Emotional and Sensory Expectancies of Smoking Item Banks

    PubMed Central

    Shadel, William G.; Edelen, Maria Orlando; Stucky, Brian D.; Li, Zhen; Hansen, Mark; Cai, Li

    2014-01-01

    Introduction: The positive emotional and sensory expectancies of cigarette smoking include improved cognitive abilities, positive affective states, and pleasurable sensorimotor sensations. This paper describes development of Positive Emotional and Sensory Expectancies of Smoking item banks that will serve to standardize the assessment of this construct among daily and nondaily cigarette smokers. Methods: Data came from daily (N = 4,201) and nondaily (N =1,183) smokers who completed an online survey. To identify a unidimensional set of items, we conducted item factor analyses, item response theory analyses, and differential item functioning analyses. Additionally, we evaluated the performance of fixed-item short forms (SFs) and computer adaptive tests (CATs) to efficiently assess the construct. Results: Eighteen items were included in the item banks (15 common across daily and nondaily smokers, 1 unique to daily, 2 unique to nondaily). The item banks are strongly unidimensional, highly reliable (reliability = 0.95 for both), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.86). Results from simulated CATs indicated that, on average, less than 8 items are needed to assess the construct with adequate precision using the item banks. Conclusions: These analyses identified a new set of items that can assess the positive emotional and sensory expectancies of smoking in a reliable and standardized manner. Considerable efficiency in assessing this construct can be achieved by using the item bank SF, employing computer adaptive tests, or selecting subsets of items tailored to specific research or clinical purposes. PMID:25118228

  4. Development of the PROMIS positive emotional and sensory expectancies of smoking item banks.

    PubMed

    Tucker, Joan S; Shadel, William G; Edelen, Maria Orlando; Stucky, Brian D; Li, Zhen; Hansen, Mark; Cai, Li

    2014-09-01

    The positive emotional and sensory expectancies of cigarette smoking include improved cognitive abilities, positive affective states, and pleasurable sensorimotor sensations. This paper describes development of Positive Emotional and Sensory Expectancies of Smoking item banks that will serve to standardize the assessment of this construct among daily and nondaily cigarette smokers. Data came from daily (N = 4,201) and nondaily (N =1,183) smokers who completed an online survey. To identify a unidimensional set of items, we conducted item factor analyses, item response theory analyses, and differential item functioning analyses. Additionally, we evaluated the performance of fixed-item short forms (SFs) and computer adaptive tests (CATs) to efficiently assess the construct. Eighteen items were included in the item banks (15 common across daily and nondaily smokers, 1 unique to daily, 2 unique to nondaily). The item banks are strongly unidimensional, highly reliable (reliability = 0.95 for both), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.86). Results from simulated CATs indicated that, on average, less than 8 items are needed to assess the construct with adequate precision using the item banks. These analyses identified a new set of items that can assess the positive emotional and sensory expectancies of smoking in a reliable and standardized manner. Considerable efficiency in assessing this construct can be achieved by using the item bank SF, employing computer adaptive tests, or selecting subsets of items tailored to specific research or clinical purposes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  5. Exposures from mining and mine tailings

    NASA Astrophysics Data System (ADS)

    Chambers, Douglas B.; Cassaday, Valerie J.; Lowe, Leo M.

    The mining, milling and tailings management of uranium ores results in environmental radiation exposures. This paper describes the sources of radioactive emissions to the environment associated with these activities, reviews the basic approach used to estimate the resultant radiation exposures and presents examples of typical uranium mind and mill facilities. Similar concepts apply to radiation exposures associated with the mining of non-radioactive ores although the magnitudes of the exposures would normally be smaller than those associated with uranium mining.

  6. A New Item Selection Procedure for Mixed Item Type in Computerized Classification Testing.

    ERIC Educational Resources Information Center

    Lau, C. Allen; Wang, Tianyou

    This paper proposes a new Information-Time index as the basis for item selection in computerized classification testing (CCT) and investigates how this new item selection algorithm can help improve test efficiency for item pools with mixed item types. It also investigates how practical constraints such as item exposure rate control, test…

  7. Mining lease handbook

    SciTech Connect

    Not Available

    1992-01-01

    Mining leases and similar agreements are some of the most common documents encountered by mining attorneys. The mining Lease Handbook contains a collection of mining lease clauses which have been organized and assembled for over 25 years. The clauses in this book have been coordinated and cross-referenced to enable the Handbook user to create a mining lease having a logical structure with consistent terminology throughout. In many cases, alternative clauses are included. The accompanying commentary provides insight into the use of the various clauses while pointing our pitfalls to be avoided. This Handbook is devoted primarily to mining leases, several chapters cover the subjects of options, subleases, and ancillary documents.

  8. Evaluation of measurement equivalence of the Family Satisfaction with the End-of-Life Care in an ethnically diverse cohort: tests of differential item functioning.

    PubMed

    Teresi, Jeanne A; Ocepek-Welikson, Katja; Ramirez, Mildred; Kleinman, Marjorie; Ornstein, Katherine; Siu, Albert

    2015-01-01

    The Family Satisfaction with End-of-Life Care is an internationally used measure of satisfaction with cancer care. However, the Family Satisfaction with End-of-Life Care has not been studied for equivalence of item endorsement across different socio-demographic groups using differential item functioning. The aims of this secondary data analysis were (1) to examine potential differential item functioning in the family satisfaction item set with respect to type of caregiver, race, and patient age, gender, and education and (2) to provide parameters and documentation of differential item functioning for an item bank. A mixed qualitative and quantitative analysis was conducted. A priori hypotheses regarding potential group differences in item response were established. Item response theory and Wald tests were used for the analyses of differential item functioning, accompanied by magnitude and impact measures. Very little significant differential item functioning was observed for patient's age and gender. For race, 13 items showed differential item functioning after multiple comparison adjustment, 10 with non-uniform differential item functioning. No items evidenced differential item functioning of high magnitude, and the impact was negligible. For education, 5 items evidenced uniform differential item functioning after adjustment, none of high magnitude. Differential item functioning impact was trivial. One item evidenced differential item functioning for the caregiver relationship variable. Differential item functioning was observed primarily for race and education. No differential item functioning of high magnitude was observed for any item, and the overall impact of differential item functioning was negligible. One item, satisfaction with "the patient's pain relief," might be singled out for further study, given that this item was both hypothesized and observed to show differential item functioning for race and education. © The Author(s) 2014.

  9. Open-source tools for data mining.

    PubMed

    Zupan, Blaz; Demsar, Janez

    2008-03-01

    With a growing volume of biomedical databases and repositories, the need to develop a set of tools to address their analysis and support knowledge discovery is becoming acute. The data mining community has developed a substantial set of techniques for computational treatment of these data. In this article, we discuss the evolution of open-source toolboxes that data mining researchers and enthusiasts have developed over the span of a few decades and review several currently available open-source data mining suites. The approaches we review are diverse in data mining methods and user interfaces and also demonstrate that the field and its tools are ready to be fully exploited in biomedical research.

  10. The Body Appreciation Scale-2: item refinement and psychometric evaluation.

    PubMed

    Tylka, Tracy L; Wood-Barcalow, Nichole L

    2015-01-01

    Considered a positive body image measure, the 13-item Body Appreciation Scale (BAS; Avalos, Tylka, & Wood-Barcalow, 2005) assesses individuals' acceptance of, favorable opinions toward, and respect for their bodies. While the BAS has accrued psychometric support, we improved it by rewording certain BAS items (to eliminate sex-specific versions and body dissatisfaction-based language) and developing additional items based on positive body image research. In three studies, we examined the reworded, newly developed, and retained items to determine their psychometric properties among college and online community (Amazon Mechanical Turk) samples of 820 women and 767 men. After exploratory factor analysis, we retained 10 items (five original BAS items). Confirmatory factor analysis upheld the BAS-2's unidimensionality and invariance across sex and sample type. Its internal consistency, test-retest reliability, and construct (convergent, incremental, and discriminant) validity were supported. The BAS-2 is a psychometrically sound positive body image measure applicable for research and clinical settings. Copyright © 2014 Elsevier Ltd. All rights reserved.

  11. Numerosity estimates for attended and unattended items in visual search.

    PubMed

    Kelley, Troy D; Cassenti, Daniel N; Marusich, Laura R; Ghirardelli, Thomas G

    2017-07-01

    The goal of this research was to examine memories created for the number of items during a visual search task. Participants performed a visual search task for a target defined by a single feature (Experiment 1A), by a conjunction of features (Experiment 1B), or by a specific spatial configuration of features (Experiment 1C). On some trials following the search task, subjects were asked to recall the total number of items in the previous display. In all search types, participants underestimated the total number of items, but the severity of the underestimation varied depending on the efficiency of the search. In three follow-up studies (Experiments 2A, 2B, and 2C) using the same visual stimuli, the participants' only task was to estimate the number of items on each screen. Participants still underestimated the numerosity of the items, although the degree of underestimation was smaller than in the search tasks and did not depend on the type of visual stimuli. In Experiment 3, participants were asked to recall the number of items in a display only once. Subjects still displayed a tendency to underestimate, indicating that the underestimation effects seen in Experiments 1A-1C were not attributable to knowledge of the estimation task. The degree of underestimation depends on the efficiency of the search task, with more severe underestimation in efficient search tasks. This suggests that the lower attentional demands of very efficient searches leads to less encoding of numerosity of the distractor set.

  12. Applications of Item Response Theory to Partial Credit Scoring.

    ERIC Educational Resources Information Center

    Wise, Steven L., Ed.; And Others

    1988-01-01

    Six papers on the use of partial credit item response theory score models in applied measurement settings are presented. These applications include the scoring of medical certification examinations using computer-based patient simulations, narrative writing tests, and educational diagnosis. (TJH)

  13. Building an Evaluation Scale using Item Response Theory

    PubMed Central

    Lalor, John P.; Wu, Hao; Yu, Hong

    2016-01-01

    Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1). The current assumption is that all items in a given test set are equal with regards to difficulty and discriminating power. We propose Item Response Theory (IRT) from psychometrics as an alternative means for gold-standard test-set generation and NLP system evaluation. IRT is able to describe characteristics of individual items - their difficulty and discriminating power - and can account for these characteristics in its estimation of human intelligence or ability for an NLP task. In this paper, we demonstrate IRT by generating a gold-standard test set for Recognizing Textual Entailment. By collecting a large number of human responses and fitting our IRT model, we show that our IRT model compares NLP systems with the performance in a human population and is able to provide more insight into system performance than standard evaluation metrics. We show that a high accuracy score does not always imply a high IRT score, which depends on the item characteristics and the response pattern.1 PMID:28004039

  14. Item Response Theory: Overview, Applications, and Promise for Institutional Research

    ERIC Educational Resources Information Center

    Bowman, Nicholas A.; Herzog, Serge; Sharkness, Jessica

    2014-01-01

    Item Response Theory (IRT) is a measurement theory that is ideal for scale and test development in institutional research, but it is not without its drawbacks. This chapter provides an overview of IRT, describes an example of its use, and highlights the pros and cons of using IRT in applied settings.

  15. Scores Based on Dangerous Responses to Multiple-Choice Items.

    ERIC Educational Resources Information Center

    Grosse, Martin E.

    1986-01-01

    Scores based on the number of correct answers were compared with scores based on dangerous responses to items in the same multiple choice test developed by American Board of Orthopaedic Surgery. Results showed construct validity for both sets of scores. However, both scores were redundant when evaluated by correlation coefficient. (Author/JAZ)

  16. Design Patterns for Digital Item Types in Higher Education

    ERIC Educational Resources Information Center

    Draaijer, S.; Hartog, R. J. M.

    2007-01-01

    A set of design patterns for digital item types has been developed in response to challenges identified in various projects by teachers in higher education. The goal of the projects in question was to design and develop formative and summative tests, and to develop interactive learning material in the form of quizzes. The subject domains involved…

  17. Scores Based on Dangerous Responses to Multiple-Choice Items.

    ERIC Educational Resources Information Center

    Grosse, Martin E.

    1986-01-01

    Scores based on the number of correct answers were compared with scores based on dangerous responses to items in the same multiple choice test developed by American Board of Orthopaedic Surgery. Results showed construct validity for both sets of scores. However, both scores were redundant when evaluated by correlation coefficient. (Author/JAZ)

  18. The Consequence of Refreshing for Access to Non-Selected Items in Young and Older Adults

    PubMed Central

    Higgins, Julie A.; Johnson, Marcia K.

    2009-01-01

    We examined the effect of competition on briefly thinking of just-seen items. In Experiment 1, participants saw a set of three related or unrelated words, and then read one of the words again (repeat) or thought briefly of one of the words (refresh). Participants read the set a second time, after which they refreshed a second word from the set or read a new word. Compared to reading a new word, response times were slower to refresh the second item having just refreshed vs. just repeated the first item. This increase was larger for related vs. unrelated words and for older vs. younger adults. In Experiment 2, a negative impact of refreshing was observed when repeating a different word from the set. The pattern of findings suggest that the negative impact of refreshing comes from increased competition from the refreshed item rather than inhibition of the non-refreshed items. PMID:19223566

  19. Detecting Gender Bias Through Test Item Analysis

    NASA Astrophysics Data System (ADS)

    González-Espada, Wilson J.

    2009-03-01

    Many physical science and physics instructors might not be trained in pedagogically appropriate test construction methods. This could lead to test items that do not measure what they are intended to measure. A subgroup of these items might show bias against some groups of students. This paper describes how the author became aware of potentially biased items against females in his examinations, which led to the exploration of fundamental issues related to item validity, gender bias, and differential item functioning, or DIF. A brief discussion of DIF in the context of university courses, as well as practical suggestions to detect possible gender-biased items, follows.

  20. The 4-Item Negative Symptom Assessment (NSA-4) Instrument

    PubMed Central

    Morlock, Robert; Coon, Cheryl; van Willigenburg, Arjen; Panagides, John

    2010-01-01

    Objective. To assess the ability of mental health professionals to use the 4-item Negative Symptom Assessment instrument, derived from the Negative Symptom Assessment-16, to rapidly determine the severity of negative symptoms of schizophrenia. Design. Open participation. Setting. Medical education conferences. Participants. Attendees at two international psychiatry conferences. Measurements. Participants read a brief set of the 4-item Negative Symptom Assessment instructions and viewed a videotape of a patient with schizophrenia. Using the 1 to 6 4-item Negative Symptom Assessment severity rating scale, they rated four negative symptom items and the overall global negative symptoms. These ratings were compared with a consensus rating determination using frequency distributions and Chi-square tests for the proportion of participant ratings that were within one point of the expert rating. Results. More than 400 medical professionals (293 physicians, 50% with a European practice, and 55% who reported past utilization of schizophrenia ratings scales) participated. Between 82.1 and 91.1 percent of the 4-items and the global rating determinations by the participants were within one rating point of the consensus expert ratings. The differences between the percentage of participant rating scores that were within one point versus the percentage that were greater than one point different from those by the consensus experts was significant (p<0.0001). Participants rating of negative symptoms using the 4-item Negative Symptom Assessment did not generally differ among the geographic regions of practice, the professional credentialing, or their familiarity with the use of schizophrenia symptom rating instruments. Conclusion. These findings suggest that clinicians from a variety of geographic practices can, after brief training, use the 4-item Negative Symptom Assessment effectively to rapidly assess negative symptoms in patients with schizophrenia. PMID:20805916

  1. Feedback control and synchronization of Mandelbrot sets

    NASA Astrophysics Data System (ADS)

    Zhang, Yong-Ping

    2013-01-01

    The movement of a particle could be depicted by the Mandelbrot set from the fractal viewpoint. According to the requirement, the movement of the particle needs to show different behaviors. In this paper, the feedback control method is taken on the classical Mandelbrot set. By amending the feedback item in the controller, the control method is applied to the generalized Mandelbrot set and by taking the reference item to be the trajectory of another system, the synchronization of Mandelbrot sets is achieved.

  2. Abandoned Mine Lands

    EPA Pesticide Factsheets

    Abandoned Mine Lands are those lands, waters, and surrounding watersheds where extraction, beneficiation, or processing of ores and minerals (excluding coal) has occurred. These lands also include areas where mining or processing activity is inactive.

  3. Mine drainage and surface mine reclamation. Volume II. Mine reclamation, abandoned mine lands and policy issues

    SciTech Connect

    Not Available

    1988-01-01

    Mine waste and mine reclamation are topics of major interest to the mining industry, the government and the general public. This publication and its companion volume are the proceedings of a conference held in Pittsburgh, April 19-21, 1988. There were nine sessions (50 papers) that dealt with the geochemistry, hydrology and problems of mine waste and mine water, especially acid mine drainage. These comprise Volume 1. The nine sessions (43 papers) that dealt with reclamation and restoration of disturbed lands, as well as related policy issues, are included in volume 2. Volume 2 also contains the ten papers that pertained to control of subsidence and mine fires at abandoned mines. Poster session presentations are, in general, represented by abstracts; these have been placed in the back of both volumes.

  4. Mine drainage and surface mine reclamation. Volume I. Mine water and mine waste

    SciTech Connect

    Not Available

    1988-01-01

    Mine waste and mine reclamation are topics of major interest to the mining industry, the government and the general public. This publication and its companion volume are the proceedings of a conference held in Pittsburgh, April 19-21, 1988. There were nine sessions (50 papers) that dealt with the geochemistry, hydrology and problems of mine waste and mine water, especially acid mine drainage. These comprise Volume 1. The nine sessions (43 papers) that dealt with reclamation and restoration of disturbed lands, as well as related policy issues, are included in volume 2. Volume 2 also contains the ten papers that pertained to control of subsidence and mine fires at abandoned mines. Poster session presentations are, in general, represented by abstracts; these have been placed in the back of both volumes.

  5. Substructure mining using elaborate chemical representation.

    PubMed

    Kazius, Jeroen; Nijssen, Siegfried; Kok, Joost; Bäck, Thomas; Ijzerman, Adriaan P

    2006-01-01

    Substructure mining algorithms are important drug discovery tools since they can find substructures that affect physicochemical and biological properties. Current methods, however, only consider a part of all chemical information that is present within a data set of compounds. Therefore, the overall aim of our study was to enable more exhaustive data mining by designing methods that detect all substructures of any size, shape, and level of chemical detail. A means of chemical representation was developed that uses atomic hierarchies, thus enabling substructure mining to consider general and/or highly specific features. As a proof-of-concept, the efficient, multipurpose graph mining system Gaston learned substructures of any size and shape from a mutagenicity data set that was represented in this manner. From these substructures, we extracted a set of only six nonredundant, discriminative substructures that represent relevant biochemical knowledge. Our results demonstrate the individual and synergistic importance of elaborate chemical representation and mining for nonlinear substructures. We conclude that the combination of elaborate chemical representation and Gaston provides an excellent method for 2D substructure mining as this recipe systematically explores all substructures in different levels of chemical detail.

  6. A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

    ERIC Educational Resources Information Center

    Fukuhara, Hirotaka; Kamata, Akihito

    2011-01-01

    A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…

  7. A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

    ERIC Educational Resources Information Center

    Fukuhara, Hirotaka; Kamata, Akihito

    2011-01-01

    A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…

  8. The enviornmental assessment of a contemporary coal mining system

    NASA Technical Reports Server (NTRS)

    Dutzi, E. J.; Sullivan, P. J.; Hutchinson, C. F.; Stevens, C. M.

    1980-01-01

    A contemporary underground coal mine in eastern Kentucky was assessed in order to determine potential off-site and on-site environmental impacts associated with the mining system in the given environmental setting. A 4 section, continuous room and pillor mine plan was developed for an appropriate site in eastern Kentucky. Potential environmental impacts were identified, and mitigation costs determined. The major potential environmental impacts were determined to be: acid water drainage from the mine and refuse site, uneven subsidence of the surface as a result of mining activity, and alteration of ground water aquifers in the subsidence zone. In the specific case examined, the costs of environmental impact mitigation to levels prescribed by regulations would not exceed $1/ton of coal mined, and post mining land values would not be affected.

  9. Proceedings: Fourth Workshop on Mining Scientific Datasets

    SciTech Connect

    Kamath, C

    2001-07-24

    Commercial applications of data mining in areas such as e-commerce, market-basket analysis, text-mining, and web-mining have taken on a central focus in the JCDD community. However, there is a significant amount of innovative data mining work taking place in the context of scientific and engineering applications that is not well represented in the mainstream KDD conferences. For example, scientific data mining techniques are being developed and applied to diverse fields such as remote sensing, physics, chemistry, biology, astronomy, structural mechanics, computational fluid dynamics etc. In these areas, data mining frequently complements and enhances existing analysis methods based on statistics, exploratory data analysis, and domain-specific approaches. On the surface, it may appear that data from one scientific field, say genomics, is very different from another field, such as physics. However, despite their diversity, there is much that is common across the mining of scientific and engineering data. For example, techniques used to identify objects in images are very similar, regardless of whether the images came from a remote sensing application, a physics experiment, an astronomy observation, or a medical study. Further, with data mining being applied to new types of data, such as mesh data from scientific simulations, there is the opportunity to apply and extend data mining to new scientific domains. This one-day workshop brings together data miners analyzing science data and scientists from diverse fields to share their experiences, learn how techniques developed in one field can be applied in another, and better understand some of the newer techniques being developed in the KDD community. This is the fourth workshop on the topic of Mining Scientific Data sets; for information on earlier workshops, see http://www.ahpcrc.org/conferences/. This workshop continues the tradition of addressing challenging problems in a field where the diversity of applications is

  10. Development of the PROMIS® Negative Psychosocial Expectancies of Smoking Item Banks

    PubMed Central

    Edelen, Maria Orlando; Tucker, Joan S.; Shadel, William G.; Cerully, Jennifer; Kuhfeld, Megan; Hansen, Mark; Cai, Li

    2014-01-01

    Introduction: Negative psychosocial expectancies of smoking include aspects of social disapproval and disappointment in oneself. This paper describes analyses conducted to develop and evaluate item banks for assessing psychosocial expectancies among daily and nondaily smokers. Methods: Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of psychosocial expectancies items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess psychosocial expectancies. Results: A total of 21 items were included in the Psychosocial Expectancies item banks: 14 items are common across daily and nondaily smokers, 6 are unique to daily, and 1 is unique to nondaily. For both daily and nondaily smokers, the Psychosocial Expectancies item banks are strongly unidimensional, highly reliable (reliability = 0.95 and 0.93, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.85). Results from simulated CATs showed that, on average, fewer than 8 items are needed to assess psychosocial expectancies with adequate precision when using the item banks. Conclusions: Psychosocial expectancies of smoking can be assessed on the basis of these item banks via the SF, by using CAT, or through a tailored set of items selected for a specific research purpose. PMID:25118230

  11. Using automatic item generation to create multiple-choice test items.

    PubMed

    Gierl, Mark J; Lai, Hollis; Turner, Simon R

    2012-08-01

    Many tests of medical knowledge, from the undergraduate level to the level of certification and licensure, contain multiple-choice items. Although these are efficient in measuring examinees' knowledge and skills across diverse content areas, multiple-choice items are time-consuming and expensive to create. Changes in student assessment brought about by new forms of computer-based testing have created the demand for large numbers of multiple-choice items. Our current approaches to item development cannot meet this demand. We present a methodology for developing multiple-choice items based on automatic item generation (AIG) concepts and procedures. We describe a three-stage approach to AIG and we illustrate this approach by generating multiple-choice items for a medical licensure test in the content area of surgery. To generate multiple-choice items, our method requires a three-stage process. Firstly, a cognitive model is created by content specialists. Secondly, item models are developed using the content from the cognitive model. Thirdly, items are generated from the item models using computer software. Using this methodology, we generated 1248 multiple-choice items from one item model. Automatic item generation is a process that involves using models to generate items using computer technology. With our method, content specialists identify and structure the content for the test items, and computer technology systematically combines the content to generate new test items. By combining these outcomes, items can be generated automatically. © Blackwell Publishing Ltd 2012.

  12. 7 CFR 1767.24 - Extraordinary items.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... AGRICULTURE (CONTINUED) ACCOUNTING REQUIREMENTS FOR RUS ELECTRIC BORROWERS Uniform System of Accounts § 1767... would significantly distort the current year's income computed before extraordinary items, if reported... would significantly distort the current year's income computed before extraordinary items, if reported...

  13. Mountaintop mining consequences

    Treesearch

    M.A. Palmer; E.S. Bernhardt; W.H. Schlesinger; K.N. Eshleman; E. Foufoula-Georgiou; M.S. Hendryx; A.D. Lemly; G.E. Likens; O.L. Loucks; M.E. Power; P.S. White; P.R. Wilcock

    2010-01-01

    There has been a global, 30-year increase in surface mining (1), which is now the dominant driver of land-use change in the central Appalachian ecoregion of the United States (2). One major form of such mining, mountaintop mining with valley fills (MTM/VF) (3), is widespread throughout eastern Kentucky, West Virginia (WV), and southwestern Virginia. Upper elevation...

  14. Orapa Diamond Mine, Botswana

    NASA Image and Video Library

    2015-11-16

    This image from NASA Terra spacecraft shows the Orapa diamond mine, the world largest diamond mine by area. The mine is located in Botswana. It is the oldest of four mines operated by the same company, having begun operations in 1971. Orapa is an open pit style of mine, located on two kimberlite pipes. Currently, the Orapa mine annually produces approximately 11 million carats (2200 kg) of diamonds. The Letlhakane diamond mine is also an open pit construction. In 2003, the Letlhakane mine produced 1.06 million carats of diamonds. The Damtshaa diamond mine is the newest of four mines, located on top of four distinct kimberlite pipes of varying ore grade. The mine is forecast to produce about 5 million carats of diamond over the projected 31 year life of the mine. The image was acquired October 5, 2014, covers an area of 28 by 45 km, and is located at 21.3 degrees south, 25.4 degrees east. http://photojournal.jpl.nasa.gov/catalog/PIA20104

  15. Mountaintop mining update

    SciTech Connect

    Buchsbaum, L.

    2006-07-15

    In a bad year for the US mining industry's safety record and public image, Morehead State University hosted a public meeting titled 'Mountaintop mining, health and safety forum'. This was a balanced event, with representatives from the mining industry as well as activists from the environmental community. A full account is given of the presentations and debate at the forum. 6 photos.

  16. Data Mining for CRM

    NASA Astrophysics Data System (ADS)

    Thearling, Kurt

    Data Mining technology allows marketing organizations to better understand their customers and respond to their needs. This chapter describes how Data Mining can be combined with customer relationship management to help drive improved interactions with customers. An example showing how to use Data Mining to drive customer acquisition activities is presented.

  17. The Disgust Scale: Item Analysis, Factor Structure, and Suggestions for Refinement

    ERIC Educational Resources Information Center

    Olatunji, Bunmi O.; Williams, Nathan L.; Tolin, David F.; Abramowitz, Jonathan S.; Sawchuk, Craig N.; Lohr, Jeffrey M.; Elwood, Lisa S.

    2007-01-01

    In the 4 studies presented (N = 1,939), a converging set of analyses was conducted to evaluate the item adequacy, factor structure, reliability, and validity of the Disgust Scale (DS; J. Haidt, C. McCauley, & P. Rozin, 1994). The results suggest that 7 items (i.e., Items 2, 7, 8, 21, 23, 24, and 25) should be considered for removal from the DS.…

  18. The Disgust Scale: Item Analysis, Factor Structure, and Suggestions for Refinement

    ERIC Educational Resources Information Center

    Olatunji, Bunmi O.; Williams, Nathan L.; Tolin, David F.; Abramowitz, Jonathan S.; Sawchuk, Craig N.; Lohr, Jeffrey M.; Elwood, Lisa S.

    2007-01-01

    In the 4 studies presented (N = 1,939), a converging set of analyses was conducted to evaluate the item adequacy, factor structure, reliability, and validity of the Disgust Scale (DS; J. Haidt, C. McCauley, & P. Rozin, 1994). The results suggest that 7 items (i.e., Items 2, 7, 8, 21, 23, 24, and 25) should be considered for removal from the DS.…

  19. Geologic setting and water quality of selected basins in the active coal-mining areas of Ohio, 1989-91, with a summary of water quality for 1985-91

    USGS Publications Warehouse

    Sedam, A.C.; Francy, D.S.

    1993-01-01

    This report presents streamwater- and ground-water-quality data collected to characterize the baseline water quality for 21 drainage basins in the coal-mining region of eastern Ohio. The study area is mostly within the unglaciated part of eastern Ohio along the western edge of the Appalachian Plateaus Physiographic Province. The data collected from 1989-91 and presented in this report represent the third and final phase of a 7-year study to assess baseline water quality in Ohio's coal region during 1985-1991. During 1989-91, 246 samples from 41 streamwater sites were collected periodically from a long-term site network. Ranges and medians of measurements made at the long-term streamwater sites were following: specific conductance, 270 to 5,170 and 792 microsiemens per centimeter at 25 degrees Celsius; pH, 2.7 to 9.1 and 7.8; alkalinity, 1 to 391 and 116 mg/L (milligrams per liter). Ranges and medians of laboratory analyses of the same samples were the following: dissolved sulfate, 13 to 2,100 and 200 mg/L; dissolved aluminum, <10 to 17,000 and 300 ? /L (micrograms per liter); dissolved iron, <10 to 53,000 and 60 ? /L; and dissolved manganese, <10 to 17,000 and 295 ? /L. The ranges for concentrations of total recoverable aluminum, iron, and manganese were similar to the ranges of concentrations found for dissolved constituents. Medians of total recoverable aluminum and iron were about 10 times greater than the medians of dissolved aluminum and iron. During 1989-91, once-only sample collections were done at 45 streamwater sites in nine basins chosen for synoptic sampling. At several sites in the Middle Hocking River basin and Leading Creek basin, water had low pH and high concentrations of dissolved aluminum, iron and manganese. These water-quality characteristics are commonly associated with ace mine drainage. Throughout the entire 7-year study (1985-91), medians for most constituents at the long-term streamwater-sampling sites were fairly consistent, despite the

  20. A Balance Sheet for Educational Item Banking.

    ERIC Educational Resources Information Center

    Hiscox, Michael D.

    Educational item banking presents observers with a considerable paradox. The development of test items from scratch is viewed as wasteful, a luxury in times of declining resources. On the other hand, item banking has failed to become a mature technology despite large amounts of money and the efforts of talented professionals. The question of which…

  1. Using Multimedia Technology To Create Innovative Items.

    ERIC Educational Resources Information Center

    French, Ann; Godwin, Janet

    The development of innovative test item types that use multimedia technology to improve item authenticity and interaction and allow for objective scoring through partial-credit scoring methodologies was studied. Science test items were developed for community college developmental students using "Authorware 3.0," an instructional compact disc. The…

  2. Criterion-Referenced Test Items for Welding.

    ERIC Educational Resources Information Center

    Davis, Diane, Ed.

    This test item bank on welding contains test questions based upon competencies found in the Missouri Welding Competency Profile. Some test items are keyed for multiple competencies. These criterion-referenced test items are designed to work with the Vocational Instructional Management System. Questions have been statistically sampled and validated…

  3. Item Selection in Computerized Classification Testing

    ERIC Educational Resources Information Center

    Thompson, Nathan A.

    2009-01-01

    Several alternatives for item selection algorithms based on item response theory in computerized classification testing (CCT) have been suggested, with no conclusive evidence on the substantial superiority of a single method. It is argued that the lack of sizable effect is because some of the methods actually assess items very similarly through…

  4. Generalized Full-Information Item Bifactor Analysis

    ERIC Educational Resources Information Center

    Cai, Li; Yang, Ji Seung; Hansen, Mark

    2011-01-01

    Full-information item bifactor analysis is an important statistical method in psychological and educational measurement. Current methods are limited to single-group analysis and inflexible in the types of item response models supported. We propose a flexible multiple-group item bifactor analysis framework that supports a variety of…

  5. Differential Item Functioning from a Multilevel Perspective.

    ERIC Educational Resources Information Center

    van den Bergh, Huub; And Others

    The term differential item functioning (DIF) refers to whether or not the same psychological constructs are measured across different groups. If an item does not measure the same skills or subskills in different populations, it is said to function differentially or to display item bias. A multilevel approach to DIF is proposed. In such a model,…

  6. 42 CFR 414.224 - Customized items.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... for the same purpose that the two items cannot be grouped together for pricing purposes. (b) Payment rule. Payment is made on a lump sum basis for the purchase of a customized item based on the carrier's... individual consideration takes into account written documentation on the costs of the item including at least...

  7. Binomial Test Models and Item Difficulty.

    ERIC Educational Resources Information Center

    van der Linden, Wim J.

    1979-01-01

    The restrictions on item difficulties that must be met when binomial models are applied to domain-referenced testing are examined. Both a deterministic and a stochastic conception of item responses are discussed with respect to difficulty and Guttman-type items. (Author/BH)

  8. Are All Item Response Functions Monotonically Increasing?

    ERIC Educational Resources Information Center

    Wang, Wenhao

    2012-01-01

    Item response functions of the parametric logistic IRT models follow the logistic form which is monotonically increasing. However, item response functions of some real items are nonmonotonic which might lead to examinees with lower proficiency levels receiving higher scores. This study compared three nonparametric IRF estimation methods--the…

  9. Computer Equipment Repair Test Item Bank.

    ERIC Educational Resources Information Center

    Reneau, Fred; And Others

    This guide contains 321 test items for use in teaching a course in repairing computer equipment. All test items were reviewed, revised, and validated by incumbent workers and subject matter instructors. Items are provided for assessing student achievement in the following skill areas (with selected skills mentioned in brackets): performing…

  10. Processing Polarity Items: Contrastive Licensing Costs

    ERIC Educational Resources Information Center

    Saddy, Douglas; Drenhaus, Heiner; Frisch, Stefan

    2004-01-01

    We describe an experiment that investigated the failure to license polarity items in German using event-related brain potentials (ERPs). The results reveal distinct processing reflexes associated with failure to license positive polarity items in comparison to failure to license negative polarity items. Failure to license both negative and…

  11. Effects of Including Humor in Test Items.

    ERIC Educational Resources Information Center

    McMorris, Robert F.; And Others

    Two 50-item multiple-choice forms of a grammar test were developed differing only in humor being included in 20 items of one form. One hundred twenty-six (126) eighth graders received the test plus alternate forms of a questionnaire. Humor inclusion did not affect grammar scores on matched humorous/nonhumorous items nor on common post-treatment…

  12. 76 FR 60474 - Commercial Item Handbook

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-09-29

    ... Defense Acquisition Regulations System Commercial Item Handbook AGENCY: Defense Acquisition Regulations... Commercial Item Handbook. The purpose of the Handbook is to help acquisition personnel develop sound business strategies for procuring commercial items. DoD is seeking industry input on the contents before...

  13. Processing Polarity Items: Contrastive Licensing Costs

    ERIC Educational Resources Information Center

    Saddy, Douglas; Drenhaus, Heiner; Frisch, Stefan

    2004-01-01

    We describe an experiment that investigated the failure to license polarity items in German using event-related brain potentials (ERPs). The results reveal distinct processing reflexes associated with failure to license positive polarity items in comparison to failure to license negative polarity items. Failure to license both negative and…

  14. Are All Item Response Functions Monotonically Increasing?

    ERIC Educational Resources Information Center

    Wang, Wenhao

    2012-01-01

    Item response functions of the parametric logistic IRT models follow the logistic form which is monotonically increasing. However, item response functions of some real items are nonmonotonic which might lead to examinees with lower proficiency levels receiving higher scores. This study compared three nonparametric IRF estimation methods--the…

  15. Real and Artificial Differential Item Functioning

    ERIC Educational Resources Information Center

    Andrich, David; Hagquist, Curt

    2012-01-01

    The literature in modern test theory on procedures for identifying items with differential item functioning (DIF) among two groups of persons includes the Mantel-Haenszel (MH) procedure. Generally, it is not recognized explicitly that if there is real DIF in some items which favor one group, then as an artifact of this procedure, artificial DIF…

  16. Effects of Assigning Raters to Items

    ERIC Educational Resources Information Center

    Sykes, Robert C.; Ito, Kyoko; Wang, Zhen

    2008-01-01

    Student responses to a large number of constructed response items in three Math and three Reading tests were scored on two occasions using three ways of assigning raters: single reader scoring, a different reader for each response (item-specific), and three readers each scoring a rater item block (RIB) containing approximately one-third of a…

  17. FACTOR STRUCTURE OF MF SCALES AND ITEMS.

    ERIC Educational Resources Information Center

    LUNNEBORG, CLIFFORD E.; LUNNEBORG, PATRICIA W.

    FACTOR ANALYSES WERE PERFORMED UPON FOUR MASCULINITY-FEMINITY (MF) SCALES AND UPON THE 136-ITEMS COMPRISING THESE SCALES. RESULTS OF THE FIRST ANALYSIS ILLUSTRATED THE DIFFICULTY OF INTERPRETING FACTORS BASED ON HETEROGENEOUS SCALES. THE ITEM FACTORING REVEALED THE MULTIDIMENSIONALITY OF MF. CERTAIN ITEM FACTORS WERE UNCORRELATED WITH SEX STATUS…

  18. Item Response Methods for Educational Assessment.

    ERIC Educational Resources Information Center

    Mislevy, Robert J.; Rieser, Mark R.

    Multiple matrix sampling (MMS) theory indicates how data may be gathered to most efficiently convey information about levels of attainment in a population, but standard analyses of these data require random sampling of items from a fixed pool of items. This assumption proscribes the retirement of flawed or obsolete items from the pool as well as…

  19. Ramsay-Curve Differential Item Functioning

    ERIC Educational Resources Information Center

    Woods, Carol M.

    2011-01-01

    Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another, irrespective of true group-mean differences on the constructs being measured. This article is focused on item response theory based likelihood ratio testing for DIF (IRT-LR or…

  20. The Multidimensionality of Verbal Analogy Items

    ERIC Educational Resources Information Center

    Ullstadius, Eva; Carlstedt, Berit; Gustafsson, Jan-Eric

    2008-01-01

    The influence of general and verbal ability on each of 72 verbal analogy test items were investigated with new factor analytical techniques. The analogy items together with the Computerized Swedish Enlistment Battery (CAT-SEB) were given randomly to two samples of 18-year-old male conscripts (n = 8566 and n = 5289). Thirty-two of the 72 items had…

  1. Identification of a Semiparametric Item Response Model

    ERIC Educational Resources Information Center

    Peress, Michael

    2012-01-01

    We consider the identification of a semiparametric multidimensional fixed effects item response model. Item response models are typically estimated under parametric assumptions about the shape of the item characteristic curves (ICCs), and existing results suggest difficulties in recovering the distribution of individual characteristics under…

  2. The Multidimensionality of Verbal Analogy Items

    ERIC Educational Resources Information Center

    Ullstadius, Eva; Carlstedt, Berit; Gustafsson, Jan-Eric

    2008-01-01

    The influence of general and verbal ability on each of 72 verbal analogy test items were investigated with new factor analytical techniques. The analogy items together with the Computerized Swedish Enlistment Battery (CAT-SEB) were given randomly to two samples of 18-year-old male conscripts (n = 8566 and n = 5289). Thirty-two of the 72 items had…

  3. Linking Item Parameters to a Base Scale

    ERIC Educational Resources Information Center

    Kang, Taehoon; Petersen, Nancy S.

    2012-01-01

    This paper compares three methods of item calibration--concurrent calibration, separate calibration with linking, and fixed item parameter calibration--that are frequently used for linking item parameters to a base scale. Concurrent and separate calibrations were implemented using BILOG-MG. The Stocking and Lord in "Appl Psychol Measure"…

  4. Item Response Methods for Educational Assessment.

    ERIC Educational Resources Information Center

    Mislevy, Robert J.; Rieser, Mark R.

    Multiple matrix sampling (MMS) theory indicates how data may be gathered to most efficiently convey information about levels of attainment in a population, but standard analyses of these data require random sampling of items from a fixed pool of items. This assumption proscribes the retirement of flawed or obsolete items from the pool as well as…

  5. On the Relationship between Differential Item Functioning and Item Difficulty: An Issue of Methods? Item Response Theory Approach to Differential Item Functioning

    ERIC Educational Resources Information Center

    Santelices, Maria Veronica; Wilson, Mark

    2012-01-01

    The relationship between differential item functioning (DIF) and item difficulty on the SAT is such that more difficult items tended to exhibit DIF in favor of the focal group (usually minority groups). These results were reported by Kulick and Hu, and Freedle and have been enthusiastically discussed by more recent literature. Examining the…

  6. Real and Artificial Differential Item Functioning

    ERIC Educational Resources Information Center

    Andrich, David; Hagquist, Curt

    2012-01-01

    The literature in modern test theory on procedures for identifying items with differential item functioning (DIF) among two groups of persons includes the Mantel-Haenszel (MH) procedure. Generally, it is not recognized explicitly that if there is real DIF in some items which favor one group, then as an artifact of this procedure, artificial DIF…

  7. Item Banking Using Sample Free Calibration.

    ERIC Educational Resources Information Center

    Choppin, Bruce

    1985-01-01

    The potential value of developing item banks that would both guarantee the quality of teacher-made examinations and allow for national comparison of student achievement is described. Use of Rasch model methods, rather than classical item analysis techniques, to calibrate the items for these banks is illustrated. (BS)

  8. An Iterative Item Bias Detection Method.

    ERIC Educational Resources Information Center

    Van Der Flier, Henk; And Others

    1984-01-01

    Two strategies for assessing item bias are discussed: methods comparing item difficulties unconditional on ability and methods comparing probabilities of response conditional on ability. Results suggest that the iterative logit method is an improvement on the noniterative one and is efficient in detecting biased and unbiased items. (Author/DWH)

  9. Generalized Full-Information Item Bifactor Analysis

    ERIC Educational Resources Information Center

    Cai, Li; Yang, Ji Seung; Hansen, Mark

    2011-01-01

    Full-information item bifactor analysis is an important statistical method in psychological and educational measurement. Current methods are limited to single-group analysis and inflexible in the types of item response models supported. We propose a flexible multiple-group item bifactor analysis framework that supports a variety of…

  10. Effects of Assigning Raters to Items

    ERIC Educational Resources Information Center

    Sykes, Robert C.; Ito, Kyoko; Wang, Zhen

    2008-01-01

    Student responses to a large number of constructed response items in three Math and three Reading tests were scored on two occasions using three ways of assigning raters: single reader scoring, a different reader for each response (item-specific), and three readers each scoring a rater item block (RIB) containing approximately one-third of a…

  11. Linking Item Parameters to a Base Scale

    ERIC Educational Resources Information Center

    Kang, Taehoon; Petersen, Nancy S.

    2012-01-01

    This paper compares three methods of item calibration--concurrent calibration, separate calibration with linking, and fixed item parameter calibration--that are frequently used for linking item parameters to a base scale. Concurrent and separate calibrations were implemented using BILOG-MG. The Stocking and Lord in "Appl Psychol Measure"…

  12. Chemistry Sets Face Uncertain Future.

    ERIC Educational Resources Information Center

    Stinson, Stephen C.

    1979-01-01

    Chemistry sets, often a child's first contact with chemistry, are becoming less attractive to manufacturers as the market for these items decreases. There is a tendency for recently manufactured chemistry sets to be less adequate than those selling in the same price range in past years. Manuals vary in quality among manufacturers. (RE)

  13. Emergency Power For Critical Items

    NASA Astrophysics Data System (ADS)

    Young, William R.

    2009-07-01

    Natural disasters, such as hurricanes, floods, tornados, and tsunami, are becoming a greater problem as climate change impacts our environment. Disasters, whether natural or man made, destroy lives, homes, businesses and the natural environment. Such disasters can happen with little or no warning, leaving hundreds or even thousands of people without medical services, potable water, sanitation, communications and electrical services for up to several weeks. In our modern world, the need for electricity has become a necessity. Modern building codes and new disaster resistant building practices are reducing the damage to homes and businesses. Emergency gasoline and diesel generators are becoming common place for power outages. Generators need fuel, which may not be available after a disaster, but Photovoltaic (solar-electric) systems supply electricity without petroleum fuel as they are powered by the sun. Photovoltaic (PV) systems can provide electrical power for a home or business. PV systems can operate as utility interactive or stand-alone with battery backup. Determining your critical load items and sizing the photovoltaic system for those critical items, guarantees their operation in a disaster.

  14. Intact Transition Epitope Mapping (ITEM)

    NASA Astrophysics Data System (ADS)

    Yefremova, Yelena; Opuni, Kwabena F. M.; Danquah, Bright D.; Thiesen, Hans-Juergen; Glocker, Michael O.

    2017-08-01

    Intact transition epitope mapping (ITEM) enables rapid and accurate determination of protein antigen-derived epitopes by either epitope extraction or epitope excision. Upon formation of the antigen peptide-containing immune complex in solution, the entire mixture is electrosprayed to translate all constituents as protonated ions into the gas phase. There, ions from antibody-peptide complexes are separated from unbound peptide ions according to their masses, charges, and shapes either by ion mobility drift or by quadrupole ion filtering. Subsequently, immune complexes are dissociated by collision induced fragmentation and the ion signals of the "complex-released peptides," which in effect are the epitope peptides, are recorded in the time-of-flight analyzer of the mass spectrometer. Mixing of an antibody solution with a solution in which antigens or antigen-derived peptides are dissolved is, together with antigen proteolysis, the only required in-solution handling step. Simplicity of sample handling and speed of analysis together with very low sample consumption makes ITEM faster and easier to perform than other experimental epitope mapping methods.

  15. An incremental high-utility mining algorithm with transaction insertion.

    PubMed

    Lin, Jerry Chun-Wei; Gan, Wensheng; Hong, Tzung-Pei; Zhang, Binbin

    2015-01-01

    Association-rule mining is commonly used to discover useful and meaningful patterns from a very large database. It only considers the occurrence frequencies of items to reveal the relationships among itemsets. Traditional association-rule mining is, however, not suitable in real-world applications since the purchased items from a customer may have various factors, such as profit or quantity. High-utility mining was designed to solve the limitations of association-rule mining by considering both the quantity and profit measures. Most algorithms of high-utility mining are designed to handle the static database. Fewer researches handle the dynamic high-utility mining with transaction insertion, thus requiring the computations of database rescan and combination explosion of pattern-growth mechanism. In this paper, an efficient incremental algorithm with transaction insertion is designed to reduce computations without candidate generation based on the utility-list structures. The enumeration tree and the relationships between 2-itemsets are also adopted in the proposed algorithm to speed up the computations. Several experiments are conducted to show the performance of the proposed algorithm in terms of runtime, memory consumption, and number of generated patterns.

  16. Development and community-based validation of eight item banks to assess mental health.

    PubMed

    Batterham, Philip J; Sunderland, Matthew; Carragher, Natacha; Calear, Alison L

    2016-09-30

    There is a need for precise but brief screening of mental health problems in a range of settings. The development of item banks to assess depression and anxiety has resulted in new adaptive and static screeners that accurately assess severity of symptoms. However, expansion to a wider array of mental health problems is required. The current study developed item banks for eight mental health problems: social anxiety disorder, panic disorder, post-traumatic stress disorder, obsessive-compulsive disorder, adult attention-deficit hyperactivity disorder, drug use, psychosis and suicidality. The item banks were calibrated in a population-based Australian adult sample (N=3175) by administering large item pools (45-75 items) and excluding items on the basis of local dependence or measurement non-invariance. Item Response Theory parameters were estimated for each item bank using a two-parameter graded response model. Each bank consisted of 19-47 items, demonstrating excellent fit and precision across a range of -1 to 3 standard deviations from the mean. No previous study has developed such a broad range of mental health item banks. The calibrated item banks will form the basis of a new system of static and adaptive measures to screen for a broad array of mental health problems in the community.

  17. Caution warranted in extrapolating from Boston Naming Test item gradation construct.

    PubMed

    Beattey, Robert A; Murphy, Hilary; Cornwell, Melinda; Braun, Thomas; Stein, Victoria; Goldstein, Martin; Bender, Heidi Allison

    2017-01-01

    The Boston Naming Test (BNT) was designed to present items in order of difficulty based on word frequency. Changes in word frequencies over time, however, would frustrate extrapolation in clinical and research settings based on the theoretical construct because performance on the BNT might reflect changes in ecological frequency of the test items, rather than performance across items of increasing difficulty. This study identifies the ecological frequency of BNT items at the time of publication using the American Heritage Word Frequency Book and determines changes in frequency over time based on the frequency distribution of BNT items across a current corpus, the Corpus of Contemporary American English. Findings reveal an uneven distribution of BNT items across 2 corpora and instances of negligible differentiation in relative word frequency across test items. As BNT items are not presented in order from least to most frequent, clinicians and researchers should exercise caution in relying on the BNT as presenting items in increasing order of difficulty. A method is proposed for distributing confrontation-naming items to be explicitly measured against test items that are normally distributed across the corpus of a given language.

  18. Text mining for the biocuration workflow.

    PubMed

    Hirschman, Lynette; Burns, Gully A P C; Krallinger, Martin; Arighi, Cecilia; Cohen, K Bretonnel; Valencia, Alfonso; Wu, Cathy H; Chatr-Aryamontri, Andrew; Dowell, Karen G; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on 'Text Mining for the BioCuration Workflow' at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community.

  19. Remediation of acid mine drainage with sulfate reducing bacteria

    SciTech Connect

    Hauri, J.F.; Schaider, L.A.

    2009-02-15

    Sulfate reducing bacteria have been shown to be effective at treating acid mine drainage through sulfide production and subsequent precipitation of metal sulfides. In this laboratory experiment for undergraduate environmental chemistry courses, students design and implement a set of bioreactors to remediate acid mine drainage and explain observed changes in dissolved metal concentrations and pH. Using synthetic acid mine drainage and combinations of inputs, students monitor their bioreactors for decreases in dissolved copper and iron concentrations.

  20. Equal Area Logistic Estimation for Item Response Theory

    NASA Astrophysics Data System (ADS)

    Lo, Shih-Ching; Wang, Kuo-Chang; Chang, Hsin-Li

    2009-08-01

    Item response theory (IRT) models use logistic functions exclusively as item response functions (IRFs). Applications of IRT models require obtaining the set of values for logistic function parameters that best fit an empirical data set. However, success in obtaining such set of values does not guarantee that the constructs they represent actually exist, for the adequacy of a model is not sustained by the possibility of estimating parameters. In this study, an equal area based two-parameter logistic model estimation algorithm is proposed. Two theorems are given to prove that the results of the algorithm are equivalent to the results of fitting data by logistic model. Numerical results are presented to show the stability and accuracy of the algorithm.