Science.gov

Sample records for item set mining

  1. Statistical evaluation of synchronous spike patterns extracted by frequent item set mining

    PubMed Central

    Torre, Emiliano; Picado-Muiño, David; Denker, Michael; Borgelt, Christian; Grün, Sonja

    2013-01-01

    We recently proposed frequent itemset mining (FIM) as a method to perform an optimized search for patterns of synchronous spikes (item sets) in massively parallel spike trains. This search outputs the occurrence count (support) of individual patterns that are not trivially explained by the counts of any superset (closed frequent item sets). The number of patterns found by FIM makes direct statistical tests infeasible due to severe multiple testing. To overcome this issue, we proposed to test the significance not of individual patterns, but instead of their signatures, defined as the pairs of pattern size z and support c. Here, we derive in detail a statistical test for the significance of the signatures under the null hypothesis of full independence (pattern spectrum filtering, PSF) by means of surrogate data. As a result, injected spike patterns that mimic assembly activity are well detected, yielding a low false negative rate. However, this approach is prone to additionally classify patterns resulting from chance overlap of real assembly activity and background spiking as significant. These patterns represent false positives with respect to the null hypothesis of having one assembly of given signature embedded in otherwise independent spiking activity. We propose the additional method of pattern set reduction (PSR) to remove these false positives by conditional filtering. By employing stochastic simulations of parallel spike trains with correlated activity in form of injected spike synchrony in subsets of the neurons, we demonstrate for a range of parameter settings that the analysis scheme composed of FIM, PSF and PSR allows to reliably detect active assemblies in massively parallel spike trains. PMID:24167487

  2. Using Frequent Item Set Mining and Feature Selection Methods to Identify Interacted Risk Factors - The Atrial Fibrillation Case Study.

    PubMed

    Li, Xiang; Liu, Haifeng; Du, Xin; Hu, Gang; Xie, Guotong; Zhang, Ping

    2016-01-01

    Disease risk prediction is highly important for early intervention and treatment, and identification of predictive risk factors is the key point to achieve accurate prediction. In addition to original independent features in a dataset, some interacted features, such as comorbidities and combination therapies, may have non-additive influence on the disease outcome and can also be used in risk prediction to improve the prediction performance. However, it is usually difficult to manually identify the possible interacted risk factors due to the combination explosion of features. In this paper, we propose an automatic approach to identify predictive risk factors with interactions using frequent item set mining and feature selection methods. The proposed approach was applied in the real world case study of predicting ischemic stroke and thromboembolism for atrial fibrillation patients on the Chinese atrial fibrillation registry dataset, and the results show that our approach can not only improve the prediction performance, but also identify the comorbidities and combination therapies that have potential influences on TE occurrence for AF. PMID:27577446

  3. Optimal Assembly of Tests with Item Sets.

    ERIC Educational Resources Information Center

    van der Linden, Wim J.

    2000-01-01

    Presents six computational methods based on mixed-integer programming for assembling tests from a bank with an item-set structure and evaluated these methods using mathematical programming feasibiity and expected solution times. Illustrates these methods with two data sets from the Law School Admission Test. Discusses the best approximations to…

  4. Efficient Algorithms for Segmentation of Item-Set Time Series

    NASA Astrophysics Data System (ADS)

    Chundi, Parvathi; Rosenkrantz, Daniel J.

    We propose a special type of time series, which we call an item-set time series, to facilitate the temporal analysis of software version histories, email logs, stock market data, etc. In an item-set time series, each observed data value is a set of discrete items. We formalize the concept of an item-set time series and present efficient algorithms for segmenting a given item-set time series. Segmentation of a time series partitions the time series into a sequence of segments where each segment is constructed by combining consecutive time points of the time series. Each segment is associated with an item set that is computed from the item sets of the time points in that segment, using a function which we call a measure function. We then define a concept called the segment difference, which measures the difference between the item set of a segment and the item sets of the time points in that segment. The segment difference values are required to construct an optimal segmentation of the time series. We describe novel and efficient algorithms to compute segment difference values for each of the measure functions described in the paper. We outline a dynamic programming based scheme to construct an optimal segmentation of the given item-set time series. We use the item-set time series segmentation techniques to analyze the temporal content of three different data sets—Enron email, stock market data, and a synthetic data set. The experimental results show that an optimal segmentation of item-set time series data captures much more temporal content than a segmentation constructed based on the number of time points in each segment, without examining the item set data at the time points, and can be used to analyze different types of temporal data.

  5. Effects of Decreasing the Number of Common Items in Equating Link Item Sets.

    ERIC Educational Resources Information Center

    Hanick, Patricia L.; Huang, Chi-Yu

    The term "equating" refers to a statistical procedure that adjusts test scores on different forms of the same examination so that scores can be interpreted interchangeably. This study examines the impact of equating with fewer items than originally planned when items have been removed from the equating set for a variety of reasons. A real data set…

  6. Vocabulary Mining for Information Retrieval: Rough Sets and Fuzzy Sets.

    ERIC Educational Resources Information Center

    Srinivasan, Padmini; Ruiz, Miguel E.; Kraft, Donald H.; Chen, Jianhua

    2001-01-01

    Explains vocabulary mining in information retrieval and describes a framework for vocabulary mining that allows the use of rough set-based approximations even when documents and queries are described using weighted, or fuzzy, representations. Examines coordination between multiple vocabulary views and applies the framework to the Unified Medical…

  7. Application of Item Response Theory in Standard Setting.

    ERIC Educational Resources Information Center

    Guan, Toh Poh; Gopal, Saminanthan

    1989-01-01

    A design for test equating based on Item Response Theory is proposed and illustrated through a simulation study of about 1,000 primary students taking two examinations. This test-equating method allows decision makers in a standard setting exercise access to high quality information. (SLD)

  8. Non-Derivable Item Set and Non-Derivable Literal Set Representations of Patterns Admitting Negation

    NASA Astrophysics Data System (ADS)

    Kryszkiewicz, Marzena

    The discovery of frequent patterns has attracted a lot of attention of the data mining community. While an extensive research has been carried out for discovering positive patterns, little has been offered for discovering patterns with negation. The main hindrance to the progress of such research is huge amount of frequent patterns with negation, which exceeds the number of frequent positive patterns by orders of magnitude. In this paper, we examine properties of derivable and non-derivable patterns, including those with negated items. In particular, we establish important relationships among patterns admitting negation that have the same canonical variant. By analogy to frequent non-derivable itemsets, which constitute a concise lossless representation NDR of frequent positive patterns, we introduce frequent non-derivable literal sets lossless representation NDRL of frequent positive patterns admitting negation. Then we use the derived properties of literal sets to offer a concise representation NDIR of frequent patterns admitting negation that is built only from positive non-derivable itemsets. The relationships between the three representations are identified. The transformation of the new representations into not less concise lossless closure representations is discussed.

  9. "Sample-Independent" Item Parameters? An Investigation of the Stability of IRT Item Parameters Estimated from Small Data Sets.

    ERIC Educational Resources Information Center

    Sireci, Stephen G.

    Whether item response theory (IRT) is useful to the small-scale testing practitioner is examined. The stability of IRT item parameters is evaluated with respect to the classical item parameters (i.e., p-values, biserials) obtained from the same data set. Previous research investigating the effect of sample size on IRT parameter estimation has…

  10. Confirmatory Factor Analysis of the M5-50: An Implementation of the International Personality Item Pool Item Set

    ERIC Educational Resources Information Center

    Socha, Alan; Cooper, Christopher A.; McCord, David M.

    2010-01-01

    Goldberg's International Personality Item Pool (IPIP; Goldberg, 1999) provides researchers with public-domain, free-access personality measurement scales that are proxies of well-established published scales. One of the more commonly used IPIP sets employs 50 items to measure the 5 broad domains of the 5-factor model, with 10 items per factor. The…

  11. Double Mine Building, general view in setting; view northeast ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    Double Mine Building, general view in setting; view northeast - Fort McKinley, Double Mine Building, East side of East Side Drive, approximately 125 feet south of Weymouth Way, Great Diamond Island, Portland, Cumberland County, ME

  12. Method of data mining including determining multidimensional coordinates of each item using a predetermined scalar similarity value for each item pair

    DOEpatents

    Meyers, Charles E.; Davidson, George S.; Johnson, David K.; Hendrickson, Bruce A.; Wylie, Brian N.

    1999-01-01

    A method of data mining represents related items in a multidimensional space. Distance between items in the multidimensional space corresponds to the extent of relationship between the items. The user can select portions of the space to perceive. The user also can interact with and control the communication of the space, focusing attention on aspects of the space of most interest. The multidimensional spatial representation allows more ready comprehension of the structure of the relationships among the items.

  13. Method of data mining including determining multidimensional coordinates of each item using a predetermined scalar similarity value for each item pair

    SciTech Connect

    Meyers, C.E.; Davidson, G.S.; Johnson, D.K.; Hendrickson, B.A.; Wylie, B.N.

    1999-11-16

    A method of data mining represents related items in a multidimensional space. Distance between items in the multidimensional space corresponds to the extent of relationship between the items. The user can select portions of the space to perceive. The user also can interact with and control the communication of the space, focusing attention on aspects of the space of most interest. The multidimensional spatial representation allows more ready comprehension of the structure of the relationships among the items.

  14. Data Mining Using Extensions of the Rough Set Model.

    ERIC Educational Resources Information Center

    Lingras, P. J.; Yao, Y. Y.

    1998-01-01

    Examines basic issues of data mining using the theory of rough sets, a recent proposal for generalizing classical set theory. Demonstrates that a generalized rough set model can be used for generating rules from incomplete databases. Discusses the importance of rule extraction from incomplete databases in data mining. (AEF)

  15. Matching the Judgmental Task with Standard Setting Panelist Expertise: The Item-Descriptor (ID) Matching Method

    ERIC Educational Resources Information Center

    Ferrara, Steve; Perie, Marianne; Johnson, Eugene

    2008-01-01

    Psychometricians continue to introduce new approaches to setting cut scores for educational assessments in an attempt to improve on current methods. In this paper we describe the Item-Descriptor (ID) Matching method, a method based on IRT item mapping. In ID Matching, test content area experts match items (i.e., their judgments about the knowledge…

  16. Combining the Best of Two Standard Setting Methods: The Ordered Item Booklet Angoff

    ERIC Educational Resources Information Center

    Smith, Russell W.; Davis-Becker, Susan L.; O'Leary, Lisa S.

    2014-01-01

    This article describes a hybrid standard setting method that combines characteristics of the Angoff (1971) and Bookmark (Mitzel, Lewis, Patz & Green, 2001) methods. The proposed approach utilizes strengths of each method while addressing weaknesses. An ordered item booklet, with items sorted based on item difficulty, is used in combination…

  17. Application of rough set for medical images data mining

    NASA Astrophysics Data System (ADS)

    Wang, Shuyan; Wang, Chunmei; Chen, Yan

    2010-08-01

    To study the application of Rough set algorithm for diagnosis breast cancer, attribute reduction strategies of rough set are applied to the data mining of the mammography classification, proposes a medical images classifier based on association rules. Attribute reduction strategies of rough set for medical image data mining are realized. The experiment results are given. The experimental results show that the system performs well in accuracy, verified the great potential of rough set in assistant medical treatment.

  18. Identifying an Efficient Set of Items Sensitive to Clinical-Range Externalizing Problems in Children

    PubMed Central

    Petersen, Isaac T.; Bates, John E.; Dodge, Kenneth A.; Lansford, Jennifer E.; Pettit, Gregory S.

    2015-01-01

    The present study applied item response theory to identify an efficient set of items of the Achenbach Externalizing scale from the Child Behavior Checklist (CBCL; 33 items) and Teacher's Report Form (TRF; 35 items) that were sensitive to clinical-range scores. Mothers and teachers rated children's externalizing problems annually from ages 5 to 13 years in two independent samples (Ns = 585 and 1,199). Item properties for each rater across ages 5–8 and 9–13 were examined with item response theory. We identified 10 mother- and teacher-reported items from both samples based on the items' measurement precision for sub-clinical and clinical levels of externalizing problems: externalizing problems that involve meanness to others, destroying others' things, fighting, lying and cheating, attacking people, screaming, swearing/obscene language, temper tantrums, threatening people, and being loud. Scores on the scales using these items had strong reliability and psychometric properties, capturing nearly as much information as the full Externalizing scale for classifying clinical levels of externalizing problems. Scores on the scale with the 10 CBCL items had moderate accuracy, equivalent to the full Externalizing scale, in classifying diagnoses of conduct disorder based on a research diagnostic interview. Of course, comprehensive clinical assessment would consider additional items, dimensions of behavior, and sources of information, too, but it appears that the behaviors tapped by this select set of items may be core to externalizing psychopathology in children. PMID:26322800

  19. Identifying an efficient set of items sensitive to clinical-range externalizing problems in children.

    PubMed

    Petersen, Isaac T; Bates, John E; Dodge, Kenneth A; Lansford, Jennifer E; Pettit, Gregory S

    2016-05-01

    The present study applied item response theory to identify an efficient set of items of the Achenbach Externalizing scale from the Child Behavior Checklist (CBCL; 33 items) and Teacher's Report Form (TRF; 35 items) that were sensitive to clinical-range scores. Mothers and teachers rated children's externalizing problems annually from ages 5 to 13 years in 2 independent samples (Ns = 585 and 1,199). Item properties for each rater across ages 5-8 and 9-13 were examined with item response theory. We identified 10 mother- and teacher-reported items from both samples based on the items' measurement precision for subclinical and clinical levels of externalizing problems: externalizing problems that involve meanness to others, destroying others' things, fighting, lying and cheating, attacking people, screaming, swearing/obscene language, temper tantrums, threatening people, and being loud. Scores on the scales using these items had strong reliability and psychometric properties, capturing nearly as much information as the full Externalizing scale for classifying clinical levels of externalizing problems. Scores on the scale with the 10 CBCL items had moderate accuracy, equivalent to the full Externalizing scale, in classifying diagnoses of conduct disorder based on a research diagnostic interview. Of course, comprehensive clinical assessment would consider additional items, dimensions of behavior, and sources of information, too, but it appears that the behaviors tapped by this select set of items may be core to externalizing psychopathology in children. (PsycINFO Database Record PMID:26322800

  20. Psychometric aspects of item mapping for criterion-referenced interpretation and bookmark standard setting.

    PubMed

    Huynh, Huynh

    2010-01-01

    Locating an item on an achievement continuum (item mapping) is well-established in technical work for educational/psychological assessment. Applications of item mapping may be found in criterion-referenced (CR) testing (or scale anchoring, Beaton and Allen, 1992; Huynh, 1994, 1998a, 2000a, 2000b, 2006), computer-assisted testing, test form assembly, and in standard setting methods based on ordered test booklets. These methods include the bookmark standard setting originally used for the CTB/TerraNova tests (Lewis, Mitzel, Green, and Patz, 1999), the item descriptor process (Ferrara, Perie, and Johnson, 2002) and a similar process described by Wang (2003) for multiple-choice licensure and certification examinations. While item response theory (IRT) models such as the Rasch and two-parameter logistic (2PL) models traditionally place a binary item at its location, Huynh has argued in the cited papers that such mapping may not be appropriate in selecting items for CR interpretation and scale anchoring. PMID:20351450

  1. Physics Mining of Multi-Source Data Sets

    NASA Technical Reports Server (NTRS)

    Helly, John; Karimabadi, Homa; Sipes, Tamara

    2012-01-01

    Powerful new parallel data mining algorithms can produce diagnostic and prognostic numerical models and analyses from observational data. These techniques yield higher-resolution measures than ever before of environmental parameters by fusing synoptic imagery and time-series measurements. These techniques are general and relevant to observational data, including raster, vector, and scalar, and can be applied in all Earth- and environmental science domains. Because they can be highly automated and are parallel, they scale to large spatial domains and are well suited to change and gap detection. This makes it possible to analyze spatial and temporal gaps in information, and facilitates within-mission replanning to optimize the allocation of observational resources. The basis of the innovation is the extension of a recently developed set of algorithms packaged into MineTool to multi-variate time-series data. MineTool is unique in that it automates the various steps of the data mining process, thus making it amenable to autonomous analysis of large data sets. Unlike techniques such as Artificial Neural Nets, which yield a blackbox solution, MineTool's outcome is always an analytical model in parametric form that expresses the output in terms of the input variables. This has the advantage that the derived equation can then be used to gain insight into the physical relevance and relative importance of the parameters and coefficients in the model. This is referred to as physics-mining of data. The capabilities of MineTool are extended to include both supervised and unsupervised algorithms, handle multi-type data sets, and parallelize it.

  2. Optimal Assembly of Tests with Item Sets. Research Report 98-12.

    ERIC Educational Resources Information Center

    van der Linden, Wim J.

    Six methods for assembling tests from a pool with an item-set structure are presented. All methods are computational and based on the technique of mixed integer programming. The methods are evaluated using such criteria as the feasibility of their linear programming problems and their expected solution times. The methods are illustrated for two…

  3. Intelligent Archiving and Physics Mining of Large Data Sets (Invited)

    NASA Astrophysics Data System (ADS)

    Karimabadi, H.

    2009-12-01

    There are unique challenges in all aspects related to large data sets, from storage, search and access, to analysis and file sharing. With few exceptions, the adoption of the latest technologies to deal with the management and mining of large data sets has been slow in heliosciences. Web services such as CDAweb have been very successful and have been widely adopted by the community. There are also significant efforts going towards Virtual Observatories (VxOs). The main thrust of VxOs has so far been on data discovery, aggregation and uniform presentation. While work remains, many VxOs can now be used to access data. However data is not knowledge and the challenge of extracting physics from the large data sets remains. Here we review our efforts on (i) implementing advanced data mining techniques as part of the data-to-knowledge discovery pipeline, and (ii) use of social networking paradigm in the development of a science collaboratory environment that enables sharing of large files, creation of projects, among others. We will present new data mining software that works on a variety of data formats and demonstrate its capability through several examples of analysis of spacecraft data. The use of such techniques in intelligent archiving will be discussed. Finally, the use of our science collaboratory service and its unique sharing features such as universal accessibility of staged files will be illustrated.

  4. Conversion of Proportion-Correct Standard-Setting Judgments to Cutoff Scores on the Item Response Theory Theta Scale

    ERIC Educational Resources Information Center

    Hurtz, Gregory M.; Jones, J. Patrick; Jones, Christian N.

    2008-01-01

    This study compares the efficacy of different strategies for translating item-level, proportion-correct standard-setting judgments into a theta-metric test cutoff score for use with item response theory (IRT) scoring, using Monte Carlo methods. Simulated Angoff-type ratings, consisting of 1,000 independent 75 Item x13 Rater matrices, were…

  5. Using Data Mining to Predict K-12 Students' Performance on Large-Scale Assessment Items Related to Energy

    ERIC Educational Resources Information Center

    Liu, Xiufeng; Ruiz, Miguel E.

    2008-01-01

    This article reports a study on using data mining to predict K-12 students' competence levels on test items related to energy. Data sources are the 1995 Third International Mathematics and Science Study (TIMSS), 1999 TIMSS-Repeat, 2003 Trend in International Mathematics and Science Study (TIMSS), and the National Assessment of Educational…

  6. Method using a density field for locating related items for data mining

    DOEpatents

    Wylie, Brian N.

    2002-01-01

    A method for locating related items in a geometric space transforms relationships among items to geometric locations. The method locates items in the geometric space so that the distance between items corresponds to the degree of relatedness. The method facilitates communication of the structure of the relationships among the items. The method makes use of numeric values as a measure of similarity between each pairing of items. The items are given initial coordinates in the space. An energy is then determined for each item from the item's distance and similarity to other items, and from the density of items assigned coordinates near the item. The distance and similarity component can act to draw items with high similarities close together, while the density component can act to force all items apart. If a terminal condition is not yet reached, then new coordinates can be determined for one or more items, and the energy determination repeated. The iteration can terminate, for example, when the total energy reaches a threshold, when each item's energy is below a threshold, after a certain amount of time or iterations.

  7. A Clarification on the Response Probability Criterion RP67 for Standard Settings Based on Bookmark and Item Mapping

    ERIC Educational Resources Information Center

    Huynh, Huynh

    2006-01-01

    By analyzing the Fisher information allotted to the correct response of a Rasch binary item, Huynh (1994) established the response probability criterion 0.67 (RP67) for standard settings based on bookmarks and item mapping. The purpose of this note is to help clarify the conceptual and psychometric framework of the RP criterion.

  8. Discovering Psychological Principles by Mining Naturally Occurring Data Sets.

    PubMed

    Goldstone, Robert L; Lupyan, Gary

    2016-07-01

    The very expertise with which psychologists wield their tools for achieving laboratory control may have had the unwelcome effect of blinding psychologists to the possibilities of discovering principles of behavior without conducting experiments. When creatively interrogated, a diverse range of large, real-world data sets provides powerful diagnostic tools for revealing principles of human judgment, perception, categorization, decision-making, language use, inference, problem solving, and representation. Examples of these data sets include patterns of website links, dictionaries, logs of group interactions, collections of images and image tags, text corpora, history of financial transactions, trends in twitter tag usage and propagation, patents, consumer product sales, performance in high-stakes sporting events, dialect maps, and scientific citations. The goal of this issue is to present some exemplary case studies of mining naturally existing data sets to reveal important principles and phenomena in cognitive science, and to discuss some of the underlying issues involved with conducting traditional experiments, analyses of naturally occurring data, computational modeling, and the synthesis of all three methods. PMID:27404718

  9. Assembling a Computerized Adaptive Testing Item Pool as a Set of Linear Tests

    ERIC Educational Resources Information Center

    van der Linden, Wim J.; Ariel, Adelaide; Veldkamp, Bernard P.

    2006-01-01

    Test-item writing efforts typically results in item pools with an undesirable correlational structure between the content attributes of the items and their statistical information. If such pools are used in computerized adaptive testing (CAT), the algorithm may be forced to select items with less than optimal information, that violate the content…

  10. Method of locating related items in a geometric space for data mining

    DOEpatents

    Hendrickson, B.A.

    1999-07-27

    A method for locating related items in a geometric space transforms relationships among items to geometric locations. The method locates items in the geometric space so that the distance between items corresponds to the degree of relatedness. The method facilitates communication of the structure of the relationships among the items. The method is especially beneficial for communicating databases with many items, and with non-regular relationship patterns. Examples of such databases include databases containing items such as scientific papers or patents, related by citations or keywords. A computer system adapted for practice of the present invention can include a processor, a storage subsystem, a display device, and computer software to direct the location and display of the entities. The method comprises assigning numeric values as a measure of similarity between each pairing of items. A matrix is constructed, based on the numeric values. The eigenvectors and eigenvalues of the matrix are determined. Each item is located in the geometric space at coordinates determined from the eigenvectors and eigenvalues. Proper construction of the matrix and proper determination of coordinates from eigenvectors can ensure that distance between items in the geometric space is representative of the numeric value measure of the items' similarity. 12 figs.

  11. Method of locating related items in a geometric space for data mining

    DOEpatents

    Hendrickson, Bruce A.

    1999-01-01

    A method for locating related items in a geometric space transforms relationships among items to geometric locations. The method locates items in the geometric space so that the distance between items corresponds to the degree of relatedness. The method facilitates communication of the structure of the relationships among the items. The method is especially beneficial for communicating databases with many items, and with non-regular relationship patterns. Examples of such databases include databases containing items such as scientific papers or patents, related by citations or keywords. A computer system adapted for practice of the present invention can include a processor, a storage subsystem, a display device, and computer software to direct the location and display of the entities. The method comprises assigning numeric values as a measure of similarity between each pairing of items. A matrix is constructed, based on the numeric values. The eigenvectors and eigenvalues of the matrix are determined. Each item is located in the geometric space at coordinates determined from the eigenvectors and eigenvalues. Proper construction of the matrix and proper determination of coordinates from eigenvectors can ensure that distance between items in the geometric space is representative of the numeric value measure of the items' similarity.

  12. Identification of Differential Item Functioning in Multiple-Group Settings: A Multivariate Outlier Detection Approach

    ERIC Educational Resources Information Center

    Magis, David; De Boeck, Paul

    2011-01-01

    We focus on the identification of differential item functioning (DIF) when more than two groups of examinees are considered. We propose to consider items as elements of a multivariate space, where DIF items are outlying elements. Following this approach, the situation of multiple groups is a quite natural case. A robust statistics technique is…

  13. Large-Scale Constraint-Based Pattern Mining

    ERIC Educational Resources Information Center

    Zhu, Feida

    2009-01-01

    We studied the problem of constraint-based pattern mining for three different data formats, item-set, sequence and graph, and focused on mining patterns of large sizes. Colossal patterns in each data formats are studied to discover pruning properties that are useful for direct mining of these patterns. For item-set data, we observed robustness of…

  14. Data mining in big data sets of spectroscopic surveys

    NASA Astrophysics Data System (ADS)

    Luo, A.-Li

    2015-08-01

    More and more spectroscopic surveys provide many big datasets, in which various rare objects could be found. In this presentation, we suggested two different data mining approaches based on spectral feature by detection and machine learning way respectively. The most difficult thing of data mining is try to understand the result, and most of found outliers were finally identified as bad data. Candidates need follow up observations or multi waveband data to be confirmed. Some successful examples of newly discovered rare objects in SDSS and LAMOST data release are presented in this talk.

  15. Set of Criteria for Efficiency of the Process Forming the Answers to Multiple-Choice Test Items

    ERIC Educational Resources Information Center

    Rybanov, Alexander Aleksandrovich

    2013-01-01

    Is offered the set of criteria for assessing efficiency of the process forming the answers to multiple-choice test items. To increase accuracy of computer-assisted testing results, it is suggested to assess dynamics of the process of forming the final answer using the following factors: loss of time factor and correct choice factor. The model…

  16. Evidence for a Global Sampling Process in Extraction of Summary Statistics of Item Sizes in a Set

    PubMed Central

    Tokita, Midori; Ueda, Sachiyo; Ishiguchi, Akira

    2016-01-01

    Several studies have shown that our visual system may construct a “summary statistical representation” over groups of visual objects. Although there is a general understanding that human observers can accurately represent sets of a variety of features, many questions on how summary statistics, such as an average, are computed remain unanswered. This study investigated sampling properties of visual information used by human observers to extract two types of summary statistics of item sets, average and variance. We presented three models of ideal observers to extract the summary statistics: a global sampling model without sampling noise, global sampling model with sampling noise, and limited sampling model. We compared the performance of an ideal observer of each model with that of human observers using statistical efficiency analysis. Results suggest that summary statistics of items in a set may be computed without representing individual items, which makes it possible to discard the limited sampling account. Moreover, the extraction of summary statistics may not necessarily require the representation of individual objects with focused attention when the sets of items are larger than 4. PMID:27242622

  17. Metals and metalloids in nestling tree swallows and their dietary items near oilsands mine operations in Northern Alberta.

    PubMed

    Godwin, Christine M; Smits, Judit E G; Barclay, Robert M R

    2016-08-15

    Tree swallows (Tachycineta bicolor) nesting near oilsands development in northern Alberta are potentially exposed to elevated levels of metals. The objective of this study was to determine whether levels of metals and metalloid elements in dietary items and tissues of nestling tree swallows inhabiting areas near oilsands mine operations were higher compared to those of reference sites. We hypothesized that if there was increased, industry-related exposure to metals, it would be via the diet. We identified the invertebrate prey in the stomach contents of nestlings. We also collected invertebrates using Malaise traps near nest boxes, and analyzed those taxa found in the nestling diet to understand potential variability in metal exposure. For most elements, we found no significant differences in concentrations in the liver, kidney, or stomach contents between sites near to and far from oilsands operations. Concentrations of five elements were positively correlated among tissues and stomach contents. For invertebrates collected from Malaise traps, location differences occurred in some absolute elemental concentrations, which were most often highest at reference sites away from mining operations. We found no evidence that nestling tree swallows accumulated metals approaching toxic levels. Tree swallows consumed relatively high quantities of terrestrial insects, possibly limiting exposure to water borne, food-web-related contaminants. We suggest that annual variability associated with elemental exposure and dietary levels of elements be considered when interpreting concentrations in bird tissues. PMID:27110982

  18. Improving the Scalability of an Exact Approach for Frequent Item Set Hiding

    ERIC Educational Resources Information Center

    LaMacchia, Carolyn

    2013-01-01

    Technological advances have led to the generation of large databases of organizational data recognized as an information-rich, strategic asset for internal analysis and sharing with trading partners. Data mining techniques can discover patterns in large databases including relationships considered strategically relevant to the owner of the data.…

  19. Mining knowledge in One Night Stands data sets

    NASA Astrophysics Data System (ADS)

    Sansaturio, M. E.; Arratia, O.

    2012-02-01

    In this paper, we set up a procedure that aims at improving the orbits of the known near-Earth asteroids (NEAs) by making use of the One Night Stands (ONS) file distributed by the Minor Planet Center. The identification algorithm employed is of the observation attribution type and takes into account the higher apparent motion exhibited by this kind of asteroids. The application of this algorithm together with the ONS file is not straightforward, as in practice such a file presents several problems that must be managed prior to the massive treatment of the information contained in it. We include a three-step protocol to handle these drawbacks. Finally, we present a detailed analysis of the results obtained with this procedure, the main conclusion being that more than 80 NEA orbits have been improved.

  20. Mining temporal data sets: hypoplastic left heart syndrome case study

    NASA Astrophysics Data System (ADS)

    Kusiak, Andrew; Caldarone, Christopher A.; Kelleher, Michael D.; Lamb, Fred S.; Persoon, Thomas J.; Gan, Yuan; Burns, Alex

    2003-03-01

    Hypoplastic left heart syndrome (HLHS) affects infants and is uniformly fatal without surgery. Post-surgery mortality rates are highly variable and dependent on postoperative management. The high mortality after the first stage surgery usually occurs within the first few days after procedure. Typically, the deaths are attributed to the unstable balance between the pulmonary and systemic circulations. An experienced team of physicians, nurses, and therapists is required to successfully manage the infant. However, even the most experienced teams report significant mortality due to the extremely complex relationships among physiologic parameters in a given patient. A data acquisition system was developed for the simultaneous collection of 73 physiologic, laboratory, and nurse-assessed variables. Data records were created at intervals of 30 seconds. An expert-validated wellness score was computed for each data record. A training data set consisting of over 5000 data records from multiple patients was collected. Preliminary results demonstratd that the knowledge discovery approach was over 94.57% accurate in predicting the "wellness score" of an infant. The discovered knowledge can improve care of complex patients by development of an intelligent simulator that can be used to support decisions.

  1. Development and Validation of the Standard Chinese Version of the CARE Item Set (CARE-C) for Stroke Patients

    PubMed Central

    Chang, Ke-Vin; Hung, Chen-Yu; Kao, Chien-Wei; Tan, Fuk-Tang; Gage, Barbara; Hsieh, Ching-Lin; Wang, Tyng-Guey; Han, Der-Sheng

    2015-01-01

    Abstract The Continuity Assessment Record and Evaluation (CARE) item set is a standardized, integrative scale for evaluation of functional status across acute and postacute care (PAC) providers. The aim of this study was to develop a Chinese version of the CARE (CARE-C) item set and to examine its reliability and validity for assessment of functional outcomes among stroke patients. The CARE-C was administered in two samples. Sample 1 included 30 stroke patients in the outpatient clinic setting for the purpose of examining interrater and test–retest reliabilities and internal consistency. Sample 2 included 138 stroke patients admitted to rehabilitation units for the purpose of investigating criterion-related validity with the Barthel index, Lawton Instrumental Activities of Daily Living (IADL) scale, EuroQOL five dimensions questionnaire (EQ-5D), and Mini-Mental State Examination (MMSE). The CARE-C was categorized into 11 subscales, 52 items of which were analyzed. At the subscale level, the interrater reliability and test–retest reliability expressed by intraclass correlation coefficient (ICC) ranged from 0.72 to 0.99 and 0.60 to 1.00, respectively. Six of the 11 subscales met acceptable levels of internal consistency (Cronbach alpha > 0.7). The criterion-related validity of the CARE-C showed moderate to high correlations of its subscales of cognition and basic and instrumental activities of daily living with the Barthel index, IADL scale, and MMSE. The CARE-C is a useful instrument for evaluating functional quality metrics in the Chinese stroke population. The development of the CARE-C also facilitates the assessment of the PAC program in Taiwan and future research is warranted for validating the capability of CARE-C to identify patients’ functional change over time and its generalizability for nonstroke populations. PMID:26496322

  2. Testing whether the DSM-5 personality disorder trait model can be measured with a reduced set of items: An item response theory investigation of the Personality Inventory for DSM-5.

    PubMed

    Maples, Jessica L; Carter, Nathan T; Few, Lauren R; Crego, Cristina; Gore, Whitney L; Samuel, Douglas B; Williamson, Rachel L; Lynam, Donald R; Widiger, Thomas A; Markon, Kristian E; Krueger, Robert F; Miller, Joshua D

    2015-12-01

    The fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) includes an alternative model of personality disorders (PDs) in Section III, consisting in part of a pathological personality trait model. To date, the 220-item Personality Inventory for DSM-5 (PID-5; Krueger, Derringer, Markon, Watson, & Skodol, 2012) is the only extant self-report instrument explicitly developed to measure this pathological trait model. The present study used item response theory-based analyses in a large sample (n = 1,417) to investigate whether a reduced set of 100 items could be identified from the PID-5 that could measure the 25 traits and 5 domains. This reduced set of PID-5 items was then tested in a community sample of adults currently receiving psychological treatment (n = 109). Across a wide range of criterion variables including NEO PI-R domains and facets, DSM-5 Section II PD scores, and externalizing and internalizing outcomes, the correlational profiles of the original and reduced versions of the PID-5 were nearly identical (rICC = .995). These results provide strong support for the hypothesis that an abbreviated set of PID-5 items can be used to reliably, validly, and efficiently assess these personality disorder traits. The ability to assess the DSM-5 Section III traits using only 100 items has important implications in that it suggests these traits could still be measured in settings in which assessment-related resources (e.g., time, compensation) are limited. PMID:25844534

  3. Keeping it simple: studying grammatical encoding with lexically reduced item sets.

    PubMed

    Veenstra, Alma; Acheson, Daniel J; Meyer, Antje S

    2014-01-01

    Compared to the large body of work on lexical access, little research has been done on grammatical encoding in language production. An exception is the generation of subject-verb agreement. Here, two key findings have been reported: (1) speakers make more agreement errors when the head and local noun of a phrase mismatch in number than when they match [e.g., the key to the cabinet(s)]; and (2) this attraction effect is asymmetric, with stronger attraction for singular than for plural head nouns. Although these findings are robust, the cognitive processes leading to agreement errors and their significance for the generation of correct agreement are not fully understood. We propose that future studies of agreement, and grammatical encoding in general, may benefit from using paradigms that tightly control the variability of the lexical content of the material. We report two experiments illustrating this approach. In both of them, the experimental items featured combinations of four nouns, four color adjectives, and two prepositions. In Experiment 1, native speakers of Dutch described pictures in sentences such as the circle next to the stars is blue. In Experiment 2, they carried out a forced-choice task, where they read subject noun phrases (e.g., the circle next to the stars) and selected the correct verb-phrase (is blue or are blue) with a button press. Both experiments showed an attraction effect, with more errors after subject phrases with mismatching, compared to matching head and local nouns. This effect was stronger for singular than plural heads, replicating the attraction asymmetry. In contrast, the response times recorded in Experiment 2 showed similar attraction effects for singular and plural head nouns. These results demonstrate that critical agreement phenomena can be elicited reliably in lexically reduced contexts. We discuss the theoretical implications of the findings and the potential and limitations of studies using lexically simple materials. PMID

  4. Evaluating the Bookmark Standard Setting Method: The Impact of Random Item Ordering

    ERIC Educational Resources Information Center

    Davis-Becker, Susan L.; Buckendahl, Chad W.; Gerrow, Jack

    2011-01-01

    Throughout the world, cut scores are an important aspect of a high-stakes testing program because they are a key operational component of the interpretation of test scores. One method for setting standards that is prevalent in educational testing programs--the Bookmark method--is intended to be a less cognitively complex alternative to methods…

  5. Diesel exhaust and coal mine dust: lung cancer risk in occupational settings

    SciTech Connect

    Hoffmann, B.; Jockel, K.H.

    2006-09-15

    Conflicting evidence on the carcinogenicity of diesel exhaust (DE) and coal mine dust in occupational settings exist. Exposure measurement in most studies is inferred on the basis of job classifications and may lead to misclassification. Confounding behavioral factors (i.e., smoking) and occupational risk factors (exposure to asbestos, arsenic, radon) need to be considered. We evaluated the epidemiological evidence and current findings of the carcinogenicity of DE and coal mine dust in occupational settings. Pertaining literature was identified through Medline search and recent review articles. Strengths and limitations of recent approaches are discussed. Many epidemiological studies have addressed the question of carcinogenicity in workers exposed to DE, and most showed a low-to-medium increase in the risk of bronchial carcinoma. The pooled relative risk (RR) estimates lie between 1.33 and 1.47, and a consistent rise in risk across various job categories and study designs point to a causal relationship. Data on the carcinogenicity of coal mine dust are less consistent and the potential for confounding by unmeasured risk factors (arsenic, radon, DE) are higher. While silica as one of its components has been evaluated as carcinogenic, there is inadequate evidence for the carcinogenicity of pure coal dust according to the International Agency for Research on Cancer (IARC). There is sufficient evidence for a causal relationship between DE and lung cancer in occupational settings. The evidence for coal mine dust is less convincing, but individual studies show an increase in risk of lung cancer in exposed workers. 4 refs.

  6. Knowledge Mining from Clinical Datasets Using Rough Sets and Backpropagation Neural Network

    PubMed Central

    Nahato, Kindie Biredagn; Harichandran, Khanna Nehemiah; Arputharaj, Kannan

    2015-01-01

    The availability of clinical datasets and knowledge mining methodologies encourages the researchers to pursue research in extracting knowledge from clinical datasets. Different data mining techniques have been used for mining rules, and mathematical models have been developed to assist the clinician in decision making. The objective of this research is to build a classifier that will predict the presence or absence of a disease by learning from the minimal set of attributes that has been extracted from the clinical dataset. In this work rough set indiscernibility relation method with backpropagation neural network (RS-BPNN) is used. This work has two stages. The first stage is handling of missing values to obtain a smooth data set and selection of appropriate attributes from the clinical dataset by indiscernibility relation method. The second stage is classification using backpropagation neural network on the selected reducts of the dataset. The classifier has been tested with hepatitis, Wisconsin breast cancer, and Statlog heart disease datasets obtained from the University of California at Irvine (UCI) machine learning repository. The accuracy obtained from the proposed method is 97.3%, 98.6%, and 90.4% for hepatitis, breast cancer, and heart disease, respectively. The proposed system provides an effective classification model for clinical datasets. PMID:25821508

  7. Acknowledging How Older Australian Women Experience Life After Stroke: How Does the WHO 18-Item Brief ICF Core Set for Stroke Compare?

    PubMed

    Tavener, Meredith; Thijsen, Amanda; Hubbard, Isobel J; Francis, J Lynn; Grennall, Claire; Levi, Christopher; Byles, Julie

    2015-01-01

    We examined older women's qualitative experiences of stroke with the World Health Organization's 18-item Brief International Classification of Functioning, Disability and Health Core Set for Stroke. Women were participants of the Australian Longitudinal Study on Women's Health, born between 1921 and 1926, who had experienced a stroke in the previous 3 years. An inductive thematic analysis was conducted of women's qualitative experiences of stroke, which were then examined with the 18-item Brief Core Set for Stroke for congruency. Our analysis showed that for older Australian women, their concerns of poststroke living were not adequately classified, potentially impeding a full recovery. PMID:26042797

  8. Setting standards for severity of common symptoms in oncology using the PROMIS item banks and expert judgment

    PubMed Central

    Choi, Seung; Garcia, Sofia; Cook, Karon F.; Rosenbloom, Sarah; Lai, Jin-Shei; Tatum, Donna Surges; Gershon, Richard

    2016-01-01

    Background Although the use of patient-reported outcome measures (PROs) has increased markedly, clinical interpretation of scores remains lacking. We developed a method to identify clinical severity thresholds for pain, fatigue, depression, and anxiety in people with cancer. Methods Using available Patient-Reported Outcomes Measurement Information System (PROMIS) item bank response data collected on 840 cancer patients, symptom vignettes across a range of symptom severity were developed and placed on index cards. Cards represented symptom severity at five-point intervals differences on the T score metric [mean = 50; standard deviation (SD) = 10]. Symptom vignettes for each symptom were anchored on these standardized scores at 0.5 SD increments across the full range of severity. Clinical experts, blind to the PROMIS score associated with each vignette, rank-ordered the vignettes by severity, then arrived at consensus regarding which two vignettes were at the upper and lower boundaries of normal and mildly symptomatic for each symptom. The procedure was repeated to identify cut scores separating mildly from moderately symptomatic, and moderately from severely symptomatic scores. Clinician severity rankings were then compared to the T scores upon which the vignettes were based. Results For each of the targeted PROs, the severity rankings reached by clinician consensus perfectly matched the numerical rankings of their associated T scores. Across all symptoms, the thresholds (cut scores) identified to differentiate normal from mildly symptomatic were near a T score of 50. Cut scores differentiating mildly from moderately symptomatic were at or near 60, and those separating moderately from severely symptomatic were at or near 70. Conclusions The study results provide empirically generated PROMIS T score thresholds that differentiate levels of symptom severity for pain interference, fatigue, anxiety, and depression. The convergence of clinical judgment with self

  9. The flexibility of context-specific control: evidence for context-driven generalization of item-specific control settings.

    PubMed

    Crump, Matthew J C; Milliken, Bruce

    2009-08-01

    In two experiments we address an ongoing debate concerning the processes driving context-driven modulations to the Stroop effect (Crump, Gong, & Milliken, 2006). In particular, we demonstrate that context-driven processes can modulate the size of the Stroop effect for frequency-unbiased item types. We also clarify the role of item frequency in producing context-driven modulations to the Stroop effect. Taken together, our results provide unambiguous support for the claim that contextual processing can impart fast and flexible control over the operation of selective attention processes during online performance. PMID:19370487

  10. Data mining and visualization techniques

    DOEpatents

    Wong, Pak Chung; Whitney, Paul; Thomas, Jim

    2004-03-23

    Disclosed are association rule identification and visualization methods, systems, and apparatus. An association rule in data mining is an implication of the form X.fwdarw.Y where X is a set of antecedent items and Y is the consequent item. A unique visualization technique that provides multiple antecedent, consequent, confidence, and support information is disclosed to facilitate better presentation of large quantities of complex association rules.

  11. Characterizing Ground-Water Flow Paths in High-Altitude Fractured Rock Settings Impacted by Mining Activities

    NASA Astrophysics Data System (ADS)

    Wireman, M.; Williams, D.

    2003-12-01

    The Rocky Mountains of the western USA have tens of thousands of abandoned, inactive and active precious-metal(gold,silver,copper)mine sites. Most of these sites occur in fractured rock hydrogeologic settings. Mining activities often resulted in mobilization and transport of associated heavy metals (zinc,cadmium,lead) which pose a significant threat to aquatic communities in mountain streams.Transport of heavy metals from mine related sources (waste rock piles,tailings impoudments,underground workings, mine pits)can occur along numerous hydrological pathways including complex fracture controlled ground-water pathways. Since 1991, the United States Environmental Protection Agency, the Colorado Division of Minerals and Geology and the University of Colorado (INSTAAR)have been conducting applied hydrologic research at the Mary Murphy underground mine. The mine is in the Chalk Creek mining district which is located on the southwestern flanks of the Mount Princeton Batholith, a Tertiary age intrusive comprised primarily of quartz monzonite.The Mount Princeton batholith comprises a large portion of the southern part of the Collegiate Range west of Buena Vista in Chaffee County, CO. Chalk Creek and its 14 tributaries drain about 24,900 hectares of the eastern slopes of the Range including the mining district. Within the mining district, ground-water flow is controlled by the distribution, orientation and permeability of discontinuities within the bedrock. Important discontinuities include faults, joints and weathered zones. Local and intermediate flow systems are perturbed by extensive underground excavations associated with mining (adits, shafts, stopes, drifts,, etc.). During the past 12 years numerous hydrological investigations have been completed. The investigations have been focused on developing tools for characterizing ground-water flow and contaminant transport in the vicinity of hard-rock mines in fractured-rock settings. In addition, the results from these

  12. The Underlying Student Reasons for Rating SET Questionnaire Items: How Student Solve the Problem of Filling Out Questionnaires

    ERIC Educational Resources Information Center

    Kalayci, Nurdan

    2009-01-01

    In order to enhance the quality of teaching in higher education establishments, as "stakeholders" of the institution, the opinions of students related to the teaching they receive are given precedence. This is achieved by using mandatory SET questionnaires on a routine basis in colleges and universities throughout the world. This study reports the…

  13. Item Response Models for Examinee-Selected Items

    ERIC Educational Resources Information Center

    Wang, Wen-Chung; Jin, Kuan-Yu; Qiu, Xue-Lan; Wang, Lei

    2012-01-01

    In some tests, examinees are required to choose a fixed number of items from a set of given items to answer. This practice creates a challenge to standard item response models, because more capable examinees may have an advantage by making wiser choices. In this study, we developed a new class of item response models to account for the choice…

  14. Item Banking with Embedded Standards

    ERIC Educational Resources Information Center

    MacCann, Robert G.; Stanley, Gordon

    2009-01-01

    An item banking method that does not use Item Response Theory (IRT) is described. This method provides a comparable grading system across schools that would be suitable for low-stakes testing. It uses the Angoff standard-setting method to obtain item ratings that are stored with each item. An example of such a grading system is given, showing how…

  15. Handling Dynamic Weights in Weighted Frequent Pattern Mining

    NASA Astrophysics Data System (ADS)

    Ahmed, Chowdhury Farhan; Tanbeer, Syed Khairuzzaman; Jeong, Byeong-Soo; Lee, Young-Koo

    Even though weighted frequent pattern (WFP) mining is more effective than traditional frequent pattern mining because it can consider different semantic significances (weights) of items, existing WFP algorithms assume that each item has a fixed weight. But in real world scenarios, the weight (price or significance) of an item can vary with time. Reflecting these changes in item weight is necessary in several mining applications, such as retail market data analysis and web click stream analysis. In this paper, we introduce the concept of a dynamic weight for each item, and propose an algorithm, DWFPM (dynamic weighted frequent pattern mining), that makes use of this concept. Our algorithm can address situations where the weight (price or significance) of an item varies dynamically. It exploits a pattern growth mining technique to avoid the level-wise candidate set generation-and-test methodology. Furthermore, it requires only one database scan, so it is eligible for use in stream data mining. An extensive performance analysis shows that our algorithm is efficient and scalable for WFP mining using dynamic weights.

  16. DEVELOPING AND EXPLOITING A UNIQUE SEISMIC DATA SET FROM SOUTH AFRICAN GOLD MINES FOR SOURCE CHARACTERIZATION AND WAVE PROPAGATION

    SciTech Connect

    Julia, J; Nyblade, A A; Gok, R; Walter, W R; Linzer, L; Durrheim, R

    2008-07-08

    In this project, we are developing and exploiting a unique seismic data set to address the characteristics of small seismic events and the associated seismic signals observed at local (< 200 km) and regional (< 2000 km) distances. The dataset is being developed using mining-induced events from 3 deep gold mines in South Africa recorded on inmine networks (< 1 km) comprised of tens of high-frequency sensors, a network of 4 broadband stations installed as part of this project at the surface around the mines (1-10 km), and a network of existing broadband seismic stations at local/regional distances (50-1000 km) from the mines. After 1 year of seismic monitoring of mine activity (2007), over 10,000 events in the range -3.4 < ML < 4.4 have been catalogued and recorded by the in-mine networks. Events with positive magnitudes are generally well recorded by the surface-mine stations, while magnitudes 3.0 and larger are seen at regional distances (up to {approx}600 km) in high-pass filtered recordings. We have analyzed in-mine recordings in detail at one of the South African mines (Savuka) to (i) improve on reported hypocentral locations, (ii) verify sensor orientations, and (iii) determine full moment tensor solutions. Hypocentral relocations on all catalogued events have been obtained from P- and S-wave travel-times reported by the mine network operator through an automated procedure that selects travel-times falling on Wadati lines with slopes in the 0.6-0.7 range; sensor orientations have been verified and, when possible, corrected by correlating P-, SV-, and SH-waveforms obtained from theoretical and empirical (polarization filter) rotation angles; full moment tensor solutions have been obtained by inverting P-, SV-, and SH- spectral amplitudes measured on the theoretically rotated waveforms with visually assigned polarities. The relocation procedure has revealed that origin times often necessitate a negative correction of a few tenths of second and that hypocentral

  17. Chimpanzees remember the results of one-by-one addition of food items to sets over extended time periods.

    PubMed

    Beran, Michael J; Beran, Mary M

    2004-02-01

    Four chimpanzees were highly accurate in selecting the larger of two concurrent accumulations of bananas in two opaque containers over a span of 20 min. One at a time, bananas were placed into the containers, which were outside the chimpanzees' cages. The chimpanzees never saw more than one banana at a time, and there were no cues indicating the locations of the bananas after they were placed into the containers. The performance of these animals matched that of human infants and young children in similar tests. The chimpanzees were successful even when the sets to be compared were sufficiently large (5 vs. 8, 5 vs. 10, and 6 vs. 10) to cast doubt on the possibility that the chimpanzees were using an object file mechanism. These chimpanzees are the first nonhuman animals to demonstrate extended memory for accumulated quantity. PMID:14738515

  18. A Bayesian method to mine spatial data sets to evaluate the vulnerability of human beings to catastrophic risk.

    PubMed

    Li, Lianfa; Wang, Jinfeng; Leung, Hareton; Zhao, Sisi

    2012-06-01

    Vulnerability of human beings exposed to a catastrophic disaster is affected by multiple factors that include hazard intensity, environment, and individual characteristics. The traditional approach to vulnerability assessment, based on the aggregate-area method and unsupervised learning, cannot incorporate spatial information; thus, vulnerability can be only roughly assessed. In this article, we propose Bayesian network (BN) and spatial analysis techniques to mine spatial data sets to evaluate the vulnerability of human beings. In our approach, spatial analysis is leveraged to preprocess the data; for example, kernel density analysis (KDA) and accumulative road cost surface modeling (ARCSM) are employed to quantify the influence of geofeatures on vulnerability and relate such influence to spatial distance. The knowledge- and data-based BN provides a consistent platform to integrate a variety of factors, including those extracted by KDA and ARCSM to model vulnerability uncertainty. We also consider the model's uncertainty and use the Bayesian model average and Occam's Window to average the multiple models obtained by our approach to robust prediction of the risk and vulnerability. We compare our approach with other probabilistic models in the case study of seismic risk and conclude that our approach is a good means to mining spatial data sets for evaluating vulnerability. PMID:22404550

  19. Cluster Analysis-Based Approaches for Geospatiotemporal Data Mining of Massive Data Sets for Identification of Forest Threats

    SciTech Connect

    Mills, Richard T; Hoffman, Forrest M; Kumar, Jitendra; HargroveJr., William Walter

    2011-01-01

    We investigate methods for geospatiotemporal data mining of multi-year land surface phenology data (250 m2 Normalized Difference Vegetation Index (NDVI) values derived from the Moderate Resolution Imaging Spectrometer (MODIS) in this study) for the conterminous United States (CONUS) as part of an early warning system for detecting threats to forest ecosystems. The approaches explored here are based on k-means cluster analysis of this massive data set, which provides a basis for defining the bounds of the expected or normal phenological patterns that indicate healthy vegetation at a given geographic location. We briefly describe the computational approaches we have used to make cluster analysis of such massive data sets feasible, describe approaches we have explored for distinguishing between normal and abnormal phenology, and present some examples in which we have applied these approaches to identify various forest disturbances in the CONUS.

  20. A Study of the Homogeneity of Items Produced From Item Forms Across Different Taxonomic Levels.

    ERIC Educational Resources Information Center

    Weber, Margaret B.; Argo, Jana K.

    This study determined whether item forms ( rules for constructing items related to a domain or set of tasks) would enable naive item writers to generate multiple-choice items at three taxonomic levels--knowledge, comprehension, and application. Students wrote 120 multiple-choice items from 20 item forms, corresponding to educational objectives…

  1. Mining Hesitation Information by Vague Association Rules

    NASA Astrophysics Data System (ADS)

    Lu, An; Ng, Wilfred

    In many online shopping applications, such as Amazon and eBay, traditional Association Rule (AR) mining has limitations as it only deals with the items that are sold but ignores the items that are almost sold (for example, those items that are put into the basket but not checked out). We say that those almost sold items carry hesitation information, since customers are hesitating to buy them. The hesitation information of items is valuable knowledge for the design of good selling strategies. However, there is no conceptual model that is able to capture different statuses of hesitation information. Herein, we apply and extend vague set theory in the context of AR mining. We define the concepts of attractiveness and hesitation of an item, which represent the overall information of a customer's intent on an item. Based on the two concepts, we propose the notion of Vague Association Rules (VARs). We devise an efficient algorithm to mine the VARs. Our experiments show that our algorithm is efficient and the VARs capture more specific and richer information than do the traditional ARs.

  2. On Bayesian Rules for Selecting 3PL Binary Items for Criterion-Referenced Interpretations and Creating Booklets for Bookmark Standard Setting.

    ERIC Educational Resources Information Center

    Huynh, Huynh

    By noting that a Rasch or two parameter logistic (2PL) item belongs to the exponential family of random variables and that the probability density function (pdf) of the correct response (X=1) and the incorrect response (X=0) are symmetric with respect to the vertical line at the item location, it is shown that the conjugate prior for ability is…

  3. Unlocking Data for Statistical Analyses and Data Mining: Generic Case Extraction of Clinical Items from i2b2 and tranSMART.

    PubMed

    Firnkorn, Daniel; Merker, Sebastian; Ganzinger, Matthias; Muley, Thomas; Knaup, Petra

    2016-01-01

    In medical science, modern IT concepts are increasingly important to gather new findings out of complex diseases. Data Warehouses (DWH) as central data repository systems play a key role by providing standardized, high-quality and secure medical data for effective analyses. However, DWHs in medicine must fulfil various requirements concerning data privacy and the ability to describe the complexity of (rare) disease phenomena. Here, i2b2 and tranSMART are free alternatives representing DWH solutions especially developed for medical informatics purposes. But different functionalities are not yet provided in a sufficient way. In fact, data import and export is still a major problem because of the diversity of schemas, parameter definitions and data quality which are described variously in each single clinic. Further, statistical analyses inside i2b2 and tranSMART are possible, but restricted to the implemented functions. Thus, data export is needed to provide a data basis which can be directly included within statistics software like SPSS and SAS or data mining tools like Weka and RapidMiner. The standard export tools of i2b2 and tranSMART are more or less creating a database dump of key-value pairs which cannot be used immediately by the mentioned tools. They need an instance-based or a case-based representation of each patient. To overcome this lack, we developed a concept called Generic Case Extractor (GCE) which pivots the key-value pairs of each clinical fact into a row-oriented format for each patient sufficient to enable analyses in a broader context. Therefore, complex pivotisation routines where necessary to ensure temporal consistency especially in terms of different data sets and the occurrence of identical but repeated parameters like follow-up data. GCE is embedded inside a comprehensive software platform for systems medicine. PMID:27577447

  4. Hydrogeologic setting and simulation of groundwater flow near the Canterbury and Leadville Mine Drainage Tunnels, Leadville, Colorado

    USGS Publications Warehouse

    Wellman, Tristan P.; Paschke, Suzanne S.; Minsley, Burke; Dupree, Jean A.

    2011-01-01

    63 percent of the total water budget, respectively). The largest contributions to the water budget were groundwater entering from the upper reaches of the watershed and the hydrologic interaction of the groundwater with the East Fork Arkansas River. Potentiometric surface maps of the simulated model results were generated for depths of 50, 100, and 250 m. The surfaces revealed a positive trend in hydraulic head with land-surface altitude and evidence of increased control on fluid movement by the fault network structure at progressively greater depths in the aquifer. Results of advective particle-tracking simulations indicate that the sets of simulated flow paths for the Canterbury Tunnel and the Leadville Mine Drainage Tunnel were mutually exclusive of one another, which also suggested that no major hydraulic connection was present between the tunnels. Particle-tracking simulations also revealed that although the fault network generally restricted groundwater movement locally, hydrologic conditions were such that groundwater did cross the fault network at many locations. This cross-fault movement indicates that the fault network controls regional groundwater flow to some degree but is not a complete barrier to flow. The cumulative distributions of adjusted age results for the watershed indicate that approximately 30 percent of the flow pathways transmit groundwater that was younger than 68 years old (post-1941) and that about 70 percent of the flow pathways transmit old groundwater. The particle-tracking results are consistent with the apparent ages and mixing ratios developed from the chlorofluorocarbon and tritium results. The model simulations also indicate that approximately 50 percent of the groundwater flowing through the study area was less than 200 years old and about 50 percent of the groundwater flowing through the study area is old water stored in low-permeability geologic units and fault blocks. As a final examination of model response, the conductance

  5. A Condition-Enumeration Tree method for mining biclusters from DNA microarray data sets.

    PubMed

    Chen, Jiun-Rung; Chang, Ye-In

    2009-07-01

    Biclustering, which performs simultaneous clustering of rows (e.g., genes) and columns (e.g., conditions), has proved of great value for finding interesting patterns from microarray data. To find biclusters, a model called pCluster was proposed. A pCluster consists of a set of genes and a set of conditions, where the expression levels of these genes have a similar variation under these conditions. Based on this model, most of the previous methods need to compute MDSs (maximum dimension sets) for every two genes in the microarray data. Since the number of genes is far larger than the number of conditions, this step is inefficient. Another method called MicroCluster was proposed. This method does not compute MDSs for every two genes, and transforms the problem into a graph problem. However, it needs to solve the Maximal Clique problem, which is NP-Complete. To avoid the above disadvantages, in this paper, we propose a new method, CE-Tree (Condition-Enumeration Tree), for finding pClusters. Instead of generating MDSs for every two genes, we generate only MDSs for every two conditions. Then, based only on these MDSs, we expand the CE-Tree in a special local breadth-first within global depth-first manner to efficiently find all pClusters. We also utilize the idea of the traditional hash join approach to efficiently support the CE-Tree. From the simulation results, we show that the CE-Tree method could find pClusters more efficiently than those previous methods. PMID:19393714

  6. Data Mining on Large Data Set for Predicting Salmon Spawning Habitat

    SciTech Connect

    Xie, YuLong; Murray, Christopher J.; Hanrahan, Timothy P.; Geist, David R.

    2008-07-01

    Hydraulic properties related to river flow affect salmon spawning habitat. Accurate prediction of salmon spawning habitat and understanding the influential properties on the spawning behavior are of great interest for hydroelectric dam management. Previous research predicted salmon spawning habitat through deriving river specific spawning suitability indices and employing a function estimate method like logistic regression on several static river flow related properties and had some success. The objective of this study was two-fold. First dynamic river flow properties associated with upstream dam operation were successfully derived from a huge set of time series of both water velocity and water depth for about one fifth of a million habitat cells through principal component analysis (PCA) using nonlinear iterative partial least squares (NIPLAS). The inclusion of dynamic variables in the models greatly improved the model prediction. Secondly, nine machine learning methods were applied to the data and it was found that decision tree and rule induction methods were generally outperformed usually used logistic regression. Specifically random forest, an advanced decision tree algorithm, provided unanimous better results. Over-prediction problem in previous studies were greatly alleviated.

  7. Bibliomining for Automated Collection Development in a Digital Library Setting: Using Data Mining To Discover Web-Based Scholarly Research Works.

    ERIC Educational Resources Information Center

    Nicholson, Scott

    2003-01-01

    Discusses quality issues regarding Web sites and describes research that created an intelligent agent for automated collection development in a digital academic library setting, which uses a predictive model based on facets of each Web page to select scholarly works. Describes the use of bibliomining, or data mining for libraries. (Author/LRW)

  8. Empirical Mining of Large Data Sets Already Helps to Solve Practical Ecological Problems; A Panoply of Working Examples (Invited)

    NASA Astrophysics Data System (ADS)

    Hargrove, W. W.; Hoffman, F. M.; Kumar, J.; Spruce, J.; Norman, S. P.

    2013-12-01

    Here we present diverse examples where empirical mining and statistical analysis of large data sets have already been shown to be useful for a wide variety of practical decision-making problems within the realm of large-scale ecology. Because a full understanding and appreciation of particular ecological phenomena are possible only after hypothesis-directed research regarding the existence and nature of that process, some ecologists may feel that purely empirical data harvesting may represent a less-than-satisfactory approach. Restricting ourselves exclusively to process-driven approaches, however, may actually slow progress, particularly for more complex or subtle ecological processes. We may not be able to afford the delays caused by such directed approaches. Rather than attempting to formulate and ask every relevant question correctly, empirical methods allow trends, relationships and associations to emerge freely from the data themselves, unencumbered by a priori theories, ideas and prejudices that have been imposed upon them. Although they cannot directly demonstrate causality, empirical methods can be extremely efficient at uncovering strong correlations with intermediate "linking" variables. In practice, these correlative structures and linking variables, once identified, may provide sufficient predictive power to be useful themselves. Such correlation "shadows" of causation can be harnessed by, e.g., Bayesian Belief Nets, which bias ecological management decisions, made with incomplete information, toward favorable outcomes. Empirical data-harvesting also generates a myriad of testable hypotheses regarding processes, some of which may even be correct. Quantitative statistical regionalizations based on quantitative multivariate similarity have lended insights into carbon eddy-flux direction and magnitude, wildfire biophysical conditions, phenological ecoregions useful for vegetation type mapping and monitoring, forest disease risk maps (e.g., sudden oak

  9. Accommodations for English Language Learner Students: The Effect of Linguistic Modification of Math Test Item Sets. Final Report. NCEE 2009-4079

    ERIC Educational Resources Information Center

    Sato, Edynn; Rabinowitz, Stanley; Gallagher, Carole; Huang, Chun-Wei

    2010-01-01

    This study examined the effect of linguistic modification on middle school students' ability to show what they know and can do on math assessments. REL West's study on middle school math assessment accommodations found that simplifying the language--or linguistic modification--on standardized math test items made it easier for English Language…

  10. Data set of world phosphate mines, deposits, and occurrences: Part A. geologic data; Part B. location and mineral economic data

    USGS Publications Warehouse

    Chernoff, Carlotta B.; Orris, G.J.

    2002-01-01

    An inventory of more than 1,600 world phosphate mines, deposits, and occurrences was compiled from smaller data sets collected as part of multiple research efforts by Carlotta Chernoff, University of Arizona, and Greta Orris, U.S. Geological Survey. These data have been utilized during studies of black shale depositional environments and to construct phosphate deposit models. The compiled data have been edited for consistency and additional location information has been added where possible. The database of compiled phosphate information is being released in two sections; the geologic data in one section and the location and mineral economic data in the second. This report, U.S. Geological Survey Open-File Report 02–156–A, contains the geologic data and is best used with the complimentary data contained in Open-File Report 02–156–B. U.S. Geological Survey Open-File Report 02–156–B contains commodity data, location and analytical data, a variety of mineral economic data, reference information, and pointers to related records in the U.S. Geological Survey National mineral databases—MASMILS and MRDS.

  11. Item-Writing Guidelines for Physics

    ERIC Educational Resources Information Center

    Regan, Tom

    2015-01-01

    A teacher learning how to write test questions (test items) will almost certainly encounter item-writing guidelines--lists of item-writing do's and don'ts. Item-writing guidelines usually are presented as applicable across all assessment settings. Table I shows some guidelines that I believe to be generally applicable and two will be briefly…

  12. Application of Optimal Designs to Item Calibration

    PubMed Central

    Lu, Hung-Yi

    2014-01-01

    In computerized adaptive testing (CAT), examinees are presented with various sets of items chosen from a precalibrated item pool. Consequently, the attrition speed of the items is extremely fast, and replenishing the item pool is essential. Therefore, item calibration has become a crucial concern in maintaining item banks. In this study, a two-parameter logistic model is used. We applied optimal designs and adaptive sequential analysis to solve this item calibration problem. The results indicated that the proposed optimal designs are cost effective and time efficient. PMID:25188318

  13. Using the Advanced Progressive Matrices (Set I) to Assess Fluid Ability in a Short Time Frame: An Item Response Theory-Based Analysis

    ERIC Educational Resources Information Center

    Chiesi, Francesca; Ciancaleoni, Matteo; Galli, Silvia; Primi, Caterina

    2012-01-01

    This article is aimed at evaluating the possibility that Set I of the Advanced Progressive Matrices (APM-Set I) can be employed to assess fluid ability in a short time frame. The APM-Set I was administered to a sample of 1,389 primary and secondary school students. Confirmatory factor analysis attested to the unidimensionality of the scale. Item…

  14. Transfer of terrestrial technology for lunar mining

    NASA Technical Reports Server (NTRS)

    Hall, Robert A.; Green, Patricia A.

    1992-01-01

    The functions, operational procedures, and major items of equipment that comprise the terrestrial mining process are characterized. These data are used to synthesize a similar activity on the lunar surface. Functions, operations, and types of equipment that can be suitably transferred to lunar operation are identified. Shortfalls, enhancements, and technology development needs are described. The lunar mining process and what is required to adapt terrestrial equipment are highlighted. It is concluded that translation of terrestrial mining equipment and operational processes to perform similar functions on the lunar surface is practical. Adequate attention must be given to the harsh environment and logistical constraints of the lunar setting. By using earth-based equipment as a forcing function, near- and long-term benefits are derived (i.e., improved terrestrial mining in the near term vis-a-vis commercial production of helium-3 in the long term.

  15. Accuracy Assessment of Geometrical Elements for Setting-Out in Horizontal Plane of Conveying Chambers at the Bauxite Mine "KOSTURI" Srebrenica

    NASA Astrophysics Data System (ADS)

    Milutinović, Aleksandar; Ganić, Aleksandar; Tokalić, Rade

    2014-03-01

    Setting-out of objects on the exploitation field of the mine, both in surface mining and in the underground mines, is determined by the specified setting-out accuracy of reference points, which are best to define spatial position of the object projected. For the purpose of achieving of the specified accuracy, it is necessary to perform a priori accuracy assessment of parameters, which are to be used when performing setting-out. Based on the a priori accuracy assessment, verification of the quality of geometrical setting- -out elements specified in the layout; definition of the accuracy for setting-out of geometrical elements; selection of setting-out method; selection at the type and class of instruments and tools that need to be applied in order to achieve predefined accuracy. The paper displays the accuracy assessment of geometrical elements for setting-out of the main haul gallery, haul downcast and helical conveying downcasts in shape of an inclined helix in horizontal plane, using the example of the underground bauxite mine »Kosturi«, Srebrenica. Wytyczanie obiektów na polu wydobywczym w kopalniach, zarówno podziemnych jak i odkrywkowych, zależy w dużej mierze od określonej dokładności wytyczania punktów referencyjnych, przy pomocy których określane jest następnie położenie przestrzenne pozostałych obiektów. W celu uzyskania założonej dokładności, należy przeprowadzić wstępną analizę dokładności oszacowania parametrów które następnie wykorzystane będą w procesie wytyczania. W oparciu o wyniki wstępnej analizy dokładności dokonuje się weryfikacji jakości geometrycznego wytyczenia elementów zaznaczonych na szkicu, uwzględniając te wyniki dobrać należy odpowiednią metodę wytyczania i rodzaj oraz klasę wykorzystywanych narzędzi i instrumentów, tak by osiągnąć założony poziom dokładności. W pracy przedstawiono oszacowanie dokładności wytyczania elementów geometrycznych dla głównego chodnika transportowego

  16. Item Difficulty Modeling of Paragraph Comprehension Items

    ERIC Educational Resources Information Center

    Gorin, Joanna S.; Embretson, Susan E.

    2006-01-01

    Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…

  17. Perspectives on Knowledge Discovery Algorithms Recently Introduced in Chemoinformatics: Rough Set Theory, Association Rule Mining, Emerging Patterns, and Formal Concept Analysis.

    PubMed

    Gardiner, Eleanor J; Gillet, Valerie J

    2015-09-28

    Knowledge Discovery in Databases (KDD) refers to the use of methodologies from machine learning, pattern recognition, statistics, and other fields to extract knowledge from large collections of data, where the knowledge is not explicitly available as part of the database structure. In this paper, we describe four modern data mining techniques, Rough Set Theory (RST), Association Rule Mining (ARM), Emerging Pattern Mining (EP), and Formal Concept Analysis (FCA), and we have attempted to give an exhaustive list of their chemoinformatics applications. One of the main strengths of these methods is their descriptive ability. When used to derive rules, for example, in structure-activity relationships, the rules have clear physical meaning. This review has shown that there are close relationships between the methods. Often apparent differences lie in the way in which the problem under investigation has been formulated which can lead to the natural adoption of one or other method. For example, the idea of a structural alert, as a structure which is present in toxic and absent in nontoxic compounds, leads to the natural formulation of an Emerging Pattern search. Despite the similarities between the methods, each has its strengths. RST is useful for dealing with uncertain and noisy data. Its main chemoinformatics applications so far have been in feature extraction and feature reduction, the latter often as input to another data mining method, such as an Support Vector Machine (SVM). ARM has mostly been used for frequent subgraph mining. EP and FCA have both been used to mine both structural and nonstructural patterns for classification of both active and inactive molecules. Since their introduction in the 1980s and 1990s, RST, ARM, EP, and FCA have found wide-ranging applications, with many thousands of citations in Web of Science, but their adoption by the chemoinformatics community has been relatively slow. Advances, both in computer power and in algorithm development

  18. How Important Are Items on a Student Evaluation? A Study of Item Salience

    ERIC Educational Resources Information Center

    Hills, Stacey Barlow; Naegle, Natali; Bartkus, Kenneth R.

    2009-01-01

    Although student evaluations of teaching (SETs) have been the subject of numerous research studies, the salience of SET items to students has not been examined. In the present study, the authors surveyed 484 students from a large public university. The authors suggest that not all items are viewed equally and that measures of item salience can…

  19. Automatic Item Generation of Probability Word Problems

    ERIC Educational Resources Information Center

    Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina

    2009-01-01

    Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…

  20. Computerized Numerical Control Test Item Bank.

    ERIC Educational Resources Information Center

    Reneau, Fred; And Others

    This guide contains 285 test items for use in teaching a course in computerized numerical control. All test items were reviewed, revised, and validated by incumbent workers and subject matter instructors. Items are provided for assessing student achievement in such aspects of programming and planning, setting up, and operating machines with…

  1. WWC Review of the Report "Accommodations for English Language Learner Students: The Effect of Linguistic Modification of Math Test Item Sets"

    ERIC Educational Resources Information Center

    What Works Clearinghouse, 2012

    2012-01-01

    The research described in this report is a randomized controlled trial in which seventh- and eighth-grade students were randomly assigned to complete a set of 25 math questions delivered with either standard language or language that had undergone "linguistic modification" by the research team. The purpose of the study was to assess the effects of…

  2. Mining machine

    SciTech Connect

    Parrott, G.A.

    1985-05-07

    A haulage system for a mining machine comprises a mining machine mounted on and/or guided by a conveyor and reciprocable with respect thereto, the conveyor being provided with a rack having plural rows of teeth of identical pitch, with the teeth of one row staggered with respect to an adjacent row(s), and the machine being provided with at least one power driven haulage sprocket comprising plural sets of peripherally arranged teeth of identical pitch, one set being angularly staggered with respect to an adjacent set(s), whereby one set is engageable with each row of teeth of the rack. The invention also includes a mining machine provided with such a power driven haulage sprocket, and a rack as above described and provided with end fittings for securing in articulated manner to an adjacent rack.

  3. SiBIC: a web server for generating gene set networks based on biclusters obtained by maximal frequent itemset mining.

    PubMed

    Takahashi, Kei-ichiro; Takigawa, Ichigaku; Mamitsuka, Hiroshi

    2013-01-01

    Detecting biclusters from expression data is useful, since biclusters are coexpressed genes under only part of all given experimental conditions. We present a software called SiBIC, which from a given expression dataset, first exhaustively enumerates biclusters, which are then merged into rather independent biclusters, which finally are used to generate gene set networks, in which a gene set assigned to one node has coexpressed genes. We evaluated each step of this procedure: 1) significance of the generated biclusters biologically and statistically, 2) biological quality of merged biclusters, and 3) biological significance of gene set networks. We emphasize that gene set networks, in which nodes are not genes but gene sets, can be more compact than usual gene networks, meaning that gene set networks are more comprehensible. SiBIC is available at http://utrecht.kuicr.kyoto-u.ac.jp:8080/miami/faces/index.jsp. PMID:24386124

  4. SiBIC: A Web Server for Generating Gene Set Networks Based on Biclusters Obtained by Maximal Frequent Itemset Mining

    PubMed Central

    Takahashi, Kei-ichiro; Takigawa, Ichigaku; Mamitsuka, Hiroshi

    2013-01-01

    Detecting biclusters from expression data is useful, since biclusters are coexpressed genes under only part of all given experimental conditions. We present a software called SiBIC, which from a given expression dataset, first exhaustively enumerates biclusters, which are then merged into rather independent biclusters, which finally are used to generate gene set networks, in which a gene set assigned to one node has coexpressed genes. We evaluated each step of this procedure: 1) significance of the generated biclusters biologically and statistically, 2) biological quality of merged biclusters, and 3) biological significance of gene set networks. We emphasize that gene set networks, in which nodes are not genes but gene sets, can be more compact than usual gene networks, meaning that gene set networks are more comprehensible. SiBIC is available at http://utrecht.kuicr.kyoto-u.ac.jp:8080/miami/faces/index.jsp. PMID:24386124

  5. Item-Writing Guidelines for Physics

    NASA Astrophysics Data System (ADS)

    Regan, Tom

    2015-11-01

    A teacher learning how to write test questions (test items) will almost certainly encounter item-writing guidelines—lists of item-writing do's and don'ts. Item-writing guidelines usually are presented as applicable across all assessment settings. Table I shows some guidelines that I believe to be generally applicable and two will be briefly discussed here. The teacher may also encounter some guidelines that are not applicable in all assessment settings, or that can be applied to physics only with care. Two such guidelines, and their limitations, will be discussed as well.

  6. The Licensing of Negative Sensitive Items in Jordanian Arabic

    ERIC Educational Resources Information Center

    Alsarayreh, Atef

    2012-01-01

    This study investigates the licensing conditions on Negative Sensitive Items (NSIs) in Jordanian Arabic (JA). JA exhibits both types of NSIs that are discussed in the literature: Negative Polarity Items (NPIs) and Negative Concord Items (NCIs). Although these two sets of items seem to form a natural class in the sense that they show certain…

  7. Defining Deficient Items by IRT Analysis of Calibration Data.

    ERIC Educational Resources Information Center

    Krass, Iosif A.; Thomasson, Gary L.

    New items are being calibrated for the next generation of the computerized adaptive (CAT) version of the Armed Services Vocational Aptitude Battery (ASVAB) (Forms 5 and 6). The requirements that the items be "good" three-parameter logistic (3-PL) model items and typically "like" items in the previous CAT-ASVAB tests have resulted in a set of…

  8. Utilizing Response Time Distributions for Item Selection in CAT

    ERIC Educational Resources Information Center

    Fan, Zhewen; Wang, Chun; Chang, Hua-Hua; Douglas, Jeffrey

    2012-01-01

    Traditional methods for item selection in computerized adaptive testing only focus on item information without taking into consideration the time required to answer an item. As a result, some examinees may receive a set of items that take a very long time to finish, and information is not accrued as efficiently as possible. The authors propose two…

  9. Estimation of general and set steps of roof caving in rock mass with excavations at mining. Numerical modelling.

    NASA Astrophysics Data System (ADS)

    Eremin, M.; Makarov, P.

    2016-04-01

    The results of 2D modelling of rock mass elements fracture are shown in the article. The results of modelling are in a good agreement with empirical and theoretical estimations of roof caving steps for the flat-dipping coal seams when the horizons are not so deep (less than 600 m). The estimations of the general and set steps of roof caving are given for the lava conditions at different lengths of the main roof containing sandstone.

  10. Item Purification Does Not Always Improve DIF Detection: A Counterexample with Angoff's Delta Plot

    ERIC Educational Resources Information Center

    Magis, David; Facon, Bruno

    2013-01-01

    Item purification is an iterative process that is often advocated as improving the identification of items affected by differential item functioning (DIF). With test-score-based DIF detection methods, item purification iteratively removes the items currently flagged as DIF from the test scores to get purified sets of items, unaffected by DIF. The…

  11. Automated Item Selection Using Item Response Theory.

    ERIC Educational Resources Information Center

    Stocking, Martha L.; And Others

    This paper presents a new heuristic approach to interactive test assembly that is called the successive item replacement algorithm. This approach builds on the work of W. J. van der Linden (1987) and W. J. van der Linden and E. Boekkooi-Timminga (1989) in which methods of mathematical optimization are combined with item response theory to…

  12. 48 CFR 852.214-72 - Alternate item(s).

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 48 Federal Acquisition Regulations System 5 2014-10-01 2014-10-01 false Alternate item(s). 852.214... Alternate item(s). As prescribed in 814.201-6(b)(2), insert the following provision: Alternate Item(s) (JAN... .** * Contracting officer will insert an alternate item that is considered acceptable. ** Contracting officer...

  13. 48 CFR 852.214-72 - Alternate item(s).

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... 48 Federal Acquisition Regulations System 5 2010-10-01 2010-10-01 false Alternate item(s). 852.214... Alternate item(s). As prescribed in 814.201-6(b)(2), insert the following provision: Alternate Item(s) (JAN... .** * Contracting officer will insert an alternate item that is considered acceptable. ** Contracting officer...

  14. 48 CFR 852.214-72 - Alternate item(s).

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... 48 Federal Acquisition Regulations System 5 2012-10-01 2012-10-01 false Alternate item(s). 852.214... Alternate item(s). As prescribed in 814.201-6(b)(2), insert the following provision: Alternate Item(s) (JAN... .** * Contracting officer will insert an alternate item that is considered acceptable. ** Contracting officer...

  15. Item versus System Learning: Explaining Free Variation.

    ERIC Educational Resources Information Center

    Ellis, Rod

    1999-01-01

    Provides an explanation for the existence of free variation in learner language. Argues that interlanguage is best conceptualized as sets of loose lexical networks that are gradually reorganized into a system or systems. Free variation arises when learners add items to those they have already acquired and before they analyze these items and…

  16. Simulated Equating Using Several Item Response Curves.

    ERIC Educational Resources Information Center

    Boldt, R. F.

    The comparison of item response theory models for the Test of English as a Foreign Language (TOEFL) was extended to an equating context as simulation trials were used to "equate the test to itself." Equating sample data were generated from administration of identical item sets. Equatings that used procedures based on each model (simple item…

  17. Random Item IRT Models

    ERIC Educational Resources Information Center

    De Boeck, Paul

    2008-01-01

    It is common practice in IRT to consider items as fixed and persons as random. Both, continuous and categorical person parameters are most often random variables, whereas for items only continuous parameters are used and they are commonly of the fixed type, although exceptions occur. It is shown in the present article that random item parameters…

  18. A Novel Computational Tool for Mining Real-Life Data: Application in the Metastatic Colorectal Cancer Care Setting

    PubMed Central

    Siegelmann-Danieli, Nava; Farkash, Ariel; Katzir, Itzhak; Vesterman Landes, Janet; Rotem Rabinovich, Hadas; Lomnicky, Yossef; Carmeli, Boaz; Parush-Shear-Yashuv, Naama

    2016-01-01

    Background Randomized clinical trials constitute the gold-standard for evaluating new anti-cancer therapies; however, real-life data are key in complementing clinically useful information. We developed a computational tool for real-life data analysis and applied it to the metastatic colorectal cancer (mCRC) setting. This tool addressed the impact of oncology/non-oncology parameters on treatment patterns and clinical outcomes. Methods The developed tool enables extraction of any computerized information including comorbidities and use of drugs (oncological/non-oncological) per individual HMO member. The study in which we evaluated this tool was a retrospective cohort study that included Maccabi Healthcare Services members with mCRC receiving bevacizumab with fluoropyrimidines (FP), FP plus oxaliplatin (FP-O), or FP plus irinotecan (FP-I) in the first-line between 9/2006 and 12/2013. Results The analysis included 753 patients of whom 15.4% underwent subsequent metastasectomy (the Surgery group). For the entire cohort, median overall survival (OS) was 20.5 months; in the Surgery group, median duration of bevacizumab-containing therapy (DOT) pre-surgery was 6.1 months; median OS was not reached. In the Non-surgery group, median OS and DOT were 18.7 and 11.4 months, respectively; no significant OS differences were noted between FP-O and FP-I, whereas FP use was associated with shorter OS (12.3 month; p <0.002; notably, these patients were older). Patients who received both FP-O- and FP-I-based regimens achieved numerically longer OS vs. those who received only one of these regimens (22.1 [19.9–24.0] vs. 18.9 [15.5–21.9] months). Among patients assessed for wild-type KRAS and treated with subsequent anti-EGFR agent, OS was 25.4 months and 18.7 months for 124 treated vs. 37 non-treated patients (non-significant). Cox analysis (controlling for age and gender) identified several non-oncology parameters associated with poorer clinical outcomes including concurrent use of

  19. Linking pain items from two studies onto a common scale using item response theory.

    PubMed

    Chen, Wen-Hung; Revicki, Dennis A; Lai, Jin-Shei; Cook, Karon F; Amtmann, Dagmar

    2009-10-01

    This study examined two approaches to linking items from two pain surveys to form a single item bank with a common measurement scale. Secondary analysis of two independent surveys: Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials Survey with Main Survey (959 chronic pain patients; 42 pain items) and Pain Module (n=148; 36 pain items), and Center on Outcomes, Research and Education Survey (400 cancer patients; 43 pain items). There were common items included among the three data sets. In the first approach, all items were calibrated to an item response theory (IRT) model simultaneously, and in the second approach, items were calibrated separately and then the scales were transformed to a common metric. The two approaches produced similar linking results across the two sets of pain interference items because there was sufficient number of common items and large enough sample size. For pain intensity, simultaneous calibration yielded more stable results. Separated calibration yielded an unsatisfactory linking result for pain intensity because of a single common item with small sample size. The results suggested that a simultaneous IRT calibration method produces the more stable item parameters across independent samples, and hence, should be recommended for developing comprehensive item banks. Patient-reported health outcome surveys are often limited in sample sizes and the number of items owing to the difficulty of recruitment and the burden to the patients. As a result, the surveys either lack statistical power or are limited in scope. Using IRT methodology, survey data can be pooled to lend strength to each other to expand the scope and to increase the sample sizes. PMID:19577422

  20. An Analysis of Sex-Related Differential Item Functioning in Attitude Assessment.

    ERIC Educational Resources Information Center

    Dodeen, Hamzeh; Johanson, George A.

    2003-01-01

    Analyzed and classified items that display sex-related differential item functioning (DIF) in attitude assessment. Analyzed 982 items, from 23 real data sets, that measure attitudes. Found that sex DIF is common in attitude scales: more than 27 percent of items showed DIF related to sex, 15 percent of the items exhibited moderate to large DIF, and…

  1. Commercial Data Mining Software

    NASA Astrophysics Data System (ADS)

    Zhang, Qingyu; Segall, Richard S.

    This chapter discusses selected commercial software for data mining, supercomputing data mining, text mining, and web mining. The selected software are compared with their features and also applied to available data sets. The software for data mining are SAS Enterprise Miner, Megaputer PolyAnalyst 5.0, PASW (formerly SPSS Clementine), IBM Intelligent Miner, and BioDiscovery GeneSight. The software for supercomputing are Avizo by Visualization Science Group and JMP Genomics from SAS Institute. The software for text mining are SAS Text Miner and Megaputer PolyAnalyst 5.0. The software for web mining are Megaputer PolyAnalyst and SPSS Clementine . Background on related literature and software are presented. Screen shots of each of the selected software are presented, as are conclusions and future directions.

  2. The MMPI Critical Items: How Well Do They Work?

    ERIC Educational Resources Information Center

    Koss, Mary P.; And Others

    1976-01-01

    The theoretical foundation and empirical performance of two sets of "critical items" that are a common feature of most automated Minnesota Multiphasic Personality Inventory interpretations are examined. These items purportedly alert the clinician to serious pathology and is a scale of general maladjustment. The critical items resulted in numerous…

  3. Evaluating Common Item Block Options When Faced with Practical Constraints

    ERIC Educational Resources Information Center

    Wolkowitz, Amanda; Davis-Becker, Susan

    2015-01-01

    This study evaluates the impact of common item characteristics on the outcome of equating in credentialing examinations when traditionally recommended representation is not possible. This research used real data sets from several credentialing exams to test the impact of content representation, item statistics, and number of common items on…

  4. 17 CFR 210.4-02 - Items not material.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... 17 Commodity and Securities Exchanges 2 2013-04-01 2013-04-01 false Items not material. 210.4-02... 1975 Rules of General Application § 210.4-02 Items not material. If the amount which would otherwise be required to be shown with respect to any item is not material, it need not be separately set forth....

  5. 17 CFR 210.4-02 - Items not material.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... 17 Commodity and Securities Exchanges 3 2014-04-01 2014-04-01 false Items not material. 210.4-02... 1975 Rules of General Application § 210.4-02 Items not material. If the amount which would otherwise be required to be shown with respect to any item is not material, it need not be separately set forth....

  6. 17 CFR 210.4-02 - Items not material.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... 17 Commodity and Securities Exchanges 2 2012-04-01 2012-04-01 false Items not material. 210.4-02... 1975 Rules of General Application § 210.4-02 Items not material. If the amount which would otherwise be required to be shown with respect to any item is not material, it need not be separately set forth....

  7. The Effects of Testlets on Reliability and Differential Item Functioning

    ERIC Educational Resources Information Center

    Teker, Gulsen Tasdelen; Dogan, Nuri

    2015-01-01

    Reliability and differential item functioning (DIF) analyses were conducted on testlets displaying local item dependence in this study. The data set employed in the research was obtained from the answers given by 1,500 students to the 20 items included in six testlets given in English Proficiency Exam by the School of Foreign Languages of a state…

  8. A Lognormal Model for Response Times on Test Items

    ERIC Educational Resources Information Center

    van der Linden, Wim J.

    2006-01-01

    A lognormal model for the response times of a person on a set of test items is investigated. The model has a parameter structure analogous to the two-parameter logistic response models in item response theory, with a parameter for the speed of each person as well as parameters for the time intensity and discriminating power of each item. It is…

  9. Minimum Redundancy Item Analysis

    ERIC Educational Resources Information Center

    Neill, John A.; Jackson, Douglas N.

    1976-01-01

    Illustrates a multivariate approach to item analysis. Previous formulation is extended by investigating techniques simultaneously taking into account scale variance with the goal of reducing the average correlation between scales. Study examines problems in determining optimum values for combinations of item parameters selected for personality…

  10. The Use of Unidimensional Item Parameter Estimates of Multidimensional Items in Adaptive Testing.

    ERIC Educational Resources Information Center

    Ackerman, Terry A.

    The purpose of this study was to investigate the effect of using multidimensional items in a computer adaptive test (CAT) setting which assumes a unidimensional item response theory (IRT) framework. Previous research has suggested that the composite of multidimensional abilities being estimated by a unidimensional IRT model is not constant…

  11. Collaborative Data Mining

    NASA Astrophysics Data System (ADS)

    Moyle, Steve

    Collaborative Data Mining is a setting where the Data Mining effort is distributed to multiple collaborating agents - human or software. The objective of the collaborative Data Mining effort is to produce solutions to the tackled Data Mining problem which are considered better by some metric, with respect to those solutions that would have been achieved by individual, non-collaborating agents. The solutions require evaluation, comparison, and approaches for combination. Collaboration requires communication, and implies some form of community. The human form of collaboration is a social task. Organizing communities in an effective manner is non-trivial and often requires well defined roles and processes. Data Mining, too, benefits from a standard process. This chapter explores the standard Data Mining process CRISP-DM utilized in a collaborative setting.

  12. Increasing Item Complexity: A Possible Cause of Scale Shrinkage for Unidimensional Item Response Theory.

    ERIC Educational Resources Information Center

    Yen, Wendy M.

    1985-01-01

    An approximate relationship is devised between the unidimensional model used in data analysis and a multidimensional model hypothesized to be generating the item responses. Scale shrinkage is successfully predicted for several sets of simulated data. (Author/LMO)

  13. Item Writer Judgments of Item Difficulty versus Actual Item Difficulty: A Case Study

    ERIC Educational Resources Information Center

    Sydorenko, Tetyana

    2011-01-01

    This study investigates how accurate one item writer can be on item difficulty estimates and whether factors affecting item writer judgments correspond to predictors of actual item difficulty. The items were based on conversational dialogs (presented as videos online) that focus on pragmatic functions. Thirty-five 2nd-, 3rd-, and 4th-year learners…

  14. Classification accuracy across multiple tests following item method directed forgetting.

    PubMed

    Goernert, Phillip N; Widner, Robert L; Otani, Hajime

    2007-09-01

    We investigated recall of line-drawing pictures paired at study with an instruction either to remember (TBR items) or to forget (TBF items). Across three 7-minute tests, net recall (items reported independent of accuracy in instructional designation) and correctly classified recall (recall conditional on correct instructional designation) showed directed forgetting. That is, for both measures, recall of TBR items always exceeded recall of TBF items. Net recall for both item types increased across tests at comparable levels showing hypermnesia. However, across tests, correct classification of both item types decreased at comparable levels. Collectively, hypermnesia as measured by net recall is possible for items from multiple sets, but at the cost of accurate source information. PMID:17676551

  15. Development of the PROMIS® Coping Expectancies of Smoking Item Banks

    PubMed Central

    Edelen, Maria Orlando; Tucker, Joan S.; Stucky, Brian D.; Hansen, Mark; Cai, Li

    2014-01-01

    Objective: Smoking is a coping strategy for many smokers who then have difficulty finding new ways to cope with negative affect when they quit. This paper describes analyses conducted to develop and evaluate item banks for assessing the coping expectancies of smoking for daily and nondaily smokers. Methods: Using data from a large sample of daily (N = 4,201) and nondaily (N = 1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning (DIF) analyses (according to gender, age, and ethnicity) to arrive at a unidimensional set of items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) for assessing coping expectancies of smoking. Results: For both daily and nondaily smokers, the unidimensional Coping Expectancies item banks (21 items) are relatively DIF free and are highly reliable (0.96 and 0.97, respectively). A common 4-item SF for daily and nondaily smokers also showed good reliability (0.85). Adaptive tests required an average of 4.3 and 3.7 items for simulated daily and nondaily respondents, respectively, and achieved reliabilities of 0.91 for both when the maximum test length was 10 items. Conclusions: This research provides a new set of items that can be used to reliably assess coping expectancies of smoking, through a SF, CAT, or a tailored set selected for a specific research purpose. PMID:25118227

  16. The Role of Item Models in Automatic Item Generation

    ERIC Educational Resources Information Center

    Gierl, Mark J.; Lai, Hollis

    2012-01-01

    Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

  17. Item Overexposure in Computerized Classification Tests Using Sequential Item Selection

    ERIC Educational Resources Information Center

    Huebner, Alan

    2012-01-01

    Computerized classification tests (CCTs) often use sequential item selection which administers items according to maximizing psychometric information at a cut point demarcating passing and failing scores. This paper illustrates why this method of item selection leads to the overexposure of a significant number of items, and the performances of…

  18. Extending item response theory to online homework

    NASA Astrophysics Data System (ADS)

    Kortemeyer, Gerd

    2014-06-01

    Item response theory (IRT) becomes an increasingly important tool when analyzing "big data" gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large-enrollment physics course for scientists and engineers, the study compares outcomes from IRT analyses of exam and homework data, and then proceeds to investigate the effects of each confounding factor introduced in the online realm. It is found that IRT yields the correct trends for learner ability and meaningful item parameters, yet overall agreement with exam data is moderate. It is also found that learner ability and item discrimination is robust over a wide range with respect to model assumptions and introduced noise. Item difficulty is also robust, but over a narrower range.

  19. EnvMine: A text-mining system for the automatic extraction of contextual information

    PubMed Central

    2010-01-01

    Background For ecological studies, it is crucial to count on adequate descriptions of the environments and samples being studied. Such a description must be done in terms of their physicochemical characteristics, allowing a direct comparison between different environments that would be difficult to do otherwise. Also the characterization must include the precise geographical location, to make possible the study of geographical distributions and biogeographical patterns. Currently, there is no schema for annotating these environmental features, and these data have to be extracted from textual sources (published articles). So far, this had to be performed by manual inspection of the corresponding documents. To facilitate this task, we have developed EnvMine, a set of text-mining tools devoted to retrieve contextual information (physicochemical variables and geographical locations) from textual sources of any kind. Results EnvMine is capable of retrieving the physicochemical variables cited in the text, by means of the accurate identification of their associated units of measurement. In this task, the system achieves a recall (percentage of items retrieved) of 92% with less than 1% error. Also a Bayesian classifier was tested for distinguishing parts of the text describing environmental characteristics from others dealing with, for instance, experimental settings. Regarding the identification of geographical locations, the system takes advantage of existing databases such as GeoNames to achieve 86% recall with 92% precision. The identification of a location includes also the determination of its exact coordinates (latitude and longitude), thus allowing the calculation of distance between the individual locations. Conclusion EnvMine is a very efficient method for extracting contextual information from different text sources, like published articles or web pages. This tool can help in determining the precise location and physicochemical variables of sampling sites, thus

  20. Limited life item management

    NASA Technical Reports Server (NTRS)

    Eaglen, R. L.

    1971-01-01

    Plans are available for age-sensitive hardware management. Control plan identifies shelf life or age control requirements for materials considered age sensitive, use sensitive, or time service or shelf life controlled items, and describes methods of arriving at age controls through adherence to detailed specifications.

  1. Reading Current Events Items.

    ERIC Educational Resources Information Center

    Ediger, Marlow

    People who live in a democracy should be well informed of local, state, national, and international happenings. Students should become curious about news items and relate current happenings to the personal self. They must possess skills in word recognition and in diverse kinds of comprehension since reading is an important way to glean current…

  2. Selecting Lower Priced Items.

    ERIC Educational Resources Information Center

    Kleinert, Harold L.; And Others

    1988-01-01

    A program used to teach moderately to severely mentally handicapped students to select the lower priced items in actual shopping activities is described. Through a five-phase process, students are taught to compare prices themselves as well as take into consideration variations in the sizes of containers and varying product weights. (VW)

  3. ICCS 2009 User Guide for the International Database. Supplement 5: ICCS 2009 Released Items

    ERIC Educational Resources Information Center

    Brese, Falk; Jung, Michael; Mirazchiyski, Plamen; Schulz, Wolfram; Zuehlke, Olaf

    2011-01-01

    This document contains released items used in the ICCS 2009 main survey. Seven clusters of items were used in the study in a rotated booklet design with three clusters per test booklet. Clusters 1 and 7 comprise the released item set. Cluster 1 comprises items that were newly developed for ICCS 2009 and Cluster 7 is drawn from previously secure…

  4. Longwall mining

    SciTech Connect

    1995-03-14

    As part of EIA`s program to provide information on coal, this report, Longwall-Mining, describes longwall mining and compares it with other underground mining methods. Using data from EIA and private sector surveys, the report describes major changes in the geologic, technological, and operating characteristics of longwall mining over the past decade. Most important, the report shows how these changes led to dramatic improvements in longwall mining productivity. For readers interested in the history of longwall mining and greater detail on recent developments affecting longwall mining, the report includes a bibliography.

  5. Mining fuzzy association rules in spatio-temporal databases

    NASA Astrophysics Data System (ADS)

    Shu, Hong; Dong, Lin; Zhu, Xinyan

    2008-12-01

    A huge amount of geospatial and temporal data have been collected through various networks of environment monitoring stations. For instance, daily precipitation and temperature are observed at hundreds of meteorological stations in Northeastern China. However, these massive raw data from the stations are not fully utilized for meeting the requirements of human decision-making. In nature, the discovery of geographical data mining is the computation of multivariate spatio-temporal correlations through the stages of data mining. In this paper, a procedure of mining association rules in regional climate-changing databases is introduced. The methods of Kriging interpolation, fuzzy cmeans clustering, and Apriori-based logical rules extraction are employed subsequently. Formally, we define geographical spatio-temporal transactions and fuzzy association rules. Innovatively, we make fuzzy data conceptualization by means of fuzzy c-means clustering, and transform fuzzy data items with membership grades into Boolean data items with weights by means ofλ-cut sets. When the algorithm Apriori is executed on Boolean transactions with weights, fuzzy association rules are derived. Fuzzy association rules are more nature than crisp association rules for human cognition about the reality.

  6. Extending Item Response Theory to Online Homework

    ERIC Educational Resources Information Center

    Kortemeyer, Gerd

    2014-01-01

    Item response theory (IRT) becomes an increasingly important tool when analyzing "big data" gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large-enrollment physics course for…

  7. Development of the PROMIS® Nicotine Dependence Item Banks

    PubMed Central

    Edelen, Maria Orlando; Tucker, Joan S.; Stucky, Brian D.; Hansen, Mark; Cai, Li

    2014-01-01

    Introduction: Nicotine dependence is a core construct important for understanding cigarette smoking and smoking cessation behavior. This article describes analyses conducted to develop and evaluate item banks for assessing nicotine dependence among daily and nondaily smokers. Methods: Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of nicotine dependence items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess dependence. Results: A total of 32 items were included in the Nicotine Dependence item banks; 22 items are common across daily and nondaily smokers, 5 are unique to daily smokers, and 5 are unique to nondaily smokers. For both daily and nondaily smokers, the Nicotine Dependence item banks are strongly unidimensional, highly reliable (reliability = 0.97 and 0.97, respectively), and perform similarly across gender, age, and race/ethnicity groups. SFs common to daily and nondaily smokers consist of 8 and 4 items (reliability = 0.91 and 0.81, respectively). Results from simulated CATs showed that dependence can be assessed with very good precision for most respondents using fewer than 6 items adaptively selected from the item banks. Conclusions: Nicotine dependence on cigarettes can be assessed on the basis of these item banks via one of the SFs, by using CATs, or through a tailored set of items selected for a specific research purpose. PMID:25118226

  8. Development of the PROMIS® Health Expectancies of Smoking Item Banks

    PubMed Central

    Tucker, Joan S.; Shadel, William G.; Stucky, Brian D.; Cerully, Jennifer; Li, Zhen; Hansen, Mark; Cai, Li

    2014-01-01

    Introduction: Smokers’ health-related outcome expectancies are associated with a number of important constructs in smoking research, yet there are no measures currently available that focus exclusively on this domain. This paper describes the development and evaluation of item banks for assessing the health expectancies of smoking. Methods: Using data from a sample of daily (N = 4,201) and nondaily (N = 1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of health expectancies items for daily and nondaily smokers. We also evaluated the performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess health expectancies. Results: A total of 24 items were included in the Health Expectancies item banks; 13 items are common across daily and nondaily smokers, 6 are unique to daily, and 5 are unique to nondaily. For both daily and nondaily smokers, the Health Expectancies item banks are unidimensional, reliable (reliability = 0.95 and 0.96, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.87). Results from simulated CATs showed that health expectancies can be assessed with good precision with an average of 5–6 items adaptively selected from the item banks. Conclusions: Health expectancies of smoking can be assessed on the basis of these item banks via SFs, CATs, or through a tailored set of items selected for a specific research purpose. PMID:25118229

  9. Research on parallel algorithm for sequential pattern mining

    NASA Astrophysics Data System (ADS)

    Zhou, Lijuan; Qin, Bai; Wang, Yu; Hao, Zhongxiao

    2008-03-01

    Sequential pattern mining is the mining of frequent sequences related to time or other orders from the sequence database. Its initial motivation is to discover the laws of customer purchasing in a time section by finding the frequent sequences. In recent years, sequential pattern mining has become an important direction of data mining, and its application field has not been confined to the business database and has extended to new data sources such as Web and advanced science fields such as DNA analysis. The data of sequential pattern mining has characteristics as follows: mass data amount and distributed storage. Most existing sequential pattern mining algorithms haven't considered the above-mentioned characteristics synthetically. According to the traits mentioned above and combining the parallel theory, this paper puts forward a new distributed parallel algorithm SPP(Sequential Pattern Parallel). The algorithm abides by the principal of pattern reduction and utilizes the divide-and-conquer strategy for parallelization. The first parallel task is to construct frequent item sets applying frequent concept and search space partition theory and the second task is to structure frequent sequences using the depth-first search method at each processor. The algorithm only needs to access the database twice and doesn't generate the candidated sequences, which abates the access time and improves the mining efficiency. Based on the random data generation procedure and different information structure designed, this paper simulated the SPP algorithm in a concrete parallel environment and implemented the AprioriAll algorithm. The experiments demonstrate that compared with AprioriAll, the SPP algorithm had excellent speedup factor and efficiency.

  10. Item-focussed Trees for the Identification of Items in Differential Item Functioning.

    PubMed

    Tutz, Gerhard; Berger, Moritz

    2016-09-01

    A novel method for the identification of differential item functioning (DIF) by means of recursive partitioning techniques is proposed. We assume an extension of the Rasch model that allows for DIF being induced by an arbitrary number of covariates for each item. Recursive partitioning on the item level results in one tree for each item and leads to simultaneous selection of items and variables that induce DIF. For each item, it is possible to detect groups of subjects with different item difficulties, defined by combinations of characteristics that are not pre-specified. The way a DIF item is determined by covariates is visualized in a small tree and therefore easily accessible. An algorithm is proposed that is based on permutation tests. Various simulation studies, including the comparison with traditional approaches to identify items with DIF, show the applicability and the competitive performance of the method. Two applications illustrate the usefulness and the advantages of the new method. PMID:26596721

  11. Faculty Development on Item Writing Substantially Improves Item Quality

    ERIC Educational Resources Information Center

    Naeem, Naghma; van der Vleuten, Cees; Alfaris, Eiad Abdelmohsen

    2012-01-01

    The quality of items written for in-house examinations in medical schools remains a cause of concern. Several faculty development programs are aimed at improving faculty's item writing skills. The purpose of this study was to evaluate the effectiveness of a faculty development program in item development. An objective method was developed and used…

  12. Item Sampling: Optimum Number of People and Items.

    ERIC Educational Resources Information Center

    Moy, Mabel L. Y.; Barcikowski, Robert S.

    Using a computer-based Monte Carlo approach to generate item responses, the results of this study indicate that, when item discrimination indices are considered, item-examinee sampling procedures having the same number of observations have different standard errors in estimating both test mean and test variance. With certain types of tests, a…

  13. Real and Artificial Differential Item Functioning in Polytomous Items

    ERIC Educational Resources Information Center

    Andrich, David; Hagquist, Curt

    2015-01-01

    Differential item functioning (DIF) for an item between two groups is present if, for the same person location on a variable, persons from different groups have different expected values for their responses. Applying only to dichotomously scored items in the popular Mantel-Haenszel (MH) method for detecting DIF in which persons are classified by…

  14. Component Identification and Item Difficulty of Raven's Matrices Items.

    ERIC Educational Resources Information Center

    Green, Kathy E.; Kluever, Raymond C.

    Item components that might contribute to the difficulty of items on the Raven Colored Progressive Matrices (CPM) and the Standard Progressive Matrices (SPM) were studied. Subjects providing responses to CPM items were 269 children aged 2 years 9 months to 11 years 8 months, most of whom were referred for testing as potentially gifted. A second…

  15. The Communicative Participation Item Bank (CPIB): Item bank calibration and development of a disorder-generic short form

    PubMed Central

    Baylor, Carolyn; Yorkston, Kathryn; Eadie, Tanya; Kim, Jiseon; Chung, Hyewon; Amtmann, Dagmar

    2015-01-01

    Purpose The purpose of this study was to calibrate the items for the Communicative Participation Item Bank (CPIB) using Item Response Theory (IRT). One overriding objective was to examine if the IRT item parameters would be consistent across different diagnostic groups, thereby allowing creation of a disorder-generic instrument. The intended outcomes were the final item bank and a short form ready for clinical and research applications. Methods Self-report data were collected from 701 individuals representing four diagnoses: multiple sclerosis, Parkinson’s disease, amyotrophic lateral sclerosis and head and neck cancer. Participants completed the CPIB and additional self-report questionnaires. CPIB data were analyzed using the IRT Graded Response Model (GRM). Results The initial set of 94 candidate CPIB items were reduced to an item bank of 46 items demonstrating unidimensionality, local independence, good item fit, and good measurement precision. Differential item function (DIF) analyses detected no meaningful differences across diagnostic groups. A 10-item, disorder-generic short form was generated. Conclusions The CPIB provides speech-language pathologists with a unidimensional, self-report outcomes measurement instrument dedicated to the construct of communicative participation. This instrument may be useful to clinicians and researchers wanting to implement measures of communicative participation in their work. PMID:23816661

  16. Detecting Local Item Dependence in Polytomous Adaptive Data

    ERIC Educational Resources Information Center

    Mislevy, Jessica L.; Rupp, Andre A.; Harring, Jeffrey R.

    2012-01-01

    A rapidly expanding arena for item response theory (IRT) is in attitudinal and health-outcomes survey applications, often with polytomous items. In particular, there is interest in computer adaptive testing (CAT). Meeting model assumptions is necessary to realize the benefits of IRT in this setting, however. Although initial investigations of…

  17. The Effect of Item Sequence on Bar Examination Scores.

    ERIC Educational Resources Information Center

    Klein, Stephen P.; Bolus, Roger

    A solution to reduce the likelihood of one examinee copying another's answers on large scale tests that require all examinees to answer the same set of questions is to use multiple test forms that differ in terms of item ordering. This study was conducted to determine whether varying the sequence in which blocks of items were presented to…

  18. Tree versus Geometric Representation of Tests and Items.

    ERIC Educational Resources Information Center

    Beller, Michael

    1990-01-01

    Geometric approaches to representing interrelations among tests and items are compared with an additive tree model (ATM), using 2,644 examinees and 2 other data sets. The ATM's close fit to the data and its coherence of presentation indicate that it is the best means of representing tests and items. (TJH)

  19. Comparison of Bookmark Difficulty Locations Under Different Item Response Models

    ERIC Educational Resources Information Center

    Beretvas, S. Natasha

    2004-01-01

    In the bookmark standard-setting procedure, judges place "bookmarks" in a reordered test booklet containing items presented in order of increasing difficulty. Traditionally, the bookmark difficulty location (BDL) is on the trait continuum where, for dichotomous items, there is a two-thirds probability of a correct response and, for a score of "k"…

  20. Technical evaluation process for specifying replacement items

    SciTech Connect

    Craig, W.E. ); Mulford, T.J. )

    1989-01-01

    As nuclear power plants continue in the transition to a strictly operating and maintenance mode, the engineering involvement required to specify and procure replacement parts and components continues to increase. The evaluation process for a replacement item has one simple goal: to specify an effective set of technical and quality requirements that will result in the procurement of an item meeting the plant design basis. The evaluation process is depicted in a simplified block diagram showing the elements of engineering evaluation from a functional perspective. These elements are the following: (1) need for a technical evaluation; (2) components and part classification; (3) failure mode and effect analysis (FMEA); (4) critical characteristics for design determination; (5) like-for-like or alternate item evaluation; and (6) technical and quality requirements determination.

  1. Methodological Issues in Using the Rasch Model To Select Cross Culturally Equivalent Items in Order To Develop a Quality of Life Index: The Analysis of Four WHOQOL-100 Data Sets (Argentina, France, Hong Kong, United Kingdom).

    ERIC Educational Resources Information Center

    Leplege, Alain; Ecosse, Emmanuel

    2000-01-01

    Describes the source instrument, the database, and the experimental methodology that have been used in developing the World Health Organization Quality of Life (WHOQOL) profiles to allow the comparison of quality of life findings from a variety of cultural settings. Illustrates the precautions necessary in interpreting scores from different…

  2. Web Mining

    NASA Astrophysics Data System (ADS)

    Fürnkranz, Johannes

    The World-Wide Web provides every internet citizen with access to an abundance of information, but it becomes increasingly difficult to identify the relevant pieces of information. Research in web mining tries to address this problem by applying techniques from data mining and machine learning to Web data and documents. This chapter provides a brief overview of web mining techniques and research areas, most notably hypertext classification, wrapper induction, recommender systems and web usage mining.

  3. Data Mining.

    ERIC Educational Resources Information Center

    Benoit, Gerald

    2002-01-01

    Discusses data mining (DM) and knowledge discovery in databases (KDD), taking the view that KDD is the larger view of the entire process, with DM emphasizing the cleaning, warehousing, mining, and visualization of knowledge discovery in databases. Highlights include algorithms; users; the Internet; text mining; and information extraction.…

  4. Text Mining.

    ERIC Educational Resources Information Center

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  5. Effects of Ignoring Item Interaction on Item Parameter Estimation and Detection of Interacting Items

    ERIC Educational Resources Information Center

    Chen, Cheng-Te; Wang, Wen-Chung

    2007-01-01

    This study explores the effects of ignoring item interaction on item parameter estimation and the efficiency of using the local dependence index Q[subscript 3] and the SAS NLMIXED procedure to detect item interaction under the three-parameter logistic model and the generalized partial credit model. Through simulations, it was found that ignoring…

  6. The Body Appreciation Scale-2: item refinement and psychometric evaluation.

    PubMed

    Tylka, Tracy L; Wood-Barcalow, Nichole L

    2015-01-01

    Considered a positive body image measure, the 13-item Body Appreciation Scale (BAS; Avalos, Tylka, & Wood-Barcalow, 2005) assesses individuals' acceptance of, favorable opinions toward, and respect for their bodies. While the BAS has accrued psychometric support, we improved it by rewording certain BAS items (to eliminate sex-specific versions and body dissatisfaction-based language) and developing additional items based on positive body image research. In three studies, we examined the reworded, newly developed, and retained items to determine their psychometric properties among college and online community (Amazon Mechanical Turk) samples of 820 women and 767 men. After exploratory factor analysis, we retained 10 items (five original BAS items). Confirmatory factor analysis upheld the BAS-2's unidimensionality and invariance across sex and sample type. Its internal consistency, test-retest reliability, and construct (convergent, incremental, and discriminant) validity were supported. The BAS-2 is a psychometrically sound positive body image measure applicable for research and clinical settings. PMID:25462882

  7. The DIF-Free-Then-DIF Strategy for the Assessment of Differential Item Functioning

    ERIC Educational Resources Information Center

    Wang, Wen-Chung; Shih, Ching-Lin; Sun, Guo-Wei

    2012-01-01

    The DIF-free-then-DIF (DFTD) strategy consists of two steps: (a) select a set of items that are the most likely to be DIF-free and (b) assess the other items for DIF (differential item functioning) using the designated items as anchors. The rank-based method together with the computer software IRTLRDIF can select a set of DIF-free polytomous items…

  8. 10 CFR 74.55 - Item monitoring.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... days for Category IA items and 60 calendar days for Category IB items contained in a vault or in a... days for Category IA items and seven calendar days for Category BI items located elsewhere in the...

  9. 10 CFR 74.55 - Item monitoring.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... days for Category IA items and 60 calendar days for Category IB items contained in a vault or in a... days for Category IA items and seven calendar days for Category BI items located elsewhere in the...

  10. Linking Item Response Model Parameters.

    PubMed

    van der Linden, Wim J; Barrett, Michelle D

    2016-09-01

    With a few exceptions, the problem of linking item response model parameters from different item calibrations has been conceptualized as an instance of the problem of test equating scores on different test forms. This paper argues, however, that the use of item response models does not require any test score equating. Instead, it involves the necessity of parameter linking due to a fundamental problem inherent in the formal nature of these models-their general lack of identifiability. More specifically, item response model parameters need to be linked to adjust for the different effects of the identifiability restrictions used in separate item calibrations. Our main theorems characterize the formal nature of these linking functions for monotone, continuous response models, derive their specific shapes for different parameterizations of the 3PL model, and show how to identify them from the parameter values of the common items or persons in different linking designs. PMID:26155754

  11. Item-properties may influence item-item associations in serial recall.

    PubMed

    Caplan, Jeremy B; Madan, Christopher R; Bedwell, Darren J

    2015-04-01

    Attributes of words, such as frequency and imageability, can influence memory for order. In serial recall, Hulme, Stuart, Brown, and Morin (Journal of Memory and Language, 49(4), 500-518, 2003) found that high-frequency words were recalled worse, and low-frequency words better, when embedded in alternating lists than pure lists. This is predicted by associative chaining, wherein each recalled list-item becomes a recall-cue for the next item. However, Hulme, Stuart, Brown, and Morin (Journal of Memory and Language, 49(4), 500-518, 2003) argued their findings supported positional-coding models, wherein items are linked to a representation of position, with no direct associations between items. They suggested their serial-position effects were due to pre-experimental semantic similarity between pairs of items, which depended on frequency, or a complex tradeoff between item- and order-coding (Morin, Poirier, Fortin, & Hulme, Psychonomic Bulletin Review, 13(4), 724-729, 2006). We replicated the smooth serial-position effects, but accounts based on pre-existing similarity or item-order tradeoffs were untenable. Alternative accounts based, on imageability, phonological and lexical neighbourhood sizes were also ruled out. The standard chaining account predicts that if accuracy is conditionalized on whether the prior item was correct, the word-frequency effect should reappear in alternating lists; however, this prediction was not borne out, challenging this retrieval-based chaining account. We describe a new account, whereby frequency influences the strengths of item-item associations, symmetrically, during study. A manipulation of word-imageability also produced a pattern consistent with item-item cueing at study, but left room for effects of imageability at the final stage of recall. These findings provide further support for the contribution of associative chaining to serial-recall behaviour and show that item-properties may influence serial-recall in multiple ways. PMID

  12. Three controversies over item disclosure in medical licensure examinations

    PubMed Central

    Park, Yoon Soo; Yang, Eunbae B.

    2015-01-01

    In response to views on public's right to know, there is growing attention to item disclosure – release of items, answer keys, and performance data to the public – in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations – 1) fairness and validity, 2) impact on passing levels, and 3) utility of item disclosure – by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers’ right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration. PMID:26374693

  13. Three controversies over item disclosure in medical licensure examinations.

    PubMed

    Park, Yoon Soo; Yang, Eunbae B

    2015-01-01

    In response to views on public's right to know, there is growing attention to item disclosure - release of items, answer keys, and performance data to the public - in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations - 1) fairness and validity, 2) impact on passing levels, and 3) utility of item disclosure - by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers' right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration. PMID:26374693

  14. Selecting Items for Criterion-Referenced Tests.

    ERIC Educational Resources Information Center

    Mellenbergh, Gideon J.; van der Linden, Wim J.

    1982-01-01

    Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)

  15. A Mixed Effects Randomized Item Response Model

    ERIC Educational Resources Information Center

    Fox, J.-P.; Wyrick, Cheryl

    2008-01-01

    The randomized response technique ensures that individual item responses, denoted as true item responses, are randomized before observing them and so-called randomized item responses are observed. A relationship is specified between randomized item response data and true item response data. True item response data are modeled with a (non)linear…

  16. Mine system

    SciTech Connect

    Stoppani, B.R.

    1983-10-04

    A mine system comprises at least one mining machine adapted to haul itself, in a reciprocating manner, along a mineral face, and a control box housing means to control the various electrical elements of the machine(s), the box being located in a mine roadway at one end of the mineral face along which the machine(s) is reciprocating, and the box being electrically connected to a terminal box housed in a body of the machine(s).

  17. Evaluating Item Fit for Multidimensional Item Response Models

    ERIC Educational Resources Information Center

    Zhang, Bo; Stone, Clement A.

    2008-01-01

    This research examines the utility of the s-x[superscript 2] statistic proposed by Orlando and Thissen (2000) in evaluating item fit for multidimensional item response models. Monte Carlo simulation was conducted to investigate both the Type I error and statistical power of this fit statistic in analyzing two kinds of multidimensional test…

  18. MIMIC Methods for Assessing Differential Item Functioning in Polytomous Items

    ERIC Educational Resources Information Center

    Wang, Wen-Chung; Shih, Ching-Lin

    2010-01-01

    Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…

  19. Differential Item Functioning Analysis Using Rasch Item Information Functions

    ERIC Educational Resources Information Center

    Wyse, Adam E.; Mapuranga, Raymond

    2009-01-01

    Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

  20. 17 CFR 229.104 - (Item 104) Mine safety disclosure.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... and orders referenced in Subpart B of 29 CFR part 2700; 2. Contests of proposed penalties referenced in Subpart C of 29 CFR part 2700; 3. Complaints for compensation referenced in Subpart D of 29 CFR part 2700; 4. Complaints of discharge, discrimination or interference referenced in Subpart E of 29...

  1. 17 CFR 229.104 - (Item 104) Mine safety disclosure.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... and orders referenced in Subpart B of 29 CFR part 2700; 2. Contests of proposed penalties referenced in Subpart C of 29 CFR part 2700; 3. Complaints for compensation referenced in Subpart D of 29 CFR part 2700; 4. Complaints of discharge, discrimination or interference referenced in Subpart E of 29...

  2. 17 CFR 229.104 - (Item 104) Mine safety disclosure.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... and orders referenced in Subpart B of 29 CFR part 2700; 2. Contests of proposed penalties referenced in Subpart C of 29 CFR part 2700; 3. Complaints for compensation referenced in Subpart D of 29 CFR part 2700; 4. Complaints of discharge, discrimination or interference referenced in Subpart E of 29...

  3. Deep mine cooling system

    SciTech Connect

    Conan, J.

    1984-11-06

    A deep mine cooling system comprising a compressor supplied with air and rotatively driven by a motor and an expansion turbine supplied with compressed air from said compressor and driving an actuating unit, wherein the compressed air, after leaving the compressor but prior to reaching the expansion turbine, passes through a steam generator whose output provides the energy required to operate an absorption refrigeration machine used to cool utility water for mining, said compressed air on leaving the steam generator going to a first heat exchanger in which it yields calories to a water circuit comprising a second heat exchanger, said second heat exchanger giving off the calories absorbed by the water in the first heat exchanger to the air fed by the second heat exchanger to a drying cell that is regenerated by said air from the second heat exchanger, said drying cell being part of a set of two cells working in alternation, the other cell in the set receiving the compressed air from the first heat exchanger, such that the compressed air is fed to said expansion turbine after leaving said drying unit, and wherein the air exhausted from said expansion turbine is sent to a third heat exchanger after which it is distributed according to the needs of the mine, said third exchanger being traversed by the water collected in the mine, cooled in said exchanger and circulated upon leaving said exchanger to meet the cool water requirements of the mine.

  4. An Efficiency Balanced Information Criterion for Item Selection in Computerized Adaptive Testing

    ERIC Educational Resources Information Center

    Han, Kyung T.

    2012-01-01

    Successful administration of computerized adaptive testing (CAT) programs in educational settings requires that test security and item exposure control issues be taken seriously. Developing an item selection algorithm that strikes the right balance between test precision and level of item pool utilization is the key to successful implementation…

  5. Diverse Food Items Are Similarly Categorized by 8- to 13-Year-Old Children

    ERIC Educational Resources Information Center

    Beltran, Alicia; Knight Sepulveda, Karina; Watson, Kathy; Baranowski, Tom; Baranowski, Janice; Islam, Noemi; Missaghian, Mariam

    2008-01-01

    Objective: Assess how 8- to 13-year-old children categorized and labeled food items for possible use as part of a food search strategy in a computerized 24-hour dietary recall. Design: A set of 62 cards with pictures and names of food items from 18 professionally defined food groups was sorted by each child into piles of similar food items.…

  6. Item Difficulty in the Evaluation of Computer-Based Instruction: An Example from Neuroanatomy

    ERIC Educational Resources Information Center

    Chariker, Julia H.; Naaz, Farah; Pani, John R.

    2012-01-01

    This article reports large item effects in a study of computer-based learning of neuroanatomy. Outcome measures of the efficiency of learning, transfer of learning, and generalization of knowledge diverged by a wide margin across test items, with certain sets of items emerging as particularly difficult to master. In addition, the outcomes of…

  7. Assessing Acquiescence in Binary Responses: IRT-Related Item-Factor-Analytic Procedures

    ERIC Educational Resources Information Center

    Ferrando, Pere J.; Condon, Lorena

    2006-01-01

    This article proposes procedures for assessing acquiescence in a balanced set of binary personality items. These procedures are based on the bidimensional item-factor analysis model, which is an alternative parameterization of the bidimensional 2-parameter normal-ogive item response theory model. First the rationale and general approach are…

  8. The Asymptotic Distribution of Ability Estimates: Beyond Dichotomous Items and Unidimensional IRT Models

    ERIC Educational Resources Information Center

    Sinharay, Sandip

    2015-01-01

    The maximum likelihood estimate (MLE) of the ability parameter of an item response theory model with known item parameters was proved to be asymptotically normally distributed under a set of regularity conditions for tests involving dichotomous items and a unidimensional ability parameter (Klauer, 1990; Lord, 1983). This article first considers…

  9. Career and Occupational Development Items.

    ERIC Educational Resources Information Center

    Education Commission of the States, Denver, CO. National Assessment of Educational Progress.

    The career and occupational development items contained in this document are part of a kit consisting of four documents which bring together different types of items that measure a number of career and occupational development (COD) objectives developed by the National Assessment of Educational Progress (NAEP). (NAEP--which completed a national…

  10. Generalizability in Item Response Modeling

    ERIC Educational Resources Information Center

    Briggs, Derek C.; Wilson, Mark

    2007-01-01

    An approach called generalizability in item response modeling (GIRM) is introduced in this article. The GIRM approach essentially incorporates the sampling model of generalizability theory (GT) into the scaling model of item response theory (IRT) by making distributional assumptions about the relevant measurement facets. By specifying a random…

  11. Agriculture Library of Test Items.

    ERIC Educational Resources Information Center

    Sutherland, Duncan, Ed.

    As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items of value from past tests are made available to teachers for the construction of unit tests, term examinations or as a basis for class discussion. Each collection is reviewed for content validity and reliability. The test…

  12. Automatic Association of News Items.

    ERIC Educational Resources Information Center

    Carrick, Christina; Watters, Carolyn

    1997-01-01

    Discussion of electronic news delivery systems and the automatic generation of electronic editions focuses on the association of related items of different media type, specifically photos and stories. The goal is to be able to determine to what degree any two news items refer to the same news event. (Author/LRW)

  13. African mining

    SciTech Connect

    Not Available

    1987-01-01

    This book contains papers presented at a conference addressing the development of the minerals industry in Africa. Topics covered include: A review - past, present and future - of Zimbabwe's mining industry; Geomorphological processes and related mineralization in Tanzania; and Rock mechanics investigations at Mufulira mine, Zambia.

  14. Assessing item fit for unidimensional item response theory models using residuals from estimated item response functions.

    PubMed

    Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee

    2013-07-01

    Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models. PMID:25106393

  15. Integrating Entropy and Closed Frequent Pattern Mining for Social Network Modelling and Analysis

    NASA Astrophysics Data System (ADS)

    Adnan, Muhaimenul; Alhajj, Reda; Rokne, Jon

    The recent increase in the explicitly available social networks has attracted the attention of the research community to investigate how it would be possible to benefit from such a powerful model in producing effective solutions for problems in other domains where the social network is implicit; we argue that social networks do exist around us but the key issue is how to realize and analyze them. This chapter presents a novel approach for constructing a social network model by an integrated framework that first preparing the data to be analyzed and then applies entropy and frequent closed patterns mining for network construction. For a given problem, we first prepare the data by identifying items and transactions, which arc the basic ingredients for frequent closed patterns mining. Items arc main objects in the problem and a transaction is a set of items that could exist together at one time (e.g., items purchased in one visit to the supermarket). Transactions could be analyzed to discover frequent closed patterns using any of the well-known techniques. Frequent closed patterns have the advantage that they successfully grab the inherent information content of the dataset and is applicable to a broader set of domains. Entropies of the frequent closed patterns arc used to keep the dimensionality of the feature vectors to a reasonable size; it is a kind of feature reduction process. Finally, we analyze the dynamic behavior of the constructed social network. Experiments were conducted on a synthetic dataset and on the Enron corpus email dataset. The results presented in the chapter show that social networks extracted from a feature set as frequent closed patterns successfully carry the community structure information. Moreover, for the Enron email dataset, we present an analysis to dynamically indicate the deviations from each user's individual and community profile. These indications of deviations can be very useful to identify unusual events.

  16. Bureau takes lead in 1983 R and D. [US Bureau of Mines

    SciTech Connect

    Chironis, N.P.

    1983-02-01

    Items from the US Bureau of Mines research programme are described. Those relating to underground mining include SAMS, a semi-autonomous mining system; Kloswall, a more productive longwall system; automated roof bolting and miner-bolters; new types of haulage and roof support; and airborne dust monitors.

  17. 30 CFR 57.19106 - Shaft sets.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Shaft sets. 57.19106 Section 57.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND....19106 Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  18. 30 CFR 56.19106 - Shaft sets.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Shaft sets. 56.19106 Section 56.19106 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND... Shaft sets. Shaft sets shall be kept in good repair and clean of hazardous material....

  19. Real-time intelligent decision making with data mining

    NASA Astrophysics Data System (ADS)

    Gupta, Deepak P.; Gopalakrishnan, Bhaskaran

    2004-03-01

    Database mining, widely known as knowledge discovery and data mining (KDD), has attracted lot of attention in recent years. With the rapid growth of databases in commercial, industrial, administrative and other applications, it is necessary and interesting to extract knowledge automatically from huge amount of data. Almost all the organizations are generating data and information at an unprecedented rate and they need to get some useful information from this data. Data mining is the extraction of non-trivial, previously unknown and potentially useful patterns, trends, dependence and correlation known as association rules among data values in large databases. In last ten to fifteen years, data mining spread out from one company to the other to help them understand more about customers' aspect of quality and response and also distinguish the customers they want from those they do not. A credit-card company found that customers who complete their applications in pencil rather than pen are more likely to default. There is a program that identifies callers by purchase history. The bigger the spender, the quicker the call will be answered. If you feel your call is being answered in the order in which it was received, think again. Many algorithms assume that data is static in nature and mine the rules and relations in that data. But for a dynamic database e.g. in most of the manufacturing industries, the rules and relations thus developed among the variables/items no longer hold true. A simple approach may be to mine the associations among the variables after every fixed period of time. But again, how much the length of this period should be, is a question to be answered. The next problem with the static data mining is that some of the relationships that might be of interest from one period to the other may be lost after a new set of data is used. To reflect the effect of new data set and current status of the association rules where some of the strong rules might become

  20. Determining the Significance of Item Order in Randomized Problem Sets

    ERIC Educational Resources Information Center

    Pardos, Zachary A.; Heffernan, Neil T.

    2009-01-01

    Researchers who make tutoring systems would like to know which sequences of educational content lead to the most effective learning by their students. The majority of data collected in many ITS systems consist of answers to a group of questions of a given skill often presented in a random sequence. Following work that identifies which items…

  1. Item Disordinality with the Bookmark Standard Setting Procedure.

    ERIC Educational Resources Information Center

    Skaggs, Gary; Tessema, Aster

    This paper presents an application of the bookmark procedure to a test comprised of increasing text difficulty levels. The Test of English Proficiency for Adults (TEPA) was used for this study. Three forms of the TEPA were field tested in 1999 with approximately 1,000 non-native English speaking students enrolled in English-as-a-Second-Language…

  2. The Effects of Various Item Selection Methods on the Classification Accuracy and Classification Consistency of Criterion-Referenced Instruments.

    ERIC Educational Resources Information Center

    Smith, Douglas U.

    This study examined the effects of certain item selection methods on the classification accuracy and classification consistency of criterion-referenced instruments. Three item response data sets, representing varying situations of instructional effectiveness, were simulated. Five methods of item selection were then applied to each data set for the…

  3. Computerized Adaptive Testing with Item Cloning.

    ERIC Educational Resources Information Center

    Glas, Cees A. W.; van der Linden, Wim J.

    2003-01-01

    Developed a multilevel item response (IRT) model that allows for differences between the distributions of item parameters of families of item clones. Results from simulation studies based on an item pool from the Law School Admission Test illustrate the accuracy of the item pool calibration and adaptive testing procedures based on the model. (SLD)

  4. 7 CFR 3201.5 - Item designation.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 7 Agriculture 15 2012-01-01 2012-01-01 false Item designation. 3201.5 Section 3201.5 Agriculture....5 Item designation. (a) Procedure. Designated items are listed in subpart B. In designating items, USDA will designate items composed of generic groupings of specific products and will identify...

  5. 7 CFR 3201.5 - Item designation.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 7 Agriculture 15 2014-01-01 2014-01-01 false Item designation. 3201.5 Section 3201.5 Agriculture....5 Item designation. (a) Procedure. Designated items are listed in subpart B. In designating items, USDA will designate items composed of generic groupings of specific products and will identify...

  6. 7 CFR 3201.5 - Item designation.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 7 Agriculture 15 2013-01-01 2013-01-01 false Item designation. 3201.5 Section 3201.5 Agriculture....5 Item designation. (a) Procedure. Designated items are listed in subpart B. In designating items, USDA will designate items composed of generic groupings of specific products and will identify...

  7. Item Banking. ERIC/AE Digest.

    ERIC Educational Resources Information Center

    Rudner, Lawrence

    This digest discusses the advantages and disadvantages of using item banks, and it provides useful information for those who are considering implementing an item banking project in their school districts. The primary advantage of item banking is in test development. Using an item response theory method, such as the Rasch model, items from multiple…

  8. Multilevel Modeling of Item Position Effects

    ERIC Educational Resources Information Center

    Albano, Anthony D.

    2013-01-01

    In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

  9. How Item Banks and Their Application Can Influence Measurement Practice in Rehabilitation Medicine: A PROMIS Fatigue Item Bank Example

    PubMed Central

    Lai, Jin-Shei; Cella, David; Choi, Seung; Junghaenel, Doerte U.; Christodoulou, Christopher; Gershon, Richard; Stone, Arthur

    2013-01-01

    Objective To illustrate how measurement practices can be advanced using as an example the fatigue item bank (FIB) and its applications (short-forms and computerized adaptive test) that were developed via the NIH Patient Reported Outcomes Measurement Information System (PROMIS) Cooperative Group. Design Psychometric analysis of data collected by an internet survey company using Item Response Theory (IRT) related techniques. Setting A United States general population representative sample collected via internet. Participants 803 respondents used for dimensionality evaluation of the PROMIS FIB and 14,931 respondents used for item calibrations Interventions Not applicable. Main Outcome Measures 112 fatigue items developed by the PROMIS fatigue domain working group, 13-item Functional Assessment of Chronic Illness Therapy-Fatigue, and 4-item SF-36 Vitality scale. Results The PROMIS FIB version 1 which consists of 95 items demonstrated acceptable psychometric properties. Computerized Adaptive Testing (CAT) showed consistently better precision than short-forms. However, all three short-forms showed good precision for the majority of participants, in that more than 95% of sample could be precisely measured with a reliability greater than 0.9. Conclusions Measurement practice can be advanced by using a psychometrically sound measurement tool and its applications. This example shows that CAT and short-forms derived from the PROMIS FIB can reliably estimate fatigue reported by the US general population. Evaluation in clinical populations is warranted before the item bank can be used for clinical trials. PMID:21958919

  10. Mining apparatus

    SciTech Connect

    Ingle, J.E.; Lane, A.J.; Mcgee, D.A.

    1981-03-10

    An improved mining apparatus for excavating material, such as coal, for example, from an earth formation, such as a coal seam, for example, wherein a miner, having a forward and a rearward cutter, is guided through the coal seam and excavates a borehole therein, the borehole being filled with a working fluid during the operation of the miner, the working fluid facilitating the operation of the miner and providing a vehicle for removing the mined material. Substantially all of the operations of the miner are controlled from the earth's surface thereby eliminating the necessity and accompanying hazards and costs involved in utilizing personnel underground during the mining operations.

  11. Penn State researches acid mine drainage

    SciTech Connect

    Not Available

    1984-08-27

    A brief news item reports that work is being carried out at Penn State University on the effectiveness of sphagnum moss and other marsh-type plants in removing iron from acid mine water. A sphagnum moss bog has been established in a greenhouse at the University and field work is also being undertaken in a natural environment in Clearfield County.

  12. The Identification of Neurologically Relevant Items in the MMPI-2.

    ERIC Educational Resources Information Center

    Van Balen, H. G. G.; Van Limbeek, J.; De Mey, H. R. A.

    1997-01-01

    Forty neuropsychologists, neurologists, psychiatrists, and physiatrists identified neurologically relevant items (NRIs) in the Minnesota Multiphasic Personality Inventory-2 (MMPI-2). Raters identified four sets of NRIs: one for brain damage in general and three partially overlapping sets for stroke, traumatic brain damage, and whiplash.…

  13. Optimizing data collection for public health decisions: a data mining approach

    PubMed Central

    2014-01-01

    Background Collecting data can be cumbersome and expensive. Lack of relevant, accurate and timely data for research to inform policy may negatively impact public health. The aim of this study was to test if the careful removal of items from two community nutrition surveys guided by a data mining technique called feature selection, can (a) identify a reduced dataset, while (b) not damaging the signal inside that data. Methods The Nutrition Environment Measures Surveys for stores (NEMS-S) and restaurants (NEMS-R) were completed on 885 retail food outlets in two counties in West Virginia between May and November of 2011. A reduced dataset was identified for each outlet type using feature selection. Coefficients from linear regression modeling were used to weight items in the reduced datasets. Weighted item values were summed with the error term to compute reduced item survey scores. Scores produced by the full survey were compared to the reduced item scores using a Wilcoxon rank-sum test. Results Feature selection identified 9 store and 16 restaurant survey items as significant predictors of the score produced from the full survey. The linear regression models built from the reduced feature sets had R2 values of 92% and 94% for restaurant and grocery store data, respectively. Conclusions While there are many potentially important variables in any domain, the most useful set may only be a small subset. The use of feature selection in the initial phase of data collection to identify the most influential variables may be a useful tool to greatly reduce the amount of data needed thereby reducing cost. PMID:24919484

  14. Mining wavelet transformed boiler data sets

    NASA Astrophysics Data System (ADS)

    Letsche, Terry Lee

    Accurate combustion models provide information that allows increased boiler efficiency optimization, saving money and resources while reducing waste. Boiler combustion processes are noted for being complex, nonstationary and nonlinear. While numerous methods have been used to model boiler processes, data driven approaches reflect actual operating conditions within a particular boiler and do not depend on idealized, complex, or expensive empirical models. Boiler and combustion processes vary in time, requiring a denoising technique that preserves the temporal and frequency nature of the data. Moving average, a common technique, smoothes data---low frequency noise is not removed. This dissertation examines models built with wavelet denoising techniques that remove low and high frequency noise in both time and frequency domains. The denoising process has a number of parameters, including choice of wavelet, threshold value, level of wavelet decomposition, and disposition of attributes that appear to be significant at multiple thresholds. A process is developed to experimentally evaluate the predictive accuracy of these models and compares this result against two benchmarks. The first research hypothesis compares the performance of these wavelet denoised models to the model generated from the original data. The second research hypothesis compares the performance of the models generated with this denoising approach to the most effective model generated from a moving average process. In both experiments it was determined that the Daubechies 4 wavelet was a better choice than the more typically chosen Haar wavelet, wavelet packet decomposition outperforms other levels of wavelet decomposition, and discarding all but the lowest threshold repeating attributes produces superior results. The third research hypothesis examined using a two-dimensional wavelet transform on the data. Another parameter for handling the boundary condition was introduced. In the two-dimensional case, the CDF(4,6) wavelet and zero symmetric padded boundary produced the best results. The two-dimensional transform produces models six percent more accurate than the one-dimensional transform, nearly eight percent more accurate than non-transformed data, and four percent more accurate than moving average transformed model.

  15. Metacognition influences item-method directed forgetting.

    PubMed

    Foster, Nathaniel L; Sahakyan, Lili

    2012-09-01

    In 4 experiments, we examined whether metacognitive beliefs about item memorability influence item-method directed forgetting. In Experiment 1, participants studied loud and quiet items, which were subsequently cued as to-be-remembered (TBR) or to-be-forgotten (TBF). Typically, the volume of stimuli does not influence recall, although loud items are judged as more memorable than quiet items (Rhodes & Castel, 2009). In contrast, we found a recall advantage for loud items in directed forgetting, although this was observed for TBR items but not TBF items. The loud item advantage disappeared in Experiment 2, when we eliminated all TBF trials and instead inserted additional trials during which participants could engage in extra rehearsal of earlier presented items. In Experiments 3 and 4, a recall advantage for loud items was observed again when items were assigned a mixture of positive and negative values, but it did not emerge when items were assigned graded positive values. Overall, the results showed that the recall advantage for loud items emerges only in response to the need to forget some items. We propose 2 mechanisms to account for these results-either participants select to rehearse loud items as a controlled strategy that allows them to forget some items, or they have an unconscious preference for loud items that emerges only in response to the need to forget. PMID:22468801

  16. Coastal mining

    NASA Astrophysics Data System (ADS)

    Bell, Peter M.

    The Exclusive Economic Zone (EEZ) declared by President Reagan in March 1983 has met with a mixed response from those who would benefit from a guaranteed, 200-nautical-mile (370-km) protected underwater mining zone off the coasts of the United States and its possessions. On the one hand, the U.S. Department of the Interior is looking ahead and has been very successful in safeguarding important natural resources that will be needed in the coming decades. On the other hand, the mining industry is faced with a depressed metals and mining market.A report of the Exclusive Economic Zone Symposium held in November 1983 by the U.S. Geological Survey, the Mineral Management Service, and the Bureau of Mines described the mixed response as: “ … The Department of Interior … raring to go into promotion of deep-seal mining but industrial consortia being very pessimistic about the program, at least for the next 30 or so years.” (Chemical & Engineering News, February 5, 1983).

  17. Social Interest for User Selecting Items in Recommender Systems

    NASA Astrophysics Data System (ADS)

    Nie, Da-Cheng; Ding, Ming-Jing; Fu, Yan; Zhou, Jun-Lin; Zhang, Zi-Ke

    2013-04-01

    Recommender systems have developed rapidly and successfully. The system aims to help users find relevant items from a potentially overwhelming set of choices. However, most of the existing recommender algorithms focused on the traditional user-item similarity computation, other than incorporating the social interests into the recommender systems. As we know, each user has their own preference field, they may influence their friends' preference in their expert field when considering the social interest on their friends' item collecting. In order to model this social interest, in this paper, we proposed a simple method to compute users' social interest on the specific items in the recommender systems, and then integrate this social interest with similarity preference. The experimental results on two real-world datasets Epinions and Friendfeed show that this method can significantly improve not only the algorithmic precision-accuracy but also the diversity-accuracy.

  18. Multisource causal data mining

    NASA Astrophysics Data System (ADS)

    Woodley, Robert; Gosnell, Michael; Shallenberger, Kevin

    2012-06-01

    Analysts are faced with mountains of data, and finding that relevant piece of information is the proverbial needle in a haystack, only with dozens of haystacks. Analysis tools that facilitate identifying causal relationships across multiple data sets are sorely needed. 21st Century Systems, Inc. (21CSi) has initiated research called Causal-View, a causal datamining visualization tool, to address this challenge. Causal-View is built on an agent-enabled framework. Much of the processing that Causal-View will do is in the background. When a user requests information, Data Extraction Agents launch to gather information. This initial search is a raw, Monte Carlo type search designed to gather everything available that may have relevance to an individual, location, associations, and more. This data is then processed by Data- Mining Agents. The Data-Mining Agents are driven by user supplied feature parameters. If the analyst is looking to see if the individual frequents a known haven for insurgents he may request information on his last known locations. Or, if the analyst is trying to see if there is a pattern in the individual's contacts, the mining agent can be instructed with the type and relevance of the information fields to look at. The same data is extracted from the database, but the Data Mining Agents customize the feature set to determine causal relationships the user is interested in. At this point, a Hypothesis Generation and Data Reasoning Agents take over to form conditional hypotheses about the data and pare the data, respectively. The newly formed information is then published to the agent communication backbone of Causal- View to be displayed. Causal-View provides causal analysis tools to fill the gaps in the causal chain. We present here the Causal-View concept, the initial research into data mining tools that assist in forming the causal relationships, and our initial findings.

  19. Text Mining for Neuroscience

    NASA Astrophysics Data System (ADS)

    Tirupattur, Naveen; Lapish, Christopher C.; Mukhopadhyay, Snehasis

    2011-06-01

    Text mining, sometimes alternately referred to as text analytics, refers to the process of extracting high-quality knowledge from the analysis of textual data. Text mining has wide variety of applications in areas such as biomedical science, news analysis, and homeland security. In this paper, we describe an approach and some relatively small-scale experiments which apply text mining to neuroscience research literature to find novel associations among a diverse set of entities. Neuroscience is a discipline which encompasses an exceptionally wide range of experimental approaches and rapidly growing interest. This combination results in an overwhelmingly large and often diffuse literature which makes a comprehensive synthesis difficult. Understanding the relations or associations among the entities appearing in the literature not only improves the researchers current understanding of recent advances in their field, but also provides an important computational tool to formulate novel hypotheses and thereby assist in scientific discoveries. We describe a methodology to automatically mine the literature and form novel associations through direct analysis of published texts. The method first retrieves a set of documents from databases such as PubMed using a set of relevant domain terms. In the current study these terms yielded a set of documents ranging from 160,909 to 367,214 documents. Each document is then represented in a numerical vector form from which an Association Graph is computed which represents relationships between all pairs of domain terms, based on co-occurrence. Association graphs can then be subjected to various graph theoretic algorithms such as transitive closure and cycle (circuit) detection to derive additional information, and can also be visually presented to a human researcher for understanding. In this paper, we present three relatively small-scale problem-specific case studies to demonstrate that such an approach is very successful in

  20. Australian Item Bank Program: Science Item Bank. Book 3: Biology.

    ERIC Educational Resources Information Center

    Australian Council for Educational Research, Hawthorn.

    The Australian Science Item Bank consists of three volumes of multiple-choice questions. Book 3 contains questions on the biological sciences. The questions are designed to be suitable for high school students (year 8 to year 12 in Australian schools). The questions are classified by the subject content of the question, the cognitive skills…

  1. Evaluating the Magnitude of Differential Item Functioning in Polytomous Items.

    ERIC Educational Resources Information Center

    Zwick, Rebecca; Thayer, Dorothy T.

    1996-01-01

    Two possible standard error formulas for the polytomous differential item functioning index proposed by N. J. Dorans and A. P. Schmitt (1991) were derived. These standard errors, and associated hypothesis-testing procedures, were evaluated through simulated data. The standard error that performed better is based on N. Mantel's (1963)…

  2. Asteroid mining

    NASA Technical Reports Server (NTRS)

    Gertsch, Richard E.

    1992-01-01

    The earliest studies of asteroid mining proposed retrieving a main belt asteroid. Because of the very long travel times to the main asteroid belt, attention has shifted to the asteroids whose orbits bring them fairly close to the Earth. In these schemes, the asteroids would be bagged and then processed during the return trip, with the asteroid itself providing the reaction mass to propel the mission homeward. A mission to one of these near-Earth asteroids would be shorter, involve less weight, and require a somewhat lower change in velocity. Since these asteroids apparently contain a wide range of potentially useful materials, our study group considered only them. The topics covered include asteroid materials and properties, asteroid mission selection, manned versus automated missions, mining in zero gravity, and a conceptual mining method.

  3. Mining equipment

    SciTech Connect

    Goff, J.R.; Spence, A.M.

    1981-06-23

    In a system for the supply of fluid under pressure to machinery of an underground mine working, lengths of fixed conduit are secured to parts of the conveyor assembly, prior to the assembly of said parts at the underground mine working. When the conveyor assembly has been assembled, the length of fixed conduits are interconnected, either by straight lengths of flexible conduit, or by branched lengths of flexible conduit, where take off for fluid under pressure is required for the machinery, for example a roof support unit.

  4. 18 CFR 367.8 - Extraordinary items.

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... 18 Conservation of Power and Water Resources 1 2011-04-01 2011-04-01 false Extraordinary items... Instructions § 367.8 Extraordinary items. Extraordinary items are to be recognized according to the rules which are considered generally accepted accounting principles. These items are related to the effects...

  5. 7 CFR 1767.24 - Extraordinary items.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 7 Agriculture 12 2011-01-01 2011-01-01 false Extraordinary items. 1767.24 Section 1767.24....24 Extraordinary items. The extraordinary items accounts identified in this section shall be used by all RUS borrowers. Extraordinary Items 434Extraordinary Income 435Extraordinary Deductions...

  6. 18 CFR 367.8 - Extraordinary items.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... 18 Conservation of Power and Water Resources 1 2014-04-01 2014-04-01 false Extraordinary items... Instructions § 367.8 Extraordinary items. Extraordinary items are to be recognized according to the rules which are considered generally accepted accounting principles. These items are related to the effects...

  7. 18 CFR 367.8 - Extraordinary items.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... 18 Conservation of Power and Water Resources 1 2012-04-01 2012-04-01 false Extraordinary items... Instructions § 367.8 Extraordinary items. Extraordinary items are to be recognized according to the rules which are considered generally accepted accounting principles. These items are related to the effects...

  8. 7 CFR 2902.5 - Item designation.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 7 Agriculture 15 2011-01-01 2011-01-01 false Item designation. 2902.5 Section 2902.5 Agriculture... AGRICULTURE GUIDELINES FOR DESIGNATING BIOBASED PRODUCTS FOR FEDERAL PROCUREMENT General § 2902.5 Item designation. (a) Procedure. Designated items are listed in subpart B. In designating items, USDA...

  9. 12 CFR 210.13 - Unpaid items.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 12 Banks and Banking 2 2010-01-01 2010-01-01 false Unpaid items. 210.13 Section 210.13 Banks and... OTHER ITEMS BY FEDERAL RESERVE BANKS AND FUNDS TRANSFERS THROUGH FEDWIRE (REGULATION J) Collection of Checks and Other Items By Federal Reserve Banks § 210.13 Unpaid items. (a) Right of recovery. If...

  10. 7 CFR 1767.24 - Extraordinary items.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 7 Agriculture 12 2013-01-01 2013-01-01 false Extraordinary items. 1767.24 Section 1767.24....24 Extraordinary items. The extraordinary items accounts identified in this section shall be used by all RUS borrowers. Extraordinary Items 434Extraordinary Income 435Extraordinary Deductions...

  11. 18 CFR 367.8 - Extraordinary items.

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... 18 Conservation of Power and Water Resources 1 2010-04-01 2010-04-01 false Extraordinary items... Instructions § 367.8 Extraordinary items. Extraordinary items are to be recognized according to the rules which are considered generally accepted accounting principles. These items are related to the effects...

  12. 12 CFR 210.13 - Unpaid items.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 12 Banks and Banking 2 2012-01-01 2012-01-01 false Unpaid items. 210.13 Section 210.13 Banks and... OTHER ITEMS BY FEDERAL RESERVE BANKS AND FUNDS TRANSFERS THROUGH FEDWIRE (REGULATION J) Collection of Checks and Other Items By Federal Reserve Banks § 210.13 Unpaid items. (a) Right of recovery. If...

  13. 7 CFR 1767.24 - Extraordinary items.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 7 Agriculture 12 2014-01-01 2013-01-01 true Extraordinary items. 1767.24 Section 1767.24....24 Extraordinary items. The extraordinary items accounts identified in this section shall be used by all RUS borrowers. Extraordinary Items 434Extraordinary Income 435Extraordinary Deductions...

  14. 42 CFR 414.224 - Customized items.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... 42 Public Health 3 2012-10-01 2012-10-01 false Customized items. 414.224 Section 414.224 Public... Medical Equipment and Prosthetic and Orthotic Devices § 414.224 Customized items. (a) Criteria for a customized item. To be considered a customized item for payment purposes under paragraph (b) of this...

  15. 7 CFR 1767.24 - Extraordinary items.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 7 Agriculture 12 2012-01-01 2012-01-01 false Extraordinary items. 1767.24 Section 1767.24....24 Extraordinary items. The extraordinary items accounts identified in this section shall be used by all RUS borrowers. Extraordinary Items 434Extraordinary Income 435Extraordinary Deductions...

  16. 42 CFR 414.224 - Customized items.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 42 Public Health 3 2014-10-01 2014-10-01 false Customized items. 414.224 Section 414.224 Public... Medical Equipment, Prosthetic and Orthotic Devices, and Surgical Dressings § 414.224 Customized items. (a) Criteria for a customized item. To be considered a customized item for payment purposes under paragraph...

  17. 18 CFR 367.8 - Extraordinary items.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... 18 Conservation of Power and Water Resources 1 2013-04-01 2013-04-01 false Extraordinary items... Instructions § 367.8 Extraordinary items. Extraordinary items are to be recognized according to the rules which are considered generally accepted accounting principles. These items are related to the effects...

  18. 42 CFR 414.224 - Customized items.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 42 Public Health 3 2011-10-01 2011-10-01 false Customized items. 414.224 Section 414.224 Public... and Prosthetic and Orthotic Devices § 414.224 Customized items. (a) Criteria for a customized item. To be considered a customized item for payment purposes under paragraph (b) of this section, a...

  19. 12 CFR 210.13 - Unpaid items.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 12 Banks and Banking 2 2013-01-01 2013-01-01 false Unpaid items. 210.13 Section 210.13 Banks and... OTHER ITEMS BY FEDERAL RESERVE BANKS AND FUNDS TRANSFERS THROUGH FEDWIRE (REGULATION J) Collection of Checks and Other Items By Federal Reserve Banks § 210.13 Unpaid items. (a) Right of recovery. If...

  20. 12 CFR 210.13 - Unpaid items.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 12 Banks and Banking 2 2011-01-01 2011-01-01 false Unpaid items. 210.13 Section 210.13 Banks and... OTHER ITEMS BY FEDERAL RESERVE BANKS AND FUNDS TRANSFERS THROUGH FEDWIRE (REGULATION J) Collection of Checks and Other Items By Federal Reserve Banks § 210.13 Unpaid items. (a) Right of recovery. If...

  1. 42 CFR 414.224 - Customized items.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... 42 Public Health 3 2013-10-01 2013-10-01 false Customized items. 414.224 Section 414.224 Public... Medical Equipment and Prosthetic and Orthotic Devices § 414.224 Customized items. (a) Criteria for a customized item. To be considered a customized item for payment purposes under paragraph (b) of this...

  2. 12 CFR 210.13 - Unpaid items.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 12 Banks and Banking 2 2014-01-01 2014-01-01 false Unpaid items. 210.13 Section 210.13 Banks and... OTHER ITEMS BY FEDERAL RESERVE BANKS AND FUNDS TRANSFERS THROUGH FEDWIRE (REGULATION J) Collection of Checks and Other Items By Federal Reserve Banks § 210.13 Unpaid items. (a) Right of recovery. If...

  3. Adaptable Learning Assistant for Item Bank Management

    ERIC Educational Resources Information Center

    Nuntiyagul, Atorn; Naruedomkul, Kanlaya; Cercone, Nick; Wongsawang, Damras

    2008-01-01

    We present PKIP, an adaptable learning assistant tool for managing question items in item banks. PKIP is not only able to automatically assist educational users to categorize the question items into predefined categories by their contents but also to correctly retrieve the items by specifying the category and/or the difficulty level. PKIP adapts…

  4. Unidimensional Interpretations for Multidimensional Test Items

    ERIC Educational Resources Information Center

    Kahraman, Nilufer

    2013-01-01

    This article considers potential problems that can arise in estimating a unidimensional item response theory (IRT) model when some test items are multidimensional (i.e., show a complex factorial structure). More specifically, this study examines (1) the consequences of model misfit on IRT item parameter estimates due to unintended minor item-level…

  5. Kernel-Smoothing Estimation of Item Characteristic Functions for Continuous Personality Items: An Empirical Comparison with the Linear and the Continuous-Response Models

    ERIC Educational Resources Information Center

    Ferrando, Pere J.

    2004-01-01

    This study used kernel-smoothing procedures to estimate the item characteristic functions (ICFs) of a set of continuous personality items. The nonparametric ICFs were compared with the ICFs estimated (a) by the linear model and (b) by Samejima's continuous-response model. The study was based on a conditioned approach and used an error-in-variables…

  6. Converging evidence for control of color-word Stroop interference at the item level.

    PubMed

    Bugg, Julie M; Hutchison, Keith A

    2013-04-01

    Prior studies have shown that cognitive control is implemented at the list and context levels in the color-word Stroop task. At first blush, the finding that Stroop interference is reduced for mostly incongruent items as compared with mostly congruent items (i.e., the item-specific proportion congruence [ISPC] effect) appears to provide evidence for yet a third level of control, which modulates word reading at the item level. However, evidence to date favors the view that ISPC effects reflect the rapid prediction of high-contingency responses and not item-specific control. In Experiment 1, we first show that an ISPC effect is obtained when the relevant dimension (i.e., color) signals proportion congruency, a problematic pattern for theories based on differential response contingencies. In Experiment 2, we replicate and extend this pattern by showing that item-specific control settings transfer to new stimuli, ruling out alternative frequency-based accounts. In Experiment 3, we revert to the traditional design in which the irrelevant dimension (i.e., word) signals proportion congruency. Evidence for item-specific control, including transfer of the ISPC effect to new stimuli, is apparent when 4-item sets are employed but not when 2-item sets are employed. We attribute this pattern to the absence of high-contingency responses on incongruent trials in the 4-item set. These novel findings provide converging evidence for reactive control of color-word Stroop interference at the item level, reveal theoretically important factors that modulate reliance on item-specific control versus contingency learning, and suggest an update to the item-specific control account (Bugg, Jacoby, & Chanani, 2011). PMID:22845037

  7. Randomized Item Response Theory Models

    ERIC Educational Resources Information Center

    Fox, Jean-Paul

    2005-01-01

    The randomized response (RR) technique is often used to obtain answers on sensitive questions. A new method is developed to measure latent variables using the RR technique because direct questioning leads to biased results. Within the RR technique is the probability of the true response modeled by an item response theory (IRT) model. The RR…

  8. Paradoxical Results and Item Bundles

    ERIC Educational Resources Information Center

    Hooker, Giles; Finkelman, Matthew

    2010-01-01

    Hooker, Finkelman, and Schwartzman ("Psychometrika," 2009, in press) defined a paradoxical result as the attainment of a higher test score by changing answers from correct to incorrect and demonstrated that such results are unavoidable for maximum likelihood estimates in multidimensional item response theory. The potential for these results to…

  9. 15 CFR 752.3 - Eligible items.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 15 Commerce and Foreign Trade 2 2010-01-01 2010-01-01 false Eligible items. 752.3 Section 752.3... LICENSE § 752.3 Eligible items. (a) All items subject to the EAR, including items eligible for License... 2E301 on the CCL controlled for CB reasons; (3) Items controlled by ECCNs 1C350, 1C995, 1D390,...

  10. 15 CFR 752.3 - Eligible items.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 15 Commerce and Foreign Trade 2 2011-01-01 2011-01-01 false Eligible items. 752.3 Section 752.3... LICENSE § 752.3 Eligible items. (a) All items subject to the EAR, including items eligible for License... 2E301 on the CCL controlled for CB reasons; (3) Items controlled by ECCNs 1C350, 1C995, 1D390,...

  11. Implementations of the Branch-and-Bound Method for Test Construction Problems. Project Psychometric Aspects of Item Banking No. 46. Research Report 89-6.

    ERIC Educational Resources Information Center

    Adema, Jos J.

    Item banks, large sets of test items, can be used for the construction of achievement tests. Mathematical programming models have been proposed for the selection of items from an item bank for a test. These models make automated test construction possible. However, to find an optimal or even an approximate optimal solution to a test construction…

  12. The effects of item familiarity on the neural correlates of successful associative memory encoding.

    PubMed

    Dennis, Nancy A; Turney, Indira C; Webb, Christina E; Overman, Amy A

    2015-12-01

    Associative memory is considered to be resource-demanding, requiring individuals to learn individual items and the specific relationships between those items. Previous research has shown that prior studying of items aids in associative memory for pairs composed of those same items, as compared to pairs of items that have not been prelearned (e.g., Kilb & Naveh-Benjamin, 2011). In the present study, we sought to elucidate the neural correlates mediating this memory facilitation. After being trained on individual items, participants were scanned while encoding item pairs composed of items from the pretrained phase (familiarized-item pairs) and pairs whose items had not been previously learned (unfamiliarized-item pairs). Consistent with previous findings, the overall subsequent recollection showed the engagement of bilateral parahippocampal gyrus (PHG) and hippocampus, when compared to subsequent forgetting. However, a direct comparison between familiarized- and unfamiliarized-item pairs showed that subsequently recollected familiarized-item pairs were associated with decreased activity across much of the encoding network, including bilateral PHG, hippocampus, prefrontal cortex, and regions associated with item-specific processing within occipital cortex. Increased activity for familiarized-item pairs was found in a more limited set of regions, including bilateral parietal cortex, which has been associated with the formation of novel associations. Additionally, activity in the right parietal cortex correlated with associative memory success in the familiarized condition. Taken together, these results suggest that prior exposure to items can reduce the demands incurred on neural processing throughout the associative encoding network and can enhance associative memory performance by focusing resources within regions supporting the formation of associative links. PMID:25939781

  13. Radioecological impacts of tin mining.

    PubMed

    Aliyu, Abubakar Sadiq; Mousseau, Timothy Alexander; Ramli, Ahmad Termizi; Bununu, Yakubu Aliyu

    2015-12-01

    The tin mining activities in the suburbs of Jos, Plateau State, Nigeria, have resulted in technical enhancement of the natural background radiation as well as higher activity concentrations of primordial radionuclides in the topsoil of mining sites and their environs. Several studies have considered the radiological human health risks of the mining activity; however, to our knowledge no documented study has investigated the radiological impacts on biota. Hence, an attempt is made to assess potential hazards using published data from the literature and the ERICA Tool. This paper considers the effects of mining and milling on terrestrial organisms like shrubs, large mammals, small burrowing mammals, birds (duck), arthropods (earth worm), grasses, and herbs. The dose rates and risk quotients to these organisms are computed using conservative values for activity concentrations of natural radionuclides reported in Bitsichi and Bukuru mining areas. The results suggest that grasses, herbs, lichens, bryophytes and shrubs receive total dose rates that are of potential concern. The effects of dose rates to specific indicator species of interest are highlighted and discussed. We conclude that further investigation and proper regulations should be set in place in order to reduce the risk posed by the tin mining activity on biota. This paper also presents a brief overview of the impact of mineral mining on biota based on documented literature for other countries. PMID:26093469

  14. [Introduction to medical data mining].

    PubMed

    Zhu, Lingyun; Wu, Baoming; Cao, Changxiu

    2003-09-01

    Modern medicine generates a great deal of information stored in the medical database. Extracting useful knowledge and providing scientific decision-making for the diagnosis and treatment of disease from the database increasingly becomes necessary. Data mining in medicine can deal with this problem. It can also improve the management level of hospital information and promote the development of telemedicine and community medicine. Because the medical information is characteristic of redundancy, multi-attribution, incompletion and closely related with time, medical data mining differs from other one. In this paper we have discussed the key techniques of medical data mining involving pretreatment of medical data, fusion of different pattern and resource, fast and robust mining algorithms and reliability of mining results. The methods and applications of medical data mining based on computation intelligence such as artificial neural network, fuzzy system, evolutionary algorithms, rough set, and support vector machine have been introduced. The features and problems in data mining are summarized in the last section. PMID:14565039

  15. Data Mining and Analysis

    NASA Technical Reports Server (NTRS)

    Samms, Kevin O.

    2015-01-01

    The Data Mining project seeks to bring the capability of data visualization to NASA anomaly and problem reporting systems for the purpose of improving data trending, evaluations, and analyses. Currently NASA systems are tailored to meet the specific needs of its organizations. This tailoring has led to a variety of nomenclatures and levels of annotation for procedures, parts, and anomalies making difficult the realization of the common causes for anomalies. Making significant observations and realizing the connection between these causes without a common way to view large data sets is difficult to impossible. In the first phase of the Data Mining project a portal was created to present a common visualization of normalized sensitive data to customers with the appropriate security access. The tool of the visualization itself was also developed and fine-tuned. In the second phase of the project we took on the difficult task of searching and analyzing the target data set for common causes between anomalies. In the final part of the second phase we have learned more about how much of the analysis work will be the job of the Data Mining team, how to perform that work, and how that work may be used by different customers in different ways. In this paper I detail how our perspective has changed after gaining more insight into how the customers wish to interact with the output and how that has changed the product.

  16. A survey of data mining techniques

    NASA Astrophysics Data System (ADS)

    Jorgensen, A. M.; Karimabadi, H.

    2005-12-01

    Data mining is the act of extracting useful knowledge from a data set. This knowledge can take many forms. It can be in the detection of features of interest. It can be in the form of statistical analysis of data sets. It can be in the form of predictive expressions describing the data or relationships between data. It can be in the form of anomaly detection, serendipitous events which require further investigation. The simplest form of data mining consists of manual inspection of the data sets. This has been the norm in space physics for decades. The most advanced forms of data mining make use of emerging computer science methods, including advances in artificial intelligence. In this presentation we will give a brief introduction to some of the resources that are available for performing advanced data mining. We will focus on giving an overview of some data mining techniques, and how they have been, or could be, applied to space physics problems.

  17. Design Patterns for Digital Item Types in Higher Education

    ERIC Educational Resources Information Center

    Draaijer, S.; Hartog, R. J. M.

    2007-01-01

    A set of design patterns for digital item types has been developed in response to challenges identified in various projects by teachers in higher education. The goal of the projects in question was to design and develop formative and summative tests, and to develop interactive learning material in the form of quizzes. The subject domains involved…

  18. Item Response Theory: Overview, Applications, and Promise for Institutional Research

    ERIC Educational Resources Information Center

    Bowman, Nicholas A.; Herzog, Serge; Sharkness, Jessica

    2014-01-01

    Item Response Theory (IRT) is a measurement theory that is ideal for scale and test development in institutional research, but it is not without its drawbacks. This chapter provides an overview of IRT, describes an example of its use, and highlights the pros and cons of using IRT in applied settings.

  19. Scores Based on Dangerous Responses to Multiple-Choice Items.

    ERIC Educational Resources Information Center

    Grosse, Martin E.

    1986-01-01

    Scores based on the number of correct answers were compared with scores based on dangerous responses to items in the same multiple choice test developed by American Board of Orthopaedic Surgery. Results showed construct validity for both sets of scores. However, both scores were redundant when evaluated by correlation coefficient. (Author/JAZ)

  20. Development of the PROMIS® Negative Psychosocial Expectancies of Smoking Item Banks

    PubMed Central

    Edelen, Maria Orlando; Tucker, Joan S.; Shadel, William G.; Cerully, Jennifer; Kuhfeld, Megan; Hansen, Mark; Cai, Li

    2014-01-01

    Introduction: Negative psychosocial expectancies of smoking include aspects of social disapproval and disappointment in oneself. This paper describes analyses conducted to develop and evaluate item banks for assessing psychosocial expectancies among daily and nondaily smokers. Methods: Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of psychosocial expectancies items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess psychosocial expectancies. Results: A total of 21 items were included in the Psychosocial Expectancies item banks: 14 items are common across daily and nondaily smokers, 6 are unique to daily, and 1 is unique to nondaily. For both daily and nondaily smokers, the Psychosocial Expectancies item banks are strongly unidimensional, highly reliable (reliability = 0.95 and 0.93, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.85). Results from simulated CATs showed that, on average, fewer than 8 items are needed to assess psychosocial expectancies with adequate precision when using the item banks. Conclusions: Psychosocial expectancies of smoking can be assessed on the basis of these item banks via the SF, by using CAT, or through a tailored set of items selected for a specific research purpose. PMID:25118230

  1. The 4-Item Negative Symptom Assessment (NSA-4) Instrument

    PubMed Central

    Morlock, Robert; Coon, Cheryl; van Willigenburg, Arjen; Panagides, John

    2010-01-01

    Objective. To assess the ability of mental health professionals to use the 4-item Negative Symptom Assessment instrument, derived from the Negative Symptom Assessment-16, to rapidly determine the severity of negative symptoms of schizophrenia. Design. Open participation. Setting. Medical education conferences. Participants. Attendees at two international psychiatry conferences. Measurements. Participants read a brief set of the 4-item Negative Symptom Assessment instructions and viewed a videotape of a patient with schizophrenia. Using the 1 to 6 4-item Negative Symptom Assessment severity rating scale, they rated four negative symptom items and the overall global negative symptoms. These ratings were compared with a consensus rating determination using frequency distributions and Chi-square tests for the proportion of participant ratings that were within one point of the expert rating. Results. More than 400 medical professionals (293 physicians, 50% with a European practice, and 55% who reported past utilization of schizophrenia ratings scales) participated. Between 82.1 and 91.1 percent of the 4-items and the global rating determinations by the participants were within one rating point of the consensus expert ratings. The differences between the percentage of participant rating scores that were within one point versus the percentage that were greater than one point different from those by the consensus experts was significant (p<0.0001). Participants rating of negative symptoms using the 4-item Negative Symptom Assessment did not generally differ among the geographic regions of practice, the professional credentialing, or their familiarity with the use of schizophrenia symptom rating instruments. Conclusion. These findings suggest that clinicians from a variety of geographic practices can, after brief training, use the 4-item Negative Symptom Assessment effectively to rapidly assess negative symptoms in patients with schizophrenia. PMID:20805916

  2. Underground mining methods handbook

    SciTech Connect

    Hustrulid, W.A.

    1982-01-01

    Sections discuss: mine design considerations; stopes requiring minimum support (includes room-and-pillar mining and sublevel stoping); stopes requiring some additional support other than pillars (includes shrinkage stoping, cut-and-fill stoping, undercut-and-fill mining, timber-supported system, top-slice mining, longwall mining and shortwall mining); caving methods (sublevel and block caving); underground equipment; financial considerations; design; and mine ventilation.

  3. FISH HABITATS IMPACTED BY ACIDIC MINE DRAINAGE

    EPA Science Inventory

    This data set represents in-stream fish spawning and hatching areas that have been impacted by elevated acid content waters discharging from areas near mining activities. It is based on an EPA fisheries survey completed in 1995. Acid Mine Drainage, or AMD, occurs when water co...

  4. The Disgust Scale: Item Analysis, Factor Structure, and Suggestions for Refinement

    ERIC Educational Resources Information Center

    Olatunji, Bunmi O.; Williams, Nathan L.; Tolin, David F.; Abramowitz, Jonathan S.; Sawchuk, Craig N.; Lohr, Jeffrey M.; Elwood, Lisa S.

    2007-01-01

    In the 4 studies presented (N = 1,939), a converging set of analyses was conducted to evaluate the item adequacy, factor structure, reliability, and validity of the Disgust Scale (DS; J. Haidt, C. McCauley, & P. Rozin, 1994). The results suggest that 7 items (i.e., Items 2, 7, 8, 21, 23, 24, and 25) should be considered for removal from the DS.…

  5. Data mining

    SciTech Connect

    Lee, K.; Kargupta, H.; Stafford, B.G.; Buescher, K.L.; Ravindran, B.

    1998-12-31

    This is the final report of a one-year, Laboratory Directed Research and Development (LDRD) project at the Los Alamos National Laboratory (LANL). The objective of this project was to develop and implement data mining technology suited to the analysis of large collections of unstructured data. This has taken the form of a software tool, PADMA (Parallel Data Mining Agents), which incorporates parallel data accessing, parallel scalable hierarchical clustering algorithms, and a web-based user interface for submitting Structured Query Language (SQL) queries and interactive data visualization. The authors have demonstrated the viability and scalability of PADMA by applying it to an unstructured text database of 25,000 documents running on an IBM SP2 at Argonne National Laboratory. The utility of PADMA for discovering patterns in data has also been demonstrated by applying it to laboratory test data for Hepatitis C patients and autopsy reports in collaboration with the University of New Mexico School of Medicine.

  6. Acid mine drainage on public and private lands, the Walker Mine experience, Plumas County, California

    SciTech Connect

    Croyle, W.A.; Rosenbaum, S.E.

    1996-11-01

    A widespread environmental problem associated with abandoned mines and their tailings is acid mine drainage (AMD). AMID typically has low pH and elevated metal concentrations that are toxic to aquatic life. In Northern California, Iron Mountain and other mines in the Shasta mining districts are the largest sources of AMD. Additional sources lie to the south along a discontinuous belt of copper and zinc mineralization in the western Sierra foothills. Between these areas lies a remote group of copper mines in northeastern Plumas County including the Walker, Engels and Superior mines. Of this group, AMD from Walker Mine has caused the most severe water quality impairment. This paper describes the history and environmental setting of Walker Mine and the approaches used by the Central Valley Regional Water Quality Control Board, a state regulatory agency, to improve water quality at the site. Both the mine and its tailings contribute pollutants to the watershed. The mine has a portal discharge with depressed pH and high copper concentrations. The tailings add fine grained sediment to the creek and generate low but significant concentrations of dissolved copper. The mine is on private property and the tailings are on land managed by the U. S. Forest Service. Because of these differences in pollution problems and ownership, the methods employed by the Regional Board to improve conditions at the mine and tailings have been on different, but parallel tracks. Monitoring shows these efforts have significantly improved water quality in the watershed over the last 10 years.

  7. Development and community-based validation of eight item banks to assess mental health.

    PubMed

    Batterham, Philip J; Sunderland, Matthew; Carragher, Natacha; Calear, Alison L

    2016-09-30

    There is a need for precise but brief screening of mental health problems in a range of settings. The development of item banks to assess depression and anxiety has resulted in new adaptive and static screeners that accurately assess severity of symptoms. However, expansion to a wider array of mental health problems is required. The current study developed item banks for eight mental health problems: social anxiety disorder, panic disorder, post-traumatic stress disorder, obsessive-compulsive disorder, adult attention-deficit hyperactivity disorder, drug use, psychosis and suicidality. The item banks were calibrated in a population-based Australian adult sample (N=3175) by administering large item pools (45-75 items) and excluding items on the basis of local dependence or measurement non-invariance. Item Response Theory parameters were estimated for each item bank using a two-parameter graded response model. Each bank consisted of 19-47 items, demonstrating excellent fit and precision across a range of -1 to 3 standard deviations from the mean. No previous study has developed such a broad range of mental health item banks. The calibrated item banks will form the basis of a new system of static and adaptive measures to screen for a broad array of mental health problems in the community. PMID:27500552

  8. Toward improved branch prediction through data mining.

    SciTech Connect

    Hemmert, K. Scott; Johnson, D. Eric

    2009-09-01

    Data mining and machine learning techniques can be applied to computer system design to aid in optimizing design decisions, improving system runtime performance. Data mining techniques have been investigated in the context of branch prediction. Specifically, a comparison of traditional branch predictor performance has been made to data mining algorithms. Additionally, the possiblity of whether additional features available within the architectural state might serve to further improve branch prediction has been evaluated. Results show that data mining techniques indicate potential for improved branch prediction, especially when register file contents are included as a feature set.

  9. The Mechanization of Mining.

    ERIC Educational Resources Information Center

    Marovelli, Robert L.; Karhnak, John M.

    1982-01-01

    Mechanization of mining is explained in terms of its effect on the mining of coal, focusing on, among others, types of mining, productivity, machinery, benefits to retired miners, fatality rate in underground coal mines, and output of U.S. mining industry. (Author/JN)

  10. A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

    ERIC Educational Resources Information Center

    Fukuhara, Hirotaka; Kamata, Akihito

    2011-01-01

    A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…