2006-10-01
Hierarchy of Pre-Processing Techniques 3. NLP (Natural Language Processing) Utilities 3.1 Named-Entity Recognition 3.1.1 Example for Named-Entity... Recognition 3.2 Symbol RemovalN-Gram Identification: Bi-Grams 4. Stemming 4.1 Stemming Example 5. Delete List 5.1 Open a Delete List 5.1.1 Small...iterative and involves several key processes: • Named-Entity Recognition Named-Entity Recognition is an Automap feature that allows you to
Wei, Qikang; Chen, Tao; Xu, Ruifeng; He, Yulan; Gui, Lin
2016-01-01
The recognition of disease and chemical named entities in scientific articles is a very important subtask in information extraction in the biomedical domain. Due to the diversity and complexity of disease names, the recognition of named entities of diseases is rather tougher than those of chemical names. Although there are some remarkable chemical named entity recognition systems available online such as ChemSpot and tmChem, the publicly available recognition systems of disease named entities are rare. This article presents a system for disease named entity recognition (DNER) and normalization. First, two separate DNER models are developed. One is based on conditional random fields model with a rule-based post-processing module. The other one is based on the bidirectional recurrent neural networks. Then the named entities recognized by each of the DNER model are fed into a support vector machine classifier for combining results. Finally, each recognized disease named entity is normalized to a medical subject heading disease name by using a vector space model based method. Experimental results show that using 1000 PubMed abstracts for training, our proposed system achieves an F1-measure of 0.8428 at the mention level and 0.7804 at the concept level, respectively, on the testing data of the chemical-disease relation task in BioCreative V.Database URL: http://219.223.252.210:8080/SS/cdr.html. © The Author(s) 2016. Published by Oxford University Press.
Medical Named Entity Recognition for Indonesian Language Using Word Representations
NASA Astrophysics Data System (ADS)
Rahman, Arief
2018-03-01
Nowadays, Named Entity Recognition (NER) system is used in medical texts to obtain important medical information, like diseases, symptoms, and drugs. While most NER systems are applied to formal medical texts, informal ones like those from social media (also called semi-formal texts) are starting to get recognition as a gold mine for medical information. We propose a theoretical Named Entity Recognition (NER) model for semi-formal medical texts in our medical knowledge management system by comparing two kinds of word representations: cluster-based word representation and distributed representation.
A Novel Approach towards Medical Entity Recognition in Chinese Clinical Text
Yu, Jian
2017-01-01
Medical entity recognition, a basic task in the language processing of clinical data, has been extensively studied in analyzing admission notes in alphabetic languages such as English. However, much less work has been done on nonstructural texts that are written in Chinese, or in the setting of differentiation of Chinese drug names between traditional Chinese medicine and Western medicine. Here, we propose a novel cascade-type Chinese medication entity recognition approach that aims at integrating the sentence category classifier from a support vector machine and the conditional random field-based medication entity recognition. We hypothesized that this approach could avoid the side effects of abundant negative samples and improve the performance of the named entity recognition from admission notes written in Chinese. Therefore, we applied this approach to a test set of 324 Chinese-written admission notes with manual annotation by medical experts. Our data demonstrated that this approach had a score of 94.2% in precision, 92.8% in recall, and 93.5% in F-measure for the recognition of traditional Chinese medicine drug names and 91.2% in precision, 92.6% in recall, and 91.7% F-measure for the recognition of Western medicine drug names. The differences in F-measure were significant compared with those in the baseline systems. PMID:29065612
Mehta, Sonya; Inoue, Kayo; Rudrauf, David; Damasio, Hanna; Tranel, Daniel; Grabowski, Thomas
2015-01-01
Lesion-deficit studies support the hypothesis that the left anterior temporal lobe (ATL) plays a critical role in retrieving names of concrete entities. They further suggest that different regions of the left ATL process different conceptual categories. Here we test the specificity of these relationships and whether the anatomical segregation is related to the underlying organization of white matter connections. We reanalyzed data from a previous lesion study of naming and recognition across five categories of concrete entities. In voxelwise logistic regressions of lesion-deficit associations, we formally incorporated measures of disconnection of long-range association fiber tracts (FTs) and covaried for recognition and non-category specific naming deficits. We also performed fiber tractwise analyses to assess whether damage to specific FTs was preferentially associated with category-selective naming deficits. Damage to the basolateral ATL was associated with naming deficits for both unique (famous faces) and non-unique entities, whereas the damage to the temporal pole was associated with naming deficits for unique entities only. This segregation pattern remained after accounting for comorbid recognition deficits or naming deficits in other categories. The tractwise analyses showed that damage to the uncinate fasciculus was associated with naming impairments for unique entities, while damage to the inferior longitudinal fasciculus was associated with naming impairments for non-unique entities. Covarying for FT transection in voxelwise analyses rendered the cortical association for unique entities more focal. These results are consistent with the partial segregation of brain system support for name retrieval of unique and non-unique entities at both the level of cortical components and underlying white matter fiber bundles. Our study reconciles theoretic accounts of the functional organization of the left ATL by revealing both category-related processing and semantic hub sectors. PMID:26707082
Mehta, Sonya; Inoue, Kayo; Rudrauf, David; Damasio, Hanna; Tranel, Daniel; Grabowski, Thomas
2016-02-01
Lesion-deficit studies support the hypothesis that the left anterior temporal lobe (ATL) plays a critical role in retrieving names of concrete entities. They further suggest that different regions of the left ATL process different conceptual categories. Here we test the specificity of these relationships and whether the anatomical segregation is related to the underlying organization of white matter connections. We reanalyzed data from a previous lesion study of naming and recognition across five categories of concrete entities. In voxelwise logistic regressions of lesion-deficit associations, we formally incorporated measures of disconnection of long-range association fiber tracts (FTs) and covaried for recognition and non-category-specific naming deficits. We also performed fiber tractwise analyses to assess whether damage to specific FTs was preferentially associated with category-selective naming deficits. Damage to the basolateral ATL was associated with naming deficits for both unique (famous faces) and non-unique entities, whereas the damage to the temporal pole was associated with naming deficits for unique entities only. This segregation pattern remained after accounting for comorbid recognition deficits or naming deficits in other categories. The tractwise analyses showed that damage to the uncinate fasciculus (UNC) was associated with naming impairments for unique entities, while damage to the inferior longitudinal fasciculus (ILF) was associated with naming impairments for non-unique entities. Covarying for FT transection in voxelwise analyses rendered the cortical association for unique entities more focal. These results are consistent with the partial segregation of brain system support for name retrieval of unique and non-unique entities at both the level of cortical components and underlying white matter fiber bundles. Our study reconciles theoretic accounts of the functional organization of the left ATL by revealing both category-related processing and semantic hub sectors. Copyright © 2015 Elsevier Ltd. All rights reserved.
Han, Xu; Kim, Jung-jae; Kwoh, Chee Keong
2016-01-01
Biomedical text mining may target various kinds of valuable information embedded in the literature, but a critical obstacle to the extension of the mining targets is the cost of manual construction of labeled data, which are required for state-of-the-art supervised learning systems. Active learning is to choose the most informative documents for the supervised learning in order to reduce the amount of required manual annotations. Previous works of active learning, however, focused on the tasks of entity recognition and protein-protein interactions, but not on event extraction tasks for multiple event types. They also did not consider the evidence of event participants, which might be a clue for the presence of events in unlabeled documents. Moreover, the confidence scores of events produced by event extraction systems are not reliable for ranking documents in terms of informativity for supervised learning. We here propose a novel committee-based active learning method that supports multi-event extraction tasks and employs a new statistical method for informativity estimation instead of using the confidence scores from event extraction systems. Our method is based on a committee of two systems as follows: We first employ an event extraction system to filter potential false negatives among unlabeled documents, from which the system does not extract any event. We then develop a statistical method to rank the potential false negatives of unlabeled documents 1) by using a language model that measures the probabilities of the expression of multiple events in documents and 2) by using a named entity recognition system that locates the named entities that can be event arguments (e.g. proteins). The proposed method further deals with unknown words in test data by using word similarity measures. We also apply our active learning method for the task of named entity recognition. We evaluate the proposed method against the BioNLP Shared Tasks datasets, and show that our method can achieve better performance than such previous methods as entropy and Gibbs error based methods and a conventional committee-based method. We also show that the incorporation of named entity recognition into the active learning for event extraction and the unknown word handling further improve the active learning method. In addition, the adaptation of the active learning method into named entity recognition tasks also improves the document selection for manual annotation of named entities.
Entity-based Stochastic Analysis of Search Results for Query Expansion and Results Re-Ranking
2015-11-20
pages) and struc- tured data (e.g. Linked Open Data ( LOD ) [8]) coexist in var- ious forms. An important observation is that entity names (like names of...the top-L (e.g. L = 1, 000) results are retrieved. Then, Named Entity Recognition (NER) is applied in these results for identifying LOD entities. In...the next (optional) step, more semantic information about the identified entities is retrieved from the LOD (like properties and related entities). A
Chemical Entity Recognition and Resolution to ChEBI
Grego, Tiago; Pesquita, Catia; Bastos, Hugo P.; Couto, Francisco M.
2012-01-01
Chemical entities are ubiquitous through the biomedical literature and the development of text-mining systems that can efficiently identify those entities are required. Due to the lack of available corpora and data resources, the community has focused its efforts in the development of gene and protein named entity recognition systems, but with the release of ChEBI and the availability of an annotated corpus, this task can be addressed. We developed a machine-learning-based method for chemical entity recognition and a lexical-similarity-based method for chemical entity resolution and compared them with Whatizit, a popular-dictionary-based method. Our methods outperformed the dictionary-based method in all tasks, yielding an improvement in F-measure of 20% for the entity recognition task, 2–5% for the entity-resolution task, and 15% for combined entity recognition and resolution tasks. PMID:25937941
Chemical named entities recognition: a review on approaches and applications
2014-01-01
The rapid increase in the flow rate of published digital information in all disciplines has resulted in a pressing need for techniques that can simplify the use of this information. The chemistry literature is very rich with information about chemical entities. Extracting molecules and their related properties and activities from the scientific literature to “text mine” these extracted data and determine contextual relationships helps research scientists, particularly those in drug development. One of the most important challenges in chemical text mining is the recognition of chemical entities mentioned in the texts. In this review, the authors briefly introduce the fundamental concepts of chemical literature mining, the textual contents of chemical documents, and the methods of naming chemicals in documents. We sketch out dictionary-based, rule-based and machine learning, as well as hybrid chemical named entity recognition approaches with their applied solutions. We end with an outlook on the pros and cons of these approaches and the types of chemical entities extracted. PMID:24834132
Chemical named entities recognition: a review on approaches and applications.
Eltyeb, Safaa; Salim, Naomie
2014-01-01
The rapid increase in the flow rate of published digital information in all disciplines has resulted in a pressing need for techniques that can simplify the use of this information. The chemistry literature is very rich with information about chemical entities. Extracting molecules and their related properties and activities from the scientific literature to "text mine" these extracted data and determine contextual relationships helps research scientists, particularly those in drug development. One of the most important challenges in chemical text mining is the recognition of chemical entities mentioned in the texts. In this review, the authors briefly introduce the fundamental concepts of chemical literature mining, the textual contents of chemical documents, and the methods of naming chemicals in documents. We sketch out dictionary-based, rule-based and machine learning, as well as hybrid chemical named entity recognition approaches with their applied solutions. We end with an outlook on the pros and cons of these approaches and the types of chemical entities extracted.
Cross domains Arabic named entity recognition system
NASA Astrophysics Data System (ADS)
Al-Ahmari, S. Saad; Abdullatif Al-Johar, B.
2016-07-01
Named Entity Recognition (NER) plays an important role in many Natural Language Processing (NLP) applications such as; Information Extraction (IE), Question Answering (QA), Text Clustering, Text Summarization and Word Sense Disambiguation. This paper presents the development and implementation of domain independent system to recognize three types of Arabic named entities. The system works based on a set of domain independent grammar-rules along with Arabic part of speech tagger in addition to gazetteers and lists of trigger words. The experimental results shown, that the system performed as good as other systems with better results in some cases of cross-domains corpora.
Designing Rules for Accounting Transaction Identification based on Indonesian NLP
NASA Astrophysics Data System (ADS)
Iswandi, I.; Suwardi, I. S.; Maulidevi, N. U.
2017-03-01
Recording accounting transactions carried out by the evidence of the transactions. It can be invoices, receipts, letters of intent, electricity bill, telephone bill, etc. In this paper, we proposed design of rules to identify the entities located on the sales invoice. There are some entities identified in a sales invoice, namely : invoice date, company name, invoice number, product id, product name, quantity and total price. Identification this entities using named entity recognition method. The entities generated from the rules used as a basis for automation process of data input into the accounting system.
A transition-based joint model for disease named entity recognition and normalization.
Lou, Yinxia; Zhang, Yue; Qian, Tao; Li, Fei; Xiong, Shufeng; Ji, Donghong
2017-08-01
Disease named entities play a central role in many areas of biomedical research, and automatic recognition and normalization of such entities have received increasing attention in biomedical research communities. Existing methods typically used pipeline models with two independent phases: (i) a disease named entity recognition (DER) system is used to find the boundaries of mentions in text and (ii) a disease named entity normalization (DEN) system is used to connect the mentions recognized to concepts in a controlled vocabulary. The main problems of such models are: (i) there is error propagation from DER to DEN and (ii) DEN is useful for DER, but pipeline models cannot utilize this. We propose a transition-based model to jointly perform disease named entity recognition and normalization, casting the output construction process into an incremental state transition process, learning sequences of transition actions globally, which correspond to joint structural outputs. Beam search and online structured learning are used, with learning being designed to guide search. Compared with the only existing method for joint DEN and DER, our method allows non-local features to be used, which significantly improves the accuracies. We evaluate our model on two corpora: the BioCreative V Chemical Disease Relation (CDR) corpus and the NCBI disease corpus. Experiments show that our joint framework achieves significantly higher performances compared to competitive pipeline baselines. Our method compares favourably to other state-of-the-art approaches. Data and code are available at https://github.com/louyinxia/jointRN. dhji@whu.edu.cn. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
BANNER: an executable survey of advances in biomedical named entity recognition.
Leaman, Robert; Gonzalez, Graciela
2008-01-01
There has been an increasing amount of research on biomedical named entity recognition, the most basic text extraction problem, resulting in significant progress by different research teams around the world. This has created a need for a freely-available, open source system implementing the advances described in the literature. In this paper we present BANNER, an open-source, executable survey of advances in biomedical named entity recognition, intended to serve as a benchmark for the field. BANNER is implemented in Java as a machine-learning system based on conditional random fields and includes a wide survey of the best techniques recently described in the literature. It is designed to maximize domain independence by not employing brittle semantic features or rule-based processing steps, and achieves significantly better performance than existing baseline systems. It is therefore useful to developers as an extensible NER implementation, to researchers as a standard for comparing innovative techniques, and to biologists requiring the ability to find novel entities in large amounts of text.
Anatomical Entity Recognition with a Hierarchical Framework Augmented by External Resources
Xu, Yan; Hua, Ji; Ni, Zhaoheng; Chen, Qinlang; Fan, Yubo; Ananiadou, Sophia; Chang, Eric I-Chao; Tsujii, Junichi
2014-01-01
References to anatomical entities in medical records consist not only of explicit references to anatomical locations, but also other diverse types of expressions, such as specific diseases, clinical tests, clinical treatments, which constitute implicit references to anatomical entities. In order to identify these implicit anatomical entities, we propose a hierarchical framework, in which two layers of named entity recognizers (NERs) work in a cooperative manner. Each of the NERs is implemented using the Conditional Random Fields (CRF) model, which use a range of external resources to generate features. We constructed a dictionary of anatomical entity expressions by exploiting four existing resources, i.e., UMLS, MeSH, RadLex and BodyPart3D, and supplemented information from two external knowledge bases, i.e., Wikipedia and WordNet, to improve inference of anatomical entities from implicit expressions. Experiments conducted on 300 discharge summaries showed a micro-averaged performance of 0.8509 Precision, 0.7796 Recall and 0.8137 F1 for explicit anatomical entity recognition, and 0.8695 Precision, 0.6893 Recall and 0.7690 F1 for implicit anatomical entity recognition. The use of the hierarchical framework, which combines the recognition of named entities of various types (diseases, clinical tests, treatments) with information embedded in external knowledge bases, resulted in a 5.08% increment in F1. The resources constructed for this research will be made publicly available. PMID:25343498
Unsupervised Biomedical Named Entity Recognition: Experiments with Clinical and Biological Texts
Zhang, Shaodian; Elhadad, Nóemie
2013-01-01
Named entity recognition is a crucial component of biomedical natural language processing, enabling information extraction and ultimately reasoning over and knowledge discovery from text. Much progress has been made in the design of rule-based and supervised tools, but they are often genre and task dependent. As such, adapting them to different genres of text or identifying new types of entities requires major effort in re-annotation or rule development. In this paper, we propose an unsupervised approach to extracting named entities from biomedical text. We describe a stepwise solution to tackle the challenges of entity boundary detection and entity type classification without relying on any handcrafted rules, heuristics, or annotated data. A noun phrase chunker followed by a filter based on inverse document frequency extracts candidate entities from free text. Classification of candidate entities into categories of interest is carried out by leveraging principles from distributional semantics. Experiments show that our system, especially the entity classification step, yields competitive results on two popular biomedical datasets of clinical notes and biological literature, and outperforms a baseline dictionary match approach. Detailed error analysis provides a road map for future work. PMID:23954592
Using Workflows to Explore and Optimise Named Entity Recognition for Chemistry
Kolluru, BalaKrishna; Hawizy, Lezan; Murray-Rust, Peter; Tsujii, Junichi; Ananiadou, Sophia
2011-01-01
Chemistry text mining tools should be interoperable and adaptable regardless of system-level implementation, installation or even programming issues. We aim to abstract the functionality of these tools from the underlying implementation via reconfigurable workflows for automatically identifying chemical names. To achieve this, we refactored an established named entity recogniser (in the chemistry domain), OSCAR and studied the impact of each component on the net performance. We developed two reconfigurable workflows from OSCAR using an interoperable text mining framework, U-Compare. These workflows can be altered using the drag-&-drop mechanism of the graphical user interface of U-Compare. These workflows also provide a platform to study the relationship between text mining components such as tokenisation and named entity recognition (using maximum entropy Markov model (MEMM) and pattern recognition based classifiers). Results indicate that, for chemistry in particular, eliminating noise generated by tokenisation techniques lead to a slightly better performance than others, in terms of named entity recognition (NER) accuracy. Poor tokenisation translates into poorer input to the classifier components which in turn leads to an increase in Type I or Type II errors, thus, lowering the overall performance. On the Sciborg corpus, the workflow based system, which uses a new tokeniser whilst retaining the same MEMM component, increases the F-score from 82.35% to 84.44%. On the PubMed corpus, it recorded an F-score of 84.84% as against 84.23% by OSCAR. PMID:21633495
Using workflows to explore and optimise named entity recognition for chemistry.
Kolluru, Balakrishna; Hawizy, Lezan; Murray-Rust, Peter; Tsujii, Junichi; Ananiadou, Sophia
2011-01-01
Chemistry text mining tools should be interoperable and adaptable regardless of system-level implementation, installation or even programming issues. We aim to abstract the functionality of these tools from the underlying implementation via reconfigurable workflows for automatically identifying chemical names. To achieve this, we refactored an established named entity recogniser (in the chemistry domain), OSCAR and studied the impact of each component on the net performance. We developed two reconfigurable workflows from OSCAR using an interoperable text mining framework, U-Compare. These workflows can be altered using the drag-&-drop mechanism of the graphical user interface of U-Compare. These workflows also provide a platform to study the relationship between text mining components such as tokenisation and named entity recognition (using maximum entropy Markov model (MEMM) and pattern recognition based classifiers). Results indicate that, for chemistry in particular, eliminating noise generated by tokenisation techniques lead to a slightly better performance than others, in terms of named entity recognition (NER) accuracy. Poor tokenisation translates into poorer input to the classifier components which in turn leads to an increase in Type I or Type II errors, thus, lowering the overall performance. On the Sciborg corpus, the workflow based system, which uses a new tokeniser whilst retaining the same MEMM component, increases the F-score from 82.35% to 84.44%. On the PubMed corpus, it recorded an F-score of 84.84% as against 84.23% by OSCAR.
Character-level neural network for biomedical named entity recognition.
Gridach, Mourad
2017-06-01
Biomedical named entity recognition (BNER), which extracts important named entities such as genes and proteins, is a challenging task in automated systems that mine knowledge in biomedical texts. The previous state-of-the-art systems required large amounts of task-specific knowledge in the form of feature engineering, lexicons and data pre-processing to achieve high performance. In this paper, we introduce a novel neural network architecture that benefits from both word- and character-level representations automatically, by using a combination of bidirectional long short-term memory (LSTM) and conditional random field (CRF) eliminating the need for most feature engineering tasks. We evaluate our system on two datasets: JNLPBA corpus and the BioCreAtIvE II Gene Mention (GM) corpus. We obtained state-of-the-art performance by outperforming the previous systems. To the best of our knowledge, we are the first to investigate the combination of deep neural networks, CRF, word embeddings and character-level representation in recognizing biomedical named entities. Copyright © 2017 Elsevier Inc. All rights reserved.
Chun, Hong-Woo; Tsuruoka, Yoshimasa; Kim, Jin-Dong; Shiba, Rie; Nagata, Naoki; Hishiki, Teruyoshi; Tsujii, Jun'ichi
2006-01-01
Background Automatic recognition of relations between a specific disease term and its relevant genes or protein terms is an important practice of bioinformatics. Considering the utility of the results of this approach, we identified prostate cancer and gene terms with the ID tags of public biomedical databases. Moreover, considering that genetics experts will use our results, we classified them based on six topics that can be used to analyze the type of prostate cancers, genes, and their relations. Methods We developed a maximum entropy-based named entity recognizer and a relation recognizer and applied them to a corpus-based approach. We collected prostate cancer-related abstracts from MEDLINE, and constructed an annotated corpus of gene and prostate cancer relations based on six topics by biologists. We used it to train the maximum entropy-based named entity recognizer and relation recognizer. Results Topic-classified relation recognition achieved 92.1% precision for the relation (an increase of 11.0% from that obtained in a baseline experiment). For all topics, the precision was between 67.6 and 88.1%. Conclusion A series of experimental results revealed two important findings: a carefully designed relation recognition system using named entity recognition can improve the performance of relation recognition, and topic-classified relation recognition can be effectively addressed through a corpus-based approach using manual annotation and machine learning techniques. PMID:17134477
Chun, Hong-Woo; Tsuruoka, Yoshimasa; Kim, Jin-Dong; Shiba, Rie; Nagata, Naoki; Hishiki, Teruyoshi; Tsujii, Jun'ichi
2006-11-24
Automatic recognition of relations between a specific disease term and its relevant genes or protein terms is an important practice of bioinformatics. Considering the utility of the results of this approach, we identified prostate cancer and gene terms with the ID tags of public biomedical databases. Moreover, considering that genetics experts will use our results, we classified them based on six topics that can be used to analyze the type of prostate cancers, genes, and their relations. We developed a maximum entropy-based named entity recognizer and a relation recognizer and applied them to a corpus-based approach. We collected prostate cancer-related abstracts from MEDLINE, and constructed an annotated corpus of gene and prostate cancer relations based on six topics by biologists. We used it to train the maximum entropy-based named entity recognizer and relation recognizer. Topic-classified relation recognition achieved 92.1% precision for the relation (an increase of 11.0% from that obtained in a baseline experiment). For all topics, the precision was between 67.6 and 88.1%. A series of experimental results revealed two important findings: a carefully designed relation recognition system using named entity recognition can improve the performance of relation recognition, and topic-classified relation recognition can be effectively addressed through a corpus-based approach using manual annotation and machine learning techniques.
Dai, Hong-Jie; Lai, Po-Ting; Chang, Yung-Chun; Tsai, Richard Tzong-Han
2015-01-01
The functions of chemical compounds and drugs that affect biological processes and their particular effect on the onset and treatment of diseases have attracted increasing interest with the advancement of research in the life sciences. To extract knowledge from the extensive literatures on such compounds and drugs, the organizers of BioCreative IV administered the CHEMical Compound and Drug Named Entity Recognition (CHEMDNER) task to establish a standard dataset for evaluating state-of-the-art chemical entity recognition methods. This study introduces the approach of our CHEMDNER system. Instead of emphasizing the development of novel feature sets for machine learning, this study investigates the effect of various tag schemes on the recognition of the names of chemicals and drugs by using conditional random fields. Experiments were conducted using combinations of different tokenization strategies and tag schemes to investigate the effects of tag set selection and tokenization method on the CHEMDNER task. This study presents the performance of CHEMDNER of three more representative tag schemes-IOBE, IOBES, and IOB12E-when applied to a widely utilized IOB tag set and combined with the coarse-/fine-grained tokenization methods. The experimental results thus reveal that the fine-grained tokenization strategy performance best in terms of precision, recall and F-scores when the IOBES tag set was utilized. The IOBES model with fine-grained tokenization yielded the best-F-scores in the six chemical entity categories other than the "Multiple" entity category. Nonetheless, no significant improvement was observed when a more representative tag schemes was used with the coarse or fine-grained tokenization rules. The best F-scores that were achieved using the developed system on the test dataset of the CHEMDNER task were 0.833 and 0.815 for the chemical documents indexing and the chemical entity mention recognition tasks, respectively. The results herein highlight the importance of tag set selection and the use of different tokenization strategies. Fine-grained tokenization combined with the tag set IOBES most effectively recognizes chemical and drug names. To the best of the authors' knowledge, this investigation is the first comprehensive investigation use of various tag set schemes combined with different tokenization strategies for the recognition of chemical entities.
University of Glasgow at TREC 2009: Experiments with Terrier
2009-11-01
identify entities in the category B subset of the corpus, we resort to an efficient dictionary -based named en- tity recognition approach.4 In particular...we build a large dictio- nary of entity names using DBPedia,5 a structured representation of Wikipedia. Dictionary entries comprise all known...aliases for each unique entity, as obtained from DBPedia (e.g., ‘Barack Obama’ is represented by the dictionary entries ‘Barack Obama’ and ‘44th President
An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition.
Luo, Ling; Yang, Zhihao; Yang, Pei; Zhang, Yin; Wang, Lei; Lin, Hongfei; Wang, Jian
2018-04-15
In biomedical research, chemical is an important class of entities, and chemical named entity recognition (NER) is an important task in the field of biomedical information extraction. However, most popular chemical NER methods are based on traditional machine learning and their performances are heavily dependent on the feature engineering. Moreover, these methods are sentence-level ones which have the tagging inconsistency problem. In this paper, we propose a neural network approach, i.e. attention-based bidirectional Long Short-Term Memory with a conditional random field layer (Att-BiLSTM-CRF), to document-level chemical NER. The approach leverages document-level global information obtained by attention mechanism to enforce tagging consistency across multiple instances of the same token in a document. It achieves better performances with little feature engineering than other state-of-the-art methods on the BioCreative IV chemical compound and drug name recognition (CHEMDNER) corpus and the BioCreative V chemical-disease relation (CDR) task corpus (the F-scores of 91.14 and 92.57%, respectively). Data and code are available at https://github.com/lingluodlut/Att-ChemdNER. yangzh@dlut.edu.cn or wangleibihami@gmail.com. Supplementary data are available at Bioinformatics online.
LeadMine: a grammar and dictionary driven approach to entity recognition.
Lowe, Daniel M; Sayle, Roger A
2015-01-01
Chemical entity recognition has traditionally been performed by machine learning approaches. Here we describe an approach using grammars and dictionaries. This approach has the advantage that the entities found can be directly related to a given grammar or dictionary, which allows the type of an entity to be known and, if an entity is misannotated, indicates which resource should be corrected. As recognition is driven by what is expected, if spelling errors occur, they can be corrected. Correcting such errors is highly useful when attempting to lookup an entity in a database or, in the case of chemical names, converting them to structures. Our system uses a mixture of expertly curated grammars and dictionaries, as well as dictionaries automatically derived from public resources. We show that the heuristics developed to filter our dictionary of trivial chemical names (from PubChem) yields a better performing dictionary than the previously published Jochem dictionary. Our final system performs post-processing steps to modify the boundaries of entities and to detect abbreviations. These steps are shown to significantly improve performance (2.6% and 4.0% F1-score respectively). Our complete system, with incremental post-BioCreative workshop improvements, achieves 89.9% precision and 85.4% recall (87.6% F1-score) on the CHEMDNER test set. Grammar and dictionary approaches can produce results at least as good as the current state of the art in machine learning approaches. While machine learning approaches are commonly thought of as "black box" systems, our approach directly links the output entities to the input dictionaries and grammars. Our approach also allows correction of errors in detected entities, which can assist with entity resolution.
LeadMine: a grammar and dictionary driven approach to entity recognition
2015-01-01
Background Chemical entity recognition has traditionally been performed by machine learning approaches. Here we describe an approach using grammars and dictionaries. This approach has the advantage that the entities found can be directly related to a given grammar or dictionary, which allows the type of an entity to be known and, if an entity is misannotated, indicates which resource should be corrected. As recognition is driven by what is expected, if spelling errors occur, they can be corrected. Correcting such errors is highly useful when attempting to lookup an entity in a database or, in the case of chemical names, converting them to structures. Results Our system uses a mixture of expertly curated grammars and dictionaries, as well as dictionaries automatically derived from public resources. We show that the heuristics developed to filter our dictionary of trivial chemical names (from PubChem) yields a better performing dictionary than the previously published Jochem dictionary. Our final system performs post-processing steps to modify the boundaries of entities and to detect abbreviations. These steps are shown to significantly improve performance (2.6% and 4.0% F1-score respectively). Our complete system, with incremental post-BioCreative workshop improvements, achieves 89.9% precision and 85.4% recall (87.6% F1-score) on the CHEMDNER test set. Conclusions Grammar and dictionary approaches can produce results at least as good as the current state of the art in machine learning approaches. While machine learning approaches are commonly thought of as "black box" systems, our approach directly links the output entities to the input dictionaries and grammars. Our approach also allows correction of errors in detected entities, which can assist with entity resolution. PMID:25810776
Anaplastic sarcoma of the kidney.
Labanaris, Apostolos; Zugor, Vahudin; Smiszek, Robert; Nützel, Reinhold; Kühn, Reinhard
2009-02-15
Wilms tumor can appear with a wide spectrum of morphologic features and can sometimes cover or delay the recognition of other clinicopathologic entities of the kidney. We present a case of a new tumor entity of the kidney, namely the anaplastic sarcoma of the kidney, a tumor of high malignancy.
Skeppstedt, Maria; Kvist, Maria; Nilsson, Gunnar H; Dalianis, Hercules
2014-06-01
Automatic recognition of clinical entities in the narrative text of health records is useful for constructing applications for documentation of patient care, as well as for secondary usage in the form of medical knowledge extraction. There are a number of named entity recognition studies on English clinical text, but less work has been carried out on clinical text in other languages. This study was performed on Swedish health records, and focused on four entities that are highly relevant for constructing a patient overview and for medical hypothesis generation, namely the entities: Disorder, Finding, Pharmaceutical Drug and Body Structure. The study had two aims: to explore how well named entity recognition methods previously applied to English clinical text perform on similar texts written in Swedish; and to evaluate whether it is meaningful to divide the more general category Medical Problem, which has been used in a number of previous studies, into the two more granular entities, Disorder and Finding. Clinical notes from a Swedish internal medicine emergency unit were annotated for the four selected entity categories, and the inter-annotator agreement between two pairs of annotators was measured, resulting in an average F-score of 0.79 for Disorder, 0.66 for Finding, 0.90 for Pharmaceutical Drug and 0.80 for Body Structure. A subset of the developed corpus was thereafter used for finding suitable features for training a conditional random fields model. Finally, a new model was trained on this subset, using the best features and settings, and its ability to generalise to held-out data was evaluated. This final model obtained an F-score of 0.81 for Disorder, 0.69 for Finding, 0.88 for Pharmaceutical Drug, 0.85 for Body Structure and 0.78 for the combined category Disorder+Finding. The obtained results, which are in line with or slightly lower than those for similar studies on English clinical text, many of them conducted using a larger training data set, show that the approaches used for English are also suitable for Swedish clinical text. However, a small proportion of the errors made by the model are less likely to occur in English text, showing that results might be improved by further tailoring the system to clinical Swedish. The entity recognition results for the individual entities Disorder and Finding show that it is meaningful to separate the general category Medical Problem into these two more granular entity types, e.g. for knowledge mining of co-morbidity relations and disorder-finding relations. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Abstracts versus Full Texts and Patents: A Quantitative Analysis of Biomedical Entities
NASA Astrophysics Data System (ADS)
Müller, Bernd; Klinger, Roman; Gurulingappa, Harsha; Mevissen, Heinz-Theodor; Hofmann-Apitius, Martin; Fluck, Juliane; Friedrich, Christoph M.
In information retrieval, named entity recognition gives the opportunity to apply semantic search in domain specific corpora. Recently, more full text patents and journal articles became freely available. As the information distribution amongst the different sections is unknown, an analysis of the diversity is of interest.
ERIC Educational Resources Information Center
Wolk, D.A.; Coslett, H.B.; Glosser, G.
2005-01-01
The role of sensory-motor representations in object recognition was investigated in experiments involving AD, a patient with mild visual agnosia who was impaired in the recognition of visually presented living as compared to non-living entities. AD named visually presented items for which sensory-motor information was available significantly more…
Entity recognition in the biomedical domain using a hybrid approach.
Basaldella, Marco; Furrer, Lenz; Tasso, Carlo; Rinaldi, Fabio
2017-11-09
This article describes a high-recall, high-precision approach for the extraction of biomedical entities from scientific articles. The approach uses a two-stage pipeline, combining a dictionary-based entity recognizer with a machine-learning classifier. First, the OGER entity recognizer, which has a bias towards high recall, annotates the terms that appear in selected domain ontologies. Subsequently, the Distiller framework uses this information as a feature for a machine learning algorithm to select the relevant entities only. For this step, we compare two different supervised machine-learning algorithms: Conditional Random Fields and Neural Networks. In an in-domain evaluation using the CRAFT corpus, we test the performance of the combined systems when recognizing chemicals, cell types, cellular components, biological processes, molecular functions, organisms, proteins, and biological sequences. Our best system combines dictionary-based candidate generation with Neural-Network-based filtering. It achieves an overall precision of 86% at a recall of 60% on the named entity recognition task, and a precision of 51% at a recall of 49% on the concept recognition task. These results are to our knowledge the best reported so far in this particular task.
Named Entity Recognition in Chinese Clinical Text Using Deep Neural Network.
Wu, Yonghui; Jiang, Min; Lei, Jianbo; Xu, Hua
2015-01-01
Rapid growth in electronic health records (EHRs) use has led to an unprecedented expansion of available clinical data in electronic formats. However, much of the important healthcare information is locked in the narrative documents. Therefore Natural Language Processing (NLP) technologies, e.g., Named Entity Recognition that identifies boundaries and types of entities, has been extensively studied to unlock important clinical information in free text. In this study, we investigated a novel deep learning method to recognize clinical entities in Chinese clinical documents using the minimal feature engineering approach. We developed a deep neural network (DNN) to generate word embeddings from a large unlabeled corpus through unsupervised learning and another DNN for the NER task. The experiment results showed that the DNN with word embeddings trained from the large unlabeled corpus outperformed the state-of-the-art CRF's model in the minimal feature engineering setting, achieving the highest F1-score of 0.9280. Further analysis showed that word embeddings derived through unsupervised learning from large unlabeled corpus remarkably improved the DNN with randomized embedding, denoting the usefulness of unsupervised feature learning.
Dingare, Shipra; Nissim, Malvina; Finkel, Jenny; Grover, Claire
2005-01-01
We present a maximum entropy-based system for identifying named entities (NEs) in biomedical abstracts and present its performance in the only two biomedical named entity recognition (NER) comparative evaluations that have been held to date, namely BioCreative and Coling BioNLP. Our system obtained an exact match F-score of 83.2% in the BioCreative evaluation and 70.1% in the BioNLP evaluation. We discuss our system in detail, including its rich use of local features, attention to correct boundary identification, innovative use of external knowledge resources, including parsing and web searches, and rapid adaptation to new NE sets. We also discuss in depth problems with data annotation in the evaluations which caused the final performance to be lower than optimal. PMID:18629295
Named Entity Recognition in a Hungarian NL Based QA System
NASA Astrophysics Data System (ADS)
Tikkl, Domonkos; Szidarovszky, P. Ferenc; Kardkovacs, Zsolt T.; Magyar, Gábor
In WoW project our purpose is to create a complex search interface with the following features: search in the deep web content of contracted partners' databases, processing Hungarian natural language (NL) questions and transforming them to SQL queries for database access, image search supported by a visual thesaurus that describes in a structural form the visual content of images (also in Hungarian). This paper primarily focuses on a particular problem of question processing task: the entity recognition. Before going into details we give a short overview of the project's aims.
Deep learning with word embeddings improves biomedical named entity recognition.
Habibi, Maryam; Weber, Leon; Neves, Mariana; Wiegandt, David Luis; Leser, Ulf
2017-07-15
Text mining has become an important tool for biomedical research. The most fundamental text-mining task is the recognition of biomedical named entities (NER), such as genes, chemicals and diseases. Current NER methods rely on pre-defined features which try to capture the specific surface properties of entity types, properties of the typical local context, background knowledge, and linguistic information. State-of-the-art tools are entity-specific, as dictionaries and empirically optimal feature sets differ between entity types, which makes their development costly. Furthermore, features are often optimized for a specific gold standard corpus, which makes extrapolation of quality measures difficult. We show that a completely generic method based on deep learning and statistical word embeddings [called long short-term memory network-conditional random field (LSTM-CRF)] outperforms state-of-the-art entity-specific NER tools, and often by a large margin. To this end, we compared the performance of LSTM-CRF on 33 data sets covering five different entity classes with that of best-of-class NER tools and an entity-agnostic CRF implementation. On average, F1-score of LSTM-CRF is 5% above that of the baselines, mostly due to a sharp increase in recall. The source code for LSTM-CRF is available at https://github.com/glample/tagger and the links to the corpora are available at https://corposaurus.github.io/corpora/ . habibima@informatik.hu-berlin.de. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Deep learning with word embeddings improves biomedical named entity recognition
Habibi, Maryam; Weber, Leon; Neves, Mariana; Wiegandt, David Luis; Leser, Ulf
2017-01-01
Abstract Motivation: Text mining has become an important tool for biomedical research. The most fundamental text-mining task is the recognition of biomedical named entities (NER), such as genes, chemicals and diseases. Current NER methods rely on pre-defined features which try to capture the specific surface properties of entity types, properties of the typical local context, background knowledge, and linguistic information. State-of-the-art tools are entity-specific, as dictionaries and empirically optimal feature sets differ between entity types, which makes their development costly. Furthermore, features are often optimized for a specific gold standard corpus, which makes extrapolation of quality measures difficult. Results: We show that a completely generic method based on deep learning and statistical word embeddings [called long short-term memory network-conditional random field (LSTM-CRF)] outperforms state-of-the-art entity-specific NER tools, and often by a large margin. To this end, we compared the performance of LSTM-CRF on 33 data sets covering five different entity classes with that of best-of-class NER tools and an entity-agnostic CRF implementation. On average, F1-score of LSTM-CRF is 5% above that of the baselines, mostly due to a sharp increase in recall. Availability and implementation: The source code for LSTM-CRF is available at https://github.com/glample/tagger and the links to the corpora are available at https://corposaurus.github.io/corpora/. Contact: habibima@informatik.hu-berlin.de PMID:28881963
Jonnagaddala, Jitendra; Jue, Toni Rose; Chang, Nai-Wen; Dai, Hong-Jie
2016-01-01
The rapidly increasing biomedical literature calls for the need of an automatic approach in the recognition and normalization of disease mentions in order to increase the precision and effectivity of disease based information retrieval. A variety of methods have been proposed to deal with the problem of disease named entity recognition and normalization. Among all the proposed methods, conditional random fields (CRFs) and dictionary lookup method are widely used for named entity recognition and normalization respectively. We herein developed a CRF-based model to allow automated recognition of disease mentions, and studied the effect of various techniques in improving the normalization results based on the dictionary lookup approach. The dataset from the BioCreative V CDR track was used to report the performance of the developed normalization methods and compare with other existing dictionary lookup based normalization methods. The best configuration achieved an F-measure of 0.77 for the disease normalization, which outperformed the best dictionary lookup based baseline method studied in this work by an F-measure of 0.13. Database URL: https://github.com/TCRNBioinformatics/DiseaseExtract PMID:27504009
A modular framework for biomedical concept recognition
2013-01-01
Background Concept recognition is an essential task in biomedical information extraction, presenting several complex and unsolved challenges. The development of such solutions is typically performed in an ad-hoc manner or using general information extraction frameworks, which are not optimized for the biomedical domain and normally require the integration of complex external libraries and/or the development of custom tools. Results This article presents Neji, an open source framework optimized for biomedical concept recognition built around four key characteristics: modularity, scalability, speed, and usability. It integrates modules for biomedical natural language processing, such as sentence splitting, tokenization, lemmatization, part-of-speech tagging, chunking and dependency parsing. Concept recognition is provided through dictionary matching and machine learning with normalization methods. Neji also integrates an innovative concept tree implementation, supporting overlapped concept names and respective disambiguation techniques. The most popular input and output formats, namely Pubmed XML, IeXML, CoNLL and A1, are also supported. On top of the built-in functionalities, developers and researchers can implement new processing modules or pipelines, or use the provided command-line interface tool to build their own solutions, applying the most appropriate techniques to identify heterogeneous biomedical concepts. Neji was evaluated against three gold standard corpora with heterogeneous biomedical concepts (CRAFT, AnEM and NCBI disease corpus), achieving high performance results on named entity recognition (F1-measure for overlap matching: species 95%, cell 92%, cellular components 83%, gene and proteins 76%, chemicals 65%, biological processes and molecular functions 63%, disorders 85%, and anatomical entities 82%) and on entity normalization (F1-measure for overlap name matching and correct identifier included in the returned list of identifiers: species 88%, cell 71%, cellular components 72%, gene and proteins 64%, chemicals 53%, and biological processes and molecular functions 40%). Neji provides fast and multi-threaded data processing, annotating up to 1200 sentences/second when using dictionary-based concept identification. Conclusions Considering the provided features and underlying characteristics, we believe that Neji is an important contribution to the biomedical community, streamlining the development of complex concept recognition solutions. Neji is freely available at http://bioinformatics.ua.pt/neji. PMID:24063607
Gimli: open source and high-performance biomedical name recognition
2013-01-01
Background Automatic recognition of biomedical names is an essential task in biomedical information extraction, presenting several complex and unsolved challenges. In recent years, various solutions have been implemented to tackle this problem. However, limitations regarding system characteristics, customization and usability still hinder their wider application outside text mining research. Results We present Gimli, an open-source, state-of-the-art tool for automatic recognition of biomedical names. Gimli includes an extended set of implemented and user-selectable features, such as orthographic, morphological, linguistic-based, conjunctions and dictionary-based. A simple and fast method to combine different trained models is also provided. Gimli achieves an F-measure of 87.17% on GENETAG and 72.23% on JNLPBA corpus, significantly outperforming existing open-source solutions. Conclusions Gimli is an off-the-shelf, ready to use tool for named-entity recognition, providing trained and optimized models for recognition of biomedical entities from scientific text. It can be used as a command line tool, offering full functionality, including training of new models and customization of the feature set and model parameters through a configuration file. Advanced users can integrate Gimli in their text mining workflows through the provided library, and extend or adapt its functionalities. Based on the underlying system characteristics and functionality, both for final users and developers, and on the reported performance results, we believe that Gimli is a state-of-the-art solution for biomedical NER, contributing to faster and better research in the field. Gimli is freely available at http://bioinformatics.ua.pt/gimli. PMID:23413997
Munkhdalai, Tsendsuren; Li, Meijing; Batsuren, Khuyagbaatar; Park, Hyeon Ah; Choi, Nak Hyeon; Ryu, Keun Ho
2015-01-01
Chemical and biomedical Named Entity Recognition (NER) is an essential prerequisite task before effective text mining can begin for biochemical-text data. Exploiting unlabeled text data to leverage system performance has been an active and challenging research topic in text mining due to the recent growth in the amount of biomedical literature. We present a semi-supervised learning method that efficiently exploits unlabeled data in order to incorporate domain knowledge into a named entity recognition model and to leverage system performance. The proposed method includes Natural Language Processing (NLP) tasks for text preprocessing, learning word representation features from a large amount of text data for feature extraction, and conditional random fields for token classification. Other than the free text in the domain, the proposed method does not rely on any lexicon nor any dictionary in order to keep the system applicable to other NER tasks in bio-text data. We extended BANNER, a biomedical NER system, with the proposed method. This yields an integrated system that can be applied to chemical and drug NER or biomedical NER. We call our branch of the BANNER system BANNER-CHEMDNER, which is scalable over millions of documents, processing about 530 documents per minute, is configurable via XML, and can be plugged into other systems by using the BANNER Unstructured Information Management Architecture (UIMA) interface. BANNER-CHEMDNER achieved an 85.68% and an 86.47% F-measure on the testing sets of CHEMDNER Chemical Entity Mention (CEM) and Chemical Document Indexing (CDI) subtasks, respectively, and achieved an 87.04% F-measure on the official testing set of the BioCreative II gene mention task, showing remarkable performance in both chemical and biomedical NER. BANNER-CHEMDNER system is available at: https://bitbucket.org/tsendeemts/banner-chemdner.
2017-01-01
Evidence-based dietary information represented as unstructured text is a crucial information that needs to be accessed in order to help dietitians follow the new knowledge arrives daily with newly published scientific reports. Different named-entity recognition (NER) methods have been introduced previously to extract useful information from the biomedical literature. They are focused on, for example extracting gene mentions, proteins mentions, relationships between genes and proteins, chemical concepts and relationships between drugs and diseases. In this paper, we present a novel NER method, called drNER, for knowledge extraction of evidence-based dietary information. To the best of our knowledge this is the first attempt at extracting dietary concepts. DrNER is a rule-based NER that consists of two phases. The first one involves the detection and determination of the entities mention, and the second one involves the selection and extraction of the entities. We evaluate the method by using text corpora from heterogeneous sources, including text from several scientifically validated web sites and text from scientific publications. Evaluation of the method showed that drNER gives good results and can be used for knowledge extraction of evidence-based dietary recommendations. PMID:28644863
Recognition of chemical entities: combining dictionary-based and grammar-based approaches.
Akhondi, Saber A; Hettne, Kristina M; van der Horst, Eelke; van Mulligen, Erik M; Kors, Jan A
2015-01-01
The past decade has seen an upsurge in the number of publications in chemistry. The ever-swelling volume of available documents makes it increasingly hard to extract relevant new information from such unstructured texts. The BioCreative CHEMDNER challenge invites the development of systems for the automatic recognition of chemicals in text (CEM task) and for ranking the recognized compounds at the document level (CDI task). We investigated an ensemble approach where dictionary-based named entity recognition is used along with grammar-based recognizers to extract compounds from text. We assessed the performance of ten different commercial and publicly available lexical resources using an open source indexing system (Peregrine), in combination with three different chemical compound recognizers and a set of regular expressions to recognize chemical database identifiers. The effect of different stop-word lists, case-sensitivity matching, and use of chunking information was also investigated. We focused on lexical resources that provide chemical structure information. To rank the different compounds found in a text, we used a term confidence score based on the normalized ratio of the term frequencies in chemical and non-chemical journals. The use of stop-word lists greatly improved the performance of the dictionary-based recognition, but there was no additional benefit from using chunking information. A combination of ChEBI and HMDB as lexical resources, the LeadMine tool for grammar-based recognition, and the regular expressions, outperformed any of the individual systems. On the test set, the F-scores were 77.8% (recall 71.2%, precision 85.8%) for the CEM task and 77.6% (recall 71.7%, precision 84.6%) for the CDI task. Missed terms were mainly due to tokenization issues, poor recognition of formulas, and term conjunctions. We developed an ensemble system that combines dictionary-based and grammar-based approaches for chemical named entity recognition, outperforming any of the individual systems that we considered. The system is able to provide structure information for most of the compounds that are found. Improved tokenization and better recognition of specific entity types is likely to further improve system performance.
Recognition of chemical entities: combining dictionary-based and grammar-based approaches
2015-01-01
Background The past decade has seen an upsurge in the number of publications in chemistry. The ever-swelling volume of available documents makes it increasingly hard to extract relevant new information from such unstructured texts. The BioCreative CHEMDNER challenge invites the development of systems for the automatic recognition of chemicals in text (CEM task) and for ranking the recognized compounds at the document level (CDI task). We investigated an ensemble approach where dictionary-based named entity recognition is used along with grammar-based recognizers to extract compounds from text. We assessed the performance of ten different commercial and publicly available lexical resources using an open source indexing system (Peregrine), in combination with three different chemical compound recognizers and a set of regular expressions to recognize chemical database identifiers. The effect of different stop-word lists, case-sensitivity matching, and use of chunking information was also investigated. We focused on lexical resources that provide chemical structure information. To rank the different compounds found in a text, we used a term confidence score based on the normalized ratio of the term frequencies in chemical and non-chemical journals. Results The use of stop-word lists greatly improved the performance of the dictionary-based recognition, but there was no additional benefit from using chunking information. A combination of ChEBI and HMDB as lexical resources, the LeadMine tool for grammar-based recognition, and the regular expressions, outperformed any of the individual systems. On the test set, the F-scores were 77.8% (recall 71.2%, precision 85.8%) for the CEM task and 77.6% (recall 71.7%, precision 84.6%) for the CDI task. Missed terms were mainly due to tokenization issues, poor recognition of formulas, and term conjunctions. Conclusions We developed an ensemble system that combines dictionary-based and grammar-based approaches for chemical named entity recognition, outperforming any of the individual systems that we considered. The system is able to provide structure information for most of the compounds that are found. Improved tokenization and better recognition of specific entity types is likely to further improve system performance. PMID:25810767
Developing a hybrid dictionary-based bio-entity recognition technique.
Song, Min; Yu, Hwanjo; Han, Wook-Shin
2015-01-01
Bio-entity extraction is a pivotal component for information extraction from biomedical literature. The dictionary-based bio-entity extraction is the first generation of Named Entity Recognition (NER) techniques. This paper presents a hybrid dictionary-based bio-entity extraction technique. The approach expands the bio-entity dictionary by combining different data sources and improves the recall rate through the shortest path edit distance algorithm. In addition, the proposed technique adopts text mining techniques in the merging stage of similar entities such as Part of Speech (POS) expansion, stemming, and the exploitation of the contextual cues to further improve the performance. The experimental results show that the proposed technique achieves the best or at least equivalent performance among compared techniques, GENIA, MESH, UMLS, and combinations of these three resources in F-measure. The results imply that the performance of dictionary-based extraction techniques is largely influenced by information resources used to build the dictionary. In addition, the edit distance algorithm shows steady performance with three different dictionaries in precision whereas the context-only technique achieves a high-end performance with three difference dictionaries in recall.
Developing a hybrid dictionary-based bio-entity recognition technique
2015-01-01
Background Bio-entity extraction is a pivotal component for information extraction from biomedical literature. The dictionary-based bio-entity extraction is the first generation of Named Entity Recognition (NER) techniques. Methods This paper presents a hybrid dictionary-based bio-entity extraction technique. The approach expands the bio-entity dictionary by combining different data sources and improves the recall rate through the shortest path edit distance algorithm. In addition, the proposed technique adopts text mining techniques in the merging stage of similar entities such as Part of Speech (POS) expansion, stemming, and the exploitation of the contextual cues to further improve the performance. Results The experimental results show that the proposed technique achieves the best or at least equivalent performance among compared techniques, GENIA, MESH, UMLS, and combinations of these three resources in F-measure. Conclusions The results imply that the performance of dictionary-based extraction techniques is largely influenced by information resources used to build the dictionary. In addition, the edit distance algorithm shows steady performance with three different dictionaries in precision whereas the context-only technique achieves a high-end performance with three difference dictionaries in recall. PMID:26043907
Naming and recognizing famous faces in temporal lobe epilepsy.
Glosser, G; Salvucci, A E; Chiaravalloti, N D
2003-07-08
To assess naming and recognition of faces of familiar famous people in patients with epilepsy before and after anterior temporal lobectomy (ATL). Color photographs of famous people were presented for naming and description to 63 patients with temporal lobe epilepsy (TLE) either before or after ATL and to 10 healthy age- and education-matched controls. Spontaneous naming of photographed famous people was impaired in all patient groups, but was most abnormal in patients who had undergone left ATL. When allowed to demonstrate knowledge of the famous faces through verbal descriptions, rather than naming, patients with left TLE, left ATL, and right TLE improved to normal levels, but patients with right ATL were still impaired, suggesting a new deficit in identifying famous faces. Naming of famous people was related to naming of other common objects, verbal memory, and perceptual discrimination of faces. Recognition of the identity of pictured famous people was more related to visuospatial perception and memory. Lesions in anterior regions of the right temporal lobe impair recognition of the identities of familiar faces, as well as the learning of new faces. Lesions in the left temporal lobe, especially in anterior regions, disrupt access to the names of known people, but do not affect recognition of the identities of famous faces. Results are consistent with the hypothesized role of lateralized anterior temporal lobe structures in facial recognition and naming of unique entities.
Identifying non-elliptical entity mentions in a coordinated NP with ellipses.
Chae, Jeongmin; Jung, Younghee; Lee, Taemin; Jung, Soonyoung; Huh, Chan; Kim, Gilhan; Kim, Hyeoncheol; Oh, Heungbum
2014-02-01
Named entities in the biomedical domain are often written using a Noun Phrase (NP) along with a coordinating conjunction such as 'and' and 'or'. In addition, repeated words among named entity mentions are frequently omitted. It is often difficult to identify named entities. Although various Named Entity Recognition (NER) methods have tried to solve this problem, these methods can only deal with relatively simple elliptical patterns in coordinated NPs. We propose a new NER method for identifying non-elliptical entity mentions with simple or complex ellipses using linguistic rules and an entity mention dictionary. The GENIA and CRAFT corpora were used to evaluate the performance of the proposed system. The GENIA corpus was used to evaluate the performance of the system according to the quality of the dictionary. The GENIA corpus comprises 3434 non-elliptical entity mentions in 1585 coordinated NPs with ellipses. The system achieves 92.11% precision, 95.20% recall, and 93.63% F-score in identification of non-elliptical entity mentions in coordinated NPs. The accuracy of the system in resolving simple and complex ellipses is 94.54% and 91.95%, respectively. The CRAFT corpus was used to evaluate the performance of the system under realistic conditions. The system achieved 78.47% precision, 67.10% recall, and 72.34% F-score in coordinated NPs. The performance evaluations of the system show that it efficiently solves the problem caused by ellipses, and improves NER performance. The algorithm is implemented in PHP and the code can be downloaded from https://code.google.com/p/medtextmining/. Copyright © 2013. Published by Elsevier Inc.
Jonnagaddala, Jitendra; Jue, Toni Rose; Chang, Nai-Wen; Dai, Hong-Jie
2016-01-01
The rapidly increasing biomedical literature calls for the need of an automatic approach in the recognition and normalization of disease mentions in order to increase the precision and effectivity of disease based information retrieval. A variety of methods have been proposed to deal with the problem of disease named entity recognition and normalization. Among all the proposed methods, conditional random fields (CRFs) and dictionary lookup method are widely used for named entity recognition and normalization respectively. We herein developed a CRF-based model to allow automated recognition of disease mentions, and studied the effect of various techniques in improving the normalization results based on the dictionary lookup approach. The dataset from the BioCreative V CDR track was used to report the performance of the developed normalization methods and compare with other existing dictionary lookup based normalization methods. The best configuration achieved an F-measure of 0.77 for the disease normalization, which outperformed the best dictionary lookup based baseline method studied in this work by an F-measure of 0.13.Database URL: https://github.com/TCRNBioinformatics/DiseaseExtract. © The Author(s) 2016. Published by Oxford University Press.
Clinical Named Entity Recognition Using Deep Learning Models.
Wu, Yonghui; Jiang, Min; Xu, Jun; Zhi, Degui; Xu, Hua
2017-01-01
Clinical Named Entity Recognition (NER) is a critical natural language processing (NLP) task to extract important concepts (named entities) from clinical narratives. Researchers have extensively investigated machine learning models for clinical NER. Recently, there have been increasing efforts to apply deep learning models to improve the performance of current clinical NER systems. This study examined two popular deep learning architectures, the Convolutional Neural Network (CNN) and the Recurrent Neural Network (RNN), to extract concepts from clinical texts. We compared the two deep neural network architectures with three baseline Conditional Random Fields (CRFs) models and two state-of-the-art clinical NER systems using the i2b2 2010 clinical concept extraction corpus. The evaluation results showed that the RNN model trained with the word embeddings achieved a new state-of-the- art performance (a strict F1 score of 85.94%) for the defined clinical NER task, outperforming the best-reported system that used both manually defined and unsupervised learning features. This study demonstrates the advantage of using deep neural network architectures for clinical concept extraction, including distributed feature representation, automatic feature learning, and long-term dependencies capture. This is one of the first studies to compare the two widely used deep learning models and demonstrate the superior performance of the RNN model for clinical NER.
Clinical Named Entity Recognition Using Deep Learning Models
Wu, Yonghui; Jiang, Min; Xu, Jun; Zhi, Degui; Xu, Hua
2017-01-01
Clinical Named Entity Recognition (NER) is a critical natural language processing (NLP) task to extract important concepts (named entities) from clinical narratives. Researchers have extensively investigated machine learning models for clinical NER. Recently, there have been increasing efforts to apply deep learning models to improve the performance of current clinical NER systems. This study examined two popular deep learning architectures, the Convolutional Neural Network (CNN) and the Recurrent Neural Network (RNN), to extract concepts from clinical texts. We compared the two deep neural network architectures with three baseline Conditional Random Fields (CRFs) models and two state-of-the-art clinical NER systems using the i2b2 2010 clinical concept extraction corpus. The evaluation results showed that the RNN model trained with the word embeddings achieved a new state-of-the- art performance (a strict F1 score of 85.94%) for the defined clinical NER task, outperforming the best-reported system that used both manually defined and unsupervised learning features. This study demonstrates the advantage of using deep neural network architectures for clinical concept extraction, including distributed feature representation, automatic feature learning, and long-term dependencies capture. This is one of the first studies to compare the two widely used deep learning models and demonstrate the superior performance of the RNN model for clinical NER. PMID:29854252
Bachman, John A; Gyori, Benjamin M; Sorger, Peter K
2018-06-28
For automated reading of scientific publications to extract useful information about molecular mechanisms it is critical that genes, proteins and other entities be correctly associated with uniform identifiers, a process known as named entity linking or "grounding." Correct grounding is essential for resolving relationships among mined information, curated interaction databases, and biological datasets. The accuracy of this process is largely dependent on the availability of machine-readable resources associating synonyms and abbreviations commonly found in biomedical literature with uniform identifiers. In a task involving automated reading of ∼215,000 articles using the REACH event extraction software we found that grounding was disproportionately inaccurate for multi-protein families (e.g., "AKT") and complexes with multiple subunits (e.g."NF- κB"). To address this problem we constructed FamPlex, a manually curated resource defining protein families and complexes as they are commonly encountered in biomedical text. In FamPlex the gene-level constituents of families and complexes are defined in a flexible format allowing for multi-level, hierarchical membership. To create FamPlex, text strings corresponding to entities were identified empirically from literature and linked manually to uniform identifiers; these identifiers were also mapped to equivalent entries in multiple related databases. FamPlex also includes curated prefix and suffix patterns that improve named entity recognition and event extraction. Evaluation of REACH extractions on a test corpus of ∼54,000 articles showed that FamPlex significantly increased grounding accuracy for families and complexes (from 15 to 71%). The hierarchical organization of entities in FamPlex also made it possible to integrate otherwise unconnected mechanistic information across families, subfamilies, and individual proteins. Applications of FamPlex to the TRIPS/DRUM reading system and the Biocreative VI Bioentity Normalization Task dataset demonstrated the utility of FamPlex in other settings. FamPlex is an effective resource for improving named entity recognition, grounding, and relationship resolution in automated reading of biomedical text. The content in FamPlex is available in both tabular and Open Biomedical Ontology formats at https://github.com/sorgerlab/famplex under the Creative Commons CC0 license and has been integrated into the TRIPS/DRUM and REACH reading systems.
Combining Open-domain and Biomedical Knowledge for Topic Recognition in Consumer Health Questions.
Mrabet, Yassine; Kilicoglu, Halil; Roberts, Kirk; Demner-Fushman, Dina
2016-01-01
Determining the main topics in consumer health questions is a crucial step in their processing as it allows narrowing the search space to a specific semantic context. In this paper we propose a topic recognition approach based on biomedical and open-domain knowledge bases. In the first step of our method, we recognize named entities in consumer health questions using an unsupervised method that relies on a biomedical knowledge base, UMLS, and an open-domain knowledge base, DBpedia. In the next step, we cast topic recognition as a binary classification problem of deciding whether a named entity is the question topic or not. We evaluated our approach on a dataset from the National Library of Medicine (NLM), introduced in this paper, and another from the Genetic and Rare Disease Information Center (GARD). The combination of knowledge bases outperformed the results obtained by individual knowledge bases by up to 16.5% F1 and achieved state-of-the-art performance. Our results demonstrate that combining open-domain knowledge bases with biomedical knowledge bases can lead to a substantial improvement in understanding user-generated health content.
Building a protein name dictionary from full text: a machine learning term extraction approach.
Shi, Lei; Campagne, Fabien
2005-04-07
The majority of information in the biological literature resides in full text articles, instead of abstracts. Yet, abstracts remain the focus of many publicly available literature data mining tools. Most literature mining tools rely on pre-existing lexicons of biological names, often extracted from curated gene or protein databases. This is a limitation, because such databases have low coverage of the many name variants which are used to refer to biological entities in the literature. We present an approach to recognize named entities in full text. The approach collects high frequency terms in an article, and uses support vector machines (SVM) to identify biological entity names. It is also computationally efficient and robust to noise commonly found in full text material. We use the method to create a protein name dictionary from a set of 80,528 full text articles. Only 8.3% of the names in this dictionary match SwissProt description lines. We assess the quality of the dictionary by studying its protein name recognition performance in full text. This dictionary term lookup method compares favourably to other published methods, supporting the significance of our direct extraction approach. The method is strong in recognizing name variants not found in SwissProt.
Building a protein name dictionary from full text: a machine learning term extraction approach
Shi, Lei; Campagne, Fabien
2005-01-01
Background The majority of information in the biological literature resides in full text articles, instead of abstracts. Yet, abstracts remain the focus of many publicly available literature data mining tools. Most literature mining tools rely on pre-existing lexicons of biological names, often extracted from curated gene or protein databases. This is a limitation, because such databases have low coverage of the many name variants which are used to refer to biological entities in the literature. Results We present an approach to recognize named entities in full text. The approach collects high frequency terms in an article, and uses support vector machines (SVM) to identify biological entity names. It is also computationally efficient and robust to noise commonly found in full text material. We use the method to create a protein name dictionary from a set of 80,528 full text articles. Only 8.3% of the names in this dictionary match SwissProt description lines. We assess the quality of the dictionary by studying its protein name recognition performance in full text. Conclusion This dictionary term lookup method compares favourably to other published methods, supporting the significance of our direct extraction approach. The method is strong in recognizing name variants not found in SwissProt. PMID:15817129
Identification of related gene/protein names based on an HMM of name variations.
Yeganova, L; Smith, L; Wilbur, W J
2004-04-01
Gene and protein names follow few, if any, true naming conventions and are subject to great variation in different occurrences of the same name. This gives rise to two important problems in natural language processing. First, can one locate the names of genes or proteins in free text, and second, can one determine when two names denote the same gene or protein? The first of these problems is a special case of the problem of named entity recognition, while the second is a special case of the problem of automatic term recognition (ATR). We study the second problem, that of gene or protein name variation. Here we describe a system which, given a query gene or protein name, identifies related gene or protein names in a large list. The system is based on a dynamic programming algorithm for sequence alignment in which the mutation matrix is allowed to vary under the control of a fully trainable hidden Markov model.
Usié, Anabel; Cruz, Joaquim; Comas, Jorge; Solsona, Francesc; Alves, Rui
2015-01-01
Small chemical molecules regulate biological processes at the molecular level. Those molecules are often involved in causing or treating pathological states. Automatically identifying such molecules in biomedical text is difficult due to both, the diverse morphology of chemical names and the alternative types of nomenclature that are simultaneously used to describe them. To address these issues, the last BioCreAtIvE challenge proposed a CHEMDNER task, which is a Named Entity Recognition (NER) challenge that aims at labelling different types of chemical names in biomedical text. To address this challenge we tested various approaches to recognizing chemical entities in biomedical documents. These approaches range from linear Conditional Random Fields (CRFs) to a combination of CRFs with regular expression and dictionary matching, followed by a post-processing step to tag those chemical names in a corpus of Medline abstracts. We named our best performing systems CheNER. We evaluate the performance of the various approaches using the F-score statistics. Higher F-scores indicate better performance. The highest F-score we obtain in identifying unique chemical entities is 72.88%. The highest F-score we obtain in identifying all chemical entities is 73.07%. We also evaluate the F-Score of combining our system with ChemSpot, and find an increase from 72.88% to 73.83%. CheNER presents a valid alternative for automated annotation of chemical entities in biomedical documents. In addition, CheNER may be used to derive new features to train newer methods for tagging chemical entities. CheNER can be downloaded from http://metres.udl.cat and included in text annotation pipelines.
A method for named entity normalization in biomedical articles: application to diseases and plants.
Cho, Hyejin; Choi, Wonjun; Lee, Hyunju
2017-10-13
In biomedical articles, a named entity recognition (NER) technique that identifies entity names from texts is an important element for extracting biological knowledge from articles. After NER is applied to articles, the next step is to normalize the identified names into standard concepts (i.e., disease names are mapped to the National Library of Medicine's Medical Subject Headings disease terms). In biomedical articles, many entity normalization methods rely on domain-specific dictionaries for resolving synonyms and abbreviations. However, the dictionaries are not comprehensive except for some entities such as genes. In recent years, biomedical articles have accumulated rapidly, and neural network-based algorithms that incorporate a large amount of unlabeled data have shown considerable success in several natural language processing problems. In this study, we propose an approach for normalizing biological entities, such as disease names and plant names, by using word embeddings to represent semantic spaces. For diseases, training data from the National Center for Biotechnology Information (NCBI) disease corpus and unlabeled data from PubMed abstracts were used to construct word representations. For plants, a training corpus that we manually constructed and unlabeled PubMed abstracts were used to represent word vectors. We showed that the proposed approach performed better than the use of only the training corpus or only the unlabeled data and showed that the normalization accuracy was improved by using our model even when the dictionaries were not comprehensive. We obtained F-scores of 0.808 and 0.690 for normalizing the NCBI disease corpus and manually constructed plant corpus, respectively. We further evaluated our approach using a data set in the disease normalization task of the BioCreative V challenge. When only the disease corpus was used as a dictionary, our approach significantly outperformed the best system of the task. The proposed approach shows robust performance for normalizing biological entities. The manually constructed plant corpus and the proposed model are available at http://gcancer.org/plant and http://gcancer.org/normalization , respectively.
Assessment of disease named entity recognition on a corpus of annotated sentences.
Jimeno, Antonio; Jimenez-Ruiz, Ernesto; Lee, Vivian; Gaudan, Sylvain; Berlanga, Rafael; Rebholz-Schuhmann, Dietrich
2008-04-11
In recent years, the recognition of semantic types from the biomedical scientific literature has been focused on named entities like protein and gene names (PGNs) and gene ontology terms (GO terms). Other semantic types like diseases have not received the same level of attention. Different solutions have been proposed to identify disease named entities in the scientific literature. While matching the terminology with language patterns suffers from low recall (e.g., Whatizit) other solutions make use of morpho-syntactic features to better cover the full scope of terminological variability (e.g., MetaMap). Currently, MetaMap that is provided from the National Library of Medicine (NLM) is the state of the art solution for the annotation of concepts from UMLS (Unified Medical Language System) in the literature. Nonetheless, its performance has not yet been assessed on an annotated corpus. In addition, little effort has been invested so far to generate an annotated dataset that links disease entities in text to disease entries in a database, thesaurus or ontology and that could serve as a gold standard to benchmark text mining solutions. As part of our research work, we have taken a corpus that has been delivered in the past for the identification of associations of genes to diseases based on the UMLS Metathesaurus and we have reprocessed and re-annotated the corpus. We have gathered annotations for disease entities from two curators, analyzed their disagreement (0.51 in the kappa-statistic) and composed a single annotated corpus for public use. Thereafter, three solutions for disease named entity recognition including MetaMap have been applied to the corpus to automatically annotate it with UMLS Metathesaurus concepts. The resulting annotations have been benchmarked to compare their performance. The annotated corpus is publicly available at ftp://ftp.ebi.ac.uk/pub/software/textmining/corpora/diseases and can serve as a benchmark to other systems. In addition, we found that dictionary look-up already provides competitive results indicating that the use of disease terminology is highly standardized throughout the terminologies and the literature. MetaMap generates precise results at the expense of insufficient recall while our statistical method obtains better recall at a lower precision rate. Even better results in terms of precision are achieved by combining at least two of the three methods leading, but this approach again lowers recall. Altogether, our analysis gives a better understanding of the complexity of disease annotations in the literature. MetaMap and the dictionary based approach are available through the Whatizit web service infrastructure (Rebholz-Schuhmann D, Arregui M, Gaudan S, Kirsch H, Jimeno A: Text processing through Web services: Calling Whatizit. Bioinformatics 2008, 24:296-298).
Jiang, Min; Chen, Yukun; Liu, Mei; Rosenbloom, S Trent; Mani, Subramani; Denny, Joshua C; Xu, Hua
2011-01-01
The authors' goal was to develop and evaluate machine-learning-based approaches to extracting clinical entities-including medical problems, tests, and treatments, as well as their asserted status-from hospital discharge summaries written using natural language. This project was part of the 2010 Center of Informatics for Integrating Biology and the Bedside/Veterans Affairs (VA) natural-language-processing challenge. The authors implemented a machine-learning-based named entity recognition system for clinical text and systematically evaluated the contributions of different types of features and ML algorithms, using a training corpus of 349 annotated notes. Based on the results from training data, the authors developed a novel hybrid clinical entity extraction system, which integrated heuristic rule-based modules with the ML-base named entity recognition module. The authors applied the hybrid system to the concept extraction and assertion classification tasks in the challenge and evaluated its performance using a test data set with 477 annotated notes. Standard measures including precision, recall, and F-measure were calculated using the evaluation script provided by the Center of Informatics for Integrating Biology and the Bedside/VA challenge organizers. The overall performance for all three types of clinical entities and all six types of assertions across 477 annotated notes were considered as the primary metric in the challenge. Systematic evaluation on the training set showed that Conditional Random Fields outperformed Support Vector Machines, and semantic information from existing natural-language-processing systems largely improved performance, although contributions from different types of features varied. The authors' hybrid entity extraction system achieved a maximum overall F-score of 0.8391 for concept extraction (ranked second) and 0.9313 for assertion classification (ranked fourth, but not statistically different than the first three systems) on the test data set in the challenge.
Urbain, Jay
2015-12-01
We present the design, and analyze the performance of a multi-stage natural language processing system employing named entity recognition, Bayesian statistics, and rule logic to identify and characterize heart disease risk factor events in diabetic patients over time. The system was originally developed for the 2014 i2b2 Challenges in Natural Language in Clinical Data. The system's strengths included a high level of accuracy for identifying named entities associated with heart disease risk factor events. The system's primary weakness was due to inaccuracies when characterizing the attributes of some events. For example, determining the relative time of an event with respect to the record date, whether an event is attributable to the patient's history or the patient's family history, and differentiating between current and prior smoking status. We believe these inaccuracies were due in large part to the lack of an effective approach for integrating context into our event detection model. To address these inaccuracies, we explore the addition of a distributional semantic model for characterizing contextual evidence of heart disease risk factor events. Using this semantic model, we raise our initial 2014 i2b2 Challenges in Natural Language of Clinical data F1 score of 0.838 to 0.890 and increased precision by 10.3% without use of any lexicons that might bias our results. Copyright © 2015 Elsevier Inc. All rights reserved.
Information Tailoring Enhancements for Large-Scale Social Data
2016-06-15
Intelligent Automation Incorporated Information Tailoring Enhancements for Large-Scale... Automation Incorporated Progress Report No. 3 Information Tailoring Enhancements for Large-Scale Social Data Submitted in accordance with...1 Work Performed within This Reporting Period .................................................... 2 1.1 Enhanced Named Entity Recognition (NER
The CHEMDNER corpus of chemicals and drugs and its annotation principles.
Krallinger, Martin; Rabal, Obdulia; Leitner, Florian; Vazquez, Miguel; Salgado, David; Lu, Zhiyong; Leaman, Robert; Lu, Yanan; Ji, Donghong; Lowe, Daniel M; Sayle, Roger A; Batista-Navarro, Riza Theresa; Rak, Rafal; Huber, Torsten; Rocktäschel, Tim; Matos, Sérgio; Campos, David; Tang, Buzhou; Xu, Hua; Munkhdalai, Tsendsuren; Ryu, Keun Ho; Ramanan, S V; Nathan, Senthil; Žitnik, Slavko; Bajec, Marko; Weber, Lutz; Irmer, Matthias; Akhondi, Saber A; Kors, Jan A; Xu, Shuo; An, Xin; Sikdar, Utpal Kumar; Ekbal, Asif; Yoshioka, Masaharu; Dieb, Thaer M; Choi, Miji; Verspoor, Karin; Khabsa, Madian; Giles, C Lee; Liu, Hongfang; Ravikumar, Komandur Elayavilli; Lamurias, Andre; Couto, Francisco M; Dai, Hong-Jie; Tsai, Richard Tzong-Han; Ata, Caglar; Can, Tolga; Usié, Anabel; Alves, Rui; Segura-Bedmar, Isabel; Martínez, Paloma; Oyarzabal, Julen; Valencia, Alfonso
2015-01-01
The automatic extraction of chemical information from text requires the recognition of chemical entity mentions as one of its key steps. When developing supervised named entity recognition (NER) systems, the availability of a large, manually annotated text corpus is desirable. Furthermore, large corpora permit the robust evaluation and comparison of different approaches that detect chemicals in documents. We present the CHEMDNER corpus, a collection of 10,000 PubMed abstracts that contain a total of 84,355 chemical entity mentions labeled manually by expert chemistry literature curators, following annotation guidelines specifically defined for this task. The abstracts of the CHEMDNER corpus were selected to be representative for all major chemical disciplines. Each of the chemical entity mentions was manually labeled according to its structure-associated chemical entity mention (SACEM) class: abbreviation, family, formula, identifier, multiple, systematic and trivial. The difficulty and consistency of tagging chemicals in text was measured using an agreement study between annotators, obtaining a percentage agreement of 91. For a subset of the CHEMDNER corpus (the test set of 3,000 abstracts) we provide not only the Gold Standard manual annotations, but also mentions automatically detected by the 26 teams that participated in the BioCreative IV CHEMDNER chemical mention recognition task. In addition, we release the CHEMDNER silver standard corpus of automatically extracted mentions from 17,000 randomly selected PubMed abstracts. A version of the CHEMDNER corpus in the BioC format has been generated as well. We propose a standard for required minimum information about entity annotations for the construction of domain specific corpora on chemical and drug entities. The CHEMDNER corpus and annotation guidelines are available at: http://www.biocreative.org/resources/biocreative-iv/chemdner-corpus/.
The CHEMDNER corpus of chemicals and drugs and its annotation principles
2015-01-01
The automatic extraction of chemical information from text requires the recognition of chemical entity mentions as one of its key steps. When developing supervised named entity recognition (NER) systems, the availability of a large, manually annotated text corpus is desirable. Furthermore, large corpora permit the robust evaluation and comparison of different approaches that detect chemicals in documents. We present the CHEMDNER corpus, a collection of 10,000 PubMed abstracts that contain a total of 84,355 chemical entity mentions labeled manually by expert chemistry literature curators, following annotation guidelines specifically defined for this task. The abstracts of the CHEMDNER corpus were selected to be representative for all major chemical disciplines. Each of the chemical entity mentions was manually labeled according to its structure-associated chemical entity mention (SACEM) class: abbreviation, family, formula, identifier, multiple, systematic and trivial. The difficulty and consistency of tagging chemicals in text was measured using an agreement study between annotators, obtaining a percentage agreement of 91. For a subset of the CHEMDNER corpus (the test set of 3,000 abstracts) we provide not only the Gold Standard manual annotations, but also mentions automatically detected by the 26 teams that participated in the BioCreative IV CHEMDNER chemical mention recognition task. In addition, we release the CHEMDNER silver standard corpus of automatically extracted mentions from 17,000 randomly selected PubMed abstracts. A version of the CHEMDNER corpus in the BioC format has been generated as well. We propose a standard for required minimum information about entity annotations for the construction of domain specific corpora on chemical and drug entities. The CHEMDNER corpus and annotation guidelines are available at: http://www.biocreative.org/resources/biocreative-iv/chemdner-corpus/ PMID:25810773
Transfer learning for biomedical named entity recognition with neural networks.
Giorgi, John M; Bader, Gary D
2018-06-01
The explosive increase of biomedical literature has made information extraction an increasingly important tool for biomedical research. A fundamental task is the recognition of biomedical named entities in text (BNER) such as genes/proteins, diseases, and species. Recently, a domain-independent method based on deep learning and statistical word embeddings, called long short-term memory network-conditional random field (LSTM-CRF), has been shown to outperform state-of-the-art entity-specific BNER tools. However, this method is dependent on gold-standard corpora (GSCs) consisting of hand-labeled entities, which tend to be small but highly reliable. An alternative to GSCs are silver-standard corpora (SSCs), which are generated by harmonizing the annotations made by several automatic annotation systems. SSCs typically contain more noise than GSCs but have the advantage of containing many more training examples. Ideally, these corpora could be combined to achieve the benefits of both, which is an opportunity for transfer learning. In this work, we analyze to what extent transfer learning improves upon state-of-the-art results for BNER. We demonstrate that transferring a deep neural network (DNN) trained on a large, noisy SSC to a smaller, but more reliable GSC significantly improves upon state-of-the-art results for BNER. Compared to a state-of-the-art baseline evaluated on 23 GSCs covering four different entity classes, transfer learning results in an average reduction in error of approximately 11%. We found transfer learning to be especially beneficial for target data sets with a small number of labels (approximately 6000 or less). Source code for the LSTM-CRF is available athttps://github.com/Franck-Dernoncourt/NeuroNER/ and links to the corpora are available athttps://github.com/BaderLab/Transfer-Learning-BNER-Bioinformatics-2018/. john.giorgi@utoronto.ca. Supplementary data are available at Bioinformatics online.
Sahadevan, S; Hofmann-Apitius, M; Schellander, K; Tesfaye, D; Fluck, J; Friedrich, C M
2012-10-01
In biological research, establishing the prior art by searching and collecting information already present in the domain has equal importance as the experiments done. To obtain a complete overview about the relevant knowledge, researchers mainly rely on 2 major information sources: i) various biological databases and ii) scientific publications in the field. The major difference between the 2 information sources is that information from databases is available, typically well structured and condensed. The information content in scientific literature is vastly unstructured; that is, dispersed among the many different sections of scientific text. The traditional method of information extraction from scientific literature occurs by generating a list of relevant publications in the field of interest and manually scanning these texts for relevant information, which is very time consuming. It is more than likely that in using this "classical" approach the researcher misses some relevant information mentioned in the literature or has to go through biological databases to extract further information. Text mining and named entity recognition methods have already been used in human genomics and related fields as a solution to this problem. These methods can process and extract information from large volumes of scientific text. Text mining is defined as the automatic extraction of previously unknown and potentially useful information from text. Named entity recognition (NER) is defined as the method of identifying named entities (names of real world objects; for example, gene/protein names, drugs, enzymes) in text. In animal sciences, text mining and related methods have been briefly used in murine genomics and associated fields, leaving behind other fields of animal sciences, such as livestock genomics. The aim of this work was to develop an information retrieval platform in the livestock domain focusing on livestock publications and the recognition of relevant data from cattle and pigs. For this purpose, the rather noncomprehensive resources of pig and cattle gene and protein terminologies were enriched with orthologue synonyms, integrated in the NER platform, ProMiner, which is successfully used in human genomics domain. Based on the performance tests done, the present system achieved a fair performance with precision 0.64, recall 0.74, and F(1) measure of 0.69 in a test scenario based on cattle literature.
TaggerOne: joint named entity recognition and normalization with semi-Markov Models
Leaman, Robert; Lu, Zhiyong
2016-01-01
Motivation: Text mining is increasingly used to manage the accelerating pace of the biomedical literature. Many text mining applications depend on accurate named entity recognition (NER) and normalization (grounding). While high performing machine learning methods trainable for many entity types exist for NER, normalization methods are usually specialized to a single entity type. NER and normalization systems are also typically used in a serial pipeline, causing cascading errors and limiting the ability of the NER system to directly exploit the lexical information provided by the normalization. Methods: We propose the first machine learning model for joint NER and normalization during both training and prediction. The model is trainable for arbitrary entity types and consists of a semi-Markov structured linear classifier, with a rich feature approach for NER and supervised semantic indexing for normalization. We also introduce TaggerOne, a Java implementation of our model as a general toolkit for joint NER and normalization. TaggerOne is not specific to any entity type, requiring only annotated training data and a corresponding lexicon, and has been optimized for high throughput. Results: We validated TaggerOne with multiple gold-standard corpora containing both mention- and concept-level annotations. Benchmarking results show that TaggerOne achieves high performance on diseases (NCBI Disease corpus, NER f-score: 0.829, normalization f-score: 0.807) and chemicals (BioCreative 5 CDR corpus, NER f-score: 0.914, normalization f-score 0.895). These results compare favorably to the previous state of the art, notwithstanding the greater flexibility of the model. We conclude that jointly modeling NER and normalization greatly improves performance. Availability and Implementation: The TaggerOne source code and an online demonstration are available at: http://www.ncbi.nlm.nih.gov/bionlp/taggerone Contact: zhiyong.lu@nih.gov Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27283952
TaggerOne: joint named entity recognition and normalization with semi-Markov Models.
Leaman, Robert; Lu, Zhiyong
2016-09-15
Text mining is increasingly used to manage the accelerating pace of the biomedical literature. Many text mining applications depend on accurate named entity recognition (NER) and normalization (grounding). While high performing machine learning methods trainable for many entity types exist for NER, normalization methods are usually specialized to a single entity type. NER and normalization systems are also typically used in a serial pipeline, causing cascading errors and limiting the ability of the NER system to directly exploit the lexical information provided by the normalization. We propose the first machine learning model for joint NER and normalization during both training and prediction. The model is trainable for arbitrary entity types and consists of a semi-Markov structured linear classifier, with a rich feature approach for NER and supervised semantic indexing for normalization. We also introduce TaggerOne, a Java implementation of our model as a general toolkit for joint NER and normalization. TaggerOne is not specific to any entity type, requiring only annotated training data and a corresponding lexicon, and has been optimized for high throughput. We validated TaggerOne with multiple gold-standard corpora containing both mention- and concept-level annotations. Benchmarking results show that TaggerOne achieves high performance on diseases (NCBI Disease corpus, NER f-score: 0.829, normalization f-score: 0.807) and chemicals (BioCreative 5 CDR corpus, NER f-score: 0.914, normalization f-score 0.895). These results compare favorably to the previous state of the art, notwithstanding the greater flexibility of the model. We conclude that jointly modeling NER and normalization greatly improves performance. The TaggerOne source code and an online demonstration are available at: http://www.ncbi.nlm.nih.gov/bionlp/taggerone zhiyong.lu@nih.gov Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.
Weegar, Rebecka; Kvist, Maria; Sundström, Karin; Brunak, Søren; Dalianis, Hercules
2015-01-01
Detection of early symptoms in cervical cancer is crucial for early treatment and survival. To find symptoms of cervical cancer in clinical text, Named Entity Recognition is needed. In this paper the Clinical Entity Finder, a machine-learning tool trained on annotated clinical text from a Swedish internal medicine emergency unit, is evaluated on cervical cancer records. The Clinical Entity Finder identifies entities of the types body part, finding and disorder and is extended with negation detection using the rule-based tool NegEx, to distinguish between negated and non-negated entities. To measure the performance of the tools on this new domain, two physicians annotated a set of clinical notes from the health records of cervical cancer patients. The inter-annotator agreement for finding, disorder and body part obtained an average F-score of 0.677 and the Clinical Entity Finder extended with NegEx had an average F-score of 0.667. PMID:26958270
Weegar, Rebecka; Kvist, Maria; Sundström, Karin; Brunak, Søren; Dalianis, Hercules
2015-01-01
Detection of early symptoms in cervical cancer is crucial for early treatment and survival. To find symptoms of cervical cancer in clinical text, Named Entity Recognition is needed. In this paper the Clinical Entity Finder, a machine-learning tool trained on annotated clinical text from a Swedish internal medicine emergency unit, is evaluated on cervical cancer records. The Clinical Entity Finder identifies entities of the types body part, finding and disorder and is extended with negation detection using the rule-based tool NegEx, to distinguish between negated and non-negated entities. To measure the performance of the tools on this new domain, two physicians annotated a set of clinical notes from the health records of cervical cancer patients. The inter-annotator agreement for finding, disorder and body part obtained an average F-score of 0.677 and the Clinical Entity Finder extended with NegEx had an average F-score of 0.667.
Identifying interactions between chemical entities in biomedical text.
Lamurias, Andre; Ferreira, João D; Couto, Francisco M
2014-10-23
Interactions between chemical compounds described in biomedical text can be of great importance to drug discovery and design, as well as pharmacovigilance. We developed a novel system, \\"Identifying Interactions between Chemical Entities\\" (IICE), to identify chemical interactions described in text. Kernel-based Support Vector Machines first identify the interactions and then an ensemble classifier validates and classifies the type of each interaction. This relation extraction module was evaluated with the corpus released for the DDI Extraction task of SemEval 2013, obtaining results comparable to state-of-the-art methods for this type of task. We integrated this module with our chemical named entity recognition module and made the whole system available as a web tool at www.lasige.di.fc.ul.pt/webtools/iice.
Identifying interactions between chemical entities in biomedical text.
Lamurias, Andre; Ferreira, João D; Couto, Francisco M
2014-12-01
Interactions between chemical compounds described in biomedical text can be of great importance to drug discovery and design, as well as pharmacovigilance. We developed a novel system, "Identifying Interactions between Chemical Entities" (IICE), to identify chemical interactions described in text. Kernel-based Support Vector Machines first identify the interactions and then an ensemble classifier validates and classifies the type of each interaction. This relation extraction module was evaluated with the corpus released for the DDI Extraction task of SemEval 2013, obtaining results comparable to stateof- the-art methods for this type of task. We integrated this module with our chemical named entity recognition module and made the whole system available as a web tool at www.lasige.di.fc.ul.pt/webtools/iice.
Structured prediction models for RNN based sequence labeling in clinical text.
Jagannatha, Abhyuday N; Yu, Hong
2016-11-01
Sequence labeling is a widely used method for named entity recognition and information extraction from unstructured natural language data. In clinical domain one major application of sequence labeling involves extraction of medical entities such as medication, indication, and side-effects from Electronic Health Record narratives. Sequence labeling in this domain, presents its own set of challenges and objectives. In this work we experimented with various CRF based structured learning models with Recurrent Neural Networks. We extend the previously studied LSTM-CRF models with explicit modeling of pairwise potentials. We also propose an approximate version of skip-chain CRF inference with RNN potentials. We use these methodologies for structured prediction in order to improve the exact phrase detection of various medical entities.
Structured prediction models for RNN based sequence labeling in clinical text
Jagannatha, Abhyuday N; Yu, Hong
2016-01-01
Sequence labeling is a widely used method for named entity recognition and information extraction from unstructured natural language data. In clinical domain one major application of sequence labeling involves extraction of medical entities such as medication, indication, and side-effects from Electronic Health Record narratives. Sequence labeling in this domain, presents its own set of challenges and objectives. In this work we experimented with various CRF based structured learning models with Recurrent Neural Networks. We extend the previously studied LSTM-CRF models with explicit modeling of pairwise potentials. We also propose an approximate version of skip-chain CRF inference with RNN potentials. We use these methodologies1 for structured prediction in order to improve the exact phrase detection of various medical entities. PMID:28004040
Open-Source Data Collection Techniques for Weapons Transfer Information
2012-03-01
IR Infrared ISO International Organization for Standardization ITAR International Traffic in Arms Regulations NER Named Entity Recognition NLP ...Control Protocol UAE United Arab Emirates URI Uniform Resource Identifier URL Uniform Resource Locator USSR Union of Soviet Socialist Republics UTF...KOREA, DEMOCRATIC PEOPLE’S REPUBLIC OF North Korea KOREA, REPUBLIC OF South Korea LIBYAN ARAB JAMAHIRIYA Libya RUSSIAN FEDERATION Russia Table 3
A system for de-identifying medical message board text.
Benton, Adrian; Hill, Shawndra; Ungar, Lyle; Chung, Annie; Leonard, Charles; Freeman, Cristin; Holmes, John H
2011-06-09
There are millions of public posts to medical message boards by users seeking support and information on a wide range of medical conditions. It has been shown that these posts can be used to gain a greater understanding of patients' experiences and concerns. As investigators continue to explore large corpora of medical discussion board data for research purposes, protecting the privacy of the members of these online communities becomes an important challenge that needs to be met. Extant entity recognition methods used for more structured text are not sufficient because message posts present additional challenges: the posts contain many typographical errors, larger variety of possible names, terms and abbreviations specific to Internet posts or a particular message board, and mentions of the authors' personal lives. The main contribution of this paper is a system to de-identify the authors of message board posts automatically, taking into account the aforementioned challenges. We demonstrate our system on two different message board corpora, one on breast cancer and another on arthritis. We show that our approach significantly outperforms other publicly available named entity recognition and de-identification systems, which have been tuned for more structured text like operative reports, pathology reports, discharge summaries, or newswire.
Applying Suffix Rules to Organization Name Recognition
NASA Astrophysics Data System (ADS)
Inui, Takashi; Murakami, Koji; Hashimoto, Taiichi; Utsumi, Kazuo; Ishikawa, Masamichi
This paper presents a method for boosting the performance of the organization name recognition, which is a part of named entity recognition (NER). Although gazetteers (lists of the NEs) have been known as one of the effective features for supervised machine learning approaches on the NER task, the previous methods which have applied the gazetteers to the NER were very simple. The gazetteers have been used just for searching the exact matches between input text and NEs included in them. The proposed method generates regular expression rules from gazetteers, and, with these rules, it can realize a high-coverage searches based on looser matches between input text and NEs. To generate these rules, we focus on the two well-known characteristics of NE expressions; 1) most of NE expressions can be divided into two parts, class-reference part and instance-reference part, 2) for most of NE expressions the class-reference parts are located at the suffix position of them. A pattern mining algorithm runs on the set of NEs in the gazetteers, and some frequent word sequences from which NEs are constructed are found. Then, we employ only word sequences which have the class-reference part at the suffix position as suffix rules. Experimental results showed that our proposed method improved the performance of the organization name recognition, and achieved the 84.58 F-value for evaluation data.
Gene/protein name recognition based on support vector machine using dictionary as features.
Mitsumori, Tomohiro; Fation, Sevrani; Murata, Masaki; Doi, Kouichi; Doi, Hirohumi
2005-01-01
Automated information extraction from biomedical literature is important because a vast amount of biomedical literature has been published. Recognition of the biomedical named entities is the first step in information extraction. We developed an automated recognition system based on the SVM algorithm and evaluated it in Task 1.A of BioCreAtIvE, a competition for automated gene/protein name recognition. In the work presented here, our recognition system uses the feature set of the word, the part-of-speech (POS), the orthography, the prefix, the suffix, and the preceding class. We call these features "internal resource features", i.e., features that can be found in the training data. Additionally, we consider the features of matching against dictionaries to be external resource features. We investigated and evaluated the effect of these features as well as the effect of tuning the parameters of the SVM algorithm. We found that the dictionary matching features contributed slightly to the improvement in the performance of the f-score. We attribute this to the possibility that the dictionary matching features might overlap with other features in the current multiple feature setting. During SVM learning, each feature alone had a marginally positive effect on system performance. This supports the fact that the SVM algorithm is robust on the high dimensionality of the feature vector space and means that feature selection is not required.
Boosting drug named entity recognition using an aggregate classifier.
Korkontzelos, Ioannis; Piliouras, Dimitrios; Dowsey, Andrew W; Ananiadou, Sophia
2015-10-01
Drug named entity recognition (NER) is a critical step for complex biomedical NLP tasks such as the extraction of pharmacogenomic, pharmacodynamic and pharmacokinetic parameters. Large quantities of high quality training data are almost always a prerequisite for employing supervised machine-learning techniques to achieve high classification performance. However, the human labour needed to produce and maintain such resources is a significant limitation. In this study, we improve the performance of drug NER without relying exclusively on manual annotations. We perform drug NER using either a small gold-standard corpus (120 abstracts) or no corpus at all. In our approach, we develop a voting system to combine a number of heterogeneous models, based on dictionary knowledge, gold-standard corpora and silver annotations, to enhance performance. To improve recall, we employed genetic programming to evolve 11 regular-expression patterns that capture common drug suffixes and used them as an extra means for recognition. Our approach uses a dictionary of drug names, i.e. DrugBank, a small manually annotated corpus, i.e. the pharmacokinetic corpus, and a part of the UKPMC database, as raw biomedical text. Gold-standard and silver annotated data are used to train maximum entropy and multinomial logistic regression classifiers. Aggregating drug NER methods, based on gold-standard annotations, dictionary knowledge and patterns, improved the performance on models trained on gold-standard annotations, only, achieving a maximum F-score of 95%. In addition, combining models trained on silver annotations, dictionary knowledge and patterns are shown to achieve comparable performance to models trained exclusively on gold-standard data. The main reason appears to be the morphological similarities shared among drug names. We conclude that gold-standard data are not a hard requirement for drug NER. Combining heterogeneous models build on dictionary knowledge can achieve similar or comparable classification performance with that of the best performing model trained on gold-standard annotations. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Network analysis of named entity co-occurrences in written texts
NASA Astrophysics Data System (ADS)
Amancio, Diego Raphael
2016-06-01
The use of methods borrowed from statistics and physics to analyze written texts has allowed the discovery of unprecedent patterns of human behavior and cognition by establishing links between models features and language structure. While current models have been useful to unveil patterns via analysis of syntactical and semantical networks, only a few works have probed the relevance of investigating the structure arising from the relationship between relevant entities such as characters, locations and organizations. In this study, we represent entities appearing in the same context as a co-occurrence network, where links are established according to a null model based on random, shuffled texts. Computational simulations performed in novels revealed that the proposed model displays interesting topological features, such as the small world feature, characterized by high values of clustering coefficient. The effectiveness of our model was verified in a practical pattern recognition task in real networks. When compared with traditional word adjacency networks, our model displayed optimized results in identifying unknown references in texts. Because the proposed representation plays a complementary role in characterizing unstructured documents via topological analysis of named entities, we believe that it could be useful to improve the characterization of written texts (and related systems), specially if combined with traditional approaches based on statistical and deeper paradigms.
Le, Hoang-Quynh; Tran, Mai-Vu; Dang, Thanh Hai; Ha, Quang-Thuy; Collier, Nigel
2016-07-01
The BioCreative V chemical-disease relation (CDR) track was proposed to accelerate the progress of text mining in facilitating integrative understanding of chemicals, diseases and their relations. In this article, we describe an extension of our system (namely UET-CAM) that participated in the BioCreative V CDR. The original UET-CAM system's performance was ranked fourth among 18 participating systems by the BioCreative CDR track committee. In the Disease Named Entity Recognition and Normalization (DNER) phase, our system employed joint inference (decoding) with a perceptron-based named entity recognizer (NER) and a back-off model with Semantic Supervised Indexing and Skip-gram for named entity normalization. In the chemical-induced disease (CID) relation extraction phase, we proposed a pipeline that includes a coreference resolution module and a Support Vector Machine relation extraction model. The former module utilized a multi-pass sieve to extend entity recall. In this article, the UET-CAM system was improved by adding a 'silver' CID corpus to train the prediction model. This silver standard corpus of more than 50 thousand sentences was automatically built based on the Comparative Toxicogenomics Database (CTD) database. We evaluated our method on the CDR test set. Results showed that our system could reach the state of the art performance with F1 of 82.44 for the DNER task and 58.90 for the CID task. Analysis demonstrated substantial benefits of both the multi-pass sieve coreference resolution method (F1 + 4.13%) and the silver CID corpus (F1 +7.3%).Database URL: SilverCID-The silver-standard corpus for CID relation extraction is freely online available at: https://zenodo.org/record/34530 (doi:10.5281/zenodo.34530). © The Author(s) 2016. Published by Oxford University Press.
NASA Astrophysics Data System (ADS)
Zhang, Xiaoli; Zou, Jie; Le, Daniel X.; Thoma, George
2010-01-01
"Investigator Names" is a newly required field in MEDLINE citations. It consists of personal names listed as members of corporate organizations in an article. Extracting investigator names automatically is necessary because of the increasing volume of articles reporting collaborative biomedical research in which a large number of investigators participate. In this paper, we present an SVM-based stacked sequential learning method in a novel application - recognizing named entities such as the first and last names of investigators from online medical journal articles. Stacked sequential learning is a meta-learning algorithm which can boost any base learner. It exploits contextual information by adding the predicted labels of the surrounding tokens as features. We apply this method to tag words in text paragraphs containing investigator names, and demonstrate that stacked sequential learning improves the performance of a nonsequential base learner such as an SVM classifier.
Zhao, Zhehuan; Yang, Zhihao; Luo, Ling; Wang, Lei; Zhang, Yin; Lin, Hongfei; Wang, Jian
2017-12-28
Automatic disease named entity recognition (DNER) is of utmost importance for development of more sophisticated BioNLP tools. However, most conventional CRF based DNER systems rely on well-designed features whose selection is labor intensive and time-consuming. Though most deep learning methods can solve NER problems with little feature engineering, they employ additional CRF layer to capture the correlation information between labels in neighborhoods which makes them much complicated. In this paper, we propose a novel multiple label convolutional neural network (MCNN) based disease NER approach. In this approach, instead of the CRF layer, a multiple label strategy (MLS) first introduced by us, is employed. First, the character-level embedding, word-level embedding and lexicon feature embedding are concatenated. Then several convolutional layers are stacked over the concatenated embedding. Finally, MLS strategy is applied to the output layer to capture the correlation information between neighboring labels. As shown by the experimental results, MCNN can achieve the state-of-the-art performance on both NCBI and CDR corpora. The proposed MCNN based disease NER method achieves the state-of-the-art performance with little feature engineering. And the experimental results show the MLS strategy's effectiveness of capturing the correlation information between labels in the neighborhood.
Basaruddin, T.
2016-01-01
One essential task in information extraction from the medical corpus is drug name recognition. Compared with text sources come from other domains, the medical text mining poses more challenges, for example, more unstructured text, the fast growing of new terms addition, a wide range of name variation for the same drug, the lack of labeled dataset sources and external knowledge, and the multiple token representations for a single drug name. Although many approaches have been proposed to overwhelm the task, some problems remained with poor F-score performance (less than 0.75). This paper presents a new treatment in data representation techniques to overcome some of those challenges. We propose three data representation techniques based on the characteristics of word distribution and word similarities as a result of word embedding training. The first technique is evaluated with the standard NN model, that is, MLP. The second technique involves two deep network classifiers, that is, DBN and SAE. The third technique represents the sentence as a sequence that is evaluated with a recurrent NN model, that is, LSTM. In extracting the drug name entities, the third technique gives the best F-score performance compared to the state of the art, with its average F-score being 0.8645. PMID:27843447
Recognition of a person named entity from the text written in a natural language
NASA Astrophysics Data System (ADS)
Dolbin, A. V.; Rozaliev, V. L.; Orlova, Y. A.
2017-01-01
This work is devoted to the semantic analysis of texts, which were written in a natural language. The main goal of the research was to compare latent Dirichlet allocation and latent semantic analysis to identify elements of the human appearance in the text. The completeness of information retrieval was chosen as the efficiency criteria for methods comparison. However, it was insufficient to choose only one method for achieving high recognition rates. Thus, additional methods were used for finding references to the personality in the text. All these methods are based on the created information model, which represents person’s appearance.
BioTextQuest: a web-based biomedical text mining suite for concept discovery.
Papanikolaou, Nikolas; Pafilis, Evangelos; Nikolaou, Stavros; Ouzounis, Christos A; Iliopoulos, Ioannis; Promponas, Vasilis J
2011-12-01
BioTextQuest combines automated discovery of significant terms in article clusters with structured knowledge annotation, via Named Entity Recognition services, offering interactive user-friendly visualization. A tag-cloud-based illustration of terms labeling each document cluster are semantically annotated according to the biological entity, and a list of document titles enable users to simultaneously compare terms and documents of each cluster, facilitating concept association and hypothesis generation. BioTextQuest allows customization of analysis parameters, e.g. clustering/stemming algorithms, exclusion of documents/significant terms, to better match the biological question addressed. http://biotextquest.biol.ucy.ac.cy vprobon@ucy.ac.cy; iliopj@med.uoc.gr Supplementary data are available at Bioinformatics online.
PKDE4J: Entity and relation extraction for public knowledge discovery.
Song, Min; Kim, Won Chul; Lee, Dahee; Heo, Go Eun; Kang, Keun Young
2015-10-01
Due to an enormous number of scientific publications that cannot be handled manually, there is a rising interest in text-mining techniques for automated information extraction, especially in the biomedical field. Such techniques provide effective means of information search, knowledge discovery, and hypothesis generation. Most previous studies have primarily focused on the design and performance improvement of either named entity recognition or relation extraction. In this paper, we present PKDE4J, a comprehensive text-mining system that integrates dictionary-based entity extraction and rule-based relation extraction in a highly flexible and extensible framework. Starting with the Stanford CoreNLP, we developed the system to cope with multiple types of entities and relations. The system also has fairly good performance in terms of accuracy as well as the ability to configure text-processing components. We demonstrate its competitive performance by evaluating it on many corpora and found that it surpasses existing systems with average F-measures of 85% for entity extraction and 81% for relation extraction. Copyright © 2015 Elsevier Inc. All rights reserved.
77 FR 70163 - Recognition of Entities for the Accreditation of Qualified Health Plans
Federal Register 2010, 2011, 2012, 2013, 2014
2012-11-23
... DEPARTMENT OF HEALTH AND HUMAN SERVICES [CMS-9961-N] Recognition of Entities for the Accreditation... as recognized accrediting entities for the purposes of fulfilling the accreditation requirement as... a recognized accrediting entity on a uniform timeline established by the applicable Exchange. On...
Handwritten-word spotting using biologically inspired features.
van der Zant, Tijn; Schomaker, Lambert; Haak, Koen
2008-11-01
For quick access to new handwritten collections, current handwriting recognition methods are too cumbersome. They cannot deal with the lack of labeled data and would require extensive laboratory training for each individual script, style, language and collection. We propose a biologically inspired whole-word recognition method which is used to incrementally elicit word labels in a live, web-based annotation system, named Monk. Since human labor should be minimized given the massive amount of image data, it becomes important to rely on robust perceptual mechanisms in the machine. Recent computational models of the neuro-physiology of vision are applied to isolated word classification. A primate cortex-like mechanism allows to classify text-images that have a low frequency of occurrence. Typically these images are the most difficult to retrieve and often contain named entities and are regarded as the most important to people. Usually standard pattern-recognition technology cannot deal with these text-images if there are not enough labeled instances. The results of this retrieval system are compared to normalized word-image matching and appear to be very promising.
Zhu, Qile; Li, Xiaolin; Conesa, Ana; Pereira, Cécile
2018-05-01
Best performing named entity recognition (NER) methods for biomedical literature are based on hand-crafted features or task-specific rules, which are costly to produce and difficult to generalize to other corpora. End-to-end neural networks achieve state-of-the-art performance without hand-crafted features and task-specific knowledge in non-biomedical NER tasks. However, in the biomedical domain, using the same architecture does not yield competitive performance compared with conventional machine learning models. We propose a novel end-to-end deep learning approach for biomedical NER tasks that leverages the local contexts based on n-gram character and word embeddings via Convolutional Neural Network (CNN). We call this approach GRAM-CNN. To automatically label a word, this method uses the local information around a word. Therefore, the GRAM-CNN method does not require any specific knowledge or feature engineering and can be theoretically applied to a wide range of existing NER problems. The GRAM-CNN approach was evaluated on three well-known biomedical datasets containing different BioNER entities. It obtained an F1-score of 87.26% on the Biocreative II dataset, 87.26% on the NCBI dataset and 72.57% on the JNLPBA dataset. Those results put GRAM-CNN in the lead of the biological NER methods. To the best of our knowledge, we are the first to apply CNN based structures to BioNER problems. The GRAM-CNN source code, datasets and pre-trained model are available online at: https://github.com/valdersoul/GRAM-CNN. andyli@ece.ufl.edu or aconesa@ufl.edu. Supplementary data are available at Bioinformatics online.
Zhu, Qile; Li, Xiaolin; Conesa, Ana; Pereira, Cécile
2018-01-01
Abstract Motivation Best performing named entity recognition (NER) methods for biomedical literature are based on hand-crafted features or task-specific rules, which are costly to produce and difficult to generalize to other corpora. End-to-end neural networks achieve state-of-the-art performance without hand-crafted features and task-specific knowledge in non-biomedical NER tasks. However, in the biomedical domain, using the same architecture does not yield competitive performance compared with conventional machine learning models. Results We propose a novel end-to-end deep learning approach for biomedical NER tasks that leverages the local contexts based on n-gram character and word embeddings via Convolutional Neural Network (CNN). We call this approach GRAM-CNN. To automatically label a word, this method uses the local information around a word. Therefore, the GRAM-CNN method does not require any specific knowledge or feature engineering and can be theoretically applied to a wide range of existing NER problems. The GRAM-CNN approach was evaluated on three well-known biomedical datasets containing different BioNER entities. It obtained an F1-score of 87.26% on the Biocreative II dataset, 87.26% on the NCBI dataset and 72.57% on the JNLPBA dataset. Those results put GRAM-CNN in the lead of the biological NER methods. To the best of our knowledge, we are the first to apply CNN based structures to BioNER problems. Availability and implementation The GRAM-CNN source code, datasets and pre-trained model are available online at: https://github.com/valdersoul/GRAM-CNN. Contact andyli@ece.ufl.edu or aconesa@ufl.edu Supplementary information Supplementary data are available at Bioinformatics online. PMID:29272325
Tsai, Richard Tzong-Han; Sung, Cheng-Lung; Dai, Hong-Jie; Hung, Hsieh-Chuan; Sung, Ting-Yi; Hsu, Wen-Lian
2006-12-18
Biomedical named entity recognition (Bio-NER) is a challenging problem because, in general, biomedical named entities of the same category (e.g., proteins and genes) do not follow one standard nomenclature. They have many irregularities and sometimes appear in ambiguous contexts. In recent years, machine-learning (ML) approaches have become increasingly common and now represent the cutting edge of Bio-NER technology. This paper addresses three problems faced by ML-based Bio-NER systems. First, most ML approaches usually employ singleton features that comprise one linguistic property (e.g., the current word is capitalized) and at least one class tag (e.g., B-protein, the beginning of a protein name). However, such features may be insufficient in cases where multiple properties must be considered. Adding conjunction features that contain multiple properties can be beneficial, but it would be infeasible to include all conjunction features in an NER model since memory resources are limited and some features are ineffective. To resolve the problem, we use a sequential forward search algorithm to select an effective set of features. Second, variations in the numerical parts of biomedical terms (e.g., "2" in the biomedical term IL2) cause data sparseness and generate many redundant features. In this case, we apply numerical normalization, which solves the problem by replacing all numerals in a term with one representative numeral to help classify named entities. Third, the assignment of NE tags does not depend solely on the target word's closest neighbors, but may depend on words outside the context window (e.g., a context window of five consists of the current word plus two preceding and two subsequent words). We use global patterns generated by the Smith-Waterman local alignment algorithm to identify such structures and modify the results of our ML-based tagger. This is called pattern-based post-processing. To develop our ML-based Bio-NER system, we employ conditional random fields, which have performed effectively in several well-known tasks, as our underlying ML model. Adding selected conjunction features, applying numerical normalization, and employing pattern-based post-processing improve the F-scores by 1.67%, 1.04%, and 0.57%, respectively. The combined increase of 3.28% yields a total score of 72.98%, which is better than the baseline system that only uses singleton features. We demonstrate the benefits of using the sequential forward search algorithm to select effective conjunction feature groups. In addition, we show that numerical normalization can effectively reduce the number of redundant and unseen features. Furthermore, the Smith-Waterman local alignment algorithm can help ML-based Bio-NER deal with difficult cases that need longer context windows.
Assessment of Orthographic Similarity of Drugs Names between Iran and Overseas Using the Solar Model
ABOLHASSANI, Nazanin; AKBARI SARI, Ali; RASHIDIAN, Arash; RASTEGARPANAH, Mansoor
2017-01-01
Background: The recognition of patient safety is now occupying a prominent place on the health policy agenda since medical errors can result in adverse events. The existence of confusing drug names is one of the most common causes of medication errors. In Iran, the General Office of Trademarks Registry (GOTR), for four years (2010–2014) was responsible for approving drug proprietary names. This study aimed to investigate the performance of the GOTR in terms of drug names orthographic similarity using the SOLAR model. Methods: First, 100 names were randomly selected from the GOTR’s database. Then, each name was searched through pharmaceutical websites including Martindale (the Complete Drug Reference published by Pharmaceutical Press), Drugs.com and Medicines Complete. Pair of drugs whose names look orthographically similar with different indications were identified. Then, the SOLAR model was utilized to determine orthographic similarity between all pair of drug names. Results: The mean of match values of these 100 pairs of drug was 77% indicating the high risk of similarity. The match value for most of the reviewed pairs (92%) was high (≥66%). This value was medium (≥ 33% and <66%) just for 8% of the pairs of drug. These results indicate high risk of confusion due to similarity of drug names. Conclusion: The stewardship of the GOTR in patient safety considerations is fundamentally problematic. Thus, as a best practice, we recommend that proprietary names of drugs be evaluated by an entity within the health system. While an entity within the health system should address patient safety considerations, the GOTR is responsible for intellectual property rights. PMID:29259940
Platz, T
1996-10-01
Somaesthetic, motor and cognitive functions were studied in a man with impaired tactile object-recognition (TOR) in his left hand due to a right parietal convexity meningeoma which had been surgically removed. Primary motor and somatosensory functions were not impaired, and discriminative abilities for various tactile aspects and cognitive skills were preserved. Nevertheless, the patient could often not appreciate the object's nature or significance when it was placed in his left hand and was unable to name or to describe or demonstrate the use of these objects. Therefore, he can be regarded as an example of associative tactile agnosia. The view is taken and elaborated that defective modality-specific meaning representations account for associative tactile agnosia. These meaning representations are conceptualized as learned unimodal feature-entity relationships which are thought to be defective in tactile agnosia. In line with this hypothesis, tactile feature analysis and cross-modal matching of features were largely preserved in the investigated patient, while combining features to form entities was defective in the tactile domain. The alternative hypothesis of agnosia as deficit of cross-modal association of features was not supported. The presumed distributed functional network responsible for TOR is thought to involve perception of features, object recognition and related tactile motor behaviour interactively. A deficit leading primarily to impaired combining features to form entities can therefore be expected to result in additional minor impairment of related perceptual-motor processes. Unilaterality of the gnostic deficit can be explained by a lateralized organization of the functional network responsible for tactile recognition of objects.
Towards an Obesity-Cancer Knowledge Base: Biomedical Entity Identification and Relation Detection
Lossio-Ventura, Juan Antonio; Hogan, William; Modave, François; Hicks, Amanda; Hanna, Josh; Guo, Yi; He, Zhe; Bian, Jiang
2017-01-01
Obesity is associated with increased risks of various types of cancer, as well as a wide range of other chronic diseases. On the other hand, access to health information activates patient participation, and improve their health outcomes. However, existing online information on obesity and its relationship to cancer is heterogeneous ranging from pre-clinical models and case studies to mere hypothesis-based scientific arguments. A formal knowledge representation (i.e., a semantic knowledge base) would help better organizing and delivering quality health information related to obesity and cancer that consumers need. Nevertheless, current ontologies describing obesity, cancer and related entities are not designed to guide automatic knowledge base construction from heterogeneous information sources. Thus, in this paper, we present methods for named-entity recognition (NER) to extract biomedical entities from scholarly articles and for detecting if two biomedical entities are related, with the long term goal of building a obesity-cancer knowledge base. We leverage both linguistic and statistical approaches in the NER task, which supersedes the state-of-the-art results. Further, based on statistical features extracted from the sentences, our method for relation detection obtains an accuracy of 99.3% and a f-measure of 0.993. PMID:28503356
García-Remesal, Miguel; García-Ruiz, Alejandro; Pérez-Rey, David; de la Iglesia, Diana; Maojo, Víctor
2013-01-01
Nanoinformatics is an emerging research field that uses informatics techniques to collect, process, store, and retrieve data, information, and knowledge on nanoparticles, nanomaterials, and nanodevices and their potential applications in health care. In this paper, we have focused on the solutions that nanoinformatics can provide to facilitate nanotoxicology research. For this, we have taken a computational approach to automatically recognize and extract nanotoxicology-related entities from the scientific literature. The desired entities belong to four different categories: nanoparticles, routes of exposure, toxic effects, and targets. The entity recognizer was trained using a corpus that we specifically created for this purpose and was validated by two nanomedicine/nanotoxicology experts. We evaluated the performance of our entity recognizer using 10-fold cross-validation. The precisions range from 87.6% (targets) to 93.0% (routes of exposure), while recall values range from 82.6% (routes of exposure) to 87.4% (toxic effects). These results prove the feasibility of using computational approaches to reliably perform different named entity recognition (NER)-dependent tasks, such as for instance augmented reading or semantic searches. This research is a "proof of concept" that can be expanded to stimulate further developments that could assist researchers in managing data, information, and knowledge at the nanolevel, thus accelerating research in nanomedicine.
Mining Adverse Drug Reactions in Social Media with Named Entity Recognition and Semantic Methods.
Chen, Xiaoyi; Deldossi, Myrtille; Aboukhamis, Rim; Faviez, Carole; Dahamna, Badisse; Karapetiantz, Pierre; Guenegou-Arnoux, Armelle; Girardeau, Yannick; Guillemin-Lanne, Sylvie; Lillo-Le-Louët, Agnès; Texier, Nathalie; Burgun, Anita; Katsahian, Sandrine
2017-01-01
Suspected adverse drug reactions (ADR) reported by patients through social media can be a complementary source to current pharmacovigilance systems. However, the performance of text mining tools applied to social media text data to discover ADRs needs to be evaluated. In this paper, we introduce the approach developed to mine ADR from French social media. A protocol of evaluation is highlighted, which includes a detailed sample size determination and evaluation corpus constitution. Our text mining approach provided very encouraging preliminary results with F-measures of 0.94 and 0.81 for recognition of drugs and symptoms respectively, and with F-measure of 0.70 for ADR detection. Therefore, this approach is promising for downstream pharmacovigilance analysis.
Durairaj, Vijayasarathi; Punnaivanam, Sankar
2015-09-01
Fundamental chemical entities are identified in the context of organic reactivity and classified as appropriate concept classes namely ElectronEntity, AtomEntity, AtomGroupEntity, FunctionalGroupEntity and MolecularEntity. The entity classes and their subclasses are organized into a chemical ontology named "ChemEnt" for the purpose of assertion, restriction and modification of properties through entity relations. Individual instances of entity classes are defined and encoded as a library of chemical entities in XML. The instances of entity classes are distinguished with a unique notation and identification values in order to map them with the ontology definitions. A model GUI named Entity Table is created to view graphical representations of all the entity instances. The detection of chemical entities in chemical structures is achieved through suitable algorithms. The possibility of asserting properties to the entities at different levels and the mechanism of property flow within the hierarchical entity levels is outlined. Copyright © 2015 Elsevier Inc. All rights reserved.
OSCAR4: a flexible architecture for chemical text-mining.
Jessop, David M; Adams, Sam E; Willighagen, Egon L; Hawizy, Lezan; Murray-Rust, Peter
2011-10-14
The Open-Source Chemistry Analysis Routines (OSCAR) software, a toolkit for the recognition of named entities and data in chemistry publications, has been developed since 2002. Recent work has resulted in the separation of the core OSCAR functionality and its release as the OSCAR4 library. This library features a modular API (based on reduction of surface coupling) that permits client programmers to easily incorporate it into external applications. OSCAR4 offers a domain-independent architecture upon which chemistry specific text-mining tools can be built, and its development and usage are discussed.
Code of Federal Regulations, 2013 CFR
2013-10-01
..., fiscal agents, and managed care entities provide the following disclosures: (1)(i) The name and address... entity, fiscal agent, or managed care entity. The address for corporate entities must include as... disclosing entity as a spouse, parent, child, or sibling. (3) The name of any other disclosing entity (or...
Seethala, Raja R; Stenman, Göran
2017-03-01
The salivary gland section in the 4th edition of the World Health Organization classification of head and neck tumors features the description and inclusion of several entities, the most significant of which is represented by (mammary analogue) secretory carcinoma. This entity was extracted mainly from acinic cell carcinoma based on recapitulation of breast secretory carcinoma and a shared ETV6-NTRK3 gene fusion. Also new is the subsection of "Other epithelial lesions," for which key entities include sclerosing polycystic adenosis and intercalated duct hyperplasia. Many entities have been compressed into their broader categories given clinical and morphologic similarities, or transitioned to a different grouping as was the case with low-grade cribriform cystadenocarcinoma reclassified as intraductal carcinoma (with the applied qualifier of low-grade). Specific grade has been removed from the names of the salivary gland entities such as polymorphous adenocarcinoma, providing pathologists flexibility in assigning grade and allowing for recognition of a broader spectrum within an entity. Cribriform adenocarcinoma of (minor) salivary gland origin continues to be divisive in terms of whether it should be recognized as a distinct category. This chapter also features new key concepts such as high-grade transformation. The new paradigm of translocations and gene fusions being common in salivary gland tumors is featured heavily in this chapter.
Montembeault, M; Brambati, S M; Joubert, S; Boukadi, M; Chapleau, M; Laforce, R Jr; Wilson, M A; Macoir, J; Rouleau, I
2017-01-27
While the semantic variant of primary progressive aphasia (svPPA) is characterized by a predominant semantic memory impairment, episodic memory impairments are the clinical hallmark of Alzheimer's disease (AD). However, AD patients also present with semantic deficits, which are more severe for semantically unique entities (e.g. a famous person) than for common concepts (e.g. a beaver). Previous studies in these patient populations have largely focused on famous-person naming. Therefore, we aimed to evaluate if these impairments also extend to other semantically unique entities such as famous places and famous logos. In this study, 13 AD patients, 9 svPPA patients, and 12 cognitively unimpaired elderly subjects (CTRL) were tested with a picture-naming test of non-unique entities (Boston Naming Test) and three experimental tests of semantically unique entities assessing naming of famous persons, places, and logos. Both clinical groups were overall more impaired at naming semantically unique entities than non-unique entities. Naming impairments in AD and svPPA extended to the other types of semantically unique entities, since a CTRL>AD>svPPA pattern was found on the performance of all naming tests. Naming famous places and famous persons appeared to be most impaired in svPPA, and both specific and general semantic knowledge for these entities were affected in these patients. Although AD patients were most significantly impaired on famous-person naming, only their specific semantic knowledge was impaired, while general knowledge was preserved. Post-hoc neuroimaging analyses also showed that famous-person naming impairments in AD correlated with atrophy in the temporo-parietal junction, a region functionally associated with lexical access. In line with previous studies, svPPA patients' impairment in both naming and semantic knowledge suggest a more profound semantic impairment, while naming impairments in AD may arise to a greater extent from impaired lexical access, even though semantic impairment for specific knowledge is also present. These results highlight the critical importance of developing and using a variety of semantically-unique-entity naming tests in neuropsychological assessments of patients with neurodegenerative diseases, which may unveil different patterns of lexical-semantic deficits. Copyright © 2016 Elsevier Ltd. All rights reserved.
Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text
Gan, Liang; Cheng, Mian; Wu, Quanyuan
2018-01-01
Online medical text is full of references to medical entities (MEs), which are valuable in many applications, including medical knowledge-based (KB) construction, decision support systems, and the treatment of diseases. However, the diverse and ambiguous nature of the surface forms gives rise to a great difficulty for ME identification. Many existing solutions have focused on supervised approaches, which are often task-dependent. In other words, applying them to different kinds of corpora or identifying new entity categories requires major effort in data annotation and feature definition. In this paper, we propose unMERL, an unsupervised framework for recognizing and linking medical entities mentioned in Chinese online medical text. For ME recognition, unMERL first exploits a knowledge-driven approach to extract candidate entities from free text. Then, the categories of the candidate entities are determined using a distributed semantic-based approach. For ME linking, we propose a collaborative inference approach which takes full advantage of heterogenous entity knowledge and unstructured information in KB. Experimental results on real corpora demonstrate significant benefits compared to recent approaches with respect to both ME recognition and linking. PMID:29849994
LispSEI: The Programmer’s Manual
1988-01-01
defun print-entities ( str entities etype) (format t str ) (dolist (entity entities) (format t " -A" (entity-name entity *type)))) (detun entity-name...fields are munged only after the filters are executed. This makes things much easier. ;:Algorithm: (1) get initial list. (2) take out those entitles which...don’t meet all the constraints. 1, 3) pass the entities list through all the filters.(4) munge the appropriate fields (5)u return the result. (defn s
NELasso: Group-Sparse Modeling for Characterizing Relations Among Named Entities in News Articles.
Tariq, Amara; Karim, Asim; Foroosh, Hassan
2017-10-01
Named entities such as people, locations, and organizations play a vital role in characterizing online content. They often reflect information of interest and are frequently used in search queries. Although named entities can be detected reliably from textual content, extracting relations among them is more challenging, yet useful in various applications (e.g., news recommending systems). In this paper, we present a novel model and system for learning semantic relations among named entities from collections of news articles. We model each named entity occurrence with sparse structured logistic regression, and consider the words (predictors) to be grouped based on background semantics. This sparse group LASSO approach forces the weights of word groups that do not influence the prediction towards zero. The resulting sparse structure is utilized for defining the type and strength of relations. Our unsupervised system yields a named entities' network where each relation is typed, quantified, and characterized in context. These relations are the key to understanding news material over time and customizing newsfeeds for readers. Extensive evaluation of our system on articles from TIME magazine and BBC News shows that the learned relations correlate with static semantic relatedness measures like WLM, and capture the evolving relationships among named entities over time.
Department of Defense Data Model, Version 1, Fy 1998, Volume 6.
1998-05-31
Definition: A REQUIREMENT TO WITHHOLD PAYMENT ON A SPECIFIC CONTRACT. (5104) (1) (A) 138 Entity Report DOD Data Model VI FY98 Attribute Names...424 Entity Report DOD Data Model VI FY98 Entity Name: PAYMENT -MEANS-FINANCIAL-INSTITUTION-ACCOUNT Definition: THE ASSOCIATION OF A FINANCIAL...A) 453 Entity Report DOD Data Model VI FY98 Definition: PETITION FOR PAYMENT PRIOR TO PERFORMANCE BY A PERSONNEL-RESOURCE. Attribute Names
10 CFR 300.3 - Guidance for defining and naming the reporting entity.
Code of Federal Regulations, 2013 CFR
2013-01-01
... 10 Energy 3 2013-01-01 2013-01-01 false Guidance for defining and naming the reporting entity. 300.3 Section 300.3 Energy DEPARTMENT OF ENERGY CLIMATE CHANGE VOLUNTARY GREENHOUSE GAS REPORTING PROGRAM: GENERAL GUIDELINES § 300.3 Guidance for defining and naming the reporting entity. (a) A reporting...
10 CFR 300.3 - Guidance for defining and naming the reporting entity.
Code of Federal Regulations, 2012 CFR
2012-01-01
... 10 Energy 3 2012-01-01 2012-01-01 false Guidance for defining and naming the reporting entity. 300.3 Section 300.3 Energy DEPARTMENT OF ENERGY CLIMATE CHANGE VOLUNTARY GREENHOUSE GAS REPORTING PROGRAM: GENERAL GUIDELINES § 300.3 Guidance for defining and naming the reporting entity. (a) A reporting...
10 CFR 300.3 - Guidance for defining and naming the reporting entity.
Code of Federal Regulations, 2011 CFR
2011-01-01
... 10 Energy 3 2011-01-01 2011-01-01 false Guidance for defining and naming the reporting entity. 300.3 Section 300.3 Energy DEPARTMENT OF ENERGY CLIMATE CHANGE VOLUNTARY GREENHOUSE GAS REPORTING PROGRAM: GENERAL GUIDELINES § 300.3 Guidance for defining and naming the reporting entity. (a) A reporting...
10 CFR 300.3 - Guidance for defining and naming the reporting entity.
Code of Federal Regulations, 2014 CFR
2014-01-01
... 10 Energy 3 2014-01-01 2014-01-01 false Guidance for defining and naming the reporting entity. 300.3 Section 300.3 Energy DEPARTMENT OF ENERGY CLIMATE CHANGE VOLUNTARY GREENHOUSE GAS REPORTING PROGRAM: GENERAL GUIDELINES § 300.3 Guidance for defining and naming the reporting entity. (a) A reporting...
10 CFR 300.3 - Guidance for defining and naming the reporting entity.
Code of Federal Regulations, 2010 CFR
2010-01-01
... 10 Energy 3 2010-01-01 2010-01-01 false Guidance for defining and naming the reporting entity. 300.3 Section 300.3 Energy DEPARTMENT OF ENERGY CLIMATE CHANGE VOLUNTARY GREENHOUSE GAS REPORTING PROGRAM: GENERAL GUIDELINES § 300.3 Guidance for defining and naming the reporting entity. (a) A reporting...
77 FR 61658 - Designation of Two Entities Pursuant to Executive Orders
Federal Register 2010, 2011, 2012, 2013, 2014
2012-10-10
... DEPARTMENT OF THE TREASURY Office of Foreign Assets Control Designation of Two Entities Pursuant... Treasury Department's Office of Foreign Assets Control (``OFAC'') is publishing the names of two entities....'' DATES: The designation by the Director of OFAC of the two entities named in this notice, pursuant to...
OSCAR4: a flexible architecture for chemical text-mining
2011-01-01
The Open-Source Chemistry Analysis Routines (OSCAR) software, a toolkit for the recognition of named entities and data in chemistry publications, has been developed since 2002. Recent work has resulted in the separation of the core OSCAR functionality and its release as the OSCAR4 library. This library features a modular API (based on reduction of surface coupling) that permits client programmers to easily incorporate it into external applications. OSCAR4 offers a domain-independent architecture upon which chemistry specific text-mining tools can be built, and its development and usage are discussed. PMID:21999457
TwiMed: Twitter and PubMed Comparable Corpus of Drugs, Diseases, Symptoms, and Their Relations
Miyao, Yusuke; Collier, Nigel
2017-01-01
Background Work on pharmacovigilance systems using texts from PubMed and Twitter typically target at different elements and use different annotation guidelines resulting in a scenario where there is no comparable set of documents from both Twitter and PubMed annotated in the same manner. Objective This study aimed to provide a comparable corpus of texts from PubMed and Twitter that can be used to study drug reports from these two sources of information, allowing researchers in the area of pharmacovigilance using natural language processing (NLP) to perform experiments to better understand the similarities and differences between drug reports in Twitter and PubMed. Methods We produced a corpus comprising 1000 tweets and 1000 PubMed sentences selected using the same strategy and annotated at entity level by the same experts (pharmacists) using the same set of guidelines. Results The resulting corpus, annotated by two pharmacists, comprises semantically correct annotations for a set of drugs, diseases, and symptoms. This corpus contains the annotations for 3144 entities, 2749 relations, and 5003 attributes. Conclusions We present a corpus that is unique in its characteristics as this is the first corpus for pharmacovigilance curated from Twitter messages and PubMed sentences using the same data selection and annotation strategies. We believe this corpus will be of particular interest for researchers willing to compare results from pharmacovigilance systems (eg, classifiers and named entity recognition systems) when using data from Twitter and from PubMed. We hope that given the comprehensive set of drug names and the annotated entities and relations, this corpus becomes a standard resource to compare results from different pharmacovigilance studies in the area of NLP. PMID:28468748
TwiMed: Twitter and PubMed Comparable Corpus of Drugs, Diseases, Symptoms, and Their Relations.
Alvaro, Nestor; Miyao, Yusuke; Collier, Nigel
2017-05-03
Work on pharmacovigilance systems using texts from PubMed and Twitter typically target at different elements and use different annotation guidelines resulting in a scenario where there is no comparable set of documents from both Twitter and PubMed annotated in the same manner. This study aimed to provide a comparable corpus of texts from PubMed and Twitter that can be used to study drug reports from these two sources of information, allowing researchers in the area of pharmacovigilance using natural language processing (NLP) to perform experiments to better understand the similarities and differences between drug reports in Twitter and PubMed. We produced a corpus comprising 1000 tweets and 1000 PubMed sentences selected using the same strategy and annotated at entity level by the same experts (pharmacists) using the same set of guidelines. The resulting corpus, annotated by two pharmacists, comprises semantically correct annotations for a set of drugs, diseases, and symptoms. This corpus contains the annotations for 3144 entities, 2749 relations, and 5003 attributes. We present a corpus that is unique in its characteristics as this is the first corpus for pharmacovigilance curated from Twitter messages and PubMed sentences using the same data selection and annotation strategies. We believe this corpus will be of particular interest for researchers willing to compare results from pharmacovigilance systems (eg, classifiers and named entity recognition systems) when using data from Twitter and from PubMed. We hope that given the comprehensive set of drug names and the annotated entities and relations, this corpus becomes a standard resource to compare results from different pharmacovigilance studies in the area of NLP. ©Nestor Alvaro, Yusuke Miyao, Nigel Collier. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 03.05.2017.
Bruffaerts, Rose; De Weer, An-Sofie; De Grauwe, Sophie; Thys, Miek; Dries, Eva; Thijs, Vincent; Sunaert, Stefan; Vandenbulcke, Mathieu; De Deyne, Simon; Storms, Gerrit; Vandenberghe, Rik
2014-09-01
We investigated the critical contribution of right ventral occipitotemporal cortex to knowledge of visual and functional-associative attributes of biological and non-biological entities and how this relates to category-specificity during confrontation naming. In a consecutive series of 7 patients with lesions confined to right ventral occipitotemporal cortex, we conducted an extensive assessment of oral generation of visual-sensory and functional-associative features in response to the names of biological and nonbiological entities. Subjects also performed a confrontation naming task for these categories. Our main novel finding related to a unique case with a small lesion confined to right medial fusiform gyrus who showed disproportionate naming impairment for nonbiological versus biological entities, specifically for tools. Generation of visual and functional-associative features was preserved for biological and non-biological entities. In two other cases, who had a relatively small posterior lesion restricted to primary visual and posterior fusiform cortex, retrieval of visual attributes was disproportionately impaired compared to functional-associative attributes, in particular for biological entities. However, these cases did not show a category-specific naming deficit. Two final cases with the largest lesions showed a classical dissociation between biological versus nonbiological entities during naming, with normal feature generation performance. This is the first lesion-based evidence of a critical contribution of the right medial fusiform cortex to tool naming. Second, dissociations along the dimension of attribute type during feature generation do not co-occur with category-specificity during naming in the current patient sample. Copyright © 2014 Elsevier Ltd. All rights reserved.
Chen, Xiaoyi; Faviez, Carole; Schuck, Stéphane; Lillo-Le-Louët, Agnès; Texier, Nathalie; Dahamna, Badisse; Huot, Charles; Foulquié, Pierre; Pereira, Suzanne; Leroux, Vincent; Karapetiantz, Pierre; Guenegou-Arnoux, Armelle; Katsahian, Sandrine; Bousquet, Cédric; Burgun, Anita
2018-01-01
Background: The Food and Drug Administration (FDA) in the United States and the European Medicines Agency (EMA) have recognized social media as a new data source to strengthen their activities regarding drug safety. Objective: Our objective in the ADR-PRISM project was to provide text mining and visualization tools to explore a corpus of posts extracted from social media. We evaluated this approach on a corpus of 21 million posts from five patient forums, and conducted a qualitative analysis of the data available on methylphenidate in this corpus. Methods: We applied text mining methods based on named entity recognition and relation extraction in the corpus, followed by signal detection using proportional reporting ratio (PRR). We also used topic modeling based on the Correlated Topic Model to obtain the list of the matics in the corpus and classify the messages based on their topics. Results: We automatically identified 3443 posts about methylphenidate published between 2007 and 2016, among which 61 adverse drug reactions (ADR) were automatically detected. Two pharmacovigilance experts evaluated manually the quality of automatic identification, and a f-measure of 0.57 was reached. Patient's reports were mainly neuro-psychiatric effects. Applying PRR, 67% of the ADRs were signals, including most of the neuro-psychiatric symptoms but also palpitations. Topic modeling showed that the most represented topics were related to Childhood and Treatment initiation , but also Side effects . Cases of misuse were also identified in this corpus, including recreational use and abuse. Conclusion: Named entity recognition combined with signal detection and topic modeling have demonstrated their complementarity in mining social media data. An in-depth analysis focused on methylphenidate showed that this approach was able to detect potential signals and to provide better understanding of patients' behaviors regarding drugs, including misuse.
σ-Hole Bond vs π-Hole Bond: A Comparison Based on Halogen Bond.
Wang, Hui; Wang, Weizhou; Jin, Wei Jun
2016-05-11
The σ-hole and π-hole are the regions with positive surface electrostatic potential on the molecule entity; the former specifically refers to the positive region of a molecular entity along extension of the Y-Ge/P/Se/X covalent σ-bond (Y = electron-rich group; Ge/P/Se/X = Groups IV-VII), while the latter refers to the positive region in the direction perpendicular to the σ-framework of the molecular entity. The directional noncovalent interactions between the σ-hole or π-hole and the negative or electron-rich sites are named σ-hole bond or π-hole bond, respectively. The contributions from electrostatic, charge transfer, and other terms or Coulombic interaction to the σ-hole bond and π-hole bond were reviewed first followed by a brief discussion on the interplay between the σ-hole bond and the π-hole bond as well as application of the two types of noncovalent interactions in the field of anion recognition. It is expected that this review could stimulate further development of the σ-hole bond and π-hole bond in theoretical exploration and practical application in the future.
Cañada, Andres; Rabal, Obdulia; Oyarzabal, Julen; Valencia, Alfonso
2017-01-01
Abstract A considerable effort has been devoted to retrieve systematically information for genes and proteins as well as relationships between them. Despite the importance of chemical compounds and drugs as a central bio-entity in pharmacological and biological research, only a limited number of freely available chemical text-mining/search engine technologies are currently accessible. Here we present LimTox (Literature Mining for Toxicology), a web-based online biomedical search tool with special focus on adverse hepatobiliary reactions. It integrates a range of text mining, named entity recognition and information extraction components. LimTox relies on machine-learning, rule-based, pattern-based and term lookup strategies. This system processes scientific abstracts, a set of full text articles and medical agency assessment reports. Although the main focus of LimTox is on adverse liver events, it enables also basic searches for other organ level toxicity associations (nephrotoxicity, cardiotoxicity, thyrotoxicity and phospholipidosis). This tool supports specialized search queries for: chemical compounds/drugs, genes (with additional emphasis on key enzymes in drug metabolism, namely P450 cytochromes—CYPs) and biochemical liver markers. The LimTox website is free and open to all users and there is no login requirement. LimTox can be accessed at: http://limtox.bioinfo.cnio.es PMID:28531339
Context and Domain Knowledge Enhanced Entity Spotting in Informal Text
NASA Astrophysics Data System (ADS)
Gruhl, Daniel; Nagarajan, Meena; Pieper, Jan; Robson, Christine; Sheth, Amit
This paper explores the application of restricted relationship graphs (RDF) and statistical NLP techniques to improve named entity annotation in challenging Informal English domains. We validate our approach using on-line forums discussing popular music. Named entity annotation is particularly difficult in this domain because it is characterized by a large number of ambiguous entities, such as the Madonna album "Music" or Lilly Allen's pop hit "Smile".
Lindemann, Elizabeth A.; Chen, Elizabeth S.; Rajamani, Sripriya; Manohar, Nivedha; Wang, Yan; Melton, Genevieve B.
2017-01-01
There has been increasing recognition of the key role of social determinants like occupation on health. Given the relatively poor understanding of occupation information in electronic health records (EHRs), we sought to characterize occupation information within free-text clinical document sources. From six distinct clinical sources, 868 total occupation-related sentences were identified for the study corpus. Building off approaches from previous studies, refined annotation guidelines were created using the National Institute for Occupational Safety and Health Occupational Data for Health data model with elements added to increase granularity. Our corpus generated 2,005 total annotations representing 39 of 41 entity types from the enhanced data model. Highest frequency entities were: Occupation Description (17.7%); Employment Status – Not Specified (12.5%); Employer Name (11.0%); Subject (9.8%); Industry Description (6.2%). Our findings support the value for standardizing entry of EHR occupation information to improve data quality for improved patient care and secondary uses of this information. PMID:29295142
High-recall protein entity recognition using a dictionary
Kou, Zhenzhen; Cohen, William W.; Murphy, Robert F.
2010-01-01
Protein name extraction is an important step in mining biological literature. We describe two new methods for this task: semiCRFs and dictionary HMMs. SemiCRFs are a recently-proposed extension to conditional random fields that enables more effective use of dictionary information as features. Dictionary HMMs are a technique in which a dictionary is converted to a large HMM that recognizes phrases from the dictionary, as well as variations of these phrases. Standard training methods for HMMs can be used to learn which variants should be recognized. We compared the performance of our new approaches to that of Maximum Entropy (Max-Ent) and normal CRFs on three datasets, and improvement was obtained for all four methods over the best published results for two of the datasets. CRFs and semiCRFs achieved the highest overall performance according to the widely-used F-measure, while the dictionary HMMs performed the best at finding entities that actually appear in the dictionary—the measure of most interest in our intended application. PMID:15961466
12 CFR 1010.208 - General information.
Code of Federal Regulations, 2012 CFR
2012-01-01
... owner or developer are corporate entities, name the parent and/or corporate entity and state the... registration or prohibited sales, name the state involved and give the reasons cited by the state for their... made with the SEC, give the SEC identification number; identify the prospectus by name; date of filing...
12 CFR 1010.208 - General information.
Code of Federal Regulations, 2013 CFR
2013-01-01
... owner or developer are corporate entities, name the parent and/or corporate entity and state the... registration or prohibited sales, name the state involved and give the reasons cited by the state for their... made with the SEC, give the SEC identification number; identify the prospectus by name; date of filing...
2 CFR 170.110 - Types of entities to which this part applies.
Code of Federal Regulations, 2014 CFR
2014-01-01
... 2 Grants and Agreements 1 2014-01-01 2014-01-01 false Types of entities to which this part applies... or receive agency awards; or (2) Receive subawards under those awards. (b) Exceptions. (1) None of... her name). (2) None of the requirements regarding reporting names and total compensation of an entity...
2 CFR 170.110 - Types of entities to which this part applies.
Code of Federal Regulations, 2011 CFR
2011-01-01
... 2 Grants and Agreements 1 2011-01-01 2011-01-01 false Types of entities to which this part applies... or receive agency awards; or (2) Receive subawards under those awards. (b) Exceptions. (1) None of... her name). (2) None of the requirements regarding reporting names and total compensation of an entity...
Detection of IUPAC and IUPAC-like chemical names.
Klinger, Roman; Kolárik, Corinna; Fluck, Juliane; Hofmann-Apitius, Martin; Friedrich, Christoph M
2008-07-01
Chemical compounds like small signal molecules or other biological active chemical substances are an important entity class in life science publications and patents. Several representations and nomenclatures for chemicals like SMILES, InChI, IUPAC or trivial names exist. Only SMILES and InChI names allow a direct structure search, but in biomedical texts trivial names and Iupac like names are used more frequent. While trivial names can be found with a dictionary-based approach and in such a way mapped to their corresponding structures, it is not possible to enumerate all IUPAC names. In this work, we present a new machine learning approach based on conditional random fields (CRF) to find mentions of IUPAC and IUPAC-like names in scientific text as well as its evaluation and the conversion rate with available name-to-structure tools. We present an IUPAC name recognizer with an F(1) measure of 85.6% on a MEDLINE corpus. The evaluation of different CRF orders and offset conjunction orders demonstrates the importance of these parameters. An evaluation of hand-selected patent sections containing large enumerations and terms with mixed nomenclature shows a good performance on these cases (F(1) measure 81.5%). Remaining recognition problems are to detect correct borders of the typically long terms, especially when occurring in parentheses or enumerations. We demonstrate the scalability of our implementation by providing results from a full MEDLINE run. We plan to publish the corpora, annotation guideline as well as the conditional random field model as a UIMA component.
17 CFR 229.1107 - (Item 1107) Issuing entities.
Code of Federal Regulations, 2011 CFR
2011-04-01
... 17 Commodity and Securities Exchanges 2 2011-04-01 2011-04-01 false (Item 1107) Issuing entities....1107 (Item 1107) Issuing entities. Provide the following information about the issuing entity: (a) State the issuing entity's name and describe the issuing entity's form of organization, including the...
17 CFR 229.1107 - (Item 1107) Issuing entities.
Code of Federal Regulations, 2010 CFR
2010-04-01
... 17 Commodity and Securities Exchanges 2 2010-04-01 2010-04-01 false (Item 1107) Issuing entities....1107 (Item 1107) Issuing entities. Provide the following information about the issuing entity: (a) State the issuing entity's name and describe the issuing entity's form of organization, including the...
Use of Co-occurrences for Temporal Expressions Annotation
NASA Astrophysics Data System (ADS)
Craveiro, Olga; Macedo, Joaquim; Madeira, Henrique
The annotation or extraction of temporal information from text documents is becoming increasingly important in many natural language processing applications such as text summarization, information retrieval, question answering, etc.. This paper presents an original method for easy recognition of temporal expressions in text documents. The method creates semantically classified temporal patterns, using word co-occurrences obtained from training corpora and a pre-defined seed keywords set, derived from the used language temporal references. A participation on a Portuguese named entity evaluation contest showed promising effectiveness and efficiency results. This approach can be adapted to recognize other type of expressions or languages, within other contexts, by defining the suitable word sets and training corpora.
Maebayashi, Toshiya; Abe, Katsumi; Aizawa, Takuya; Sakaguchi, Masakuni; Ishibashi, Naoya; Abe, Osamu; Takayama, Tadatoshi; Nakayama, Hisashi; Matsuoka, Shunichi; Nirei, Kazushige; Nakamura, Hitomi; Ogawa, Masahiro; Sugitani, Masahiko
2015-05-07
A 58-year-old man presented with the chief complaint of abdominal bloating and was incidentally found to have a liver tumor. As diagnostic imaging studies could not rule out malignancy, the patient underwent partial resection of segment 3 of the liver. The lesion pathologically showed eosinophilic proliferation, in addition to immunohistochemical positivity for human melanoma black 45 and Melan-A, thereby leading to the diagnosis of a hepatic perivascular epithelioid cell tumor (PEComa). A PEComa arising from the liver is relatively rare. Moreover, the name 'PEComa' has not yet been widely recognized, and the same disease entity has been called epithelioid angiomyolipoma (EAML), further diminishing the recognition of PEComa. In addition, PEComa imaging findings mimic those of malignant liver tumors, and clinically, this tumor tends to enlarge. Therefore, a PEComa is difficult to diagnose. We conducted a systematic review of PEComa and EAML cases and discuss the results, including findings useful for differentiating perivascular epithelioid cell tumors from malignant liver tumors.
Maebayashi, Toshiya; Abe, Katsumi; Aizawa, Takuya; Sakaguchi, Masakuni; Ishibashi, Naoya; Abe, Osamu; Takayama, Tadatoshi; Nakayama, Hisashi; Matsuoka, Shunichi; Nirei, Kazushige; Nakamura, Hitomi; Ogawa, Masahiro; Sugitani, Masahiko
2015-01-01
A 58-year-old man presented with the chief complaint of abdominal bloating and was incidentally found to have a liver tumor. As diagnostic imaging studies could not rule out malignancy, the patient underwent partial resection of segment 3 of the liver. The lesion pathologically showed eosinophilic proliferation, in addition to immunohistochemical positivity for human melanoma black 45 and Melan-A, thereby leading to the diagnosis of a hepatic perivascular epithelioid cell tumor (PEComa). A PEComa arising from the liver is relatively rare. Moreover, the name ‘PEComa’ has not yet been widely recognized, and the same disease entity has been called epithelioid angiomyolipoma (EAML), further diminishing the recognition of PEComa. In addition, PEComa imaging findings mimic those of malignant liver tumors, and clinically, this tumor tends to enlarge. Therefore, a PEComa is difficult to diagnose. We conducted a systematic review of PEComa and EAML cases and discuss the results, including findings useful for differentiating perivascular epithelioid cell tumors from malignant liver tumors. PMID:25954119
Cañada, Andres; Capella-Gutierrez, Salvador; Rabal, Obdulia; Oyarzabal, Julen; Valencia, Alfonso; Krallinger, Martin
2017-07-03
A considerable effort has been devoted to retrieve systematically information for genes and proteins as well as relationships between them. Despite the importance of chemical compounds and drugs as a central bio-entity in pharmacological and biological research, only a limited number of freely available chemical text-mining/search engine technologies are currently accessible. Here we present LimTox (Literature Mining for Toxicology), a web-based online biomedical search tool with special focus on adverse hepatobiliary reactions. It integrates a range of text mining, named entity recognition and information extraction components. LimTox relies on machine-learning, rule-based, pattern-based and term lookup strategies. This system processes scientific abstracts, a set of full text articles and medical agency assessment reports. Although the main focus of LimTox is on adverse liver events, it enables also basic searches for other organ level toxicity associations (nephrotoxicity, cardiotoxicity, thyrotoxicity and phospholipidosis). This tool supports specialized search queries for: chemical compounds/drugs, genes (with additional emphasis on key enzymes in drug metabolism, namely P450 cytochromes-CYPs) and biochemical liver markers. The LimTox website is free and open to all users and there is no login requirement. LimTox can be accessed at: http://limtox.bioinfo.cnio.es. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Impact of translation on named-entity recognition in radiology texts
Pedro, Vasco
2017-01-01
Abstract Radiology reports describe the results of radiography procedures and have the potential of being a useful source of information which can bring benefits to health care systems around the world. One way to automatically extract information from the reports is by using Text Mining tools. The problem is that these tools are mostly developed for English and reports are usually written in the native language of the radiologist, which is not necessarily English. This creates an obstacle to the sharing of Radiology information between different communities. This work explores the solution of translating the reports to English before applying the Text Mining tools, probing the question of what translation approach should be used. We created MRRAD (Multilingual Radiology Research Articles Dataset), a parallel corpus of Portuguese research articles related to Radiology and a number of alternative translations (human, automatic and semi-automatic) to English. This is a novel corpus which can be used to move forward the research on this topic. Using MRRAD we studied which kind of automatic or semi-automatic translation approach is more effective on the Named-entity recognition task of finding RadLex terms in the English version of the articles. Considering the terms extracted from human translations as our gold standard, we calculated how similar to this standard were the terms extracted using other translations. We found that a completely automatic translation approach using Google leads to F-scores (between 0.861 and 0.868, depending on the extraction approach) similar to the ones obtained through a more expensive semi-automatic translation approach using Unbabel (between 0.862 and 0.870). To better understand the results we also performed a qualitative analysis of the type of errors found in the automatic and semi-automatic translations. Database URL: https://github.com/lasigeBioTM/MRRAD PMID:29220455
Huang, Chung-Chi; Lu, Zhiyong
2016-01-01
Identifying relevant papers from the literature is a common task in biocuration. Most current biomedical literature search systems primarily rely on matching user keywords. Semantic search, on the other hand, seeks to improve search accuracy by understanding the entities and contextual relations in user keywords. However, past research has mostly focused on semantically identifying biological entities (e.g. chemicals, diseases and genes) with little effort on discovering semantic relations. In this work, we aim to discover biomedical semantic relations in PubMed queries in an automated and unsupervised fashion. Specifically, we focus on extracting and understanding the contextual information (or context patterns) that is used by PubMed users to represent semantic relations between entities such as ‘CHEMICAL-1 compared to CHEMICAL-2.’ With the advances in automatic named entity recognition, we first tag entities in PubMed queries and then use tagged entities as knowledge to recognize pattern semantics. More specifically, we transform PubMed queries into context patterns involving participating entities, which are subsequently projected to latent topics via latent semantic analysis (LSA) to avoid the data sparseness and specificity issues. Finally, we mine semantically similar contextual patterns or semantic relations based on LSA topic distributions. Our two separate evaluation experiments of chemical-chemical (CC) and chemical–disease (CD) relations show that the proposed approach significantly outperforms a baseline method, which simply measures pattern semantics by similarity in participating entities. The highest performance achieved by our approach is nearly 0.9 and 0.85 respectively for the CC and CD task when compared against the ground truth in terms of normalized discounted cumulative gain (nDCG), a standard measure of ranking quality. These results suggest that our approach can effectively identify and return related semantic patterns in a ranked order covering diverse bio-entity relations. To assess the potential utility of our automated top-ranked patterns of a given relation in semantic search, we performed a pilot study on frequently sought semantic relations in PubMed and observed improved literature retrieval effectiveness based on post-hoc human relevance evaluation. Further investigation in larger tests and in real-world scenarios is warranted. PMID:27016698
Detection of IUPAC and IUPAC-like chemical names
Klinger, Roman; Kolářik, Corinna; Fluck, Juliane; Hofmann-Apitius, Martin; Friedrich, Christoph M.
2008-01-01
Motivation: Chemical compounds like small signal molecules or other biological active chemical substances are an important entity class in life science publications and patents. Several representations and nomenclatures for chemicals like SMILES, InChI, IUPAC or trivial names exist. Only SMILES and InChI names allow a direct structure search, but in biomedical texts trivial names and Iupac like names are used more frequent. While trivial names can be found with a dictionary-based approach and in such a way mapped to their corresponding structures, it is not possible to enumerate all IUPAC names. In this work, we present a new machine learning approach based on conditional random fields (CRF) to find mentions of IUPAC and IUPAC-like names in scientific text as well as its evaluation and the conversion rate with available name-to-structure tools. Results: We present an IUPAC name recognizer with an F1 measure of 85.6% on a MEDLINE corpus. The evaluation of different CRF orders and offset conjunction orders demonstrates the importance of these parameters. An evaluation of hand-selected patent sections containing large enumerations and terms with mixed nomenclature shows a good performance on these cases (F1 measure 81.5%). Remaining recognition problems are to detect correct borders of the typically long terms, especially when occurring in parentheses or enumerations. We demonstrate the scalability of our implementation by providing results from a full MEDLINE run. Availability: We plan to publish the corpora, annotation guideline as well as the conditional random field model as a UIMA component. Contact: roman.klinger@scai.fraunhofer.de PMID:18586724
CD-REST: a system for extracting chemical-induced disease relation in literature.
Xu, Jun; Wu, Yonghui; Zhang, Yaoyun; Wang, Jingqi; Lee, Hee-Jin; Xu, Hua
2016-01-01
Mining chemical-induced disease relations embedded in the vast biomedical literature could facilitate a wide range of computational biomedical applications, such as pharmacovigilance. The BioCreative V organized a Chemical Disease Relation (CDR) Track regarding chemical-induced disease relation extraction from biomedical literature in 2015. We participated in all subtasks of this challenge. In this article, we present our participation system Chemical Disease Relation Extraction SysTem (CD-REST), an end-to-end system for extracting chemical-induced disease relations in biomedical literature. CD-REST consists of two main components: (1) a chemical and disease named entity recognition and normalization module, which employs the Conditional Random Fields algorithm for entity recognition and a Vector Space Model-based approach for normalization; and (2) a relation extraction module that classifies both sentence-level and document-level candidate drug-disease pairs by support vector machines. Our system achieved the best performance on the chemical-induced disease relation extraction subtask in the BioCreative V CDR Track, demonstrating the effectiveness of our proposed machine learning-based approaches for automatic extraction of chemical-induced disease relations in biomedical literature. The CD-REST system provides web services using HTTP POST request. The web services can be accessed fromhttp://clinicalnlptool.com/cdr The online CD-REST demonstration system is available athttp://clinicalnlptool.com/cdr/cdr.html. Database URL:http://clinicalnlptool.com/cdr;http://clinicalnlptool.com/cdr/cdr.html. © The Author(s) 2016. Published by Oxford University Press.
SWARMs Ontology: A Common Information Model for the Cooperation of Underwater Robots.
Li, Xin; Bilbao, Sonia; Martín-Wanton, Tamara; Bastos, Joaquim; Rodriguez, Jonathan
2017-03-11
In order to facilitate cooperation between underwater robots, it is a must for robots to exchange information with unambiguous meaning. However, heterogeneity, existing in information pertaining to different robots, is a major obstruction. Therefore, this paper presents a networked ontology, named the Smart and Networking Underwater Robots in Cooperation Meshes (SWARMs) ontology, to address information heterogeneity and enable robots to have the same understanding of exchanged information. The SWARMs ontology uses a core ontology to interrelate a set of domain-specific ontologies, including the mission and planning, the robotic vehicle, the communication and networking, and the environment recognition and sensing ontology. In addition, the SWARMs ontology utilizes ontology constructs defined in the PR-OWL ontology to annotate context uncertainty based on the Multi-Entity Bayesian Network (MEBN) theory. Thus, the SWARMs ontology can provide both a formal specification for information that is necessarily exchanged between robots and a command and control entity, and also support for uncertainty reasoning. A scenario on chemical pollution monitoring is described and used to showcase how the SWARMs ontology can be instantiated, be extended, represent context uncertainty, and support uncertainty reasoning.
Federal Register 2010, 2011, 2012, 2013, 2014
2011-05-17
... DEPARTMENT OF THE TREASURY Office of Foreign Assets Control Identification of Three Entities as Government of Libya Entities Pursuant to Executive Order 13566 AGENCY: Department of the Treasury. ACTION... names of three entities identified on May 5, 2011 as persons whose property and interests in property...
Jauregi Unanue, Iñigo; Zare Borzeshi, Ehsan; Piccardi, Massimo
2017-12-01
Previous state-of-the-art systems on Drug Name Recognition (DNR) and Clinical Concept Extraction (CCE) have focused on a combination of text "feature engineering" and conventional machine learning algorithms such as conditional random fields and support vector machines. However, developing good features is inherently heavily time-consuming. Conversely, more modern machine learning approaches such as recurrent neural networks (RNNs) have proved capable of automatically learning effective features from either random assignments or automated word "embeddings". (i) To create a highly accurate DNR and CCE system that avoids conventional, time-consuming feature engineering. (ii) To create richer, more specialized word embeddings by using health domain datasets such as MIMIC-III. (iii) To evaluate our systems over three contemporary datasets. Two deep learning methods, namely the Bidirectional LSTM and the Bidirectional LSTM-CRF, are evaluated. A CRF model is set as the baseline to compare the deep learning systems to a traditional machine learning approach. The same features are used for all the models. We have obtained the best results with the Bidirectional LSTM-CRF model, which has outperformed all previously proposed systems. The specialized embeddings have helped to cover unusual words in DrugBank and MedLine, but not in the i2b2/VA dataset. We present a state-of-the-art system for DNR and CCE. Automated word embeddings has allowed us to avoid costly feature engineering and achieve higher accuracy. Nevertheless, the embeddings need to be retrained over datasets that are adequate for the domain, in order to adequately cover the domain-specific vocabulary. Copyright © 2017 Elsevier Inc. All rights reserved.
50 CFR 679.81 - Rockfish Program annual harvester and processor privileges.
Code of Federal Regulations, 2010 CFR
2010-10-01
... legal name; the type of business entity under which the rockfish cooperative is organized; the state in which the rockfish cooperative is legally registered as a business entity; Tax ID number, date of incorporation, the printed name of the rockfish cooperative's designated representative; the permanent business...
The left temporal pole is a heteromodal hub for retrieving proper names
Waldron, Eric J.; Manzel, Kenneth; Tranel, Daniel
2015-01-01
The left temporal pole (LTP) has been posited to be a heteromodal hub for retrieving proper names for semantically unique entities. Previous investigations have demonstrated that LTP is important for retrieving names for famous faces and unique landmarks. However, whether such a relationship would hold for unique entities apprehended through stimulus modalities other than vision has not been well established, and such evidence is critical to adjudicate claims about the “heteromodal” nature of the LTP. Here, we tested the hypothesis that the LTP would be important for naming famous voices. Individuals with LTP lesions were asked to recognize and name famous persons speaking in audio clips. Relative to neurologically normal and brain damaged comparison participants, patients with LTP lesions were able to recognize famous persons from their voices normally, but were selectively impaired in naming famous persons from their voices. The current results extend previous research and provide further support for the notion that the LTP is a convergence region serving as a heteromodal hub for retrieving the names of semantically unique entities. PMID:24389260
Leung, Tiffany I; Dumontier, Michel
2016-06-08
Clinical practice guidelines (CPGs) recommend pharmacologic treatments for clinical conditions, and drug structured product labels (SPLs) summarize approved treatment indications. Both resources are intended to promote evidence-based medical practices and guide clinicians' prescribing decisions. However, it is unclear how well CPG recommendations about pharmacologic therapies match SPL indications for recommended drugs. In this study, we perform text mining of CPG summaries to examine drug-disease associations in CPG recommendations and in SPL treatment indications for 15 common chronic conditions. We constructed an initial text corpus of guideline summaries from the National Guideline Clearinghouse (NGC) from a set of manually selected ICD-9 codes for each of the 15 conditions. We obtained 377 relevant guideline summaries and their Major Recommendations section, which excludes guidelines for pediatric patients, pregnant or breastfeeding women, or for medical diagnoses not meeting inclusion criteria. A vocabulary of drug terms was derived from five medical taxonomies. We used named entity recognition, in combination with dictionary-based and ontology-based methods, to identify drug term occurrences in the text corpus and construct drug-disease associations. The ATC (Anatomical Therapeutic Chemical Classification) was utilized to perform drug name and drug class matching to construct the drug-disease associations from CPGs. We then obtained drug-disease associations from SPLs using conditions mentioned in their Indications section in SIDER. The primary outcomes were the frequency of drug-disease associations in CPGs and SPLs, and the frequency of overlap between the two sets of drug-disease associations, with and without using taxonomic information from ATC. Without taxonomic information, we identified 1444 drug-disease associations across CPGs and SPLs for 15 common chronic conditions. Of these, 195 drug-disease associations overlapped between CPGs and SPLs, 917 associations occurred in CPGs only and 332 associations occurred in SPLs only. With taxonomic information, 859 unique drug-disease associations were identified, of which 152 of these drug-disease associations overlapped between CPGs and SPLs, 541 associations occurred in CPGs only, and 166 associations occurred in SPLs only. Our results suggest that CPG-recommended pharmacologic therapies and SPL indications do not overlap frequently when identifying drug-disease associations using named entity recognition, although incorporating taxonomic relationships between drug names and drug classes into the approach improves the overlap. This has important implications in practice because conflicting or inconsistent evidence may complicate clinical decision making and implementation or measurement of best practices.
76 FR 52384 - Designation of Additional Entities Pursuant to Executive Order 13405
Federal Register 2010, 2011, 2012, 2013, 2014
2011-08-22
... DEPARTMENT OF THE TREASURY Office of Foreign Assets Control Designation of Additional Entities... Assets Control (``OFAC'') is publishing the names of four newly-designated entities whose property and... the Director of OFAC of the four entities identified in this notice, pursuant to Executive [[Page...
31 CFR 306.88 - Political entities and public corporations.
Code of Federal Regulations, 2011 CFR
2011-07-01
... 31 Money and Finance:Treasury 2 2011-07-01 2011-07-01 false Political entities and public... entities and public corporations. Securities registered in the name of, or assigned to, a State, county, city, town, village, school district or other political entity, public body or corporation, may be...
31 CFR 306.88 - Political entities and public corporations.
Code of Federal Regulations, 2010 CFR
2010-07-01
... 31 Money and Finance: Treasury 2 2010-07-01 2010-07-01 false Political entities and public... entities and public corporations. Securities registered in the name of, or assigned to, a State, county, city, town, village, school district or other political entity, public body or corporation, may be...
Cloud Computing in Higher Education Sector for Sustainable Development
ERIC Educational Resources Information Center
Duan, Yuchao
2016-01-01
Cloud computing is considered a new frontier in the field of computing, as this technology comprises three major entities namely: software, hardware and network. The collective nature of all these entities is known as the Cloud. This research aims to examine the impacts of various aspects namely: cloud computing, sustainability, performance…
Training the max-margin sequence model with the relaxed slack variables.
Niu, Lingfeng; Wu, Jianmin; Shi, Yong
2012-09-01
Sequence models are widely used in many applications such as natural language processing, information extraction and optical character recognition, etc. We propose a new approach to train the max-margin based sequence model by relaxing the slack variables in this paper. With the canonical feature mapping definition, the relaxed problem is solved by training a multiclass Support Vector Machine (SVM). Compared with the state-of-the-art solutions for the sequence learning, the new method has the following advantages: firstly, the sequence training problem is transformed into a multiclassification problem, which is more widely studied and already has quite a few off-the-shelf training packages; secondly, this new approach reduces the complexity of training significantly and achieves comparable prediction performance compared with the existing sequence models; thirdly, when the size of training data is limited, by assigning different slack variables to different microlabel pairs, the new method can use the discriminative information more frugally and produces more reliable model; last but not least, by employing kernels in the intermediate multiclass SVM, nonlinear feature space can be easily explored. Experimental results on the task of named entity recognition, information extraction and handwritten letter recognition with the public datasets illustrate the efficiency and effectiveness of our method. Copyright © 2012 Elsevier Ltd. All rights reserved.
77 FR 31806 - Changes to Implement Micro Entity Status for Paying Patent Fees
Federal Register 2010, 2011, 2012, 2013, 2014
2012-05-30
... legislative history of 35 U.S.C. 123 is clear that it is directed to a subset of small entities, namely... history do not, for example, contemplate a for-profit, large entity applicant becoming a ``micro entity... across government agencies and identified goals designed to promote innovation; (8) considered approaches...
Effects of Minority Status on Facial Recognition and Naming Performance.
ERIC Educational Resources Information Center
Roberts, Richard J.; Hamsher, Kerry
1984-01-01
Examined the differential effects of minority status in Blacks (N=94) on a facial recognition test and a naming test. Results showed that performance on the facial recognition test was relatively free of racial bias, but this was not the case for visual naming. (LLL)
2 CFR 170.110 - Types of entities to which this part applies.
Code of Federal Regulations, 2013 CFR
2013-01-01
... 2 Grants and Agreements 1 2013-01-01 2013-01-01 false Types of entities to which this part applies...) Apply for or receive agency awards; or (2) Receive subawards under those awards. (b) Exceptions. (1... his or her name). (2) None of the requirements regarding reporting names and total compensation of an...
2 CFR 170.110 - Types of entities to which this part applies.
Code of Federal Regulations, 2012 CFR
2012-01-01
... 2 Grants and Agreements 1 2012-01-01 2012-01-01 false Types of entities to which this part applies...) Apply for or receive agency awards; or (2) Receive subawards under those awards. (b) Exceptions. (1... his or her name). (2) None of the requirements regarding reporting names and total compensation of an...
Improving Information Extraction and Translation Using Component Interactions
2008-01-01
74 7. CASE STUDY ON MONOLINGUAL INTERACTION.....................................................................76 7.1 IMPROVING NAME TAGGING BY...interactions described above focused on the monolingual analysis pipeline. (Huang and Vogel, 2002) presented a cross-lingual joint inference example to...improve the extracted named entity translation dictionary and the entity annotation in a bilingual 22 training corpus. They used a more
Discovering latent commercial networks from online financial news articles
NASA Astrophysics Data System (ADS)
Xia, Yunqing; Su, Weifeng; Lau, Raymond Y. K.; Liu, Yi
2013-08-01
Unlike most online social networks where explicit links among individual users are defined, the relations among commercial entities (e.g. firms) may not be explicitly declared in commercial Web sites. One main contribution of this article is the development of a novel computational model for the discovery of the latent relations among commercial entities from online financial news. More specifically, a CRF model which can exploit both structural and contextual features is applied to commercial entity recognition. In addition, a point-wise mutual information (PMI)-based unsupervised learning method is developed for commercial relation identification. To evaluate the effectiveness of the proposed computational methods, a prototype system called CoNet has been developed. Based on the financial news articles crawled from Google finance, the CoNet system achieves average F-scores of 0.681 and 0.754 in commercial entity recognition and commercial relation identification, respectively. Our experimental results confirm that the proposed shallow natural language processing methods are effective for the discovery of latent commercial networks from online financial news.
Aggregating and Predicting Sequence Labels from Crowd Annotations
Nguyen, An T.; Wallace, Byron C.; Li, Junyi Jessy; Nenkova, Ani; Lease, Matthew
2017-01-01
Despite sequences being core to NLP, scant work has considered how to handle noisy sequence labels from multiple annotators for the same text. Given such annotations, we consider two complementary tasks: (1) aggregating sequential crowd labels to infer a best single set of consensus annotations; and (2) using crowd annotations as training data for a model that can predict sequences in unannotated text. For aggregation, we propose a novel Hidden Markov Model variant. To predict sequences in unannotated text, we propose a neural approach using Long Short Term Memory. We evaluate a suite of methods across two different applications and text genres: Named-Entity Recognition in news articles and Information Extraction from biomedical abstracts. Results show improvement over strong baselines. Our source code and data are available online1. PMID:29093611
Finding Related Entities by Retrieving Relations: UIUC at TREC 2009 Entity Track
2009-11-01
classes, depending on the categories they belong to. A music album could have any generic name, whereas a laptop model has a more generalizable name. A...names of music albums are simply plain text often capitalized, and so on. Thus, we feel that a better ap- proach would be to first identify the...origin domain of the text to be tagged (e.g., pharmaceutical, music , journal, etc.), and then apply tagging rules that are specific to that domain
Anatomical entity mention recognition at literature scale
Pyysalo, Sampo; Ananiadou, Sophia
2014-01-01
Motivation: Anatomical entities ranging from subcellular structures to organ systems are central to biomedical science, and mentions of these entities are essential to understanding the scientific literature. Despite extensive efforts to automatically analyze various aspects of biomedical text, there have been only few studies focusing on anatomical entities, and no dedicated methods for learning to automatically recognize anatomical entity mentions in free-form text have been introduced. Results: We present AnatomyTagger, a machine learning-based system for anatomical entity mention recognition. The system incorporates a broad array of approaches proposed to benefit tagging, including the use of Unified Medical Language System (UMLS)- and Open Biomedical Ontologies (OBO)-based lexical resources, word representations induced from unlabeled text, statistical truecasing and non-local features. We train and evaluate the system on a newly introduced corpus that substantially extends on previously available resources, and apply the resulting tagger to automatically annotate the entire open access scientific domain literature. The resulting analyses have been applied to extend services provided by the Europe PubMed Central literature database. Availability and implementation: All tools and resources introduced in this work are available from http://nactem.ac.uk/anatomytagger. Contact: sophia.ananiadou@manchester.ac.uk Supplementary Information: Supplementary data are available at Bioinformatics online. PMID:24162468
Huang, Chung-Chi; Lu, Zhiyong
2016-01-01
Identifying relevant papers from the literature is a common task in biocuration. Most current biomedical literature search systems primarily rely on matching user keywords. Semantic search, on the other hand, seeks to improve search accuracy by understanding the entities and contextual relations in user keywords. However, past research has mostly focused on semantically identifying biological entities (e.g. chemicals, diseases and genes) with little effort on discovering semantic relations. In this work, we aim to discover biomedical semantic relations in PubMed queries in an automated and unsupervised fashion. Specifically, we focus on extracting and understanding the contextual information (or context patterns) that is used by PubMed users to represent semantic relations between entities such as 'CHEMICAL-1 compared to CHEMICAL-2' With the advances in automatic named entity recognition, we first tag entities in PubMed queries and then use tagged entities as knowledge to recognize pattern semantics. More specifically, we transform PubMed queries into context patterns involving participating entities, which are subsequently projected to latent topics via latent semantic analysis (LSA) to avoid the data sparseness and specificity issues. Finally, we mine semantically similar contextual patterns or semantic relations based on LSA topic distributions. Our two separate evaluation experiments of chemical-chemical (CC) and chemical-disease (CD) relations show that the proposed approach significantly outperforms a baseline method, which simply measures pattern semantics by similarity in participating entities. The highest performance achieved by our approach is nearly 0.9 and 0.85 respectively for the CC and CD task when compared against the ground truth in terms of normalized discounted cumulative gain (nDCG), a standard measure of ranking quality. These results suggest that our approach can effectively identify and return related semantic patterns in a ranked order covering diverse bio-entity relations. To assess the potential utility of our automated top-ranked patterns of a given relation in semantic search, we performed a pilot study on frequently sought semantic relations in PubMed and observed improved literature retrieval effectiveness based on post-hoc human relevance evaluation. Further investigation in larger tests and in real-world scenarios is warranted. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.
Schedl, Markus
2012-01-01
Different term weighting techniques such as [Formula: see text] or BM25 have been used intensely for manifold text-based information retrieval tasks. Their use for modeling term profiles for named entities and subsequent calculation of similarities between these named entities have been studied to a much smaller extent. The recent trend of microblogging made available massive amounts of information about almost every topic around the world. Therefore, microblogs represent a valuable source for text-based named entity modeling. In this paper, we present a systematic and comprehensive evaluation of different term weighting measures , normalization techniques , query schemes , index term sets , and similarity functions for the task of inferring similarities between named entities, based on data extracted from microblog posts . We analyze several thousand combinations of choices for the above mentioned dimensions, which influence the similarity calculation process, and we investigate in which way they impact the quality of the similarity estimates. Evaluation is performed using three real-world data sets: two collections of microblogs related to music artists and one related to movies. For the music collections, we present results of genre classification experiments using as benchmark genre information from allmusic.com. For the movie collection, we present results of multi-class classification experiments using as benchmark categories from IMDb. We show that microblogs can indeed be exploited to model named entity similarity with remarkable accuracy, provided the correct settings for the analyzed aspects are used. We further compare the results to those obtained when using Web pages as data source.
Recognition of Famous Names in Psychology by Students and Staff.
ERIC Educational Resources Information Center
Bunnell, Julie K.
1992-01-01
Presents results of a name recognition questionnaire testing the historical awareness of psychology majors and faculty members. Reports that students showed a low level of name recognition prior to taking a course in the history of psychology. Concludes that explicit instruction is required to impart knowledge of the history of the discipline. (DK)
SWARMs Ontology: A Common Information Model for the Cooperation of Underwater Robots
Li, Xin; Bilbao, Sonia; Martín-Wanton, Tamara; Bastos, Joaquim; Rodriguez, Jonathan
2017-01-01
In order to facilitate cooperation between underwater robots, it is a must for robots to exchange information with unambiguous meaning. However, heterogeneity, existing in information pertaining to different robots, is a major obstruction. Therefore, this paper presents a networked ontology, named the Smart and Networking Underwater Robots in Cooperation Meshes (SWARMs) ontology, to address information heterogeneity and enable robots to have the same understanding of exchanged information. The SWARMs ontology uses a core ontology to interrelate a set of domain-specific ontologies, including the mission and planning, the robotic vehicle, the communication and networking, and the environment recognition and sensing ontology. In addition, the SWARMs ontology utilizes ontology constructs defined in the PR-OWL ontology to annotate context uncertainty based on the Multi-Entity Bayesian Network (MEBN) theory. Thus, the SWARMs ontology can provide both a formal specification for information that is necessarily exchanged between robots and a command and control entity, and also support for uncertainty reasoning. A scenario on chemical pollution monitoring is described and used to showcase how the SWARMs ontology can be instantiated, be extended, represent context uncertainty, and support uncertainty reasoning. PMID:28287468
2012-07-20
This final rule establishes data collection standards necessary to implement aspects of section 1302 of the Patient Protection and Affordable Care Act (Affordable Care Act), which directs the Secretary of Health and Human Services to define essential health benefits. This final rule outlines the data on applicable plans to be collected from certain issuers to support the definition of essential health benefits. This final rule also establishes a process for the recognition of accrediting entities for purposes of certification of qualified health plans.
Code of Federal Regulations, 2010 CFR
2010-07-01
... of capital; and contracts evidencing the sale of the entity to its new owners. (b) Any continuing... narcotics trafficker could lead to designation of the purchaser. Mere change in name of an entity will not...
On the history of lacunes, etat criblé, and the white matter lesions of vascular dementia.
Román, Gustavo C
2002-01-01
The history of lesions associated with vascular dementia (17th to 19th century) is reviewed. Recognition of ischemic and hemorrhagic stroke types dates back to the 17th century; however, at that time a third type ('cerebral congestion') emerged as the most common form of apoplexy. This entity vanished as arterial hypertension became established with the introduction of the sphygmomanometer (1905). Before the 19th century, apoplexy was considered a uniformly fatal disease, although Willis first recognized post-stroke dementia in 1672. Dechambre (1838) first reported 'lacunes' in stroke survivors with small cerebral softenings. Durand-Fardel (1842) described interstitial atrophy of the brain (leukoaraiosis) and état criblé (cribriform state) reflecting chronic cerebral congestion. In 1894, Alzheimer and Binswanger identified 'arteriosclerotic brain atrophy,' a form of vascular dementia characterized by 'miliary apoplexies' (lacunes). Also in 1894, Binswanger described the disease that now bears his name. In 1901, Pierre Marie coined the name état lacunaire (lacunar state) for the clinical syndrome of elderly patients with multiple lacunes. Copyright 2002 S. Karger AG, Basel
Binding Affinity of Glycoconjugates to BACILLUS Spores and Toxins
NASA Astrophysics Data System (ADS)
Rasol, Aveen; Eassa, Souzan; Tarasenko, Olga
2010-04-01
Early recognition of Bacillus cereus group species is important since they can cause food-borne illnesses and deadly diseases in humans. Glycoconjugates (GCs) are carbohydrates covalently linked to non-sugar moieties including lipids, proteins or other entities. GCs are involved in recognition and signaling processes intrinsic to biochemical functions in cells. They also stimulate cell-cell adhesion and subsequent recognition and activation of receptors. We have demonstrated that GCs are involved in Bacillus cereus spore recognition. In the present study, we have investigated whether GCs possess the ability to bind and recognize B. cereus spores and Bacillus anthracis recombinant single toxins (sTX) and complex toxins (cTX). The affinity of GCs to spores + sTX and spores + cTX toxins was studied in the binding essay. Our results demonstrated that GC9 and GC10 were able to selectively bind to B. cereus spores and B. anthracis toxins. Different binding affinities for GCs were found toward Bacillus cereus spores + sTX and spores + cTX. Dilution of GCs does not impede the recognition and binding. Developed method provides a tool for simultaneous recognition and targeting of spores, bacteria toxins, and/or other entities.
BioCreative V CDR task corpus: a resource for chemical disease relation extraction.
Li, Jiao; Sun, Yueping; Johnson, Robin J; Sciaky, Daniela; Wei, Chih-Hsuan; Leaman, Robert; Davis, Allan Peter; Mattingly, Carolyn J; Wiegers, Thomas C; Lu, Zhiyong
2016-01-01
Community-run, formal evaluations and manually annotated text corpora are critically important for advancing biomedical text-mining research. Recently in BioCreative V, a new challenge was organized for the tasks of disease named entity recognition (DNER) and chemical-induced disease (CID) relation extraction. Given the nature of both tasks, a test collection is required to contain both disease/chemical annotations and relation annotations in the same set of articles. Despite previous efforts in biomedical corpus construction, none was found to be sufficient for the task. Thus, we developed our own corpus called BC5CDR during the challenge by inviting a team of Medical Subject Headings (MeSH) indexers for disease/chemical entity annotation and Comparative Toxicogenomics Database (CTD) curators for CID relation annotation. To ensure high annotation quality and productivity, detailed annotation guidelines and automatic annotation tools were provided. The resulting BC5CDR corpus consists of 1500 PubMed articles with 4409 annotated chemicals, 5818 diseases and 3116 chemical-disease interactions. Each entity annotation includes both the mention text spans and normalized concept identifiers, using MeSH as the controlled vocabulary. To ensure accuracy, the entities were first captured independently by two annotators followed by a consensus annotation: The average inter-annotator agreement (IAA) scores were 87.49% and 96.05% for the disease and chemicals, respectively, in the test set according to the Jaccard similarity coefficient. Our corpus was successfully used for the BioCreative V challenge tasks and should serve as a valuable resource for the text-mining research community.Database URL: http://www.biocreative.org/tasks/biocreative-v/track-3-cdr/. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the United States.
Discovery of Predicate-Oriented Relations among Named Entities Extracted from Thai Texts
NASA Astrophysics Data System (ADS)
Tongtep, Nattapong; Theeramunkong, Thanaruk
Extracting named entities (NEs) and their relations is more difficult in Thai than in other languages due to several Thai specific characteristics, including no explicit boundaries for words, phrases and sentences; few case markers and modifier clues; high ambiguity in compound words and serial verbs; and flexible word orders. Unlike most previous works which focused on NE relations of specific actions, such as work_for, live_in, located_in, and kill, this paper proposes more general types of NE relations, called predicate-oriented relation (PoR), where an extracted action part (verb) is used as a core component to associate related named entities extracted from Thai Texts. Lacking a practical parser for the Thai language, we present three types of surface features, i.e. punctuation marks (such as token spaces), entity types and the number of entities and then apply five alternative commonly used learning schemes to investigate their performance on predicate-oriented relation extraction. The experimental results show that our approach achieves the F-measure of 97.76%, 99.19%, 95.00% and 93.50% on four different types of predicate-oriented relation (action-location, location-action, action-person and person-action) in crime-related news documents using a data set of 1,736 entity pairs. The effects of NE extraction techniques, feature sets and class unbalance on the performance of relation extraction are explored.
31 CFR 306.88 - Political entities and public corporations.
Code of Federal Regulations, 2013 CFR
2013-07-01
... corporations. 306.88 Section 306.88 Money and Finance: Treasury Regulations Relating to Money and Finance... entities and public corporations. Securities registered in the name of, or assigned to, a State, county, city, town, village, school district or other political entity, public body or corporation, may be...
31 CFR 306.88 - Political entities and public corporations.
Code of Federal Regulations, 2014 CFR
2014-07-01
... corporations. 306.88 Section 306.88 Money and Finance: Treasury Regulations Relating to Money and Finance... entities and public corporations. Securities registered in the name of, or assigned to, a State, county, city, town, village, school district or other political entity, public body or corporation, may be...
31 CFR 306.88 - Political entities and public corporations.
Code of Federal Regulations, 2012 CFR
2012-07-01
... corporations. 306.88 Section 306.88 Money and Finance: Treasury Regulations Relating to Money and Finance... entities and public corporations. Securities registered in the name of, or assigned to, a State, county, city, town, village, school district or other political entity, public body or corporation, may be...
ERIC Educational Resources Information Center
Setti, Annalisa; Borghi, Anna M.; Tessari, Alessia
2009-01-01
In this study we investigated with a priming paradigm whether uni and bimanual actions presented as primes differently affected language processing. Animals' (self-moving entities) and plants' (not self-moving entities) names were used as targets. As prime we used grasping hands, presented both as static images and videos. The results showed an…
Does humor in radio advertising affect recognition of novel product brand names?
Berg, E M; Lippman, L G
2001-04-01
The authors proposed that item selection during shopping is based on brand name recognition rather than recall. College students rated advertisements and news stories of a simulated radio program for level of amusement (orienting activity) before participating in a surprise recognition test. Humor level of the advertisements was varied systematically, and content was controlled. According to signal detection analysis, humor did not affect the strength of recognition memory for brand names (nonsense units). However, brand names and product types were significantly more likely to be associated when appearing in humorous advertisements than in nonhumorous advertisements. The results are compared with prior findings concerning humor and recall.
Todd, Derrick J; Kay, Jonathan
2016-01-01
Gadolinium-based contrast agents (GBCAs), once believed to be safe for patients with renal disease, have been strongly associated with nephrogenic systemic fibrosis (NSF), a severe systemic fibrosing disorder that predominantly afflicts individuals with advanced renal dysfunction. We provide a historical perspective on the appearance and disappearance of NSF, including its initial recognition as a discrete clinical entity, its association with GBCA exposure, and the data supporting a causative relationship between GBCA exposure and NSF. On the basis of this body of evidence, we propose that the name gadolinium-induced fibrosis (GIF) more accurately reflects the totality of knowledge regarding this disease. Use of high-risk GBCAs, such as formulated gadodiamide, should be avoided in patients with renal disease. Restriction of GBCA use in this population has almost completely eradicated new cases of this debilitating condition. Emerging antifibrotic therapies may be useful for patients who suffer from GIF.
Hellrich, Johannes; Hahn, Udo
2014-01-01
We here report on efforts to computationally support the maintenance and extension of multilingual biomedical terminology resources. Our main idea is to treat term acquisition as a classification problem guided by term alignment in parallel multilingual corpora, using termhood information coming from of a named entity recognition system as a novel feature. We report on experiments for Spanish, French, German and Dutch parts of a multilingual UMLS-derived biomedical terminology. These efforts yielded 19k, 18k, 23k and 12k new terms and synonyms, respectively, from which about half relate to concepts without a previously available term label for these non-English languages. Based on expert assessment of a novel German terminology sample, 80% of the newly acquired terms were judged as reasonable additions to the terminology. PMID:25954371
Federal Register 2010, 2011, 2012, 2013, 2014
2012-07-27
...] Medicare Program; Application by the American Association of Diabetes Educators (AADE) for Continued Recognition as a National Accreditation Organization for Accrediting Entities To Furnish Outpatient Diabetes... of Diabetes Educators for continued recognition as a national accreditation program for accrediting...
Federal Register 2010, 2011, 2012, 2013, 2014
2010-09-08
... removing the names of ten entities and three individuals from the list of Specially Designated Nationals... Commit, Threaten To Commit, or Support Terrorism. DATES: The removal of ten entities and three... Foreign Assets Control has determined that these ten entities and three individuals no longer meet the...
78 FR 59880 - Enhanced Consumer Protections for Charter Air Transportation
Federal Register 2010, 2011, 2012, 2013, 2014
2013-09-30
...) The name of the company in operational control of the aircraft during flight; (2) any other ``doing... disclosure of the entity in operational control of the aircraft during the flight and seven of those comments... different from the entity in operational control of the aircraft, primarily on the basis that these entities...
Moving beyond the Name: Defining Corporate Entities to Support Provenance-Based Access
ERIC Educational Resources Information Center
Light, Michelle
2007-01-01
The second edition of the "International Standard Archival Authority Records for Corporate Bodies, Persons, and Families (ISAAR(CPF)2)" focuses on describing entities as they exist in reality, rather than on establishing authorized terms. This change allows authority records to include multiple authorized terms representing an entity as it changed…
De Winter, François-Laurent; Timmers, Dorien; de Gelder, Beatrice; Van Orshoven, Marc; Vieren, Marleen; Bouckaert, Miriam; Cypers, Gert; Caekebeke, Jo; Van de Vliet, Laura; Goffin, Karolien; Van Laere, Koen; Sunaert, Stefan; Vandenberghe, Rik; Vandenbulcke, Mathieu; Van den Stock, Jan
2016-01-01
Deficits in face processing have been described in the behavioral variant of fronto-temporal dementia (bvFTD), primarily regarding the recognition of facial expressions. Less is known about face shape and face identity processing. Here we used a hierarchical strategy targeting face shape and face identity recognition in bvFTD and matched healthy controls. Participants performed 3 psychophysical experiments targeting face shape detection (Experiment 1), unfamiliar face identity matching (Experiment 2), familiarity categorization and famous face-name matching (Experiment 3). The results revealed group differences only in Experiment 3, with a deficit in the bvFTD group for both familiarity categorization and famous face-name matching. Voxel-based morphometry regression analyses in the bvFTD group revealed an association between grey matter volume of the left ventral anterior temporal lobe and familiarity recognition, while face-name matching correlated with grey matter volume of the bilateral ventral anterior temporal lobes. Subsequently, we quantified familiarity-specific and name-specific recognition deficits as the sum of the celebrities of which respectively only the name or only the familiarity was accurately recognized. Both indices were associated with grey matter volume of the bilateral anterior temporal cortices. These findings extent previous results by documenting the involvement of the left anterior temporal lobe (ATL) in familiarity detection and the right ATL in name recognition deficits in fronto-temporal lobar degeneration.
Reading handprinted addresses on IRS tax forms
NASA Astrophysics Data System (ADS)
Ramanaprasad, Vemulapati; Shin, Yong-Chul; Srihari, Sargur N.
1996-03-01
The hand-printed address recognition system described in this paper is a part of the Name and Address Block Reader (NABR) system developed by the Center of Excellence for Document Analysis and Recognition (CEDAR). NABR is currently being used by the IRS to read address blocks (hand-print as well as machine-print) on fifteen different tax forms. Although machine- print address reading was relatively straightforward, hand-print address recognition has posed some special challenges due to demands on processing speed (with an expected throughput of 8450 forms/hour) and recognition accuracy. We discuss various subsystems involved in hand- printed address recognition, including word segmentation, word recognition, digit segmentation, and digit recognition. We also describe control strategies used to make effective use of these subsystems to maximize recognition accuracy. We present system performance on 931 address blocks in recognizing various fields, such as city, state, ZIP Code, street number and name, and personal names.
2014-11-01
possible future directions that build on the KBA experience. Data Assets In addition to the three hundred run submissions from diverse systems...form name of an entity and assigning a confidence score based on the number of matches of tokens in the name. See code in github [6]. macro-P...131 64 GENDER 4 2 FoundedBy 56 30 NAME 2 2 DateOfDeath 54 12 TOP_MEMBERS_EMPLOYEES 2 1 EmployeeOf 44 19 WON_AWARD 1 1
Polepalli Ramesh, Balaji; Belknap, Steven M; Li, Zuofeng; Frid, Nadya; West, Dennis P
2014-01-01
Background The Food and Drug Administration’s (FDA) Adverse Event Reporting System (FAERS) is a repository of spontaneously-reported adverse drug events (ADEs) for FDA-approved prescription drugs. FAERS reports include both structured reports and unstructured narratives. The narratives often include essential information for evaluation of the severity, causality, and description of ADEs that are not present in the structured data. The timely identification of unknown toxicities of prescription drugs is an important, unsolved problem. Objective The objective of this study was to develop an annotated corpus of FAERS narratives and biomedical named entity tagger to automatically identify ADE related information in the FAERS narratives. Methods We developed an annotation guideline and annotate medication information and adverse event related entities on 122 FAERS narratives comprising approximately 23,000 word tokens. A named entity tagger using supervised machine learning approaches was built for detecting medication information and adverse event entities using various categories of features. Results The annotated corpus had an agreement of over .9 Cohen’s kappa for medication and adverse event entities. The best performing tagger achieves an overall performance of 0.73 F1 score for detection of medication, adverse event and other named entities. Conclusions In this study, we developed an annotated corpus of FAERS narratives and machine learning based models for automatically extracting medication and adverse event information from the FAERS narratives. Our study is an important step towards enriching the FAERS data for postmarketing pharmacovigilance. PMID:25600332
Illuminate Knowledge Elements in Geoscience Literature
NASA Astrophysics Data System (ADS)
Ma, X.; Zheng, J. G.; Wang, H.; Fox, P. A.
2015-12-01
There are numerous dark data hidden in geoscience literature. Efficient retrieval and reuse of those data will greatly benefit geoscience researches of nowadays. Among the works of data rescue, a topic of interest is illuminating the knowledge framework, i.e. entities and relationships, embedded in documents. Entity recognition and linking have received extensive attention in news and social media analysis, as well as in bioinformatics. In the domain of geoscience, however, such works are limited. We will present our work on how to use knowledge bases on the Web, such as ontologies and vocabularies, to facilitate entity recognition and linking in geoscience literature. The work deploys an un-supervised collective inference approach [1] to link entity mentions in unstructured texts to a knowledge base, which leverages the meaningful information and structures in ontologies and vocabularies for similarity computation and entity ranking. Our work is still in the initial stage towards the detection of knowledge frameworks in literature, and we have been collecting geoscience ontologies and vocabularies in order to build a comprehensive geoscience knowledge base [2]. We hope the work will initiate new ideas and collaborations on dark data rescue, as well as on the synthesis of data and knowledge from geoscience literature. References: 1. Zheng, J., Howsmon, D., Zhang, B., Hahn, J., McGuinness, D.L., Hendler, J., and Ji, H. 2014. Entity linking for biomedical literature. In Proceedings of ACM 8th International Workshop on Data and Text Mining in Bioinformatics, Shanghai, China. 2. Ma, X. Zheng, J., 2015. Linking geoscience entity mentions to the Web of Data. ESIP 2015 Summer Meeting, Pacific Grove, CA.
41 CFR 102-173.50 - What is the naming convention for States?
Code of Federal Regulations, 2014 CFR
2014-01-01
...-INTERNET GOV DOMAIN Registration § 102-173.50 What is the naming convention for States? (a) To register any second-level domain within dot-gov, State government entities must register the full State name or clearly indicate the State postal code within the name. Examples of acceptable names include virginia.gov...
41 CFR 102-173.50 - What is the naming convention for States?
Code of Federal Regulations, 2011 CFR
2011-01-01
...-INTERNET GOV DOMAIN Registration § 102-173.50 What is the naming convention for States? (a) To register any second-level domain within dot-gov, State government entities must register the full State name or clearly indicate the State postal code within the name. Examples of acceptable names include virginia.gov...
41 CFR 102-173.50 - What is the naming convention for States?
Code of Federal Regulations, 2010 CFR
2010-07-01
...-INTERNET GOV DOMAIN Registration § 102-173.50 What is the naming convention for States? (a) To register any second-level domain within dot-gov, State government entities must register the full State name or clearly indicate the State postal code within the name. Examples of acceptable names include virginia.gov...
41 CFR 102-173.50 - What is the naming convention for States?
Code of Federal Regulations, 2013 CFR
2013-07-01
...-INTERNET GOV DOMAIN Registration § 102-173.50 What is the naming convention for States? (a) To register any second-level domain within dot-gov, State government entities must register the full State name or clearly indicate the State postal code within the name. Examples of acceptable names include virginia.gov...
41 CFR 102-173.50 - What is the naming convention for States?
Code of Federal Regulations, 2012 CFR
2012-01-01
...-INTERNET GOV DOMAIN Registration § 102-173.50 What is the naming convention for States? (a) To register any second-level domain within dot-gov, State government entities must register the full State name or clearly indicate the State postal code within the name. Examples of acceptable names include virginia.gov...
Is it about the self or the significance? An fMRI study of self-name recognition.
Tacikowski, P; Brechmann, A; Marchewka, A; Jednoróg, K; Dobrowolny, M; Nowicka, A
2011-01-01
Our own name, due to its high social relevance, is supposed to have a unique status in our information processing. However, demonstrating this phenomenon empirically proves difficult as famous and unknown names, to which self-name is often compared in the studies, may differ from self-name not only in terms of the 'me vs. not-me' distinction, but also as regards their emotional content and frequency of occurrence in everyday life. In this fMRI study, apart from famous and unknown names we used the names of the most important persons in our subjects' lives. When compared to famous or unknown names recognition, self-name recognition was associated with robust activations in widely distributed bilateral network including fronto-temporal, limbic and subcortical structures, however, when compared to significant other's name, the activations were present specifically in the right inferior frontal gyrus. In addition, the significant other's name produced a similar pattern of activations to the one activated by self-name. These results suggest that the differences between own and other's name processing may rather be quantitative than qualitative in nature.
Synthetic Approaches to the New Drugs Approved During 2015.
Flick, Andrew C; Ding, Hong X; Leverett, Carolyn A; Kyne, Robert E; Liu, Kevin K-C; Fink, Sarah J; O'Donnell, Christopher J
2017-08-10
New drugs introduced to the market every year represent privileged structures for particular biological targets. These new chemical entities (NCEs) provide insight into molecular recognition while serving as leads for designing future new drugs. This annual review describes the most likely process-scale synthetic approaches to 29 new chemical entities (NCEs) that were approved for the first time in 2015.
41 CFR 102-173.25 - What definitions apply to this part?
Code of Federal Regulations, 2014 CFR
2014-01-01
... Management Regulations System (Continued) FEDERAL MANAGEMENT REGULATION TELECOMMUNICATIONS 173-INTERNET GOV... Administration (GSA) is responsible for registrations in the dot-gov domain. Domain name is a name assigned to an... domain name server. A domain name locates the organization or other entity on the Internet. The dot gov...
41 CFR 102-173.25 - What definitions apply to this part?
Code of Federal Regulations, 2013 CFR
2013-07-01
... Management Regulations System (Continued) FEDERAL MANAGEMENT REGULATION TELECOMMUNICATIONS 173-INTERNET GOV... Administration (GSA) is responsible for registrations in the dot-gov domain. Domain name is a name assigned to an... domain name server. A domain name locates the organization or other entity on the Internet. The dot gov...
41 CFR 102-173.25 - What definitions apply to this part?
Code of Federal Regulations, 2012 CFR
2012-01-01
... Management Regulations System (Continued) FEDERAL MANAGEMENT REGULATION TELECOMMUNICATIONS 173-INTERNET GOV... Administration (GSA) is responsible for registrations in the dot-gov domain. Domain name is a name assigned to an... domain name server. A domain name locates the organization or other entity on the Internet. The dot gov...
Chemical entity recognition in patents by combining dictionary-based and statistical approaches
Akhondi, Saber A.; Pons, Ewoud; Afzal, Zubair; van Haagen, Herman; Becker, Benedikt F.H.; Hettne, Kristina M.; van Mulligen, Erik M.; Kors, Jan A.
2016-01-01
We describe the development of a chemical entity recognition system and its application in the CHEMDNER-patent track of BioCreative 2015. This community challenge includes a Chemical Entity Mention in Patents (CEMP) recognition task and a Chemical Passage Detection (CPD) classification task. We addressed both tasks by an ensemble system that combines a dictionary-based approach with a statistical one. For this purpose the performance of several lexical resources was assessed using Peregrine, our open-source indexing engine. We combined our dictionary-based results on the patent corpus with the results of tmChem, a chemical recognizer using a conditional random field classifier. To improve the performance of tmChem, we utilized three additional features, viz. part-of-speech tags, lemmas and word-vector clusters. When evaluated on the training data, our final system obtained an F-score of 85.21% for the CEMP task, and an accuracy of 91.53% for the CPD task. On the test set, the best system ranked sixth among 21 teams for CEMP with an F-score of 86.82%, and second among nine teams for CPD with an accuracy of 94.23%. The differences in performance between the best ensemble system and the statistical system separately were small. Database URL: http://biosemantics.org/chemdner-patents PMID:27141091
Biomedical named entity extraction: some issues of corpus compatibilities.
Ekbal, Asif; Saha, Sriparna; Sikdar, Utpal Kumar
2013-01-01
Named Entity (NE) extraction is one of the most fundamental and important tasks in biomedical information extraction. It involves identification of certain entities from text and their classification into some predefined categories. In the biomedical community, there is yet no general consensus regarding named entity (NE) annotation; thus, it is very difficult to compare the existing systems due to corpus incompatibilities. Due to this problem we can not also exploit the advantages of using different corpora together. In our present work we address the issues of corpus compatibilities, and use a single objective optimization (SOO) based classifier ensemble technique that uses the search capability of genetic algorithm (GA) for NE extraction in biomedicine. We hypothesize that the reliability of predictions of each classifier differs among the various output classes. We use Conditional Random Field (CRF) and Support Vector Machine (SVM) frameworks to build a number of models depending upon the various representations of the set of features and/or feature templates. It is to be noted that we tried to extract the features without using any deep domain knowledge and/or resources. In order to assess the challenges of corpus compatibilities, we experiment with the different benchmark datasets and their various combinations. Comparison results with the existing approaches prove the efficacy of the used technique. GA based ensemble achieves around 2% performance improvements over the individual classifiers. Degradation in performance on the integrated corpus clearly shows the difficulties of the task. In summary, our used ensemble based approach attains the state-of-the-art performance levels for entity extraction in three different kinds of biomedical datasets. The possible reasons behind the better performance in our used approach are the (i). use of variety and rich features as described in Subsection "Features for named entity extraction"; (ii) use of GA based classifier ensemble technique to combine the outputs of multiple classifiers.
12 CFR 602.24 - Responses to demands served on non-FCA employees or entities.
Code of Federal Regulations, 2010 CFR
2010-01-01
... 12 Banks and Banking 6 2010-01-01 2010-01-01 false Responses to demands served on non-FCA employees or entities. 602.24 Section 602.24 Banks and Banking FARM CREDIT ADMINISTRATION ADMINISTRATIVE... Not a Named Party § 602.24 Responses to demands served on non-FCA employees or entities. If you are...
Evaluation and Cross-Comparison of Lexical Entities of Biological Interest (LexEBI)
Rebholz-Schuhmann, Dietrich; Kim, Jee-Hyub; Yan, Ying; Dixit, Abhishek; Friteyre, Caroline; Hoehndorf, Robert; Backofen, Rolf; Lewin, Ian
2013-01-01
Motivation Biomedical entities, their identifiers and names, are essential in the representation of biomedical facts and knowledge. In the same way, the complete set of biomedical and chemical terms, i.e. the biomedical “term space” (the “Lexeome”), forms a key resource to achieve the full integration of the scientific literature with biomedical data resources: any identified named entity can immediately be normalized to the correct database entry. This goal does not only require that we are aware of all existing terms, but would also profit from knowing all their senses and their semantic interpretation (ambiguities, nestedness). Result This study compiles a resource for lexical terms of biomedical interest in a standard format (called “LexEBI”), determines the overall number of terms, their reuse in different resources and the nestedness of terms. LexEBI comprises references for protein and gene entries and their term variants and chemical entities amongst other terms. In addition, disease terms have been identified from Medline and PubmedCentral and added to LexEBI. Our analysis demonstrates that the baseforms of terms from the different semantic types show only little polysemous use. Nonetheless, the term variants of protein and gene names (PGNs) frequently contain species mentions, which should have been avoided according to protein annotation guidelines. Furthermore, the protein and gene entities as well as the chemical entities, both do comprise enzymes leading to hierarchical polysemy, and a large portion of PGNs make reference to a chemical entity. Altogether, according to our analysis based on the Medline distribution, 401,869 unique PGNs in the documents contain a reference to 25,022 chemical entities, 3,125 disease terms or 1,576 species mentions. Conclusion LexEBI delivers the complete biomedical and chemical Lexeome in a standardized representation (http://www.ebi.ac.uk/Rebholz-srv/LexEBI/). The resource provides the disease terms as open source content, and fully interlinks terms across resources. PMID:24124474
ParaBTM: A Parallel Processing Framework for Biomedical Text Mining on Supercomputers.
Xing, Yuting; Wu, Chengkun; Yang, Xi; Wang, Wei; Zhu, En; Yin, Jianping
2018-04-27
A prevailing way of extracting valuable information from biomedical literature is to apply text mining methods on unstructured texts. However, the massive amount of literature that needs to be analyzed poses a big data challenge to the processing efficiency of text mining. In this paper, we address this challenge by introducing parallel processing on a supercomputer. We developed paraBTM, a runnable framework that enables parallel text mining on the Tianhe-2 supercomputer. It employs a low-cost yet effective load balancing strategy to maximize the efficiency of parallel processing. We evaluated the performance of paraBTM on several datasets, utilizing three types of named entity recognition tasks as demonstration. Results show that, in most cases, the processing efficiency can be greatly improved with parallel processing, and the proposed load balancing strategy is simple and effective. In addition, our framework can be readily applied to other tasks of biomedical text mining besides NER.
Westergaard, David; Stærfeldt, Hans-Henrik; Tønsberg, Christian; Jensen, Lars Juhl; Brunak, Søren
2018-02-01
Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15 million English scientific full-text articles published during the period 1823-2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein-protein, disease-gene, and protein subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full-text articles consistently outperforms using abstracts only.
Westergaard, David; Stærfeldt, Hans-Henrik
2018-01-01
Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15 million English scientific full-text articles published during the period 1823–2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein–protein, disease–gene, and protein subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full-text articles consistently outperforms using abstracts only. PMID:29447159
Face-Name Association Learning and Brain Structural Substrates in Alcoholism
Pitel, Anne-Lise; Chanraud, Sandra; Rohlfing, Torsten; Pfefferbaum, Adolf; Sullivan, Edith V.
2011-01-01
Background Associative learning is required for face-name association and is impaired in alcoholism, but the cognitive processes and brain structural components underlying this deficit remain unclear. It is also unknown whether prompting alcoholics to implement a deep level of processing during face-name encoding would enhance performance. Methods Abstinent alcoholics and controls performed a levels-of-processing face-name learning task. Participants indicated whether the face was that of an honest person (deep encoding) or that of a man (shallow encoding). Retrieval was examined using an associative (face-name) recognition task and a single-item (face or name only) recognition task. Participants also underwent a 3T structural MRI. Results Compared with controls, alcoholics had poorer associative and single-item recognition, each impaired to the same extent. Level of processing at encoding had little effect on recognition performance but affected reaction time. Correlations with brain volumes were generally modest and based primarily on reaction time in alcoholics, where the deeper the processing at encoding, the more restricted the correlations with brain volumes. In alcoholics, longer control task reaction times correlated modestly with volumes across several anterior to posterior brain regions; shallow encoding correlated with calcarine and striatal volumes; deep encoding correlated with precuneus and parietal volumes; associative recognition RT correlated with cerebellar volumes. In controls, poorer associative recognition with deep encoding correlated significantly with smaller volumes of frontal and striatal structures. Conclusions Despite prompting, alcoholics did not take advantage of encoding memoranda at a deep level to enhance face-name recognition accuracy. Nonetheless, conditions of deeper encoding resulted in faster reaction times and more specific relations with regional brain volumes than did shallow encoding. The normal relation between associative recognition and corticostriatal volumes was not present in alcoholics. Rather, their speeded reaction time occurred at the expense of accuracy and was related most robustly to cerebellar volumes. PMID:22509954
Rapid Naming Speed and Chinese Character Recognition
ERIC Educational Resources Information Center
Liao, Chen-Huei; Georgiou, George K.; Parrila, Rauno
2008-01-01
We examined the relationship between rapid naming speed (RAN) and Chinese character recognition accuracy and fluency. Sixty-three grade 2 and 54 grade 4 Taiwanese children were administered four RAN tasks (colors, digits, Zhu-Yin-Fu-Hao, characters), and two character recognition tasks. RAN tasks accounted for more reading variance in grade 4 than…
neXtA5: accelerating annotation of articles via automated approaches in neXtProt.
Mottin, Luc; Gobeill, Julien; Pasche, Emilie; Michel, Pierre-André; Cusin, Isabelle; Gaudet, Pascale; Ruch, Patrick
2016-01-01
The rapid increase in the number of published articles poses a challenge for curated databases to remain up-to-date. To help the scientific community and database curators deal with this issue, we have developed an application, neXtA5, which prioritizes the literature for specific curation requirements. Our system, neXtA5, is a curation service composed of three main elements. The first component is a named-entity recognition module, which annotates MEDLINE over some predefined axes. This report focuses on three axes: Diseases, the Molecular Function and Biological Process sub-ontologies of the Gene Ontology (GO). The automatic annotations are then stored in a local database, BioMed, for each annotation axis. Additional entities such as species and chemical compounds are also identified. The second component is an existing search engine, which retrieves the most relevant MEDLINE records for any given query. The third component uses the content of BioMed to generate an axis-specific ranking, which takes into account the density of named-entities as stored in the Biomed database. The two ranked lists are ultimately merged using a linear combination, which has been specifically tuned to support the annotation of each axis. The fine-tuning of the coefficients is formally reported for each axis-driven search. Compared with PubMed, which is the system used by most curators, the improvement is the following: +231% for Diseases, +236% for Molecular Functions and +3153% for Biological Process when measuring the precision of the top-returned PMID (P0 or mean reciprocal rank). The current search methods significantly improve the search effectiveness of curators for three important curation axes. Further experiments are being performed to extend the curation types, in particular protein-protein interactions, which require specific relationship extraction capabilities. In parallel, user-friendly interfaces powered with a set of JSON web services are currently being implemented into the neXtProt annotation pipeline.Available on: http://babar.unige.ch:8082/neXtA5Database URL: http://babar.unige.ch:8082/neXtA5/fetcher.jsp. © The Author(s) 2016. Published by Oxford University Press.
neXtA5: accelerating annotation of articles via automated approaches in neXtProt
Mottin, Luc; Gobeill, Julien; Pasche, Emilie; Michel, Pierre-André; Cusin, Isabelle; Gaudet, Pascale; Ruch, Patrick
2016-01-01
The rapid increase in the number of published articles poses a challenge for curated databases to remain up-to-date. To help the scientific community and database curators deal with this issue, we have developed an application, neXtA5, which prioritizes the literature for specific curation requirements. Our system, neXtA5, is a curation service composed of three main elements. The first component is a named-entity recognition module, which annotates MEDLINE over some predefined axes. This report focuses on three axes: Diseases, the Molecular Function and Biological Process sub-ontologies of the Gene Ontology (GO). The automatic annotations are then stored in a local database, BioMed, for each annotation axis. Additional entities such as species and chemical compounds are also identified. The second component is an existing search engine, which retrieves the most relevant MEDLINE records for any given query. The third component uses the content of BioMed to generate an axis-specific ranking, which takes into account the density of named-entities as stored in the Biomed database. The two ranked lists are ultimately merged using a linear combination, which has been specifically tuned to support the annotation of each axis. The fine-tuning of the coefficients is formally reported for each axis-driven search. Compared with PubMed, which is the system used by most curators, the improvement is the following: +231% for Diseases, +236% for Molecular Functions and +3153% for Biological Process when measuring the precision of the top-returned PMID (P0 or mean reciprocal rank). The current search methods significantly improve the search effectiveness of curators for three important curation axes. Further experiments are being performed to extend the curation types, in particular protein–protein interactions, which require specific relationship extraction capabilities. In parallel, user-friendly interfaces powered with a set of JSON web services are currently being implemented into the neXtProt annotation pipeline. Available on: http://babar.unige.ch:8082/neXtA5 Database URL: http://babar.unige.ch:8082/neXtA5/fetcher.jsp PMID:27374119
78 FR 26244 - Updating of Employer Identification Numbers
Federal Register 2010, 2011, 2012, 2013, 2014
2013-05-06
... (including updated application information regarding the name and taxpayer identifying number of the... require these persons to update application information regarding the name and taxpayer identifying number..., Application for Employer Identification Number, requires entities to disclose the name of the EIN applicant's...
PathNER: a tool for systematic identification of biological pathway mentions in the literature
2013-01-01
Background Biological pathways are central to many biomedical studies and are frequently discussed in the literature. Several curated databases have been established to collate the knowledge of molecular processes constituting pathways. Yet, there has been little focus on enabling systematic detection of pathway mentions in the literature. Results We developed a tool, named PathNER (Pathway Named Entity Recognition), for the systematic identification of pathway mentions in the literature. PathNER is based on soft dictionary matching and rules, with the dictionary generated from public pathway databases. The rules utilise general pathway-specific keywords, syntactic information and gene/protein mentions. Detection results from both components are merged. On a gold-standard corpus, PathNER achieved an F1-score of 84%. To illustrate its potential, we applied PathNER on a collection of articles related to Alzheimer's disease to identify associated pathways, highlighting cases that can complement an existing manually curated knowledgebase. Conclusions In contrast to existing text-mining efforts that target the automatic reconstruction of pathway details from molecular interactions mentioned in the literature, PathNER focuses on identifying specific named pathway mentions. These mentions can be used to support large-scale curation and pathway-related systems biology applications, as demonstrated in the example of Alzheimer's disease. PathNER is implemented in Java and made freely available online at http://sourceforge.net/projects/pathner/. PMID:24555844
Urinary bladder: normal appearance and mimics of malignancy at CT urography
Sadow, Cheryl A.; Anik Sahni, V.; Silverman, Stuart G.
2011-01-01
Abstract The objective of this review article is to learn how to recognize anatomic variants and benign entities that mimic bladder cancer at computed tomography (CT) urography. Building on recent data that suggest that CT urography can be used to diagnose bladder cancer, recognition of anatomic variants and benign entities will help improve radiologists’ ability to diagnose bladder cancer. PMID:21771710
ERIC Educational Resources Information Center
Acres, K.; Taylor, K. I.; Moss, H. E.; Stamatakis, E. A.; Tyler, L. K.
2009-01-01
Cognitive neuroscientific research proposes complementary hemispheric asymmetries in naming and recognising visual objects, with a left temporal lobe advantage for object naming and a right temporal lobe advantage for object recognition. Specifically, it has been proposed that the left inferior temporal lobe plays a mediational role linking…
Learning of Letter Names and Sounds and Their Contribution to Word Recognition
ERIC Educational Resources Information Center
Levin, Iris; Shatil-Carmon, Sivan; Asif-Rave, Ornit
2006-01-01
This study investigated knowledge of letter names and letter sounds, their learning, and their contributions to word recognition. Of 123 preschoolers examined on letter knowledge, 65 underwent training on both letter names and letter sounds in a counterbalanced order. Prior to training, children were more advanced in associating letters with their…
Taxonomic indexing--extending the role of taxonomy.
Patterson, David J; Remsen, David; Marino, William A; Norton, Cathy
2006-06-01
Taxonomic indexing refers to a new array of taxonomically intelligent network services that use nomenclatural principles and elements of expert taxonomic knowledge to manage information about organisms. Taxonomic indexing was introduced to help manage the increasing amounts of digital information about biology. It has been designed to form a near basal layer in a layered cyberinfrastructure that deals with biological information. Taxonomic Indexing accommodates the special problems of using names of organisms to index biological material. It links alternative names for the same entity (reconciliation), and distinguishes between uses of the same name for different entities (disambiguation), and names are placed within an indefinite number of hierarchical schemes. In order to access all information on all organisms, Taxonomic indexing must be able to call on a registry of all names in all forms for all organisms. NameBank has been developed to meet that need. Taxonomic indexing is an area of informatics that overlaps with taxonomy, is dependent on the expert input of taxonomists, and reveals the relevance of the discipline to a wide audience.
A study of active learning methods for named entity recognition in clinical text.
Chen, Yukun; Lasko, Thomas A; Mei, Qiaozhu; Denny, Joshua C; Xu, Hua
2015-12-01
Named entity recognition (NER), a sequential labeling task, is one of the fundamental tasks for building clinical natural language processing (NLP) systems. Machine learning (ML) based approaches can achieve good performance, but they often require large amounts of annotated samples, which are expensive to build due to the requirement of domain experts in annotation. Active learning (AL), a sample selection approach integrated with supervised ML, aims to minimize the annotation cost while maximizing the performance of ML-based models. In this study, our goal was to develop and evaluate both existing and new AL methods for a clinical NER task to identify concepts of medical problems, treatments, and lab tests from the clinical notes. Using the annotated NER corpus from the 2010 i2b2/VA NLP challenge that contained 349 clinical documents with 20,423 unique sentences, we simulated AL experiments using a number of existing and novel algorithms in three different categories including uncertainty-based, diversity-based, and baseline sampling strategies. They were compared with the passive learning that uses random sampling. Learning curves that plot performance of the NER model against the estimated annotation cost (based on number of sentences or words in the training set) were generated to evaluate different active learning and the passive learning methods and the area under the learning curve (ALC) score was computed. Based on the learning curves of F-measure vs. number of sentences, uncertainty sampling algorithms outperformed all other methods in ALC. Most diversity-based methods also performed better than random sampling in ALC. To achieve an F-measure of 0.80, the best method based on uncertainty sampling could save 66% annotations in sentences, as compared to random sampling. For the learning curves of F-measure vs. number of words, uncertainty sampling methods again outperformed all other methods in ALC. To achieve 0.80 in F-measure, in comparison to random sampling, the best uncertainty based method saved 42% annotations in words. But the best diversity based method reduced only 7% annotation effort. In the simulated setting, AL methods, particularly uncertainty-sampling based approaches, seemed to significantly save annotation cost for the clinical NER task. The actual benefit of active learning in clinical NER should be further evaluated in a real-time setting. Copyright © 2015 Elsevier Inc. All rights reserved.
Letter Names: Effect on Letter Saying, Spelling, and Word Recognition in Hebrew.
ERIC Educational Resources Information Center
Levin, Iris; Patel, Sigal; Margalit, Tamar; Barad, Noa
2002-01-01
Examined whether letter names, which bridge the gap between oral and written language among English speaking children, have a similar function in Hebrew. In findings from studies of Israeli kindergartners and first graders, children were found to rely on letter names in performing a number of letter saying, spelling, and word recognition tasks.…
Recognizing the Emotional Valence of Names: An ERP Study
ERIC Educational Resources Information Center
Wang, Lin; Zhu, Zude; Bastiaansen, Marcel; Hagoort, Peter; Yang, Yufang
2013-01-01
Unlike common nouns, person names refer to unique entities and generally have a referring function. We used event-related potentials to investigate the time course of identifying the emotional meaning of nouns and names. The emotional valence of names and nouns were manipulated separately. The results show early N1 effects in response to emotional…
Face-name association learning and brain structural substrates in alcoholism.
Pitel, Anne-Lise; Chanraud, Sandra; Rohlfing, Torsten; Pfefferbaum, Adolf; Sullivan, Edith V
2012-07-01
Associative learning is required for face-name association and is impaired in alcoholism, but the cognitive processes and brain structural components underlying this deficit remain unclear. It is also unknown whether prompting alcoholics to implement a deep level of processing during face-name encoding would enhance performance. Abstinent alcoholics and controls performed a levels-of-processing face-name learning task. Participants indicated whether the face was that of an honest person (deep encoding) or that of a man (shallow encoding). Retrieval was examined using an associative (face-name) recognition task and a single-item (face or name only) recognition task. Participants also underwent 3T structural MRI. Compared with controls, alcoholics had poorer associative and single-item learning and performed at similar levels. Level of processing at encoding had little effect on recognition performance but affected reaction time (RT). Correlations with brain volumes were generally modest and based primarily on RT in alcoholics, where the deeper the processing at encoding, the more restricted the correlations with brain volumes. In alcoholics, longer control task RTs correlated modestly with smaller tissue volumes across several anterior to posterior brain regions; shallow encoding correlated with calcarine and striatal volumes; deep encoding correlated with precuneus and parietal volumes; and associative recognition RT correlated with cerebellar volumes. In controls, poorer associative recognition with deep encoding correlated significantly with smaller volumes of frontal and striatal structures. Despite prompting, alcoholics did not take advantage of encoding memoranda at a deep level to enhance face-name recognition accuracy. Nonetheless, conditions of deeper encoding resulted in faster RTs and more specific relations with regional brain volumes than did shallow encoding. The normal relation between associative recognition and corticostriatal volumes was not present in alcoholics. Rather, their speeded RTs occurred at the expense of accuracy and were related most robustly to cerebellar volumes. Copyright © 2012 by the Research Society on Alcoholism.
Building Scalable Knowledge Graphs for Earth Science
NASA Astrophysics Data System (ADS)
Ramachandran, R.; Maskey, M.; Gatlin, P. N.; Zhang, J.; Duan, X.; Bugbee, K.; Christopher, S. A.; Miller, J. J.
2017-12-01
Estimates indicate that the world's information will grow by 800% in the next five years. In any given field, a single researcher or a team of researchers cannot keep up with this rate of knowledge expansion without the help of cognitive systems. Cognitive computing, defined as the use of information technology to augment human cognition, can help tackle large systemic problems. Knowledge graphs, one of the foundational components of cognitive systems, link key entities in a specific domain with other entities via relationships. Researchers could mine these graphs to make probabilistic recommendations and to infer new knowledge. At this point, however, there is a dearth of tools to generate scalable Knowledge graphs using existing corpus of scientific literature for Earth science research. Our project is currently developing an end-to-end automated methodology for incrementally constructing Knowledge graphs for Earth Science. Semantic Entity Recognition (SER) is one of the key steps in this methodology. SER for Earth Science uses external resources (including metadata catalogs and controlled vocabulary) as references to guide entity extraction and recognition (i.e., labeling) from unstructured text, in order to build a large training set to seed the subsequent auto-learning component in our algorithm. Results from several SER experiments will be presented as well as lessons learned.
Chemical entity recognition in patents by combining dictionary-based and statistical approaches.
Akhondi, Saber A; Pons, Ewoud; Afzal, Zubair; van Haagen, Herman; Becker, Benedikt F H; Hettne, Kristina M; van Mulligen, Erik M; Kors, Jan A
2016-01-01
We describe the development of a chemical entity recognition system and its application in the CHEMDNER-patent track of BioCreative 2015. This community challenge includes a Chemical Entity Mention in Patents (CEMP) recognition task and a Chemical Passage Detection (CPD) classification task. We addressed both tasks by an ensemble system that combines a dictionary-based approach with a statistical one. For this purpose the performance of several lexical resources was assessed using Peregrine, our open-source indexing engine. We combined our dictionary-based results on the patent corpus with the results of tmChem, a chemical recognizer using a conditional random field classifier. To improve the performance of tmChem, we utilized three additional features, viz. part-of-speech tags, lemmas and word-vector clusters. When evaluated on the training data, our final system obtained an F-score of 85.21% for the CEMP task, and an accuracy of 91.53% for the CPD task. On the test set, the best system ranked sixth among 21 teams for CEMP with an F-score of 86.82%, and second among nine teams for CPD with an accuracy of 94.23%. The differences in performance between the best ensemble system and the statistical system separately were small.Database URL: http://biosemantics.org/chemdner-patents. © The Author(s) 2016. Published by Oxford University Press.
24 CFR 1710.208 - General information.
Code of Federal Regulations, 2010 CFR
2010-04-01
... any of the principals of the owner or developer are corporate entities, name the parent and/or... registration or prohibited sales, name the State involved and give the reasons cited by the State for their... made with the SEC, give the SEC identification number; identify the prospectus by name; date of filing...
24 CFR 1710.208 - General information.
Code of Federal Regulations, 2014 CFR
2014-04-01
... any of the principals of the owner or developer are corporate entities, name the parent and/or... registration or prohibited sales, name the State involved and give the reasons cited by the State for their... made with the SEC, give the SEC identification number; identify the prospectus by name; date of filing...
24 CFR 1710.208 - General information.
Code of Federal Regulations, 2013 CFR
2013-04-01
... any of the principals of the owner or developer are corporate entities, name the parent and/or... registration or prohibited sales, name the State involved and give the reasons cited by the State for their... made with the SEC, give the SEC identification number; identify the prospectus by name; date of filing...
24 CFR 1710.208 - General information.
Code of Federal Regulations, 2011 CFR
2011-04-01
... any of the principals of the owner or developer are corporate entities, name the parent and/or... registration or prohibited sales, name the State involved and give the reasons cited by the State for their... made with the SEC, give the SEC identification number; identify the prospectus by name; date of filing...
24 CFR 1710.208 - General information.
Code of Federal Regulations, 2012 CFR
2012-04-01
... any of the principals of the owner or developer are corporate entities, name the parent and/or... registration or prohibited sales, name the State involved and give the reasons cited by the State for their... made with the SEC, give the SEC identification number; identify the prospectus by name; date of filing...
Deadlock Detection in Computer Networks
1977-09-01
it entity class name (ndm-procownerref) = -:"node tab5le" I procnode_name z res-rnode-name call then return; nc ll c eck -for-deadlock(p_obplref...demo12 ~-exlusive sae con Caobridg Fina Sttonaa con0 Official Distribution List Defense Documentation Center New York Area Office Cameron Station 715
Recognition of cigarette brand names and logos by primary schoolchildren in Ankara, Turkey
Emri, S.; Bagci, T.; Karakoca, Y.; Baris, E.
1998-01-01
OBJECTIVE—To assess the smoking behaviour of primary schoolchildren and their ability to recognise brand names and logos of widely advertised cigarettes, compared with other commercial products intended for children. DESIGN—Cross-sectional survey in classroom settings using a questionnaire designed to measure attitudes towards smoking and the recognition of brand names and logos for 16 food, beverage, cigarette, and toothpaste products. SETTING—Ankara, Turkey. SUBJECTS—1093 children (54.6% boys, 44.4% girls) aged 7-13 years (mean = 10, SD = 1), from grades 2-5. The student sample was taken from three primary schools—one school in each of three residential districts representing high, middle, and low income populations. MAIN OUTCOME MEASURES—Prevalence of ever-smoking, recognition of brand names and logos. RESULTS—Prevalence of ever-smoking was 11.7% overall (13.9% among boys and 9.1% among girls; p<0.05). Children aged eight years or less had a higher prevalence of ever-smoking (19.6%) than older children (p<0.002). Ever-smoking prevalence did not differ significantly across the three school districts. Ever-smoking prevalence was higher among children with at least one parent who smoked (15.3%) than among those whose parents did not (4.8%) (p<0.001). Brand recognition rates ranged from 58.1% for Chee-tos (a food product) to 95.2% for Samsun (a Turkish cigarette brand). Recognition rates for cigarette brand names and logos were 95.2% and 80.8%, respectively, for Samsun; 84.0% and 90.5%, respectively, for Camel; and 92.1% and 69.5%, respectively, for Marlboro. The Camel logo and the Samsun and Marlboro brand names were the most highly recognised of all product logos and brand names tested. CONCLUSIONS—The high recognition of cigarette brand names and logos is most likely the result of tobacco advertising and promotion. Our results indicate the need to implement comprehensive tobacco control measures in Turkey. Keywords: advertising; brand recognition; children; Turkey PMID:10093173
Drane, Daniel L.; Loring, David W.; Voets, Natalie L.; Price, Michele; Ojemann, Jeffrey G.; Willie, Jon T.; Saindane, Amit M.; Phatak, Vaishali; Ivanisevic, Mirjana; Millis, Scott; Helmers, Sandra L.; Miller, John W.; Meador, Kimford J.; Gross, Robert E.
2015-01-01
SUMMARY OBJECTIVES Temporal lobe epilepsy (TLE) patients experience significant deficits in category-related object recognition and naming following standard surgical approaches. These deficits may result from a decoupling of core processing modules (e.g., language, visual processing, semantic memory), due to “collateral damage” to temporal regions outside the hippocampus following open surgical approaches. We predicted stereotactic laser amygdalohippocampotomy (SLAH) would minimize such deficits because it preserves white matter pathways and neocortical regions critical for these cognitive processes. METHODS Tests of naming and recognition of common nouns (Boston Naming Test) and famous persons were compared with nonparametric analyses using exact tests between a group of nineteen patients with medically-intractable mesial TLE undergoing SLAH (10 dominant, 9 nondominant), and a comparable series of TLE patients undergoing standard surgical approaches (n=39) using a prospective, non-randomized, non-blinded, parallel group design. RESULTS Performance declines were significantly greater for the dominant TLE patients undergoing open resection versus SLAH for naming famous faces and common nouns (F=24.3, p<.0001, η2=.57, & F=11.2, p<.001, η2=.39, respectively), and for the nondominant TLE patients undergoing open resection versus SLAH for recognizing famous faces (F=3.9, p<.02, η2=.19). When examined on an individual subject basis, no SLAH patients experienced any performance declines on these measures. In contrast, 32 of the 39 undergoing standard surgical approaches declined on one or more measures for both object types (p<.001, Fisher’s exact test). Twenty-one of 22 left (dominant) TLE patients declined on one or both naming tasks after open resection, while 11 of 17 right (non-dominant) TLE patients declined on face recognition. SIGNIFICANCE Preliminary results suggest 1) naming and recognition functions can be spared in TLE patients undergoing SLAH, and 2) the hippocampus does not appear to be an essential component of neural networks underlying name retrieval or recognition of common objects or famous faces. PMID:25489630
Recognition of cigarette brand names and logos by primary schoolchildren in Ankara, Turkey.
Emri, S; Bağci, T; Karakoca, Y; Bariş, E
1998-01-01
To assess the smoking behaviour of primary schoolchildren and their ability to recognise brand names and logos of widely advertised cigarettes, compared with other commercial products intended for children. Cross-sectional survey in classroom settings using a questionnaire designed to measure attitudes towards smoking and the recognition of brand names and logos for 16 food, beverage, cigarette, and toothpaste products. Ankara, Turkey. 1093 children (54.6% boys, 44.4% girls) aged 7-13 years (mean = 10, SD = 1), from grades 2-5. The student sample was taken from three primary schools--one school in each of three residential districts representing high, middle, and low income populations. Prevalence of ever-smoking, recognition of brand names and logos. Prevalence of ever-smoking was 11.7% overall (13.9% among boys and 9.1% among girls; p < 0.05). Children aged eight years or less had a higher prevalence of ever-smoking (19.6%) than older children (p < 0.002). Ever-smoking prevalence did not differ significantly across the three school districts. Ever-smoking prevalence was higher among children with at least one parent who smoked (15.3%) than among those whose parents did not (4.8%) (p < 0.001). Brand recognition rates ranged from 58.1% for Chee-tos (a food product) to 95.2% for Samsun (a Turkish cigarette brand). Recognition rates for cigarette brand names and logos were 95.2% and 80.8%, respectively, for Samsun; 84.0% and 90.5%, respectively, for Camel; and 92.1% and 69.5%, respectively, for Marlboro. The Camel logo and the Samsun and Marlboro brand names were the most highly recognised of all product logos and brand names tested. The high recognition of cigarette brand names and logos is most likely the result of tobacco advertising and promotion. Our results indicate the need to implement comprehensive tobacco control measures in Turkey.
ERIC Educational Resources Information Center
Blair, Rebecca; Savage, Robert
2006-01-01
This paper reports a study exploring the associations between measures of two levels of phonological representation: recognition (epi-linguistic) and production (meta-linguistic) tasks, and very early reading and writing skills. Thirty-eight pre-reading Ottawa-area children, aged 4-5 years, named environmental print (EP), wrote their own name,…
Eakin, Deborah K.; Hertzog, Christopher; Harris, William
2013-01-01
Age differences in feeling-of-knowing (FOK) accuracy were examined for both episodic memory and semantic memory. Younger and older adults either viewed pictures of famous faces (semantic memory) or associated nonfamous faces and names (episodic memory) and were tested on their memory for the name of the presented face. Participants viewed the faces again and made a FOK prediction about future recognition of the name associated with the presented face. Finally, four-alternative forced-choice recognition memory for the name, cued by the face, was tested and confidence judgments (CJs) were collected for each recognition response. Age differences were not obtained in semantic memory or the resolution of semantic FOKs, defined by within-person correlations of FOKs with recognition memory performance. Although age differences were obtained in level of episodic memory, there were no age differences in the resolution of episodic FOKs. FOKs for correctly recognized items correlated reliably with CJs for both types of materials, and did not differ by age group. The results indicate age invariance in monitoring of retrieval processes for name-face associations. PMID:23537379
Eakin, Deborah K; Hertzog, Christopher; Harris, William
2014-01-01
Age differences in feeling-of-knowing (FOK) accuracy were examined for both episodic memory and semantic memory. Younger and older adults either viewed pictures of famous faces (semantic memory) or associated non-famous faces and names (episodic memory) and were tested on their memory for the name of the presented face. Participants viewed the faces again and made a FOK prediction about future recognition of the name associated with the presented face. Finally, four-alternative forced-choice recognition memory for the name, cued by the face, was tested and confidence judgments (CJs) were collected for each recognition response. Age differences were not obtained in semantic memory or the resolution of semantic FOKs, defined by within-person correlations of FOKs with recognition memory performance. Although age differences were obtained in level of episodic memory, there were no age differences in the resolution of episodic FOKs. FOKs for correctly recognized items correlated reliably with CJs for both types of materials, and did not differ by age group. The results indicate age invariance in monitoring of retrieval processes for name-face associations.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-10-01
... letter with the following information: Name; Other Name(s) Used and Date(s) (required for FBI check); Date of Birth (required for FBI check); City and State of Birth (required for FBI Check); Current..., among other things, pre-appointment and annual tax checks, and an FBI criminal and subversive name check...
Recognition and identification of famous faces in patients with unilateral temporal lobe epilepsy.
Seidenberg, Michael; Griffith, Randall; Sabsevitz, David; Moran, Maria; Haltiner, Alan; Bell, Brian; Swanson, Sara; Hammeke, Thomas; Hermann, Bruce
2002-01-01
We examined the performance of 21 patients with unilateral temporal lobe epilepsy (TLE) and hippocampal damage (10 lefts, and 11 rights) and 10 age-matched controls on the recognition and identification (name and occupation) of well-known faces. Famous face stimuli were selected from four time periods; 1970s, 1980s, 1990-1994, and 1995-1996. Differential patterns of performance were observed for the left and right TLE group across distinct face processing components. The left TLE group showed a selective impairment in naming famous faces while they performed similar to the controls in face recognition and semantic identification (i.e. occupation). In contrast, the right TLE group was impaired across all components of face memory; face recognition, semantic identification, and face naming. Face naming impairment in the left TLE group was characterized by a temporal gradient with better naming performance for famous faces from more distant time periods. Findings are discussed in terms of the role of the temporal lobe system for the acquisition, retention, and retrieval of face semantic networks, and the differential effects of lateralized temporal lobe lesions in this process.
New FASB standard addresses revenue recognition considerations.
McKee, Thomas E
2015-12-01
Healthcare organizations are expected to apply the following steps in revenue recognition under the new standard issued in May 2014 by the Financial Accounting Standards Board: Identify the customer contract. Identify the performance obligations in the contract. Determine the transaction price. Allocate the transaction price to the performance obligations in the contract. Recognize revenue when--or in some circumstances, as--the entity satisfies the performance obligation.
Discussion: Imagining the Languaged Worker's Language
ERIC Educational Resources Information Center
Urciuoli, Bonnie
2016-01-01
What people perceive as "a language"--a named entity--is abstracted from practices and notions about those practices. People take for granted that language is somehow a "thing," an objectively distinct and bounded entity. How languages come to be thus imagined indexes the conditions under which they are imagined. The articles…
24 CFR 202.5 - General approval standards.
Code of Federal Regulations, 2011 CFR
2011-04-01
... years from the date that the materials are circulated or used to advertise. (3) Non-FHA-approved entities. A lender or mortgagee that accepts a loan application from a non-FHA-approved entity must confirm..., including, but not limited to, mergers, terminations, name, location, control of ownership, and character of...
24 CFR 202.5 - General approval standards.
Code of Federal Regulations, 2013 CFR
2013-04-01
... years from the date that the materials are circulated or used to advertise. (3) Non-FHA-approved entities. A lender or mortgagee that accepts a loan application from a non-FHA-approved entity must confirm..., including, but not limited to, mergers, terminations, name, location, control of ownership, and character of...
24 CFR 202.5 - General approval standards.
Code of Federal Regulations, 2012 CFR
2012-04-01
... years from the date that the materials are circulated or used to advertise. (3) Non-FHA-approved entities. A lender or mortgagee that accepts a loan application from a non-FHA-approved entity must confirm..., including, but not limited to, mergers, terminations, name, location, control of ownership, and character of...
49 CFR Appendix C to Part 37 - Certifications
Code of Federal Regulations, 2010 CFR
2010-10-01
..., including individuals who use wheelchairs, is equivalent to the level and quality of service offered to... (name of public entity (ies)) has conducted a survey of existing paratransit services as required by 49... is to certify that service provided by other entities but included in the ADA paratransit plan...
77 FR 48609 - Additional Designations, Foreign Narcotics Kingpin Designation Act
Federal Register 2010, 2011, 2012, 2013, 2014
2012-08-14
... the names of three individuals and five entities whose property and interests in property have been... designation by the Director of OFAC of the three individuals and five entities identified in this notice... transactions involving U.S. companies and individuals. The Kingpin Act blocks all property and interests in...
Activity Recognition in Social Media
2015-12-29
AFRL-AFOSR-JP-TR-2016-0044 Activity Recognition in Social Media Subhasis Chaudhuri INDIAN INSTITUTE OF TECHNOLOGY BOMBAY Final Report 05/09/2016...DATES COVERED (From - To) 12 Aug 2013 to 30 Sep 2015 4. TITLE AND SUBTITLE Activity Recognition in Social Media 5a. CONTRACT NUMBER 5b. GRANT NUMBER...PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) INDIAN INSTITUTE OF TECHNOLOGY BOMBAY POWAI MUMBAI, 400076 IN 8. PERFORMING ORGANIZATION REPORT NUMBER
22 CFR 96.32 - Internal structure and oversight.
Code of Federal Regulations, 2011 CFR
2011-04-01
... known, under either its current or any former form of organization, and the addresses and phone numbers used when such names were used; (2) The name, address, and phone number of each current director... number of such other provider; and (3) The name, address, and phone number of any entity it uses or...
22 CFR 96.32 - Internal structure and oversight.
Code of Federal Regulations, 2012 CFR
2012-04-01
... known, under either its current or any former form of organization, and the addresses and phone numbers used when such names were used; (2) The name, address, and phone number of each current director... number of such other provider; and (3) The name, address, and phone number of any entity it uses or...
22 CFR 96.32 - Internal structure and oversight.
Code of Federal Regulations, 2014 CFR
2014-04-01
... known, under either its current or any former form of organization, and the addresses and phone numbers used when such names were used; (2) The name, address, and phone number of each current director... number of such other provider; and (3) The name, address, and phone number of any entity it uses or...
22 CFR 96.32 - Internal structure and oversight.
Code of Federal Regulations, 2013 CFR
2013-04-01
... known, under either its current or any former form of organization, and the addresses and phone numbers used when such names were used; (2) The name, address, and phone number of each current director... number of such other provider; and (3) The name, address, and phone number of any entity it uses or...
30 CFR 1218.540 - How does ONRR serve official correspondence?
Code of Federal Regulations, 2014 CFR
2014-07-01
... reporting entity is responsible for notifying ONRR of any name or address changes on Form ONRR-4444. The... name and address, position title, or department name and address in our database, based on previous... registered agent; (ii) Any corporate officer; or (iii) The addressee of record shown in the files of any...
A Concept Hierarchy Based Ontology Mapping Approach
NASA Astrophysics Data System (ADS)
Wang, Ying; Liu, Weiru; Bell, David
Ontology mapping is one of the most important tasks for ontology interoperability and its main aim is to find semantic relationships between entities (i.e. concept, attribute, and relation) of two ontologies. However, most of the current methods only consider one to one (1:1) mappings. In this paper we propose a new approach (CHM: Concept Hierarchy based Mapping approach) which can find simple (1:1) mappings and complex (m:1 or 1:m) mappings simultaneously. First, we propose a new method to represent the concept names of entities. This method is based on the hierarchical structure of an ontology such that each concept name of entity in the ontology is included in a set. The parent-child relationship in the hierarchical structure of an ontology is then extended as a set-inclusion relationship between the sets for the parent and the child. Second, we compute the similarities between entities based on the new representation of entities in ontologies. Third, after generating the mapping candidates, we select the best mapping result for each source entity. We design a new algorithm based on the Apriori algorithm for selecting the mapping results. Finally, we obtain simple (1:1) and complex (m:1 or 1:m) mappings. Our experimental results and comparisons with related work indicate that utilizing this method in dealing with ontology mapping is a promising way to improve the overall mapping results.
SPECTRa-T: machine-based data extraction and semantic searching of chemistry e-theses.
Downing, Jim; Harvey, Matt J; Morgan, Peter B; Murray-Rust, Peter; Rzepa, Henry S; Stewart, Diana C; Tonge, Alan P; Townsend, Joe A
2010-02-22
The SPECTRa-T project has developed text-mining tools to extract named chemical entities (NCEs), such as chemical names and terms, and chemical objects (COs), e.g., experimental spectral assignments and physical chemistry properties, from electronic theses (e-theses). Although NCEs were readily identified within the two major document formats studied, only the use of structured documents enabled identification of chemical objects and their association with the relevant chemical entity (e.g., systematic chemical name). A corpus of theses was analyzed and it is shown that a high degree of semantic information can be extracted from structured documents. This integrated information has been deposited in a persistent Resource Description Framework (RDF) triple-store that allows users to conduct semantic searches. The strength and weaknesses of several document formats are reviewed.
Music Recognition in Frontotemporal Lobar Degeneration and Alzheimer Disease
Johnson, Julene K; Chang, Chiung-Chih; Brambati, Simona M; Migliaccio, Raffaella; Gorno-Tempini, Maria Luisa; Miller, Bruce L; Janata, Petr
2013-01-01
Objective To compare music recognition in patients with frontotemporal dementia, semantic dementia, Alzheimer disease, and controls and to evaluate the relationship between music recognition and brain volume. Background Recognition of familiar music depends on several levels of processing. There are few studies about how patients with dementia recognize familiar music. Methods Subjects were administered tasks that assess pitch and melody discrimination, detection of pitch errors in familiar melodies, and naming of familiar melodies. Results There were no group differences on pitch and melody discrimination tasks. However, patients with semantic dementia had considerable difficulty naming familiar melodies and also scored the lowest when asked to identify pitch errors in the same melodies. Naming familiar melodies, but not other music tasks, was strongly related to measures of semantic memory. Voxel-based morphometry analysis of brain MRI showed that difficulty in naming songs was associated with the bilateral temporal lobes and inferior frontal gyrus, whereas difficulty in identifying pitch errors in familiar melodies correlated with primarily the right temporal lobe. Conclusions The results support a view that the anterior temporal lobes play a role in familiar melody recognition, and that musical functions are affected differentially across forms of dementia. PMID:21617528
PROGRESS IN ACUTE MYELOID LEUKEMIA
Kadia, Tapan M.; Ravandi, Farhad; O’Brien, Susan; Cortes, Jorge; Kantarjian, Hagop M.
2014-01-01
Significant progress has been made in the treatment of acute myeloid leukemia (AML). Steady gains in clinical research and a renaissance of genomics in leukemia have led to improved outcomes. The recognition of tremendous heterogeneity in AML has allowed individualized treatments of specific disease entities within the context of patient age, cytogenetics, and mutational analysis. The following is a comprehensive review of the current state of AML therapy and a roadmap of our approach to these distinct disease entities. PMID:25441110
ERIC Educational Resources Information Center
Lovrencic, Michael; Vena, Laurie
2014-01-01
A kinesthetic technique for learning to recognize elements and compounds is presented in this article. The current common pedagogy appears to merge recognition and implementation into one naming method. A separate recognition skill is critical to students being able to correctly name and write the formulas of compounds. This article focuses on…
Federal Register 2010, 2011, 2012, 2013, 2014
2012-07-30
... Foreign Assets Control (``OFAC'') is publishing the names of ten individuals and nine entities whose... ten individuals and nine entities identified in this notice whose property and interests in property... international narcotics trafficking. On July 24, 2012, the Director of OFAC removed from the SDN List the ten...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-06-21
... Control (``OFAC'') is publishing the names of 38 entities identified as the Government of Iran under the... Government of Iran and Iranian Financial Institutions'' (the ``Order''). Section 1(a) of the Order blocks, with certain exceptions, all property and interests in property of the Government of Iran, including...
78 FR 78514 - Designation of One Individual and Three Entities Pursuant to Executive Order
Federal Register 2010, 2011, 2012, 2013, 2014
2013-12-26
... DEPARTMENT OF THE TREASURY Office of Foreign Assets Control Designation of One Individual and... publishing the name of one individual and three entities whose property and interests in property are blocked..., Security, or Stability of Burma.'' DATES: The designation by the Director of OFAC of the one individual and...
18 CFR 131.31 - FERC Form No. 561, Annual report of interlocking positions.
Code of Federal Regulations, 2010 CFR
2010-04-01
... supplies electric equipment (ELEQ) named in Column (3) enter the aggregate amount of revenues from... utility ELEQ Entity which produces/supplies electric equipment for the use of any public utility FUEL Entity which produces/supplies coal, natural gas, nuclear fuel, or other fuel for the use of any public...
The roles of perceptual and conceptual information in face recognition.
Schwartz, Linoy; Yovel, Galit
2016-11-01
The representation of familiar objects is comprised of perceptual information about their visual properties as well as the conceptual knowledge that we have about them. What is the relative contribution of perceptual and conceptual information to object recognition? Here, we examined this question by designing a face familiarization protocol during which participants were either exposed to rich perceptual information (viewing each face in different angles and illuminations) or with conceptual information (associating each face with a different name). Both conditions were compared with single-view faces presented with no labels. Recognition was tested on new images of the same identities to assess whether learning generated a view-invariant representation. Results showed better recognition of novel images of the learned identities following association of a face with a name label, but no enhancement following exposure to multiple face views. Whereas these findings may be consistent with the role of category learning in object recognition, face recognition was better for labeled faces only when faces were associated with person-related labels (name, occupation), but not with person-unrelated labels (object names or symbols). These findings suggest that association of meaningful conceptual information with an image shifts its representation from an image-based percept to a view-invariant concept. They further indicate that the role of conceptual information should be considered to account for the superior recognition that we have for familiar faces and objects. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
[The effects of normal aging on face naming and recognition of famous people: battery 75].
Pluchon, C; Simonnet, E; Toullat, G; Gil, R
2002-07-01
The difficulty to recall proper nouns is often something elderly people complain about. Thus, we tried to build and standardize a tool that could allow a quantified estimation of the naming and recognition abilities about famous people faces, specifying the part of gender, age and cultural level for each kind of test. The performances of 542 subjects divided in 3 age brackets and 3 academic knowledge levels were analysed. To carry out the test material, the artistic team of the Grevin Museum (Paris) was called upon. Their work offers a homogeneous way to shape famous people faces. One same person thus photographed 75 characters from different social categories with the same conditions of light, during only one day. The results of the study show that men perform better than women as concerns naming task, but that there's no difference between genders as concerns recognition task. Recognition performances are significantly better whatever the age, the gender and the cultural level may be. Generally, performances are all the more better since subjects are younger and have a higher cultural level. Our study then confirms the fact that normal aging goes hand in hand with rising difficulties to name faces. Moreover, results tend to show that recognition of faces remains better preserved and that the greater disability to recall a name is linked to difficulties in lexical accessing.
Classifying Web Pages by Using Knowledge Bases for Entity Retrieval
NASA Astrophysics Data System (ADS)
Kiritani, Yusuke; Ma, Qiang; Yoshikawa, Masatoshi
In this paper, we propose a novel method to classify Web pages by using knowledge bases for entity search, which is a kind of typical Web search for information related to a person, location or organization. First, we map a Web page to entities according to the similarities between the page and the entities. Various methods for computing such similarity are applied. For example, we can compute the similarity between a given page and a Wikipedia article describing a certain entity. The frequency of an entity appearing in the page is another factor used in computing the similarity. Second, we construct a directed acyclic graph, named PEC graph, based on the relations among Web pages, entities, and categories, by referring to YAGO, a knowledge base built on Wikipedia and WordNet. Finally, by analyzing the PEC graph, we classify Web pages into categories. The results of some preliminary experiments validate the methods proposed in this paper.
12 CFR 204.130 - Eligibility for NOW accounts.
Code of Federal Regulations, 2010 CFR
2010-01-01
... clarify the types of entities that may maintain NOW accounts at member banks. (b) Individuals. (1) Any individual may maintain a NOW account regardless of the purposes that the funds will serve. Thus, deposits of... under a trade name is eligible to maintain a NOW account in the individual's name or in the “DBA” name...
30 CFR 1218.540 - How does ONRR serve official correspondence?
Code of Federal Regulations, 2012 CFR
2012-07-01
... correspondence at issue. The company or reporting entity is responsible for notifying ONRR of any name or address changes on Form MMS-4444. The addressee of record in a part 1290, appeal will be the person or...-4444, we may use the individual name and address, position title, or department name and address in our...
30 CFR 1218.540 - How does ONRR serve official correspondence?
Code of Federal Regulations, 2013 CFR
2013-07-01
... correspondence at issue. The company or reporting entity is responsible for notifying ONRR of any name or address changes on Form ONRR-4444. The addressee of record in a part 1290, appeal will be the person or...-4444, we may use the individual name and address, position title, or department name and address in our...
Perceptual Plasticity for Auditory Object Recognition
Heald, Shannon L. M.; Van Hedger, Stephen C.; Nusbaum, Howard C.
2017-01-01
In our auditory environment, we rarely experience the exact acoustic waveform twice. This is especially true for communicative signals that have meaning for listeners. In speech and music, the acoustic signal changes as a function of the talker (or instrument), speaking (or playing) rate, and room acoustics, to name a few factors. Yet, despite this acoustic variability, we are able to recognize a sentence or melody as the same across various kinds of acoustic inputs and determine meaning based on listening goals, expectations, context, and experience. The recognition process relates acoustic signals to prior experience despite variability in signal-relevant and signal-irrelevant acoustic properties, some of which could be considered as “noise” in service of a recognition goal. However, some acoustic variability, if systematic, is lawful and can be exploited by listeners to aid in recognition. Perceivable changes in systematic variability can herald a need for listeners to reorganize perception and reorient their attention to more immediately signal-relevant cues. This view is not incorporated currently in many extant theories of auditory perception, which traditionally reduce psychological or neural representations of perceptual objects and the processes that act on them to static entities. While this reduction is likely done for the sake of empirical tractability, such a reduction may seriously distort the perceptual process to be modeled. We argue that perceptual representations, as well as the processes underlying perception, are dynamically determined by an interaction between the uncertainty of the auditory signal and constraints of context. This suggests that the process of auditory recognition is highly context-dependent in that the identity of a given auditory object may be intrinsically tied to its preceding context. To argue for the flexible neural and psychological updating of sound-to-meaning mappings across speech and music, we draw upon examples of perceptual categories that are thought to be highly stable. This framework suggests that the process of auditory recognition cannot be divorced from the short-term context in which an auditory object is presented. Implications for auditory category acquisition and extant models of auditory perception, both cognitive and neural, are discussed. PMID:28588524
Drane, Daniel L; Loring, David W; Voets, Natalie L; Price, Michele; Ojemann, Jeffrey G; Willie, Jon T; Saindane, Amit M; Phatak, Vaishali; Ivanisevic, Mirjana; Millis, Scott; Helmers, Sandra L; Miller, John W; Meador, Kimford J; Gross, Robert E
2015-01-01
Patients with temporal lobe epilepsy (TLE) experience significant deficits in category-related object recognition and naming following standard surgical approaches. These deficits may result from a decoupling of core processing modules (e.g., language, visual processing, and semantic memory), due to "collateral damage" to temporal regions outside the hippocampus following open surgical approaches. We predicted that stereotactic laser amygdalohippocampotomy (SLAH) would minimize such deficits because it preserves white matter pathways and neocortical regions that are critical for these cognitive processes. Tests of naming and recognition of common nouns (Boston Naming Test) and famous persons were compared with nonparametric analyses using exact tests between a group of 19 patients with medically intractable mesial TLE undergoing SLAH (10 dominant, 9 nondominant), and a comparable series of TLE patients undergoing standard surgical approaches (n=39) using a prospective, nonrandomized, nonblinded, parallel-group design. Performance declines were significantly greater for the patients with dominant TLE who were undergoing open resection versus SLAH for naming famous faces and common nouns (F=24.3, p<0.0001, η2=0.57, and F=11.2, p<0.001, η2=0.39, respectively), and for the patients with nondominant TLE undergoing open resection versus SLAH for recognizing famous faces (F=3.9, p<0.02, η2=0.19). When examined on an individual subject basis, no SLAH patients experienced any performance declines on these measures. In contrast, 32 of the 39 patients undergoing standard surgical approaches declined on one or more measures for both object types (p<0.001, Fisher's exact test). Twenty-one of 22 left (dominant) TLE patients declined on one or both naming tasks after open resection, while 11 of 17 right (nondominant) TLE patients declined on face recognition. Preliminary results suggest (1) naming and recognition functions can be spared in TLE patients undergoing SLAH, and (2) the hippocampus does not appear to be an essential component of neural networks underlying name retrieval or recognition of common objects or famous faces. Wiley Periodicals, Inc. © 2014 International League Against Epilepsy.
Fracture Mechanics Method for Word Embedding Generation of Neural Probabilistic Linguistic Model.
Bi, Size; Liang, Xiao; Huang, Ting-Lei
2016-01-01
Word embedding, a lexical vector representation generated via the neural linguistic model (NLM), is empirically demonstrated to be appropriate for improvement of the performance of traditional language model. However, the supreme dimensionality that is inherent in NLM contributes to the problems of hyperparameters and long-time training in modeling. Here, we propose a force-directed method to improve such problems for simplifying the generation of word embedding. In this framework, each word is assumed as a point in the real world; thus it can approximately simulate the physical movement following certain mechanics. To simulate the variation of meaning in phrases, we use the fracture mechanics to do the formation and breakdown of meaning combined by a 2-gram word group. With the experiments on the natural linguistic tasks of part-of-speech tagging, named entity recognition and semantic role labeling, the result demonstrated that the 2-dimensional word embedding can rival the word embeddings generated by classic NLMs, in terms of accuracy, recall, and text visualization.
De Luca, Daniele; van Kaam, Anton H; Tingay, David G; Courtney, Sherry E; Danhaive, Olivier; Carnielli, Virgilio P; Zimmermann, Luc J; Kneyber, Martin C J; Tissieres, Pierre; Brierley, Joe; Conti, Giorgio; Pillow, Jane J; Rimensberger, Peter C
2017-08-01
Acute respiratory distress syndrome (ARDS) is undefined in neonates, despite the long-standing existing formal recognition of ARDS syndrome in later life. We describe the Neonatal ARDS Project: an international, collaborative, multicentre, and multidisciplinary project which aimed to produce an ARDS consensus definition for neonates that is applicable from the perinatal period. The definition was created through discussions between five expert members of the European Society for Paediatric and Neonatal Intensive Care; four experts of the European Society for Paediatric Research; two independent experts from the USA and two from Australia. This Position Paper provides the first consensus definition for neonatal ARDS (called the Montreux definition). We also provide expert consensus that mechanisms causing ARDS in adults and older children-namely complex surfactant dysfunction, lung tissue inflammation, loss of lung volume, increased shunt, and diffuse alveolar damage-are also present in several critical neonatal respiratory disorders. Copyright © 2017 Elsevier Ltd. All rights reserved.
Pafilis, Evangelos; Buttigieg, Pier Luigi; Ferrell, Barbra; Pereira, Emiliano; Schnetzer, Julia; Arvanitidis, Christos; Jensen, Lars Juhl
2016-01-01
The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment, organism, tissue and disease terms. The evaluators in the BioCreative V Interactive Annotation Task found the system to be intuitive, useful, well documented and sufficiently accurate to be helpful in spotting relevant text passages and extracting organism and environment terms. Comparison of fully manual and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15-25% and helps curators to detect terms that would otherwise have been missed. Database URL: https://extract.hcmr.gr/. © The Author(s) 2016. Published by Oxford University Press.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pafilis, Evangelos; Buttigieg, Pier Luigi; Ferrell, Barbra
The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment, organism, tissue and disease terms. The evaluators in the BioCreative V Interactive Annotation Task found the system to be intuitive, useful, wellmore » documented and sufficiently accurate to be helpful in spotting relevant text passages and extracting organism and environment terms. Here the comparison of fully manual and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15–25% and helps curators to detect terms that would otherwise have been missed.« less
Jiao, Dazhi; Wild, David J
2009-02-01
This paper proposes a system that automatically extracts CYP protein and chemical interactions from journal article abstracts, using natural language processing (NLP) and text mining methods. In our system, we employ a maximum entropy based learning method, using results from syntactic, semantic, and lexical analysis of texts. We first present our system architecture and then discuss the data set for training our machine learning based models and the methods in building components in our system, such as part of speech (POS) tagging, Named Entity Recognition (NER), dependency parsing, and relation extraction. An evaluation of the system is conducted at the end, yielding very promising results: The POS, dependency parsing, and NER components in our system have achieved a very high level of accuracy as measured by precision, ranging from 85.9% to 98.5%, and the precision and the recall of the interaction extraction component are 76.0% and 82.6%, and for the overall system are 68.4% and 72.2%, respectively.
Graph Learning in Knowledge Bases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Goldberg, Sean; Wang, Daisy Zhe
The amount of text data has been growing exponentially in recent years, giving rise to automatic information extraction methods that store text annotations in a database. The current state-of-theart structured prediction methods, however, are likely to contain errors and it’s important to be able to manage the overall uncertainty of the database. On the other hand, the advent of crowdsourcing has enabled humans to aid machine algorithms at scale. As part of this project we introduced pi-CASTLE , a system that optimizes and integrates human and machine computing as applied to a complex structured prediction problem involving conditional random fieldsmore » (CRFs). We proposed strategies grounded in information theory to select a token subset, formulate questions for the crowd to label, and integrate these labelings back into the database using a method of constrained inference. On both a text segmentation task over academic citations and a named entity recognition task over tweets we showed an order of magnitude improvement in accuracy gain over baseline methods.« less
Discovering Peripheral Arterial Disease Cases from Radiology Notes Using Natural Language Processing
Savova, Guergana K.; Fan, Jin; Ye, Zi; Murphy, Sean P.; Zheng, Jiaping; Chute, Christopher G.; Kullo, Iftikhar J.
2010-01-01
As part of the Electronic Medical Records and Genomics Network, we applied, extended and evaluated an open source clinical Natural Language Processing system, Mayo’s Clinical Text Analysis and Knowledge Extraction System, for the discovery of peripheral arterial disease cases from radiology reports. The manually created gold standard consisted of 223 positive, 19 negative, 63 probable and 150 unknown cases. Overall accuracy agreement between the system and the gold standard was 0.93 as compared to a named entity recognition baseline of 0.46. Sensitivity for the positive, probable and unknown cases was 0.93–0.96, and for the negative cases was 0.72. Specificity and negative predictive value for all categories were in the 90’s. The positive predictive value for the positive and unknown categories was in the high 90’s, for the negative category was 0.84, and for the probable category was 0.63. We outline the main sources of errors and suggest improvements. PMID:21347073
Pafilis, Evangelos; Buttigieg, Pier Luigi; Ferrell, Barbra; ...
2016-01-01
The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment, organism, tissue and disease terms. The evaluators in the BioCreative V Interactive Annotation Task found the system to be intuitive, useful, wellmore » documented and sufficiently accurate to be helpful in spotting relevant text passages and extracting organism and environment terms. Here the comparison of fully manual and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15–25% and helps curators to detect terms that would otherwise have been missed.« less
Lexical Competition in Non-Native Spoken-Word Recognition
ERIC Educational Resources Information Center
Weber, Andrea; Cutler, Anne
2004-01-01
Four eye-tracking experiments examined lexical competition in non-native spoken-word recognition. Dutch listeners hearing English fixated longer on distractor pictures with names containing vowels that Dutch listeners are likely to confuse with vowels in a target picture name ("pencil," given target "panda") than on less confusable distractors…
Crowded and Sparse Domains in Object Recognition: Consequences for Categorization and Naming
ERIC Educational Resources Information Center
Gale, Tim M.; Laws, Keith R.; Foley, Kerry
2006-01-01
Some models of object recognition propose that items from structurally crowded categories (e.g., living things) permit faster access to superordinate semantic information than structurally dissimilar categories (e.g., nonliving things), but slower access to individual object information when naming items. We present four experiments that utilize…
NASA Astrophysics Data System (ADS)
Zhang, S.; Wang, Y.; Ju, H.
2017-12-01
The interprovincial terrestrial physical geographical entities are the key areas of regional integrated management. Based on toponomy dictionaries and different thematic maps, the attributes and the spatial extent of the interprovincial terrestrial physical geographical names (ITPGN, including terrain ITPGN and water ITPGN) were extracted. The coefficient of variation and Moran's I were combined together to measure the spatial variation and spatial association of ITPGN. The influencing factors of the distribution of ITPGN and the implications for the regional management were further discussed. The results showed that 11325 ITPGN were extracted, including 7082 terrain ITPGN and 4243 water ITPGN. Hunan Province had the largest number of ITPGN in China, and Shanghai had the smallest number. The spatial variance of the terrain ITPGN was larger than that of the water ITPGN, and the ITPGN showed a significant agglomeration phenomenon in the southern part of China. Further analysis showed that the number of ITPGN was positively related with the relative elevation and the population where the relative elevation was lower than 2000m and the population was less than 50 million. But the number of ITPGN showed a negative relationship with the two factors when their values became larger, indicating a large number of unnamed entities existed in complex terrain areas and a decreasing number of terrestrial physical geographical entities in densely populated area. Based on these analysis, we suggest the government take the ITPGN as management units to realize a balance development between different parts of the entities and strengthen the geographical names census and the nomination of unnamed interprovincial physical geographical entities. This study also demonstrated that the methods of literature survey, coefficient of variation and Moran's I can be combined to enhance the understanding of the spatial pattern of ITPGN.
EPA New England Opens Nomination Period for Annual Environmental Merit Awards
The U.S. Environmental Protection Agency office in New England is accepting nominations for New England people, organizations, government entities or businesses whose environmental achievements during the past year deserve recognition.
Unstable solar lentigo: A defined separate entity.
Byrom, Lisa; Barksdale, Sarah; Weedon, David; Muir, Jim
2016-08-01
An unstable solar lentigo is a solar lentigo with areas of melanocytic hyperplasia not extending past the margin of the lesion. They are discrete, macular, pigmented lesions arising on sun-damaged skin and a subset of typical solar lentigos. Clinically they differ from usual solar lentigines in often being solitary or larger and darker than adjacent solar lentigines. These lesions are of clinical importance as they can arise in close proximity to lentigo maligna and in a single lesion there can be demonstrated changes of solar lentigo, unstable solar lentigo and lentigo maligna. These observations led us to conjecture that unstable solar lentigos could be a precursor lesion to lentigo maligna. In this article we examine the possibility that lentigo maligna can arise within a solar lentigo through an intermediate lesion, the unstable solar lentigo. We propose that the histopathological recognition of this entity will allow for future research into its behaviour and thus management. We review difficulties in the diagnosis of single cell predominant melanocytic proliferations and the concept of unstable lentigo in view of the literature and clinical experience supporting the proposal of its recognition as a separate entity. © 2016 The Australasian College of Dermatologists.
Sketching for Military Courses of Action Diagrams
2003-01-01
the glyph bar and (optionally) spoken input2. Avoiding the need for recognition in glyphs Glyphs in nuSketch systems have two parts. The ink is the...time-stamped collection of ink strokes that comprise the base- level visual representation of the glyph. The content of the glyph is an entity in...preferred having a neat symbol drawn where they wanted it. Those who had tried ink recognition systems particularly appreciated never having to
Federal Register 2010, 2011, 2012, 2013, 2014
2013-06-28
... requests or appeals on behalf of other persons or entities; individuals who are the subjects of FOIA or PA... number) information, and proof of identification; names and other information about persons who are the... oversight function. E. To appropriate agencies, entities, and persons when: 1. The Board suspects or has...
21 CFR 203.30 - Sample distribution by mail or common carrier.
Code of Federal Regulations, 2011 CFR
2011-04-01
... pharmacy of a hospital or other health care entity, by mail or common carrier, provided that: (1) The... to the pharmacy of a hospital or other health care entity is required to contain, in addition to all of the information in paragraph (b)(l) of this section, the name and address of the pharmacy of the...
21 CFR 203.30 - Sample distribution by mail or common carrier.
Code of Federal Regulations, 2013 CFR
2013-04-01
... pharmacy of a hospital or other health care entity, by mail or common carrier, provided that: (1) The... to the pharmacy of a hospital or other health care entity is required to contain, in addition to all of the information in paragraph (b)(l) of this section, the name and address of the pharmacy of the...
21 CFR 203.30 - Sample distribution by mail or common carrier.
Code of Federal Regulations, 2012 CFR
2012-04-01
... pharmacy of a hospital or other health care entity, by mail or common carrier, provided that: (1) The... to the pharmacy of a hospital or other health care entity is required to contain, in addition to all of the information in paragraph (b)(l) of this section, the name and address of the pharmacy of the...
21 CFR 203.30 - Sample distribution by mail or common carrier.
Code of Federal Regulations, 2014 CFR
2014-04-01
... pharmacy of a hospital or other health care entity, by mail or common carrier, provided that: (1) The... to the pharmacy of a hospital or other health care entity is required to contain, in addition to all of the information in paragraph (b)(l) of this section, the name and address of the pharmacy of the...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-06-26
... Entity Compliance Guide. SUMMARY: This document is issued under the joint authority of DOD, GSA, and NASA... whose name appears in the table below. Please cite FAC 2005-68 and the FAR case number. For information... Listed in FAC 2005-68 Subject FAR Case Analyst *Expansion of Applicability of the Senior Executive...
Morais, António; Moura, M Conceição Souto; Cruz, M Rosa; Gomes, Isabel
2004-01-01
Nonspecific interstitial pneumonitis (NSIP) initially described by Katzenstein and Fiorelli in 1994, seems to be a distinct clinicopathologic entity among idiopathic interstitial pneumonitis (IIP). Besides different histologic features from other IIP, NSIP is characterized by a better long-term outcome, associated with a better steroids responsiveness than idiopathic pulmonar fibrosis (IPF), where usually were included. Thus, differentiating NSIP from other IIP, namely IPF is very significant, since it has important therapeutic and prognostic implications. NSIP encloses different pathologies, namely those with inflammatory predominance (cellular subtype) or fibrous predominance (fibrosing subtype). NSIP is reviewed and discussed by the authors, after two clinical cases description.
Luzzi, Simona; Baldinelli, Sara; Ranaldi, Valentina; Fabi, Katia; Cafazzo, Viviana; Fringuelli, Fabio; Silvestrini, Mauro; Provinciali, Leandro; Reverberi, Carlo; Gainotti, Guido
2017-01-08
Famous face and voice recognition is reported to be impaired both in semantic dementia (SD) and in Alzheimer's Disease (AD), although more severely in the former. In AD a coexistence of perceptual impairment in face and voice processing has also been reported and this could contribute to the altered performance in complex semantic tasks. On the other hand, in SD both face and voice recognition disorders could be related to the prevalence of atrophy in the right temporal lobe (RTL). The aim of the present study was twofold: (1) to investigate famous faces and voices recognition in SD and AD to verify if the two diseases show a differential pattern of impairment, resulting from disruption of different cognitive mechanisms; (2) to check if face and voice recognition disorders prevail in patients with atrophy mainly affecting the RTL. To avoid the potential influence of primary perceptual problems in face and voice recognition, a pool of patients suffering from early SD and AD were administered a detailed set of tests exploring face and voice perception. Thirteen SD (8 with prevalence of right and 5 with prevalence of left temporal atrophy) and 25 CE patients, who did not show visual and auditory perceptual impairment, were finally selected and were administered an experimental battery exploring famous face and voice recognition and naming. Twelve SD patients underwent cerebral PET imaging and were classified in right and left SD according to the onset modality and to the prevalent decrease in FDG uptake in right or left temporal lobe respectively. Correlation of PET imaging and famous face and voice recognition was performed. Results showed a differential performance profile in the two diseases, because AD patients were significantly impaired in the naming tests, but showed preserved recognition, whereas SD patients were profoundly impaired both in naming and in recognition of famous faces and voices. Furthermore, face and voice recognition disorders prevailed in SD patients with RTL atrophy, who also showed a conceptual impairment on the Pyramids and Palm Trees test more important in the pictorial than in the verbal modality. Finally, in 12SD patients in whom PET was available, a strong correlation between FDG uptake and face-to-name and voice-to-name matching data was found in the right but not in the left temporal lobe. The data support the hypothesis of a different cognitive basis for impairment of face and voice recognition in the two dementias and suggest that the pattern of impairment in SD may be due to a loss of semantic representations, while a defect of semantic control, with impaired naming and preserved recognition might be hypothesized in AD. Furthermore, the correlation between face and voice recognition disorders and RTL damage are consistent with the hypothesis assuming that in the RTL person-specific knowledge may be mainly based upon non-verbal representations. Copyright © 2016 Elsevier Ltd. All rights reserved.
Extraction of Pharmacokinetic Evidence of Drug–Drug Interactions from the Literature
Kolchinsky, Artemy; Lourenço, Anália; Wu, Heng-Yi; Li, Lang; Rocha, Luis M.
2015-01-01
Drug-drug interaction (DDI) is a major cause of morbidity and mortality and a subject of intense scientific interest. Biomedical literature mining can aid DDI research by extracting evidence for large numbers of potential interactions from published literature and clinical databases. Though DDI is investigated in domains ranging in scale from intracellular biochemistry to human populations, literature mining has not been used to extract specific types of experimental evidence, which are reported differently for distinct experimental goals. We focus on pharmacokinetic evidence for DDI, essential for identifying causal mechanisms of putative interactions and as input for further pharmacological and pharmacoepidemiology investigations. We used manually curated corpora of PubMed abstracts and annotated sentences to evaluate the efficacy of literature mining on two tasks: first, identifying PubMed abstracts containing pharmacokinetic evidence of DDIs; second, extracting sentences containing such evidence from abstracts. We implemented a text mining pipeline and evaluated it using several linear classifiers and a variety of feature transforms. The most important textual features in the abstract and sentence classification tasks were analyzed. We also investigated the performance benefits of using features derived from PubMed metadata fields, various publicly available named entity recognizers, and pharmacokinetic dictionaries. Several classifiers performed very well in distinguishing relevant and irrelevant abstracts (reaching F1≈0.93, MCC≈0.74, iAUC≈0.99) and sentences (F1≈0.76, MCC≈0.65, iAUC≈0.83). We found that word bigram features were important for achieving optimal classifier performance and that features derived from Medical Subject Headings (MeSH) terms significantly improved abstract classification. We also found that some drug-related named entity recognition tools and dictionaries led to slight but significant improvements, especially in classification of evidence sentences. Based on our thorough analysis of classifiers and feature transforms and the high classification performance achieved, we demonstrate that literature mining can aid DDI discovery by supporting automatic extraction of specific types of experimental evidence. PMID:25961290
77 FR 63217 - Use of Additional Portable Oxygen Concentrators on Board Aircraft
Federal Register 2010, 2011, 2012, 2013, 2014
2012-10-16
..., organizations, and governmental jurisdictions subject to regulation. To achieve this principle, agencies are... small entities, including small businesses, not-for-profit organizations, and small governmental... manufacturer's names. In this final rule, the FAA will add those previous manufacturer's names (International...
49 CFR 26.73 - What are other rules affecting certification?
Code of Federal Regulations, 2011 CFR
2011-10-01
... decisions, whether a firm has exhibited a pattern of conduct indicating its involvement in attempts to evade... cannot be certified. (f) Recognition of a business as a separate entity for tax or corporate purposes is...
49 CFR 26.73 - What are other rules affecting certification?
Code of Federal Regulations, 2010 CFR
2010-10-01
... decisions, whether a firm has exhibited a pattern of conduct indicating its involvement in attempts to evade... cannot be certified. (f) Recognition of a business as a separate entity for tax or corporate purposes is...
Recognition of names of eminent psychologists.
Duncan, C P
1976-10-01
Faculty members, graduate students, undergraduate majors, and introductory psychology students checked those names they recognized in the list of 228 deceased psychologists, rated for eminence, provided by Annin, Boring, and Watson. Mean percentage recognition was less than 50% for the 128 American psychologists, and less than 25% for the 100 foreign psychologists, by the faculty subjects. The other three groups of subjects gave even lower recognition scores. Recognition was probably also influenced by recency; median year of death of the American psychologists was 1955, of the foreign psychologists, 1943. High recognition (defined as recognition by 80% or more of the faculty group) was achieved by only 34 psychologists, almost all of them American. These highly recognized psychologists also had high eminence ratings, but there was an equal number of psychologists with high eminence ratings that were poorly recognized.
Eye movements during object recognition in visual agnosia.
Charles Leek, E; Patterson, Candy; Paul, Matthew A; Rafal, Robert; Cristino, Filipe
2012-07-01
This paper reports the first ever detailed study about eye movement patterns during single object recognition in visual agnosia. Eye movements were recorded in a patient with an integrative agnosic deficit during two recognition tasks: common object naming and novel object recognition memory. The patient showed normal directional biases in saccades and fixation dwell times in both tasks and was as likely as controls to fixate within object bounding contour regardless of recognition accuracy. In contrast, following initial saccades of similar amplitude to controls, the patient showed a bias for short saccades. In object naming, but not in recognition memory, the similarity of the spatial distributions of patient and control fixations was modulated by recognition accuracy. The study provides new evidence about how eye movements can be used to elucidate the functional impairments underlying object recognition deficits. We argue that the results reflect a breakdown in normal functional processes involved in the integration of shape information across object structure during the visual perception of shape. Copyright © 2012 Elsevier Ltd. All rights reserved.
46 CFR 520.3 - Publication responsibilities.
Code of Federal Regulations, 2013 CFR
2013-10-01
... tariff, of its organization name, organization number, home office address, name and telephone number of... tariffs, by electronically submitting Form FMC-1 via the Commission's website at www.fmc.gov. Any changes... unique organization number to new entities operating as common carriers or conferences in the U.S...
46 CFR 520.3 - Publication responsibilities.
Code of Federal Regulations, 2014 CFR
2014-10-01
... tariff, of its organization name, organization number, home office address, name and telephone number of... tariffs, by electronically submitting Form FMC-1 via the Commission's website at www.fmc.gov. Any changes... unique organization number to new entities operating as common carriers or conferences in the U.S...
46 CFR 520.3 - Publication responsibilities.
Code of Federal Regulations, 2011 CFR
2011-10-01
... tariff, of its organization name, organization number, home office address, name and telephone number of... tariffs, by electronically submitting Form FMC-1 via the Commission's website at www.fmc.gov. Any changes... unique organization number to new entities operating as common carriers or conferences in the U.S...
46 CFR 520.3 - Publication responsibilities.
Code of Federal Regulations, 2012 CFR
2012-10-01
... tariff, of its organization name, organization number, home office address, name and telephone number of... tariffs, by electronically submitting Form FMC-1 via the Commission's website at www.fmc.gov. Any changes... unique organization number to new entities operating as common carriers or conferences in the U.S...
75 FR 18887 - FBI Criminal Justice Information Services Division User Fees
Federal Register 2010, 2011, 2012, 2013, 2014
2010-04-13
.... SUMMARY: This notice establishes the user fee schedule for fingerprint- based and name-based criminal... fingerprint-based and other identification services as authorized by federal law. These fees apply to federal, state and any other authorized entities requesting fingerprint identification records and name checks...
State-of-the-art Anonymization of Medical Records Using an Iterative Machine Learning Framework
Szarvas, György; Farkas, Richárd; Busa-Fekete, Róbert
2007-01-01
Objective The anonymization of medical records is of great importance in the human life sciences because a de-identified text can be made publicly available for non-hospital researchers as well, to facilitate research on human diseases. Here the authors have developed a de-identification model that can successfully remove personal health information (PHI) from discharge records to make them conform to the guidelines of the Health Information Portability and Accountability Act. Design We introduce here a novel, machine learning-based iterative Named Entity Recognition approach intended for use on semi-structured documents like discharge records. Our method identifies PHI in several steps. First, it labels all entities whose tags can be inferred from the structure of the text and it then utilizes this information to find further PHI phrases in the flow text parts of the document. Measurements Following the standard evaluation method of the first Workshop on Challenges in Natural Language Processing for Clinical Data, we used token-level Precision, Recall and Fβ=1 measure metrics for evaluation. Results Our system achieved outstanding accuracy on the standard evaluation dataset of the de-identification challenge, with an F measure of 99.7534% for the best submitted model. Conclusion We can say that our system is competitive with the current state-of-the-art solutions, while we describe here several techniques that can be beneficial in other tasks that need to handle structured documents such as clinical records. PMID:17823086
31 CFR 535.508 - Payments to blocked accounts in domestic banks.
Code of Federal Regulations, 2010 CFR
2010-07-01
... Iran or any Iranian entity is hereby authorized: Provided, Such payment or transfer shall not be made... the interest of Iran or an Iranian entity to any other country or person. (b) This section does not authorize: (1) Any payment or transfer to any blocked account held in a name other than that of Iran or the...
31 CFR 535.508 - Payments to blocked accounts in domestic banks.
Code of Federal Regulations, 2011 CFR
2011-07-01
... Iran or any Iranian entity is hereby authorized: Provided, Such payment or transfer shall not be made... the interest of Iran or an Iranian entity to any other country or person. (b) This section does not authorize: (1) Any payment or transfer to any blocked account held in a name other than that of Iran or the...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-12-31
...: Small Entity Compliance Guide. SUMMARY: This document is issued under the joint authority of DOD, GSA..., contact the analyst whose name appears in the table below. Please cite FAC 2005-72 and the FAR case number... 202- 501-4755. Rules Listed in FAC 2005-72 Item Subject FAR Case Analyst *I Service 2010-010 Loeb...
Social Media: More Than Just a Communications Medium
2012-03-14
video-hosting web services with the recognition that “Internet-based capabilities are integral to operations across the Department of Defense.”10...as DoD and the government as a whole, the U.S. Army’s recognition of social media’s unique relationship to time and speed is a step forward toward...populated size of social media entities, Alexa , the leader in free global web analytics, provides an updated list of the top 500 websites on the Internet
Eye Movements to Pictures Reveal Transient Semantic Activation during Spoken Word Recognition
ERIC Educational Resources Information Center
Yee, Eiling; Sedivy, Julie C.
2006-01-01
Two experiments explore the activation of semantic information during spoken word recognition. Experiment 1 shows that as the name of an object unfolds (e.g., lock), eye movements are drawn to pictorial representations of both the named object and semantically related objects (e.g., key). Experiment 2 shows that objects semantically related to an…
I undervalue you but I need you: the dissociation of attitude and memory toward in-group members.
Zhao, Ke; Wu, Qi; Shen, Xunbing; Xuan, Yuming; Fu, Xiaolan
2012-01-01
In the present study, the in-group bias or in-group derogation among Mainland Chinese was investigated through a rating task and a recognition test. In two experiments,participants from two universities with similar ranks rated novel faces or names and then had a recognition test. Half of the faces or names were labeled as participants' own university and the other half were labeled as their counterpart. Results showed that, for either faces or names, rating scores for out-group members were consistently higher than those for in-group members, whereas the recognition accuracy showed just the opposite. These results indicated that the attitude and memory for group-relevant information might be dissociated among Mainland Chinese.
I Undervalue You but I Need You: The Dissociation of Attitude and Memory Toward In-Group Members
Zhao, Ke; Wu, Qi; Shen, Xunbing; Xuan, Yuming; Fu, Xiaolan
2012-01-01
In the present study, the in-group bias or in-group derogation among mainland Chinese was investigated through a rating task and a recognition test. In two experiments,participants from two universities with similar ranks rated novel faces or names and then had a recognition test. Half of the faces or names were labeled as participants' own university and the other half were labeled as their counterpart. Results showed that, for either faces or names, rating scores for out-group members were consistently higher than those for in-group members, whereas the recognition accuracy showed just the opposite. These results indicated that the attitude and memory for group-relevant information might be dissociated among Mainland Chinese. PMID:22412955
Nomination Period Extended for EPA’s New England Annual Environmental Merit Awards
The U.S. Environmental Protection Agency’s New England office has extended the deadline to submit nominations for New England people, organizations, government entities or businesses whose environmental achievements during the past year deserve recognition
NASA Astrophysics Data System (ADS)
Levchuk, Georgiy; Shabarekh, Charlotte; Furjanic, Caitlin
2011-06-01
In this paper, we present results of adversarial activity recognition using data collected in the Empire Challenge (EC 09) exercise. The EC09 experiment provided an opportunity to evaluate our probabilistic spatiotemporal mission recognition algorithms using the data from live air-born and ground sensors. Using ambiguous and noisy data about locations of entities and motion events on the ground, the algorithms inferred the types and locations of OPFOR activities, including reconnaissance, cache runs, IED emplacements, logistics, and planning meetings. In this paper, we present detailed summary of the validation study and recognition accuracy results. Our algorithms were able to detect locations and types of over 75% of hostile activities in EC09 while producing 25% false alarms.
The "Decorative" Female Model: Sexual Stimuli and the Recognition of Advertisements
ERIC Educational Resources Information Center
LaChance, Charles C.; And Others
1977-01-01
Examines the impact of the decorative or functionless female models in print advertising and indicates that models facilitate recognition of model/related information but do little to increase the recognition of brand names.
Semantic Memory in the Clinical Progression of Alzheimer Disease.
Tchakoute, Christophe T; Sainani, Kristin L; Henderson, Victor W
2017-09-01
Semantic memory measures may be useful in tracking and predicting progression of Alzheimer disease. We investigated relationships among semantic memory tasks and their 1-year predictive value in women with Alzheimer disease. We conducted secondary analyses of a randomized clinical trial of raloxifene in 42 women with late-onset mild-to-moderate Alzheimer disease. We assessed semantic memory with tests of oral confrontation naming, category fluency, semantic recognition and semantic naming, and semantic density in written narrative discourse. We measured global cognition (Alzheimer Disease Assessment Scale, cognitive subscale), dementia severity (Clinical Dementia Rating sum of boxes), and daily function (Activities of Daily Living Inventory) at baseline and 1 year. At baseline and 1 year, most semantic memory scores correlated highly or moderately with each other and with global cognition, dementia severity, and daily function. Semantic memory task performance at 1 year had worsened one-third to one-half standard deviation. Factor analysis of baseline test scores distinguished processes in semantic and lexical retrieval (semantic recognition, semantic naming, confrontation naming) from processes in lexical search (semantic density, category fluency). The semantic-lexical retrieval factor predicted global cognition at 1 year. Considered separately, baseline confrontation naming and category fluency predicted dementia severity, while semantic recognition and a composite of semantic recognition and semantic naming predicted global cognition. No individual semantic memory test predicted daily function. Semantic-lexical retrieval and lexical search may represent distinct aspects of semantic memory. Semantic memory processes are sensitive to cognitive decline and dementia severity in Alzheimer disease.
Recognition of Famous Names Predicts Episodic Memory Decline in Cognitively Intact Elders
Seidenberg, Michael; Kay, Christina; Woodard, John L.; Nielson, Kristy A.; Smith, J. Carson; Kandah, Cassandra; Guidotti Breting, Leslie M.; Novitski, Julia; Lancaster, Melissa; Matthews, Monica; Hantke, Nathan; Butts, Alissa; Rao, Stephen M.
2013-01-01
Objective: Semantic memory impairment is common in both Mild Cognitive Impairment (MCI) and early Alzheimer’s disease (AD), and the ability to recognize familiar people is particularly vulnerable. A time-limited temporal gradient (TG) in which well known people from decades earlier are better recalled than those learned recently is also reported in both AD and MCI. In this study, we hypothesized that the TG pattern on a famous name recognition task (FNRT) administered to cognitively intact elders would predict future episodic memory decline, and would also show a significant correlation with hippocampal volume. Methods: 78 healthy elders (ages 65-90) with normal cognition and episodic memory at baseline were administered a FNRT. Follow-up episodic memory testing 18 months later produced two groups: Declining (≥ 1 SD reduction in episodic memory) and Stable (< 1 SD). Results: The Declining group (N=27) recognized fewer recent famous names than the Stable group (N=51), while recognition for remote names was comparable. Baseline MRI volumes for both the left and right hippocampus was significantly smaller in the Declining group than the Stable group. Smaller baseline hippocampal volume was also significantly correlated with poorer performance for recent, but not remote famous names. Logistic regression analyses indicated that baseline TG performance was a significant predictor of group status (Declining versus Stable) independent of chronological age and APOE ε4 inheritance. Conclusions: Famous name recognition may serve as an early pre-clinical cognitive marker of episodic memory decline in older individuals. PMID:23688215
GASB Achieves Standardization, Recognition.
ERIC Educational Resources Information Center
Bissell, George E.
1986-01-01
In 1984 the Governmental Accounting Standards Board, created to solidify accounting principles for government entities, enumerated Generally Accepted Accounting Principles endorsed by the American Institute of Certified Public Accountants and the National Council on Governmental Accounting. These principles have recently been approved for school…
Combination of Evidence for Effective Web Search
2010-11-01
SPONSORING/MONITORING AGENCY NAME(S) AND ADDRESS(ES) 10. SPONSOR/MONITOR’S ACRONYM(S) 11. SPONSOR/MONITOR’S REPORT NUMBER(S) 12. DISTRIBUTION /AVAILABILITY...STATEMENT Approved for public release; distribution unlimited 13. SUPPLEMENTARY NOTES Presented at the Nineteenth Text REtrieval Conference (TREC...use that page to expand. This happens often with named entity queries (such as ‘the secret garden’ or ‘ starbucks ’). However, when the query is
Elucidation of metabolic pathways from enzyme classification data.
McDonald, Andrew G; Tipton, Keith F
2014-01-01
The IUBMB Enzyme List is widely used by other databases as a source for avoiding ambiguity in the recognition of enzymes as catalytic entities. However, it was not designed for metabolic pathway tracing, which has become increasingly important in systems biology. A Reactions Database has been created from the material in the Enzyme List to allow reactions to be searched by substrate/product, and pathways to be traced from any selected starting/seed substrate. An extensive synonym glossary allows searches by many of the alternative names, including accepted abbreviations, by which a chemical compound may be known. This database was necessary for the development of the application Reaction Explorer ( http://www.reaction-explorer.org ), which was written in Real Studio ( http://www.realsoftware.com/realstudio/ ) to search the Reactions Database and draw metabolic pathways from reactions selected by the user. Having input the name of the starting compound (the "seed"), the user is presented with a list of all reactions containing that compound and then selects the product of interest as the next point on the ensuing graph. The pathway diagram is then generated as the process iterates. A contextual menu is provided, which allows the user: (1) to remove a compound from the graph, along with all associated links; (2) to search the reactions database again for additional reactions involving the compound; (3) to search for the compound within the Enzyme List.
A neural joint model for entity and relation extraction from biomedical text.
Li, Fei; Zhang, Meishan; Fu, Guohong; Ji, Donghong
2017-03-31
Extracting biomedical entities and their relations from text has important applications on biomedical research. Previous work primarily utilized feature-based pipeline models to process this task. Many efforts need to be made on feature engineering when feature-based models are employed. Moreover, pipeline models may suffer error propagation and are not able to utilize the interactions between subtasks. Therefore, we propose a neural joint model to extract biomedical entities as well as their relations simultaneously, and it can alleviate the problems above. Our model was evaluated on two tasks, i.e., the task of extracting adverse drug events between drug and disease entities, and the task of extracting resident relations between bacteria and location entities. Compared with the state-of-the-art systems in these tasks, our model improved the F1 scores of the first task by 5.1% in entity recognition and 8.0% in relation extraction, and that of the second task by 9.2% in relation extraction. The proposed model achieves competitive performances with less work on feature engineering. We demonstrate that the model based on neural networks is effective for biomedical entity and relation extraction. In addition, parameter sharing is an alternative method for neural models to jointly process this task. Our work can facilitate the research on biomedical text mining.
Maurer, Leonie; Zitting, Kirsi-Marja; Elliott, Kieran; Czeisler, Charles A.; Ronda, Joseph M.; Duffy, Jeanne F.
2015-01-01
Sleep has been demonstrated to improve consolidation of many types of new memories. However, few prior studies have examined how sleep impacts learning of face-name associations. The recognition of a new face along with the associated name is an important human cognitive skill. Here we investigated whether post-presentation sleep impacts recognition memory of new face-name associations in healthy adults. Fourteen participants were tested twice. Each time, they were presented 20 photos of faces with a corresponding name. Twelve hours later, they were shown each face twice, once with the correct and once with an incorrect name, and asked if each face-name combination was correct and to rate their confidence. In one condition the 12-hour interval between presentation and recall included an 8-hour nighttime sleep opportunity (“Sleep”), while in the other condition they remained awake (“Wake”). There were more correct and highly confident correct responses when the interval between presentation and recall included a sleep opportunity, although improvement between the “Wake” and “Sleep” conditions was not related to duration of sleep or any sleep stage. These data suggest that a nighttime sleep opportunity improves the ability to correctly recognize face-name associations. Further studies investigating the mechanism of this improvement are important, as this finding has implications for individuals with sleep disturbances and/or memory impairments. PMID:26549626
Maurer, Leonie; Zitting, Kirsi-Marja; Elliott, Kieran; Czeisler, Charles A; Ronda, Joseph M; Duffy, Jeanne F
2015-12-01
Sleep has been demonstrated to improve consolidation of many types of new memories. However, few prior studies have examined how sleep impacts learning of face-name associations. The recognition of a new face along with the associated name is an important human cognitive skill. Here we investigated whether post-presentation sleep impacts recognition memory of new face-name associations in healthy adults. Fourteen participants were tested twice. Each time, they were presented 20 photos of faces with a corresponding name. Twelve hours later, they were shown each face twice, once with the correct and once with an incorrect name, and asked if each face-name combination was correct and to rate their confidence. In one condition the 12-h interval between presentation and recall included an 8-h nighttime sleep opportunity ("Sleep"), while in the other condition they remained awake ("Wake"). There were more correct and highly confident correct responses when the interval between presentation and recall included a sleep opportunity, although improvement between the "Wake" and "Sleep" conditions was not related to duration of sleep or any sleep stage. These data suggest that a nighttime sleep opportunity improves the ability to correctly recognize face-name associations. Further studies investigating the mechanism of this improvement are important, as this finding has implications for individuals with sleep disturbances and/or memory impairments. Copyright © 2015 Elsevier Inc. All rights reserved.
Renoult, Louis; Davidson, Patrick S R; Schmitz, Erika; Park, Lillian; Campbell, Kenneth; Moscovitch, Morris; Levine, Brian
2015-01-01
A common assertion is that semantic memory emerges from episodic memory, shedding the distinctive contexts associated with episodes over time and/or repeated instances. Some semantic concepts, however, may retain their episodic origins or acquire episodic information during life experiences. The current study examined this hypothesis by investigating the ERP correlates of autobiographically significant (AS) concepts, that is, semantic concepts that are associated with vivid episodic memories. We inferred the contribution of semantic and episodic memory to AS concepts using the amplitudes of the N400 and late positive component, respectively. We compared famous names that easily brought to mind episodic memories (high AS names) against equally famous names that did not bring such recollections to mind (low AS names) on a semantic task (fame judgment) and an episodic task (recognition memory). Compared with low AS names, high AS names were associated with increased amplitude of the late positive component in both tasks. Moreover, in the recognition task, this effect of AS was highly correlated with recognition confidence. In contrast, the N400 component did not differentiate the high versus low AS names but, instead, was related to the amount of general knowledge participants had regarding each name. These results suggest that semantic concepts high in AS, such as famous names, have an episodic component and are associated with similar brain processes to those that are engaged by episodic memory. Studying AS concepts may provide unique insights into how episodic and semantic memory interact.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-12-27
... Antonio, Texas; Kmart Corporation of Hoffman Estates, Illinois; Sears Brands Management Corporation, Sears... Specialty Brands, LLC, (2) change the name of Respondent Rankam Group to Rankam Metal Products Manufactory... Kamado Joe Company is a trade name for the legal entity Premier Specialty Brands, LLC; Rankam Metal...
49 CFR Appendix E to Part 512 - Consumer Assistance to Recycle and Save (CARS) Class Determinations
Code of Federal Regulations, 2013 CFR
2013-10-01
... of the new vehicle owner's name, home address, telephone number, state identification number and last... harm to the competitive position of the entity submitting the information: (1) Vehicle Manufacturer Issued Dealer Identification Code; (2) Dealer Bank Name, ABA Routing Number and Bank Account Number; and...
77 FR 31356 - Pesticide Products; Receipt of Applications To Register New Uses
Federal Register 2010, 2011, 2012, 2013, 2014
2012-05-25
... Number: EPA-HQ-OPP-2012- 0241. Company name and address: Bayer CropScience LP, 2 T. W. Alexander Drive.... Registration Number: 264-825. Docket Number: EPA-HQ-OPP-2012- 0325. Company name and address: Bayer CropScience... pesticide manufacturer. Potentially affected entities may include, but are not limited to: Crop production...
Category-Specific Naming and Recognition Deficits in Temporal Lobe Epilepsy Surgical Patients
Drane, Daniel L.; Ojemann, George A.; Aylward, Elizabeth; Ojemann, Jeffrey G.; Johnson, L. Clark; Silbergeld, Daniel L.; Miller, John W.; Tranel, Daniel
2008-01-01
Objective Based upon Damasio's “Convergence Zone” model of semantic memory, we predicted that epilepsy surgical patients with anterior temporal lobe (TL) seizure onset would exhibit a pattern of category-specific naming and recognition deficits not observed in patients with seizures arising elsewhere. Methods We assessed epilepsy patients with unilateral seizure onset of anterior TL or other origin (n = 22), pre- or postoperatively, using a set of category-specific items and a conventional measure of visual naming (Boston Naming Test: BNT). Results Category-specific naming deficits were exhibited by patients with dominant anterior TL seizure onset/resection for famous faces and animals, while category-specific recognition deficits for these same categories were exhibited by patients with nondominant anterior TL onset/resection. Patients with other seizure onset did not exhibit category-specific deficits. Naming and recognition deficits were frequently not detected by the BNT, which samples only a limited range of stimuli. Interpretation Consistent with the “convergence zone” framework, results suggest that the nondominant anterior TL plays a major role in binding sensory information into conceptual percepts for certain stimuli, while dominant TL regions function to provide a link to verbal labels for these percepts. Although observed category-specific deficits were striking, they were often missed by the BNT, suggesting that they are more prevalent than recognized in both pre- and postsurgical epilepsy patients. Systematic investigation of these deficits could lead to more refined models of semantic memory, aid in the localization of seizures, and contribute to modifications in surgical technique and patient selection in epilepsy surgery to improve neurocognitive outcome. PMID:18206185
Kaewphan, Suwisa; Van Landeghem, Sofie; Ohta, Tomoko; Van de Peer, Yves; Ginter, Filip; Pyysalo, Sampo
2016-01-01
Motivation: The recognition and normalization of cell line names in text is an important task in biomedical text mining research, facilitating for instance the identification of synthetically lethal genes from the literature. While several tools have previously been developed to address cell line recognition, it is unclear whether available systems can perform sufficiently well in realistic and broad-coverage applications such as extracting synthetically lethal genes from the cancer literature. In this study, we revisit the cell line name recognition task, evaluating both available systems and newly introduced methods on various resources to obtain a reliable tagger not tied to any specific subdomain. In support of this task, we introduce two text collections manually annotated for cell line names: the broad-coverage corpus Gellus and CLL, a focused target domain corpus. Results: We find that the best performance is achieved using NERsuite, a machine learning system based on Conditional Random Fields, trained on the Gellus corpus and supported with a dictionary of cell line names. The system achieves an F-score of 88.46% on the test set of Gellus and 85.98% on the independently annotated CLL corpus. It was further applied at large scale to 24 302 102 unannotated articles, resulting in the identification of 5 181 342 cell line mentions, normalized to 11 755 unique cell line database identifiers. Availability and implementation: The manually annotated datasets, the cell line dictionary, derived corpora, NERsuite models and the results of the large-scale run on unannotated texts are available under open licenses at http://turkunlp.github.io/Cell-line-recognition/. Contact: sukaew@utu.fi PMID:26428294
Bonin, Patrick; Guillemard-Tsaparina, Diana; Méot, Alain
2013-09-01
We report object-naming and object recognition times collected from Russian native speakers for the colorized version of the Snodgrass and Vanderwart (Journal of Experimental Psychology: Human Learning and Memory 6:174-215, 1980) pictures (Rossion & Pourtois, Perception 33:217-236, 2004). New norms for image variability, body-object interaction [BOI], and subjective frequency collected in Russian, as well as new name agreement scores for the colorized pictures in French, are also reported. In both object-naming and object comprehension times, the name agreement, image agreement, and age-of-acquisition variables made significant independent contributions. Objective word frequency was reliable in object-naming latencies only. The variables of image variability, BOI, and subjective frequency were not significant in either object naming or object comprehension. Finally, imageability was reliable in both tasks. The new norms and object-naming and object recognition times are provided as supplemental materials.
Spaced-retrieval effects on name-face recognition in older adults with probable Alzheimer's disease.
Hawley, Karri S; Cherry, Katie E
2004-03-01
Six older adults with probable Alzheimer's disease (AD) were trained to recall a name-face association using the spaced-retrieval method. We administered six training sessions over a 2-week period. On each trial, participants selected a target photograph and stated the target name, from eight other photographs, at increasingly longer retention intervals. Results yielded a positive effect of spaced-retrieval training for name-face recognition. All participants were able to select the target photograph and state the target's name for longer periods of time within and across training sessions. A live-person transfer task was administered to determine whether the name-face association, trained by spaced-retrieval, would transfer to a live person. Half of the participants were able to call the live person by the correct name. These data provide initial evidence that spaced-retrieval training can aid older adults with probable AD in recall of a name-face association and in transfer of that association to an actual person.
A Statistical Model for Multilingual Entity Detection and Tracking
2004-01-01
tomatic Content Extraction ( ACE ) evaluation achieved top-tier results in all three evaluation languages. 1 Introduction Detecting entities, whether named...of com- bining the detected mentions into groups of references to the same object. The work presented here is motivated by the ACE eval- uation...Entropy (MaxEnt henceforth) (Berger et al., 1996) and Robust Risk Minimization (RRM henceforth) 1For a description of the ACE program see http
Improving the Accuracy of Attribute Extraction using the Relatedness between Attribute Values
NASA Astrophysics Data System (ADS)
Bollegala, Danushka; Tani, Naoki; Ishizuka, Mitsuru
Extracting attribute-values related to entities from web texts is an important step in numerous web related tasks such as information retrieval, information extraction, and entity disambiguation (namesake disambiguation). For example, for a search query that contains a personal name, we can not only return documents that contain that personal name, but if we have attribute-values such as the organization for which that person works, we can also suggest documents that contain information related to that organization, thereby improving the user's search experience. Despite numerous potential applications of attribute extraction, it remains a challenging task due to the inherent noise in web data -- often a single web page contains multiple entities and attributes. We propose a graph-based approach to select the correct attribute-values from a set of candidate attribute-values extracted for a particular entity. First, we build an undirected weighted graph in which, attribute-values are represented by nodes, and the edge that connects two nodes in the graph represents the degree of relatedness between the corresponding attribute-values. Next, we find the maximum spanning tree of this graph that connects exactly one attribute-value for each attribute-type. The proposed method outperforms previously proposed attribute extraction methods on a dataset that contains 5000 web pages.
Fielding, R; Chee, Y Y; Choi, K M; Chu, T K; Kato, K; Lam, S K; Sin, K L; Tang, K T; Wong, H M; Wong, K M
2004-03-01
We compared the recognition of tobacco brands and ever-smoking rates in young children before (1991) and after (2001) the implementation of cigarette advertising restrictions in Hong Kong and identified continuing sources of tobacco promotion exposure. A cross-sectional survey of 824 primary school children aged from 8 to 11 (Primary classes 3-4) living in two Hong Kong districts was carried out using self-completed questionnaires examining smoking behaviour and recognition of names and logos from 18 tobacco, food, drink and other brands common in Hong Kong. Ever-smoking prevalence in 2001 was 3.8 per cent (1991, 7.8 per cent). Tobacco brand recognition rates ranged from 5.3 per cent (Viceroy name) to 72.8 per cent (Viceroy logo). Compared with 1991, in 2001 never-smoker children recognized fewer tobacco brand names and logos: Marlboro logo recognition rate fell by 55.3 per cent. Similar declines were also seen in ever-smoker children, with recognition of the Marlboro logo decreasing 48 per cent. Recognition rates declined amongst both boys and girls. Children from non-smoking families constituted 51 per cent (426) of the sample, whereas 34.5 per cent (284), 8.5 per cent (70), 1.7 per cent (14) and 4.4 per cent (36) of the children had one, two, three or more than three smoking family members at home, respectively. Tobacco brand recognition rates and ever-smoking prevalence were significantly higher among children with smoking family members compared with those without. Among 12 possible sources of exposure to cigarette brand names and logos, retail stalls (75.5 per cent; 622), indirect advertisements (71.5 per cent; 589) and magazines (65.3 per cent; 538) were ranked the most common. Advertising restrictions in Hong Kong have effectively decreased primary-age children's recognition of tobacco branding. However, these children remain vulnerable to branding, mostly through exposure from family smokers, point-of-sale tobacco advertisement and occasional promotions. Action to curb these is now required.
Developing Connectivist Schemas for Geological and Geomorphological Education
NASA Astrophysics Data System (ADS)
Whalley, B.
2012-12-01
Teaching geology is difficult; students need to grasp changes in time over three dimensions. Furthermore, the scales and rates of change in four dimensions may vary over several orders of magnitude. Geological explanations incorporate ideas from physics, chemistry, biology and engineering, lectures and textbooks provide a basic framework but they need to be amplified by laboratories and fieldwork involving active student participation and engagement. Being shown named 'things' is only a start to being able to being able to inculcate geological thinking that requires a wide and focused viewpoints. Kastens and Ishikawa (2006) suggested five aspects of thinking geologically, summarised as: 1. Observing, describing, recording, communicating geologically entities (ie basic cognitive skills) 2. (mentally) manipulating these entities 3. interpreting them via causal relationships 4. predicting other aspects using the basic knowledge (to create new knowledge) 5. using cognitive strategies to develop new ways of interpreting gained knowledge. These steps can be used follow the sequence from 'known' through 'need to know' to using knowledge to gain better geologic explanation, taken as enquiry-based or problem solving modes of education. These follow ideas from Dewey though Sternberg's 'thinking styles' and Siemens' connectivist approaches. Implementation of this basic schema needs to be structured for students in a complex geological world in line with Edelson's (2006) 'learning for' framework. In a geomorphological setting, this has been done by showing students how to interpret a landscape (landform, section etc) practice their skills and thus gain confidence with a tutor at hand. A web-based device, 'Virtorial' provides scenarios for students to practice interpretation (or even be assessed with). A cognitive tool is provided for landscape interpretation by division into the recognition of 'Materials' (rock, sediments etc), Processes (slope, glacial processes etc) and 'Geometry' (what it looks like). These components provide basic metadata for any landform in a landscape. Thus, the recognition of a landform means much more than a feature; the metadata provide contexts that can be used for interpretation in the field or laboratory, individually or in discussion groups, distance or field learning environments.
NeuroNames: an ontology for the BrainInfo portal to neuroscience on the web.
Bowden, Douglas M; Song, Evan; Kosheleva, Julia; Dubach, Mark F
2012-01-01
BrainInfo ( http://braininfo.org ) is a growing portal to neuroscientific information on the Web. It is indexed by NeuroNames, an ontology designed to compensate for ambiguities in neuroanatomical nomenclature. The 20-year old ontology continues to evolve toward the ideal of recognizing all names of neuroanatomical entities and accommodating all structural concepts about which neuroscientists communicate, including multiple concepts of entities for which neuroanatomists have yet to determine the best or 'true' conceptualization. To make the definitions of structural concepts unambiguous and terminologically consistent we created a 'default vocabulary' of unique structure names selected from existing terminology. We selected standard names by criteria designed to maximize practicality for use in verbal communication as well as computerized knowledge management. The ontology of NeuroNames accommodates synonyms and homonyms of the standard terms in many languages. It defines complex structures as models composed of primary structures, which are defined in unambiguous operational terms. NeuroNames currently relates more than 16,000 names in eight languages to some 2,500 neuroanatomical concepts. The ontology is maintained in a relational database with three core tables: Names, Concepts and Models. BrainInfo uses NeuroNames to index information by structure, to interpret users' queries and to clarify terminology on remote web pages. NeuroNames is a resource vocabulary of the NLM's Unified Medical Language System (UMLS, 2011) and the basis for the brain regions component of NIFSTD (NeuroLex, 2011). The current version has been downloaded to hundreds of laboratories for indexing data and linking to BrainInfo, which attracts some 400 visitors/day, downloading 2,000 pages/day.
Artificial Neural Networks for Processing Graphs with Application to Image Understanding: A Survey
NASA Astrophysics Data System (ADS)
Bianchini, Monica; Scarselli, Franco
In graphical pattern recognition, each data is represented as an arrangement of elements, that encodes both the properties of each element and the relations among them. Hence, patterns are modelled as labelled graphs where, in general, labels can be attached to both nodes and edges. Artificial neural networks able to process graphs are a powerful tool for addressing a great variety of real-world problems, where the information is naturally organized in entities and relationships among entities and, in fact, they have been widely used in computer vision, f.i. in logo recognition, in similarity retrieval, and for object detection. In this chapter, we propose a survey of neural network models able to process structured information, with a particular focus on those architectures tailored to address image understanding applications. Starting from the original recursive model (RNNs), we subsequently present different ways to represent images - by trees, forests of trees, multiresolution trees, directed acyclic graphs with labelled edges, general graphs - and, correspondingly, neural network architectures appropriate to process such structures.
DISEASES: text mining and data integration of disease-gene associations.
Pletscher-Frankild, Sune; Pallejà, Albert; Tsafou, Kalliopi; Binder, Janos X; Jensen, Lars Juhl
2015-03-01
Text mining is a flexible technology that can be applied to numerous different tasks in biology and medicine. We present a system for extracting disease-gene associations from biomedical abstracts. The system consists of a highly efficient dictionary-based tagger for named entity recognition of human genes and diseases, which we combine with a scoring scheme that takes into account co-occurrences both within and between sentences. We show that this approach is able to extract half of all manually curated associations with a false positive rate of only 0.16%. Nonetheless, text mining should not stand alone, but be combined with other types of evidence. For this reason, we have developed the DISEASES resource, which integrates the results from text mining with manually curated disease-gene associations, cancer mutation data, and genome-wide association studies from existing databases. The DISEASES resource is accessible through a web interface at http://diseases.jensenlab.org/, where the text-mining software and all associations are also freely available for download. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Data Processing and Text Mining Technologies on Electronic Medical Records: A Review
Sun, Wencheng; Li, Yangyang; Liu, Fang; Fang, Shengqun; Wang, Guoyan
2018-01-01
Currently, medical institutes generally use EMR to record patient's condition, including diagnostic information, procedures performed, and treatment results. EMR has been recognized as a valuable resource for large-scale analysis. However, EMR has the characteristics of diversity, incompleteness, redundancy, and privacy, which make it difficult to carry out data mining and analysis directly. Therefore, it is necessary to preprocess the source data in order to improve data quality and improve the data mining results. Different types of data require different processing technologies. Most structured data commonly needs classic preprocessing technologies, including data cleansing, data integration, data transformation, and data reduction. For semistructured or unstructured data, such as medical text, containing more health information, it requires more complex and challenging processing methods. The task of information extraction for medical texts mainly includes NER (named-entity recognition) and RE (relation extraction). This paper focuses on the process of EMR processing and emphatically analyzes the key techniques. In addition, we make an in-depth study on the applications developed based on text mining together with the open challenges and research issues for future work. PMID:29849998
Proper name retrieval in temporal lobe epilepsy: naming of famous faces and landmarks.
Benke, Thomas; Kuen, Eva; Schwarz, Michael; Walser, Gerald
2013-05-01
The objective of this study was to further explore proper name (PN) retrieval and conceptual knowledge in patients with left and right temporal lobe epilepsy (69 patients with LTLE and 62 patients with RTLE) using a refined assessment procedure. Based on the performance of a large group of age- and education-matched normals, a new test of famous faces and famous landmarks was designed. Recognition, naming, and semantic knowledge were assessed consecutively, allowing for a better characterization of deficient levels in the naming system. Impairment in PN retrieval was common in the cohort with TLE. Furthermore, side of seizure onset impaired stages of name retrieval differently: LTLE impaired the lexico-phonological processing, whereas RTLE mainly impaired the perceptual-semantic stage of object recognition. In addition to deficient PN retrieval, patients with TLE had reduced conceptual knowledge regarding famous persons and landmarks. Copyright © 2013 Elsevier Inc. All rights reserved.
False recall and recognition of brand names increases over time.
Sherman, Susan M
2013-01-01
Using the Deese-Roediger-McDermott (DRM) paradigm, participants are presented with lists of associated words (e.g., bed, awake, night). Subsequently, they reliably have false memories for related but nonpresented words (e.g., SLEEP). Previous research has found that false memories can be created for brand names (e.g., Morrisons, Sainsbury's, Waitrose, and TESCO). The present study investigates the effect of a week's delay on false memories for brand names. Participants were presented with lists of brand names followed by a distractor task. In two between-subjects experiments, participants completed a free recall task or a recognition task either immediately or a week later. In two within-subjects experiments, participants completed a free recall task or a recognition task both immediately and a week later. Correct recall for presented list items decreased over time, whereas false recall for nonpresented lure items increased. For recognition, raw scores revealed an increase in false memory across time reflected in an increase in Remember responses. Analysis of Pr scores revealed that false memory for lures stayed constant over a week, but with an increase in Remember responses in the between-subjects experiment and a trend in the same direction in the within-subjects experiment. Implications for theories of false memory are discussed.
Exploring the Repeated Name Penalty and the Overt Pronoun Penalty in Spanish
ERIC Educational Resources Information Center
Gelormini-Lezama, Carlos
2018-01-01
Anaphoric expressions such as repeated names, overt pronouns, and null pronouns serve a major role in the creation and maintenance of discourse coherence. The felicitous use of an anaphoric expression is highly dependent on the discourse salience of the entity introduced by the antecedent. Gordon et al. ("Cogn Sci" 17:311-347, 1993)…
Development of the use of conversational cues to assess reality status
Woolley, Jacqueline D.; Ma, Lili; Lopez-Mobilia, Gabriel
2011-01-01
In this study we assessed children’s ability to use information overheard in other people’s conversations to judge the reality status of a novel entity. Three- to 9-year-old children (N = 101) watched video clips in which two adults conversed casually about a novel being. Videos contained statements that either explicitly denied, explicitly affirmed, or implicitly acknowledged the entity’s existence. Results indicated that children of all ages used statements of denial to discount the reality status of the novel entity, but that this ability improved with age. By age 5, children used implicit existence cues to judge a novel entity as being real. Not until age 9, however, did children begin to doubt the existence of entities whose reality status was explicitly affirmed in conversation. Overall, results indicate that the ability to use conversational cues to determine reality status is present in some children as early as age 3, but recognition of the nuanced language of belief continues to develop during the elementary-school years. PMID:22241965
Idiopathic Noncirrhotic Portal Hypertension: An Appraisal
Lee, Hwajeong; Rehman, Aseeb Ur; Fiel, M. Isabel
2016-01-01
Idiopathic noncirrhotic portal hypertension is a poorly defined clinical condition of unknown etiology. Patients present with signs and symptoms of portal hypertension without evidence of cirrhosis. The disease course appears to be indolent and benign with an overall better outcome than cirrhosis, as long as the complications of portal hypertension are properly managed. This condition has been recognized in different parts of the world in diverse ethnic groups with variable risk factors, resulting in numerous terminologies and lack of standardized diagnostic criteria. Therefore, although the diagnosis of idiopathic noncirrhotic portal hypertension requires clinical exclusion of other conditions that can cause portal hypertension and histopathologic confirmation, this entity is under-recognized clinically as well as pathologically. Recent studies have demonstrated that variable histopathologic entities with different terms likely represent a histologic spectrum of a single entity of which obliterative portal venopathy might be an underlying pathogenesis. This perception calls for standardization of the nomenclature and formulation of widely accepted diagnostic criteria, which will facilitate easier recognition of this disorder and will highlight awareness of this entity. PMID:26563701
2016-01-01
The halogen bond occurs when there is evidence of a net attractive interaction between an electrophilic region associated with a halogen atom in a molecular entity and a nucleophilic region in another, or the same, molecular entity. In this fairly extensive review, after a brief history of the interaction, we will provide the reader with a snapshot of where the research on the halogen bond is now, and, perhaps, where it is going. The specific advantages brought up by a design based on the use of the halogen bond will be demonstrated in quite different fields spanning from material sciences to biomolecular recognition and drug design. PMID:26812185
Mispronunciation Detection for Language Learning and Speech Recognition Adaptation
ERIC Educational Resources Information Center
Ge, Zhenhao
2013-01-01
The areas of "mispronunciation detection" (or "accent detection" more specifically) within the speech recognition community are receiving increased attention now. Two application areas, namely language learning and speech recognition adaptation, are largely driving this research interest and are the focal points of this work.…
PubMedPortable: A Framework for Supporting the Development of Text Mining Applications.
Döring, Kersten; Grüning, Björn A; Telukunta, Kiran K; Thomas, Philippe; Günther, Stefan
2016-01-01
Information extraction from biomedical literature is continuously growing in scope and importance. Many tools exist that perform named entity recognition, e.g. of proteins, chemical compounds, and diseases. Furthermore, several approaches deal with the extraction of relations between identified entities. The BioCreative community supports these developments with yearly open challenges, which led to a standardised XML text annotation format called BioC. PubMed provides access to the largest open biomedical literature repository, but there is no unified way of connecting its data to natural language processing tools. Therefore, an appropriate data environment is needed as a basis to combine different software solutions and to develop customised text mining applications. PubMedPortable builds a relational database and a full text index on PubMed citations. It can be applied either to the complete PubMed data set or an arbitrary subset of downloaded PubMed XML files. The software provides the infrastructure to combine stand-alone applications by exporting different data formats, e.g. BioC. The presented workflows show how to use PubMedPortable to retrieve, store, and analyse a disease-specific data set. The provided use cases are well documented in the PubMedPortable wiki. The open-source software library is small, easy to use, and scalable to the user's system requirements. It is freely available for Linux on the web at https://github.com/KerstenDoering/PubMedPortable and for other operating systems as a virtual container. The approach was tested extensively and applied successfully in several projects.
PubMedPortable: A Framework for Supporting the Development of Text Mining Applications
Döring, Kersten; Grüning, Björn A.; Telukunta, Kiran K.; Thomas, Philippe; Günther, Stefan
2016-01-01
Information extraction from biomedical literature is continuously growing in scope and importance. Many tools exist that perform named entity recognition, e.g. of proteins, chemical compounds, and diseases. Furthermore, several approaches deal with the extraction of relations between identified entities. The BioCreative community supports these developments with yearly open challenges, which led to a standardised XML text annotation format called BioC. PubMed provides access to the largest open biomedical literature repository, but there is no unified way of connecting its data to natural language processing tools. Therefore, an appropriate data environment is needed as a basis to combine different software solutions and to develop customised text mining applications. PubMedPortable builds a relational database and a full text index on PubMed citations. It can be applied either to the complete PubMed data set or an arbitrary subset of downloaded PubMed XML files. The software provides the infrastructure to combine stand-alone applications by exporting different data formats, e.g. BioC. The presented workflows show how to use PubMedPortable to retrieve, store, and analyse a disease-specific data set. The provided use cases are well documented in the PubMedPortable wiki. The open-source software library is small, easy to use, and scalable to the user’s system requirements. It is freely available for Linux on the web at https://github.com/KerstenDoering/PubMedPortable and for other operating systems as a virtual container. The approach was tested extensively and applied successfully in several projects. PMID:27706202
Proceedings of Conference on Variable-Resolution Modeling, Washington, DC, 5-6 May 1992
1992-05-01
of powerful new computer architectures for supporting object-oriented computing. Objects, as self -contained data-code packages with orderly...another entity structure. For example, (copy-entstr e:sys- tcm ’ new -system) creates an entity structure named c:new-system that has the same structure...324 Parry, S-H. (1984): A Self -contained Hierarchical Model Construct. In: Systems Analysis and Modeling in Defense (R.K. Huber, Ed.), New York
Amplifying Electrochemical Indicators
NASA Technical Reports Server (NTRS)
Fan, Wenhong; Li, Jun; Han, Jie
2004-01-01
Dendrimeric reporter compounds have been invented for use in sensing and amplifying electrochemical signals from molecular recognition events that involve many chemical and biological entities. These reporter compounds can be formulated to target specific molecules or molecular recognition events. They can also be formulated to be, variously, hydrophilic or amphiphilic so that they are suitable for use at interfaces between (1) aqueous solutions and (2) electrodes connected to external signal-processing electronic circuits. The invention of these reporter compounds is expected to enable the development of highly miniaturized, low-power-consumption, relatively inexpensive, mass-producible sensor units for diverse applications.
Verifying visual properties in sentence verification facilitates picture recognition memory.
Pecher, Diane; Zanolie, Kiki; Zeelenberg, René
2007-01-01
According to the perceptual symbols theory (Barsalou, 1999), sensorimotor simulations underlie the representation of concepts. We investigated whether recognition memory for pictures of concepts was facilitated by earlier representation of visual properties of those concepts. During study, concept names (e.g., apple) were presented in a property verification task with a visual property (e.g., shiny) or with a nonvisual property (e.g., tart). Delayed picture recognition memory was better if the concept name had been presented with a visual property than if it had been presented with a nonvisual property. These results indicate that modality-specific simulations are used for concept representation.
Chiou, Rocco; Sowman, Paul F; Etchell, Andrew C; Rich, Anina N
2014-05-01
Object recognition benefits greatly from our knowledge of typical color (e.g., a lemon is usually yellow). Most research on object color knowledge focuses on whether both knowledge and perception of object color recruit the well-established neural substrates of color vision (the V4 complex). Compared with the intensive investigation of the V4 complex, we know little about where and how neural mechanisms beyond V4 contribute to color knowledge. The anterior temporal lobe (ATL) is thought to act as a "hub" that supports semantic memory by integrating different modality-specific contents into a meaningful entity at a supramodal conceptual level, making it a good candidate zone for mediating the mappings between object attributes. Here, we explore whether the ATL is critical for integrating typical color with other object attributes (object shape and name), akin to its role in combining nonperceptual semantic representations. In separate experimental sessions, we applied TMS to disrupt neural processing in the left ATL and a control site (the occipital pole). Participants performed an object naming task that probes color knowledge and elicits a reliable color congruency effect as well as a control quantity naming task that also elicits a cognitive congruency effect but involves no conceptual integration. Critically, ATL stimulation eliminated the otherwise robust color congruency effect but had no impact on the numerical congruency effect, indicating a selective disruption of object color knowledge. Neither color nor numerical congruency effects were affected by stimulation at the control occipital site, ruling out nonspecific effects of cortical stimulation. Our findings suggest that the ATL is involved in the representation of object concepts that include their canonical colors.
Influences on Facial Emotion Recognition in Deaf Children
ERIC Educational Resources Information Center
Sidera, Francesc; Amadó, Anna; Martínez, Laura
2017-01-01
This exploratory research is aimed at studying facial emotion recognition abilities in deaf children and how they relate to linguistic skills and the characteristics of deafness. A total of 166 participants (75 deaf) aged 3-8 years were administered the following tasks: facial emotion recognition, naming vocabulary and cognitive ability. The…
2018-05-18
The Integrated Grants Management System (IGMS) is a web-based system that contains information on the recipient of the grant, fellowship, cooperative agreement and interagency agreement, including the name of the entity accepting the award.
Code of Federal Regulations, 2012 CFR
2012-01-01
...) means an individual, private-sector entity, or public agency certified by NRCS to provide technical...,” “land conservation committee,” “natural resource district,” or similar name. Conservation Innovation...
Code of Federal Regulations, 2013 CFR
2013-01-01
...) means an individual, private-sector entity, or public agency certified by NRCS to provide technical...,” “land conservation committee,” “natural resource district,” or similar name. Conservation Innovation...
Code of Federal Regulations, 2014 CFR
2014-01-01
...) means an individual, private-sector entity, or public agency certified by NRCS to provide technical...,” “land conservation committee,” “natural resource district,” or similar name. Conservation Innovation...
Reading component skills in dyslexia: word recognition, comprehension and processing speed.
de Oliveira, Darlene G; da Silva, Patrícia B; Dias, Natália M; Seabra, Alessandra G; Macedo, Elizeu C
2014-01-01
The cognitive model of reading comprehension (RC) posits that RC is a result of the interaction between decoding and linguistic comprehension. Recently, the notion of decoding skill was expanded to include word recognition. In addition, some studies suggest that other skills could be integrated into this model, like processing speed, and have consistently indicated that this skill influences and is an important predictor of the main components of the model, such as vocabulary for comprehension and phonological awareness of word recognition. The following study evaluated the components of the RC model and predictive skills in children and adolescents with dyslexia. 40 children and adolescents (8-13 years) were divided in a Dyslexic Group (DG; 18 children, MA = 10.78, SD = 1.66) and control group (CG 22 children, MA = 10.59, SD = 1.86). All were students from the 2nd to 8th grade of elementary school and groups were equivalent in school grade, age, gender, and IQ. Oral and RC, word recognition, processing speed, picture naming, receptive vocabulary, and phonological awareness were assessed. There were no group differences regarding the accuracy in oral and RC, phonological awareness, naming, and vocabulary scores. DG performed worse than the CG in word recognition (general score and orthographic confusion items) and were slower in naming. Results corroborated the literature regarding word recognition and processing speed deficits in dyslexia. However, dyslexics can achieve normal scores on RC test. Data supports the importance of delimitation of different reading strategies embedded in the word recognition component. The role of processing speed in reading problems remain unclear.
Estes, Zachary; Adelman, James S
2008-08-01
An automatic vigilance hypothesis states that humans preferentially attend to negative stimuli, and this attention to negative valence disrupts the processing of other stimulus properties. Thus, negative words typically elicit slower color naming, word naming, and lexical decisions than neutral or positive words. Larsen, Mercer, and Balota analyzed the stimuli from 32 published studies, and they found that word valence was confounded with several lexical factors known to affect word recognition. Indeed, with these lexical factors covaried out, Larsen et al. found no evidence of automatic vigilance. The authors report a more sensitive analysis of 1011 words. Results revealed a small but reliable valence effect, such that negative words (e.g., "shark") elicit slower lexical decisions and naming than positive words (e.g., "beach"). Moreover, the relation between valence and recognition was categorical rather than linear; the extremity of a word's valence did not affect its recognition. This valence effect was not attributable to word length, frequency, orthographic neighborhood size, contextual diversity, first phoneme, or arousal. Thus, the present analysis provides the most powerful demonstration of automatic vigilance to date.
How Does Using Object Names Influence Visual Recognition Memory?
ERIC Educational Resources Information Center
Richler, Jennifer J.; Palmeri, Thomas J.; Gauthier, Isabel
2013-01-01
Two recent lines of research suggest that explicitly naming objects at study influences subsequent memory for those objects at test. Lupyan (2008) suggested that naming "impairs" memory by a representational shift of stored representations of named objects toward the prototype (labeling effect). MacLeod, Gopie, Hourihan, Neary, and Ozubko (2010)…
The Integrated Grants Management System (IGMS) is a web-based system that contains information on the recipient of the grant, fellowship, cooperative agreement and interagency agreement, including the name of the entity accepting the award.
The Integrated Grants Management System (IGMS) is a web-based system that contains information on the recipient of the grant, fellowship, cooperative agreement and interagency agreement, including the name of the entity accepting the award.
A database of natural products and chemical entities from marine habitat
Babu, Padavala Ajay; Puppala, Suma Sree; Aswini, Satyavarapu Lakshmi; Vani, Metta Ramya; Kumar, Chinta Narasimha; Prasanna, Tallapragada
2008-01-01
Marine compound database consists of marine natural products and chemical entities, collected from various literature sources, which are known to possess bioactivity against human diseases. The database is constructed using html code. The 12 categories of 182 compounds are provided with the source, compound name, 2-dimensional structure, bioactivity and clinical trial information. The database is freely available online and can be accessed at http://www.progenebio.in/mcdb/index.htm PMID:19238254
Name recognition in autism: EEG evidence of altered patterns of brain activity and connectivity.
Nowicka, Anna; Cygan, Hanna B; Tacikowski, Paweł; Ostaszewski, Paweł; Kuś, Rafał
2016-01-01
Impaired orienting to social stimuli is one of the core early symptoms of autism spectrum disorder (ASD). However, in contrast to faces, name processing has rarely been studied in individuals with ASD. Here, we investigated brain activity and functional connectivity associated with recognition of names in the high-functioning ASD group and in the control group. EEG was recorded in 15 young males with ASD and 15 matched one-to-one control individuals. EEG data were analyzed with the event-related potential (ERP), event-related desynchronization and event-related synchronization (ERD/S), as well as coherence and direct transfer function (DTF) methods. Four categories of names were presented visually: one's own, close-other's, famous, and unknown. Differences between the ASD and control groups were found for ERP, coherence, and DTF. In individuals with ASD, P300 (a positive ERP component) to own-name and to a close-other's name were similar whereas in control participants, P300 to own-name was enhanced when compared to all other names. Analysis of coherence and DTF revealed disruption of fronto-posterior task-related connectivity in individuals with ASD within the beta range frequencies. Moreover, DTF indicated the directionality of those impaired connections-they were going from parieto-occipital to frontal regions. DTF also showed inter-group differences in short-range connectivity: weaker connections within the frontal region and stronger connections within the occipital region in the ASD group in comparison to the control group. Our findings suggest a lack of the self-preference effect and impaired functioning of the attentional network during recognition of visually presented names in individuals with ASD.
Mangels, Jennifer A; Manzi, Alberto; Summerfield, Christopher
2010-03-01
In social interactions, it is often necessary to rapidly encode the association between visually presented faces and auditorily presented names. The present study used event-related potentials to examine the neural correlates of associative encoding for multimodal face-name pairs. We assessed study-phase processes leading to high-confidence recognition of correct pairs (and consistent rejection of recombined foils) as compared to lower-confidence recognition of correct pairs (with inconsistent rejection of recombined foils) and recognition failures (misses). Both high- and low-confidence retrieval of face-name pairs were associated with study-phase activity suggestive of item-specific processing of the face (posterior inferior temporal negativity) and name (fronto-central negativity). However, only those pairs later retrieved with high confidence recruited a sustained centro-parietal positivity that an ancillary localizer task suggested may index an association-unique process. Additionally, we examined how these processes were influenced by massed repetition, a mnemonic strategy commonly employed in everyday situations to improve face-name memory. Differences in subsequent memory effects across repetitions suggested that associative encoding was strongest at the initial presentation, and thus, that the initial presentation has the greatest impact on memory formation. Yet, exploratory analyses suggested that the third presentation may have benefited later memory by providing an opportunity for extended processing of the name. Thus, although encoding of the initial presentation was critical for establishing a strong association, the extent to which processing was sustained across subsequent immediate (massed) presentations may provide additional encoding support that serves to differentiate face-name pairs from similar (recombined) pairs by providing additional encoding opportunities for the less dominant stimulus dimension (i.e., name).
Recognition is Used as One Cue Among Others in Judgment and Decision Making
ERIC Educational Resources Information Center
Richter, Tobias; Spath, Pamela
2006-01-01
Three experiments with paired comparisons were conducted to test the noncompensatory character of the recognition heuristic (D. G. Goldstein & G. Gigerenzer, 2002) in judgment and decision making. Recognition and knowledge about the recognized alternative were manipulated. In Experiment 1, participants were presented pairs of animal names where…
Familiarity or Conceptual Priming: Event-Related Potentials in Name Recognition
ERIC Educational Resources Information Center
Stenberg, Georg; Hellman, Johan; Johansson, Mikael; Rosen, Ingmar
2009-01-01
Recent interest has been drawn to the separate components of recognition memory, as studied by event-related potentials (ERPs). In ERPs, recollection is usually accompanied by a late, parietal positive deflection. An earlier, frontal component has been suggested to be a counterpart, accompanying recognition by familiarity. However, this component,…
Semantic and visual determinants of face recognition in a prosopagnosic patient.
Dixon, M J; Bub, D N; Arguin, M
1998-05-01
Prosopagnosia is the neuropathological inability to recognize familiar people by their faces. It can occur in isolation or can coincide with recognition deficits for other nonface objects. Often, patients whose prosopagnosia is accompanied by object recognition difficulties have more trouble identifying certain categories of objects relative to others. In previous research, we demonstrated that objects that shared multiple visual features and were semantically close posed severe recognition difficulties for a patient with temporal lobe damage. We now demonstrate that this patient's face recognition is constrained by these same parameters. The prosopagnosic patient ELM had difficulties pairing faces to names when the faces shared visual features and the names were semantically related (e.g., Tonya Harding, Nancy Kerrigan, and Josee Chouinard -three ice skaters). He made tenfold fewer errors when the exact same faces were associated with semantically unrelated people (e.g., singer Celine Dion, actress Betty Grable, and First Lady Hillary Clinton). We conclude that prosopagnosia and co-occurring category-specific recognition problems both stem from difficulties disambiguating the stored representations of objects that share multiple visual features and refer to semantically close identities or concepts.
Peigneux, P; Salmon, E; van der Linden, M; Garraux, G; Aerts, J; Delfiore, G; Degueldre, C; Luxen, A; Orban, G; Franck, G
2000-06-01
Humans, like numerous other species, strongly rely on the observation of gestures of other individuals in their everyday life. It is hypothesized that the visual processing of human gestures is sustained by a specific functional architecture, even at an early prelexical cognitive stage, different from that required for the processing of other visual entities. In the present PET study, the neural basis of visual gesture analysis was investigated with functional neuroimaging of brain activity during naming and orientation tasks performed on pictures of either static gestures (upper-limb postures) or tridimensional objects. To prevent automatic object-related cerebral activation during the visual processing of postures, only intransitive postures were selected, i. e., symbolic or meaningless postures which do not imply the handling of objects. Conversely, only intransitive objects which cannot be handled were selected to prevent gesture-related activation during their visual processing. Results clearly demonstrate a significant functional segregation between the processing of static intransitive postures and the processing of intransitive tridimensional objects. Visual processing of objects elicited mainly occipital and fusiform gyrus activity, while visual processing of postures strongly activated the lateral occipitotemporal junction, encroaching upon area MT/V5, involved in motion analysis. These findings suggest that the lateral occipitotemporal junction, working in association with area MT/V5, plays a prominent role in the high-level perceptual analysis of gesture, namely the construction of its visual representation, available for subsequent recognition or imitation. Copyright 2000 Academic Press.
The Integrated Grants Management System (IGMS) is a web-based system that contains information on the recipient of the grant, fellowship, cooperative agreement and interagency agreement, including the name of the entity accepting the award.
Does the generation effect occur for pictures?
Kinjo, H; Snodgrass, J G
2000-01-01
The generation effect is the finding that self-generated stimuli are recalled and recognized better than read stimuli. The effect has been demonstrated primarily with words. This article examines the effect for pictures in two experiments: Subjects named complete pictures (name condition) and fragmented pictures (generation condition). In Experiment 1, memory was tested in 3 explicit tasks: free recall, yes/no recognition, and a source-monitoring task on whether each picture was complete or fragmented (the complete/incomplete task). The generation effect was found for all 3 tasks. However, in the recognition and source-monitoring tasks, the generation effect was observed only in the generation condition. We hypothesized that absence of the effect in the name condition was due to the sensory or process match effect between study and test pictures and the superior identification of pictures in the name condition. Therefore, stimuli were changed from pictures to their names in Experiment 2. Memory was tested in the recognition task, complete/incomplete task, and second source-monitoring task (success/failure) on whether each picture had been identified successfully. The generation effect was observed for all 3 tasks. These results suggest that memory of structural and semantic characteristics and of success in identification of generated pictures may contribute to the generation effect.
IGMS Construction Grants Overview
The Integrated Grants Management System (IGMS) is a web-based system that contains information on the recipient of the grant, fellowship, cooperative agreement and interagency agreement, including the name of the entity accepting the award.
The Integrated Grants Management System (IGMS) is a web-based system that contains information on the recipient of the grant, fellowship, cooperative agreement and interagency agreement, including the name of the entity accepting the award.
A Capital case for common names of species of fishes--a white crappie or a White Crappie
Joseph S. Nelson; Wayne C. Stames; Melvin L. Warren
2002-01-01
Common names of fishes are an important and often primary means of fish biologists communicating with each other and with the public. Although common names will never replace scientific names, they are indispensable in many areas such as fisheries science, management, administration, and education. In recognition of the important role common names play in communicating...
Federal Register 2010, 2011, 2012, 2013, 2014
2012-02-15
..., address, and taxpayer identifying number (TIN) of each account holder who is a specified U.S. person (or, in the case of an account holder that is a U.S. owned foreign entity, the name, address, and TIN of... that such beneficial owner does not have any substantial U.S. owners, or the name, address, and TIN of...
ERIC Educational Resources Information Center
Documentation Research and Training Centre, Bangalore (India).
The four sections of the report cover the topics of cataloging, subject analysis, documentation systems for industry and the Documentation Research and Training Centre (DRTC) research report for 1970. The cataloging section covers the conflicts of cataloging, recall, corporate bodies, titles, publishers series and the entity name. The subject…
Famous face recognition and naming test: a normative study.
Rizzo, S; Venneri, A; Papagno, C
2002-10-01
Tests of famous face recognition and naming, and tasks assessing semantic knowledge about famous people after presentation either of their faces or their names are often used in the neuropsychological examination of aphasic, amnesic and demented patients. A total of 187 normal subjects took part in this study. The aim was to collect normative data for a newly devised test including five subtests: famous face naming, fame judgement after face presentation and after name presentation, semantic knowledge about famous people after face presentation and after name presentation. Norms were calculated taking into account demographic variables such as age, sex and education and adjusted scores were used to determine inferential cut-off scores and to compute equivalent scores. Multiple regression analyses showed that age and education influenced significantly the performance on most subtests, but sex had no effect on any of them. Scores of the subtest evaluating fame judgements after name presentation were significantly influenced only by education. The only subtest whose scores were not influenced by any demographic variable was fame judgement after face presentation.
46 CFR 515.34 - Regulated Persons Index.
Code of Federal Regulations, 2010 CFR
2010-10-01
... Commission § 515.34 Regulated Persons Index. The Regulated Persons Index is a database containing the names...-regulated entities. The database may be purchased for $108 by contacting the Bureau of Certification and...
46 CFR 515.34 - Regulated Persons Index.
Code of Federal Regulations, 2013 CFR
2013-10-01
... Commission § 515.34 Regulated Persons Index. The Regulated Persons Index is a database containing the names...-regulated entities. The database may be purchased for $108 by contacting the Bureau of Certification and...
46 CFR 515.34 - Regulated Persons Index.
Code of Federal Regulations, 2014 CFR
2014-10-01
... Commission § 515.34 Regulated Persons Index. The Regulated Persons Index is a database containing the names...-regulated entities. The database may be purchased for $108 by contacting the Bureau of Certification and...
46 CFR 515.34 - Regulated Persons Index.
Code of Federal Regulations, 2012 CFR
2012-10-01
... Commission § 515.34 Regulated Persons Index. The Regulated Persons Index is a database containing the names...-regulated entities. The database may be purchased for $108 by contacting the Bureau of Certification and...
46 CFR 515.34 - Regulated Persons Index.
Code of Federal Regulations, 2011 CFR
2011-10-01
... Commission § 515.34 Regulated Persons Index. The Regulated Persons Index is a database containing the names...-regulated entities. The database may be purchased for $108 by contacting the Bureau of Certification and...
Search | IGMS | Envirofacts | US EPA
2016-02-23
The Integrated Grants Management System (IGMS) is a web-based system that contains information on the recipient of the grant, fellowship, cooperative agreement and interagency agreement, including the name of the entity accepting the award.
48 CFR 919.7005 - Eligibility to be a Mentor.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 48 Federal Acquisition Regulations System 5 2010-10-01 2010-10-01 false Eligibility to be a Mentor... PROGRAMS SMALL BUSINESS PROGRAMS The Department of Energy Mentor-Protege Program 919.7005 Eligibility to be a Mentor. To be eligible for recognition by DOE as a Mentor, an entity must be performing at least...
Pathologic childhood aerophagia: a recognizable clinical entity.
Gauderer, M W; Halpin, T C; Izant, R J
1981-06-01
Pathologic childhood aerophagia is a rarely recognized, often poorly treated entity that has remained almost undescribed in either the surgical or pediatric literature. In only 1 of 9 children the condition was recognized at presentation. The initial diagnosis of the others was Hirschsprung's disease (2), malabsorption syndrome (3), gastric outlet syndrome (1), constipation (1), and esophagitis (1). Five were hospitalized and two underwent surgical procedures. History disclosed a remarkably constant triad: previous normal stooling pattern, visible and often audible air swallowing and excessive flatus. Physical examination often demonstrated a markedly or intermittently distended and tympanitic abdomen. Abdominal musculature was thinned in children with chronic aerophagia. Roentgenographic evaluation showed massively distended loops of intestine throughout without associated air-fluid levels. There was marked compression of the diaphragm with limited excursion in some. Laboratory and malabsorption testing was normal. Treatment is limited to recognition of the problem, nasogastric decompression in severe cases and psychologic counseling when symptoms persist in the older child. The recognition of this condition may lead to a better understanding of its pathophysiology and will reduce the number of unnecessary admissions or surgical procedures.
Searching for the elusive neural substrates of body part terms: a neuropsychological study.
Kemmerer, David; Tranel, Daniel
2008-06-01
Previous neuropsychological studies suggest that, compared to other categories of concrete entities, lexical and conceptual aspects of body part knowledge are frequently spared in brain-damaged patients. To further investigate this issue, we administered a battery of 12 tests assessing lexical and conceptual aspects of body part knowledge to 104 brain-damaged patients with lesions distributed throughout the telencephalon. There were two main outcomes. First, impaired oral naming of body parts, attributable to a disturbance of the mapping between lexical-semantic and lexical-phonological structures, was most reliably and specifically associated with lesions in the left frontal opercular and anterior/inferior parietal opercular cortices and in the white matter underlying these regions (8 patients). Also, 1 patient with body part anomia had a left occipital lesion that included the "extrastriate body area" (EBA). Second, knowledge of the meanings of body part terms was remarkably resistant to impairment, regardless of lesion site; in fact, we did not uncover a single patient who exhibited significantly impaired understanding of the meanings of these terms. In the 9 patients with body part anomia, oral naming of concrete entities was evaluated, and this revealed that 4 patients had disproportionately worse naming of body parts relative to other types of concrete entities. Taken together, these findings extend previous neuropsychological and functional neuroimaging studies of body part knowledge and add to our growing understanding of the nuances of how different linguistic and conceptual categories are operated by left frontal and parietal structures.
Searching for the Elusive Neural Substrates of Body Part Terms: A Neuropsychological Study
Kemmerer, David; Tranel, Daniel
2010-01-01
Previous neuropsychological studies suggest that, compared to other categories of concrete entities, lexical and conceptual aspects of body part knowledge are frequently spared in brain-damaged patients. To further investigate this issue, we administered a battery of 12 tests assessing lexical and conceptual aspects of body part knowledge to 104 brain-damaged patients with lesions distributed throughout the telencephalon. There were two main outcomes. First, impaired oral naming of body parts, attributable to a disturbance of the mapping between lexical-semantic and lexical-phonological structures, was most reliably and specifically associated with lesions in the left frontal opercular and anterior/inferior parietal opercular cortices, and in the white matter underlying these regions (8 patients). Also, one patient with body part anomia had a left occipital lesion that included the “extrastriate body area” (EBA). Second, knowledge of the meanings of body part terms was remarkably resistant to impairment, regardless of lesion site; in fact, we did not uncover a single patient who exhibited significantly impaired understanding of the meanings of these terms. In the 9 patients with body part anomia, oral naming of concrete entities was evaluated, and this revealed that 4 patients had disproportionately worse naming of body parts relative to other types of concrete entities. Taken together, these findings extend previous neuropsychological and functional neuroimaging studies of body part knowledge, and add to our growing understanding of the nuances of how different linguistic and conceptual categories are operated by left frontal and parietal structures. PMID:18608319
Transfer between Pose and Illumination Training in Face Recognition
ERIC Educational Resources Information Center
Liu, Chang Hong; Bhuiyan, Md. Al-Amin; Ward, James; Sui, Jie
2009-01-01
The relationship between pose and illumination learning in face recognition was examined in a yes-no recognition paradigm. The authors assessed whether pose training can transfer to a new illumination or vice versa. Results show that an extensive level of pose training through a face-name association task was able to generalize to a new…
Learning and Forgetting New Names and Objects in MCI and AD
ERIC Educational Resources Information Center
Gronholm-Nyman, Petra; Rinne, Juha O.; Laine, Matti
2010-01-01
We studied how subjects with mild cognitive impairment (MCI), early Alzheimer's disease (AD) and age-matched controls learned and maintained the names of unfamiliar objects that were trained with or without semantic support (object definitions). Naming performance, phonological cueing, incidental learning of the definitions and recognition of the…
Brand name changes help health care providers win market recognition.
Keesling, G
1993-01-01
As the healthcare industry continues to recognize the strategic implications of branding, more providers will undertake an identity change to better position themselves in competitive markets. The paper examines specific healthcare branding decisions, the reasons prompting brand name decisions and the marketing implications for a change in brand name.
Influences of spoken word planning on speech recognition.
Roelofs, Ardi; Ozdemir, Rebecca; Levelt, Willem J M
2007-09-01
In 4 chronometric experiments, influences of spoken word planning on speech recognition were examined. Participants were shown pictures while hearing a tone or a spoken word presented shortly after picture onset. When a spoken word was presented, participants indicated whether it contained a prespecified phoneme. When the tone was presented, they indicated whether the picture name contained the phoneme (Experiment 1) or they named the picture (Experiment 2). Phoneme monitoring latencies for the spoken words were shorter when the picture name contained the prespecified phoneme compared with when it did not. Priming of phoneme monitoring was also obtained when the phoneme was part of spoken nonwords (Experiment 3). However, no priming of phoneme monitoring was obtained when the pictures required no response in the experiment, regardless of monitoring latency (Experiment 4). These results provide evidence that an internal phonological pathway runs from spoken word planning to speech recognition and that active phonological encoding is a precondition for engaging the pathway. 2007 APA
It's all connected: Pathways in visual object recognition and early noun learning.
Smith, Linda B
2013-11-01
A developmental pathway may be defined as the route, or chain of events, through which a new structure or function forms. For many human behaviors, including object name learning and visual object recognition, these pathways are often complex and multicausal and include unexpected dependencies. This article presents three principles of development that suggest the value of a developmental psychology that explicitly seeks to trace these pathways and uses empirical evidence on developmental dependencies among motor development, action on objects, visual object recognition, and object name learning in 12- to 24-month-old infants to make the case. The article concludes with a consideration of the theoretical implications of this approach. (PsycINFO Database Record (c) 2013 APA, all rights reserved).
NASA Astrophysics Data System (ADS)
Garfinkle, Noah W.; Selig, Lucas; Perkins, Timothy K.; Calfas, George W.
2017-05-01
Increasing worldwide internet connectivity and access to sources of print and open social media has increased near realtime availability of textual information. Capabilities to structure and integrate textual data streams can contribute to more meaningful representations of operational environment factors (i.e., Political, Military, Economic, Social, Infrastructure, Information, Physical Environment, and Time [PMESII-PT]) and tactical civil considerations (i.e., Areas, Structures, Capabilities, Organizations, People and Events [ASCOPE]). However, relying upon human analysts to encode this information as it arrives quickly proves intractable. While human analysts possess an ability to comprehend context in unstructured text far beyond that of computers, automated geoparsing (the extraction of locations from unstructured text) can empower analysts to automate sifting through datasets for areas of interest. This research evaluates existing approaches to geoprocessing as well as initiating the research and development of locally-improved methods of tagging parts of text as possible locations, resolving possible locations into coordinates, and interfacing such results with human analysts. The objective of this ongoing research is to develop a more contextually-complete picture of an area of interest (AOI) including human-geographic context for events. In particular, our research is working to make improvements to geoparsing (i.e., the extraction of spatial context from documents), which requires development, integration, and validation of named-entity recognition (NER) tools, gazetteers, and entity-attribution. This paper provides an overview of NER models and methodologies as applied to geoparsing, explores several challenges encountered, presents preliminary results from the creation of a flexible geoparsing research pipeline, and introduces ongoing and future work with the intention of contributing to the efficient geocoding of information containing valuable insights into human activities in space.
Neural systems underlying lexical retrieval for sign language.
Emmorey, Karen; Grabowski, Thomas; McCullough, Stephen; Damasio, Hanna; Ponto, Laura L B; Hichwa, Richard D; Bellugi, Ursula
2003-01-01
Positron emission tomography was used to investigate whether signed languages exhibit the same neural organization for lexical retrieval within classical and non-classical language areas as has been described for spoken English. Ten deaf native American sign language (ASL) signers were shown pictures of unique entities (famous persons) and non-unique entities (animals) and were asked to name each stimulus with an overt signed response. Proper name signed responses to famous people were fingerspelled, and common noun responses to animals were both fingerspelled and signed with native ASL signs. In general, retrieving ASL signs activated neural sites similar to those activated by hearing subjects retrieving English words. Naming famous persons activated the left temporal pole (TP), whereas naming animals (whether fingerspelled or signed) activated left inferotemporal (IT) cortex. The retrieval of fingerspelled and native signs generally engaged the same cortical regions, but fingerspelled signs in addition activated a premotor region, perhaps due to the increased motor planning and sequencing demanded by fingerspelling. Native signs activated portions of the left supramarginal gyrus (SMG), an area previously implicated in the retrieval of phonological features of ASL signs. Overall, the findings indicate that similar neuroanatomical areas are involved in lexical retrieval for both signs and words. Copyright 2003 Elsevier Science Ltd.
ERIC Educational Resources Information Center
Kostic, Bogdan; Cleary, Anne M.
2009-01-01
Recognition without identification (RWI) is a common day-to-day experience (as when recognizing a face or a tune as familiar without being able to identify the person or the song). It is also a well-established laboratory-based empirical phenomenon: When identification of recognition test items is prevented, participants can discriminate between…
What's in a Name: The Place of Recognition in a Hospitable Classroom
ERIC Educational Resources Information Center
Stratman, Jacob
2015-01-01
In this brief article, I argue that recognition is the key virtue of a hospitable classroom. Whether we are discussing the relationship between the teacher and the student, the student and other students, the student and the subject of study, or the teacher and the subject of study, recognition is the building block to a classroom that welcomes…
Speech Processing and Recognition (SPaRe)
2011-01-01
results in the areas of automatic speech recognition (ASR), speech processing, machine translation (MT), natural language processing ( NLP ), and...Processing ( NLP ), Information Retrieval (IR) 16. SECURITY CLASSIFICATION OF: UNCLASSIFED 17. LIMITATION OF ABSTRACT 18. NUMBER OF PAGES 19a. NAME...Figure 9, the IOC was only expected to provide document submission and search; automatic speech recognition (ASR) for English, Spanish, Arabic , and
Disambiguating the species of biomedical named entities using natural language parsers
Wang, Xinglong; Tsujii, Jun'ichi; Ananiadou, Sophia
2010-01-01
Motivation: Text mining technologies have been shown to reduce the laborious work involved in organizing the vast amount of information hidden in the literature. One challenge in text mining is linking ambiguous word forms to unambiguous biological concepts. This article reports on a comprehensive study on resolving the ambiguity in mentions of biomedical named entities with respect to model organisms and presents an array of approaches, with focus on methods utilizing natural language parsers. Results: We build a corpus for organism disambiguation where every occurrence of protein/gene entity is manually tagged with a species ID, and evaluate a number of methods on it. Promising results are obtained by training a machine learning model on syntactic parse trees, which is then used to decide whether an entity belongs to the model organism denoted by a neighbouring species-indicating word (e.g. yeast). The parser-based approaches are also compared with a supervised classification method and results indicate that the former are a more favorable choice when domain portability is of concern. The best overall performance is obtained by combining the strengths of syntactic features and supervised classification. Availability: The corpus and demo are available at http://www.nactem.ac.uk/deca_details/start.cgi, and the software is freely available as U-Compare components (Kano et al., 2009): NaCTeM Species Word Detector and NaCTeM Species Disambiguator. U-Compare is available at http://-compare.org/ Contact: xinglong.wang@manchester.ac.uk PMID:20053840
Alesi, Marianna; Rappo, Gaetano; Pepi, Annamaria
2016-01-01
One of the most significant current discussions has led to the hypothesis that domain-specific training programs alone are not enough to improve reading achievement or working memory abilities. Incremental or Entity personal conceptions of intelligence may be assumed to be an important prognostic factor to overcome domain-specific deficits. Specifically, incremental students tend to be more oriented toward change and autonomy and are able to adopt more efficacious strategies. This study aims at examining the effect of personal conceptions of intelligence to strengthen the efficacy of a multidimensional intervention program in order to improve decoding abilities and working memory. Participants included two children (M age = 10 years) with developmental dyslexia and different conceptions of intelligence. The children were tested on a whole battery of reading and spelling tests commonly used in the assessment of reading disabilities in Italy. Afterwards, they were given a multimedia test to measure motivational factors such as conceptions of intelligence and achievement goals. The children took part in the T.I.R.D. Multimedia Training for the Rehabilitation of Dyslexia (Rappo and Pepi, 2010) reinforced by specific units to improve verbal working memory for 3 months. This training consisted of specific tasks to rehabilitate both visual and phonological strategies (sound blending, word segmentation, alliteration test and rhyme test, letter recognition, digraph recognition, trigraph recognition, and word recognition as samples of visual tasks) and verbal working memory (rapid words and non-words recognition). Posttest evaluations showed that the child holding the incremental theory of intelligence improved more than the child holding a static representation. On the whole this study highlights the importance of treatment programs in which both specificity of deficits and motivational factors are both taken into account. There is a need to plan multifaceted intervention programs based on a transverse approach, considering both cognitive and motivational factors. PMID:26779069
Crowded and sparse domains in object recognition: consequences for categorization and naming.
Gale, Tim M; Laws, Keith R; Foley, Kerry
2006-03-01
Some models of object recognition propose that items from structurally crowded categories (e.g., living things) permit faster access to superordinate semantic information than structurally dissimilar categories (e.g., nonliving things), but slower access to individual object information when naming items. We present four experiments that utilize the same matched stimuli: two examine superordinate categorization and two examine picture naming. Experiments 1 and 2 required participants to sort pictures into their appropriate superordinate categories and both revealed faster categorization for living than nonliving things. Nonetheless, the living thing superiority disappeared when the atypical categories of body parts and musical instruments were excluded. Experiment 3 examined naming latency and found no difference between living and nonliving things. This finding was replicated in Experiment 4 where the same items were presented in different formats (e.g., color and line-drawn versions). Taken as a whole, these experiments show that the ease with which people categorize items maps strongly onto the ease with which they name them.
Information Retrieval and Text Mining Technologies for Chemistry.
Krallinger, Martin; Rabal, Obdulia; Lourenço, Anália; Oyarzabal, Julen; Valencia, Alfonso
2017-06-28
Efficient access to chemical information contained in scientific literature, patents, technical reports, or the web is a pressing need shared by researchers and patent attorneys from different chemical disciplines. Retrieval of important chemical information in most cases starts with finding relevant documents for a particular chemical compound or family. Targeted retrieval of chemical documents is closely connected to the automatic recognition of chemical entities in the text, which commonly involves the extraction of the entire list of chemicals mentioned in a document, including any associated information. In this Review, we provide a comprehensive and in-depth description of fundamental concepts, technical implementations, and current technologies for meeting these information demands. A strong focus is placed on community challenges addressing systems performance, more particularly CHEMDNER and CHEMDNER patents tasks of BioCreative IV and V, respectively. Considering the growing interest in the construction of automatically annotated chemical knowledge bases that integrate chemical information and biological data, cheminformatics approaches for mapping the extracted chemical names into chemical structures and their subsequent annotation together with text mining applications for linking chemistry with biological information are also presented. Finally, future trends and current challenges are highlighted as a roadmap proposal for research in this emerging field.
Natural Language Processing in aid of FlyBase curators
Karamanis, Nikiforos; Seal, Ruth; Lewin, Ian; McQuilton, Peter; Vlachos, Andreas; Gasperin, Caroline; Drysdale, Rachel; Briscoe, Ted
2008-01-01
Background Despite increasing interest in applying Natural Language Processing (NLP) to biomedical text, whether this technology can facilitate tasks such as database curation remains unclear. Results PaperBrowser is the first NLP-powered interface that was developed under a user-centered approach to improve the way in which FlyBase curators navigate an article. In this paper, we first discuss how observing curators at work informed the design and evaluation of PaperBrowser. Then, we present how we appraise PaperBrowser's navigational functionalities in a user-based study using a text highlighting task and evaluation criteria of Human-Computer Interaction. Our results show that PaperBrowser reduces the amount of interactions between two highlighting events and therefore improves navigational efficiency by about 58% compared to the navigational mechanism that was previously available to the curators. Moreover, PaperBrowser is shown to provide curators with enhanced navigational utility by over 74% irrespective of the different ways in which they highlight text in the article. Conclusion We show that state-of-the-art performance in certain NLP tasks such as Named Entity Recognition and Anaphora Resolution can be combined with the navigational functionalities of PaperBrowser to support curation quite successfully. PMID:18410678
Soysal, Ergin; Wang, Jingqi; Jiang, Min; Wu, Yonghui; Pakhomov, Serguei; Liu, Hongfang; Xu, Hua
2017-11-24
Existing general clinical natural language processing (NLP) systems such as MetaMap and Clinical Text Analysis and Knowledge Extraction System have been successfully applied to information extraction from clinical text. However, end users often have to customize existing systems for their individual tasks, which can require substantial NLP skills. Here we present CLAMP (Clinical Language Annotation, Modeling, and Processing), a newly developed clinical NLP toolkit that provides not only state-of-the-art NLP components, but also a user-friendly graphic user interface that can help users quickly build customized NLP pipelines for their individual applications. Our evaluation shows that the CLAMP default pipeline achieved good performance on named entity recognition and concept encoding. We also demonstrate the efficiency of the CLAMP graphic user interface in building customized, high-performance NLP pipelines with 2 use cases, extracting smoking status and lab test values. CLAMP is publicly available for research use, and we believe it is a unique asset for the clinical NLP community. © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
2012-01-01
Background We introduce the linguistic annotation of a corpus of 97 full-text biomedical publications, known as the Colorado Richly Annotated Full Text (CRAFT) corpus. We further assess the performance of existing tools for performing sentence splitting, tokenization, syntactic parsing, and named entity recognition on this corpus. Results Many biomedical natural language processing systems demonstrated large differences between their previously published results and their performance on the CRAFT corpus when tested with the publicly available models or rule sets. Trainable systems differed widely with respect to their ability to build high-performing models based on this data. Conclusions The finding that some systems were able to train high-performing models based on this corpus is additional evidence, beyond high inter-annotator agreement, that the quality of the CRAFT corpus is high. The overall poor performance of various systems indicates that considerable work needs to be done to enable natural language processing systems to work well when the input is full-text journal articles. The CRAFT corpus provides a valuable resource to the biomedical natural language processing community for evaluation and training of new models for biomedical full text publications. PMID:22901054
nala: text mining natural language mutation mentions
Cejuela, Juan Miguel; Bojchevski, Aleksandar; Uhlig, Carsten; Bekmukhametov, Rustem; Kumar Karn, Sanjeev; Mahmuti, Shpend; Baghudana, Ashish; Dubey, Ankit; Satagopam, Venkata P.; Rost, Burkhard
2017-01-01
Abstract Motivation: The extraction of sequence variants from the literature remains an important task. Existing methods primarily target standard (ST) mutation mentions (e.g. ‘E6V’), leaving relevant mentions natural language (NL) largely untapped (e.g. ‘glutamic acid was substituted by valine at residue 6’). Results: We introduced three new corpora suggesting named-entity recognition (NER) to be more challenging than anticipated: 28–77% of all articles contained mentions only available in NL. Our new method nala captured NL and ST by combining conditional random fields with word embedding features learned unsupervised from the entire PubMed. In our hands, nala substantially outperformed the state-of-the-art. For instance, we compared all unique mentions in new discoveries correctly detected by any of three methods (SETH, tmVar, or nala). Neither SETH nor tmVar discovered anything missed by nala, while nala uniquely tagged 33% mentions. For NL mentions the corresponding value shot up to 100% nala-only. Availability and Implementation: Source code, API and corpora freely available at: http://tagtog.net/-corpora/IDP4+. Contact: nala@rostlab.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28200120
Synthetic approaches to the 2009 new drugs.
Liu, Kevin K-C; Sakya, Subas M; O'Donnell, Christopher J; Flick, Andrew C; Li, Jin
2011-02-01
New drugs are introduced to the market every year and each individual drug represents a privileged structure for its biological target. These new chemical entities (NCEs) provide insights into molecular recognition and also serve as leads for designing future new drugs. This review covers the syntheses of 21 NCEs marketed in 2009. Copyright © 2011 Elsevier Ltd. All rights reserved.
Lemmond, Tracy D; Hanley, William G; Guensche, Joseph Wendell; Perry, Nathan C; Nitao, John J; Kidwell, Paul Brandon; Boakye, Kofi Agyeman; Glaser, Ron E; Prenger, Ryan James
2014-05-13
An information extraction system and methods of operating the system are provided. In particular, an information extraction system for performing meta-extraction of named entities of people, organizations, and locations as well as relationships and events from text documents are described herein.
A user-friendly tool for medical-related patent retrieval.
Pasche, Emilie; Gobeill, Julien; Teodoro, Douglas; Gaudinat, Arnaud; Vishnyakova, Dina; Lovis, Christian; Ruch, Patrick
2012-01-01
Health-related information retrieval is complicated by the variety of nomenclatures available to name entities, since different communities of users will use different ways to name a same entity. We present in this report the development and evaluation of a user-friendly interactive Web application aiming at facilitating health-related patent search. Our tool, called TWINC, relies on a search engine tuned during several patent retrieval competitions, enhanced with intelligent interaction modules, such as chemical query, normalization and expansion. While the functionality of related article search showed promising performances, the ad hoc search results in fairly contrasted results. Nonetheless, TWINC performed well during the PatOlympics competition and was appreciated by intellectual property experts. This result should be balanced by the limited evaluation sample. We can also assume that it can be customized to be applied in corporate search environments to process domain and company-specific vocabularies, including non-English literature and patents reports.
Author name recognition in degraded journal images
NASA Astrophysics Data System (ADS)
de Bodard de la Jacopière, Aliette; Likforman-Sulem, Laurence
2006-01-01
A method for extracting names in degraded documents is presented in this article. The documents targeted are images of photocopied scientific journals from various scientific domains. Due to the degradation, there is poor OCR recognition, and pieces of other articles appear on the sides of the image. The proposed approach relies on the combination of a low-level textual analysis and an image-based analysis. The textual analysis extracts robust typographic features, while the image analysis selects image regions of interest through anchor components. We report results on the University of Washington benchmark database.
How brand names are special: brands, words, and hemispheres.
Gontijo, Possidonia F D; Rayman, Janice; Zhang, Shi; Zaidel, Eran
2002-09-01
Previous research has consistently shown differences between the processing of proper names and of common nouns, leading to the belief that proper names possess a special neuropsychological status. We investigate the category of brand names and suggest that brand names also have a special neuropsychological status, but one which is different from proper names. The findings suggest that the hemispheric lexical status of the brand names is mixed--they behave like words in some respects and like nonwords in others. Our study used familiar upper case brand names, common nouns, and two different types of nonwords ("weird" and "normal") differing in length, as stimuli in a lateralized lexical decision task (LDT). Common nouns, brand names, weird nonwords, and normal nonwords were recognized in that decreasing order of speed and accuracy. A right visual field (RVF) advantage was found for all four lexical types. Interestingly, brand names, similar to nonwords, were found to be less lateralized than common nouns, consistent with theories of category-specific lexical processing. Further, brand names were the only type of lexical items to show a capitalization effect: brand names were recognized faster when they were presented in upper case than in lower case. In addition, while string length affected the recognition of common nouns only in the left visual field (LVF) and the recognition of nonwords only in the RVF, brand names behaved like common nouns in exhibiting length effects only in the LVF. Copyright 2002 Elsevier Science (USA)
Layered recognition networks that pre-process, classify, and describe
NASA Technical Reports Server (NTRS)
Uhr, L.
1971-01-01
A brief overview is presented of six types of pattern recognition programs that: (1) preprocess, then characterize; (2) preprocess and characterize together; (3) preprocess and characterize into a recognition cone; (4) describe as well as name; (5) compose interrelated descriptions; and (6) converse. A computer program (of types 3 through 6) is presented that transforms and characterizes the input scene through the successive layers of a recognition cone, and then engages in a stylized conversation to describe the scene.
Modeling Interval Temporal Dependencies for Complex Activities Understanding
2013-10-11
ORGANIZATION NAMES AND ADDRESSES U.S. Army Research Office P.O. Box 12211 Research Triangle Park, NC 27709-2211 15. SUBJECT TERMS Human activity modeling...computer vision applications: human activity recognition and facial activity recognition. The results demonstrate the superior performance of the
Image Classification for Web Genre Identification
2012-01-01
recognition and landscape detection using the computer vision toolkit OpenCV1. For facial recognition , we researched the possibilities of using the...method for connecting these names with a face/personal photo and logo respectively. [2] METHODOLOGY For this project, we focused primarily on facial
NASA Astrophysics Data System (ADS)
Esparza, Javier
In many areas of computer science entities can “reproduce”, “replicate”, or “create new instances”. Paramount examples are threads in multithreaded programs, processes in operating systems, and computer viruses, but many others exist: procedure calls create new incarnations of the callees, web crawlers discover new pages to be explored (and so “create” new tasks), divide-and-conquer procedures split a problem into subproblems, and leaves of tree-based data structures become internal nodes with children. For lack of a better name, I use the generic term systems with process creation to refer to all these entities.
Exploring Contextual Models in Chemical Patent Search
NASA Astrophysics Data System (ADS)
Urbain, Jay; Frieder, Ophir
We explore the development of probabilistic retrieval models for integrating term statistics with entity search using multiple levels of document context to improve the performance of chemical patent search. A distributed indexing model was developed to enable efficient named entity search and aggregation of term statistics at multiple levels of patent structure including individual words, sentences, claims, descriptions, abstracts, and titles. The system can be scaled to an arbitrary number of compute instances in a cloud computing environment to support concurrent indexing and query processing operations on large patent collections.
A Three Dimensional Electronic Retina Architecture.
1987-12-01
not guarantee that a biological entity is in fact the best design because of the unique constraining factors of a biological organism and the associated...4. PERFORMING ORGANIZATION REPORT NUMBER(S) 5. MONITORING ORGANIZATION REPORT NUMBER(S) AFIT/GCS/ENG/87D-23 6a. NAME OF PERFORMING ORGANIZATION 6b...OFFICE SYMBOL 7a. NAME OF MONITORING ORGANIZATION (If applicable) School of Engineering AFIT/ENG 6c. ADDRESS (City, State, and ZIP Code) 7b. ADDRESS
2016-09-01
PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) Naval Postgraduate School Monterey, CA 93943-5000 8. PERFORMING ORGANIZATION REPORT NUMBER 9...state- and local-level computer networks fertile ground for the cyber adversary. This research focuses on the threat to SLTT computer networks and how...institutions, and banking systems. The array of responsibilities and the cybersecurity threat landscape make state- and local-level computer networks fertile
Recognition of student names past: a longitudinal study with N = 1.
Huang, I N
1997-01-01
Recognition of names of former students taught at different times by a middle-aged college professor was tested, to investigate recognition memory over a time span ranging from 6 months to 26.5 years. The relationship between the d', a measure of strength of memory, and the retention interval can be best described by a logarithmic function characterized by a rapid initial drop followed by a slow forgetting rate. The correct responses (hits and rejections) had higher confidence and shorter response time than did the incorrect responses (false alarms and misses). The results show that an ecologically realistic longitudinal study with N = 1 can provide a valuable means in the study of human memory with very long retention intervals, which have not yet been investigated in the laboratory.
Zand, Ladan; Muriithi, Angela; Nelsen, Eric; Franco, Pablo M; Greene, Eddie L; Qian, Qi; El-Zoghby, Ziad M
2012-12-01
Anion gap metabolic acidosis (AGMA) is commonly encountered in medical practice. Acetaminophen-induced AGMA is, however, not widely recognized. We report 2 cases of high anion gap metabolic acidosis secondary to 5-oxoproline accumulation resulting from acetaminophen consumption: the first case caused by acute one-time ingestion of large quantities of acetaminophen and the second case caused by chronic repeated ingestion in a patient with chronic liver disease. Recognition of this entity facilitated timely diagnosis and effective treatment. Given acetaminophen is commonly used over the counter medication, increased recognition of this adverse effect is of important clinical significance.
Fast cat-eye effect target recognition based on saliency extraction
NASA Astrophysics Data System (ADS)
Li, Li; Ren, Jianlin; Wang, Xingbin
2015-09-01
Background complexity is a main reason that results in false detection in cat-eye target recognition. Human vision has selective attention property which can help search the salient target from complex unknown scenes quickly and precisely. In the paper, we propose a novel cat-eye effect target recognition method named Multi-channel Saliency Processing before Fusion (MSPF). This method combines traditional cat-eye target recognition with the selective characters of visual attention. Furthermore, parallel processing enables it to achieve fast recognition. Experimental results show that the proposed method performs better in accuracy, robustness and speed compared to other methods.
Color Makes a Difference: Two-Dimensional Object Naming in Literate and Illiterate Subjects
ERIC Educational Resources Information Center
Reis, Alexandra; Faisca, Luis; Ingvar, Martin; Petersson, Karl Magnus
2006-01-01
Previous work has shown that illiterate subjects are better at naming two-dimensional representations of real objects when presented as colored photos as compared to black and white drawings. This raises the question if color or textural details selectively improve object recognition and naming in illiterate compared to literate subjects. In this…
From Chair to "Chair": A Representational Shift Account of Object Labeling Effects on Memory
ERIC Educational Resources Information Center
Lupyan, Gary
2008-01-01
What are the consequences of calling things by their names? Six experiments investigated how classifying familiar objects with basic-level names (chairs, tables, and lamps) affected recognition memory. Memory was found to be worse for items that were overtly classified with the category name--as reflected by lower hit rates--compared with items…
ERIC Educational Resources Information Center
Li, Hong; Shu, Hua; McBride-Chang, Catherine; Liu, Hongyun; Peng, Hong
2012-01-01
Tasks tapping visual skills, orthographic knowledge, phonological awareness, speeded naming, morphological awareness and Chinese character recognition were administered to 184 kindergarteners and 273 primary school students from Beijing. Regression analyses indicated that only syllable deletion, morphological construction and speeded number naming…
Human-machine interaction to disambiguate entities in unstructured text and structured datasets
NASA Astrophysics Data System (ADS)
Ward, Kevin; Davenport, Jack
2017-05-01
Creating entity network graphs is a manual, time consuming process for an intelligence analyst. Beyond the traditional big data problems of information overload, individuals are often referred to by multiple names and shifting titles as they advance in their organizations over time which quickly makes simple string or phonetic alignment methods for entities insufficient. Conversely, automated methods for relationship extraction and entity disambiguation typically produce questionable results with no way for users to vet results, correct mistakes or influence the algorithm's future results. We present an entity disambiguation tool, DRADIS, which aims to bridge the gap between human-centric and machinecentric methods. DRADIS automatically extracts entities from multi-source datasets and models them as a complex set of attributes and relationships. Entities are disambiguated across the corpus using a hierarchical model executed in Spark allowing it to scale to operational sized data. Resolution results are presented to the analyst complete with sourcing information for each mention and relationship allowing analysts to quickly vet the correctness of results as well as correct mistakes. Corrected results are used by the system to refine the underlying model allowing analysts to optimize the general model to better deal with their operational data. Providing analysts with the ability to validate and correct the model to produce a system they can trust enables them to better focus their time on producing higher quality analysis products.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tautges, Timothy J.
MOAB is a component for representing and evaluating mesh data. MOAB can store stuctured and unstructured mesh, consisting of elements in the finite element "zoo". The functional interface to MOAB is simple yet powerful, allowing the representation of many types of metadata commonly found on the mesh. MOAB is optimized for efficiency in space and time, based on access to mesh in chunks rather than through individual entities, while also versatile enough to support individual entity access. The MOAB data model consists of a mesh interface instance, mesh entities (vertices and elements), sets, and tags. Entities are addressed through handlesmore » rather than pointers, to allow the underlying representation of an entity to change without changing the handle to that entity. Sets are arbitrary groupings of mesh entities and other sets. Sets also support parent/child relationships as a relation distinct from sets containing other sets. The directed-graph provided by set parent/child relationships is useful for modeling topological relations from a geometric model or other metadata. Tags are named data which can be assigned to the mesh as a whole, individual entities, or sets. Tags are a mechanism for attaching data to individual entities and sets are a mechanism for describing relations between entities; the combination of these two mechanisms isa powerful yet simple interface for representing metadata or application-specific data. For example, sets and tags can be used together to describe geometric topology, boundary condition, and inter-processor interface groupings in a mesh. MOAB is used in several ways in various applications. MOAB serves as the underlying mesh data representation in the VERDE mesh verification code. MOAB can also be used as a mesh input mechanism, using mesh readers induded with MOAB, or as a tanslator between mesh formats, using readers and writers included with MOAB.« less
Wheat, Katherine L; Cornelissen, Piers L; Sack, Alexander T; Schuhmann, Teresa; Goebel, Rainer; Blomert, Leo
2013-05-01
Magnetoencephalography (MEG) has shown pseudohomophone priming effects at Broca's area (specifically pars opercularis of left inferior frontal gyrus and precentral gyrus; LIFGpo/PCG) within ∼100ms of viewing a word. This is consistent with Broca's area involvement in fast phonological access during visual word recognition. Here we used online transcranial magnetic stimulation (TMS) to investigate whether LIFGpo/PCG is necessary for (not just correlated with) visual word recognition by ∼100ms. Pulses were delivered to individually fMRI-defined LIFGpo/PCG in Dutch speakers 75-500ms after stimulus onset during reading and picture naming. Reading and picture naming reactions times were significantly slower following pulses at 225-300ms. Contrary to predictions, there was no disruption to reading for pulses before 225ms. This does not provide evidence in favour of a functional role for LIFGpo/PCG in reading before 225ms in this case, but does extend previous findings in picture stimuli to written Dutch words. Copyright © 2012 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Ting, Yu-Liang; Tai, Yaming; Chen, Jun-Horng
2017-01-01
Telepresence has been playing an important role in a mediated learning environment. However, the current design of telepresence seems to be dominated by the emulation of physical human presence. With reference to social constructivism learning and the recognition of individuals as intelligent entities, this study explored the transformation of…
Hereditary neuropathy with liability to pressure palsies occurring during military training.
Delacour, H; Bompaire, F; Biale, L; Sallansonnet-Froment, M; Ceppa, F; Burnat, P
2012-03-01
Hereditary neuropathy with liability to pressure palsies (HNPP) is an autosomal-dominant peripheral neuropathy characterized by recurrent isolated nerve palsies, which are precipitated by trivial compression and trauma. Although HNPP has been well-described in literature, it often goes unrecognized. We report a case of HNPP occurring during military training to promote recognition and proper management of this entity.
Recognition of human activity characteristics based on state transitions modeling technique
NASA Astrophysics Data System (ADS)
Elangovan, Vinayak; Shirkhodaie, Amir
2012-06-01
Human Activity Discovery & Recognition (HADR) is a complex, diverse and challenging task but yet an active area of ongoing research in the Department of Defense. By detecting, tracking, and characterizing cohesive Human interactional activity patterns, potential threats can be identified which can significantly improve situation awareness, particularly, in Persistent Surveillance Systems (PSS). Understanding the nature of such dynamic activities, inevitably involves interpretation of a collection of spatiotemporally correlated activities with respect to a known context. In this paper, we present a State Transition model for recognizing the characteristics of human activities with a link to a prior contextbased ontology. Modeling the state transitions between successive evidential events determines the activities' temperament. The proposed state transition model poses six categories of state transitions including: Human state transitions of Object handling, Visibility, Entity-entity relation, Human Postures, Human Kinematics and Distance to Target. The proposed state transition model generates semantic annotations describing the human interactional activities via a technique called Casual Event State Inference (CESI). The proposed approach uses a low cost kinect depth camera for indoor and normal optical camera for outdoor monitoring activities. Experimental results are presented here to demonstrate the effectiveness and efficiency of the proposed technique.
Development of an acoustic wave based biosensor for vapor phase detection of small molecules
NASA Astrophysics Data System (ADS)
Stubbs, Desmond
For centuries scientific ingenuity and innovation have been influenced by Mother Nature's perfect design. One of her more elusive designs is that of the sensory olfactory system, an array of highly sensitive receptors responsible for chemical vapor recognition. In the animal kingdom this ability is magnified among canines where ppt (parts per trillion) sensitivity values have been reported. Today, detection dogs are considered an essential part of the US drug and explosives detection schemes. However, growing concerns about their susceptibility to extraneous odors have inspired the development of highly sensitive analytical detection tools or biosensors known as "electronic noses". In general, biosensors are distinguished from chemical sensors in that they use an entity of biological origin (e.g. antibody, cell, enzyme) immobilized onto a surface as the chemically-sensitive film on the device. The colloquial view is that the term "biosensors" refers to devices which detect the presence of entities of biological origin, such as proteins or single-stranded DNA and that this detection must take place in a liquid. Our biosensor utilizes biomolecules, specifically IgG monoclonal antibodies, to achieve molecular recognition of relatively small molecules in the vapor phase.
Taking on Nationalism in the Name of Intercultural Competence
ERIC Educational Resources Information Center
Meadows, Bryan
2010-01-01
Nationalism presents significant challenges to intercultural competence instruction. On the one hand, nationalism promotes the compartmentalization of communities into mutually-exclusive and discretely-defined nationalist entities. In complementary fashion, nationalism also advocates the homogenization of cultural and linguistic practices within…
Code of Federal Regulations, 2010 CFR
2010-04-01
... the original location. (b) For 120 days after the commencement or the expansion of commercial... original location. (c) For the purposes of this section, relocating establishment means a business entity... review should include names under which the establishment does business, including successors-in-interest...
Evaluation of Fly Ash Quality Control Tools
DOT National Transportation Integrated Search
2010-06-30
Many entities currently use fly ash in portland cement concrete (PCC) pavements and structures. Although the body of knowledge is : great concerning the use of fly ash, several projects per year are subject to poor performance where fly ash is named ...
Evaluation of fly ash quality control tools.
DOT National Transportation Integrated Search
2010-06-30
Many entities currently use fly ash in portland cement concrete (PCC) pavements and structures. Although the body of knowledge is : great concerning the use of fly ash, several projects per year are subject to poor performance where fly ash is named ...
Obligatory and facultative brain regions for voice-identity recognition
Roswandowitz, Claudia; Kappes, Claudia; Obrig, Hellmuth; von Kriegstein, Katharina
2018-01-01
Abstract Recognizing the identity of others by their voice is an important skill for social interactions. To date, it remains controversial which parts of the brain are critical structures for this skill. Based on neuroimaging findings, standard models of person-identity recognition suggest that the right temporal lobe is the hub for voice-identity recognition. Neuropsychological case studies, however, reported selective deficits of voice-identity recognition in patients predominantly with right inferior parietal lobe lesions. Here, our aim was to work towards resolving the discrepancy between neuroimaging studies and neuropsychological case studies to find out which brain structures are critical for voice-identity recognition in humans. We performed a voxel-based lesion-behaviour mapping study in a cohort of patients (n = 58) with unilateral focal brain lesions. The study included a comprehensive behavioural test battery on voice-identity recognition of newly learned (voice-name, voice-face association learning) and familiar voices (famous voice recognition) as well as visual (face-identity recognition) and acoustic control tests (vocal-pitch and vocal-timbre discrimination). The study also comprised clinically established tests (neuropsychological assessment, audiometry) and high-resolution structural brain images. The three key findings were: (i) a strong association between voice-identity recognition performance and right posterior/mid temporal and right inferior parietal lobe lesions; (ii) a selective association between right posterior/mid temporal lobe lesions and voice-identity recognition performance when face-identity recognition performance was factored out; and (iii) an association of right inferior parietal lobe lesions with tasks requiring the association between voices and faces but not voices and names. The results imply that the right posterior/mid temporal lobe is an obligatory structure for voice-identity recognition, while the inferior parietal lobe is only a facultative component of voice-identity recognition in situations where additional face-identity processing is required. PMID:29228111
Obligatory and facultative brain regions for voice-identity recognition.
Roswandowitz, Claudia; Kappes, Claudia; Obrig, Hellmuth; von Kriegstein, Katharina
2018-01-01
Recognizing the identity of others by their voice is an important skill for social interactions. To date, it remains controversial which parts of the brain are critical structures for this skill. Based on neuroimaging findings, standard models of person-identity recognition suggest that the right temporal lobe is the hub for voice-identity recognition. Neuropsychological case studies, however, reported selective deficits of voice-identity recognition in patients predominantly with right inferior parietal lobe lesions. Here, our aim was to work towards resolving the discrepancy between neuroimaging studies and neuropsychological case studies to find out which brain structures are critical for voice-identity recognition in humans. We performed a voxel-based lesion-behaviour mapping study in a cohort of patients (n = 58) with unilateral focal brain lesions. The study included a comprehensive behavioural test battery on voice-identity recognition of newly learned (voice-name, voice-face association learning) and familiar voices (famous voice recognition) as well as visual (face-identity recognition) and acoustic control tests (vocal-pitch and vocal-timbre discrimination). The study also comprised clinically established tests (neuropsychological assessment, audiometry) and high-resolution structural brain images. The three key findings were: (i) a strong association between voice-identity recognition performance and right posterior/mid temporal and right inferior parietal lobe lesions; (ii) a selective association between right posterior/mid temporal lobe lesions and voice-identity recognition performance when face-identity recognition performance was factored out; and (iii) an association of right inferior parietal lobe lesions with tasks requiring the association between voices and faces but not voices and names. The results imply that the right posterior/mid temporal lobe is an obligatory structure for voice-identity recognition, while the inferior parietal lobe is only a facultative component of voice-identity recognition in situations where additional face-identity processing is required. © The Author (2017). Published by Oxford University Press on behalf of the Guarantors of Brain.
Keyes, Helen; Dlugokencka, Aleksandra
2014-01-01
We respond more quickly to our own face than to other faces, but there is debate over whether this is connected to attention-grabbing properties of the self-face. In two experiments, we investigate whether the self-face selectively captures attention, and the attentional conditions under which this might occur. In both experiments, we examined whether different types of face (self, friend, stranger) provide differential levels of distraction when processing self, friend and stranger names. In Experiment 1, an image of a distractor face appeared centrally – inside the focus of attention – behind a target name, with the faces either upright or inverted. In Experiment 2, distractor faces appeared peripherally – outside the focus of attention – in the left or right visual field, or bilaterally. In both experiments, self-name recognition was faster than other name recognition, suggesting a self-referential processing advantage. The presence of the self-face did not cause more distraction in the naming task compared to other types of face, either when presented inside (Experiment 1) or outside (Experiment 2) the focus of attention. Distractor faces had different effects across the two experiments: when presented inside the focus of attention (Experiment 1), self and friend images facilitated self and friend naming, respectively. This was not true for stranger stimuli, suggesting that faces must be robustly represented to facilitate name recognition. When presented outside the focus of attention (Experiment 2), no facilitation occurred. Instead, we report an interesting distraction effect caused by friend faces when processing strangers’ names. We interpret this as a “social importance” effect, whereby we may be tuned to pick out and pay attention to familiar friend faces in a crowd. We conclude that any speed of processing advantages observed in the self-face processing literature are not driven by automatic attention capture. PMID:25338170
Keyes, Helen; Dlugokencka, Aleksandra
2014-01-01
We respond more quickly to our own face than to other faces, but there is debate over whether this is connected to attention-grabbing properties of the self-face. In two experiments, we investigate whether the self-face selectively captures attention, and the attentional conditions under which this might occur. In both experiments, we examined whether different types of face (self, friend, stranger) provide differential levels of distraction when processing self, friend and stranger names. In Experiment 1, an image of a distractor face appeared centrally - inside the focus of attention - behind a target name, with the faces either upright or inverted. In Experiment 2, distractor faces appeared peripherally - outside the focus of attention - in the left or right visual field, or bilaterally. In both experiments, self-name recognition was faster than other name recognition, suggesting a self-referential processing advantage. The presence of the self-face did not cause more distraction in the naming task compared to other types of face, either when presented inside (Experiment 1) or outside (Experiment 2) the focus of attention. Distractor faces had different effects across the two experiments: when presented inside the focus of attention (Experiment 1), self and friend images facilitated self and friend naming, respectively. This was not true for stranger stimuli, suggesting that faces must be robustly represented to facilitate name recognition. When presented outside the focus of attention (Experiment 2), no facilitation occurred. Instead, we report an interesting distraction effect caused by friend faces when processing strangers' names. We interpret this as a "social importance" effect, whereby we may be tuned to pick out and pay attention to familiar friend faces in a crowd. We conclude that any speed of processing advantages observed in the self-face processing literature are not driven by automatic attention capture.
The J-Staff System, Network Synchronisation and Noise
2014-06-01
GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S) 5d. PROJECT NUMBER 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7. PERFORMING ORGANIZATION NAME(S... work . A key challenge of such structures is their tendency to fall into extreme dynamical modes. One is a ‘two-speed’ mode, where units interacting with...longer term planning, led by the J5 Planning Branch, fall into a slow cycle of work , while those entities interacting predominately with operations
Scorolli, Claudia; Borghi, Anna M.
2015-01-01
The present study investigates the role that shape and color play in the representation of animate (i.e., animals) and inanimate manipulable entities (i.e., fruits), and how the importance of these features is modulated by different tasks. Across three experiments participants were shown either images of entities (e.g., a sheep or a pineapple) or images of the same entities modified in color (e.g., a blue pineapple) or in shape (e.g., an elongated pineapple). In Experiment 1 we asked participants to categorize the entities as fruit or animal. Results showed that with animals color does not matter, while shape modifications determined a deterioration of the performance – stronger for fruit than for animals. To better understand our findings, in Experiments 2 we asked participants to judge if entities were graspable (manipulation evaluation task). Participants were faster with manipulable entities (fruit) than with animals; moreover alterations in shape affected the response latencies more for animals than for fruit. In Experiment 3 (motion evaluation task), we replicated the disadvantage for shape-altered animals, while with fruits shape and color modifications produced no effect. By contrasting shape- and color- alterations the present findings provide information on shape/color relative weight, suggesting that the action based property of shape is more crucial than color for fruit categorization, while with animals it is critical for both manipulation and motion tasks. This contextual dependency is further revealed by explicit judgments on similarity – between the altered entities and the prototypical ones – provided after the different tasks. These results extend current literature on affordances and biofunctionally embodied understanding, revealing the relative robustness of biofunctional activity compared to intellectual one. PMID:26500593
When a Picasso is a "Picasso": the entry point in the identification of visual art.
Belke, B; Leder, H; Harsanyi, G; Carbon, C C
2010-02-01
We investigated whether art is distinguished from other real world objects in human cognition, in that art allows for a special memorial representation and identification based on artists' specific stylistic appearances. Testing art-experienced viewers, converging empirical evidence from three experiments, which have proved sensitive to addressing the question of initial object recognition, suggest that identification of visual art is at the subordinate level of the producing artist. Specifically, in a free naming task it was found that art-objects as opposed to non-art-objects were most frequently named with subordinate level categories, with the artist's name as the most frequent category (Experiment 1). In a category-verification task (Experiment 2), art-objects were recognized faster than non-art-objects on the subordinate level with the artist's name. In a conceptual priming task, subordinate primes of artists' names facilitated matching responses to art-objects but subordinate primes did not facilitate responses to non-art-objects (Experiment 3). Collectively, these results suggest that the artist's name has a special status in the memorial representation of visual art and serves as a predominant entry point in recognition in art perception. Copyright 2009 Elsevier B.V. All rights reserved.
Development of an information retrieval tool for biomedical patents.
Alves, Tiago; Rodrigues, Rúben; Costa, Hugo; Rocha, Miguel
2018-06-01
The volume of biomedical literature has been increasing in the last years. Patent documents have also followed this trend, being important sources of biomedical knowledge, technical details and curated data, which are put together along the granting process. The field of Biomedical text mining (BioTM) has been creating solutions for the problems posed by the unstructured nature of natural language, which makes the search of information a challenging task. Several BioTM techniques can be applied to patents. From those, Information Retrieval (IR) includes processes where relevant data are obtained from collections of documents. In this work, the main goal was to build a patent pipeline addressing IR tasks over patent repositories to make these documents amenable to BioTM tasks. The pipeline was developed within @Note2, an open-source computational framework for BioTM, adding a number of modules to the core libraries, including patent metadata and full text retrieval, PDF to text conversion and optical character recognition. Also, user interfaces were developed for the main operations materialized in a new @Note2 plug-in. The integration of these tools in @Note2 opens opportunities to run BioTM tools over patent texts, including tasks from Information Extraction, such as Named Entity Recognition or Relation Extraction. We demonstrated the pipeline's main functions with a case study, using an available benchmark dataset from BioCreative challenges. Also, we show the use of the plug-in with a user query related to the production of vanillin. This work makes available all the relevant content from patents to the scientific community, decreasing drastically the time required for this task, and provides graphical interfaces to ease the use of these tools. Copyright © 2018 Elsevier B.V. All rights reserved.
Context recognition for a hyperintensional inference machine
NASA Astrophysics Data System (ADS)
Duží, Marie; Fait, Michal; Menšík, Marek
2017-07-01
The goal of this paper is to introduce the algorithm of context recognition in the functional programming language TIL-Script, which is a necessary condition for the implementation of the TIL-Script inference machine. The TIL-Script language is an operationally isomorphic syntactic variant of Tichý's Transparent Intensional Logic (TIL). From the formal point of view, TIL is a hyperintensional, partial, typed λ-calculus with procedural semantics. Hyperintensional, because TIL λ-terms denote procedures (defined as TIL constructions) producing set-theoretic functions rather than the functions themselves; partial, because TIL is a logic of partial functions; and typed, because all the entities of TIL ontology, including constructions, receive a type within a ramified hierarchy of types. These features make it possible to distinguish three levels of abstraction at which TIL constructions operate. At the highest hyperintensional level the object to operate on is a construction (though a higher-order construction is needed to present this lower-order construction as an object of predication). At the middle intensional level the object to operate on is the function presented, or constructed, by a construction, while at the lowest extensional level the object to operate on is the value (if any) of the presented function. Thus a necessary condition for the development of an inference machine for the TIL-Script language is recognizing a context in which a construction occurs, namely extensional, intensional and hyperintensional context, in order to determine the type of an argument at which a given inference rule can be properly applied. As a result, our logic does not flout logical rules of extensional logic, which makes it possible to develop a hyperintensional inference machine for the TIL-Script language.
Morphological Influences on the Recognition of Monosyllabic Monomorphemic Words
ERIC Educational Resources Information Center
Baayen, R. H.; Feldman, L. B.; Schreuder, R.
2006-01-01
Balota et al. [Balota, D., Cortese, M., Sergent-Marshall, S., Spieler, D., & Yap, M. (2004). Visual word recognition for single-syllable words. "Journal of Experimental Psychology: General, 133," 283-316] studied lexical processing in word naming and lexical decision using hierarchical multiple regression techniques for a large data set of…
ERIC Educational Resources Information Center
Richler, Jennifer J.; Gauthier, Isabel; Palmeri, Thomas J.
2011-01-01
Are there consequences of calling objects by their names? Lupyan (2008) suggested that overtly labeling objects impairs subsequent recognition memory because labeling shifts stored memory representations of objects toward the category prototype (representational shift hypothesis). In Experiment 1, we show that processing objects at the basic…
Pattern Perception and Pictures for the Blind
ERIC Educational Resources Information Center
Heller, Morton A.; McCarthy, Melissa; Clark, Ashley
2005-01-01
This article reviews recent research on perception of tangible pictures in sighted and blind people. Haptic picture naming accuracy is dependent upon familiarity and access to semantic memory, just as in visual recognition. Performance is high when haptic picture recognition tasks do not depend upon semantic memory. Viewpoint matters for the ease…
Separating Speed from Accuracy in Beginning Reading Development
ERIC Educational Resources Information Center
Juul, Holger; Poulsen, Mads; Elbro, Carsten
2014-01-01
Phoneme awareness, letter knowledge, and rapid automatized naming (RAN) are well-known kindergarten predictors of later word recognition skills, but it is not clear whether they predict developments in accuracy or speed, or both. The present longitudinal study of 172 Danish beginning readers found that speed of word recognition mainly developed…
Parts and Relations in Young Children's Shape-Based Object Recognition
ERIC Educational Resources Information Center
Augustine, Elaine; Smith, Linda B.; Jones, Susan S.
2011-01-01
The ability to recognize common objects from sparse information about geometric shape emerges during the same period in which children learn object names and object categories. Hummel and Biederman's (1992) theory of object recognition proposes that the geometric shapes of objects have two components--geometric volumes representing major object…
45 CFR 164.508 - Uses and disclosures for which an authorization is required.
Code of Federal Regulations, 2011 CFR
2011-10-01
... is in the form of: (A) A face-to-face communication made by a covered entity to an individual; or (B... meaningful fashion. (ii) The name or other specific identification of the person(s), or class of persons...
45 CFR 164.508 - Uses and disclosures for which an authorization is required.
Code of Federal Regulations, 2010 CFR
2010-10-01
... is in the form of: (A) A face-to-face communication made by a covered entity to an individual; or (B... meaningful fashion. (ii) The name or other specific identification of the person(s), or class of persons...
2004-05-01
Army Soldier System Command: http://www.natick.armv.mil Role Name Facial Recognition Program Manager, Army Technical Lead Mark Chandler...security force with a facial recognition system. Mike Holloran, technology officer with the 6 Fleet, directed LCDR Hoa Ho and CAPT(s) Todd Morgan to...USN 6th Fleet was accomplished with the admiral expressing his support for continuing the evaluation of the a facial recognition system. This went
Single-Molecule View of Small RNA-Guided Target Search and Recognition.
Globyte, Viktorija; Kim, Sung Hyun; Joo, Chirlmin
2018-05-20
Most everyday processes in life involve a necessity for an entity to locate its target. On a cellular level, many proteins have to find their target to perform their function. From gene-expression regulation to DNA repair to host defense, numerous nucleic acid-interacting proteins use distinct target search mechanisms. Several proteins achieve that with the help of short RNA strands known as guides. This review focuses on single-molecule advances studying the target search and recognition mechanism of Argonaute and CRISPR (clustered regularly interspaced short palindromic repeats) systems. We discuss different steps involved in search and recognition, from the initial complex prearrangement into the target-search competent state to the final proofreading steps. We focus on target search mechanisms that range from weak interactions, to one- and three-dimensional diffusion, to conformational proofreading. We compare the mechanisms of Argonaute and CRISPR with a well-studied target search system, RecA.
Door recognition in cluttered building interiors using imagery and lidar data
NASA Astrophysics Data System (ADS)
Díaz-Vilariño, L.; Martínez-Sánchez, J.; Lagüela, S.; Armesto, J.; Khoshelham, K.
2014-06-01
Building indoors reconstruction is an active research topic due to the importance of the wide range of applications to which they can be subjected, from architecture and furniture design, to movies and video games editing, or even crime scene investigation. Among the constructive elements defining the inside of a building, doors are important entities in applications like routing and navigation, and their automated recognition is advantageous e.g. in case of large multi-storey buildings with many office rooms. The inherent complexity of the automation of the recognition process is increased by the presence of clutter and occlusions, difficult to avoid in indoor scenes. In this work, we present a pipeline of techniques used for the reconstruction and interpretation of building interiors using information acquired in the form of point clouds and images. The methodology goes in depth with door detection and labelling as either opened, closed or furniture (false positive)
Super-hydrophobicity fundamentals: implications to biofouling prevention.
Marmur, Abraham
2006-01-01
The theory of wetting on super-hydrophobic surfaces is presented and discussed, within the general framework of equilibrium wetting and contact angles. Emphasis is put on the implications of super-hydrophobicity to the prevention of biofouling. Two main lines of thought are discussed, viz. i) "mirror imaging" of the Lotus effect, namely designing a surface that repels biological entities by being super-hydrophilic, and ii) designing a surface that minimises the water-wetted area when submerged in water (by keeping an air film between the water and the surface), so that the suspended biological entities have a low probability of encountering the solid surface.
Taking the fifth amendment in Turing's imitation game
NASA Astrophysics Data System (ADS)
Warwick, Kevin; Shah, Huma
2017-03-01
In this paper, we look at a specific issue with practical Turing tests, namely the right of the machine to remain silent during interrogation. In particular, we consider the possibility of a machine passing the Turing test simply by not saying anything. We include a number of transcripts from practical Turing tests in which silence has actually occurred on the part of a hidden entity. Each of the transcripts considered here resulted in a judge being unable to make the 'right identification', i.e., they could not say for certain which hidden entity was the machine.
Mentovich, Avital; Huq, Aziz; Cerf, Moran
2016-04-01
The U.S. Supreme Court has increasingly expanded the scope of constitutional rights granted to corporations and other collective entities. Although this tendency receives widespread public and media attention, little empirical research examines how people ascribe rights, commonly thought to belong to natural persons, to corporations. This article explores this issue in 3 studies focusing on different rights (religious liberty, privacy, and free speech). We examined participants' willingness to grant a given right while manipulating the type of entity at stake (from small businesses, to larger corporations, to for-profit and nonprofit companies), and the identity of the right holder (from employees, to owners, to the company itself as a separate entity). We further examined the role of political ideology in perceptions of rights. Results indicated a significant decline in the degree of recognition of entities' rights (the company itself) in comparison to natural persons' rights (owners and employees). Results also demonstrated an effect of the type of entity at stake: Larger, for-profit businesses were less likely to be viewed as rights holders compared with nonprofit entities. Although both tendencies persisted across the ideological spectrum, ideological differences emerged in the relations between corporate and individual rights: these were positively related among conservatives but negatively related among liberals. Finally, we found that the desire to protect citizens (compared with businesses) underlies individuals' willingness to grant rights to companies. These findings show that people (rather than corporations) are more appropriate recipients of rights, and can explain public backlash to judicial expansions of corporate rights. (c) 2016 APA, all rights reserved).
Motivation and Organizational Principles for Anatomical Knowledge Representation
Rosse, Cornelius; Mejino, José L.; Modayur, Bharath R.; Jakobovits, Rex; Hinshaw, Kevin P.; Brinkley, James F.
1998-01-01
Abstract Objective: Conceptualization of the physical objects and spaces that constitute the human body at the macroscopic level of organization, specified as a machine-parseable ontology that, in its human-readable form, is comprehensible to both expert and novice users of anatomical information. Design: Conceived as an anatomical enhancement of the UMLS Semantic Network and Metathesaurus, the anatomical ontology was formulated by specifying defining attributes and differentia for classes and subclasses of physical anatomical entities based on their partitive and spatial relationships. The validity of the classification was assessed by instantiating the ontology for the thorax. Several transitive relationships were used for symbolically modeling aspects of the physical organization of the thorax. Results: By declaring Organ as the macroscopic organizational unit of the body, and defining the entities that constitute organs and higher level entities constituted by organs, all anatomical entities could be assigned to one of three top level classes (Anatomical structure, Anatomical spatial entity and Body substance). The ontology accommodates both the systemic and regional (topographical) views of anatomy, as well as diverse clinical naming conventions of anatomical entities. Conclusions: The ontology formulated for the thorax is extendible to microscopic and cellular levels, as well as to other body parts, in that its classes subsume essentially all anatomical entities that constitute the body. Explicit definitions of these entities and their relationships provide the first requirement for standards in anatomical concept representation. Conceived from an anatomical viewpoint, the ontology can be generalized and mapped to other biomedical domains and problem solving tasks that require anatomical knowledge. PMID:9452983
ERIC Educational Resources Information Center
Clarke, John Henrik
1989-01-01
The term "African" has gone through several phases of acceptability in the course of United States history. Changes in the applicability of the name reflect developments in African-American consciousness in the context of national and world history. Recognition of African identity is influencing Black definition and direction worldwide.…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Katsumi Marukawa; Kazuki Nakashima; Masashi Koga
1994-12-31
This paper presents a paper form processing system with an error correcting function for reading handwritten kanji strings. In the paper form processing system, names and addresses are important key data, and especially this paper takes up an error correcting method for name and address recognition. The method automatically corrects errors of the kanji OCR (Optical Character Reader) with the help of word dictionaries and other knowledge. Moreover, it allows names and addresses to be written in any style. The method consists of word matching {open_quotes}furigana{close_quotes} verification for name strings, and address approval for address strings. For word matching, kanjimore » name candidates are extracted by automaton-type word matching. In {open_quotes}furigana{close_quotes} verification, kana candidate characters recognized by the kana OCR are compared with kana`s searched from the name dictionary based on kanji name candidates, given by the word matching. The correct name is selected from the results of word matching and furigana verification. Also, the address approval efficiently searches for the right address based on a bottom-up procedure which follows hierarchical relations from a lower placename to a upper one by using the positional condition among the placenames. We ascertained that the error correcting method substantially improves the recognition rate and processing speed in experiments on 5,032 forms.« less
Letter-case information and the identification of brand names.
Perea, Manuel; Jiménez, María; Talero, Fernanda; López-Cañada, Soraya
2015-02-01
A central tenet of most current models of visual-word recognition is that lexical units are activated on the basis of case-invariant abstract letter representations. Here, we examined this assumption by using a unique type of words: brand names. The rationale of the experiments is that brand names are archetypically printed either in lowercase (e.g., adidas) or uppercase (e.g., IKEA). This allows us to present the brand names in their standard or non-standard case configuration (e.g., adidas, IKEA vs. ADIDAS, ikea, respectively). We conducted two experiments with a brand-decision task ('is it a brand name?'): a single-presentation experiment and a masked priming experiment. Results in the single-presentation experiment revealed faster identification times of brand names in their standard case configuration than in their non-standard case configuration (i.e., adidas faster than ADIDAS; IKEA faster than ikea). In the masked priming experiment, we found faster identification times of brand names when they were preceded by an identity prime that matched its standard case configuration than when it did not (i.e., faster response times to adidas-adidas than to ADIDAS-adidas). Taken together, the present findings strongly suggest that letter-case information forms part of a brand name's graphemic information, thus posing some limits to current models of visual-word recognition. © 2014 The British Psychological Society.
Type specimens and basic principles of avian taxonomy
Banks, Richard C.; Goodman, Steven M.; Lanyon, Scott M.; Schulenberg, Thomas S.
1993-01-01
"Ornithology" may be defined as the scientific study of birds. No aspect of avian biology, including management and conservation, can be carried out without reference by name to birds at some taxonomic level. Thus, the names of species of birds, and of groups of species, can fairly be considered to be of primary importance in ornithology. To be useful, these names themselves must be defined and related to biological entities. The definition of a name is accomplished by the designation of a "type." The International Code of Zoological Nomenclature, in paragraph (C) of Article 72 (third edition, 1985), establishes criteria for eligibility of a name-bearing type. The type of a species or sub-species name is the biological specimen defined by the name, and later use of the name implies specific or subspecific identity with the type. It is imperative, therefore, that a type be available for study and comparison so that the identity of other material with it can be established.
Atypical presentations of methemoglobinemia from benzocaine spray.
Tantisattamo, Ekamol; Suwantarat, Nuntra; Vierra, Joseph R; Evans, Samuel J
2011-06-01
Widely used for local anesthesia, especially prior to endoscopic procedures, benzocaine spray is one of the most common causes of iatrogenic methemoglobinemia. The authors report an atypical case of methemoglobinemia in a woman presenting with pale skin and severe hypoxemia, after a delayed repeat exposure to benzocaine spray. Early recognition and prompt management of methemoglobinemia is needed in order to lessen morbidity and mortality from this entity.
2013-07-01
ELEMENT NUMBER 6. AUTHOR (S) 5d. PROJECT NUMBER 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) U.S...persons for whom DOD must account. A committee report accompanying the National Defense Authorization Act for Fiscal Year 2013 mandated GAO to...many organizations and each reports through a different line of authority . Thus, no single entity is responsible for communitywide personnel and
Meeting Canadian Forces Expansion Goals through Retention
2010-05-01
performed well) for the recognition they would receive, while at the same time supporting satisfaction of intrinsic needs, by reinforcing the good feeling...Expansion Goals Through Expansion 4. TITLE AND SUBTITLE 5. FUNDING NUMBERS LCol M.A. Nixon 6. AUTHOR(S) 7. PERFORMING ORGANIZATION NAME(S...AND ADDRESS(ES) 8. PERFORMING ORGANIZATION REPORT NUMBER 9. SPONSORING / MONITORING AGENCY NAME(S) AND ADDRESS(ES) 10. SPONSORING
To the Question about the Quality of Economic Education
ERIC Educational Resources Information Center
Dyshaeva, Lyudmila
2015-01-01
The article discusses the shortcomings of the methodology of neoclassical theory as a basic theory determining the content of contemporary economic theory course at Russian educational institutions namely unrealistic conditions of perfect competition, rationality of economic behavior of business entities, completeness and authenticity of…
47 CFR 27.1170 - Payment Issues.
Code of Federal Regulations, 2010 CFR
2010-10-01
... is required to file a notice containing site-specific data with the clearinghouse. The notice... name of the transmitting base station, the geographic coordinates corresponding to that base station... the site-data filing requirement by submitting a copy of their PCN to the clearinghouse. AWS entities...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-08-23
... data elements: Full Name; Alias(es); Gender; Date of Birth; Country of Birth; Country of Citizenship... locked drawer behind a locked door. The records may be stored on magnetic disc, tape, or digital media...
Hu, Jinming; Liu, Shiyong
2014-07-15
CONSPECTUS: All living organisms and soft matter are intrinsically responsive and adaptive to external stimuli. Inspired by this fact, tremendous effort aiming to emulate subtle responsive features exhibited by nature has spurred the invention of a diverse range of responsive polymeric materials. Conventional stimuli-responsive polymers are constructed via covalent bonds and can undergo reversible or irreversible changes in chemical structures, physicochemical properties, or both in response to a variety of external stimuli. They have been imparted with a variety of emerging applications including drug and gene delivery, optical sensing and imaging, diagnostics and therapies, smart coatings and textiles, and tissue engineering. On the other hand, in comparison with molecular chemistry held by covalent bonds, supramolecular chemistry built on weak and reversible noncovalent interactions has emerged as a powerful and versatile strategy for materials fabrication due to its facile accessibility, extraordinary reversibility and adaptivity, and potent applications in diverse fields. Typically involving more than one type of noncovalent interactions (e.g., hydrogen bonding, metal coordination, hydrophobic association, electrostatic interactions, van der Waals forces, and π-π stacking), host-guest recognition refers to the formation of supramolecular inclusion complexes between two or more entities connected together in a highly controlled and cooperative manner. The inherently reversible and adaptive nature of host-guest molecular recognition chemistry, stemming from multiple noncovalent interactions, has opened up a new platform to construct novel types of stimuli-responsive materials. The introduction of host-guest chemistry not only enriches the realm of responsive materials but also confers them with promising new applications. Most intriguingly, the integration of responsive polymer building blocks with host-guest recognition motifs will endow the former with further broadened responsiveness to external stimuli and accordingly more sophisticated functions. In this Account, we summarize recent progress in the field of responsive polymeric materials containing host-guest recognition motifs with selected examples and highlight their versatile functional applications, whereas small molecule-oriented host-guest supramolecular systems are excluded. We demonstrate how the introduction of host-guest chemistry into conventional polymer systems can modulate their responsive modes to external stimuli. Moreover, the responsive specificity and selectivity of polymeric systems can also be inherited from the host-guest recognition motifs, and these features provide extra advantages in terms of function integration. The following discussions are categorized in terms of design and functions, namely, host-guest chemistry toward the fabrication of responsive polymers and assemblies, optical sensing and imaging, drug and gene delivery, and self-healing materials. A concluding remark on future developments is also presented. We wish this prosperous field would incur more original and evolutionary ideas and benefit fundamental research and our daily life in a more convenient way.
Bonin, Patrick; Méot, Alain; Ferrand, Ludovic; Bugaïska, Aurélia
2015-09-01
We collected sensory experience ratings (SERs) for 1,659 French words in adults. Sensory experience for words is a recently introduced variable that corresponds to the degree to which words elicit sensory and perceptual experiences (Juhasz & Yap Behavior Research Methods, 45, 160-168, 2013; Juhasz, Yap, Dicke, Taylor, & Gullick Quarterly Journal of Experimental Psychology, 64, 1683-1691, 2011). The relationships of the sensory experience norms with other psycholinguistic variables (e.g., imageability and age of acquisition) were analyzed. We also investigated the degree to which SER predicted performance in visual word recognition tasks (lexical decision, word naming, and progressive demasking). The analyses indicated that SER reliably predicted response times in lexical decision, but not in word naming or progressive demasking. The findings are discussed in relation to the status of SER, the role of semantic code activation in visual word recognition, and the embodied view of cognition.
Neighborhood Frequency Effect in Chinese Word Recognition: Evidence from Naming and Lexical Decision
ERIC Educational Resources Information Center
Li, Meng-Feng; Gao, Xin-Yu; Chou, Tai-Li; Wu, Jei-Tun
2017-01-01
Neighborhood frequency is a crucial variable to know the nature of word recognition. Different from alphabetic scripts, neighborhood frequency in Chinese is usually confounded by component character frequency and neighborhood size. Three experiments were designed to explore the role of the neighborhood frequency effect in Chinese and the stimuli…
Prediction of Word Recognition in the First Half of Grade 1
ERIC Educational Resources Information Center
Snel, M. J.; Aarnoutse, C. A. J.; Terwel, J.; van Leeuwe, J. F. J.; van der Veld, W. M.
2016-01-01
Early detection of reading problems is important to prevent an enduring lag in reading skills. We studied the relationship between speed of word recognition (after six months of grade 1 education) and four kindergarten pre-literacy skills: letter knowledge, phonological awareness and naming speed for both digits and letters. Our sample consisted…
Morphological Structures in Visual Word Recognition: The Case of Arabic
ERIC Educational Resources Information Center
Abu-Rabia, Salim; Awwad, Jasmin (Shalhoub)
2004-01-01
This research examined the function within lexical access of the main morphemic units from which most Arabic words are assembled, namely roots and word patterns. The present study focused on the derivation of nouns, in particular, whether the lexical representation of Arabic words reflects their morphological structure and whether recognition of a…
Gasperini, Filippo; Brizzolara, Daniela; Cristofani, Paola; Casalini, Claudia; Chilosi, Anna Maria
2014-01-01
Children with Developmental Dyslexia (DD) are impaired in Rapid Automatized Naming (RAN) tasks, where subjects are asked to name arrays of high frequency items as quickly as possible. However the reasons why RAN speed discriminates DD from typical readers are not yet fully understood. Our study was aimed to identify some of the cognitive mechanisms underlying RAN-reading relationship by comparing one group of 32 children with DD with an age-matched control group of typical readers on a naming and a visual recognition task both using a discrete-trial methodology, in addition to a serial RAN task, all using the same stimuli (digits and colors). Results showed a significant slowness of DD children in both serial and discrete-trial naming (DN) tasks regardless of type of stimulus, but no difference between the two groups on the discrete-trial recognition task. Significant differences between DD and control participants in the RAN task disappeared when performance in the DN task was partialled out by covariance analysis for colors, but not for digits. The same pattern held in a subgroup of DD subjects with a history of early language delay (LD). By contrast, in a subsample of DD children without LD the RAN deficit was specific for digits and disappeared after slowness in DN was partialled out. Slowness in DN was more evident for LD than for noLD DD children. Overall, our results confirm previous evidence indicating a name-retrieval deficit as a cognitive impairment underlying RAN slowness in DD children. This deficit seems to be more marked in DD children with previous LD. Moreover, additional cognitive deficits specifically associated with serial RAN tasks have to be taken into account when explaining deficient RAN speed of these latter children. We suggest that partially different cognitive dysfunctions underpin superficially similar RAN impairments in different subgroups of DD subjects. PMID:25237301
2015-12-01
IPSFRP search request. The candidate list will contain the agency’s requested number (minimum of2) of candidates or a default number of 20 candidates if...INTERSTATE IDENTIFICATION SYSTEM FOR WANTED SUBJECTS 5. FUNDING NUMBERS 6. AUTHOR(S) Michael J. Thomas 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES...Naval Postgraduate School Monterey, CA 93943-5000 8. PERFORMING ORGANIZATION REPORT NUMBER 9. SPONSORING /MONITORING AGENCY NAME(S) AND
5 CFR 581.203 - Information minimally required to accompany legal process.
Code of Federal Regulations, 2014 CFR
2014-01-01
... accompany legal process. 581.203 Section 581.203 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT... Process § 581.203 Information minimally required to accompany legal process. (a) Sufficient identifying information must accompany the legal process in order to enable processing by the governmental entity named...
5 CFR 581.203 - Information minimally required to accompany legal process.
Code of Federal Regulations, 2011 CFR
2011-01-01
... accompany legal process. 581.203 Section 581.203 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT... Process § 581.203 Information minimally required to accompany legal process. (a) Sufficient identifying information must accompany the legal process in order to enable processing by the governmental entity named...
5 CFR 581.203 - Information minimally required to accompany legal process.
Code of Federal Regulations, 2013 CFR
2013-01-01
... accompany legal process. 581.203 Section 581.203 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT... Process § 581.203 Information minimally required to accompany legal process. (a) Sufficient identifying information must accompany the legal process in order to enable processing by the governmental entity named...
5 CFR 581.203 - Information minimally required to accompany legal process.
Code of Federal Regulations, 2012 CFR
2012-01-01
... accompany legal process. 581.203 Section 581.203 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT... Process § 581.203 Information minimally required to accompany legal process. (a) Sufficient identifying information must accompany the legal process in order to enable processing by the governmental entity named...
5 CFR 581.203 - Information minimally required to accompany legal process.
Code of Federal Regulations, 2010 CFR
2010-01-01
... accompany legal process. 581.203 Section 581.203 Administrative Personnel OFFICE OF PERSONNEL MANAGEMENT... Process § 581.203 Information minimally required to accompany legal process. (a) Sufficient identifying information must accompany the legal process in order to enable processing by the governmental entity named...
Linked Data for Software Security Concepts and Vulnerability Descriptions
2013-07-01
named entity (NE) extractors such as DBpedia Spotlight, Alchemy API1, Extractiv2, OpenCalais3 and Zemanta were compared for their overall performance...presents substantial agreement for URI dis- ambiguation. Alchemy API, although preserving good performance in NE extraction and 1http://www.alchemyapi.com
25 CFR 141.23 - Posted statement of ownership.
Code of Federal Regulations, 2010 CFR
2010-04-01
... legible to customers stating the form of the business entity, the names and addresses of all other... Indians BUREAU OF INDIAN AFFAIRS, DEPARTMENT OF THE INTERIOR FINANCIAL ACTIVITIES BUSINESS PRACTICES ON THE NAVAJO, HOPI AND ZUNI RESERVATIONS General Business Practices § 141.23 Posted statement of...
Rapid Training of Information Extraction with Local and Global Data Views
2012-05-01
relation type extension system based on active learning a relation type extension system based on semi-supervised learning, and a crossdomain...bootstrapping system for domain adaptive named entity extraction. The active learning procedure adopts features extracted at the sentence level as the local
78 FR 50396 - Common Format for Federal Entity Transition Plans
Federal Register 2010, 2011, 2012, 2013, 2014
2013-08-19
..., Associate Administrator, Office of Spectrum Management. [FR Doc. 2013-20149 Filed 8-16-13; 8:45 am] BILLING..., Office of Spectrum Management. Each commenter should include the name of the person or organization... Spectrum Management, National Telecommunications and Information Administration, U.S. Department of...
BVDV: Detection, Risk Management and Control
USDA-ARS?s Scientific Manuscript database
The terms bovine viral diarrhea (BVD) and bovine viral diarrhea viruses (BVDV) are difficult to define in simple straightforward statements because both are umbrella terms covering a wide range of observations and entities. While diarrhea is in the name, BVD, it is used in reference to a number of ...
Albrecht, Markus
2007-12-01
This review gives an introduction into supramolecular chemistry describing in the first part general principles, focusing on terms like noncovalent interaction, molecular recognition, self-assembly, and supramolecular function. In the second part those will be illustrated by simple examples from our laboratories. Supramolecular chemistry is the science that bridges the gap between the world of molecules and nanotechnology. In supramolecular chemistry noncovalent interactions occur between molecular building blocks, which by molecular recognition and self-assembly form (functional) supramolecular entities. It is also termed the "chemistry of the noncovalent bond." Molecular recognition is based on geometrical complementarity based on the "key-and-lock" principle with nonshape-dependent effects, e.g., solvatization, being also highly influential. Self-assembly leads to the formation of well-defined aggregates. Hereby the overall structure of the target ensemble is controlled by the symmetry features of the certain building blocks. Finally, the aggregates can possess special properties or supramolecular functions, which are only found in the ensemble but not in the participating molecules. This review gives an introduction on supramolecular chemistry and illustrates the fundamental principles by recent examples from our group.
37 CFR 2.17 - Recognition for representation.
Code of Federal Regulations, 2014 CFR
2014-07-01
... attorney. A power of attorney must: (1) Designate by name at least one practitioner meeting the..., registrant, or party (e.g., a corporate officer or general partner of a partnership). In the case of joint... the identical owner name and attorney through TEAS. (2) The owner of an application or registration...
37 CFR 2.17 - Recognition for representation.
Code of Federal Regulations, 2012 CFR
2012-07-01
... attorney. A power of attorney must: (1) Designate by name at least one practitioner meeting the..., registrant, or party (e.g., a corporate officer or general partner of a partnership). In the case of joint... the identical owner name and attorney through TEAS. (2) The owner of an application or registration...
Polanco F, A; Acero P, A; Betancur-R, R
2016-08-01
Trachinocephalus, a formerly monotypic and nearly circumtropical genus of lizardfishes, is split into three valid species. Trachinocephalus gauguini n. sp. is described from the Marquesas Islands and is distinguished from the two other species in the genus by having a shorter snout, a narrower interorbital space, larger eye and modally fewer anal-fin and pectoral-fin rays. The distribution of Trachinocephalus myops (type species) is restricted to the Atlantic Ocean and the name Trachinocephalus trachinus is resurrected for populations from the Indo-West Pacific Ocean. Principal component analyses and bivariate plots based on the morphometric data differentiated T. gauguini from the other two species, but a substantial overlap between T. myops and T. trachinus exists. Phylogenetic evidence based on mtDNA COI sequences unambiguously supports the recognition of at least three species in Trachinocephalus, revealing deep divergences between the Atlantic Ocean, Indo-West Pacific Ocean and Marquesas entities. Additional analyses of species delimitations using the generalized mixed Yule coalescent model and the Poisson tree processes model provide a more liberal assessment of species in Trachinocephalus, indicating that many more cryptic species may exist. Finally, a taxonomic key to identify the three species recognized here is provided. © 2016 The Fisheries Society of the British Isles.
Madhumitha, Haridoss
2016-01-01
Globally, noncommunicable chronic diseases such as Type-2 Diabetes Mellitus (T2DM) and Coronary Artery Disease (CAD) are posing a major threat to the world. T2DM is known to potentiate CAD which had led to the coining of a new clinical entity named diabetic CAD (DM-CAD), leading to excessive morbidity and mortality. The synergistic interaction between these two comorbidities is through sterile inflammation which is now being addressed as metabolic inflammation or metainflammation, which plays a pivotal role during both early and late stages of T2DM and also serves as a link between T2DM and CAD. This review summarises the current concepts on the role played by both innate and adaptive immune responses in setting up metainflammation in DM-CAD. More specifically, the role played by innate pattern recognition receptors (PRRs) like Toll-like receptors (TLRs), NOD1-like receptors (NLRs), Rig-1-like receptors (RLRs), and C-type lectin like receptors (CLRs) and metabolic endotoxemia in fuelling metainflammation in DM-CAD would be discussed. Further, the role played by adaptive immune cells (Th1, Th2, Th17, and Th9 cells) in fuelling metainflammation in DM-CAD will also be discussed. PMID:27610390
Recognizing chemicals in patents: a comparative analysis.
Habibi, Maryam; Wiegandt, David Luis; Schmedding, Florian; Leser, Ulf
2016-01-01
Recently, methods for Chemical Named Entity Recognition (NER) have gained substantial interest, driven by the need for automatically analyzing todays ever growing collections of biomedical text. Chemical NER for patents is particularly essential due to the high economic importance of pharmaceutical findings. However, NER on patents has essentially been neglected by the research community for long, mostly because of the lack of enough annotated corpora. A recent international competition specifically targeted this task, but evaluated tools only on gold standard patent abstracts instead of full patents; furthermore, results from such competitions are often difficult to extrapolate to real-life settings due to the relatively high homogeneity of training and test data. Here, we evaluate the two state-of-the-art chemical NER tools, tmChem and ChemSpot, on four different annotated patent corpora, two of which consist of full texts. We study the overall performance of the tools, compare their results at the instance level, report on high-recall and high-precision ensembles, and perform cross-corpus and intra-corpus evaluations. Our findings indicate that full patents are considerably harder to analyze than patent abstracts and clearly confirm the common wisdom that using the same text genre (patent vs. scientific) and text type (abstract vs. full text) for training and testing is a pre-requisite for achieving high quality text mining results.
Face recognition using slow feature analysis and contourlet transform
NASA Astrophysics Data System (ADS)
Wang, Yuehao; Peng, Lingling; Zhe, Fuchuan
2018-04-01
In this paper we propose a novel face recognition approach based on slow feature analysis (SFA) in contourlet transform domain. This method firstly use contourlet transform to decompose the face image into low frequency and high frequency part, and then takes technological advantages of slow feature analysis for facial feature extraction. We named the new method combining the slow feature analysis and contourlet transform as CT-SFA. The experimental results on international standard face database demonstrate that the new face recognition method is effective and competitive.
Atypical Presentations of Methemoglobinemia from Benzocaine Spray
Suwantarat, Nuntra; Vierra, Joseph R; Evans, Samuel J
2011-01-01
Widely used for local anesthesia, especially prior to endoscopic procedures, benzocaine spray is one of the most common causes of iatrogenic methemoglobinemia. The authors report an atypical case of methemoglobinemia in a woman presenting with pale skin and severe hypoxemia, after a delayed repeat exposure to benzocaine spray. Early recognition and prompt management of methemoglobinemia is needed in order to lessen morbidity and mortality from this entity. PMID:22162610
Diagnosis of B-Cell Non-Hodgkin Lymphomas with Small-/Intermediate-Sized Cells in Cytopathology
Schwock, Joerg; Geddie, William R.
2012-01-01
Fine needle sampling is a fast, safe, and potentially cost-effective method of obtaining tissue for cytomorphologic assessment aimed at both initial triage and, in some cases, complete diagnosis of patients that present clinically with lymphadenopathy. The cytologic diagnosis of B-cell non-Hodgkin lymphomas composed of small-/intermediate-sized cells, however, has been seen as an area of great difficulty even for experienced observers due to the morphologic overlap between lymphoma and reactive lymphadenopathies as well as between the lymphoma entities themselves. Although ancillary testing has improved diagnostic accuracy, the results from these tests must be interpreted within the morphological and clinical context to avoid misinterpretation. Importantly, the recognition of specific cytologic features is crucial in guiding the appropriate selection of ancillary tests which will either confirm or refute a tentative diagnosis. For these reasons, we here review the cytologic characteristics particular to five common B-cell non-Hodgkin lymphomas which typically cause the most diagnostic confusion based on cytological assessment alone: marginal zone lymphoma, follicular lymphoma, mantle cell lymphoma, chronic lymphocytic leukemia/small lymphocytic lymphoma, and lymphoplasmacytic lymphoma. We summarize the most pertinent cytomorphologic features for each entity as well as for reactive lymphoid hyperplasia, contrast them with each other to facilitate their recognition, and highlight common diagnostic pitfalls. PMID:22693682
Scientific names of organisms: attribution, rights, and licensing
2014-01-01
Background As biological disciplines extend into the ‘big data’ world, they will need a names-based infrastructure to index and interconnect distributed data. The infrastructure must have access to all names of all organisms if it is to manage all information. Those who compile lists of species hold different views as to the intellectual property rights that apply to the lists. This creates uncertainty that impedes the development of a much-needed infrastructure for sharing biological data in the digital world. Findings The laws in the United States of America and European Union are consistent with the position that scientific names of organisms and their compilation in checklists, classifications or taxonomic revisions are not subject to copyright. Compilations of names, such as classifications or checklists, are not creative in the sense of copyright law. Many content providers desire credit for their efforts. Conclusions A ‘blue list’ identifies elements of checklists, classifications and monographs to which intellectual property rights do not apply. To promote sharing, authors of taxonomic content, compilers, intermediaries, and aggregators should receive citable recognition for their contributions, with the greatest recognition being given to the originating authors. Mechanisms for achieving this are discussed. PMID:24495358
Nielson, Kristy A.; Seidenberg, Michael; Woodard, John L.; Durgerian, Sally; Zhang, Qi; Gross, William L.; Gander, Amelia; Guidotti, Leslie M.; Antuono, Piero; Rao, Stephen M.
2010-01-01
Person recognition can be accomplished through several modalities (face, name, voice). Lesion, neurophysiology and neuroimaging studies have been conducted in an attempt to determine the similarities and differences in the neural networks associated with person identity via different modality inputs. The current study used event-related functional-MRI in 17 healthy participants to directly compare activation in response to randomly presented famous and non-famous names and faces (25 stimuli in each of the four categories). Findings indicated distinct areas of activation that differed for faces and names in regions typically associated with pre-semantic perceptual processes. In contrast, overlapping brain regions were activated in areas associated with the retrieval of biographical knowledge and associated social affective features. Specifically, activation for famous faces was primarily right lateralized and famous names were left lateralized. However, for both stimuli, similar areas of bilateral activity were observed in the early phases of perceptual processing. Activation for fame, irrespective of stimulus modality, activated an extensive left hemisphere network, with bilateral activity observed in the hippocampi, posterior cingulate, and middle temporal gyri. Findings are discussed within the framework of recent proposals concerning the neural network of person identification. PMID:20167415
Rapid induction of false memory for pictures.
Weinstein, Yana; Shanks, David R
2010-07-01
Recognition of pictures is typically extremely accurate, and it is thus unclear whether the reconstructive nature of memory can yield substantial false recognition of highly individuated stimuli. A procedure for the rapid induction of false memories for distinctive colour photographs is proposed. Participants studied a set of object pictures followed by a list of words naming those objects, but embedded in the list were names of unseen objects. When subsequently shown full colour pictures of these unseen objects, participants consistently claimed that they had seen them, while discriminating with high accuracy between studied pictures and new pictures whose names did not appear in the misleading word list. These false memories can be reported with high confidence as well as the feeling of recollection. This new procedure allows the investigation of factors that influence false memory reports with ecologically valid stimuli and of the similarities and differences between true and false memories.
Using Ontology Fingerprints to disambiguate gene name entities in the biomedical literature
Chen, Guocai; Zhao, Jieyi; Cohen, Trevor; Tao, Cui; Sun, Jingchun; Xu, Hua; Bernstam, Elmer V.; Lawson, Andrew; Zeng, Jia; Johnson, Amber M.; Holla, Vijaykumar; Bailey, Ann M.; Lara-Guerra, Humberto; Litzenburger, Beate; Meric-Bernstam, Funda; Jim Zheng, W.
2015-01-01
Ambiguous gene names in the biomedical literature are a barrier to accurate information extraction. To overcome this hurdle, we generated Ontology Fingerprints for selected genes that are relevant for personalized cancer therapy. These Ontology Fingerprints were used to evaluate the association between genes and biomedical literature to disambiguate gene names. We obtained 93.6% precision for the test gene set and 80.4% for the area under a receiver-operating characteristics curve for gene and article association. The core algorithm was implemented using a graphics processing unit-based MapReduce framework to handle big data and to improve performance. We conclude that Ontology Fingerprints can help disambiguate gene names mentioned in text and analyse the association between genes and articles. Database URL: http://www.ontologyfingerprint.org PMID:25858285
HWDA: A coherence recognition and resolution algorithm for hybrid web data aggregation
NASA Astrophysics Data System (ADS)
Guo, Shuhang; Wang, Jian; Wang, Tong
2017-09-01
Aiming at the object confliction recognition and resolution problem for hybrid distributed data stream aggregation, a distributed data stream object coherence solution technology is proposed. Firstly, the framework was defined for the object coherence conflict recognition and resolution, named HWDA. Secondly, an object coherence recognition technology was proposed based on formal language description logic and hierarchical dependency relationship between logic rules. Thirdly, a conflict traversal recognition algorithm was proposed based on the defined dependency graph. Next, the conflict resolution technology was prompted based on resolution pattern matching including the definition of the three types of conflict, conflict resolution matching pattern and arbitration resolution method. At last, the experiment use two kinds of web test data sets to validate the effect of application utilizing the conflict recognition and resolution technology of HWDA.
ERIC Educational Resources Information Center
Mayer, Andreas; Motsch, Hans-Joachim
2015-01-01
This study analysed the effects of a classroom intervention focusing on phonological awareness and/or automatized word recognition in children with a deficit in the domains of phonological awareness and rapid automatized naming ("double deficit"). According to the double-deficit hypothesis (Wolf & Bowers, 1999), these children belong…
Competition in prescription drug markets: the roles of trademarks, advertising, and generic names.
Feldman, Roger; Lobo, Félix
2013-08-01
We take on two subjects of controversy among economists-advertising and trademarks-in the context of the market for generic drugs. We outline a model in which trademarks for drug names reduce search costs but increase product differentiation. In this particular framework, trademarks may not benefit consumers. In contrast, the generic names of drugs or "International Nonproprietary Names" (INN) have unquestionable benefits in both economic theory and empirical studies. We offer a second model where advertising of a brand-name drug creates recognition for the generic name. The monopoly patent-holder advertises less than in the absence of a competitive spillover.
Printable, scannable biometric templates for secure documents and materials
NASA Astrophysics Data System (ADS)
Cambier, James L.; Musgrave, Clyde
2000-04-01
Biometric technology has been widely acknowledged as an effective means for enhancing private and public security through applications in physical access control, computer and computer network access control, medical records protection, banking security, public identification programs, and others. Nearly all of these applications involve use of a biometric token to control access to a physical entity or private information. There are also unique benefits to be derived from attaching a biometric template to a physical entity such as a document, package, laboratory sample, etc. Such an association allows fast, reliable, and highly accurate association of an individual person's identity to the physical entity, and can be used to enhance security, convenience, and privacy in many types of transactions. Examples include authentication of documents, tracking of laboratory samples in a testing environment, monitoring the movement of physical evidence within the criminal justice system, and authenticating the identity of both sending and receiving parties in shipment of high value parcels. A system is described which combines a biometric technology based on iris recognition with a printing and scanning technology for high-density bar codes.
12 CFR 612.2145 - Director reporting.
Code of Federal Regulations, 2010 CFR
2010-01-01
...) The name and the nature of the business of any entity in which the director has a material financial... activity that is required to be reported under this section or could constitute a conflict of interest... determination of whether the relationship, transaction, or activity is, in fact, a conflict of interest. (d...
Proposal to conserve Tamarix ramosissima against T. pentandra Tamaricaceae)
USDA-ARS?s Scientific Manuscript database
Ledebour described Tamarix ramosissima in 1829 from plants collected in Kazakhstan (Lake Noor Zaisan). In the protologue he overlooked T. pentandra Pall. (l.c.) and T. pallasii Desv. (l.c.), two earlier names which apply to the same biological entity, also widespread through Central and Western Asia...
78 FR 23194 - Federal Acquisition Regulation; Commercial and Government Entity Code
Federal Register 2010, 2011, 2012, 2013, 2014
2013-04-18
... Award Management Name Change, Phase 1 Implementation) which will make a global update to all of the... outside the United States; and Support supply chain traceability and integrity efforts. II. Discussion and.... For Contractors registered in the System for Award Management (SAM), the DLA Logistics Information...
ALDOL REACTION VIA IN SITU OLEFIN MIGRATION IN WATER. (R828129)
Department of Chemistry, Tulane University, Ne...
The Role of Instruments in Three Chemical Revolutions
ERIC Educational Resources Information Center
Chamizo, José Antonio
2014-01-01
This paper attempts to show one of the ways history of chemistry can be teachable for chemistry teachers, it means something more than an undifferentiated mass of names and dates, establishing a temporal framework based on chemical entities that all students use. Represents a difficult equilibrium between over-simplification versus…
47 CFR 52.15 - Central office code administration.
Code of Federal Regulations, 2010 CFR
2010-10-01
... forecast data to the NANPA. (ii) Reporting shall be by separate legal entity and must include company name, company headquarters address, Operating Company Number (OCN), parent company OCN, and the primary type of... headquarters address, OCN, parent company's OCN(s), and the primary type of business in which the numbering...
40 CFR 59.501 - Am I subject to this subpart?
Code of Federal Regulations, 2010 CFR
2010-07-01
... (CONTINUED) NATIONAL VOLATILE ORGANIC COMPOUND EMISSION STANDARDS FOR CONSUMER AND COMMERCIAL PRODUCTS... subpart? (a) The regulated entities for an aerosol coating product are the manufacturer or importer of an aerosol coating product and a distributor of an aerosol coating product if it is named on the label or if...
2016-12-14
The Architectural and Transportation Barriers Compliance Board (Access Board or Board) is issuing a final rule that revises its existing accessibility guidelines for non-rail vehicles--namely, buses, over-the-road buses, and vans--acquired or remanufactured by entities covered by the Americans with Disabilities Act. The revised guidelines ensure that such vehicles are readily accessible to, and usable by, individuals with disabilities. The U.S. Department of Transportation (DOT) is required to revise its accessibility standards for transportation vehicles acquired or remanufactured by entities covered by the Americans with Disabilities Act (ADA) to be consistent with the final rule.
Deleger, Louise; Li, Qi; Kaiser, Megan; Stoutenborough, Laura
2013-01-01
Background A high-quality gold standard is vital for supervised, machine learning-based, clinical natural language processing (NLP) systems. In clinical NLP projects, expert annotators traditionally create the gold standard. However, traditional annotation is expensive and time-consuming. To reduce the cost of annotation, general NLP projects have turned to crowdsourcing based on Web 2.0 technology, which involves submitting smaller subtasks to a coordinated marketplace of workers on the Internet. Many studies have been conducted in the area of crowdsourcing, but only a few have focused on tasks in the general NLP field and only a handful in the biomedical domain, usually based upon very small pilot sample sizes. In addition, the quality of the crowdsourced biomedical NLP corpora were never exceptional when compared to traditionally-developed gold standards. The previously reported results on medical named entity annotation task showed a 0.68 F-measure based agreement between crowdsourced and traditionally-developed corpora. Objective Building upon previous work from the general crowdsourcing research, this study investigated the usability of crowdsourcing in the clinical NLP domain with special emphasis on achieving high agreement between crowdsourced and traditionally-developed corpora. Methods To build the gold standard for evaluating the crowdsourcing workers’ performance, 1042 clinical trial announcements (CTAs) from the ClinicalTrials.gov website were randomly selected and double annotated for medication names, medication types, and linked attributes. For the experiments, we used CrowdFlower, an Amazon Mechanical Turk-based crowdsourcing platform. We calculated sensitivity, precision, and F-measure to evaluate the quality of the crowd’s work and tested the statistical significance (P<.001, chi-square test) to detect differences between the crowdsourced and traditionally-developed annotations. Results The agreement between the crowd’s annotations and the traditionally-generated corpora was high for: (1) annotations (0.87, F-measure for medication names; 0.73, medication types), (2) correction of previous annotations (0.90, medication names; 0.76, medication types), and excellent for (3) linking medications with their attributes (0.96). Simple voting provided the best judgment aggregation approach. There was no statistically significant difference between the crowd and traditionally-generated corpora. Our results showed a 27.9% improvement over previously reported results on medication named entity annotation task. Conclusions This study offers three contributions. First, we proved that crowdsourcing is a feasible, inexpensive, fast, and practical approach to collect high-quality annotations for clinical text (when protected health information was excluded). We believe that well-designed user interfaces and rigorous quality control strategy for entity annotation and linking were critical to the success of this work. Second, as a further contribution to the Internet-based crowdsourcing field, we will publicly release the JavaScript and CrowdFlower Markup Language infrastructure code that is necessary to utilize CrowdFlower’s quality control and crowdsourcing interfaces for named entity annotations. Finally, to spur future research, we will release the CTA annotations that were generated by traditional and crowdsourced approaches. PMID:23548263
Integrated Bio-Entity Network: A System for Biological Knowledge Discovery
Bell, Lindsey; Chowdhary, Rajesh; Liu, Jun S.; Niu, Xufeng; Zhang, Jinfeng
2011-01-01
A significant part of our biological knowledge is centered on relationships between biological entities (bio-entities) such as proteins, genes, small molecules, pathways, gene ontology (GO) terms and diseases. Accumulated at an increasing speed, the information on bio-entity relationships is archived in different forms at scattered places. Most of such information is buried in scientific literature as unstructured text. Organizing heterogeneous information in a structured form not only facilitates study of biological systems using integrative approaches, but also allows discovery of new knowledge in an automatic and systematic way. In this study, we performed a large scale integration of bio-entity relationship information from both databases containing manually annotated, structured information and automatic information extraction of unstructured text in scientific literature. The relationship information we integrated in this study includes protein–protein interactions, protein/gene regulations, protein–small molecule interactions, protein–GO relationships, protein–pathway relationships, and pathway–disease relationships. The relationship information is organized in a graph data structure, named integrated bio-entity network (IBN), where the vertices are the bio-entities and edges represent their relationships. Under this framework, graph theoretic algorithms can be designed to perform various knowledge discovery tasks. We designed breadth-first search with pruning (BFSP) and most probable path (MPP) algorithms to automatically generate hypotheses—the indirect relationships with high probabilities in the network. We show that IBN can be used to generate plausible hypotheses, which not only help to better understand the complex interactions in biological systems, but also provide guidance for experimental designs. PMID:21738677