Knowledge-Driven Event Extraction in Russian: Corpus-Based Linguistic Resources
Solovyev, Valery; Ivanov, Vladimir
2016-01-01
Automatic event extraction form text is an important step in knowledge acquisition and knowledge base population. Manual work in development of extraction system is indispensable either in corpus annotation or in vocabularies and pattern creation for a knowledge-based system. Recent works have been focused on adaptation of existing system (for extraction from English texts) to new domains. Event extraction in other languages was not studied due to the lack of resources and algorithms necessary for natural language processing. In this paper we define a set of linguistic resources that are necessary in development of a knowledge-based event extraction system in Russian: a vocabulary of subordination models, a vocabulary of event triggers, and a vocabulary of Frame Elements that are basic building blocks for semantic patterns. We propose a set of methods for creation of such vocabularies in Russian and other languages using Google Books NGram Corpus. The methods are evaluated in development of event extraction system for Russian. PMID:26955386
Knowledge Based Text Generation
1989-08-01
Number 4, October-December, 1985, pp. 219-242. de Joia , A. and Stenton, A., Terms in Linguistics: A Guide to Halliday, London: Batsford Academic and...extraction of text schemata and their corresponding rhetorical predicates; design of a system motivated by the desire for domain and language independence...semantics and semantics effects syntax. Functional Linguistic Framework Page 19 The design of GENNY was guided by the functional paradigm. Provided a
A knowledge-base generating hierarchical fuzzy-neural controller.
Kandadai, R M; Tien, J M
1997-01-01
We present an innovative fuzzy-neural architecture that is able to automatically generate a knowledge base, in an extractable form, for use in hierarchical knowledge-based controllers. The knowledge base is in the form of a linguistic rule base appropriate for a fuzzy inference system. First, we modify Berenji and Khedkar's (1992) GARIC architecture to enable it to automatically generate a knowledge base; a pseudosupervised learning scheme using reinforcement learning and error backpropagation is employed. Next, we further extend this architecture to a hierarchical controller that is able to generate its own knowledge base. Example applications are provided to underscore its viability.
Automated Extraction of Substance Use Information from Clinical Texts.
Wang, Yan; Chen, Elizabeth S; Pakhomov, Serguei; Arsoniadis, Elliot; Carter, Elizabeth W; Lindemann, Elizabeth; Sarkar, Indra Neil; Melton, Genevieve B
2015-01-01
Within clinical discourse, social history (SH) includes important information about substance use (alcohol, drug, and nicotine use) as key risk factors for disease, disability, and mortality. In this study, we developed and evaluated a natural language processing (NLP) system for automated detection of substance use statements and extraction of substance use attributes (e.g., temporal and status) based on Stanford Typed Dependencies. The developed NLP system leveraged linguistic resources and domain knowledge from a multi-site social history study, Propbank and the MiPACQ corpus. The system attained F-scores of 89.8, 84.6 and 89.4 respectively for alcohol, drug, and nicotine use statement detection, as well as average F-scores of 82.1, 90.3, 80.8, 88.7, 96.6, and 74.5 respectively for extraction of attributes. Our results suggest that NLP systems can achieve good performance when augmented with linguistic resources and domain knowledge when applied to a wide breadth of substance use free text clinical notes.
Incorporating linguistic knowledge for learning distributed word representations.
Wang, Yan; Liu, Zhiyuan; Sun, Maosong
2015-01-01
Combined with neural language models, distributed word representations achieve significant advantages in computational linguistics and text mining. Most existing models estimate distributed word vectors from large-scale data in an unsupervised fashion, which, however, do not take rich linguistic knowledge into consideration. Linguistic knowledge can be represented as either link-based knowledge or preference-based knowledge, and we propose knowledge regularized word representation models (KRWR) to incorporate these prior knowledge for learning distributed word representations. Experiment results demonstrate that our estimated word representation achieves better performance in task of semantic relatedness ranking. This indicates that our methods can efficiently encode both prior knowledge from knowledge bases and statistical knowledge from large-scale text corpora into a unified word representation model, which will benefit many tasks in text mining.
Incorporating Linguistic Knowledge for Learning Distributed Word Representations
Wang, Yan; Liu, Zhiyuan; Sun, Maosong
2015-01-01
Combined with neural language models, distributed word representations achieve significant advantages in computational linguistics and text mining. Most existing models estimate distributed word vectors from large-scale data in an unsupervised fashion, which, however, do not take rich linguistic knowledge into consideration. Linguistic knowledge can be represented as either link-based knowledge or preference-based knowledge, and we propose knowledge regularized word representation models (KRWR) to incorporate these prior knowledge for learning distributed word representations. Experiment results demonstrate that our estimated word representation achieves better performance in task of semantic relatedness ranking. This indicates that our methods can efficiently encode both prior knowledge from knowledge bases and statistical knowledge from large-scale text corpora into a unified word representation model, which will benefit many tasks in text mining. PMID:25874581
A novel probabilistic framework for event-based speech recognition
NASA Astrophysics Data System (ADS)
Juneja, Amit; Espy-Wilson, Carol
2003-10-01
One of the reasons for unsatisfactory performance of the state-of-the-art automatic speech recognition (ASR) systems is the inferior acoustic modeling of low-level acoustic-phonetic information in the speech signal. An acoustic-phonetic approach to ASR, on the other hand, explicitly targets linguistic information in the speech signal, but such a system for continuous speech recognition (CSR) is not known to exist. A probabilistic and statistical framework for CSR based on the idea of the representation of speech sounds by bundles of binary valued articulatory phonetic features is proposed. Multiple probabilistic sequences of linguistically motivated landmarks are obtained using binary classifiers of manner phonetic features-syllabic, sonorant and continuant-and the knowledge-based acoustic parameters (APs) that are acoustic correlates of those features. The landmarks are then used for the extraction of knowledge-based APs for source and place phonetic features and their binary classification. Probabilistic landmark sequences are constrained using manner class language models for isolated or connected word recognition. The proposed method could overcome the disadvantages encountered by the early acoustic-phonetic knowledge-based systems that led the ASR community to switch to systems highly dependent on statistical pattern analysis methods and probabilistic language or grammar models.
Towards an Obesity-Cancer Knowledge Base: Biomedical Entity Identification and Relation Detection
Lossio-Ventura, Juan Antonio; Hogan, William; Modave, François; Hicks, Amanda; Hanna, Josh; Guo, Yi; He, Zhe; Bian, Jiang
2017-01-01
Obesity is associated with increased risks of various types of cancer, as well as a wide range of other chronic diseases. On the other hand, access to health information activates patient participation, and improve their health outcomes. However, existing online information on obesity and its relationship to cancer is heterogeneous ranging from pre-clinical models and case studies to mere hypothesis-based scientific arguments. A formal knowledge representation (i.e., a semantic knowledge base) would help better organizing and delivering quality health information related to obesity and cancer that consumers need. Nevertheless, current ontologies describing obesity, cancer and related entities are not designed to guide automatic knowledge base construction from heterogeneous information sources. Thus, in this paper, we present methods for named-entity recognition (NER) to extract biomedical entities from scholarly articles and for detecting if two biomedical entities are related, with the long term goal of building a obesity-cancer knowledge base. We leverage both linguistic and statistical approaches in the NER task, which supersedes the state-of-the-art results. Further, based on statistical features extracted from the sentences, our method for relation detection obtains an accuracy of 99.3% and a f-measure of 0.993. PMID:28503356
ERIC Educational Resources Information Center
Yasuda, Sachiko
2011-01-01
This study examines how novice foreign language (FL) writers develop their genre awareness, linguistic knowledge, and writing competence in a genre-based writing course that incorporates email-writing tasks. To define genre, the study draws on systemic functional linguistics (SFL) that sees language as a resource for making meaning in a particular…
A linguistic rule-based approach to extract drug-drug interactions from pharmacological documents.
Segura-Bedmar, Isabel; Martínez, Paloma; de Pablo-Sánchez, César
2011-03-29
A drug-drug interaction (DDI) occurs when one drug influences the level or activity of another drug. The increasing volume of the scientific literature overwhelms health care professionals trying to be kept up-to-date with all published studies on DDI. This paper describes a hybrid linguistic approach to DDI extraction that combines shallow parsing and syntactic simplification with pattern matching. Appositions and coordinate structures are interpreted based on shallow syntactic parsing provided by the UMLS MetaMap tool (MMTx). Subsequently, complex and compound sentences are broken down into clauses from which simple sentences are generated by a set of simplification rules. A pharmacist defined a set of domain-specific lexical patterns to capture the most common expressions of DDI in texts. These lexical patterns are matched with the generated sentences in order to extract DDIs. We have performed different experiments to analyze the performance of the different processes. The lexical patterns achieve a reasonable precision (67.30%), but very low recall (14.07%). The inclusion of appositions and coordinate structures helps to improve the recall (25.70%), however, precision is lower (48.69%). The detection of clauses does not improve the performance. Information Extraction (IE) techniques can provide an interesting way of reducing the time spent by health care professionals on reviewing the literature. Nevertheless, no approach has been carried out to extract DDI from texts. To the best of our knowledge, this work proposes the first integral solution for the automatic extraction of DDI from biomedical texts.
ERIC Educational Resources Information Center
Liu, Yongcan; Fisher, Linda; Forbes, Karen; Evans, Michael
2017-01-01
This paper aims to define the knowledge base of teaching in linguistically diverse secondary schools in England. Based on extensive interviews with the teachers across two schools, the paper identifies a range of good practices centred on flexibility and differentiation. These include diversifying teaching resources by using bilingual materials…
Phonetics Information Base and Lexicon
ERIC Educational Resources Information Center
Moran, Steven Paul
2012-01-01
In this dissertation, I investigate the linguistic and technological challenges involved in creating a cross-linguistic data set to undertake phonological typology. I then address the question of whether more sophisticated, knowledge-based approaches to data modeling, coupled with a broad cross-linguistic data set, can extend previous typological…
A linguistic rule-based approach to extract drug-drug interactions from pharmacological documents
2011-01-01
Background A drug-drug interaction (DDI) occurs when one drug influences the level or activity of another drug. The increasing volume of the scientific literature overwhelms health care professionals trying to be kept up-to-date with all published studies on DDI. Methods This paper describes a hybrid linguistic approach to DDI extraction that combines shallow parsing and syntactic simplification with pattern matching. Appositions and coordinate structures are interpreted based on shallow syntactic parsing provided by the UMLS MetaMap tool (MMTx). Subsequently, complex and compound sentences are broken down into clauses from which simple sentences are generated by a set of simplification rules. A pharmacist defined a set of domain-specific lexical patterns to capture the most common expressions of DDI in texts. These lexical patterns are matched with the generated sentences in order to extract DDIs. Results We have performed different experiments to analyze the performance of the different processes. The lexical patterns achieve a reasonable precision (67.30%), but very low recall (14.07%). The inclusion of appositions and coordinate structures helps to improve the recall (25.70%), however, precision is lower (48.69%). The detection of clauses does not improve the performance. Conclusions Information Extraction (IE) techniques can provide an interesting way of reducing the time spent by health care professionals on reviewing the literature. Nevertheless, no approach has been carried out to extract DDI from texts. To the best of our knowledge, this work proposes the first integral solution for the automatic extraction of DDI from biomedical texts. PMID:21489220
Knowledge representation and management: transforming textual information into useful knowledge.
Rassinoux, A-M
2010-01-01
To summarize current outstanding research in the field of knowledge representation and management. Synopsis of the articles selected for the IMIA Yearbook 2010. Four interesting papers, dealing with structured knowledge, have been selected for the section knowledge representation and management. Combining the newest techniques in computational linguistics and natural language processing with the latest methods in statistical data analysis, machine learning and text mining has proved to be efficient for turning unstructured textual information into meaningful knowledge. Three of the four selected papers for the section knowledge representation and management corroborate this approach and depict various experiments conducted to .extract meaningful knowledge from unstructured free texts such as extracting cancer disease characteristics from pathology reports, or extracting protein-protein interactions from biomedical papers, as well as extracting knowledge for the support of hypothesis generation in molecular biology from the Medline literature. Finally, the last paper addresses the level of formally representing and structuring information within clinical terminologies in order to render such information easily available and shareable among the health informatics community. Delivering common powerful tools able to automatically extract meaningful information from the huge amount of electronically unstructured free texts is an essential step towards promoting sharing and reusability across applications, domains, and institutions thus contributing to building capacities worldwide.
ERIC Educational Resources Information Center
Schmid, Hans-Jorg, Ed.
2017-01-01
In recent years, linguists have increasingly turned to the cognitive sciences to broaden their investigation into the roots and development of language. With the advent of cognitive-linguistic, usage-based and complex-adaptive models of language, linguists today are utilizing approaches and insights from cognitive psychology, neuropsychology,…
R, Elakkiya; K, Selvamani
2017-09-22
Subunit segmenting and modelling in medical sign language is one of the important studies in linguistic-oriented and vision-based Sign Language Recognition (SLR). Many efforts were made in the precedent to focus the functional subunits from the view of linguistic syllables but the problem is implementing such subunit extraction using syllables is not feasible in real-world computer vision techniques. And also, the present recognition systems are designed in such a way that it can detect the signer dependent actions under restricted and laboratory conditions. This research paper aims at solving these two important issues (1) Subunit extraction and (2) Signer independent action on visual sign language recognition. Subunit extraction involved in the sequential and parallel breakdown of sign gestures without any prior knowledge on syllables and number of subunits. A novel Bayesian Parallel Hidden Markov Model (BPaHMM) is introduced for subunit extraction to combine the features of manual and non-manual parameters to yield better results in classification and recognition of signs. Signer independent action aims in using a single web camera for different signer behaviour patterns and for cross-signer validation. Experimental results have proved that the proposed signer independent subunit level modelling for sign language classification and recognition has shown improvement and variations when compared with other existing works.
ERIC Educational Resources Information Center
CHAO, YUEN REN
THE AUTHOR OF THIS GRAMMAR STATES THAT THIS IS A "DISCUSSION BOOK" AND NOT AN INSTRUCTION BOOK FOR LEARNING CHINESE. HIS ANALYSIS OF CHINESE GRAMMAR IS BASED ON CURRENT LINGUISTIC METHODS AND ASSUMES THE READER HAS SOME KNOWLEDGE OF LINGUISTICS. THIS BOOK CONSTITUTES A REFERENCE WORK FOR LINGUISTS AND STUDENTS OF THE CHINESE LANGUAGE. MAJOR…
On Empirical Evidence for the Existence of Rules Governing Speech-Using Behavior.
ERIC Educational Resources Information Center
Sanders, Robert E.; Schneider, Michael
Departing from Baconian science which focuses on explanation of the occurrence of events, Chomsky's linguistics involves a different orientation--namely the explanation of form to account for linguistic behavior. The "knowledge" upon which linguistic judgements are based involves the premise of innate mechanisms. The assumption that speakers and…
Incorporating World Knowledge to Document Clustering via Heterogeneous Information Networks.
Wang, Chenguang; Song, Yangqiu; El-Kishky, Ahmed; Roth, Dan; Zhang, Ming; Han, Jiawei
2015-08-01
One of the key obstacles in making learning protocols realistic in applications is the need to supervise them, a costly process that often requires hiring domain experts. We consider the framework to use the world knowledge as indirect supervision. World knowledge is general-purpose knowledge, which is not designed for any specific domain. Then the key challenges are how to adapt the world knowledge to domains and how to represent it for learning. In this paper, we provide an example of using world knowledge for domain dependent document clustering. We provide three ways to specify the world knowledge to domains by resolving the ambiguity of the entities and their types, and represent the data with world knowledge as a heterogeneous information network. Then we propose a clustering algorithm that can cluster multiple types and incorporate the sub-type information as constraints. In the experiments, we use two existing knowledge bases as our sources of world knowledge. One is Freebase, which is collaboratively collected knowledge about entities and their organizations. The other is YAGO2, a knowledge base automatically extracted from Wikipedia and maps knowledge to the linguistic knowledge base, Word-Net. Experimental results on two text benchmark datasets (20newsgroups and RCV1) show that incorporating world knowledge as indirect supervision can significantly outperform the state-of-the-art clustering algorithms as well as clustering algorithms enhanced with world knowledge features.
Incorporating World Knowledge to Document Clustering via Heterogeneous Information Networks
Wang, Chenguang; Song, Yangqiu; El-Kishky, Ahmed; Roth, Dan; Zhang, Ming; Han, Jiawei
2015-01-01
One of the key obstacles in making learning protocols realistic in applications is the need to supervise them, a costly process that often requires hiring domain experts. We consider the framework to use the world knowledge as indirect supervision. World knowledge is general-purpose knowledge, which is not designed for any specific domain. Then the key challenges are how to adapt the world knowledge to domains and how to represent it for learning. In this paper, we provide an example of using world knowledge for domain dependent document clustering. We provide three ways to specify the world knowledge to domains by resolving the ambiguity of the entities and their types, and represent the data with world knowledge as a heterogeneous information network. Then we propose a clustering algorithm that can cluster multiple types and incorporate the sub-type information as constraints. In the experiments, we use two existing knowledge bases as our sources of world knowledge. One is Freebase, which is collaboratively collected knowledge about entities and their organizations. The other is YAGO2, a knowledge base automatically extracted from Wikipedia and maps knowledge to the linguistic knowledge base, Word-Net. Experimental results on two text benchmark datasets (20newsgroups and RCV1) show that incorporating world knowledge as indirect supervision can significantly outperform the state-of-the-art clustering algorithms as well as clustering algorithms enhanced with world knowledge features. PMID:26705504
Knowledge Acquisition Using Linguistic-Based Knowledge Analysis
Daniel L. Schmoldt
1998-01-01
Most knowledge-based system developmentefforts include acquiring knowledge from one or more sources. difficulties associated with this knowledge acquisition task are readily acknowledged by most researchers. While a variety of knowledge acquisition methods have been reported, little has been done to organize those different methods and to suggest how to apply them...
Identification of threats using linguistics-based knowledge extraction.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chew, Peter A.
One of the challenges increasingly facing intelligence analysts, along with professionals in many other fields, is the vast amount of data which needs to be reviewed and converted into meaningful information, and ultimately into rational, wise decisions by policy makers. The advent of the world wide web (WWW) has magnified this challenge. A key hypothesis which has guided us is that threats come from ideas (or ideology), and ideas are almost always put into writing before the threats materialize. While in the past the 'writing' might have taken the form of pamphlets or books, today's medium of choice is themore » WWW, precisely because it is a decentralized, flexible, and low-cost method of reaching a wide audience. However, a factor which complicates matters for the analyst is that material published on the WWW may be in any of a large number of languages. In 'Identification of Threats Using Linguistics-Based Knowledge Extraction', we have sought to use Latent Semantic Analysis (LSA) and other similar text analysis techniques to map documents from the WWW, in whatever language they were originally written, to a common language-independent vector-based representation. This then opens up a number of possibilities. First, similar documents can be found across language boundaries. Secondly, a set of documents in multiple languages can be visualized in a graphical representation. These alone offer potentially useful tools and capabilities to the intelligence analyst whose knowledge of foreign languages may be limited. Finally, we can test the over-arching hypothesis--that ideology, and more specifically ideology which represents a threat, can be detected solely from the words which express the ideology--by using the vector-based representation of documents to predict additional features (such as the ideology) within a framework based on supervised learning. In this report, we present the results of a three-year project of the same name. We believe these results clearly demonstrate the general feasibility of an approach such as that outlined above. Nevertheless, there are obstacles which must still be overcome, relating primarily to how 'ideology' should be defined. We discuss these and point to possible solutions.« less
The linguistically aware teacher and the teacher-aware linguist.
McCartney, Elspeth; Ellis, Sue
2013-07-01
This review evaluates issues of teacher linguistic knowledge relating to their work with children with speech, language and communication difficulties (SLCD). Information is from Ellis and McCartney [(2011a). Applied linguistics and primary school teaching. Cambridge: Cambridge University Press], a state-of-the-art text deriving from a British Association of Applied Linguistics/Cambridge University Press expert seminar series that details: linguistic research underpinning primary school curricula and pedagogy; the form of linguistic knowledge useful for teachers supporting children with SLCD in partnership with speech and language therapists; and how and when teachers acquire and learn to apply such knowledge. Critical analysis of the options presented for teacher learning indicate that policy enjoinders now include linguistic application as an expected part of teachers' professional knowledge, for all children including those with SLCD, but there is a large unmet learning need. It is concluded that there is a role for clinical linguists to disseminate useable knowledge to teachers in an accessible format. Ways of achieving this are considered.
ERIC Educational Resources Information Center
Turkan, Sultan; De Oliveira, Luciana C.; Lee, Okhee; Phelps, Geoffrey
2014-01-01
Background/Context: The current research on teacher knowledge and teacher accountability falls short on information about what teacher knowledge base could guide preparation and accountability of the mainstream teachers for meeting the academic needs of ELLs. Most recently, research on specialized knowledge for teaching has offered ways to…
Crain, Stephen; Thornton, Rosalind
2012-03-01
Every normal child acquires a language in just a few years. By 3- or 4-years-old, children have effectively become adults in their abilities to produce and understand endlessly many sentences in a variety of conversational contexts. There are two alternative accounts of the course of children's language development. These different perspectives can be traced back to the nature versus nurture debate about how knowledge is acquired in any cognitive domain. One perspective dates back to Plato's dialog 'The Meno'. In this dialog, the protagonist, Socrates, demonstrates to Meno, an aristocrat in Ancient Greece, that a young slave knows more about geometry than he could have learned from experience. By extension, Plato's Problem refers to any gap between experience and knowledge. How children fill in the gap in the case of language continues to be the subject of much controversy in cognitive science. Any model of language acquisition must address three factors, inter alia: 1. The knowledge children accrue; 2. The input children receive (often called the primary linguistic data); 3. The nonlinguistic capacities of children to form and test generalizations based on the input. According to the famous linguist Noam Chomsky, the main task of linguistics is to explain how children bridge the gap-Chomsky calls it a 'chasm'-between what they come to know about language, and what they could have learned from experience, even given optimistic assumptions about their cognitive abilities. Proponents of the alternative 'nurture' approach accuse nativists like Chomsky of overestimating the complexity of what children learn, underestimating the data children have to work with, and manifesting undue pessimism about children's abilities to extract information based on the input. The modern 'nurture' approach is often referred to as the usage-based account. We discuss the usage-based account first, and then the nativist account. After that, we report and discuss the findings of several studies of child language that have been conducted with the goal of helping to adjudicate between the alternative approaches to language development. WIREs Cogn Sci 2012, 3:185-203. doi: 10.1002/wcs.1158 For further resources related to this article, please visit the WIREs website. Copyright © 2012 John Wiley & Sons, Ltd.
Linguistic Knowledge and Reasoning for Error Diagnosis and Feedback Generation.
ERIC Educational Resources Information Center
Delmonte, Rodolfo
2003-01-01
Presents four sets of natural language processing-based exercises for which error correction and feedback are produced by means of a rich database in which linguistic information is encoded either at the lexical or the grammatical level. (Author/VWL)
Integration of language and sensor information
NASA Astrophysics Data System (ADS)
Perlovsky, Leonid I.; Weijers, Bertus
2003-04-01
The talk describes the development of basic technologies of intelligent systems fusing data from multiple domains and leading to automated computational techniques for understanding data contents. Understanding involves inferring appropriate decisions and recommending proper actions, which in turn requires fusion of data and knowledge about objects, situations, and actions. Data might include sensory data, verbal reports, intelligence intercepts, or public records, whereas knowledge ought to encompass the whole range of objects, situations, people and their behavior, and knowledge of languages. In the past, a fundamental difficulty in combining knowledge with data was the combinatorial complexity of computations, too many combinations of data and knowledge pieces had to be evaluated. Recent progress in understanding of natural intelligent systems, including the human mind, leads to the development of neurophysiologically motivated architectures for solving these challenging problems, in particular the role of emotional neural signals in overcoming combinatorial complexity of old logic-based approaches. Whereas past approaches based on logic tended to identify logic with language and thinking, recent studies in cognitive linguistics have led to appreciation of more complicated nature of linguistic models. Little is known about the details of the brain mechanisms integrating language and thinking. Understanding and fusion of linguistic information with sensory data represent a novel challenging aspect of the development of integrated fusion systems. The presentation will describe a non-combinatorial approach to this problem and outline techniques that can be used for fusing diverse and uncertain knowledge with sensory and linguistic data.
ERIC Educational Resources Information Center
Yasuda, Sachiko
2017-01-01
This article attempts to apply some systemic functional linguistic (SFL) concepts to task-based language teaching (TBLT) as a means of enriching the fields of learning, teaching, and evaluating writing in an additional language. The purposes are twofold. First, this article presents a concrete example concerning SFL-initiated genre-based tasks,…
ERIC Educational Resources Information Center
Toro, Juan M.; Pons, Ferran; Bion, Ricardo A. H.; Sebastian-Galles, Nuria
2011-01-01
Much research has explored the extent to which statistical computations account for the extraction of linguistic information. However, it remains to be studied how language-specific constraints are imposed over these computations. In the present study we investigated if the violation of a word-forming rule in Catalan (the presence of more than one…
Combining Multiple Knowledge Sources for Continuous Speech Recognition
1989-08-01
derived by estimating probabilities from a training set, or a linguistically -based model that uses syntactic and semantic information explicitly. The...into a hierarchical set of rules tha’ wouA. :over a much larger percentage of new sentences than the original sentence patteiis. We applied this tool...statistical grammars typically used by the use of linguistic knowledge. In particular, we group the different words in the vocabulary into classes, under the
BJUT at TREC 2015 Microblog Track: Real Time Filtering Using Knowledge Base
2015-11-20
learning to rank of tweets. In Proceedings of the 23rd International Conference on Computational Linguistics , pages 295–303. Association for Computational... Linguistics , 2010. Thorsten Joachims. Optimizing search engines using clickthrough data. In Proceedings of the eighth ACM SIGKDD international
Fuzzy Linguistic Knowledge Based Behavior Extraction for Building Energy Management Systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dumidu Wijayasekara; Milos Manic
2013-08-01
Significant portion of world energy production is consumed by building Heating, Ventilation and Air Conditioning (HVAC) units. Thus along with occupant comfort, energy efficiency is also an important factor in HVAC control. Modern buildings use advanced Multiple Input Multiple Output (MIMO) control schemes to realize these goals. However, since the performance of HVAC units is dependent on many criteria including uncertainties in weather, number of occupants, and thermal state, the performance of current state of the art systems are sub-optimal. Furthermore, because of the large number of sensors in buildings, and the high frequency of data collection, large amount ofmore » information is available. Therefore, important behavior of buildings that compromise energy efficiency or occupant comfort is difficult to identify. This paper presents an easy to use and understandable framework for identifying such behavior. The presented framework uses human understandable knowledge-base to extract important behavior of buildings and present it to users via a graphical user interface. The presented framework was tested on a building in the Pacific Northwest and was shown to be able to identify important behavior that relates to energy efficiency and occupant comfort.« less
Rule-based Approach on Extraction of Malay Compound Nouns in Standard Malay Document
NASA Astrophysics Data System (ADS)
Abu Bakar, Zamri; Kamal Ismail, Normaly; Rawi, Mohd Izani Mohamed
2017-08-01
Malay compound noun is defined as a form of words that exists when two or more words are combined into a single syntax and it gives a specific meaning. Compound noun acts as one unit and it is spelled separately unless an established compound noun is written closely from two words. The basic characteristics of compound noun can be seen in the Malay sentences which are the frequency of that word in the text itself. Thus, this extraction of compound nouns is significant for the following research which is text summarization, grammar checker, sentiments analysis, machine translation and word categorization. There are many research efforts that have been proposed in extracting Malay compound noun using linguistic approaches. Most of the existing methods were done on the extraction of bi-gram noun+noun compound. However, the result still produces some problems as to give a better result. This paper explores a linguistic method for extracting compound Noun from stand Malay corpus. A standard dataset are used to provide a common platform for evaluating research on the recognition of compound Nouns in Malay sentences. Therefore, an improvement for the effectiveness of the compound noun extraction is needed because the result can be compromised. Thus, this study proposed a modification of linguistic approach in order to enhance the extraction of compound nouns processing. Several pre-processing steps are involved including normalization, tokenization and tagging. The first step that uses the linguistic approach in this study is Part-of-Speech (POS) tagging. Finally, we describe several rules-based and modify the rules to get the most relevant relation between the first word and the second word in order to assist us in solving of the problems. The effectiveness of the relations used in our study can be measured using recall, precision and F1-score techniques. The comparison of the baseline values is very essential because it can provide whether there has been an improvement in the result.
Sleep facilitates learning a new linguistic rule
Batterink, Laura J.; Oudiette, Delphine; Reber, Paul J.; Paller, Ken A.
2014-01-01
Natural languages contain countless regularities. Extraction of these patterns is an essential component of language acquisition. Here we examined the hypothesis that memory processing during sleep contributes to this learning. We exposed participants to a hidden linguistic rule by presenting a large number of two-word phrases, each including a noun preceded by one of four novel words that functioned as an article (e.g., gi rhino). These novel words (ul, gi, ro and ne) were presented as obeying an explicit rule: two words signified that the noun referent was relatively near, and two that it was relatively far. Undisclosed to participants was the fact that the novel articles also predicted noun animacy, with two of the articles preceding animate referents and the other two preceding inanimate referents. Rule acquisition was tested implicitly using a task in which participants responded to each phrase according to whether the noun was animate or inanimate. Learning of the hidden rule was evident in slower responses to phrases that violated the rule. Responses were delayed regardless of whether rule-knowledge was consciously accessible. Brain potentials provided additional confirmation of implicit and explicit rule-knowledge. An afternoon nap was interposed between two 20-min learning sessions. Participants who obtained greater amounts of both slow-wave and rapid-eye-movement sleep showed increased sensitivity to the hidden linguistic rule in the second session. We conclude that during sleep, reactivation of linguistic information linked with the rule was instrumental for stabilizing learning. The combination of slow-wave and rapid-eye-movement sleep may synergistically facilitate the abstraction of complex patterns in linguistic input. PMID:25447376
Text-mining and information-retrieval services for molecular biology
Krallinger, Martin; Valencia, Alfonso
2005-01-01
Text-mining in molecular biology - defined as the automatic extraction of information about genes, proteins and their functional relationships from text documents - has emerged as a hybrid discipline on the edges of the fields of information science, bioinformatics and computational linguistics. A range of text-mining applications have been developed recently that will improve access to knowledge for biologists and database annotators. PMID:15998455
Tang, Ming; Liao, Huchang; Li, Zongmin; Xu, Zeshui
2018-04-13
Because the natural disaster system is a very comprehensive and large system, the disaster reduction scheme must rely on risk analysis. Experts' knowledge and experiences play a critical role in disaster risk assessment. The hesitant fuzzy linguistic preference relation is an effective tool to express experts' preference information when comparing pairwise alternatives. Owing to the lack of knowledge or a heavy workload, information may be missed in the hesitant fuzzy linguistic preference relation. Thus, an incomplete hesitant fuzzy linguistic preference relation is constructed. In this paper, we firstly discuss some properties of the additive consistent hesitant fuzzy linguistic preference relation. Next, the incomplete hesitant fuzzy linguistic preference relation, the normalized hesitant fuzzy linguistic preference relation, and the acceptable hesitant fuzzy linguistic preference relation are defined. Afterwards, three procedures to estimate the missing information are proposed. The first one deals with the situation in which there are only n-1 known judgments involving all the alternatives; the second one is used to estimate the missing information of the hesitant fuzzy linguistic preference relation with more known judgments; while the third procedure is used to deal with ignorance situations in which there is at least one alternative with totally missing information. Furthermore, an algorithm for group decision making with incomplete hesitant fuzzy linguistic preference relations is given. Finally, we illustrate our model with a case study about flood disaster risk evaluation. A comparative analysis is presented to testify the advantage of our method.
Rinaldi, Fabio; Schneider, Gerold; Kaljurand, Kaarel; Hess, Michael; Andronis, Christos; Konstandi, Ourania; Persidis, Andreas
2007-02-01
The amount of new discoveries (as published in the scientific literature) in the biomedical area is growing at an exponential rate. This growth makes it very difficult to filter the most relevant results, and thus the extraction of the core information becomes very expensive. Therefore, there is a growing interest in text processing approaches that can deliver selected information from scientific publications, which can limit the amount of human intervention normally needed to gather those results. This paper presents and evaluates an approach aimed at automating the process of extracting functional relations (e.g. interactions between genes and proteins) from scientific literature in the biomedical domain. The approach, using a novel dependency-based parser, is based on a complete syntactic analysis of the corpus. We have implemented a state-of-the-art text mining system for biomedical literature, based on a deep-linguistic, full-parsing approach. The results are validated on two different corpora: the manually annotated genomics information access (GENIA) corpus and the automatically annotated arabidopsis thaliana circadian rhythms (ATCR) corpus. We show how a deep-linguistic approach (contrary to common belief) can be used in a real world text mining application, offering high-precision relation extraction, while at the same time retaining a sufficient recall.
A Logical Framework for Service Migration Based Survivability
2016-06-24
platforms; Service Migration Strategy Fuzzy Inference System Knowledge Base Fuzzy rules representing domain expert knowledge about implications of...service migration strategy. Our approach uses expert knowledge as linguistic reasoning rules and takes service programs damage assessment, service...programs complexity, and available network capability as input. The fuzzy inference system includes four components as shown in Figure 5: (1) a knowledge
ERIC Educational Resources Information Center
Trapman, Mirjam; van Gelderen, Amos; van Schooten, Erik; Hulstijn, Jan
2018-01-01
In a longitudinal design, 51 low-achieving adolescents' development in writing proficiency from Grades 7 to 9 was measured. There were 25 native-Dutch and 26 language-minority students. In addition, the roles of (1) linguistic knowledge, (2) metacognitive knowledge, and (3) linguistic fluency in predicting both the level and development of writing…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Taylor, R.C.
This thesis involved the construction of (1) a grammar that incorporates knowledge on base invariancy and secondary structure in a molecule and (2) a parser engine that uses the grammar to position bases into the structural subunits of the molecule. These concepts were combined with a novel pinning technique to form a tool that semi-automates insertion of a new species into the alignment for the 16S rRNA molecule (a component of the ribosome) maintained by Dr. Carl Woese's group at the University of Illinois at Urbana. The tool was tested on species extracted from the alignment and on a groupmore » of entirely new species. The results were very encouraging, and the tool should be substantial aid to the curators of the 16S alignment. The construction of the grammar was itself automated, allowing application of the tool to alignments for other molecules. The logic programming language Prolog was used to construct all programs involved. The computational linguistics approach used here was found to be a useful way to attach the problem of insertion into an alignment.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Taylor, Ronald C.
This thesis involved the construction of (1) a grammar that incorporates knowledge on base invariancy and secondary structure in a molecule and (2) a parser engine that uses the grammar to position bases into the structural subunits of the molecule. These concepts were combined with a novel pinning technique to form a tool that semi-automates insertion of a new species into the alignment for the 16S rRNA molecule (a component of the ribosome) maintained by Dr. Carl Woese`s group at the University of Illinois at Urbana. The tool was tested on species extracted from the alignment and on a groupmore » of entirely new species. The results were very encouraging, and the tool should be substantial aid to the curators of the 16S alignment. The construction of the grammar was itself automated, allowing application of the tool to alignments for other molecules. The logic programming language Prolog was used to construct all programs involved. The computational linguistics approach used here was found to be a useful way to attach the problem of insertion into an alignment.« less
Exploring relation types for literature-based discovery.
Preiss, Judita; Stevenson, Mark; Gaizauskas, Robert
2015-09-01
Literature-based discovery (LBD) aims to identify "hidden knowledge" in the medical literature by: (1) analyzing documents to identify pairs of explicitly related concepts (terms), then (2) hypothesizing novel relations between pairs of unrelated concepts that are implicitly related via a shared concept to which both are explicitly related. Many LBD approaches use simple techniques to identify semantically weak relations between concepts, for example, document co-occurrence. These generate huge numbers of hypotheses, difficult for humans to assess. More complex techniques rely on linguistic analysis, for example, shallow parsing, to identify semantically stronger relations. Such approaches generate fewer hypotheses, but may miss hidden knowledge. The authors investigate this trade-off in detail, comparing techniques for identifying related concepts to discover which are most suitable for LBD. A generic LBD system that can utilize a range of relation types was developed. Experiments were carried out comparing a number of techniques for identifying relations. Two approaches were used for evaluation: replication of existing discoveries and the "time slicing" approach.(1) RESULTS: Previous LBD discoveries could be replicated using relations based either on document co-occurrence or linguistic analysis. Using relations based on linguistic analysis generated many fewer hypotheses, but a significantly greater proportion of them were candidates for hidden knowledge. The use of linguistic analysis-based relations improves accuracy of LBD without overly damaging coverage. LBD systems often generate huge numbers of hypotheses, which are infeasible to manually review. Improving their accuracy has the potential to make these systems significantly more usable. © The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association.
From spoken narratives to domain knowledge: mining linguistic data for medical image understanding.
Guo, Xuan; Yu, Qi; Alm, Cecilia Ovesdotter; Calvelli, Cara; Pelz, Jeff B; Shi, Pengcheng; Haake, Anne R
2014-10-01
Extracting useful visual clues from medical images allowing accurate diagnoses requires physicians' domain knowledge acquired through years of systematic study and clinical training. This is especially true in the dermatology domain, a medical specialty that requires physicians to have image inspection experience. Automating or at least aiding such efforts requires understanding physicians' reasoning processes and their use of domain knowledge. Mining physicians' references to medical concepts in narratives during image-based diagnosis of a disease is an interesting research topic that can help reveal experts' reasoning processes. It can also be a useful resource to assist with design of information technologies for image use and for image case-based medical education systems. We collected data for analyzing physicians' diagnostic reasoning processes by conducting an experiment that recorded their spoken descriptions during inspection of dermatology images. In this paper we focus on the benefit of physicians' spoken descriptions and provide a general workflow for mining medical domain knowledge based on linguistic data from these narratives. The challenge of a medical image case can influence the accuracy of the diagnosis as well as how physicians pursue the diagnostic process. Accordingly, we define two lexical metrics for physicians' narratives--lexical consensus score and top N relatedness score--and evaluate their usefulness by assessing the diagnostic challenge levels of corresponding medical images. We also report on clustering medical images based on anchor concepts obtained from physicians' medical term usage. These analyses are based on physicians' spoken narratives that have been preprocessed by incorporating the Unified Medical Language System for detecting medical concepts. The image rankings based on lexical consensus score and on top 1 relatedness score are well correlated with those based on challenge levels (Spearman correlation>0.5 and Kendall correlation>0.4). Clustering results are largely improved based on our anchor concept method (accuracy>70% and mutual information>80%). Physicians' spoken narratives are valuable for the purpose of mining the domain knowledge that physicians use in medical image inspections. We also show that the semantic metrics introduced in the paper can be successfully applied to medical image understanding and allow discussion of additional uses of these metrics. Copyright © 2014 Elsevier B.V. All rights reserved.
Sleep facilitates learning a new linguistic rule.
Batterink, Laura J; Oudiette, Delphine; Reber, Paul J; Paller, Ken A
2014-12-01
Natural languages contain countless regularities. Extraction of these patterns is an essential component of language acquisition. Here we examined the hypothesis that memory processing during sleep contributes to this learning. We exposed participants to a hidden linguistic rule by presenting a large number of two-word phrases, each including a noun preceded by one of four novel words that functioned as an article (e.g., gi rhino). These novel words (ul, gi, ro and ne) were presented as obeying an explicit rule: two words signified that the noun referent was relatively near, and two that it was relatively far. Undisclosed to participants was the fact that the novel articles also predicted noun animacy, with two of the articles preceding animate referents and the other two preceding inanimate referents. Rule acquisition was tested implicitly using a task in which participants responded to each phrase according to whether the noun was animate or inanimate. Learning of the hidden rule was evident in slower responses to phrases that violated the rule. Responses were delayed regardless of whether rule-knowledge was consciously accessible. Brain potentials provided additional confirmation of implicit and explicit rule-knowledge. An afternoon nap was interposed between two 20-min learning sessions. Participants who obtained greater amounts of both slow-wave and rapid-eye-movement sleep showed increased sensitivity to the hidden linguistic rule in the second session. We conclude that during sleep, reactivation of linguistic information linked with the rule was instrumental for stabilizing learning. The combination of slow-wave and rapid-eye-movement sleep may synergistically facilitate the abstraction of complex patterns in linguistic input. Copyright © 2014 Elsevier Ltd. All rights reserved.
Navigating a Mobile Robot Across Terrain Using Fuzzy Logic
NASA Technical Reports Server (NTRS)
Seraji, Homayoun; Howard, Ayanna; Bon, Bruce
2003-01-01
A strategy for autonomous navigation of a robotic vehicle across hazardous terrain involves the use of a measure of traversability of terrain within a fuzzy-logic conceptual framework. This navigation strategy requires no a priori information about the environment. Fuzzy logic was selected as a basic element of this strategy because it provides a formal methodology for representing and implementing a human driver s heuristic knowledge and operational experience. Within a fuzzy-logic framework, the attributes of human reasoning and decision- making can be formulated by simple IF (antecedent), THEN (consequent) rules coupled with easily understandable and natural linguistic representations. The linguistic values in the rule antecedents convey the imprecision associated with measurements taken by sensors onboard a mobile robot, while the linguistic values in the rule consequents represent the vagueness inherent in the reasoning processes to generate the control actions. The operational strategies of the human expert driver can be transferred, via fuzzy logic, to a robot-navigation strategy in the form of a set of simple conditional statements composed of linguistic variables. These linguistic variables are defined by fuzzy sets in accordance with user-defined membership functions. The main advantages of a fuzzy navigation strategy lie in the ability to extract heuristic rules from human experience and to obviate the need for an analytical model of the robot navigation process.
Teaching Advanced Literacy Skills: A Guide for Leaders in Linguistically Diverse Schools
ERIC Educational Resources Information Center
Lesaux, Nonie K.; Galloway, Emily Phillips; Marietta, Sky H.
2016-01-01
In our knowledge-based society, K-8 students need to develop increasingly sophisticated skills to read, write, and speak for a wide variety of purposes and audiences. Including an extended case example from a linguistically diverse school, this book guides school leaders to design and implement advanced literacy instruction through four key…
ERIC Educational Resources Information Center
Maillat, Didier; Serra, Cecilia
2009-01-01
This paper focusses on the teaching of non-linguistic subject matters in a second or third language through bilingual education. We investigate how this specific educational framework influences the development of linguistic competence as well as disciplinary knowledge. Based on a large-scale corpus of classroom interactions collected in bilingual…
Applying Linguistics in the Teaching of Reading and the Language Arts.
ERIC Educational Resources Information Center
Eisenhardt, Catheryn
The purpose of this book is to illustrate how the principles revealed by linguistic research can be translated into classroom practice. Emphasis is placed on: (1) a methodology which offers opportunities for children to create knowledge based on their observations of language tested against their intiutive speech, and (2) a content which is…
The politics and semiotics of sounds--Mayan linguistics and nation-building in Guatemala.
French, Brigittine M
2004-01-01
This paper discusses the development Mayan linguistics as an authoritative field of knowledge in Guatemala. In particular, it links missionary linguists' and Maya linguists' activities with shifting nationalist agendas from the 1920s in to the late 1980s. It is argued that during the historical and intellectual moment that linguistics becomes an authoritative epistemology, phonetic analysis functions as a creative index that constitutes "expert" knowledge for particular semiotic and ideological reasons tied to competing versions of the Guatemalan imagined community.
Knowledge representation for fuzzy inference aided medical image interpretation.
Gal, Norbert; Stoicu-Tivadar, Vasile
2012-01-01
Knowledge defines how an automated system transforms data into information. This paper suggests a representation method of medical imaging knowledge using fuzzy inference systems coded in XML files. The imaging knowledge incorporates features of the investigated objects in linguistic form and inference rules that can transform the linguistic data into information about a possible diagnosis. A fuzzy inference system is used to model the vagueness of the linguistic medical imaging terms. XML files are used to facilitate easy manipulation and deployment of the knowledge into the imaging software. Preliminary results are presented.
Computer Aided Program Synthesis.
1980-01-01
Representations 18 .2 Refinements and Reductions 18.:2.3 Dependenc ies 20 3.3 The Programming Knowledge Base 21 3.4 Linguistic Knowledge 22 3.5...strategy selection knowledge, i.e. knowledge representing a context sensitive discrimination among alternate methods; and knowledge of logical...program, each supplying his expertise. The client describes his task to the consultant and supplies answers and explanations to the consultant’s
NASA Astrophysics Data System (ADS)
Lu, Qian
2017-07-01
Exploring language universal is one of the major goals of linguistic researches, which are largely devoted to answering the ;Platonic questions; in linguistics, that is, what is the language knowledge, how to get and use this knowledge. However, if solely guided by linguistic intuition, it is very difficult for syntactic studies to answer these questions, or to achieve abstractions in the scientific sense. This suggests that linguistic analyses based on the probability theory may provide effective ways to investigate into language universals in terms of biological motivations or cognitive psychological mechanisms. With the view that ;Language is a human-driven system;, Liu, Xu & Liang's review [1] pointed out that dependency distance minimization (DDM), which has been corroborated by big data analysis of corpus, may be a language universal shaped in language evolution, a universal that has profound effect on syntactic patterns.
ERIC Educational Resources Information Center
Puliatte, Alison; Ehri, Linnea C.
2018-01-01
The relationship between 2nd and 3rd grade teachers' linguistic knowledge and spelling instructional practices and their students' spelling gains from fall to spring was examined. Second grade (N = 16) and 3rd grade (N = 16) teachers were administered an instructional practices survey and a linguistic knowledge test. Total scores on the two…
'Needs only' Analysis in Linguistic Ontogeny and Phylogeny
NASA Astrophysics Data System (ADS)
Wray, Alison
Recently, linguists from several quarters have begun to unpack some of the assumptions and claims made in linguistics over the last 40 years, opening up new possibilities for synergies between linguistic theory and the variety of fields that engage with it. A key point of exploration is the relationship between external manifestations of language and the underlying mental model that produces and understands them. To what extent does it remain reasonable to argue that all humans 'know' certain things about language, even if they never demonstrate that knowledge? What is the status of knowledge that is only stimulated into expression by particular cultural input? Many have asked whether the human's linguistic behaviour can be explained with recourse to less innate knowledge than Chomskian models traditionally assume.
Testing of a Natural Language Retrieval System for a Full Text Knowledge Base.
ERIC Educational Resources Information Center
Bernstein, Lionel M.; Williamson, Robert E.
1984-01-01
The Hepatitis Knowledge Base (text of prototype information system) was used for modifying and testing "A Navigator of Natural Language Organized (Textual) Data" (ANNOD), a retrieval system which combines probabilistic, linguistic, and empirical means to rank individual paragraphs of full text for similarity to natural language queries…
NASA Astrophysics Data System (ADS)
Hayes, Aneta L.; Mansour, Nasser
2017-04-01
Changes in the cultural and linguistic environments of learners are often associated with identity shifts. The aim of this study was to explore what identity shifts occur when science students from Bahraini national schools transition to an international university. The role of two aspects of learner identity—that is, English proficiency and science background knowledge, was examined in this study. Focus groups and semi-structured interviews were conducted with students and with university lecturers. The analysis suggested three conceptual themes of (1) reliance on science knowledge, (2) the auxiliary role of professional language and (3) adequacy of student learning strategies, demonstrating what subjective meanings the participants ascribe to the interplay between science knowledge and linguistic ability. The findings suggest that despite the lack of adequate linguistic attributes, the students are still able to successfully learn science in the context of language change. It is also implied that through strategically utilising their academic background in science, students preserve their identity as successful learners from school through to university. We conclude that agency plays a separate role in transition and is not a sole function of identity. We also contest the idea of language as a necessary attribute of one's identity as it was perceived by our participants to be an advantage and an auxiliary tool rather than a requirement.
New approach for cognitive analysis and understanding of medical patterns and visualizations
NASA Astrophysics Data System (ADS)
Ogiela, Marek R.; Tadeusiewicz, Ryszard
2003-11-01
This paper presents new opportunities for applying linguistic description of the picture merit content and AI methods to undertake tasks of the automatic understanding of images semantics in intelligent medical information systems. A successful obtaining of the crucial semantic content of the medical image may contribute considerably to the creation of new intelligent multimedia cognitive medical systems. Thanks to the new idea of cognitive resonance between stream of the data extracted from the image using linguistic methods and expectations taken from the representaion of the medical knowledge, it is possible to understand the merit content of the image even if teh form of the image is very different from any known pattern. This article proves that structural techniques of artificial intelligence may be applied in the case of tasks related to automatic classification and machine perception based on semantic pattern content in order to determine the semantic meaning of the patterns. In the paper are described some examples presenting ways of applying such techniques in the creation of cognitive vision systems for selected classes of medical images. On the base of scientific research described in the paper we try to build some new systems for collecting, storing, retrieving and intelligent interpreting selected medical images especially obtained in radiological and MRI examinations.
Introduction to the special issue: parsimony and redundancy in models of language.
Wiechmann, Daniel; Kerz, Elma; Snider, Neal; Jaeger, T Florian
2013-09-01
One of the most fundamental goals in linguistic theory is to understand the nature of linguistic knowledge, that is, the representations and mechanisms that figure in a cognitively plausible model of human language-processing. The past 50 years have witnessed the development and refinement of various theories about what kind of 'stuff' human knowledge of language consists of, and technological advances now permit the development of increasingly sophisticated computational models implementing key assumptions of different theories from both rationalist and empiricist perspectives. The present special issue does not aim to present or discuss the arguments for and against the two epistemological stances or discuss evidence that supports either of them (cf. Bod, Hay, & Jannedy, 2003; Christiansen & Chater, 2008; Hauser, Chomsky, & Fitch, 2002; Oaksford & Chater, 2007; O'Donnell, Hauser, & Fitch, 2005). Rather, the research presented in this issue, which we label usage-based here, conceives of linguistic knowledge as being induced from experience. According to the strongest of such accounts, the acquisition and processing of language can be explained with reference to general cognitive mechanisms alone (rather than with reference to innate language-specific mechanisms). Defined in these terms, usage-based approaches encompass approaches referred to as experience-based, performance-based and/or emergentist approaches (Amrnon & Snider, 2010; Bannard, Lieven, & Tomasello, 2009; Bannard & Matthews, 2008; Chater & Manning, 2006; Clark & Lappin, 2010; Gerken, Wilson, & Lewis, 2005; Gomez, 2002;
ERIC Educational Resources Information Center
Hayes, Aneta L.; Mansour, Nasser
2017-01-01
Changes in the cultural and linguistic environments of learners are often associated with identity shifts. The aim of this study was to explore what identity shifts occur when science students from Bahraini national schools transition to an international university. The role of two aspects of learner identity--that is, English proficiency and…
The Spelling Sensitivity Score: Noting Developmental Changes in Spelling Knowledge
ERIC Educational Resources Information Center
Masterson, Julie J.; Apel, Kenn
2010-01-01
Spelling is a language skill supported by several linguistic knowledge sources, including phonemic, orthographic, and morphological knowledge. Typically, however, spelling assessment procedures do not capture the development and use of these linguistic knowledge sources. The purpose of this article is to describe a new assessment system, the…
Identification of research hypotheses and new knowledge from scientific literature.
Shardlow, Matthew; Batista-Navarro, Riza; Thompson, Paul; Nawaz, Raheel; McNaught, John; Ananiadou, Sophia
2018-06-25
Text mining (TM) methods have been used extensively to extract relations and events from the literature. In addition, TM techniques have been used to extract various types or dimensions of interpretative information, known as Meta-Knowledge (MK), from the context of relations and events, e.g. negation, speculation, certainty and knowledge type. However, most existing methods have focussed on the extraction of individual dimensions of MK, without investigating how they can be combined to obtain even richer contextual information. In this paper, we describe a novel, supervised method to extract new MK dimensions that encode Research Hypotheses (an author's intended knowledge gain) and New Knowledge (an author's findings). The method incorporates various features, including a combination of simple MK dimensions. We identify previously explored dimensions and then use a random forest to combine these with linguistic features into a classification model. To facilitate evaluation of the model, we have enriched two existing corpora annotated with relations and events, i.e., a subset of the GENIA-MK corpus and the EU-ADR corpus, by adding attributes to encode whether each relation or event corresponds to Research Hypothesis or New Knowledge. In the GENIA-MK corpus, these new attributes complement simpler MK dimensions that had previously been annotated. We show that our approach is able to assign different types of MK dimensions to relations and events with a high degree of accuracy. Firstly, our method is able to improve upon the previously reported state of the art performance for an existing dimension, i.e., Knowledge Type. Secondly, we also demonstrate high F1-score in predicting the new dimensions of Research Hypothesis (GENIA: 0.914, EU-ADR 0.802) and New Knowledge (GENIA: 0.829, EU-ADR 0.836). We have presented a novel approach for predicting New Knowledge and Research Hypothesis, which combines simple MK dimensions to achieve high F1-scores. The extraction of such information is valuable for a number of practical TM applications.
Expert system training and control based on the fuzzy relation matrix
NASA Technical Reports Server (NTRS)
Ren, Jie; Sheridan, T. B.
1991-01-01
Fuzzy knowledge, that for which the terms of reference are not crisp but overlapped, seems to characterize human expertise. This can be shown from the fact that an experienced human operator can control some complex plants better than a computer can. Proposed here is fuzzy theory to build a fuzzy expert relation matrix (FERM) from given rules or/and examples, either in linguistic terms or in numerical values to mimic human processes of perception and decision making. The knowledge base is codified in terms of many implicit fuzzy rules. Fuzzy knowledge thus codified may also be compared with explicit rules specified by a human expert. It can also provide a basis for modeling the human operator and allow comparison of what a human operator says to what he does in practice. Two experiments were performed. In the first, control of liquid in a tank, demonstrates how the FERM knowledge base is elicited and trained. The other shows how to use a FERM, build up from linguistic rules, and to control an inverted pendulum without a dynamic model.
Cognition-Based Approaches for High-Precision Text Mining
ERIC Educational Resources Information Center
Shannon, George John
2017-01-01
This research improves the precision of information extraction from free-form text via the use of cognitive-based approaches to natural language processing (NLP). Cognitive-based approaches are an important, and relatively new, area of research in NLP and search, as well as linguistics. Cognitive approaches enable significant improvements in both…
What Does Corpus Linguistics Have to Offer to Language Assessment?
ERIC Educational Resources Information Center
Xi, Xiaoming
2017-01-01
In recent years, continuing advances in technology have increased the capacity to automate the extraction of a range of linguistic features of texts and thus have provided the impetus for the substantial growth of corpus linguistics. While corpus linguistic tools and methods have been used extensively in second language learning research, they…
Linguistic measures of chemical diversity and the "keywords" of molecular collections.
Woźniak, Michał; Wołos, Agnieszka; Modrzyk, Urszula; Górski, Rafał L; Winkowski, Jan; Bajczyk, Michał; Szymkuć, Sara; Grzybowski, Bartosz A; Eder, Maciej
2018-05-15
Computerized linguistic analyses have proven of immense value in comparing and searching through large text collections ("corpora"), including those deposited on the Internet - indeed, it would nowadays be hard to imagine browsing the Web without, for instance, search algorithms extracting most appropriate keywords from documents. This paper describes how such corpus-linguistic concepts can be extended to chemistry based on characteristic "chemical words" that span more than traditional functional groups and, instead, look at common structural fragments molecules share. Using these words, it is possible to quantify the diversity of chemical collections/databases in new ways and to define molecular "keywords" by which such collections are best characterized and annotated.
ERIC Educational Resources Information Center
Petropoulos, Constance
2012-01-01
Studies by Moats (1995), Mather, Bos, and Babur (2001), and McCutchen, et al (2002) have begun to identify the relationship between teachers' linguistic knowledge and what is known, scientifically, about how literacy is acquired by learners. Findings from these studies support the idea that linguistic knowledge--particularly knowledge of…
Event-Based Plausibility Immediately Influences On-Line Language Comprehension
ERIC Educational Resources Information Center
Matsuki, Kazunaga; Chow, Tracy; Hare, Mary; Elman, Jeffrey L.; Scheepers, Christoph; McRae, Ken
2011-01-01
In some theories of sentence comprehension, linguistically relevant lexical knowledge, such as selectional restrictions, is privileged in terms of the time-course of its access and influence. We examined whether event knowledge computed by combining multiple concepts can rapidly influence language understanding even in the absence of selectional…
Semantics vs. World Knowledge in Prefrontal Cortex
ERIC Educational Resources Information Center
Pylkkanen, Liina; Oliveri, Bridget; Smart, Andrew J.
2009-01-01
Humans have knowledge about the properties of their native language at various levels of representation; sound, structure, and meaning computation constitute the core components of any linguistic theory. Although the brain sciences have engaged with representational theories of sound and syntactic structure, the study of the neural bases of…
Children's Understanding of Speaker Reliability between Lexical and Syntactic Knowledge
ERIC Educational Resources Information Center
Sobel, David M.; Macris, Deanna M.
2013-01-01
Many studies suggest that preschoolers rely on individuals' histories of generating accurate lexical information when learning novel lexical information from them. The present study examined whether children used a speaker's accuracy about one kind of linguistic knowledge to make inferences about another kind of linguistic knowledge, focusing…
ERIC Educational Resources Information Center
Trapman, Mirjam; van Gelderen, Amos; van Steensel, Roel; van Schooten, Erik; Hulstijn, Jan
2014-01-01
In this study we investigate the role of linguistic knowledge, fluency and meta-cognitive knowledge in Dutch reading comprehension of monolingual and bilingual adolescent academic low achievers in the Netherlands. Results show that these components are substantially associated with reading comprehension. However, their role appears to be different…
The Availability of Conscious Knowledge: A Comment on Lindseth (2016)
ERIC Educational Resources Information Center
Krashen, Stephen
2016-01-01
Lindseth (2016) reported that direct instruction and practice using the German verb-inversion rule resulted in higher accuracy in an oral test for college students, supporting the hypothesis that explicit linguistic knowledge can become implicit linguistic knowledge. It is quite likely, however, that the conditions for the use of conscious…
What Is Linguistics? ERIC Digest. [Revised].
ERIC Educational Resources Information Center
ERIC Clearinghouse on Languages and Linguistics, Washington, DC.
Linguistics is the study of language, as contrasted with knowledge of a specific language. Formal linguistics is the study of the structures and processes of language, or how it works and is organized. Different approaches to formal linguistics include traditional or prescriptive, structural, and generative or transformational perspectives. Formal…
n-Gram-Based Indexing for Korean Text Retrieval.
ERIC Educational Resources Information Center
Lee, Joon Ho; Cho, Hyun Yang; Park, Hyouk Ro
1999-01-01
Discusses indexing methods in Korean text retrieval and proposes a new indexing method based on n-grams which can handle compound nouns effectively without dictionaries and complex linguistic knowledge. Experimental results show that n-gram-based indexing is considerably faster than morpheme-based indexing, and also provides better retrieval…
CRIE: An automated analyzer for Chinese texts.
Sung, Yao-Ting; Chang, Tao-Hsing; Lin, Wei-Chun; Hsieh, Kuan-Sheng; Chang, Kuo-En
2016-12-01
Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been developed for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguistic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in language arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.
The Linguistically Aware Teacher and the Teacher-Aware Linguist
ERIC Educational Resources Information Center
McCartney, Elspeth; Ellis, Sue
2013-01-01
This review evaluates issues of teacher linguistic knowledge relating to their work with children with speech, language and communication difficulties (SLCD). Information is from Ellis and McCartney [(2011a). "Applied linguistics and primary school teaching." Cambridge: Cambridge University Press], a state-of-the-art text deriving from a British…
Robson, Barry; Boray, Srinidhi
2016-06-01
Extracting medical knowledge by structured data mining of many medical records and from unstructured data mining of natural language source text on the Internet will become increasingly important for clinical decision support. Output from these sources can be transformed into large numbers of elements of knowledge in a Knowledge Representation Store (KRS), here using the notation and to some extent the algebraic principles of the Q-UEL Web-based universal exchange and inference language described previously, rooted in Dirac notation from quantum mechanics and linguistic theory. In a KRS, semantic structures or statements about the world of interest to medicine are analogous to natural language sentences seen as formed from noun phrases separated by verbs, prepositions and other descriptions of relationships. A convenient method of testing and better curating these elements of knowledge is by having the computer use them to take the test of a multiple choice medical licensing examination. It is a venture which perhaps tells us almost as much about the reasoning of students and examiners as it does about the requirements for Artificial Intelligence as employed in clinical decision making. It emphasizes the role of context and of contextual probabilities as opposed to the more familiar intrinsic probabilities, and of a preliminary form of logic that we call presyllogistic reasoning. Copyright © 2016 Elsevier Ltd. All rights reserved.
Semantic Analysis of Email Using Domain Ontologies and WordNet
NASA Technical Reports Server (NTRS)
Berrios, Daniel C.; Keller, Richard M.
2005-01-01
The problem of capturing and accessing knowledge in paper form has been supplanted by a problem of providing structure to vast amounts of electronic information. Systems that can construct semantic links for natural language documents like email messages automatically will be a crucial element of semantic email tools. We have designed an information extraction process that can leverage the knowledge already contained in an existing semantic web, recognizing references in email to existing nodes in a network of ontology instances by using linguistic knowledge and knowledge of the structure of the semantic web. We developed a heuristic score that uses several forms of evidence to detect references in email to existing nodes in the Semanticorganizer repository's network. While these scores cannot directly support automated probabilistic inference, they can be used to rank nodes by relevance and link those deemed most relevant to email messages.
Ellis, Rebecca J Bartlett; Connor, Ulla; Marshall, James
2014-01-01
Purpose This study evaluated the feasibility of developing linguistically tailored educational messages designed to match the linguistic styles of patients segmented into types with the Descriptor™, and to determine patient preferences for tailored or standard messages based on their segments. Patients and methods Twenty patients with type 2 diabetes (T2DM) were recruited from a diabetes health clinic. Participants were segmented using the Descriptor™, a language-based questionnaire, to identify patient types based on their control orientation (internal/external), agency (high/low), and affect (positive/negative), which are well studied constructs related to T2DM self-management. Two of the seven self-care behaviors described by the American Association of Diabetes Educators (healthy eating and taking medication) were used to develop standard messages and then linguistically tailored using features of the six different construct segment types of the Descriptor™. A subset of seven participants each provided feedback on their preference for standard or linguistically tailored messages; 12 comparisons between standard and tailored messages were made. Results Overall, the tailored messages were preferred to the standard messages. When the messages were matched to specific construct segment types, the tailored messages were preferred over the standard messages, although this was not statistically significant. Conclusion Linguistically tailoring messages based on construct segments is feasible. Furthermore, tailored messages were more often preferred over standard messages. This study provides some preliminary evidence for tailoring messages based on the linguistic features of control orientation, agency, and affect. The messages developed in this study should be tested in a larger more representative sample. The present study did not explore whether tailored messages were better understood. This research will serve as preliminary evidence to develop future studies with the ultimate goal to design intervention studies to investigate if linguistically tailoring communication within the context of patient education influences patient knowledge, motivation, and activation toward making healthy behavior changes in T2DM self-management. PMID:25336928
Neuro-Fuzzy Support of Knowledge Management in Social Regulation
NASA Astrophysics Data System (ADS)
Petrovic-Lazarevic, Sonja; Coghill, Ken; Abraham, Ajith
2002-09-01
The aim of the paper is to demonstrate the neuro-fuzzy support of knowledge management in social regulation. Knowledge could be understood for social regulation purposes as explicit and tacit. Explicit knowledge relates to the community culture indicating how things work in the community based on social policies and procedures. Tacit knowledge is ethics and norms of the community. The former could be codified, stored and transferable in order to support decision making, while the latter being based on personal knowledge, experience and judgments is difficult to codify and store. Tacit knowledge expressed through linguistic information can be stored and used to support knowledge management in social regulation through the application of fuzzy and neuro-fuzzy logic.
Knowledge and School Talk: Intellectual Accommodations to Literacy?
ERIC Educational Resources Information Center
Freebody, Peter
2013-01-01
This paper introduces the goals of the research project on which this special issue of "Linguistics and Education" is based. A case is made for considering contemporary education as saturated by and dependent on oral and written language, and on beliefs and practices that relate knowledge, talk, reading and writing. The project is directed at a…
Towards Automatic Treatment of Natural Language.
ERIC Educational Resources Information Center
Lonsdale, Deryle
1984-01-01
Because automated natural language processing relies heavily on the still developing fields of linguistics, knowledge representation, and computational linguistics, no system is capable of mimicking human linguistic capabilities. For the present, interactive systems may be used to augment today's technology. (MSE)
Ideologeme "Order" in Modern American Linguistic World Image
ERIC Educational Resources Information Center
Ibatova, Aygul Z.; Vdovichenko, Larisa V.; Ilyashenko, Lubov K.
2016-01-01
The paper studies the topic of modern American linguistic world image. It is known that any language is the most important instrument of cognition of the world by a person but there is also no doubt that any language is the way of perception and conceptualization of this knowledge about the world. In modern linguistics linguistic world image is…
Aryani, Arash; Jacobs, Arthur M.; Conrad, Markus
2013-01-01
A growing body of literature in psychology, linguistics, and the neurosciences has paid increasing attention to the understanding of the relationships between phonological representations of words and their meaning: a phenomenon also known as phonological iconicity. In this article, we investigate how a text's intended emotional meaning, particularly in literature and poetry, may be reflected at the level of sublexical phonological salience and the use of foregrounded elements. To extract such elements from a given text, we developed a probabilistic model to predict the exceeding of a confidence interval for specific sublexical units concerning their frequency of occurrence within a given text contrasted with a reference linguistic corpus for the German language. Implementing this model in a computational application, we provide a text analysis tool which automatically delivers information about sublexical phonological salience allowing researchers, inter alia, to investigate effects of the sublexical emotional tone of texts based on current findings on phonological iconicity. PMID:24101907
Generating structure from experience: A retrieval-based model of language processing.
Johns, Brendan T; Jones, Michael N
2015-09-01
Standard theories of language generally assume that some abstraction of linguistic input is necessary to create higher level representations of linguistic structures (e.g., a grammar). However, the importance of individual experiences with language has recently been emphasized by both usage-based theories (Tomasello, 2003) and grounded and situated theories (e.g., Zwaan & Madden, 2005). Following the usage-based approach, we present a formal exemplar model that stores instances of sentences across a natural language corpus, applying recent advances from models of semantic memory. In this model, an exemplar memory is used to generate expectations about the future structure of sentences, using a mechanism for prediction in language processing (Altmann & Mirković, 2009). The model successfully captures a broad range of behavioral effects-reduced relative clause processing (Reali & Christiansen, 2007), the role of contextual constraint (Rayner & Well, 1996), and event knowledge activation (Ferretti, Kutas, & McRae, 2007), among others. We further demonstrate how perceptual knowledge could be integrated into this exemplar-based framework, with the goal of grounding language processing in perception. Finally, we illustrate how an exemplar memory system could have been used in the cultural evolution of language. The model provides evidence that an impressive amount of language processing may be bottom-up in nature, built on the storage and retrieval of individual linguistic experiences. (c) 2015 APA, all rights reserved).
An Overview of Biomolecular Event Extraction from Scientific Documents
Vanegas, Jorge A.; Matos, Sérgio; González, Fabio; Oliveira, José L.
2015-01-01
This paper presents a review of state-of-the-art approaches to automatic extraction of biomolecular events from scientific texts. Events involving biomolecules such as genes, transcription factors, or enzymes, for example, have a central role in biological processes and functions and provide valuable information for describing physiological and pathogenesis mechanisms. Event extraction from biomedical literature has a broad range of applications, including support for information retrieval, knowledge summarization, and information extraction and discovery. However, automatic event extraction is a challenging task due to the ambiguity and diversity of natural language and higher-level linguistic phenomena, such as speculations and negations, which occur in biological texts and can lead to misunderstanding or incorrect interpretation. Many strategies have been proposed in the last decade, originating from different research areas such as natural language processing, machine learning, and statistics. This review summarizes the most representative approaches in biomolecular event extraction and presents an analysis of the current state of the art and of commonly used methods, features, and tools. Finally, current research trends and future perspectives are also discussed. PMID:26587051
A generic method for the evaluation of interval type-2 fuzzy linguistic summaries.
Boran, Fatih Emre; Akay, Diyar
2014-09-01
Linguistic summarization has turned out to be an important knowledge discovery technique by providing the most relevant natural language-based sentences in a human consistent manner. While many studies on linguistic summarization have handled ordinary fuzzy sets [type-1 fuzzy set (T1FS)] for modeling words, only few of them have dealt with interval type-2 fuzzy sets (IT2FS) even though IT2FS is better capable of handling uncertainties associated with words. Furthermore, the existent studies work with the scalar cardinality based degree of truth which might lead to inconsistency in the evaluation of interval type-2 fuzzy (IT2F) linguistic summaries. In this paper, to overcome this shortcoming, we propose a novel probabilistic degree of truth for evaluating IT2F linguistic summaries in the forms of type-I and type-II quantified sentences. We also extend the properties that should be fulfilled by any degree of truth on linguistic summarization with T1FS to IT2F environment. We not only prove that our probabilistic degree of truth satisfies the given properties, but also illustrate by examples that it provides more consistent results when compared to the existing degree of truth in the literature. Furthermore, we carry out an application on linguistic summarization of time series data of Europe Brent Spot Price, along with a comparison of the results achieved with our approach and that of the existing degree of truth in the literature.
ERIC Educational Resources Information Center
Trapman, Mirjam; van Gelderen, Amos; van Schooten, Erik; Hulstijn, Jan
2017-01-01
In a longitudinal design, we measured 50 low-achieving adolescents' reading comprehension development from Grades 7 to 9. There were 24 native Dutch and 26 language minority students. In addition, we assessed the roles of (a) linguistic knowledge, (b) metacognitive knowledge, and (c) reading fluency in predicting both the level and growth of…
"Heading Up the Street:" Localized Opportunities for Shared Constructions of Knowledge.
ERIC Educational Resources Information Center
Lee, Carol D.; Majors, Yolanda J.
2003-01-01
Compares linguistic and non-linguistic components of ways of speaking, being, performing, and reasoning within an urban African American secondary classroom and a midwestern African American hair salon, identifying culturally shared interactional norms that inform knowledge building across sites and analyzing how the discourse norms and structures…
Strekalova, Yulia A
2017-06-01
Significant barriers to participant recruitment for clinical research (CR) are related to effective communication, and nurse coordinators are entrusted with being knowledge brokers between investigators and prospective participants. This prospective cohort study sought to identify linguistic choices that could inform and facilitate recruitment efforts. Healthy adults ( N = 204) were invited to join an online survey to assess the likelihood of participation in CR based on short and extended definitions of CR. Five short definitions included clinical trial, clinical study, health-related research study, community participatory study, and quality improvement study. The likelihood of participation in CR was the lowest for clinical trial and the highest for health-related research study. However, when only an extended definition was provided, those differences were not observed. A linguistic change from trial to study could lead to positive attitude toward CR and improvements in recruitment. However, ethical implications of linguistic choices should be considered.
REKRIATE: A Knowledge Representation System for Object Recognition and Scene Interpretation
NASA Astrophysics Data System (ADS)
Meystel, Alexander M.; Bhasin, Sanjay; Chen, X.
1990-02-01
What humans actually observe and how they comprehend this information is complex due to Gestalt processes and interaction of context in predicting the course of thinking and enforcing one idea while repressing another. How we extract the knowledge from the scene, what we get from the scene indeed and what we bring from our mechanisms of perception are areas separated by a thin, ill-defined line. The purpose of this paper is to present a system for Representing Knowledge and Recognizing and Interpreting Attention Trailed Entities dubbed as REKRIATE. It will be used as a tool for discovering the underlying principles involved in knowledge representation required for conceptual learning. REKRIATE has some inherited knowledge and is given a vocabulary which is used to form rules for identification of the object. It has various modalities of sensing and has the ability to measure the distance between the objects in the image as well as the similarity between different images of presumably the same object. All sensations received from matrix of different sensors put into an adequate form. The methodology proposed is applicable to not only the pictorial or visual world representation, but to any sensing modality. It is based upon the two premises: a) inseparability of all domains of the world representation including linguistic, as well as those formed by various sensor modalities. and b) representativity of the object at several levels of resolution simultaneously.
Netlang: A software for the linguistic analysis of corpora by means of complex networks
Serna Salazar, Diego; Isaza, Gustavo; Castillo Ossa, Luis F.; Bedia, Manuel G.
2017-01-01
To date there is no software that directly connects the linguistic analysis of a conversation to a network program. Networks programs are able to extract statistical information from data basis with information about systems of interacting elements. Language has also been conceived and studied as a complex system. However, most proposals do not analyze language according to linguistic theory, but use instead computational systems that should save time at the price of leaving aside many crucial aspects for linguistic theory. Some approaches to network studies on language do apply precise linguistic analyses, made by a linguist. The problem until now has been the lack of interface between the analysis of a sentence and its integration into the network that could be managed by a linguist and that could save the analysis of any language. Previous works have used old software that was not created for these purposes and that often produced problems with some idiosyncrasies of the target language. The desired interface should be able to deal with the syntactic peculiarities of a particular language, the options of linguistic theory preferred by the user and the preservation of morpho-syntactic information (lexical categories and syntactic relations between items). Netlang is the first program able to do that. Recently, a new kind of linguistic analysis has been developed, which is able to extract a complexity pattern from the speaker's linguistic production which is depicted as a network where words are inside nodes, and these nodes connect each other by means of edges or links (the information inside the edge can be syntactic, semantic, etc.). The Netlang software has become the bridge between rough linguistic data and the network program. Netlang has integrated and improved the functions of programs used in the past, namely the DGA annotator and two scripts (ToXML.pl and Xml2Pairs.py) used for transforming and pruning data. Netlang allows the researcher to make accurate linguistic analysis by means of syntactic dependency relations between words, while tracking record of the nature of such syntactic relationships (subject, object, etc). The Netlang software is presented as a new tool that solve many problems detected in the past. The most important improvement is that Netlang integrates three past applications into one program, and is able to produce a series of file formats that can be read by a network program. Through the Netlang software, the linguistic network analysis based on syntactic analyses, characterized for its low cost and the completely non-invasive procedure aims to evolve into a sufficiently fine grained tool for clinical diagnosis in potential cases of language disorders. PMID:28832598
Netlang: A software for the linguistic analysis of corpora by means of complex networks.
Barceló-Coblijn, Lluís; Serna Salazar, Diego; Isaza, Gustavo; Castillo Ossa, Luis F; Bedia, Manuel G
2017-01-01
To date there is no software that directly connects the linguistic analysis of a conversation to a network program. Networks programs are able to extract statistical information from data basis with information about systems of interacting elements. Language has also been conceived and studied as a complex system. However, most proposals do not analyze language according to linguistic theory, but use instead computational systems that should save time at the price of leaving aside many crucial aspects for linguistic theory. Some approaches to network studies on language do apply precise linguistic analyses, made by a linguist. The problem until now has been the lack of interface between the analysis of a sentence and its integration into the network that could be managed by a linguist and that could save the analysis of any language. Previous works have used old software that was not created for these purposes and that often produced problems with some idiosyncrasies of the target language. The desired interface should be able to deal with the syntactic peculiarities of a particular language, the options of linguistic theory preferred by the user and the preservation of morpho-syntactic information (lexical categories and syntactic relations between items). Netlang is the first program able to do that. Recently, a new kind of linguistic analysis has been developed, which is able to extract a complexity pattern from the speaker's linguistic production which is depicted as a network where words are inside nodes, and these nodes connect each other by means of edges or links (the information inside the edge can be syntactic, semantic, etc.). The Netlang software has become the bridge between rough linguistic data and the network program. Netlang has integrated and improved the functions of programs used in the past, namely the DGA annotator and two scripts (ToXML.pl and Xml2Pairs.py) used for transforming and pruning data. Netlang allows the researcher to make accurate linguistic analysis by means of syntactic dependency relations between words, while tracking record of the nature of such syntactic relationships (subject, object, etc). The Netlang software is presented as a new tool that solve many problems detected in the past. The most important improvement is that Netlang integrates three past applications into one program, and is able to produce a series of file formats that can be read by a network program. Through the Netlang software, the linguistic network analysis based on syntactic analyses, characterized for its low cost and the completely non-invasive procedure aims to evolve into a sufficiently fine grained tool for clinical diagnosis in potential cases of language disorders.
Detection and categorization of bacteria habitats using shallow linguistic analysis
2015-01-01
Background Information regarding bacteria biotopes is important for several research areas including health sciences, microbiology, and food processing and preservation. One of the challenges for scientists in these domains is the huge amount of information buried in the text of electronic resources. Developing methods to automatically extract bacteria habitat relations from the text of these electronic resources is crucial for facilitating research in these areas. Methods We introduce a linguistically motivated rule-based approach for recognizing and normalizing names of bacteria habitats in biomedical text by using an ontology. Our approach is based on the shallow syntactic analysis of the text that include sentence segmentation, part-of-speech (POS) tagging, partial parsing, and lemmatization. In addition, we propose two methods for identifying bacteria habitat localization relations. The underlying assumption for the first method is that discourse changes with a new paragraph. Therefore, it operates on a paragraph-basis. The second method performs a more fine-grained analysis of the text and operates on a sentence-basis. We also develop a novel anaphora resolution method for bacteria coreferences and incorporate it with the sentence-based relation extraction approach. Results We participated in the Bacteria Biotope (BB) Task of the BioNLP Shared Task 2013. Our system (Boun) achieved the second best performance with 68% Slot Error Rate (SER) in Sub-task 1 (Entity Detection and Categorization), and ranked third with an F-score of 27% in Sub-task 2 (Localization Event Extraction). This paper reports the system that is implemented for the shared task, including the novel methods developed and the improvements obtained after the official evaluation. The extensions include the expansion of the OntoBiotope ontology using the training set for Sub-task 1, and the novel sentence-based relation extraction method incorporated with anaphora resolution for Sub-task 2. These extensions resulted in promising results for Sub-task 1 with a SER of 68%, and state-of-the-art performance for Sub-task 2 with an F-score of 53%. Conclusions Our results show that a linguistically-oriented approach based on the shallow syntactic analysis of the text is as effective as machine learning approaches for the detection and ontology-based normalization of habitat entities. Furthermore, the newly developed sentence-based relation extraction system with the anaphora resolution module significantly outperforms the paragraph-based one, as well as the other systems that participated in the BB Shared Task 2013. PMID:26201262
Masanz, James J; Ogren, Philip V; Zheng, Jiaping; Sohn, Sunghwan; Kipper-Schuler, Karin C; Chute, Christopher G
2010-01-01
We aim to build and evaluate an open-source natural language processing system for information extraction from electronic medical record clinical free-text. We describe and evaluate our system, the clinical Text Analysis and Knowledge Extraction System (cTAKES), released open-source at http://www.ohnlp.org. The cTAKES builds on existing open-source technologies—the Unstructured Information Management Architecture framework and OpenNLP natural language processing toolkit. Its components, specifically trained for the clinical domain, create rich linguistic and semantic annotations. Performance of individual components: sentence boundary detector accuracy=0.949; tokenizer accuracy=0.949; part-of-speech tagger accuracy=0.936; shallow parser F-score=0.924; named entity recognizer and system-level evaluation F-score=0.715 for exact and 0.824 for overlapping spans, and accuracy for concept mapping, negation, and status attributes for exact and overlapping spans of 0.957, 0.943, 0.859, and 0.580, 0.939, and 0.839, respectively. Overall performance is discussed against five applications. The cTAKES annotations are the foundation for methods and modules for higher-level semantic processing of clinical free-text. PMID:20819853
Grammaticality, Acceptability, and Probability: A Probabilistic View of Linguistic Knowledge
ERIC Educational Resources Information Center
Lau, Jey Han; Clark, Alexander; Lappin, Shalom
2017-01-01
The question of whether humans represent grammatical knowledge as a binary condition on membership in a set of well-formed sentences, or as a probabilistic property has been the subject of debate among linguists, psychologists, and cognitive scientists for many decades. Acceptability judgments present a serious problem for both classical binary…
Linguistics and Information Science. LINCS Project Document Series.
ERIC Educational Resources Information Center
Montgomery, Christine A.
The relationship between the disciplines of linguistics and information science has not yet been studied in depth. We must assess the state of our knowledge of natural language and determine how this knowledge is applicable within the context of an information system. The concept of a natural language information system can be specified in terms…
Powell, Rachel K
2018-04-05
This lead article of the Clinical Forum focuses on the research that supports why speech-language pathologists (SLPs) are an integral part of the overarching curriculum for all students in schools. Focus on education has shifted to student performance in our global world, specifically in college and career readiness standards. This article reviews recommendations on best practice from the American Speech-Language-Hearing Association on SLPs' roles in schools, as well as data on school-based services. Implementation of these practices as it is applicable to school initiatives will be explored. Methods of interventions available in schools, from general education to special education, will be discussed based on national guidelines for a Response to Intervention and Multi-Tiered System of Support. Research regarding teacher knowledge of the linguistic principles of reading instruction will be explored, as well as correlation between teacher knowledge and student performance. The implications for how SLPs as the linguistic experts offer unique roles in curriculum and the evidence available to support this role will be explored. Implications for future research needs will be discussed. The demands of a highly rigorous curriculum allow SLPs a unique opportunity to apply their knowledge in linguistic principles to increase student performance and achievement. With the increased focus on student achievement, growth outcome measures, and value-added incentives, it is critical that SLPs become contributors to the curriculum for all students and that data to support this role are gathered through focused research initiatives.
Statistical Literacy among Applied Linguists and Second Language Acquisition Researchers
ERIC Educational Resources Information Center
Loewen, Shawn; Lavolette, Elizabeth; Spino, Le Anne; Papi, Mostafa; Schmidtke, Jens; Sterling, Scott; Wolff, Dominik
2014-01-01
The importance of statistical knowledge in applied linguistics and second language acquisition (SLA) research has been emphasized in recent publications. However, the last investigation of the statistical literacy of applied linguists occurred more than 25 years ago (Lazaraton, Riggenbach, & Ediger, 1987). The current study undertook a partial…
[Prosody, speech input and language acquisition].
Jungheim, M; Miller, S; Kühn, D; Ptok, M
2014-04-01
In order to acquire language, children require speech input. The prosody of the speech input plays an important role. In most cultures adults modify their code when communicating with children. Compared to normal speech this code differs especially with regard to prosody. For this review a selective literature search in PubMed and Scopus was performed. Prosodic characteristics are a key feature of spoken language. By analysing prosodic features, children gain knowledge about underlying grammatical structures. Child-directed speech (CDS) is modified in a way that meaningful sequences are highlighted acoustically so that important information can be extracted from the continuous speech flow more easily. CDS is said to enhance the representation of linguistic signs. Taking into consideration what has previously been described in the literature regarding the perception of suprasegmentals, CDS seems to be able to support language acquisition due to the correspondence of prosodic and syntactic units. However, no findings have been reported, stating that the linguistically reduced CDS could hinder first language acquisition.
ERIC Educational Resources Information Center
Roth McDuffie, Amy; Foote, Mary Q.; Bolson, Catherine; Turner, Erin E.; Aguirre, Julia M.; Bartell, Tonya Gau; Drake, Corey; Land, Tonia
2014-01-01
As part of a larger research project aimed at transforming preK-8 mathematics teacher preparation, the purpose of this study was to examine the extent to which prospective teachers notice children's competencies related to children's mathematical thinking, and children's community, cultural, and linguistic funds of knowledge or what…
Linguistic Model for Engine Power Loss
2011-11-27
Intelligent Vehicle Health Management System (IVHMS) for light trucks. In particular, this paper is focused on the system architecture for monitoring...developed for the cooling system of a diesel engine, integrating a priori, ‘expert’ knowledge , sensor data, and the adaptive network-based fuzzy...domain knowledge . However, in a nonlinear system in which not all possible causes to engine power loss are considered and measured, merely relying
Soto, Axel J; Zerva, Chrysoula; Batista-Navarro, Riza; Ananiadou, Sophia
2018-04-15
Pathway models are valuable resources that help us understand the various mechanisms underpinning complex biological processes. Their curation is typically carried out through manual inspection of published scientific literature to find information relevant to a model, which is a laborious and knowledge-intensive task. Furthermore, models curated manually cannot be easily updated and maintained with new evidence extracted from the literature without automated support. We have developed LitPathExplorer, a visual text analytics tool that integrates advanced text mining, semi-supervised learning and interactive visualization, to facilitate the exploration and analysis of pathway models using statements (i.e. events) extracted automatically from the literature and organized according to levels of confidence. LitPathExplorer supports pathway modellers and curators alike by: (i) extracting events from the literature that corroborate existing models with evidence; (ii) discovering new events which can update models; and (iii) providing a confidence value for each event that is automatically computed based on linguistic features and article metadata. Our evaluation of event extraction showed a precision of 89% and a recall of 71%. Evaluation of our confidence measure, when used for ranking sampled events, showed an average precision ranging between 61 and 73%, which can be improved to 95% when the user is involved in the semi-supervised learning process. Qualitative evaluation using pair analytics based on the feedback of three domain experts confirmed the utility of our tool within the context of pathway model exploration. LitPathExplorer is available at http://nactem.ac.uk/LitPathExplorer_BI/. sophia.ananiadou@manchester.ac.uk. Supplementary data are available at Bioinformatics online.
ERIC Educational Resources Information Center
Minagawa, Harumi
2017-01-01
This paper reports students' experiences of a coursework task in a Japanese linguistics course that embraces certain aspects of collaborative learning--aspects that are not practised widely in Japanese language learning situations. These involve the students looking at themselves as well as their fellow students as producers of knowledge and…
Knowledge-based processing for aircraft flight control
NASA Technical Reports Server (NTRS)
Painter, John H.
1991-01-01
The purpose is to develop algorithms and architectures for embedding artificial intelligence in aircraft guidance and control systems. With the approach adopted, AI-computing is used to create an outer guidance loop for driving the usual aircraft autopilot. That is, a symbolic processor monitors the operation and performance of the aircraft. Then, based on rules and other stored knowledge, commands are automatically formulated for driving the autopilot so as to accomplish desired flight operations. The focus is on developing a software system which can respond to linguistic instructions, input in a standard format, so as to formulate a sequence of simple commands to the autopilot. The instructions might be a fairly complex flight clearance, input either manually or by data-link. Emphasis is on a software system which responds much like a pilot would, employing not only precise computations, but, also, knowledge which is less precise, but more like common-sense. The approach is based on prior work to develop a generic 'shell' architecture for an AI-processor, which may be tailored to many applications by describing the application in appropriate processor data bases (libraries). Such descriptions include numerical models of the aircraft and flight control system, as well as symbolic (linguistic) descriptions of flight operations, rules, and tactics.
Generalized event knowledge activation during online sentence comprehension
Metusalem, Ross; Kutas, Marta; Urbach, Thomas P.; Hare, Mary; McRae, Ken; Elman, Jeffrey L.
2012-01-01
Recent research has demonstrated that knowledge of real-world eventsplays an important role inguiding online language comprehension. The present study addresses the scope of event knowledge activation during the course of comprehension, specifically investigating whether activation is limited to those knowledge elements that align with the local linguistic context.The present study addresses this issue by analyzing event-related brain potentials (ERPs) recorded as participants read brief scenariosdescribing typical real-world events. Experiment 1 demonstratesthat a contextually anomalous word elicits a reduced N400 if it is generally related to the described event, even when controlling for the degree of association of this word with individual words in the preceding context and with the expected continuation. Experiment 2 shows that this effect disappears when the discourse context is removed.These findings demonstrate that during the course of incremental comprehension, comprehenders activate general knowledge about the described event, even at points at which this knowledge would constitute an anomalous continuation of the linguistic stream. Generalized event knowledge activationcontributes to mental representations of described events, is immediately available to influence language processing, and likely drives linguistic expectancy generation. PMID:22711976
On the Application of Syntactic Methodologies in Automatic Text Analysis.
ERIC Educational Resources Information Center
Salton, Gerard; And Others
1990-01-01
Summarizes various linguistic approaches proposed for document analysis in information retrieval environments. Topics discussed include syntactic analysis; use of machine-readable dictionary information; knowledge base construction; the PLNLP English Grammar (PEG) system; phrase normalization; and statistical and syntactic phrase evaluation used…
ERIC Educational Resources Information Center
Fedorenko, Evelina; Nieto-Castanon, Alfonso; Kanwisher, Nancy
2012-01-01
Work in theoretical linguistics and psycholinguistics suggests that human linguistic knowledge forms a continuum between individual lexical items and abstract syntactic representations, with most linguistic representations falling between the two extremes and taking the form of lexical items stored together with the syntactic/semantic contexts in…
Research of MPPT for photovoltaic generation based on two-dimensional cloud model
NASA Astrophysics Data System (ADS)
Liu, Shuping; Fan, Wei
2013-03-01
The cloud model is a mathematical representation to fuzziness and randomness in linguistic concepts. It represents a qualitative concept with expected value Ex, entropy En and hyper entropy He, and integrates the fuzziness and randomness of a linguistic concept in a unified way. This model is a new method for transformation between qualitative and quantitative in the knowledge. This paper is introduced MPPT (maximum power point tracking, MPPT) controller based two- dimensional cloud model through analysis of auto-optimization MPPT control of photovoltaic power system and combining theory of cloud model. Simulation result shows that the cloud controller is simple and easy, directly perceived through the senses, and has strong robustness, better control performance.
NASA Astrophysics Data System (ADS)
Croft, William
2016-03-01
Arbib's computational comparative neuroprimatology [1] is a welcome model for cognitive linguists, that is, linguists who ground their models of language in human cognition and language use in social interaction. Arbib argues that language emerged via biological and cultural coevolution [1]; linguistic knowledge is represented by constructions, and semantic representations of linguistic constructions are grounded in embodied perceptual-motor schemas (the mirror system hypothesis). My comments offer some refinements from a linguistic point of view.
[An essay about science and linguistics].
Cugini, P
2011-01-01
Both the methodology and epistemology of science provided the criteria by which the scientific research can describe and interpret data and results of its observational or experimental studies. When the scientist approaches the conclusive inference, it is mandatory to think that both the knowledge and truth imply the use of words semantically and etymologically (semiologically) appropriate, especially if neologisms are required. Lacking a vocabulary, there will be the need of popularizing the inference to the linguistics of the context to which the message is addressed. This could imply a discrepancy among science, knowledge, truth and linguistics, that can be defined "semiologic bias". To avoid this linguistic error, the scientist must feel the responsibility to provide the scientific community with the new words that are semantically and etymologically coherent with what it has been scientifically discovered.
Tilsen, Sam; Arvaniti, Amalia
2013-07-01
This study presents a method for analyzing speech rhythm using empirical mode decomposition of the speech amplitude envelope, which allows for extraction and quantification of syllabic- and supra-syllabic time-scale components of the envelope. The method of empirical mode decomposition of a vocalic energy amplitude envelope is illustrated in detail, and several types of rhythm metrics derived from this method are presented. Spontaneous speech extracted from the Buckeye Corpus is used to assess the effect of utterance length on metrics, and it is shown how metrics representing variability in the supra-syllabic time-scale components of the envelope can be used to identify stretches of speech with targeted rhythmic characteristics. Furthermore, the envelope-based metrics are used to characterize cross-linguistic differences in speech rhythm in the UC San Diego Speech Lab corpus of English, German, Greek, Italian, Korean, and Spanish speech elicited in read sentences, read passages, and spontaneous speech. The envelope-based metrics exhibit significant effects of language and elicitation method that argue for a nuanced view of cross-linguistic rhythm patterns.
Interpreting "I don't know" use by persons living with dementia in Mini-Mental State Examinations.
Hesson, Ashley M; Pichler, Heike
2016-09-01
We investigate dementia patients' use of "I don't know" (IDK) in Mini-Mental State Exams (MMSEs) using objective linguistic indicators to differentiate IDK signalling lack of knowledge (LOK) from IDK used to hedge responses, affect exam progression etc. We hypothesize that increased proportional use of LOK-IDK correlates with worsening dementia severity. 189 IDK tokens were extracted from 72 MMSE interactions and coded for linguistic/social characteristics. A data-driven, discourse position/relation-based functional taxonomy for IDK in MMSE was developed and the resulting functional distribution was subjected to multiple logistic regression. Use of LOK-IDK (vs. non-LOK-IDK) is significantly correlated (p=0.01) with clinicians' subjective ratings of patients' dementia as 'severe' vs. 'mild'/'moderate', indicating that objective sociolinguistic criteria approximate physician judgments. 92% of 'severe' patients' IDKs signalled LOK, compared to only 68% of 'mild' patients', suggesting that uncritical interpretation of IDK as signalling LOK would result in 8-32% of IDK responses being mis-scored. LOK and non-LOK uses distinguished on the basis of reliable, objective usage patterns are differentially distributed among dementia severity groups. LOK-IDK serves as a supplemental indicator of dementia severity. Correct interpretation may improve diagnostic accuracy and allow clinicians to respond supportively during cognitive assessment. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
ERIC Educational Resources Information Center
Mueller Gathercole, Virginia C.; Thomas, Enlli Mon; Jones, Leah; Guasch, Nestor Vinas; Young, Nia; Hughes, Emma K.
2010-01-01
This study explores the extent to which a bilingual advantage can be observed for executive function tasks in children of varying levels of language dominance, and examines the contributions of general cognitive knowledge, linguistic abilities, language use and socio-economic level to performance. Welsh-English bilingual and English monolingual…
ERIC Educational Resources Information Center
He, Hao
2008-01-01
Minority students' English learning is a special and an indispensable component of English education system in China. This article studies students' linguistic knowledge that live in Northwestern China--Gan Nan Autonomy State of Gan Su Province with majority population of Tibetan, mixed with Chinese and some Muslim. An analogous analysis is…
Knowledge discovery by accuracy maximization
Cacciatore, Stefano; Luchinat, Claudio; Tenori, Leonardo
2014-01-01
Here we describe KODAMA (knowledge discovery by accuracy maximization), an unsupervised and semisupervised learning algorithm that performs feature extraction from noisy and high-dimensional data. Unlike other data mining methods, the peculiarity of KODAMA is that it is driven by an integrated procedure of cross-validation of the results. The discovery of a local manifold’s topology is led by a classifier through a Monte Carlo procedure of maximization of cross-validated predictive accuracy. Briefly, our approach differs from previous methods in that it has an integrated procedure of validation of the results. In this way, the method ensures the highest robustness of the obtained solution. This robustness is demonstrated on experimental datasets of gene expression and metabolomics, where KODAMA compares favorably with other existing feature extraction methods. KODAMA is then applied to an astronomical dataset, revealing unexpected features. Interesting and not easily predictable features are also found in the analysis of the State of the Union speeches by American presidents: KODAMA reveals an abrupt linguistic transition sharply separating all post-Reagan from all pre-Reagan speeches. The transition occurs during Reagan’s presidency and not from its beginning. PMID:24706821
Language Comprehension and the Acquisition of Knowledge.
ERIC Educational Resources Information Center
Freedle, Roy O., Ed.; Carroll, John B., Ed.
Thirteen papers given by language specialists are presented. These analyze special linguistic (semantic) problems that occur when interconnected strings of sentences constitute data base; they also analyze special psychological problems (of memory, inference, and motivation) that occur when human subjects are exposed to discourse materials in…
People Use their Knowledge of Common Events to Understand Language, and Do So as Quickly as Possible
McRae, Ken; Matsuki, Kazunaga
2011-01-01
People possess a great deal of knowledge about how the world works, and it is undoubtedly true that adults use this knowledge when understanding and producing language. However, psycholinguistic theories differ regarding whether this extra-linguistic pragmatic knowledge can be activated and used immediately, or only after a delay. The authors present research that investigates whether people immediately use their generalized knowledge of common events when understanding language. This research demonstrates that (i) individual isolated words immediately activate event-based knowledge; (ii) combinations of words in sentences immediately constrain people’s event-based expectations for concepts that are upcoming in language; (iii) syntax modulates people’s expectations for ensuing concepts; and (iv) event-based knowledge can produce expectations for ensuing syntactic structures. It is concluded that theories of sentence comprehension must allow for the rapid dynamic interplay among these sources of information. PMID:22125574
Non-linguistic learning and aphasia: Evidence from a paired associate and feedback-based task
Vallila-Rohter, Sofia; Kiran, Swathi
2013-01-01
Though aphasia is primarily characterized by impairments in the comprehension and/or expression of language, research has shown that patients with aphasia also show deficits in cognitive-linguistic domains such as attention, executive function, concept knowledge and memory (Helm-Estabrooks, 2002 for review). Research in aphasia suggests that cognitive impairments can impact the online construction of language, new verbal learning, and transactional success (Freedman & Martin, 2001; Hula & McNeil, 2008; Ramsberger, 2005). In our research, we extend this hypothesis to suggest that general cognitive deficits influence progress with therapy. The aim of our study is to explore learning, a cognitive process that is integral to relearning language, yet underexplored in the field of aphasia rehabilitation. We examine non-linguistic category learning in patients with aphasia (n=19) and in healthy controls (n=12), comparing feedback and non-feedback based instruction. Participants complete two computer-based learning tasks that require them to categorize novel animals based on the percentage of features shared with one of two prototypes. As hypothesized, healthy controls showed successful category learning following both methods of instruction. In contrast, only 60% of our patient population demonstrated successful non-linguistic category learning. Patient performance was not predictable by standardized measures of cognitive ability. Results suggest that general learning is affected in aphasia and is a unique, important factor to consider in the field of aphasia rehabilitation. PMID:23127795
NASA Astrophysics Data System (ADS)
Monterde Rey, Ana Maria
In the area of terminology, one can find very little literature about the relationships and dependencies between linguistic and non-linguistic forms of concept representation. Furthermore, a large gap exists in the studies of non-linguistic forms. All of this constitutes the central problem in our thesis that we attempt to solve. Following an onomasiologic process of creating a terminological database, we have analysed and related, using three levels of specialisation (expert, student, and general public), the various linguistic forms (term, definition, and explanation) and a non-linguistic form (illustration) of concept representation in the area of aeronautical fuel-system installations. Specifically, of the aforementioned forms of conceptual representation, we have studied the adaptation of the level of knowledge of the material to those to whom the texts are addressed. Additionally, we have examined the formation, origin, etimology, foreign words, polysemy, synonymy, and typology of each term. We have also described in the following detail the characteristics of each type of illustration isolated in our corpus: the relationship to the object or to the concept, the existence of text and terms (linguistic media) within the illustrations, the degree of abstraction, the a priori knowledge necessary to interpret the illustrations, and, the existence of grafic symbols. Finally, we have related all linguistic and non-linguistic forms of conceptual representation.
An Individual Subjectivist Critique of the Use of Corpus Linguistics to Inform Pedagogical Materials
ERIC Educational Resources Information Center
Richards, Kendall; Pilcher, Nick
2016-01-01
Corpus linguistics, or the gathering together of language into a body for analysis and development of materials, is claimed to be an assured, established method (or field) that valuably informs pedagogical materials and knowledge of language (e.g. Ädel 2010; Gardner & Nesi, 2013). The fundamental validity of corpus linguistics is rarely, if…
Acquisition and Use of Linguistic Knowledge: Scrambling in Child Japanese as a Test Case
ERIC Educational Resources Information Center
Minai, Utako; Isobe, Miwa; Okabe, Reiko
2015-01-01
The current study investigates preschool-age children's comprehension of scrambled sentences in Japanese. While scrambling has been known to be challenging for children, biasing them to exhibit non-adult-like interpretations (e.g., Hayashibe in "Descr Appl Linguist" 8:1-18, 1975; Sano in "Descr Appl Linguist" 10:213-233, 1977;…
The Goals of Linguistic Theory Revisited.
ERIC Educational Resources Information Center
Schank, Roger C.; Wilks, Yorick
There is a need for a new kind of linguistic theory which, while being concerned with both generation and analysis, must include the roles of memory, non-linguistic knowledge, and inference. The role of logic is diminished according to such a theory because inference has no real logical content. Meaning must be studied with respect to the actual…
ERIC Educational Resources Information Center
Albashtawi, Abeer H.; Jaganathan, Paramaswari; Singh, Manjet
2016-01-01
This study aimed to investigate the linguistic knowledge aspect in academic reading, the challenges and the deployed strategies by English major undergraduates at a Jordanian institution of higher education. The importance of the study is attributed to the importance of the academic reading at university which is closely related to the academic…
Metusalem, Ross; Kutas, Marta; Urbach, Thomas P.; Elman, Jeffrey L.
2016-01-01
During incremental language comprehension, the brain activates knowledge of described events, including knowledge elements that constitute semantic anomalies in their linguistic context. The present study investigates hemispheric asymmetries in this process, with the aim of advancing our understanding of the neural basis and functional properties of event knowledge activation during incremental comprehension. In a visual half-field event-related brain potential (ERP) experiment, participants read brief discourses in which the third sentence contained a word that was either highly expected, semantically anomalous but related to the described event, or semantically anomalous but unrelated to the described event. For both visual fields of target word presentation, semantically anomalous words elicited N400 ERP components of greater amplitude than did expected words. Crucially, event-related anomalous words elicited a reduced N400 relative to event-unrelated anomalous words only with left visual field/right hemisphere presentation. This result suggests that right hemisphere processes are critical to the activation of event knowledge elements that violate the linguistic context, and in doing so informs existing theories of hemispheric asymmetries in semantic processing during language comprehension. Additionally, this finding coincides with past research suggesting a crucial role for the right hemisphere in elaborative inference generation, raises interesting questions regarding hemispheric coordination in generating event-specific linguistic expectancies, and more generally highlights the possibility of functional dissociation between event knowledge activation for the generation of elaborative inferences and for linguistic expectancies. PMID:26878980
Metusalem, Ross; Kutas, Marta; Urbach, Thomas P; Elman, Jeffrey L
2016-04-01
During incremental language comprehension, the brain activates knowledge of described events, including knowledge elements that constitute semantic anomalies in their linguistic context. The present study investigates hemispheric asymmetries in this process, with the aim of advancing our understanding of the neural basis and functional properties of event knowledge activation during incremental comprehension. In a visual half-field event-related brain potential (ERP) experiment, participants read brief discourses in which the third sentence contained a word that was either highly expected, semantically anomalous but related to the described event (Event-Related), or semantically anomalous but unrelated to the described event (Event-Unrelated). For both visual fields of target word presentation, semantically anomalous words elicited N400 ERP components of greater amplitude than did expected words. Crucially, Event-Related anomalous words elicited a reduced N400 relative to Event-Unrelated anomalous words only with left visual field/right hemisphere presentation. This result suggests that right hemisphere processes are critical to the activation of event knowledge elements that violate the linguistic context, and in doing so informs existing theories of hemispheric asymmetries in semantic processing during language comprehension. Additionally, this finding coincides with past research suggesting a crucial role for the right hemisphere in elaborative inference generation, raises interesting questions regarding hemispheric coordination in generating event-specific linguistic expectancies, and more generally highlights the possibility of functional dissociation of event knowledge activation for the generation of elaborative inferences and for linguistic expectancies. Copyright © 2016 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Giebler, Ralf
2012-01-01
It has been suggested recently that it may be useful for language teaching practitioners to have some knowledge of cognitive linguistics. Cognitive linguistics (CL) provides tools that may help the language-teaching practitioner to gain insight into the semantic potential of words and communicate the meaning of lexical chunks in greater detail…
Refining the quantitative pathway of the Pathways to Mathematics model.
Sowinski, Carla; LeFevre, Jo-Anne; Skwarchuk, Sheri-Lynn; Kamawar, Deepthi; Bisanz, Jeffrey; Smith-Chant, Brenda
2015-03-01
In the current study, we adopted the Pathways to Mathematics model of LeFevre et al. (2010). In this model, there are three cognitive domains--labeled as the quantitative, linguistic, and working memory pathways--that make unique contributions to children's mathematical development. We attempted to refine the quantitative pathway by combining children's (N=141 in Grades 2 and 3) subitizing, counting, and symbolic magnitude comparison skills using principal components analysis. The quantitative pathway was examined in relation to dependent numerical measures (backward counting, arithmetic fluency, calculation, and number system knowledge) and a dependent reading measure, while simultaneously accounting for linguistic and working memory skills. Analyses controlled for processing speed, parental education, and gender. We hypothesized that the quantitative, linguistic, and working memory pathways would account for unique variance in the numerical outcomes; this was the case for backward counting and arithmetic fluency. However, only the quantitative and linguistic pathways (not working memory) accounted for unique variance in calculation and number system knowledge. Not surprisingly, only the linguistic pathway accounted for unique variance in the reading measure. These findings suggest that the relative contributions of quantitative, linguistic, and working memory skills vary depending on the specific cognitive task. Copyright © 2014 Elsevier Inc. All rights reserved.
Talking to Students: Metadiscourse in Introductory Coursebooks.
ERIC Educational Resources Information Center
Hyland, Ken
1999-01-01
Explores role of college textbooks in students' acquisition of special disciplinary literacy, focusing on use of metadiscourse as manifestation of writer's linguistic and rhetorical presence in a text. Features are compared from 21 textbook extracts in microbiology, marketing, and applied linguistics with similar corpus of research articles,…
Validation and Comprehension of Text Information: Two Sides of the Same Coin
ERIC Educational Resources Information Center
Richter, Tobias
2015-01-01
In psychological research, the comprehension of linguistic information and the knowledge-based assessment of its validity are often regarded as two separate stages of information processing. Recent findings in psycholinguistics and text comprehension research call this two-stage model into question. In particular, validation can affect…
Linguistically Motivated Features for CCG Realization Ranking
ERIC Educational Resources Information Center
Rajkumar, Rajakrishnan
2012-01-01
Natural Language Generation (NLG) is the process of generating natural language text from an input, which is a communicative goal and a database or knowledge base. Informally, the architecture of a standard NLG system consists of the following modules (Reiter and Dale, 2000): content determination, sentence planning (or microplanning) and surface…
ERIC Educational Resources Information Center
Wolf, Maryanne; Gottwald, Stephanie
2016-01-01
Maryanne Wolf's early literacy knowledge is based on her research into deep reading and on periods of enlightened linguistic processes. In her own rich language of conveyance, she brought great inspiration to the Montessori teachers at the Columbia, South Carolina conference. Her presentation on the research of early reading, the acquisition of…
ERIC Educational Resources Information Center
Schissel, Jamie L.; Leung, Constant; López-Gopar, Mario; Davis, James R.
2018-01-01
The assessments designed for and analyzed in this study used a task-based language design template rooted in theories of language reflecting heteroglossic language practices and funds of knowledge learning theories, which were understood as transforming classroom teaching, learning, and assessment through continua of biliteracy lenses. Using a…
Webquests for English-Language Learners: Essential Elements for Design
ERIC Educational Resources Information Center
Sox, Amanda; Rubinstein-Avila, Eliane
2009-01-01
The authors of this article advocate for the adaptation and use of WebQuests (web-based interdisciplinary collaborative learning units) to integrate technological competencies and content area knowledge development at the secondary level and to support the linguistic needs of English-language learners (ELLs). After examining eight WebQuests, the…
Metaphor Analysis in the Educational Discourse: A Critical Review
ERIC Educational Resources Information Center
Zheng, Hong-bo; Song, Wen-juan
2010-01-01
Metaphor analysis is based on the belief that metaphor is a powerful linguistic device, because it extends and encapsulates knowledge about the familiarity and unfamiliarity. Metaphor analysis has been adopted in the educational discourse. The paper categorizes the previous relevant research into 3: interactions between learners and institutions,…
ERIC Educational Resources Information Center
Horgan, Dianne
A study was conducted to determine whether the child expresses linguistic knowledge during the single-word period. The order of mention in 65 sets of successive single-word utterances from five children at Stage 1, two to four years old, were analyzed. To elicit speech, the children were shown line drawings representing such situations as animate…
A Risk Assessment System with Automatic Extraction of Event Types
NASA Astrophysics Data System (ADS)
Capet, Philippe; Delavallade, Thomas; Nakamura, Takuya; Sandor, Agnes; Tarsitano, Cedric; Voyatzi, Stavroula
In this article we describe the joint effort of experts in linguistics, information extraction and risk assessment to integrate EventSpotter, an automatic event extraction engine, into ADAC, an automated early warning system. By detecting as early as possible weak signals of emerging risks ADAC provides a dynamic synthetic picture of situations involving risk. The ADAC system calculates risk on the basis of fuzzy logic rules operated on a template graph whose leaves are event types. EventSpotter is based on a general purpose natural language dependency parser, XIP, enhanced with domain-specific lexical resources (Lexicon-Grammar). Its role is to automatically feed the leaves with input data.
2017-01-01
Evidence-based dietary information represented as unstructured text is a crucial information that needs to be accessed in order to help dietitians follow the new knowledge arrives daily with newly published scientific reports. Different named-entity recognition (NER) methods have been introduced previously to extract useful information from the biomedical literature. They are focused on, for example extracting gene mentions, proteins mentions, relationships between genes and proteins, chemical concepts and relationships between drugs and diseases. In this paper, we present a novel NER method, called drNER, for knowledge extraction of evidence-based dietary information. To the best of our knowledge this is the first attempt at extracting dietary concepts. DrNER is a rule-based NER that consists of two phases. The first one involves the detection and determination of the entities mention, and the second one involves the selection and extraction of the entities. We evaluate the method by using text corpora from heterogeneous sources, including text from several scientifically validated web sites and text from scientific publications. Evaluation of the method showed that drNER gives good results and can be used for knowledge extraction of evidence-based dietary recommendations. PMID:28644863
ERIC Educational Resources Information Center
Marckwardt, Albert H., Ed.
Authors in Section 1 of this yearbook distinguish between the special knowledge and tools employed by the linguists, and the concepts and conclusions which may be passed on to teachers; while authors in Section 2 deal specifically with linguistics in the school context--both its content and its implications for teaching strategies. Papers and…
ERIC Educational Resources Information Center
Ament, Jennifer R.; Pérez-Vidal, Carmen
2015-01-01
Globalisation and international mobility in the 21st century has led to the internationalisation of the English language (Crystal, 2003). Research regarding linguistic gains at university levels is however extremely scarce. This study aims to address this gap of knowledge and provide some answers as to how much linguistic gain can be expected…
1989-08-01
Automatic Line Network Extraction from Aerial Imangery of Urban Areas Sthrough KnowledghBased Image Analysis N 04 Final Technical ReportI December...Automatic Line Network Extraction from Aerial Imagery of Urban Areas through Knowledge Based Image Analysis Accesion For NTIS CRA&I DTIC TAB 0...paittern re’ognlition. blac’kboardl oriented symbollic processing, knowledge based image analysis , image understanding, aer’ial imsagery, urban area, 17
Linguistic Summarization of Video for Fall Detection Using Voxel Person and Fuzzy Logic
Anderson, Derek; Luke, Robert H.; Keller, James M.; Skubic, Marjorie; Rantz, Marilyn; Aud, Myra
2009-01-01
In this paper, we present a method for recognizing human activity from linguistic summarizations of temporal fuzzy inference curves representing the states of a three-dimensional object called voxel person. A hierarchy of fuzzy logic is used, where the output from each level is summarized and fed into the next level. We present a two level model for fall detection. The first level infers the states of the person at each image. The second level operates on linguistic summarizations of voxel person’s states and inference regarding activity is performed. The rules used for fall detection were designed under the supervision of nurses to ensure that they reflect the manner in which elders perform these activities. The proposed framework is extremely flexible. Rules can be modified, added, or removed, allowing for per-resident customization based on knowledge about their cognitive and physical ability. PMID:20046216
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dumidu Wijayasekara; Ondrej Linda; Milos Manic
Building Energy Management Systems (BEMSs) are essential components of modern buildings that utilize digital control technologies to minimize energy consumption while maintaining high levels of occupant comfort. However, BEMSs can only achieve these energy savings when properly tuned and controlled. Since indoor environment is dependent on uncertain criteria such as weather, occupancy, and thermal state, performance of BEMS can be sub-optimal at times. Unfortunately, the complexity of BEMS control mechanism, the large amount of data available and inter-relations between the data can make identifying these sub-optimal behaviors difficult. This paper proposes a novel Fuzzy Anomaly Detection and Linguistic Description (Fuzzy-ADLD)more » based method for improving the understandability of BEMS behavior for improved state-awareness. The presented method is composed of two main parts: 1) detection of anomalous BEMS behavior and 2) linguistic representation of BEMS behavior. The first part utilizes modified nearest neighbor clustering algorithm and fuzzy logic rule extraction technique to build a model of normal BEMS behavior. The second part of the presented method computes the most relevant linguistic description of the identified anomalies. The presented Fuzzy-ADLD method was applied to real-world BEMS system and compared against a traditional alarm based BEMS. In six different scenarios, the Fuzzy-ADLD method identified anomalous behavior either as fast as or faster (an hour or more), that the alarm based BEMS. In addition, the Fuzzy-ADLD method identified cases that were missed by the alarm based system, demonstrating potential for increased state-awareness of abnormal building behavior.« less
Semi Automatic Ontology Instantiation in the domain of Risk Management
NASA Astrophysics Data System (ADS)
Makki, Jawad; Alquier, Anne-Marie; Prince, Violaine
One of the challenging tasks in the context of Ontological Engineering is to automatically or semi-automatically support the process of Ontology Learning and Ontology Population from semi-structured documents (texts). In this paper we describe a Semi-Automatic Ontology Instantiation method from natural language text, in the domain of Risk Management. This method is composed from three steps 1 ) Annotation with part-of-speech tags, 2) Semantic Relation Instances Extraction, 3) Ontology instantiation process. It's based on combined NLP techniques using human intervention between steps 2 and 3 for control and validation. Since it heavily relies on linguistic knowledge it is not domain dependent which is a good feature for portability between the different fields of risk management application. The proposed methodology uses the ontology of the PRIMA1 project (supported by the European community) as a Generic Domain Ontology and populates it via an available corpus. A first validation of the approach is done through an experiment with Chemical Fact Sheets from Environmental Protection Agency2.
Adlassnig, Klaus-Peter; Fehre, Karsten; Rappelsberger, Andrea
2015-01-01
This study's objective is to develop and use a scalable genuine technology platform for clinical decision support based on Arden Syntax, which was extended by fuzzy set theory and fuzzy logic. Arden Syntax is a widely recognized formal language for representing clinical and scientific knowledge in an executable format, and is maintained by Health Level Seven (HL7) International and approved by the American National Standards Institute (ANSI). Fuzzy set theory and logic permit the representation of knowledge and automated reasoning under linguistic and propositional uncertainty. These forms of uncertainty are a common feature of patients' medical data, the body of medical knowledge, and deductive clinical reasoning.
Knowledge-Based Replanning System.
1987-05-01
appeared to be making significant progress. Theoretical linguistics pulled itself together in the late 1970s and early 1980s, and more and more Al...Richard, "Achievingv Several GoalIS SimuL1tan1C0ulvj\\. Artiti:1al 1111CIic~ Center, Technical Note 107, SRI Project 2245, JlyI 19-5. - 77 - ~ k I
Incorporating Popular Literature into the Curriculum for Diverse Learners.
ERIC Educational Resources Information Center
Jairrels, Veda; Brazil, Nettye; Patton, James R.
1999-01-01
Discusses how teachers can use magazines written for culturally and linguistically diverse groups to increase their own knowledge base and to use as a resource for multicultural-education lesson planning in order to provide students with an opportunity to learn about high achieving individuals who come from backgrounds similar to their own.…
Language Modernization vs. Linguistic Protectionism
ERIC Educational Resources Information Center
Prifti, Erida
2009-01-01
Since 1991, when the fiercest of all Communist isolations broke and the borders to the world were finally opened, the Albanian language has been undergoing significant changes in its lexicon and, at a certain measure, in its structure. Numerous concepts have found their way into the Albanian knowledge base before an Albanian word was ever found to…
Creativity in the Language Classroom: Towards a "Vichian" Approach in Second Language Teaching.
ERIC Educational Resources Information Center
Danesi, Marcel; D'Alfonso, Aldo
1989-01-01
Describes a "Vichian" approach (involving linguistic imagination and creativity) to the exploration of basic pedagogical matters in classroom language teaching. The approach is based on principles involving: (1) concrete language knowledge; (2) development from the concrete to the abstract; (3) the role of metaphor in verbal creativity;…
Developing Early Place-Value Understanding: A Framework for Tens Awareness
ERIC Educational Resources Information Center
Young-Loveridge, Jenny; Bicknell, Brenda
2016-01-01
This paper outlines a framework to explain the early development of place-value understanding based on an analysis of data from 84 five- to seven-year-old children from diverse cultural and linguistic backgrounds. The children were assessed individually on number knowledge tasks (recalled facts, subitizing, counting, place-value understanding) and…
Language Learners' Acculturation Attitudes
ERIC Educational Resources Information Center
Rafieyan, Vahid; Orang, Maryam; Bijami, Maryam; Nejad, Maryam Sharafi; Eng, Lin Siew
2014-01-01
Learning a language involves knowledge of both linguistic competence and cultural competence. Optimal development of linguistic competence and cultural competence, however, requires a high level of acculturation attitude toward the target language culture. To this end, the present study explored the acculturation attitudes of 70 Iranian…
Automatic Extraction of Destinations, Origins and Route Parts from Human Generated Route Directions
NASA Astrophysics Data System (ADS)
Zhang, Xiao; Mitra, Prasenjit; Klippel, Alexander; Maceachren, Alan
Researchers from the cognitive and spatial sciences are studying text descriptions of movement patterns in order to examine how humans communicate and understand spatial information. In particular, route directions offer a rich source of information on how cognitive systems conceptualize movement patterns by segmenting them into meaningful parts. Route directions are composed using a plethora of cognitive spatial organization principles: changing levels of granularity, hierarchical organization, incorporation of cognitively and perceptually salient elements, and so forth. Identifying such information in text documents automatically is crucial for enabling machine-understanding of human spatial language. The benefits are: a) creating opportunities for large-scale studies of human linguistic behavior; b) extracting and georeferencing salient entities (landmarks) that are used by human route direction providers; c) developing methods to translate route directions to sketches and maps; and d) enabling queries on large corpora of crawled/analyzed movement data. In this paper, we introduce our approach and implementations that bring us closer to the goal of automatically processing linguistic route directions. We report on research directed at one part of the larger problem, that is, extracting the three most critical parts of route directions and movement patterns in general: origin, destination, and route parts. We use machine-learning based algorithms to extract these parts of routes, including, for example, destination names and types. We prove the effectiveness of our approach in several experiments using hand-tagged corpora.
Linguistic feature analysis for protein interaction extraction
2009-01-01
Background The rapid growth of the amount of publicly available reports on biomedical experimental results has recently caused a boost of text mining approaches for protein interaction extraction. Most approaches rely implicitly or explicitly on linguistic, i.e., lexical and syntactic, data extracted from text. However, only few attempts have been made to evaluate the contribution of the different feature types. In this work, we contribute to this evaluation by studying the relative importance of deep syntactic features, i.e., grammatical relations, shallow syntactic features (part-of-speech information) and lexical features. For this purpose, we use a recently proposed approach that uses support vector machines with structured kernels. Results Our results reveal that the contribution of the different feature types varies for the different data sets on which the experiments were conducted. The smaller the training corpus compared to the test data, the more important the role of grammatical relations becomes. Moreover, deep syntactic information based classifiers prove to be more robust on heterogeneous texts where no or only limited common vocabulary is shared. Conclusion Our findings suggest that grammatical relations play an important role in the interaction extraction task. Moreover, the net advantage of adding lexical and shallow syntactic features is small related to the number of added features. This implies that efficient classifiers can be built by using only a small fraction of the features that are typically being used in recent approaches. PMID:19909518
Science knowledge and cognitive strategy use among culturally and linguistically diverse students
NASA Astrophysics Data System (ADS)
Lee, Okhee; Fradd, Sandra H.; Sutman, Frank X.
Science performance is determined, to a large extent, by what students already know about science (i.e., science knowledge) and what techniques or methods students use in performing science tasks (i.e., cognitive strategies). This study describes and compares science knowledge, science vocabulary, and cognitive strategy use among four diverse groups of elementary students: (a) monolingual English Caucasian, (b) African-American, (c) bilingual Spanish, and (d) bilingual Haitian Creole. To facilitate science performance in culturally and linguistically congruent settings, the study included student dyads and teachers of the same language, culture, and gender. Science performance was observed using three science tasks: weather phenomena, simple machines, and buoyancy. Data analysis involved a range of qualitative methods focusing on major themes and patterns, and quantitative methods using coding systems to summarize frequencies and total scores. The findings reveal distinct patterns of science knowledge, science vocabulary, and cognitive strategy use among the four language and culture groups. The findings also indicate relationships among science knowledge, science vocabulary, and cognitive strategy use. These findings raise important issues about science instruction for culturally and linguistically diverse groups of students.Received: 3 January 1995;
The Importance of Being a Complement: CED Effects Revisited
ERIC Educational Resources Information Center
Jurka, Johannes
2010-01-01
This dissertation revisits subject island effects (Ross 1967, Chomsky 1973) cross-linguistically. Controlled acceptability judgment studies in German, English, Japanese and Serbian show that extraction out of specifiers is consistently degraded compared to extraction out of complements, indicating that the Condition on Extraction domains (CED,…
"This war for men's minds": the birth of a human science in Cold War America.
Martin-Nielsen, Janet
2010-01-01
The past decade has seen an explosion of work on the history of the human sciences during the Cold War. This work, however, does not engage with one of the leading human sciences of the period: linguistics. This article begins to rectify this knowledge gap by investigating the influence of linguistics and its concept of study, language, on American public, political and intellectual life during the postwar and early Cold War years. I show that language emerged in three frameworks in this period: language as tool, language as weapon, and language as knowledge. As America stepped onto the international stage, language and linguistics were at the forefront: the military poured millions of dollars into machine translation, American diplomats were required to master scores of foreign languages, and schoolchildren were exposed to language-learning on a scale never before seen in the United States. Together, I argue, language and linguistics formed a critical part of the rise of American leadership in the new world order - one that provided communities as dispersed as the military, the diplomatic corps, scientists and language teachers with a powerful way of tackling the problems they faced. To date, linguistics has not been integrated into the broader framework of Cold War human sciences. In this article, I aim to bring both language, as concept, and linguistics, as discipline, into this framework. In doing so, I pave the way for future work on the history of linguistics as a human science.
A knowledge-based system for prototypical reasoning
NASA Astrophysics Data System (ADS)
Lieto, Antonio; Minieri, Andrea; Piana, Alberto; Radicioni, Daniele P.
2015-04-01
In this work we present a knowledge-based system equipped with a hybrid, cognitively inspired architecture for the representation of conceptual information. The proposed system aims at extending the classical representational and reasoning capabilities of the ontology-based frameworks towards the realm of the prototype theory. It is based on a hybrid knowledge base, composed of a classical symbolic component (grounded on a formal ontology) with a typicality based one (grounded on the conceptual spaces framework). The resulting system attempts to reconcile the heterogeneous approach to the concepts in Cognitive Science with the dual process theories of reasoning and rationality. The system has been experimentally assessed in a conceptual categorisation task where common sense linguistic descriptions were given in input, and the corresponding target concepts had to be identified. The results show that the proposed solution substantially extends the representational and reasoning 'conceptual' capabilities of standard ontology-based systems.
Chen, Zhenyu; Li, Jianping; Wei, Liwei
2007-10-01
Recently, gene expression profiling using microarray techniques has been shown as a promising tool to improve the diagnosis and treatment of cancer. Gene expression data contain high level of noise and the overwhelming number of genes relative to the number of available samples. It brings out a great challenge for machine learning and statistic techniques. Support vector machine (SVM) has been successfully used to classify gene expression data of cancer tissue. In the medical field, it is crucial to deliver the user a transparent decision process. How to explain the computed solutions and present the extracted knowledge becomes a main obstacle for SVM. A multiple kernel support vector machine (MK-SVM) scheme, consisting of feature selection, rule extraction and prediction modeling is proposed to improve the explanation capacity of SVM. In this scheme, we show that the feature selection problem can be translated into an ordinary multiple parameters learning problem. And a shrinkage approach: 1-norm based linear programming is proposed to obtain the sparse parameters and the corresponding selected features. We propose a novel rule extraction approach using the information provided by the separating hyperplane and support vectors to improve the generalization capacity and comprehensibility of rules and reduce the computational complexity. Two public gene expression datasets: leukemia dataset and colon tumor dataset are used to demonstrate the performance of this approach. Using the small number of selected genes, MK-SVM achieves encouraging classification accuracy: more than 90% for both two datasets. Moreover, very simple rules with linguist labels are extracted. The rule sets have high diagnostic power because of their good classification performance.
Functional Neuroanatomy of Contextual Acquisition of Concrete and Abstract Words
ERIC Educational Resources Information Center
Mestres-Misse, Anna; Munte, Thomas F.; Rodriguez-Fornells, Antoni
2009-01-01
The meaning of a novel word can be acquired by extracting it from linguistic context. Here we simulated word learning of new words associated to concrete and abstract concepts in a variant of the human simulation paradigm that provided linguistic context information in order to characterize the brain systems involved. Native speakers of Spanish…
Mainela-Arnold, Elina; Evans, Julia L.
2016-01-01
Reduced verbal working memory capacity has been proposed as a possible account of language impairments in specific language impairment (SLI). Studies have shown, however, that differences in strength of linguistic representations in the form of word frequency affect list recall and performance on verbal working memory tasks. This suggests that verbal memory capacity and long-term linguistic knowledge may not be distinct constructs. It has been suggested that linguistic representations in SLI are weak in ways that result in a breakdown in language processing on tasks that require manipulation of unfamiliar material. In this study, the effects of word frequency, long-term linguistic knowledge, and serial order position on recall performance in the competing language processing task (CLPT) were investigated in 10 children with SLI and 10 age-matched peers (age 8 years 6 months to 12 years 4 months). The children with SLI recalled significantly fewer target words on the CLPT as compared with their age-matched controls. The SLI group did not differ, however, in their ability to recall target words having high word frequency but were significantly poorer in their ability to recall words on the CLPT having low word frequency. Differences in receptive and expressive language abilities also appeared closely related to performance on the CLPT, suggesting that working memory capacity is not distinct from language knowledge and that degraded linguistic representations may have an effect on performance on verbal working memory span tasks in children with SLI. PMID:16378481
Listening Comprehension: A Cognitive Prerequisite for Communication.
ERIC Educational Resources Information Center
Fischer, Robert A.
Proponents of the cognitive approach to language teaching list linguistic competence as the primary instructional objective and attribute considerable importance to listening comprehension. For the student, linguistic competence would be knowledge of grammatical components of the language and its vocabulary. Understanding oral messages is an…
Chemical-induced disease relation extraction with various linguistic features.
Gu, Jinghang; Qian, Longhua; Zhou, Guodong
2016-01-01
Understanding the relations between chemicals and diseases is crucial in various biomedical tasks such as new drug discoveries and new therapy developments. While manually mining these relations from the biomedical literature is costly and time-consuming, such a procedure is often difficult to keep up-to-date. To address these issues, the BioCreative-V community proposed a challenging task of automatic extraction of chemical-induced disease (CID) relations in order to benefit biocuration. This article describes our work on the CID relation extraction task on the BioCreative-V tasks. We built a machine learning based system that utilized simple yet effective linguistic features to extract relations with maximum entropy models. In addition to leveraging various features, the hypernym relations between entity concepts derived from the Medical Subject Headings (MeSH)-controlled vocabulary were also employed during both training and testing stages to obtain more accurate classification models and better extraction performance, respectively. We demoted relation extraction between entities in documents to relation extraction between entity mentions. In our system, pairs of chemical and disease mentions at both intra- and inter-sentence levels were first constructed as relation instances for training and testing, then two classification models at both levels were trained from the training examples and applied to the testing examples. Finally, we merged the classification results from mention level to document level to acquire final relations between chemicals and diseases. Our system achieved promisingF-scores of 60.4% on the development dataset and 58.3% on the test dataset using gold-standard entity annotations, respectively. Database URL:https://github.com/JHnlp/BC5CIDTask. © The Author(s) 2016. Published by Oxford University Press.
Semantic computing and language knowledge bases
NASA Astrophysics Data System (ADS)
Wang, Lei; Wang, Houfeng; Yu, Shiwen
2017-09-01
As the proposition of the next-generation Web - semantic Web, semantic computing has been drawing more and more attention within the circle and the industries. A lot of research has been conducted on the theory and methodology of the subject, and potential applications have also been investigated and proposed in many fields. The progress of semantic computing made so far cannot be detached from its supporting pivot - language resources, for instance, language knowledge bases. This paper proposes three perspectives of semantic computing from a macro view and describes the current status of affairs about the construction of language knowledge bases and the related research and applications that have been carried out on the basis of these resources via a case study in the Institute of Computational Linguistics at Peking University.
Risk analysis with a fuzzy-logic approach of a complex installation
NASA Astrophysics Data System (ADS)
Peikert, Tim; Garbe, Heyno; Potthast, Stefan
2016-09-01
This paper introduces a procedural method based on fuzzy logic to analyze systematic the risk of an electronic system in an intentional electromagnetic environment (IEME). The method analyzes the susceptibility of a complex electronic installation with respect to intentional electromagnetic interference (IEMI). It combines the advantages of well-known techniques as fault tree analysis (FTA), electromagnetic topology (EMT) and Bayesian networks (BN) and extends the techniques with an approach to handle uncertainty. This approach uses fuzzy sets, membership functions and fuzzy logic to handle the uncertainty with probability functions and linguistic terms. The linguistic terms add to the risk analysis the knowledge from experts of the investigated system or environment.
Expert Systems for Libraries at SCIL [Small Computers in Libraries]'88.
ERIC Educational Resources Information Center
Kochtanek, Thomas R.; And Others
1988-01-01
Six brief papers on expert systems for libraries cover (1) a knowledge-based approach to database design; (2) getting started in expert systems; (3) using public domain software to develop a business reference system; (4) a music cataloging inquiry system; (5) linguistic analysis of reference transactions; and (6) a model of a reference librarian.…
"Blame" Concept in Phraseology: Cognitive-Semantic Aspect (Based on the French Language)
ERIC Educational Resources Information Center
Zalavina, Tatyana Y.; Kisel, Olesya V.
2016-01-01
Phraseology is one of the basic and most important objects of study in cognitive linguistics. The article deals with verbal fixed phrases in their correlation with the cognitive structure of knowledge--a concept. The used definitional analysis method to identify the basic notions of the conceptual content of the concept of blame and basic…
ERIC Educational Resources Information Center
Soto Huerta, Mary Esther
2012-01-01
To examine the second-language reading development of 45 fourth-grade Latino bilinguals, a sequential mixed methods study was conducted in two phases (Creswell, 2009). The quantitative data collected in the first phase generated an index of the group's reading performance based on two grade-level assessments, a state-mandated standardized reading…
Building an automated SOAP classifier for emergency department reports.
Mowery, Danielle; Wiebe, Janyce; Visweswaran, Shyam; Harkema, Henk; Chapman, Wendy W
2012-02-01
Information extraction applications that extract structured event and entity information from unstructured text can leverage knowledge of clinical report structure to improve performance. The Subjective, Objective, Assessment, Plan (SOAP) framework, used to structure progress notes to facilitate problem-specific, clinical decision making by physicians, is one example of a well-known, canonical structure in the medical domain. Although its applicability to structuring data is understood, its contribution to information extraction tasks has not yet been determined. The first step to evaluating the SOAP framework's usefulness for clinical information extraction is to apply the model to clinical narratives and develop an automated SOAP classifier that classifies sentences from clinical reports. In this quantitative study, we applied the SOAP framework to sentences from emergency department reports, and trained and evaluated SOAP classifiers built with various linguistic features. We found the SOAP framework can be applied manually to emergency department reports with high agreement (Cohen's kappa coefficients over 0.70). Using a variety of features, we found classifiers for each SOAP class can be created with moderate to outstanding performance with F(1) scores of 93.9 (subjective), 94.5 (objective), 75.7 (assessment), and 77.0 (plan). We look forward to expanding the framework and applying the SOAP classification to clinical information extraction tasks. Copyright © 2011. Published by Elsevier Inc.
FEX: A Knowledge-Based System For Planimetric Feature Extraction
NASA Astrophysics Data System (ADS)
Zelek, John S.
1988-10-01
Topographical planimetric features include natural surfaces (rivers, lakes) and man-made surfaces (roads, railways, bridges). In conventional planimetric feature extraction, a photointerpreter manually interprets and extracts features from imagery on a stereoplotter. Visual planimetric feature extraction is a very labour intensive operation. The advantages of automating feature extraction include: time and labour savings; accuracy improvements; and planimetric data consistency. FEX (Feature EXtraction) combines techniques from image processing, remote sensing and artificial intelligence for automatic feature extraction. The feature extraction process co-ordinates the information and knowledge in a hierarchical data structure. The system simulates the reasoning of a photointerpreter in determining the planimetric features. Present efforts have concentrated on the extraction of road-like features in SPOT imagery. Keywords: Remote Sensing, Artificial Intelligence (AI), SPOT, image understanding, knowledge base, apars.
ERIC Educational Resources Information Center
O'Keeffe, Lisa
2016-01-01
Language is frequently discussed as barrier to mathematics word problems. Hence this paper presents the initial findings of a linguistic analysis of numeracy skills test sample items. The theoretical perspective of multi-modal text analysis underpinned this study, in which data was extracted from the ten sample numeracy test items released by the…
Antia, B E; Omotara, B A; Rabasa, A I; Addy, E O; Tomfafi, O A A; Anaso, C C
2003-06-01
The aim of this study was to propose an alternative approach to traditional knowledge, attitude and practice (KAP) studies to enhance the quality of data on which educational health programmes are based. The methodology proposed and illustrated involved a triangulation of approaches derived from linguistics, cognitive science, and medical laboratory sciences. Three diarrhoeal health talks (educational messages) as given to mothers in three primary-care facilities in Borno State (Northeast Nigeria) were subjected to a linguistics analysis. Relationships were then sought between the ontology of knowledge in the health talks as revealed by the text analysis and two other kinds of data, namely: (a) mothers' answers to a set of ecologically-sensitive reasoning questions that test how much relevant inferential knowledge the health talks allow for and (b) results of microbiological and biochemical analyses of salt-sugar rehydration solutions prepared by mothers participating in the study. The findings of the study show a relationship between contents/formatting of the health talks and the extent to which relevant inferential competence was supported or demonstrated by mothers. It was also evident that the laboratory analyses could be related either directly to the health talks or indirectly in terms of what the health talks need to emphasize on. The conclusion shows how the methodology proposed addresses shortcomings of traditional KAP studies in respect of the gap between health knowledge and practice.
NASA Astrophysics Data System (ADS)
Irish, Tobias E. L.
This multiple case study explores issues of equity in science education through an examination of how teachers' reasoning patterns compare with students' reasoning patterns during inquiry-based lessons. It also examines the ways in which teachers utilize students' cultural and linguistic resources, or funds of knowledge, during inquiry-based lessons and the ways in which students utilize their funds of knowledge, during inquiry-based lessons. Three middle school teachers and a total of 57 middle school students participated in this study. The data collection involved classroom observations and multiple interviews with each of the teachers individually and with small groups of students. The findings indicate that the students are capable of far more complex reasoning than what was elicited by the lessons observed or what was modeled and expected by the teachers, but that during the inquiry-based lessons they conformed to the more simplistic reasoning patterns they perceived as the expected norm of classroom dialogue. The findings also indicate that the students possess funds of knowledge that are relevant to science topics, but very seldom use these funds in the context of their inquiry-based lessons. In addition, the teachers in this study very seldom worked to elicit students' use of their funds in these contexts. The few attempts they did make involved the use of analogies, examples, or questions. The findings from this study have implications for both teachers and teacher educators in that they highlight similarities and differences in reasoning that can help teachers establish instructional congruence and facilitate more equitable science instruction. They also provide insight into how students' cultural and linguistic resources are utilized during inquiry-based science lessons.
New Approaches to a Subject of Anthropocentric Linguistics
ERIC Educational Resources Information Center
Lee, Valentine S.; Tumanova, Ainakul B.; Salkhanova, Zhanat H.
2016-01-01
The article studies theoretical issues of modern anthropocentric paradigm of scientific knowledge from the history of anthropocentric linguistics development as a special field of language science. The purpose of this study is to answer the question about human influence on the semiotic system. The material result is the unification of specific…
Teaching Reading to English Language Learners: Insights from Linguistics
ERIC Educational Resources Information Center
Lems, Kristin; Miller, Leah D.; Soro, Tenena M.
2009-01-01
Written specifically for K-12 educators, this accessible book explains the processes involved in second-language acquisition and provides a wealth of practical strategies for helping English language learners (ELLs) succeed at reading. The authors integrate knowledge from two fields that often remain disconnected--linguistics and literacy--with a…
Linguistic Skills and Speaking Fluency in a Second Language
ERIC Educational Resources Information Center
De Jong, Nivja H.; Steinel, Margarita P.; Florijn, Arjen; Schoonen, Rob; Hulstijn, Jan H.
2013-01-01
This study investigated how individual differences in linguistic knowledge and processing skills relate to individual differences in speaking fluency. Speakers of Dutch as a second language ("N" = 179) performed eight speaking tasks, from which several measures of fluency were derived such as measures for pausing, repairing, and speed…
Preparing Bilingual Teachers for the Future: Developing Culture and Linguistic Global Competence
ERIC Educational Resources Information Center
Alfaro, Cristina
2008-01-01
Increasing diversity and linguistics complexity in classrooms is occurring in schools throughout the world. Bilingual teachers need to develop knowledge and skills to succees in teaching diverse students. Demographic shifts are bringing increasing numbers of international students from diverse racial, ethnic, religious, class, and linguistic…
Bulletin suisse de linguistique appliquee, 2001 (Swiss Bulletin for Applied Linguistics, 2001).
ERIC Educational Resources Information Center
Gajo, Laurent, Ed; Mondada, Lorenza, Ed.
2001-01-01
This issue, primarily written in French, contains articles by researchers in the fields of linguistics and social sciences and by health care professionals. The articles include the following: "The Collective Working Out of Medical Knowledge" (Lorenza Mondada); "Involvement and Constraint in a Surgical Consultation Room"…
Worlds of Knowledge in Central Bhutan: Documentation of 'Olekha
ERIC Educational Resources Information Center
Hyslop, Gwendolyn
2016-01-01
A re-emergence in language documentation has brought with it a recent recognition of the potential contributions which collaboration with other disciplines has to offer linguistics. For example, ten chapters of the recently published Oxford Handbook of Linguistic Fieldwork (Thieberger 2012) were explicitly devoted to cross-discipline…
A Simple View of Linguistic Complexity
ERIC Educational Resources Information Center
Pallotti, Gabriele
2015-01-01
Although a growing number of second language acquisition (SLA) studies take linguistic complexity as a dependent variable, the term is still poorly defined and often used with different meanings, thus posing serious problems for research synthesis and knowledge accumulation. This article proposes a simple, coherent view of the construct, which is…
Neural bases of event knowledge and syntax integration in comprehension of complex sentences.
Malaia, Evie; Newman, Sharlene
2015-01-01
Comprehension of complex sentences is necessarily supported by both syntactic and semantic knowledge, but what linguistic factors trigger a readers' reliance on a specific system? This functional neuroimaging study orthogonally manipulated argument plausibility and verb event type to investigate cortical bases of the semantic effect on argument comprehension during reading. The data suggest that telic verbs facilitate online processing by means of consolidating the event schemas in episodic memory and by easing the computation of syntactico-thematic hierarchies in the left inferior frontal gyrus. The results demonstrate that syntax-semantics integration relies on trade-offs among a distributed network of regions for maximum comprehension efficiency.
Language learners privilege structured meaning over surface frequency
Culbertson, Jennifer; Adger, David
2014-01-01
Although it is widely agreed that learning the syntax of natural languages involves acquiring structure-dependent rules, recent work on acquisition has nevertheless attempted to characterize the outcome of learning primarily in terms of statistical generalizations about surface distributional information. In this paper we investigate whether surface statistical knowledge or structural knowledge of English is used to infer properties of a novel language under conditions of impoverished input. We expose learners to artificial-language patterns that are equally consistent with two possible underlying grammars—one more similar to English in terms of the linear ordering of words, the other more similar on abstract structural grounds. We show that learners’ grammatical inferences overwhelmingly favor structural similarity over preservation of superficial order. Importantly, the relevant shared structure can be characterized in terms of a universal preference for isomorphism in the mapping from meanings to utterances. Whereas previous empirical support for this universal has been based entirely on data from cross-linguistic language samples, our results suggest it may reflect a deep property of the human cognitive system—a property that, together with other structure-sensitive principles, constrains the acquisition of linguistic knowledge. PMID:24706789
Categorizing words through semantic memory navigation
NASA Astrophysics Data System (ADS)
Borge-Holthoefer, J.; Arenas, A.
2010-03-01
Semantic memory is the cognitive system devoted to storage and retrieval of conceptual knowledge. Empirical data indicate that semantic memory is organized in a network structure. Everyday experience shows that word search and retrieval processes provide fluent and coherent speech, i.e. are efficient. This implies either that semantic memory encodes, besides thousands of words, different kind of links for different relationships (introducing greater complexity and storage costs), or that the structure evolves facilitating the differentiation between long-lasting semantic relations from incidental, phenomenological ones. Assuming the latter possibility, we explore a mechanism to disentangle the underlying semantic backbone which comprises conceptual structure (extraction of categorical relations between pairs of words), from the rest of information present in the structure. To this end, we first present and characterize an empirical data set modeled as a network, then we simulate a stochastic cognitive navigation on this topology. We schematize this latter process as uncorrelated random walks from node to node, which converge to a feature vectors network. By doing so we both introduce a novel mechanism for information retrieval, and point at the problem of category formation in close connection to linguistic and non-linguistic experience.
The logical syntax of number words: theory, acquisition and processing.
Musolino, Julien
2009-04-01
Recent work on the acquisition of number words has emphasized the importance of integrating linguistic and developmental perspectives [Musolino, J. (2004). The semantics and acquisition of number words: Integrating linguistic and developmental perspectives. Cognition93, 1-41; Papafragou, A., Musolino, J. (2003). Scalar implicatures: Scalar implicatures: Experiments at the semantics-pragmatics interface. Cognition, 86, 253-282; Hurewitz, F., Papafragou, A., Gleitman, L., Gelman, R. (2006). Asymmetries in the acquisition of numbers and quantifiers. Language Learning and Development, 2, 76-97; Huang, Y. T., Snedeker, J., Spelke, L. (submitted for publication). What exactly do numbers mean?]. Specifically, these studies have shown that data from experimental investigations of child language can be used to illuminate core theoretical issues in the semantic and pragmatic analysis of number terms. In this article, I extend this approach to the logico-syntactic properties of number words, focusing on the way numerals interact with each other (e.g. Three boys are holding two balloons) as well as with other quantified expressions (e.g. Three boys are holding each balloon). On the basis of their intuitions, linguists have claimed that such sentences give rise to at least four different interpretations, reflecting the complexity of the linguistic structure and syntactic operations involved. Using psycholinguistic experimentation with preschoolers (n=32) and adult speakers of English (n=32), I show that (a) for adults, the intuitions of linguists can be verified experimentally, (b) by the age of 5, children have knowledge of the core aspects of the logical syntax of number words, (c) in spite of this knowledge, children nevertheless differ from adults in systematic ways, (d) the differences observed between children and adults can be accounted for on the basis of an independently motivated, linguistically-based processing model [Geurts, B. (2003). Quantifying kids. Language Acquisition, 11(4), 197-218]. In doing so, this work ties together research on the acquisition of the number vocabulary with a growing body of work on the development of quantification and sentence processing abilities in young children [Geurts, 2003; Lidz, J., Musolino, J. (2002). Children's command of quantification. Cognition, 84, 113-154; Musolino, J., Lidz, J. (2003). The scope of isomorphism: Turning adults into children. Language Acquisition, 11(4), 277-291; Trueswell, J., Sekerina, I., Hilland, N., Logrip, M. (1999). The kindergarten-path effect: Studying on-line sentence processing in young children. Cognition, 73, 89-134; Noveck, I. (2001). When children are more logical than adults: Experimental investigations of scalar implicature. Cognition, 78, 165-188; Noveck, I., Guelminger, R., Georgieff, N., & Labruyere, N. (2007). What autism can tell us about every. . . not sentences. Journal of Semantics,24(1), 73-90. On a more general level, this work confirms the importance of integrating formal and developmental perspectives [Musolino, 2004], this time by highlighting the explanatory power of linguistically-based models of language acquisition and by showing that the complex structure postulated by linguists has important implications for developmental accounts of the number vocabulary.
ERIC Educational Resources Information Center
Schwarz, Michel P.M.
1981-01-01
Discusses general principles of language testing, stressing objectivity and reliability as the key terms. However, maintains that it is impossible to obtain a direct measure of linguistic competence and consequently questions the value of standard grading procedures. Instead, proposes an evaluation system based on the achievement of specific…
ERIC Educational Resources Information Center
Márquez, Manuel; Chaves, Beatriz
2016-01-01
The application of a methodology based on S.C. Dik's Functionalist Grammar linguistic principles, which is addressed to the teaching of Latin to secondary students, has resulted in a quantitative improvement in students' acquisition process of knowledge. To do so, we have used a self-learning tool, an ad hoc dictionary, of which the use in…
NASA Technical Reports Server (NTRS)
Howard, Ayanna; Bayard, David
2006-01-01
Fuzzy Feature Observation Planner for Small Body Proximity Observations (FuzzObserver) is a developmental computer program, to be used along with other software, for autonomous planning of maneuvers of a spacecraft near an asteroid, comet, or other small astronomical body. Selection of terrain features and estimation of the position of the spacecraft relative to these features is an essential part of such planning. FuzzObserver contributes to the selection and estimation by generating recommendations for spacecraft trajectory adjustments to maintain the spacecraft's ability to observe sufficient terrain features for estimating position. The input to FuzzObserver consists of data from terrain images, including sets of data on features acquired during descent toward, or traversal of, a body of interest. The name of this program reflects its use of fuzzy logic to reason about the terrain features represented by the data and extract corresponding trajectory-adjustment rules. Linguistic fuzzy sets and conditional statements enable fuzzy systems to make decisions based on heuristic rule-based knowledge derived by engineering experts. A major advantage of using fuzzy logic is that it involves simple arithmetic calculations that can be performed rapidly enough to be useful for planning within the short times typically available for spacecraft maneuvers.
Chapter 16: text mining for translational bioinformatics.
Cohen, K Bretonnel; Hunter, Lawrence E
2013-04-01
Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.
Computational Investigations of Multiword Chunks in Language Learning.
McCauley, Stewart M; Christiansen, Morten H
2017-07-01
Second-language learners rarely arrive at native proficiency in a number of linguistic domains, including morphological and syntactic processing. Previous approaches to understanding the different outcomes of first- versus second-language learning have focused on cognitive and neural factors. In contrast, we explore the possibility that children and adults may rely on different linguistic units throughout the course of language learning, with specific focus on the granularity of those units. Following recent psycholinguistic evidence for the role of multiword chunks in online language processing, we explore the hypothesis that children rely more heavily on multiword units in language learning than do adults learning a second language. To this end, we take an initial step toward using large-scale, corpus-based computational modeling as a tool for exploring the granularity of speakers' linguistic units. Employing a computational model of language learning, the Chunk-Based Learner, we compare the usefulness of chunk-based knowledge in accounting for the speech of second-language learners versus children and adults speaking their first language. Our findings suggest that while multiword units are likely to play a role in second-language learning, adults may learn less useful chunks, rely on them to a lesser extent, and arrive at them through different means than children learning a first language. Copyright © 2017 Cognitive Science Society, Inc.
VisualUrText: A Text Analytics Tool for Unstructured Textual Data
NASA Astrophysics Data System (ADS)
Zainol, Zuraini; Jaymes, Mohd T. H.; Nohuddin, Puteri N. E.
2018-05-01
The growing amount of unstructured text over Internet is tremendous. Text repositories come from Web 2.0, business intelligence and social networking applications. It is also believed that 80-90% of future growth data is available in the form of unstructured text databases that may potentially contain interesting patterns and trends. Text Mining is well known technique for discovering interesting patterns and trends which are non-trivial knowledge from massive unstructured text data. Text Mining covers multidisciplinary fields involving information retrieval (IR), text analysis, natural language processing (NLP), data mining, machine learning statistics and computational linguistics. This paper discusses the development of text analytics tool that is proficient in extracting, processing, analyzing the unstructured text data and visualizing cleaned text data into multiple forms such as Document Term Matrix (DTM), Frequency Graph, Network Analysis Graph, Word Cloud and Dendogram. This tool, VisualUrText, is developed to assist students and researchers for extracting interesting patterns and trends in document analyses.
Refining Automatically Extracted Knowledge Bases Using Crowdsourcing.
Li, Chunhua; Zhao, Pengpeng; Sheng, Victor S; Xian, Xuefeng; Wu, Jian; Cui, Zhiming
2017-01-01
Machine-constructed knowledge bases often contain noisy and inaccurate facts. There exists significant work in developing automated algorithms for knowledge base refinement. Automated approaches improve the quality of knowledge bases but are far from perfect. In this paper, we leverage crowdsourcing to improve the quality of automatically extracted knowledge bases. As human labelling is costly, an important research challenge is how we can use limited human resources to maximize the quality improvement for a knowledge base. To address this problem, we first introduce a concept of semantic constraints that can be used to detect potential errors and do inference among candidate facts. Then, based on semantic constraints, we propose rank-based and graph-based algorithms for crowdsourced knowledge refining, which judiciously select the most beneficial candidate facts to conduct crowdsourcing and prune unnecessary questions. Our experiments show that our method improves the quality of knowledge bases significantly and outperforms state-of-the-art automatic methods under a reasonable crowdsourcing cost.
ERIC Educational Resources Information Center
Montgomery, James W.
2004-01-01
Many children with specific language impairment (SLI) exhibit sentence comprehension difficulties. In some instances, these difficulties appear to be related to poor linguistic knowledge and, in other instances, to inferior general processing abilities. Two processing deficiencies evidenced by these children include reduced linguistic processing…
The Role of Conceptual and Linguistic Ontologies in Interpreting Spatial Discourse
ERIC Educational Resources Information Center
Bateman, John; Tenbrink, Thora; Farrar, Scott
2007-01-01
This article argues that a clear division between two sources of information--one oriented to world knowledge, the other to linguistic semantics--offers a framework within which mechanisms for modelling the highly flexible relation between language and interpretation necessary for natural discourse can be specified and empirically validated.…
Preparing Linguistically Responsive Teachers: Laying the Foundation in Preservice Teacher Education
ERIC Educational Resources Information Center
Lucas, Tamara; Villegas, Ana Maria
2013-01-01
It takes teachers many years to develop expertise in the complex set of knowledge, skills, and orientations needed to teach culturally and linguistically diverse (CLD) students well. The process begins in preservice preparation and continues into the early years of teaching and throughout a teacher's career. This article examines preservice…
Preparing PETE Students for Culturally and Linguistically Diverse Learners
ERIC Educational Resources Information Center
Culp, Brian; Schmidlein, Robert
2012-01-01
By the year 2030, it is predicted that culturally and linguistically diverse (CLD) learners will comprise approximately half of the public school population in the United States. Unfortunately, many pre-service educators enter the teaching field each year lacking knowledge of the experiences and needs of these students. This trend has particular…
ERIC Educational Resources Information Center
O'Connor, Brendan Harold
2012-01-01
This dissertation is a linguistic ethnography of a high school Astronomy/Oceanography classroom in southern Arizona, where an exceptionally promising, novice, white science teacher and mostly Mexican-American students confronted issues of identity and difference through interactions both related and unrelated to science learning. Through close…
Digitizing Ethiopic: Coding for Linguistic Continuity in the Face of Digital Extinction
ERIC Educational Resources Information Center
Zaugg, Isabelle Alice
2017-01-01
Despite the growing sophistication of digital technologies, it appears they are contributing to language extinction on a par with devastating losses in biodiversity. With language extinction comes loss of identity, inter-generational cohesion, culture, and a global wealth of knowledge to address future problems facing humanity. Linguists estimate…
[Communicating effectively: neuro-linguistic programming in the psychiatric interview].
Ducasse, Déborah; Fond, Guillaume
2014-01-01
Neuro-linguistic programming is a set of practices and knowledge which seeks to "model" and then imitate the best communication practices. Applying the key concepts to the care relationship in mental health care helps to improve the quality of the contact, the clarity of the communication and to create an openness to change.
ERIC Educational Resources Information Center
Wenden, Anita L.
2007-01-01
Despite the multifaceted role language plays in promoting direct and indirect violence, activities that would develop the linguistic knowledge and critical language skills for understanding how discourse shapes individual and group beliefs and prompts social action are conspicuously absent from peace education. This article aims to address this…
ERIC Educational Resources Information Center
Yan, Jing
2016-01-01
Explanation and justification require cognitive ability which selects and organises relevant information in a logical way, and linguistic ability which enables speakers to encode the information with linguistic knowledge. This study aims to investigate the development of Chinese oral explanation and justification in Singapore primary students. The…
Sources and Suggestions to Lower Listening Comprehension Anxiety in the EFL Classroom: A Case Study
ERIC Educational Resources Information Center
Sharif, Mohd. Yasin; Ferdous, Farhiba
2012-01-01
Listening is a creative skill that demands active involvement. The listeners share their knowledge from both linguistics and non linguistics sources. Listening comprehension (LC) tasks which is always accompanied by anxiety needs closer examination. In the listening process a low-anxiety classroom environment inspires the listeners to participate…
Haman, Ewa; Łuniewska, Magdalena; Hansen, Pernille; Simonsen, Hanne Gram; Chiat, Shula; Bjekić, Jovana; Blažienė, Agnė; Chyl, Katarzyna; Dabašinskienė, Ineta; Engel de Abreu, Pascale; Gagarina, Natalia; Gavarró, Anna; Håkansson, Gisela; Harel, Efrat; Holm, Elisabeth; Kapalková, Svetlana; Kunnari, Sari; Levorato, Chiara; Lindgren, Josefin; Mieszkowska, Karolina; Montes Salarich, Laia; Potgieter, Anneke; Ribu, Ingeborg; Ringblom, Natalia; Rinker, Tanja; Roch, Maja; Slančová, Daniela; Southwood, Frenette; Tedeschi, Roberta; Tuncer, Aylin Müge; Ünal-Logacev, Özlem; Vuksanović, Jasmina; Armon-Lotem, Sharon
2017-01-01
This article investigates the cross-linguistic comparability of the newly developed lexical assessment tool Cross-linguistic Lexical Tasks (LITMUS-CLT). LITMUS-CLT is a part the Language Impairment Testing in Multilingual Settings (LITMUS) battery (Armon-Lotem, de Jong & Meir, 2015). Here we analyse results on receptive and expressive word knowledge tasks for nouns and verbs across 17 languages from eight different language families: Baltic (Lithuanian), Bantu (isiXhosa), Finnic (Finnish), Germanic (Afrikaans, British English, South African English, German, Luxembourgish, Norwegian, Swedish), Romance (Catalan, Italian), Semitic (Hebrew), Slavic (Polish, Serbian, Slovak) and Turkic (Turkish). The participants were 639 monolingual children aged 3;0-6;11 living in 15 different countries. Differences in vocabulary size were small between 16 of the languages; but isiXhosa-speaking children knew significantly fewer words than speakers of the other languages. There was a robust effect of word class: accuracy was higher for nouns than verbs. Furthermore, comprehension was more advanced than production. Results are discussed in the context of cross-linguistic comparisons of lexical development in monolingual and bilingual populations.
Psychometric Evaluation of a Cultural Competency Assessment Instrument for Health Professionals
Haywood, Sonja H.; Goode, Tawara; Gao, Yong; Smith, Kristyn; Bronheim, Suzanne; Flocke, Susan A; Zyzanski, Steve
2012-01-01
Background Few valid and reliable measures exist for health care professionals interested in determining their levels of cultural and linguistic competence. Objective To evaluate the measurement properties of the Cultural Competence Health Practitioner Assessment (CCHPA-129). Methods The CCHPA-129 is a 129-item web-based instrument, developed by the National Center for Cultural Competence (NCCC). Responses on the CCHPA -129 were examined using factor analysis; Rasch modeling; and Differential Item Functioning (DIF) across race, ethnicity, gender, and profession. Subjects 2504 practitioners, including 1864 nurses (RN/LPN,/BSN); 341 clinicians (PA/NP); and 299 physicians (MD/DO), who completed the CCHPA-129 online between 2005 and 2008. Results Three factors representing domains of knowledge, adapting practice, and promoting health for culturally and linguistically diverse populations accounted for 46% of the variance. Among Knowledge factor items, 53% (23/43) fit the Rasch model, item difficulties ranged from −1.01 logits (least difficult) to +1.11 logits (most difficult), separation index (SI) 13.82, and Cronbach’s α 0.92. Forty-seven percent (21/44) Adapting Practice factor items fit the model, item difficulties −0.07 to +1.11 logits, SI 11.59, Cronbach’s α 0.88; and 58% (23/39). Promoting Health factor items fit the model, item difficulties −1.01 to +1.38 logits, SI 22.64, Cronbach’s α 0.92. Early evidence of validity was established by known groups having statistically different scores. Conclusion The 67-item CCHPA-67 is psychometrically sound. This shorted instrument can be used to establish associations between practitioners’ cultural and linguistic competence and health outcomes as well as to evaluate interventions to increase practitioners’ cultural and linguistic competence. PMID:22437625
Visual analysis of online social media to open up the investigation of stance phenomena
Kucher, Kostiantyn; Schamp-Bjerede, Teri; Kerren, Andreas; Paradis, Carita; Sahlgren, Magnus
2015-01-01
Online social media are a perfect text source for stance analysis. Stance in human communication is concerned with speaker attitudes, beliefs, feelings and opinions. Expressions of stance are associated with the speakers' view of what they are talking about and what is up for discussion and negotiation in the intersubjective exchange. Taking stance is thus crucial for the social construction of meaning. Increased knowledge of stance can be useful for many application fields such as business intelligence, security analytics, or social media monitoring. In order to process large amounts of text data for stance analyses, linguists need interactive tools to explore the textual sources as well as the processed data based on computational linguistics techniques. Both original texts and derived data are important for refining the analyses iteratively. In this work, we present a visual analytics tool for online social media text data that can be used to open up the investigation of stance phenomena. Our approach complements traditional linguistic analysis techniques and is based on the analysis of utterances associated with two stance categories: sentiment and certainty. Our contributions include (1) the description of a novel web-based solution for analyzing the use and patterns of stance meanings and expressions in human communication over time; and (2) specialized techniques used for visualizing analysis provenance and corpus overview/navigation. We demonstrate our approach by means of text media on a highly controversial scandal with regard to expressions of anger and provide an expert review from linguists who have been using our tool. PMID:29249903
Visual analysis of online social media to open up the investigation of stance phenomena.
Kucher, Kostiantyn; Schamp-Bjerede, Teri; Kerren, Andreas; Paradis, Carita; Sahlgren, Magnus
2016-04-01
Online social media are a perfect text source for stance analysis. Stance in human communication is concerned with speaker attitudes, beliefs, feelings and opinions. Expressions of stance are associated with the speakers' view of what they are talking about and what is up for discussion and negotiation in the intersubjective exchange. Taking stance is thus crucial for the social construction of meaning. Increased knowledge of stance can be useful for many application fields such as business intelligence, security analytics, or social media monitoring. In order to process large amounts of text data for stance analyses, linguists need interactive tools to explore the textual sources as well as the processed data based on computational linguistics techniques. Both original texts and derived data are important for refining the analyses iteratively. In this work, we present a visual analytics tool for online social media text data that can be used to open up the investigation of stance phenomena. Our approach complements traditional linguistic analysis techniques and is based on the analysis of utterances associated with two stance categories: sentiment and certainty. Our contributions include (1) the description of a novel web-based solution for analyzing the use and patterns of stance meanings and expressions in human communication over time; and (2) specialized techniques used for visualizing analysis provenance and corpus overview/navigation. We demonstrate our approach by means of text media on a highly controversial scandal with regard to expressions of anger and provide an expert review from linguists who have been using our tool.
Multi-dimensionality and variability in folk classification of stingless bees (Apidae: Meliponini).
Zamudio, Fernando; Hilgert, Norma I
2015-05-23
Not long ago Eugene Hunn suggested using a combination of cognitive, linguistic, ecological and evolutionary theories in order to account for the dynamic character of ethnoecology in the study of folk classification systems. In this way he intended to question certain homogeneity in folk classifications models and deepen in the analysis and interpretation of variability in folk classifications. This paper studies how a rural culturally mixed population of the Atlantic Forest of Misiones (Argentina) classified honey-producing stingless bees according to the linguistic, cognitive and ecological dimensions of folk classification. We also analyze the socio-ecological meaning of binomialization in naming and the meaning of general local variability in the appointment of stingless bees. We used three different approaches: the classical approach developed by Brent Berlin which relies heavily on linguistic criteria, the approach developed by Eleonor Rosch which relies on psychological (cognitive) principles of categorization and finally we have captured the ecological dimension of folk classification in local narratives. For the second approximation, we developed ways of measuring the degree of prototypicality based on a total of 107 comparisons of the type "X is similar to Y" identified in personal narratives. Various logical and grouping strategies coexist and were identified as: graded of lateral linkage, hierarchical and functional. Similarity judgments among folk taxa resulted in an implicit logic of classification graded according to taxa's prototypicality. While there is a high agreement on naming stingless bees with monomial names, a considerable number of underrepresented binomial names and lack of names were observed. Two possible explanations about reported local naming variability are presented. We support the multidimensionality of folk classification systems. This confirms the specificity of local classification systems but also reflects the use of grouping strategies and mechanisms commonly observed in other cultural groups, such as the use of similarity judgments between more or less prototypical organisms. Also we support the idea that alternative naming results from a process of fragmentation of knowledge or incomplete transmission of knowledge. These processes lean on the facts that culturally based knowledge, on the one hand, and biologic knowledge of nature on the other, can be acquired through different learning pathways.
Electronic processing of informed consents in a global pharmaceutical company environment.
Vishnyakova, Dina; Gobeill, Julien; Oezdemir-Zaech, Fatma; Kreim, Olivier; Vachon, Therese; Clade, Thierry; Haenning, Xavier; Mikhailov, Dmitri; Ruch, Patrick
2014-01-01
We present an electronic capture tool to process informed consents, which are mandatory recorded when running a clinical trial. This tool aims at the extraction of information expressing the duration of the consent given by the patient to authorize the exploitation of biomarker-related information collected during clinical trials. The system integrates a language detection module (LDM) to route a document into the appropriate information extraction module (IEM). The IEM is based on language-specific sets of linguistic rules for the identification of relevant textual facts. The achieved accuracy of both the LDM and IEM is 99%. The architecture of the system is described in detail.
The Effects of Transferred Vocabulary Knowledge on the Development of L2 Reading Proficiency.
ERIC Educational Resources Information Center
Koda, Keiko
1989-01-01
Examination of the effects of transferred vocabulary knowledge on college students' (N=24) acquisition of Japanese linguistic knowledge, verbal processing skills, and reading comprehension indicated that vocabulary knowledge was most highly correlated with reading comprehension. This initial advantage magnified its effects over time as task…
García-Remesal, Miguel; Maojo, Victor; Crespo, José
2010-01-01
In this paper we present a knowledge engineering approach to automatically recognize and extract genetic sequences from scientific articles. To carry out this task, we use a preliminary recognizer based on a finite state machine to extract all candidate DNA/RNA sequences. The latter are then fed into a knowledge-based system that automatically discards false positives and refines noisy and incorrectly merged sequences. We created the knowledge base by manually analyzing different manuscripts containing genetic sequences. Our approach was evaluated using a test set of 211 full-text articles in PDF format containing 3134 genetic sequences. For such set, we achieved 87.76% precision and 97.70% recall respectively. This method can facilitate different research tasks. These include text mining, information extraction, and information retrieval research dealing with large collections of documents containing genetic sequences.
Bonifacci, Paola; Tobia, Valentina; Bernabini, Luca; Marzocchi, Gian Marco
2016-01-01
Many studies have suggested that the concept of “number” is relatively independent from linguistic skills, although an increasing number of studies suggest that language abilities may play a pivotal role in the development of arithmetic skills. The condition of bilingualism can offer a unique perspective into the role of linguistic competence in numerical development. The present study was aimed at evaluating the relationship between language skills and early numeracy through a multilevel investigation in monolingual and bilingual minority children attending preschool. The sample included 156 preschool children. Of these, 77 were bilingual minority children (mean age = 58.27 ± 5.90), and 79 were monolinguals (mean age = 58.45 ± 6.03). The study focused on three levels of analysis: group differences in language and number skills, concurrent linguistic predictors of early numeracy and, finally, profile analysis of linguistic skills in children with impaired vs. adequate numeracy skills. The results showed that, apart from the expected differences in linguistic measures, bilinguals differed from monolinguals in numerical skills with a verbal component, such as semantic knowledge of digits, but they did not differ in a pure non-verbal component such as quantity comparison. The multigroup structural equation model indicated that letter knowledge was a significant predictor of the verbal component of numeracy for both groups. Phonological awareness was a significant predictor of numeracy skills only in the monolingual group. Profile analysis showed that children with a selective weakness in the non-verbal component of numeracy had fully adequate verbal skills. Results from the present study suggest that only some specific components of language competence predict numerical processing, although linguistic proficiency may not be a prerequisite for developing adequate early numeracy skills. PMID:27458413
Bonifacci, Paola; Tobia, Valentina; Bernabini, Luca; Marzocchi, Gian Marco
2016-01-01
Many studies have suggested that the concept of "number" is relatively independent from linguistic skills, although an increasing number of studies suggest that language abilities may play a pivotal role in the development of arithmetic skills. The condition of bilingualism can offer a unique perspective into the role of linguistic competence in numerical development. The present study was aimed at evaluating the relationship between language skills and early numeracy through a multilevel investigation in monolingual and bilingual minority children attending preschool. The sample included 156 preschool children. Of these, 77 were bilingual minority children (mean age = 58.27 ± 5.90), and 79 were monolinguals (mean age = 58.45 ± 6.03). The study focused on three levels of analysis: group differences in language and number skills, concurrent linguistic predictors of early numeracy and, finally, profile analysis of linguistic skills in children with impaired vs. adequate numeracy skills. The results showed that, apart from the expected differences in linguistic measures, bilinguals differed from monolinguals in numerical skills with a verbal component, such as semantic knowledge of digits, but they did not differ in a pure non-verbal component such as quantity comparison. The multigroup structural equation model indicated that letter knowledge was a significant predictor of the verbal component of numeracy for both groups. Phonological awareness was a significant predictor of numeracy skills only in the monolingual group. Profile analysis showed that children with a selective weakness in the non-verbal component of numeracy had fully adequate verbal skills. Results from the present study suggest that only some specific components of language competence predict numerical processing, although linguistic proficiency may not be a prerequisite for developing adequate early numeracy skills.
NASA Astrophysics Data System (ADS)
Lebedev, A. A.; Maksimov, N. V.; Smirnova, E. V.
2017-01-01
The paper presents a model of information interactions, based on a probabilistic concept of meanings. The proposed hypothesis about the wave nature of information and use of quantum mechanics mathematical apparatus allow to consider the phenomena of interference and diffraction with respect to the linguistic variables, and to quantify dynamics of terms in subject areas. Retrospective database INIS IAEA was used as an experimental base.
Background Knowledge in Learning-Based Relation Extraction
ERIC Educational Resources Information Center
Do, Quang Xuan
2012-01-01
In this thesis, we study the importance of background knowledge in relation extraction systems. We not only demonstrate the benefits of leveraging background knowledge to improve the systems' performance but also propose a principled framework that allows one to effectively incorporate knowledge into statistical machine learning models for…
Design of fuzzy systems using neurofuzzy networks.
Figueiredo, M; Gomide, F
1999-01-01
This paper introduces a systematic approach for fuzzy system design based on a class of neural fuzzy networks built upon a general neuron model. The network structure is such that it encodes the knowledge learned in the form of if-then fuzzy rules and processes data following fuzzy reasoning principles. The technique provides a mechanism to obtain rules covering the whole input/output space as well as the membership functions (including their shapes) for each input variable. Such characteristics are of utmost importance in fuzzy systems design and application. In addition, after learning, it is very simple to extract fuzzy rules in the linguistic form. The network has universal approximation capability, a property very useful in, e.g., modeling and control applications. Here we focus on function approximation problems as a vehicle to illustrate its usefulness and to evaluate its performance. Comparisons with alternative approaches are also included. Both, nonnoisy and noisy data have been studied and considered in the computational experiments. The neural fuzzy network developed here and, consequently, the underlying approach, has shown to provide good results from the accuracy, complexity, and system design points of view.
ERIC Educational Resources Information Center
Ghapanchi, Zargham; Taheryan, Atefeh
2012-01-01
This study examined the influence of language knowledge, metacognitive knowledge and metacognitive strategy use on speaking and listening proficiency. Ninety six freshman and sophomore Iranian university students (male = 6, female = 90) were participated in the study. Two kinds of questionnaire and one language knowledge test were administered.…
ERIC Educational Resources Information Center
Hernández, Anita C.; Montelongo, José A.; Herter, Roberta J.
2016-01-01
Educators can take advantage of Latino English learners' linguistic backgrounds by teaching Spanish-English cognate vocabulary using the Children's Choices picture books. Cognates are words that have identical or nearly identical spellings and meanings in two languages because of their Latin and Greek origins. Students can learn to recognize…
ERIC Educational Resources Information Center
Swanson, Julie Dingle
2016-01-01
Javits Gifted and Talented Education Program has provided a wealth of knowledge on culturally and linguistically diverse (CLD) gifted learners and how to support teachers in their work with CLD students. This study examined five impactful Javits projects through qualitative inquiry centered on how innovative practice takes root or not. Using…
ERIC Educational Resources Information Center
Zisselsberger, Margarita
2016-01-01
The increasing number of culturally and linguistically diverse (CLD) students in the United States has created a priority for examining the perspectives, dispositions, and attitudes that best support these learners' writing. This study explores the link between a teacher's developing pedagogical language knowledge and humanizing practices and her…
ERIC Educational Resources Information Center
Herring, William Rodney, Jr.
2009-01-01
A number of arguments appeared in the late-nineteenth-century United States about "correctness" in language, arguments for and against enforcing a standard of correctness and arguments about what should count as correct in language. Insofar as knowledge about and facility with "correct" linguistic usage could affect one's standing in the social…
The Power of Folk Linguistic Knowledge in Language Policy
ERIC Educational Resources Information Center
Albury, Nathan John
2017-01-01
Just as an expanded view of language policy now affords agency to many more actors across society than authorities and linguists alone, it also accepts that the dispositions these agents bring to language affairs influence language policy processes and outcomes. However, this paper makes the case that language policy may also be guided, to some…
Application of Learner Corpora to Second Language Learning and Teaching: An Overview
ERIC Educational Resources Information Center
Xu, Qi
2016-01-01
The paper gives an overview of learner corpora and their application to second language learning and teaching. It is proposed that there are four core components in learner corpus research, namely, corpus linguistics expertise, a good background in linguistic theory, knowledge of SLA theory, and a good understanding of foreign language teaching…
ERIC Educational Resources Information Center
Avenia-Tapper, Brianna; Llosa, Lorena
2015-01-01
This article addresses the issue of language-related construct-irrelevant variance on content area tests from the perspective of systemic functional linguistics. We propose that the construct relevance of language used in content area assessments, and consequent claims of construct-irrelevant variance and bias, should be determined according to…
Tools for Analyzing Verbal Art in the Field
ERIC Educational Resources Information Center
Turpin, Myfany; Henderson, Lana
2015-01-01
Song is a universal human phenomenon that can shed much light on the nature of language. Despite this, field linguists are not always equipped with the knowledge and skills to analyze song texts and draw out their significances to other areas of language. Furthermore, it is not uncommon for a language community to ask linguists working in the…
ERIC Educational Resources Information Center
Silverman, Rebecca D.; Coker, David; Proctor, C. Patrick; Harring, Jeffrey; Piantedosi, Kelly W.; Hartranft, Anna M.
2015-01-01
The purpose of this study was to explore relationships between language variables and writing outcomes with linguistically diverse students in grades 3-5. The participants were 197 children from three schools in one district in the mid-Atlantic United States. We assessed students' vocabulary knowledge and morphological and syntactical skill as…
ERIC Educational Resources Information Center
Brisk, Maria Estela; Hodgson-Drysdale, Tracy; O'Connor, Cheryl
2011-01-01
This study examined the teaching of report writing in PreK-5 through the lens of systemic functional linguistics theory. Teachers were part of a university-public school collaboration that included professional development on teaching genres, text organization, and language features. Grounded in this knowledge, teachers explicitly taught report…
A Framework for Representing and Jointly Reasoning over Linguistic and Non-Linguistic Knowledge
ERIC Educational Resources Information Center
Murugesan, Arthi
2009-01-01
Natural language poses several challenges to developing computational systems for modeling it. Natural language is not a precise problem but is rather ridden with a number of uncertainties in the form of either alternate words or interpretations. Furthermore, natural language is a generative system where the problem size is potentially infinite.…
Words as cultivators of others minds
Schilhab, Theresa S. S.
2015-01-01
The embodied–grounded view of cognition and language holds that sensorimotor experiences in the form of ‘re-enactments’ or ‘simulations’ are significant to the individual’s development of concepts and competent language use. However, a typical objection to the explanatory force of this view is that, in everyday life, we engage in linguistic exchanges about much more than might be directly accessible to our senses. For instance, when knowledge-sharing occurs as part of deep conversations between a teacher and student, language is the salient tool by which to obtain understanding, through the unfolding of explanations. Here, the acquisition of knowledge is realized through language, and the constitution of knowledge seems entirely linguistic. In this paper, based on a review of selected studies within contemporary embodied cognitive science, I propose that such linguistic exchanges, though occurring independently of direct experience, are in fact disguised forms of embodied cognition, leading to the reconciliation of the opposing views. I suggest that, in conversation, interlocutors use Words as Cultivators (WAC) of other minds as a direct result of their embodied–grounded origin, rendering WAC a radical interpretation of the Words as social Tools (WAT) proposal. The WAC hypothesis endorses the view of language as dynamic, continuously integrating with, and negotiating, cognitive processes in the individual. One such dynamic feature results from the ‘linguification process’, a term by which I refer to the socially produced mapping of a word to its referent which, mediated by the interlocutor, turns words into cultivators of others minds. In support of the linguification process hypothesis and WAC, I review relevant embodied–grounded research, and selected studies of instructed fear conditioning and guided imagery. PMID:26594187
Words as cultivators of others minds.
Schilhab, Theresa S S
2015-01-01
The embodied-grounded view of cognition and language holds that sensorimotor experiences in the form of 're-enactments' or 'simulations' are significant to the individual's development of concepts and competent language use. However, a typical objection to the explanatory force of this view is that, in everyday life, we engage in linguistic exchanges about much more than might be directly accessible to our senses. For instance, when knowledge-sharing occurs as part of deep conversations between a teacher and student, language is the salient tool by which to obtain understanding, through the unfolding of explanations. Here, the acquisition of knowledge is realized through language, and the constitution of knowledge seems entirely linguistic. In this paper, based on a review of selected studies within contemporary embodied cognitive science, I propose that such linguistic exchanges, though occurring independently of direct experience, are in fact disguised forms of embodied cognition, leading to the reconciliation of the opposing views. I suggest that, in conversation, interlocutors use Words as Cultivators (WAC) of other minds as a direct result of their embodied-grounded origin, rendering WAC a radical interpretation of the Words as social Tools (WAT) proposal. The WAC hypothesis endorses the view of language as dynamic, continuously integrating with, and negotiating, cognitive processes in the individual. One such dynamic feature results from the 'linguification process', a term by which I refer to the socially produced mapping of a word to its referent which, mediated by the interlocutor, turns words into cultivators of others minds. In support of the linguification process hypothesis and WAC, I review relevant embodied-grounded research, and selected studies of instructed fear conditioning and guided imagery.
Rhythm in language acquisition.
Langus, Alan; Mehler, Jacques; Nespor, Marina
2017-10-01
Spoken language is governed by rhythm. Linguistic rhythm is hierarchical and the rhythmic hierarchy partially mimics the prosodic as well as the morpho-syntactic hierarchy of spoken language. It can thus provide learners with cues about the structure of the language they are acquiring. We identify three universal levels of linguistic rhythm - the segmental level, the level of the metrical feet and the phonological phrase level - and discuss why primary lexical stress is not rhythmic. We survey experimental evidence on rhythm perception in young infants and native speakers of various languages to determine the properties of linguistic rhythm that are present at birth, those that mature during the first year of life and those that are shaped by the linguistic environment of language learners. We conclude with a discussion of the major gaps in current knowledge on linguistic rhythm and highlight areas of interest for future research that are most likely to yield significant insights into the nature, the perception, and the usefulness of linguistic rhythm. Copyright © 2016 Elsevier Ltd. All rights reserved.
Good-enough linguistic representations and online cognitive equilibrium in language processing.
Karimi, Hossein; Ferreira, Fernanda
2016-01-01
We review previous research showing that representations formed during language processing are sometimes just "good enough" for the task at hand and propose the "online cognitive equilibrium" hypothesis as the driving force behind the formation of good-enough representations in language processing. Based on this view, we assume that the language comprehension system by default prefers to achieve as early as possible and remain as long as possible in a state of cognitive equilibrium where linguistic representations are successfully incorporated with existing knowledge structures (i.e., schemata) so that a meaningful and coherent overall representation is formed, and uncertainty is resolved or at least minimized. We also argue that the online equilibrium hypothesis is consistent with current theories of language processing, which maintain that linguistic representations are formed through a complex interplay between simple heuristics and deep syntactic algorithms and also theories that hold that linguistic representations are often incomplete and lacking in detail. We also propose a model of language processing that makes use of both heuristic and algorithmic processing, is sensitive to online cognitive equilibrium, and, we argue, is capable of explaining the formation of underspecified representations. We review previous findings providing evidence for underspecification in relation to this hypothesis and the associated language processing model and argue that most of these findings are compatible with them.
A Critical Reflection on Knowledge Hierarchies, Language and Development
ERIC Educational Resources Information Center
Langthaler, Margarita; Witjes, Nina; Slezak, Gabriele
2012-01-01
Purpose: The purpose of this paper is to contribute to the discussion about the developmental value of knowledge by reflecting on the "knowledge for development" (K4D) paradigm. In particular, it draws attention to the interaction between linguistic and communicative processes and the areas of power, knowledge and education. This is…
Ahmed, Wamiq M; Lenz, Dominik; Liu, Jia; Paul Robinson, J; Ghafoor, Arif
2008-03-01
High-throughput biological imaging uses automated imaging devices to collect a large number of microscopic images for analysis of biological systems and validation of scientific hypotheses. Efficient manipulation of these datasets for knowledge discovery requires high-performance computational resources, efficient storage, and automated tools for extracting and sharing such knowledge among different research sites. Newly emerging grid technologies provide powerful means for exploiting the full potential of these imaging techniques. Efficient utilization of grid resources requires the development of knowledge-based tools and services that combine domain knowledge with analysis algorithms. In this paper, we first investigate how grid infrastructure can facilitate high-throughput biological imaging research, and present an architecture for providing knowledge-based grid services for this field. We identify two levels of knowledge-based services. The first level provides tools for extracting spatiotemporal knowledge from image sets and the second level provides high-level knowledge management and reasoning services. We then present cellular imaging markup language, an extensible markup language-based language for modeling of biological images and representation of spatiotemporal knowledge. This scheme can be used for spatiotemporal event composition, matching, and automated knowledge extraction and representation for large biological imaging datasets. We demonstrate the expressive power of this formalism by means of different examples and extensive experimental results.
Lynx: a database and knowledge extraction engine for integrative medicine.
Sulakhe, Dinanath; Balasubramanian, Sandhya; Xie, Bingqing; Feng, Bo; Taylor, Andrew; Wang, Sheng; Berrocal, Eduardo; Dave, Utpal; Xu, Jinbo; Börnigen, Daniela; Gilliam, T Conrad; Maltsev, Natalia
2014-01-01
We have developed Lynx (http://lynx.ci.uchicago.edu)--a web-based database and a knowledge extraction engine, supporting annotation and analysis of experimental data and generation of weighted hypotheses on molecular mechanisms contributing to human phenotypes and disorders of interest. Its underlying knowledge base (LynxKB) integrates various classes of information from >35 public databases and private collections, as well as manually curated data from our group and collaborators. Lynx provides advanced search capabilities and a variety of algorithms for enrichment analysis and network-based gene prioritization to assist the user in extracting meaningful knowledge from LynxKB and experimental data, whereas its service-oriented architecture provides public access to LynxKB and its analytical tools via user-friendly web services and interfaces.
Associating Human-Centered Concepts with Social Networks Using Fuzzy Sets
NASA Astrophysics Data System (ADS)
Yager, Ronald R.
The rapidly growing global interconnectivity, brought about to a large extent by the Internet, has dramatically increased the importance and diversity of social networks. Modern social networks cut across a spectrum from benign recreational focused websites such as Facebook to occupationally oriented websites such as LinkedIn to criminally focused groups such as drug cartels to devastation and terror focused groups such as Al-Qaeda. Many organizations are interested in analyzing and extracting information related to these social networks. Among these are governmental police and security agencies as well marketing and sales organizations. To aid these organizations there is a need for technologies to model social networks and intelligently extract information from these models. While established technologies exist for the modeling of relational networks [1-7] few technologies exist to extract information from these, compatible with human perception and understanding. Data bases is an example of a technology in which we have tools for representing our information as well as tools for querying and extracting the information contained. Our goal is in some sense analogous. We want to use the relational network model to represent information, in this case about relationships and interconnections, and then be able to query the social network using intelligent human-centered concepts. To extend our capabilities to interact with social relational networks we need to associate with these network human concepts and ideas. Since human beings predominantly use linguistic terms in which to reason and understand we need to build bridges between human conceptualization and the formal mathematical representation of the social network. Consider for example a concept such as "leader". An analyst may be able to express, in linguistic terms, using a network relevant vocabulary, properties of a leader. Our task is to translate this linguistic description into a mathematical formalism that allows us to determine how true it is that a particular node is a leader. In this work we look at the use of fuzzy set methodologies [8-10] to provide a bridge between the human analyst and the formal model of the network.
Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
Xian, Xuefeng; Cui, Zhiming
2017-01-01
Machine-constructed knowledge bases often contain noisy and inaccurate facts. There exists significant work in developing automated algorithms for knowledge base refinement. Automated approaches improve the quality of knowledge bases but are far from perfect. In this paper, we leverage crowdsourcing to improve the quality of automatically extracted knowledge bases. As human labelling is costly, an important research challenge is how we can use limited human resources to maximize the quality improvement for a knowledge base. To address this problem, we first introduce a concept of semantic constraints that can be used to detect potential errors and do inference among candidate facts. Then, based on semantic constraints, we propose rank-based and graph-based algorithms for crowdsourced knowledge refining, which judiciously select the most beneficial candidate facts to conduct crowdsourcing and prune unnecessary questions. Our experiments show that our method improves the quality of knowledge bases significantly and outperforms state-of-the-art automatic methods under a reasonable crowdsourcing cost. PMID:28588611
Xu, Rong; Li, Li; Wang, QuanQiu
2013-01-01
Motivation: Systems approaches to studying phenotypic relationships among diseases are emerging as an active area of research for both novel disease gene discovery and drug repurposing. Currently, systematic study of disease phenotypic relationships on a phenome-wide scale is limited because large-scale machine-understandable disease–phenotype relationship knowledge bases are often unavailable. Here, we present an automatic approach to extract disease–manifestation (D-M) pairs (one specific type of disease–phenotype relationship) from the wide body of published biomedical literature. Data and Methods: Our method leverages external knowledge and limits the amount of human effort required. For the text corpus, we used 119 085 682 MEDLINE sentences (21 354 075 citations). First, we used D-M pairs from existing biomedical ontologies as prior knowledge to automatically discover D-M–specific syntactic patterns. We then extracted additional pairs from MEDLINE using the learned patterns. Finally, we analysed correlations between disease manifestations and disease-associated genes and drugs to demonstrate the potential of this newly created knowledge base in disease gene discovery and drug repurposing. Results: In total, we extracted 121 359 unique D-M pairs with a high precision of 0.924. Among the extracted pairs, 120 419 (99.2%) have not been captured in existing structured knowledge sources. We have shown that disease manifestations correlate positively with both disease-associated genes and drug treatments. Conclusions: The main contribution of our study is the creation of a large-scale and accurate D-M phenotype relationship knowledge base. This unique knowledge base, when combined with existing phenotypic, genetic and proteomic datasets, can have profound implications in our deeper understanding of disease etiology and in rapid drug repurposing. Availability: http://nlp.case.edu/public/data/DMPatternUMLS/ Contact: rxx@case.edu PMID:23828786
ERIC Educational Resources Information Center
Taylor, Orlando L.
In discussing the rich linguistic history of Afro-Americans, the author points out that black people had a linguistic system when they came to the New World and frequently had a knowledge of a form of English which had been influenced by Black Portuguese and West African languages. Despite many assertions to the contrary, Black English, "the…
ERIC Educational Resources Information Center
Blair, Rebecca; Savage, Robert
2006-01-01
This paper reports a study exploring the associations between measures of two levels of phonological representation: recognition (epi-linguistic) and production (meta-linguistic) tasks, and very early reading and writing skills. Thirty-eight pre-reading Ottawa-area children, aged 4-5 years, named environmental print (EP), wrote their own name,…
ERIC Educational Resources Information Center
de la Fuente, Anahí Alba; Lacroix, Hugues
2015-01-01
In foreign language classrooms we often find that, in addition to their mother tongue (L1), learners already speak--or are learning--at least one other language. As a result, they already have an array of linguistic and cognitive skills that may prove very useful if they are adequately exploited during the language learning process. However, in…
Improving Cyber-Security of Smart Grid Systems via Anomaly Detection and Linguistic Domain Knowledge
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ondrej Linda; Todd Vollmer; Milos Manic
The planned large scale deployment of smart grid network devices will generate a large amount of information exchanged over various types of communication networks. The implementation of these critical systems will require appropriate cyber-security measures. A network anomaly detection solution is considered in this work. In common network architectures multiple communications streams are simultaneously present, making it difficult to build an anomaly detection solution for the entire system. In addition, common anomaly detection algorithms require specification of a sensitivity threshold, which inevitably leads to a tradeoff between false positives and false negatives rates. In order to alleviate these issues, thismore » paper proposes a novel anomaly detection architecture. The designed system applies the previously developed network security cyber-sensor method to individual selected communication streams allowing for learning accurate normal network behavior models. Furthermore, the developed system dynamically adjusts the sensitivity threshold of each anomaly detection algorithm based on domain knowledge about the specific network system. It is proposed to model this domain knowledge using Interval Type-2 Fuzzy Logic rules, which linguistically describe the relationship between various features of the network communication and the possibility of a cyber attack. The proposed method was tested on experimental smart grid system demonstrating enhanced cyber-security.« less
Incorporating Semantics into Data Driven Workflows for Content Based Analysis
NASA Astrophysics Data System (ADS)
Argüello, M.; Fernandez-Prieto, M. J.
Finding meaningful associations between text elements and knowledge structures within clinical narratives in a highly verbal domain, such as psychiatry, is a challenging goal. The research presented here uses a small corpus of case histories and brings into play pre-existing knowledge, and therefore, complements other approaches that use large corpus (millions of words) and no pre-existing knowledge. The paper describes a variety of experiments for content-based analysis: Linguistic Analysis using NLP-oriented approaches, Sentiment Analysis, and Semantically Meaningful Analysis. Although it is not standard practice, the paper advocates providing automatic support to annotate the functionality as well as the data for each experiment by performing semantic annotation that uses OWL and OWL-S. Lessons learnt can be transmitted to legacy clinical databases facing the conversion of clinical narratives according to prominent Electronic Health Records standards.
Knowledge Acquisition of Generic Queries for Information Retrieval
Seol, Yoon-Ho; Johnson, Stephen B.; Cimino, James J.
2002-01-01
Several studies have identified clinical questions posed by health care professionals to understand the nature of information needs during clinical practice. To support access to digital information sources, it is necessary to integrate the information needs with a computer system. We have developed a conceptual guidance approach in information retrieval, based on a knowledge base that contains the patterns of information needs. The knowledge base uses a formal representation of clinical questions based on the UMLS knowledge sources, called the Generic Query model. To improve the coverage of the knowledge base, we investigated a method for extracting plausible clinical questions from the medical literature. This poster presents the Generic Query model, shows how it is used to represent the patterns of clinical questions, and describes the framework used to extract knowledge from the medical literature.
NASA Technical Reports Server (NTRS)
Howard, Ayanna
2005-01-01
The Fuzzy Logic Engine is a software package that enables users to embed fuzzy-logic modules into their application programs. Fuzzy logic is useful as a means of formulating human expert knowledge and translating it into software to solve problems. Fuzzy logic provides flexibility for modeling relationships between input and output information and is distinguished by its robustness with respect to noise and variations in system parameters. In addition, linguistic fuzzy sets and conditional statements allow systems to make decisions based on imprecise and incomplete information. The user of the Fuzzy Logic Engine need not be an expert in fuzzy logic: it suffices to have a basic understanding of how linguistic rules can be applied to the user's problem. The Fuzzy Logic Engine is divided into two modules: (1) a graphical-interface software tool for creating linguistic fuzzy sets and conditional statements and (2) a fuzzy-logic software library for embedding fuzzy processing capability into current application programs. The graphical- interface tool was developed using the Tcl/Tk programming language. The fuzzy-logic software library was written in the C programming language.
Cross, Zachariah R.; Kohler, Mark J.; Schlesewsky, Matthias; Gaskell, M. G.; Bornkessel-Schlesewsky, Ina
2018-01-01
We hypothesize a beneficial influence of sleep on the consolidation of the combinatorial mechanisms underlying incremental sentence comprehension. These predictions are grounded in recent work examining the effect of sleep on the consolidation of linguistic information, which demonstrate that sleep-dependent neurophysiological activity consolidates the meaning of novel words and simple grammatical rules. However, the sleep-dependent consolidation of sentence-level combinatorics has not been studied to date. Here, we propose that dissociable aspects of sleep neurophysiology consolidate two different types of combinatory mechanisms in human language: sequence-based (order-sensitive) and dependency-based (order-insensitive) combinatorics. The distinction between the two types of combinatorics is motivated both by cross-linguistic considerations and the neurobiological underpinnings of human language. Unifying this perspective with principles of sleep-dependent memory consolidation, we posit that a function of sleep is to optimize the consolidation of sequence-based knowledge (the when) and the establishment of semantic schemas of unordered items (the what) that underpin cross-linguistic variations in sentence comprehension. This hypothesis builds on the proposal that sleep is involved in the construction of predictive codes, a unified principle of brain function that supports incremental sentence comprehension. Finally, we discuss neurophysiological measures (EEG/MEG) that could be used to test these claims, such as the quantification of neuronal oscillations, which reflect basic mechanisms of information processing in the brain. PMID:29445333
Second language learners who are at-risk for reading disabilities: A growth mixture model study.
Yeung, Susanna S
2018-05-11
This one-year longitudinal study examined the developmental trajectories of English reading in Chinese children learning English as a second language (ESL) and identified cognitive profiles of children who are at risk for English reading disability. One hundred and eighty-four Chinese ESL children from eight Hong Kong kindergartens were measured four times during their last year of kindergarten for phonological awareness, letter knowledge, vocabulary and English word reading. Growth mixture modeling was applied to classify the children based on their growth trajectories in English word reading. Four subgroups of word reading growth were classified, namely high-achieving, fast-growth, slow-growth and low-achieving groups. The cognitive-linguistic skills were compared across different groups with age, non-verbal intelligence and receptive vocabulary in L1 controlled. The results showed that low-achieving groups, who were expected to be at-risk for L2 reading disability, showed deficits in letter-name knowledge, phonemic awareness, and receptive and expressive vocabulary. Fast-growth and high-achieving groups were not distinguishable on the measured cognitive-linguistic skills. Children in the low-growth groups were significantly weaker in phonemic awareness, receptive vocabulary and expressive vocabulary than children in the high-achieving group. Our findings identified specific cognitive-linguistic deficits that were associated with children who are at-risk for reading disability. Implications for the early identification of L2 reading disability were discussed. Copyright © 2018 Elsevier Ltd. All rights reserved.
Structure identification in fuzzy inference using reinforcement learning
NASA Technical Reports Server (NTRS)
Berenji, Hamid R.; Khedkar, Pratap
1993-01-01
In our previous work on the GARIC architecture, we have shown that the system can start with surface structure of the knowledge base (i.e., the linguistic expression of the rules) and learn the deep structure (i.e., the fuzzy membership functions of the labels used in the rules) by using reinforcement learning. Assuming the surface structure, GARIC refines the fuzzy membership functions used in the consequents of the rules using a gradient descent procedure. This hybrid fuzzy logic and reinforcement learning approach can learn to balance a cart-pole system and to backup a truck to its docking location after a few trials. In this paper, we discuss how to do structure identification using reinforcement learning in fuzzy inference systems. This involves identifying both surface as well as deep structure of the knowledge base. The term set of fuzzy linguistic labels used in describing the values of each control variable must be derived. In this process, splitting a label refers to creating new labels which are more granular than the original label and merging two labels creates a more general label. Splitting and merging of labels directly transform the structure of the action selection network used in GARIC by increasing or decreasing the number of hidden layer nodes.
Crowdsourcing Language Change with Smartphone Applications
Leemann, Adrian; Kolly, Marie-José; Purves, Ross; Britain, David; Glaser, Elvira
2016-01-01
Crowdsourcing linguistic phenomena with smartphone applications is relatively new. In linguistics, apps have predominantly been developed to create pronunciation dictionaries, to train acoustic models, and to archive endangered languages. This paper presents the first account of how apps can be used to collect data suitable for documenting language change: we created an app, Dialäkt Äpp (DÄ), which predicts users’ dialects. For 16 linguistic variables, users select a dialectal variant from a drop-down menu. DÄ then geographically locates the user’s dialect by suggesting a list of communes where dialect variants most similar to their choices are used. Underlying this prediction are 16 maps from the historical Linguistic Atlas of German-speaking Switzerland, which documents the linguistic situation around 1950. Where users disagree with the prediction, they can indicate what they consider to be their dialect’s location. With this information, the 16 variables can be assessed for language change. Thanks to the playfulness of its functionality, DÄ has reached many users; our linguistic analyses are based on data from nearly 60,000 speakers. Results reveal a relative stability for phonetic variables, while lexical and morphological variables seem more prone to change. Crowdsourcing large amounts of dialect data with smartphone apps has the potential to complement existing data collection techniques and to provide evidence that traditional methods cannot, with normal resources, hope to gather. Nonetheless, it is important to emphasize a range of methodological caveats, including sparse knowledge of users’ linguistic backgrounds (users only indicate age, sex) and users’ self-declaration of their dialect. These are discussed and evaluated in detail here. Findings remain intriguing nevertheless: as a means of quality control, we report that traditional dialectological methods have revealed trends similar to those found by the app. This underlines the validity of the crowdsourcing method. We are presently extending DÄ architecture to other languages. PMID:26726775
Automated extraction of knowledge for model-based diagnostics
NASA Technical Reports Server (NTRS)
Gonzalez, Avelino J.; Myler, Harley R.; Towhidnejad, Massood; Mckenzie, Frederic D.; Kladke, Robin R.
1990-01-01
The concept of accessing computer aided design (CAD) design databases and extracting a process model automatically is investigated as a possible source for the generation of knowledge bases for model-based reasoning systems. The resulting system, referred to as automated knowledge generation (AKG), uses an object-oriented programming structure and constraint techniques as well as internal database of component descriptions to generate a frame-based structure that describes the model. The procedure has been designed to be general enough to be easily coupled to CAD systems that feature a database capable of providing label and connectivity data from the drawn system. The AKG system is capable of defining knowledge bases in formats required by various model-based reasoning tools.
Lynx: a database and knowledge extraction engine for integrative medicine
Sulakhe, Dinanath; Balasubramanian, Sandhya; Xie, Bingqing; Feng, Bo; Taylor, Andrew; Wang, Sheng; Berrocal, Eduardo; Dave, Utpal; Xu, Jinbo; Börnigen, Daniela; Gilliam, T. Conrad; Maltsev, Natalia
2014-01-01
We have developed Lynx (http://lynx.ci.uchicago.edu)—a web-based database and a knowledge extraction engine, supporting annotation and analysis of experimental data and generation of weighted hypotheses on molecular mechanisms contributing to human phenotypes and disorders of interest. Its underlying knowledge base (LynxKB) integrates various classes of information from >35 public databases and private collections, as well as manually curated data from our group and collaborators. Lynx provides advanced search capabilities and a variety of algorithms for enrichment analysis and network-based gene prioritization to assist the user in extracting meaningful knowledge from LynxKB and experimental data, whereas its service-oriented architecture provides public access to LynxKB and its analytical tools via user-friendly web services and interfaces. PMID:24270788
Automation for System Safety Analysis
NASA Technical Reports Server (NTRS)
Malin, Jane T.; Fleming, Land; Throop, David; Thronesbery, Carroll; Flores, Joshua; Bennett, Ted; Wennberg, Paul
2009-01-01
This presentation describes work to integrate a set of tools to support early model-based analysis of failures and hazards due to system-software interactions. The tools perform and assist analysts in the following tasks: 1) extract model parts from text for architecture and safety/hazard models; 2) combine the parts with library information to develop the models for visualization and analysis; 3) perform graph analysis and simulation to identify and evaluate possible paths from hazard sources to vulnerable entities and functions, in nominal and anomalous system-software configurations and scenarios; and 4) identify resulting candidate scenarios for software integration testing. There has been significant technical progress in model extraction from Orion program text sources, architecture model derivation (components and connections) and documentation of extraction sources. Models have been derived from Internal Interface Requirements Documents (IIRDs) and FMEA documents. Linguistic text processing is used to extract model parts and relationships, and the Aerospace Ontology also aids automated model development from the extracted information. Visualizations of these models assist analysts in requirements overview and in checking consistency and completeness.
NASA Astrophysics Data System (ADS)
Chen, Xinying
2014-12-01
Researchers have been talking about the language system theoretically for many years [1]. A well accepted assumption is that language is a complex adaptive system [2] which is hierarchical [3] and contains multiple levels along the meaning-form dimension [4]. Over the last decade or so, driven by the availability of digital language data and the popularity of statistical approach, many researchers interested in theoretical questions have started to try to quantitatively describe microscopic linguistic features in a certain level of a language system by using authentic language data. Despite the fruitful findings, one question remains unclear. That is, how does a whole language system look like? For answering this question, network approach, an analysis method emphasizes the macro features of structures, has been introduced into linguistic studies [5]. By analyzing the static and dynamic linguistics networks constructed from authentic language data, many macro and micro linguistic features, such as lexical, syntactic or semantic features have been discovered and successfully applied in linguistic typographical studies so that the huge potential of linguistic networks research has revealed [6].
KAM (Knowledge Acquisition Module): A tool to simplify the knowledge acquisition process
NASA Technical Reports Server (NTRS)
Gettig, Gary A.
1988-01-01
Analysts, knowledge engineers and information specialists are faced with increasing volumes of time-sensitive data in text form, either as free text or highly structured text records. Rapid access to the relevant data in these sources is essential. However, due to the volume and organization of the contents, and limitations of human memory and association, frequently: (1) important information is not located in time; (2) reams of irrelevant data are searched; and (3) interesting or critical associations are missed due to physical or temporal gaps involved in working with large files. The Knowledge Acquisition Module (KAM) is a microcomputer-based expert system designed to assist knowledge engineers, analysts, and other specialists in extracting useful knowledge from large volumes of digitized text and text-based files. KAM formulates non-explicit, ambiguous, or vague relations, rules, and facts into a manageable and consistent formal code. A library of system rules or heuristics is maintained to control the extraction of rules, relations, assertions, and other patterns from the text. These heuristics can be added, deleted or customized by the user. The user can further control the extraction process with optional topic specifications. This allows the user to cluster extracts based on specific topics. Because KAM formalizes diverse knowledge, it can be used by a variety of expert systems and automated reasoning applications. KAM can also perform important roles in computer-assisted training and skill development. Current research efforts include the applicability of neural networks to aid in the extraction process and the conversion of these extracts into standard formats.
ERIC Educational Resources Information Center
Shintani, Natsuko
2017-01-01
This study examines the effects of the timing of explicit instruction (EI) on grammatical accuracy. A total of 123 learners were divided into two groups: those with some productive knowledge of past-counterfactual conditionals (+Prior Knowledge) and those without such knowledge (-Prior Knowledge). Each group was divided into four conditions. Two…
2014-01-01
Background Maternity health care available in Canada is based on the needs of women born in Canada and often lacks the flexibility to meet the needs of immigrant women. The purpose of this study was to explore immigrant Chinese women’s experiences in accessing maternity care, the utilization of maternity health services, and the obstacles they perceived in Canada. Methods This descriptive phenomenology study used in-depth semi-structured interviews to examine immigrant Chinese women’s experiences. Fifteen participants were recruited from the Chinese community in Toronto, Canada by using purposive sampling. The interviews were digitally recorded and transcribed verbatim into written Chinese. The transcripts were analyzed using Colaizzi’s (1978) phenomenological method. Results Six themes were extracted from the interviews: (1) preference for linguistically and culturally competent healthcare providers, with obstetricians over midwives, (2) strategies to deal with the inconvenience of the Canadian healthcare system (3) multiple resources to obtain pregnancy information, (4) the merits of the Canadian healthcare system, (5) the need for culturally sensitive care, and (6) the emergence of alternative supports and the use of private services. Conclusions The findings provide new knowledge and understanding of immigrant Chinese women’s experiences in accessing maternity health services within a large metropolitan Canadian city. Participants described two unique experiences within the themes: preference for linguistically and culturally competent healthcare providers, with obstetricians over midwives, and the emergence of alternative supports and the use of private services. Few studies of immigrant maternity service access have identified these experiences which may be linked to cultural difference. Further investigation with women from different cultural backgrounds is needed to develop a comprehensive understanding of immigrant women’s experiences with maternity care. PMID:24602231
The Sociophonetic and Acoustic Vowel Dynamics of Michigan's Upper Peninsula English
NASA Astrophysics Data System (ADS)
Rankinen, Wil A.
The present sociophonetic study examines the English variety in Michigan's Upper Peninsula (UP) based upon a 130-speaker sample from Marquette County. The linguistic variables of interest include seven monophthongs and four diphthongs: 1) front lax, 2) low back, and 3) high back monophthongs and 4) short and 5) long diphthongs. The sample is stratified by the predictor variables of heritage-location, bilingualism, age, sex and class. The aim of the thesis is two fold: 1) to determine the extent of potential substrate effects on a 71-speaker older-aged bilingual and monolingual subset of these UP English speakers focusing on the predictor variables of heritage-location and bilingualism, and 2) to determine the extent of potential exogenous influences on an 85-speaker subset of UP English monolingual speakers by focusing on the predictor variables of heritage-location, age, sex and class. All data were extracted from a reading passage task collected during a sociolinguistic interview and measured instrumentally. The findings of this apparent-time data reveal the presence of lingering effects from substrate sources and developing effects from exogenous sources based upon American and Canadian models of diffusion. The linguistic changes-in-progress from above, led by middle-class females, are taking shape in the speech of UP residents of whom are propagating linguistic phenomena typically associated with varieties of Canadian English (i.e., low-back merger, Canadian shift, and Canadian raising); however, the findings also report resistance of such norms by working-class females. Finally, the data also reveal substrate effects demonstrating cases of dialect leveling and maintenance. As a result, the speech spoken in Michigan's Upper Peninsula can presently be described as a unique variety of English comprised of lingering substrate effects as well as exogenous effects modeled from both American and Canadian English linguistic norms.
ERIC Educational Resources Information Center
Pizzioli, Fabrizio; Schelstraete, Marie-Anne
2008-01-01
Purpose: The hypothesis that the linguistic deficit presented by children with specific language impairment (SLI) is caused by limited cognitive resources (e.g., S. Ellis Weismer & L. Hesketh, 1996) was tested against the hypothesis of a limitation in linguistic knowledge (e.g., M. L. Rice, K. Wexler, & P. Cleave, 1995). Method: The study examined…
Fedorenko, Evelina; Nieto-Castañon, Alfonso; Kanwisher, Nancy
2011-01-01
Work in theoretical linguistics and psycholinguistics suggests that human linguistic knowledge forms a continuum between individual lexical items and abstract syntactic representations, with most linguistic representations falling between the two extremes and taking the form of lexical items stored together with the syntactic/semantic contexts in which they frequently occur. Neuroimaging evidence further suggests that no brain region is selectively sensitive to only lexical information or only syntactic information. Instead, all the key brain regions that support high-level linguistic processing have been implicated in both lexical and syntactic processing, suggesting that our linguistic knowledge is plausibly represented in a distributed fashion in these brain regions. Given this distributed nature of linguistic representations, multi-voxel pattern analyses (MVPAs) can help uncover important functional properties of the language system. In the current study we use MVPAs to ask two questions: 1) Do language brain regions differ in how robustly they represent lexical vs. syntactic information?; and 2) Do any of the language bran regions distinguish between “pure” lexical information (lists of words) and “pure” abstract syntactic information (jabberwocky sentences) in the pattern of activity? We show that lexical information is represented more robustly than syntactic information across many language regions (with no language region showing the opposite pattern), as evidenced by a better discrimination between conditions that differ along the lexical dimension (sentences vs. jabberwocky, and word lists vs. nonword lists) than between conditions that differ along the syntactic dimension (sentences vs. word lists, and jabberwocky vs. nonword lists). This result suggests that lexical information may play a more critical role than syntax in the representation of linguistic meaning. We also show that several language regions reliably discriminate between “pure” lexical information and “pure” abstract syntactic information in their patterns of neural activity. PMID:21945850
Fedorenko, Evelina; Nieto-Castañon, Alfonso; Kanwisher, Nancy
2012-03-01
Work in theoretical linguistics and psycholinguistics suggests that human linguistic knowledge forms a continuum between individual lexical items and abstract syntactic representations, with most linguistic representations falling between the two extremes and taking the form of lexical items stored together with the syntactic/semantic contexts in which they frequently occur. Neuroimaging evidence further suggests that no brain region is selectively sensitive to only lexical information or only syntactic information. Instead, all the key brain regions that support high-level linguistic processing have been implicated in both lexical and syntactic processing, suggesting that our linguistic knowledge is plausibly represented in a distributed fashion in these brain regions. Given this distributed nature of linguistic representations, multi-voxel pattern analyses (MVPAs) can help uncover important functional properties of the language system. In the current study we use MVPAs to ask two questions: (1) Do language brain regions differ in how robustly they represent lexical vs. syntactic information? and (2) Do any of the language bran regions distinguish between "pure" lexical information (lists of words) and "pure" abstract syntactic information (jabberwocky sentences) in the pattern of activity? We show that lexical information is represented more robustly than syntactic information across many language regions (with no language region showing the opposite pattern), as evidenced by a better discrimination between conditions that differ along the lexical dimension (sentences vs. jabberwocky, and word lists vs. nonword lists) than between conditions that differ along the syntactic dimension (sentences vs. word lists, and jabberwocky vs. nonword lists). This result suggests that lexical information may play a more critical role than syntax in the representation of linguistic meaning. We also show that several language regions reliably discriminate between "pure" lexical information and "pure" abstract syntactic information in their patterns of neural activity. Copyright © 2011 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Kinney, Angela
2015-01-01
This study focused on household funds of knowledge or "historically accumulated bodies of knowledge and skills essential for household functioning and well-being" (Gonzalez, Andrade, Civil, & Moll, 2001). A Funds of Knowledge approach provides both a methodological and theoretical lens for educators to understand both themselves and…
ERIC Educational Resources Information Center
Bowles, Melissa A.
2011-01-01
Although claims about explicit and implicit language knowledge are central to many debates in SLA, little research has been dedicated to measuring the two knowledge types (R. Ellis, 2004, 2005). The purpose of this study was to validate the use of the battery of tests reported in Ellis (2005) to measure implicit and explicit language knowledge.…
2012-01-01
Background In recent years, biological event extraction has emerged as a key natural language processing task, aiming to address the information overload problem in accessing the molecular biology literature. The BioNLP shared task competitions have contributed to this recent interest considerably. The first competition (BioNLP'09) focused on extracting biological events from Medline abstracts from a narrow domain, while the theme of the latest competition (BioNLP-ST'11) was generalization and a wider range of text types, event types, and subject domains were considered. We view event extraction as a building block in larger discourse interpretation and propose a two-phase, linguistically-grounded, rule-based methodology. In the first phase, a general, underspecified semantic interpretation is composed from syntactic dependency relations in a bottom-up manner. The notion of embedding underpins this phase and it is informed by a trigger dictionary and argument identification rules. Coreference resolution is also performed at this step, allowing extraction of inter-sentential relations. The second phase is concerned with constraining the resulting semantic interpretation by shared task specifications. We evaluated our general methodology on core biological event extraction and speculation/negation tasks in three main tracks of BioNLP-ST'11 (GENIA, EPI, and ID). Results We achieved competitive results in GENIA and ID tracks, while our results in the EPI track leave room for improvement. One notable feature of our system is that its performance across abstracts and articles bodies is stable. Coreference resolution results in minor improvement in system performance. Due to our interest in discourse-level elements, such as speculation/negation and coreference, we provide a more detailed analysis of our system performance in these subtasks. Conclusions The results demonstrate the viability of a robust, linguistically-oriented methodology, which clearly distinguishes general semantic interpretation from shared task specific aspects, for biological event extraction. Our error analysis pinpoints some shortcomings, which we plan to address in future work within our incremental system development methodology. PMID:22759461
DOE Office of Scientific and Technical Information (OSTI.GOV)
Darby, John L.
LinguisticBelief is a Java computer code that evaluates combinations of linguistic variables using an approximate reasoning rule base. Each variable is comprised of fuzzy sets, and a rule base describes the reasoning on combinations of variables fuzzy sets. Uncertainty is considered and propagated through the rule base using the belief/plausibility measure. The mathematics of fuzzy sets, approximate reasoning, and belief/ plausibility are complex. Without an automated tool, this complexity precludes their application to all but the simplest of problems. LinguisticBelief automates the use of these techniques, allowing complex problems to be evaluated easily. LinguisticBelief can be used free of chargemore » on any Windows XP machine. This report documents the use and structure of the LinguisticBelief code, and the deployment package for installation client machines.« less
d/Deaf and Hard of Hearing Multilingual Learners: The Development of Communication and Language.
Pizzo, Lianna
2016-01-01
The author examines the theory and research relevant to educating d/Deaf and Hard of Hearing Multilingual Learners (DMLs). There is minimal research on this population, yet a synthesis of related theory, research, and practice on spoken-language bilinguals can be used to add to the body of knowledge on these learners. Specifically, the author reports on three major areas: (a) population characteristics of DMLs, (b) theories relevant to understanding the language development of DMLs, and (c) considerations for programs in designing and implementing educational services for DMLs. In the interest of ensuring that children receive the foundation for linguistic success, aspects of linguistically responsive teaching (Lucas & Villegas, 2013) are addressed, with a focus on adopting an asset-based perspective on educating DMLs that honors all of a child's language, identity, and cultural memberships.
Subtle Implicit Language Facts Emerge from the Functions of Constructions
Goldberg, Adele E.
2016-01-01
Much has been written about the unlikelihood of innate, syntax-specific, universal knowledge of language (Universal Grammar) on the grounds that it is biologically implausible, unresponsive to cross-linguistic facts, theoretically inelegant, and implausible and unnecessary from the perspective of language acquisition. While relevant, much of this discussion fails to address the sorts of facts that generative linguists often take as evidence in favor of the Universal Grammar Hypothesis: subtle, intricate, knowledge about language that speakers implicitly know without being taught. This paper revisits a few often-cited such cases and argues that, although the facts are sometimes even more complex and subtle than is generally appreciated, appeals to Universal Grammar fail to explain the phenomena. Instead, such facts are strongly motivated by the functions of the constructions involved. The following specific cases are discussed: (a) the distribution and interpretation of anaphoric one, (b) constraints on long-distance dependencies, (c) subject-auxiliary inversion, and (d) cross-linguistic linking generalizations between semantics and syntax. PMID:26858662
a New Model for Fuzzy Personalized Route Planning Using Fuzzy Linguistic Preference Relation
NASA Astrophysics Data System (ADS)
Nadi, S.; Houshyaripour, A. H.
2017-09-01
This paper proposes a new model for personalized route planning under uncertain condition. Personalized routing, involves different sources of uncertainty. These uncertainties can be raised from user's ambiguity about their preferences, imprecise criteria values and modelling process. The proposed model uses Fuzzy Linguistic Preference Relation Analytical Hierarchical Process (FLPRAHP) to analyse user's preferences under uncertainty. Routing is a multi-criteria task especially in transportation networks, where the users wish to optimize their routes based on different criteria. However, due to the lake of knowledge about the preferences of different users and uncertainties available in the criteria values, we propose a new personalized fuzzy routing method based on the fuzzy ranking using center of gravity. The model employed FLPRAHP method to aggregate uncertain criteria values regarding uncertain user's preferences while improve consistency with least possible comparisons. An illustrative example presents the effectiveness and capability of the proposed model to calculate best personalize route under fuzziness and uncertainty.
Considering context: reliable entity networks through contextual relationship extraction
NASA Astrophysics Data System (ADS)
David, Peter; Hawes, Timothy; Hansen, Nichole; Nolan, James J.
2016-05-01
Existing information extraction techniques can only partially address the problem of exploiting unreadable-large amounts text. When discussion of events and relationships is limited to simple, past-tense, factual descriptions of events, current NLP-based systems can identify events and relationships and extract a limited amount of additional information. But the simple subset of available information that existing tools can extract from text is only useful to a small set of users and problems. Automated systems need to find and separate information based on what is threatened or planned to occur, has occurred in the past, or could potentially occur. We address the problem of advanced event and relationship extraction with our event and relationship attribute recognition system, which labels generic, planned, recurring, and potential events. The approach is based on a combination of new machine learning methods, novel linguistic features, and crowd-sourced labeling. The attribute labeler closes the gap between structured event and relationship models and the complicated and nuanced language that people use to describe them. Our operational-quality event and relationship attribute labeler enables Warfighters and analysts to more thoroughly exploit information in unstructured text. This is made possible through 1) More precise event and relationship interpretation, 2) More detailed information about extracted events and relationships, and 3) More reliable and informative entity networks that acknowledge the different attributes of entity-entity relationships.
MMKG: An approach to generate metallic materials knowledge graph based on DBpedia and Wikipedia
NASA Astrophysics Data System (ADS)
Zhang, Xiaoming; Liu, Xin; Li, Xin; Pan, Dongyu
2017-02-01
The research and development of metallic materials are playing an important role in today's society, and in the meanwhile lots of metallic materials knowledge is generated and available on the Web (e.g., Wikipedia) for materials experts. However, due to the diversity and complexity of metallic materials knowledge, the knowledge utilization may encounter much inconvenience. The idea of knowledge graph (e.g., DBpedia) provides a good way to organize the knowledge into a comprehensive entity network. Therefore, the motivation of our work is to generate a metallic materials knowledge graph (MMKG) using available knowledge on the Web. In this paper, an approach is proposed to build MMKG based on DBpedia and Wikipedia. First, we use an algorithm based on directly linked sub-graph semantic distance (DLSSD) to preliminarily extract metallic materials entities from DBpedia according to some predefined seed entities; then based on the results of the preliminary extraction, we use an algorithm, which considers both semantic distance and string similarity (SDSS), to achieve the further extraction. Second, due to the absence of materials properties in DBpedia, we use an ontology-based method to extract properties knowledge from the HTML tables of corresponding Wikipedia Web pages for enriching MMKG. Materials ontology is used to locate materials properties tables as well as to identify the structure of the tables. The proposed approach is evaluated by precision, recall, F1 and time performance, and meanwhile the appropriate thresholds for the algorithms in our approach are determined through experiments. The experimental results show that our approach returns expected performance. A tool prototype is also designed to facilitate the process of building the MMKG as well as to demonstrate the effectiveness of our approach.
Information Extraction for System-Software Safety Analysis: Calendar Year 2008 Year-End Report
NASA Technical Reports Server (NTRS)
Malin, Jane T.
2009-01-01
This annual report describes work to integrate a set of tools to support early model-based analysis of failures and hazards due to system-software interactions. The tools perform and assist analysts in the following tasks: 1) extract model parts from text for architecture and safety/hazard models; 2) combine the parts with library information to develop the models for visualization and analysis; 3) perform graph analysis and simulation to identify and evaluate possible paths from hazard sources to vulnerable entities and functions, in nominal and anomalous system-software configurations and scenarios; and 4) identify resulting candidate scenarios for software integration testing. There has been significant technical progress in model extraction from Orion program text sources, architecture model derivation (components and connections) and documentation of extraction sources. Models have been derived from Internal Interface Requirements Documents (IIRDs) and FMEA documents. Linguistic text processing is used to extract model parts and relationships, and the Aerospace Ontology also aids automated model development from the extracted information. Visualizations of these models assist analysts in requirements overview and in checking consistency and completeness.
Linguistic Sources of Skinner's Verbal Behavior
Matos, Maria Amelia; da F. Passos, Maria de Lourdes R.
2006-01-01
Formal and functional analyses of verbal behavior have been often considered to be divergent and incompatible. Yet, an examination of the history of part of the analytical approach used in Verbal Behavior (Skinner, 1957/1992) for the identification and conceptualization of verbal operant units discloses that it corresponds well with formal analyses of languages. Formal analyses have been carried out since the invention of writing and fall within the scope of traditional grammar and structural linguistics, particularly in analyses made by the linguist Leonard Bloomfield. The relevance of analytical instruments originated from linguistic studies (which examine and describe the practices of verbal communities) to the analysis of verbal behavior, as proposed by Skinner, relates to the conception of a verbal community as a prerequisite for the acquisition of verbal behavior. A deliberately interdisciplinary approach is advocated in this paper, with the systematic adoption of linguistic analyses and descriptions adding relevant knowledge to the design of experimental research in verbal behavior. PMID:22478454
Integrating Multiple On-line Knowledge Bases for Disease-Lab Test Relation Extraction.
Zhang, Yaoyun; Soysal, Ergin; Moon, Sungrim; Wang, Jingqi; Tao, Cui; Xu, Hua
2015-01-01
A computable knowledge base containing relations between diseases and lab tests would be a great resource for many biomedical informatics applications. This paper describes our initial step towards establishing a comprehensive knowledge base of disease and lab tests relations utilizing three public on-line resources. LabTestsOnline, MedlinePlus and Wikipedia are integrated to create a freely available, computable disease-lab test knowledgebase. Disease and lab test concepts are identified using MetaMap and relations between diseases and lab tests are determined based on source-specific rules. Experimental results demonstrate a high precision for relation extraction, with Wikipedia achieving the highest precision of 87%. Combining the three sources reached a recall of 51.40%, when compared with a subset of disease-lab test relations extracted from a reference book. Moreover, we found additional disease-lab test relations from on-line resources, indicating they are complementary to existing reference books for building a comprehensive disease and lab test relation knowledge base.
ERIC Educational Resources Information Center
Son, Elena
2015-01-01
The under-preparation in math at the high school and college levels, as well as the low participation of ethnically and linguistically diverse individuals in STEM fields are concerning because their preparation for work in these areas is essential for the U.S. to remain competitive in the innovative knowledge economy. While there is now a…
Ahmad, Sohail; Ismail, Ahmad Izuanuddin; Khan, Tahir Mehmood; Akram, Waqas; Mohd Zim, Mohd Arif; Ismail, Nahlah Elkudssiah
2017-04-01
The stigmatisation degree, self-esteem and knowledge either directly or indirectly influence the control and self-management of asthma. To date, there is no valid and reliable instrument that can assess these key issues collectively. The main aim of this study was to test the reliability and validity of the newly devised and translated "Stigmatisation Degree, Self-Esteem and Knowledge Questionnaire" among adult asthma patients using the Rasch measurement model. This cross-sectional study recruited thirty adult asthma patients from two respiratory specialist clinics in Selangor, Malaysia. The newly devised self-administered questionnaire was adapted from relevant publications and translated into the Malay language using international standard translation guidelines. Content and face validation was done. The data were extracted and analysed for real item reliability and construct validation using the Rasch model. The translated "Stigmatisation Degree, Self-Esteem and Knowledge Questionnaire" showed high real item reliability values of 0.90, 0.86 and 0.89 for stigmatisation degree, self-esteem, and knowledge of asthma, respectively. Furthermore, all values of point measure correlation (PTMEA Corr) analysis were within the acceptable specified range of the Rasch model. Infit/outfit mean square values and Z standard (ZSTD) values of each item verified the construct validity and suggested retaining all the items in the questionnaire. The reliability analyses and output tables of item measures for construct validation proved the translated Malaysian version of "Stigmatisation Degree, Self-Esteem and Knowledge Questionnaire" as a valid and highly reliable questionnaire.
Hassanpour, Saeed; O'Connor, Martin J; Das, Amar K
2013-08-12
A variety of informatics approaches have been developed that use information retrieval, NLP and text-mining techniques to identify biomedical concepts and relations within scientific publications or their sentences. These approaches have not typically addressed the challenge of extracting more complex knowledge such as biomedical definitions. In our efforts to facilitate knowledge acquisition of rule-based definitions of autism phenotypes, we have developed a novel semantic-based text-mining approach that can automatically identify such definitions within text. Using an existing knowledge base of 156 autism phenotype definitions and an annotated corpus of 26 source articles containing such definitions, we evaluated and compared the average rank of correctly identified rule definition or corresponding rule template using both our semantic-based approach and a standard term-based approach. We examined three separate scenarios: (1) the snippet of text contained a definition already in the knowledge base; (2) the snippet contained an alternative definition for a concept in the knowledge base; and (3) the snippet contained a definition not in the knowledge base. Our semantic-based approach had a higher average rank than the term-based approach for each of the three scenarios (scenario 1: 3.8 vs. 5.0; scenario 2: 2.8 vs. 4.9; and scenario 3: 4.5 vs. 6.2), with each comparison significant at the p-value of 0.05 using the Wilcoxon signed-rank test. Our work shows that leveraging existing domain knowledge in the information extraction of biomedical definitions significantly improves the correct identification of such knowledge within sentences. Our method can thus help researchers rapidly acquire knowledge about biomedical definitions that are specified and evolving within an ever-growing corpus of scientific publications.
ERIC Educational Resources Information Center
DOLBY, J.L.; AND OTHERS
THE STUDY IS CONCERNED WITH THE LINGUISTIC PROBLEM INVOLVED IN TEXT COMPRESSION--EXTRACTING, INDEXING, AND THE AUTOMATIC CREATION OF SPECIAL-PURPOSE CITATION DICTIONARIES. IN SPITE OF EARLY SUCCESS IN USING LARGE-SCALE COMPUTERS TO AUTOMATE CERTAIN HUMAN TASKS, THESE PROBLEMS REMAIN AMONG THE MOST DIFFICULT TO SOLVE. ESSENTIALLY, THE PROBLEM IS TO…
Akama, Hiroyuki; Miyake, Maki; Jung, Jaeyoung; Murphy, Brian
2015-01-01
In this study, we introduce an original distance definition for graphs, called the Markov-inverse-F measure (MiF). This measure enables the integration of classical graph theory indices with new knowledge pertaining to structural feature extraction from semantic networks. MiF improves the conventional Jaccard and/or Simpson indices, and reconciles both the geodesic information (random walk) and co-occurrence adjustment (degree balance and distribution). We measure the effectiveness of graph-based coefficients through the application of linguistic graph information for a neural activity recorded during conceptual processing in the human brain. Specifically, the MiF distance is computed between each of the nouns used in a previous neural experiment and each of the in-between words in a subgraph derived from the Edinburgh Word Association Thesaurus of English. From the MiF-based information matrix, a machine learning model can accurately obtain a scalar parameter that specifies the degree to which each voxel in (the MRI image of) the brain is activated by each word or each principal component of the intermediate semantic features. Furthermore, correlating the voxel information with the MiF-based principal components, a new computational neurolinguistics model with a network connectivity paradigm is created. This allows two dimensions of context space to be incorporated with both semantic and neural distributional representations.
NASA Astrophysics Data System (ADS)
Stevenson, Alma R.
2013-12-01
This qualitative, sociolinguistic research study examines how bilingual Latino/a students use their linguistic resources in the classroom and laboratory during science instruction. This study was conducted in a school in the southwestern United States serving an economically depressed, predominantly Latino population. The object of study was a fifth grade science class entirely comprised of language minority students transitioning out of bilingual education. Therefore, English was the means of instruction in science, supported by informal peer-to-peer Spanish-language communication. This study is grounded in a social constructivist paradigm. From this standpoint, learning science is a social process where social, cultural, and linguistic factors are all considered crucial to the process of acquiring scientific knowledge. The study was descriptive in nature, examining specific linguistic behaviors with the purpose of identifying and analyzing the linguistic functions of students' utterances while participating in science learning. The results suggest that students purposefully adapt their use of linguistic resources in order to facilitate their participation in science leaning. What is underscored in this study is the importance of explicitly acknowledging, supporting, and incorporating bilingual students' linguistic resources both in Spanish and English into the science classroom in order to optimize students' participation and facilitate their understanding.
Reusing Design Knowledge Based on Design Cases and Knowledge Map
ERIC Educational Resources Information Center
Yang, Cheng; Liu, Zheng; Wang, Haobai; Shen, Jiaoqi
2013-01-01
Design knowledge was reused for innovative design work to support designers with product design knowledge and help designers who lack rich experiences to improve their design capacity and efficiency. First, based on the ontological model of product design knowledge constructed by taxonomy, implicit and explicit knowledge was extracted from some…
An Overview of OWL, a Language for Knowledge Representation.
ERIC Educational Resources Information Center
Szolovits, Peter; And Others
This is a description of the motivation and overall organization of the OWL language for knowledge representation. OWL consists of a linguistic memory system (LMS), a memory of concepts in terms of which all English phrases and all knowledge of an application domain are represented; a theory of English grammar which tells how to map English…
Measuring University-Level L2 Learners' Implicit and Explicit Linguistic Knowledge
ERIC Educational Resources Information Center
Zhang, Runhan
2015-01-01
Although many theoretical issues revolving around implicit and explicit knowledge in second language (L2) acquisition hinge on the ability to measure these two types of knowledge, few empirical studies have attempted to do so. However, R. Ellis (2005) did develop a battery of tests intended to provide relatively separate measures. This study aims…
Supporting the Development of Number Fact Knowledge in Five- and Six-Year-Olds
ERIC Educational Resources Information Center
Young-Loveridge, Jenny; Bicknell, Brenda
2014-01-01
This paper focuses on children's number fact knowledge from a study that explored the impact of using multiplication and division contexts for developing number understanding with 34 five- and six-year-old children from diverse cultural and linguistic backgrounds. After a series of focused lessons, children's knowledge of number facts, including…
The Centre for Speech, Language and the Brain (CSLB) concept property norms.
Devereux, Barry J; Tyler, Lorraine K; Geertzen, Jeroen; Randall, Billi
2014-12-01
Theories of the representation and processing of concepts have been greatly enhanced by models based on information available in semantic property norms. This information relates both to the identity of the features produced in the norms and to their statistical properties. In this article, we introduce a new and large set of property norms that are designed to be a more flexible tool to meet the demands of many different disciplines interested in conceptual knowledge representation, from cognitive psychology to computational linguistics. As well as providing all features listed by 2 or more participants, we also show the considerable linguistic variation that underlies each normalized feature label and the number of participants who generated each variant. Our norms are highly comparable with the largest extant set (McRae, Cree, Seidenberg, & McNorgan, 2005) in terms of the number and distribution of features. In addition, we show how the norms give rise to a coherent category structure. We provide these norms in the hope that the greater detail available in the Centre for Speech, Language and the Brain norms should further promote the development of models of conceptual knowledge. The norms can be downloaded at www.csl.psychol.cam.ac.uk/propertynorms.
Hopf, Suzanne C
2018-02-01
Receipt of accessible and appropriate specialist services and resources by all people with communication and/or swallowing disability is a human right; however, it is a right rarely achieved in either Minority or Majority World contexts. This paper considers communication specialists' efforts to provide sustainable services for people with communication difficulties living in Majority World countries. The commentary draws on human rights literature, particularly Article 19 of the Universal Declaration of Human Rights and the Communication Capacity Research program that includes: (1) gathering knowledge from policy and literature; (2) gathering knowledge from the community; (3) understanding speech, language and literacy use and proficiency; and (4) developing culturally and linguistically appropriate resources and assessments. To inform the development of resources and assessments that could be used by speech-language pathologists as well as other communication specialists in Fiji, the Communication Capacity Research program involved collection and analysis of data from multiple sources including 144 community members, 75 school students and their families, and 25 teachers. The Communication Capacity Research program may be applicable for achieving the development of evidence-based, culturally and linguistically sustainable SLP services in similar contexts.
Cultural Artifacts as Scaffolds for Genre Development.
ERIC Educational Resources Information Center
Kamberelis, George; Bovino, Thomas D.
1999-01-01
Shows that children in the primary grades possessed considerable working knowledge of the cultural conventions of narrative genres but much less working knowledge of the cultural conventions of informational genres. Reveals grade-related developmental differences for some dimensions of linguistic and textual organization. Shows that cultural…
Discourse Analysis and Social Construction.
ERIC Educational Resources Information Center
Bazerman, Charles
1990-01-01
A brief review of social constructivism as a general movement and how it has been applied in particular to scientific knowledge precedes a review of investigations into the role language and linguistic activities have in the social construction of knowledge. A 39-citation unannotated bibliography is included. (CB)
Fuzziness In Approximate And Common-Sense Reasoning In Knowledge-Based Robotics Systems
NASA Astrophysics Data System (ADS)
Dodds, David R.
1987-10-01
Fuzzy functions, a major key to inexact reasoning, are described as they are applied to the fuzzification of robot co-ordinate systems. Linguistic-variables, a means of labelling ranges in fuzzy sets, are used as computationally pragmatic means of representing spatialization metaphors, themselves an extraordinarily rich basis for understanding concepts in orientational terms. Complex plans may be abstracted and simplified in a system which promotes conceptual planning by means of the orientational representation.
Controlled English for Effective Communication during Coalition Operations
2013-06-01
Linguistic variations and cultural differences often create unexpected challenges for effective communication and thus for Command and Control (C2...CE), and CE-based tools to improve cross- linguistic /cross-cultural communication. We will discuss various types of linguistic variations and cultural...human-computer interaction, reasoning, and explanation CE and CE-based tools can play an important role in facilitating cross- linguistic and cross
2007-05-01
The paper falls under the tradition of cultural relativism, and the modern version of linguistic relativity. Culture is a multi-faceted concept that...a theory of culture as socially distributed knowledge. Due to the particular subject related to meaning and linguistic contact, the paper makes... culture being studied, and language reflects the particular history and traditions of those who speak it, this amount of
NASA Astrophysics Data System (ADS)
Ehrentreich, F.; Dietze, U.; Meyer, U.; Abbas, S.; Schulz, H.
1995-04-01
It is a main task within the SpecInfo-Project to develop interpretation tools that can handle a great deal more of the complicated, more specific spectrum-structure-correlations. In the first step the empirical knowledge about the assignment of structural groups and their characteristic IR-bands has been collected from literature and represented in a computer readable well-structured form. Vague, verbal rules are managed by introduction of linguistic variables. The next step was the development of automatic rule generating procedures. We had combined and enlarged the IDIOTS algorithm with the algorithm by Blaffert relying on set theory. The procedures were successfully applied to the SpecInfo database. The realization of the preceding items is a prerequisite for the improvement of the computerized structure elucidation procedure.
Toward a theory of distributed word expert natural language parsing
NASA Technical Reports Server (NTRS)
Rieger, C.; Small, S.
1981-01-01
An approach to natural language meaning-based parsing in which the unit of linguistic knowledge is the word rather than the rewrite rule is described. In the word expert parser, knowledge about language is distributed across a population of procedural experts, each representing a word of the language, and each an expert at diagnosing that word's intended usage in context. The parser is structured around a coroutine control environment in which the generator-like word experts ask questions and exchange information in coming to collective agreement on sentence meaning. The word expert theory is advanced as a better cognitive model of human language expertise than the traditional rule-based approach. The technical discussion is organized around examples taken from the prototype LISP system which implements parts of the theory.
Inefficient conjunction search made efficient by concurrent spoken delivery of target identity.
Reali, Florencia; Spivey, Michael J; Tyler, Melinda J; Terranova, Joseph
2006-08-01
Visual search based on a conjunction of two features typically elicits reaction times that increase linearly as a function of the number of distractors, whereas search based on a single feature is essentially unaffected by set size. These and related findings have often been interpreted as evidence of a serial search stage that follows a parallel search stage. However, a wide range of studies has been showing a form of blending of these two processes. For example, when a spoken instruction identifies the conjunction target concurrently with the visual display, the effect of set size is significantly reduced, suggesting that incremental linguistic processing of the first feature adjective and then the second feature adjective may facilitate something approximating a parallel extraction of objects during search for the target. Here, we extend these results to a variety of experimental designs. First, we replicate the result with a mixed-trials design (ruling out potential strategies associated with the blocked design of the original study). Second, in a mixed-trials experiment, the order of adjective types in the spoken query varies randomly across conditions. In a third experiment, we extend the effect to a triple-conjunction search task. A fourth (control) experiment demonstrates that these effects are not due to an efficient odd-one-out search that ignores the linguistic input. This series of experiments, along with attractor-network simulations of the phenomena, provide further evidence toward understanding linguistically mediated influences in real-time visual search processing.
Non-linguistic learning in aphasia: Effects of training method and stimulus characteristics
Vallila-Rohter, Sofia; Kiran, Swathi
2013-01-01
Purpose The purpose of the current study was to explore non-linguistic learning ability in patients with aphasia, examining the impact of stimulus typicality and feedback on success with learning. Method Eighteen patients with aphasia and eight healthy controls participated in this study. All participants completed four computerized, non-linguistic category-learning tasks. We probed learning ability under two methods of instruction: feedback-based (FB) and paired-associate (PA). We also examined the impact of task complexity on learning ability, comparing two stimulus conditions: typical (Typ) and atypical (Atyp). Performance was compared between groups and across conditions. Results Results demonstrated that healthy controls were able to successfully learn categories under all conditions. For our patients with aphasia, two patterns of performance arose. One subgroup of patients was able to maintain learning across task manipulations and conditions. The other subgroup of patients demonstrated a sensitivity to task complexity, learning successfully only in the typical training conditions. Conclusions Results support the hypothesis that impairments of general learning are present in aphasia. Some patients demonstrated the ability to extract category information under complex training conditions, while others learned only under conditions that were simplified and emphasized salient category features. Overall, the typical training condition facilitated learning for all participants. Findings have implications for therapy, which are discussed. PMID:23695914
Høy, A
1996-12-02
In a previous article in the Journal of the Danish Medical Association, "Medical Terminology Linguistitis (I)", the author pointed out the importance of involving the users in connection with the definition of a language policy concerning Danish medical terminology and future medical doctors' need for language proficiency. A recent inquiry, involving 11 medical doctors (Ph.D. students), suggests that there is in fact a need for a language policy. Also, the respondents expressed strong antipathy against a full nationalization of the terms which can in some cases be observed in the Danish version of ICD-10. On the basis of this survey, the following questions are discussed in this article: Future medical doctors' need for knowledge of Latin and English, medical doctors' opinions concerning a terminology based on Latin, or Danish, and some important aspects concerning the definition of a language policy in the medical area.
Knowledge in motion: The cultural politics of modern science translations in Arabic.
Elshakry, Marwa S
2008-12-01
This essay looks at the problem of the global circulation of modem scientific knowledge by looking at science translations in modern Arabic. In the commercial centers of the late Ottoman Empire, emerging transnational networks lay behind the development of new communities of knowledge, many of which sought to break with old linguistic and literary norms to redefine the basis of their authority. Far from acting as neutral purveyors of "universal truths," scientific translations thus served as key instruments in this ongoing process of sociopolitical and epistemological transformation and mediation. Fierce debates over translators' linguistic strategies and choices involved deliberations over the character of language and the nature of "science" itself. They were also crucially shaped by such geopolitical factors as the rise of European imperialism and anticolonial nationalism in the region. The essay concludes by arguing for the need for greater attention to the local factors involved in the translation of scientific concepts across borders.
Bhutani, Jaikrit; Kalra, Sanjay; Bhutani, Sukriti; Kalra, Bharti
2013-01-01
Introduction: The cross cultural differences in perception of menopausal symptoms are well known and these differences in perception of hypoglycemic symptoms in Russian-speaking and Caucasian postmenopausal women have been reported. Aims and objectives: This study assessed cross – linguistic and cross – cultural differences in symptomatology of self reported hypoglycemia, between Punjabi and Hindi speaking diabetic post menopausal women. Material and Methods: Thirty Punjabi speaking and 20 Hindi speaking diabetic postmenopausal women aged over 50 years, were recruited for this study. Each subject was asked, what happens to you when you have low sugar? in the language of her choice, and spontaneous answers were recorded verbatim. Statistical analysis: The data so obtained was analyzed by paper and pen method to obtain an understanding of the frequency of self reporting of various symptoms and then analyzed using Statistical Package for Social Science ver.19.0. Results: Symptoms of hollowness, cold sweats and headache correlated significantly (P < 0.0001, P = 0.0001 and P = 0.03 respectively). One difference was noted in women from rural vs. urban background: Inability to concentrate was more frequent in urban women (4/23) vs rural women (0/27) (P < 0.0001). Discussion: To our knowledge, this is the first exploratory work highlighting the differences in self reported hypoglycemia symptomatology, based on linguistic background. In India and other countries with multi ethnic, multi linguistic societies, linguistic competence in hypoglycemia history taking is important. Limitations: Incidence of hypoglycemia in the subjects enrolled was not assessed. Many of the subjects in the Punjabi speaking cohort were bilingual. Some symptoms of hypoglycemia may have been missed or over-reported by participants. Conclusion: Diabetes care professionals should be aware that persons with diabetes from varying linguistic backgrounds may report symptoms of hypoglycemia differently. PMID:24251188
Bhutani, Jaikrit; Kalra, Sanjay; Bhutani, Sukriti; Kalra, Bharti
2013-10-01
The cross cultural differences in perception of menopausal symptoms are well known and these differences in perception of hypoglycemic symptoms in Russian-speaking and Caucasian postmenopausal women have been reported. This study assessed cross - linguistic and cross - cultural differences in symptomatology of self reported hypoglycemia, between Punjabi and Hindi speaking diabetic post menopausal women. Thirty Punjabi speaking and 20 Hindi speaking diabetic postmenopausal women aged over 50 years, were recruited for this study. Each subject was asked, what happens to you when you have low sugar? in the language of her choice, and spontaneous answers were recorded verbatim. The data so obtained was analyzed by paper and pen method to obtain an understanding of the frequency of self reporting of various symptoms and then analyzed using Statistical Package for Social Science ver.19.0. Symptoms of hollowness, cold sweats and headache correlated significantly (P < 0.0001, P = 0.0001 and P = 0.03 respectively). One difference was noted in women from rural vs. urban background: Inability to concentrate was more frequent in urban women (4/23) vs rural women (0/27) (P < 0.0001). To our knowledge, this is the first exploratory work highlighting the differences in self reported hypoglycemia symptomatology, based on linguistic background. In India and other countries with multi ethnic, multi linguistic societies, linguistic competence in hypoglycemia history taking is important. Incidence of hypoglycemia in the subjects enrolled was not assessed. Many of the subjects in the Punjabi speaking cohort were bilingual. Some symptoms of hypoglycemia may have been missed or over-reported by participants. Diabetes care professionals should be aware that persons with diabetes from varying linguistic backgrounds may report symptoms of hypoglycemia differently.
ERIC Educational Resources Information Center
Christensen, Ken Ramshoj; Kizach, Johannes; Nyvad, Anne Mette
2013-01-01
In the syntax literature, it is commonly assumed that a constraint on linguistic competence blocks extraction of "wh-"expressions (e.g. "what" or "which book") from embedded questions, referred to as "wh-"islands. Furthermore, it is assumed that there is an argument/adjunct asymmetry in extraction from "wh-"islands. We report results from two…
Educator Beliefs and Cultural Knowledge: Implications for School Improvement Efforts
ERIC Educational Resources Information Center
Nelson, Sarah W.; Guerra, Patricia L.
2014-01-01
Purpose: This qualitative study reports on beliefs practicing educators hold about diverse students and families. Specifically, this study examined educator beliefs related to culturally, linguistically, and economically diverse students and families along with participants' knowledge of culture and its application in practice. Research Design:…
The Future of Digital Working: Knowledge Migration and Learning
ERIC Educational Resources Information Center
Malcolm, Irene
2014-01-01
Against the backdrop of intensified migration linked to globalisation, this article considers the implications of knowledge migration for future digital workers. It draws empirically on a socio-material analysis of the international software localisation industry. Localisers' work requires linguistic, cultural and software engineering skills to…
Knowing How We Know: Evidentiality and Cognitive Development
ERIC Educational Resources Information Center
Matsui, Tomoko; Fitneva, Stanka A.
2009-01-01
Evidentials are grammatical elements such as affixes and particles indicating the source of knowledge. We provide an overview of this grammatical category and consider three research domains to which developmental studies on evidentiality contribute: the acquisition of linguistic means to characterize knowledge, the conceptual understanding of…
ERIC Educational Resources Information Center
Cooper, Henry S. F., Jr.
2000-01-01
In northeastern Peru, U.S. mammalogists are tapping the knowledge of the indigenous Matses people to catalog the fauna of the rainforest. One zoologist turned linguist has recruited Matses research assistants; together they are creating a classroom text on mammals in Matses, which will help preserve both local ethnobiological knowledge and the…
Linguistic Processing of Accented Speech Across the Lifespan
Cristia, Alejandrina; Seidl, Amanda; Vaughn, Charlotte; Schmale, Rachel; Bradlow, Ann; Floccia, Caroline
2012-01-01
In most of the world, people have regular exposure to multiple accents. Therefore, learning to quickly process accented speech is a prerequisite to successful communication. In this paper, we examine work on the perception of accented speech across the lifespan, from early infancy to late adulthood. Unfamiliar accents initially impair linguistic processing by infants, children, younger adults, and older adults, but listeners of all ages come to adapt to accented speech. Emergent research also goes beyond these perceptual abilities, by assessing links with production and the relative contributions of linguistic knowledge and general cognitive skills. We conclude by underlining points of convergence across ages, and the gaps left to face in future work. PMID:23162513
Preserved processing of musical structure in a person with agrammatic aphasia.
Slevc, L Robert; Faroqi-Shah, Yasmeen; Saxena, Sadhvi; Okada, Brooke M
2016-12-01
Evidence for shared processing of structure (or syntax) in language and in music conflicts with neuropsychological dissociations between the two. However, while harmonic structural processing can be impaired in patients with spared linguistic syntactic abilities (Peretz, I. (1993). Auditory atonalia for melodies. Cognitive Neuropsychology, 10, 21-56. doi:10.1080/02643299308253455), evidence for the opposite dissociation-preserved harmonic processing despite agrammatism-is largely lacking. Here, we report one such case: HV, a former musician with Broca's aphasia and agrammatic speech, was impaired in making linguistic, but not musical, acceptability judgments. Similarly, she showed no sensitivity to linguistic structure, but normal sensitivity to musical structure, in implicit priming tasks. To our knowledge, this is the first non-anecdotal report of a patient with agrammatic aphasia demonstrating preserved harmonic processing abilities, supporting claims that aspects of musical and linguistic structure rely on distinct neural mechanisms.
NASA Astrophysics Data System (ADS)
Colucci-Gray, Laura; Perazzone, Anna; Dodman, Martin; Camino, Elena
2013-03-01
In this three-part article we seek to establish connections between the emerging framework of sustainability science and the methodological basis of research and practice in science education in order to bring forth knowledge and competences for sustainability. The first and second parts deal with the implications of taking a sustainability view in relation to knowledge processes. The complexity, uncertainty and urgency of global environmental problems challenge the foundations of reductionist Western science. Within such debate, the proposal of sustainability science advocates for inter-disciplinary and inter-paradigmatic collaboration and it includes the requirements of post- normal science proposing a respectful dialogue between experts and non-experts in the construction of new scientific knowledge. Such a change of epistemology is rooted into participation, deliberation and the gathering of extended-facts where cultural framings and values are the hard components in the face of soft facts. A reflection on language and communication processes is thus the focus of knowledge practices and educational approaches aimed at sustainability. Language contains the roots of conceptual thinking (including scientific knowledge) and each culture and society are defined and limited by the language that is used to describe and act upon the world. Within a scenario of sustainability, a discussion of scientific language is in order to retrace the connections between language and culture, and to promote a holistic view based on pluralism and dialogue. Drawing on the linguistic reflection, the third part gives examples of teaching and learning situations involving prospective science teachers in action-research contexts: these activities are set out to promote linguistic integration and to introduce reflexive process into science learning. Discussion will focus on the methodological features of a learning process that is akin to a communal and emancipatory research process within a sustainability scenario.
Enhancing acronym/abbreviation knowledge bases with semantic information.
Torii, Manabu; Liu, Hongfang
2007-10-11
In the biomedical domain, a terminology knowledge base that associates acronyms/abbreviations (denoted as SFs) with the definitions (denoted as LFs) is highly needed. For the construction such terminology knowledge base, we investigate the feasibility to build a system automatically assigning semantic categories to LFs extracted from text. Given a collection of pairs (SF,LF) derived from text, we i) assess the coverage of LFs and pairs (SF,LF) in the UMLS and justify the need of a semantic category assignment system; and ii) automatically derive name phrases annotated with semantic category and construct a system using machine learning. Utilizing ADAM, an existing collection of (SF,LF) pairs extracted from MEDLINE, our system achieved an f-measure of 87% when assigning eight UMLS-based semantic groups to LFs. The system has been incorporated into a web interface which integrates SF knowledge from multiple SF knowledge bases. Web site: http://gauss.dbb.georgetown.edu/liblab/SFThesurus.
A linguistic geometry for 3D strategic planning
NASA Technical Reports Server (NTRS)
Stilman, Boris
1995-01-01
This paper is a new step in the development and application of the Linguistic Geometry. This formal theory is intended to discover the inner properties of human expert heuristics, which have been successful in a certain class of complex control systems, and apply them to different systems. In this paper we investigate heuristics extracted in the form of hierarchical networks of planning paths of autonomous agents. Employing Linguistic Geometry tools the dynamic hierarchy of networks is represented as a hierarchy of formal attribute languages. The main ideas of this methodology are shown in this paper on the new pilot example of the solution of the extremely complex 3D optimization problem of strategic planning for the space combat of autonomous vehicles. This example demonstrates deep and highly selective search in comparison with conventional search algorithms.
ERIC Educational Resources Information Center
Pearl, Lisa S.
2011-01-01
Parametric systems have been proposed as models of how humans represent knowledge about language, motivated in part as a way to explain children's rapid acquisition of linguistic knowledge. Given this, it seems reasonable to examine if children with knowledge of parameters could in fact acquire the adult system from the data available to them.…
ERIC Educational Resources Information Center
Oh, Eunjou
2016-01-01
The present study investigated the relative contributions of vocabulary knowledge, grammar knowledge, and processing speed to second language listening and reading comprehension. Seventy-five Korean university students participated in the study. Results showed the three tested components had a significant portion of shared variance in explaining…
Working Memory for Linguistic and Non-linguistic Manual Gestures: Evidence, Theory, and Application.
Rudner, Mary
2018-01-01
Linguistic manual gestures are the basis of sign languages used by deaf individuals. Working memory and language processing are intimately connected and thus when language is gesture-based, it is important to understand related working memory mechanisms. This article reviews work on working memory for linguistic and non-linguistic manual gestures and discusses theoretical and applied implications. Empirical evidence shows that there are effects of load and stimulus degradation on working memory for manual gestures. These effects are similar to those found for working memory for speech-based language. Further, there are effects of pre-existing linguistic representation that are partially similar across language modalities. But above all, deaf signers score higher than hearing non-signers on an n-back task with sign-based stimuli, irrespective of their semantic and phonological content, but not with non-linguistic manual actions. This pattern may be partially explained by recent findings relating to cross-modal plasticity in deaf individuals. It suggests that in linguistic gesture-based working memory, semantic aspects may outweigh phonological aspects when processing takes place under challenging conditions. The close association between working memory and language development should be taken into account in understanding and alleviating the challenges faced by deaf children growing up with cochlear implants as well as other clinical populations.
Working Memory for Linguistic and Non-linguistic Manual Gestures: Evidence, Theory, and Application
Rudner, Mary
2018-01-01
Linguistic manual gestures are the basis of sign languages used by deaf individuals. Working memory and language processing are intimately connected and thus when language is gesture-based, it is important to understand related working memory mechanisms. This article reviews work on working memory for linguistic and non-linguistic manual gestures and discusses theoretical and applied implications. Empirical evidence shows that there are effects of load and stimulus degradation on working memory for manual gestures. These effects are similar to those found for working memory for speech-based language. Further, there are effects of pre-existing linguistic representation that are partially similar across language modalities. But above all, deaf signers score higher than hearing non-signers on an n-back task with sign-based stimuli, irrespective of their semantic and phonological content, but not with non-linguistic manual actions. This pattern may be partially explained by recent findings relating to cross-modal plasticity in deaf individuals. It suggests that in linguistic gesture-based working memory, semantic aspects may outweigh phonological aspects when processing takes place under challenging conditions. The close association between working memory and language development should be taken into account in understanding and alleviating the challenges faced by deaf children growing up with cochlear implants as well as other clinical populations. PMID:29867655
Rule Extracting based on MCG with its Application in Helicopter Power Train Fault Diagnosis
NASA Astrophysics Data System (ADS)
Wang, M.; Hu, N. Q.; Qin, G. J.
2011-07-01
In order to extract decision rules for fault diagnosis from incomplete historical test records for knowledge-based damage assessment of helicopter power train structure. A method that can directly extract the optimal generalized decision rules from incomplete information based on GrC was proposed. Based on semantic analysis of unknown attribute value, the granule was extended to handle incomplete information. Maximum characteristic granule (MCG) was defined based on characteristic relation, and MCG was used to construct the resolution function matrix. The optimal general decision rule was introduced, with the basic equivalent forms of propositional logic, the rules were extracted and reduction from incomplete information table. Combined with a fault diagnosis example of power train, the application approach of the method was present, and the validity of this method in knowledge acquisition was proved.
Liu, Bo; Wu, Huayi; Wang, Yandong; Liu, Wenming
2015-01-01
Main road features extracted from remotely sensed imagery play an important role in many civilian and military applications, such as updating Geographic Information System (GIS) databases, urban structure analysis, spatial data matching and road navigation. Current methods for road feature extraction from high-resolution imagery are typically based on threshold value segmentation. It is difficult however, to completely separate road features from the background. We present a new method for extracting main roads from high-resolution grayscale imagery based on directional mathematical morphology and prior knowledge obtained from the Volunteered Geographic Information found in the OpenStreetMap. The two salient steps in this strategy are: (1) using directional mathematical morphology to enhance the contrast between roads and non-roads; (2) using OpenStreetMap roads as prior knowledge to segment the remotely sensed imagery. Experiments were conducted on two ZiYuan-3 images and one QuickBird high-resolution grayscale image to compare our proposed method to other commonly used techniques for road feature extraction. The results demonstrated the validity and better performance of the proposed method for urban main road feature extraction. PMID:26397832
Common Ground? How the Encoding of Specialist Vocabulary Affects Peer-to-Peer Online Discourse
ERIC Educational Resources Information Center
Paus, Elisabeth; Jucks, Regina
2012-01-01
Using the same specialist terms in online discourse can indicate knowledge overlaps between partners. However, linguistic overlaps do not automatically ensure overlaps in conceptual representations. In particular, learning situations, which typically focus on knowledge acquisition, require a sufficient understanding of domain-specific concepts.…
Template Authoring Environment for the Automatic Generation of Narrative Content
ERIC Educational Resources Information Center
Caropreso, Maria Fernanda; Inkpen, Diana; Keshtkar, Fazel; Khan, Shahzad
2012-01-01
Natural Language Generation (NLG) systems can make data accessible in an easily digestible textual form; but using such systems requires sophisticated linguistic and sometimes even programming knowledge. We have designed and implemented an environment for creating and modifying NLG templates that requires no programming knowledge, and can operate…
ERIC Educational Resources Information Center
Rosenberg, Katharina
2012-01-01
In conversations between immigrants and officials, problems of understanding are often noticeable. About 280 recordings realised at the Argentine Aliens' Department and at several public authorities in Germany show that knowledge divergences regarding linguistic, cultural and institutional knowledge result in (sometimes grave) difficulties of…
ERIC Educational Resources Information Center
Aiello, Angelo; And Others
1986-01-01
A form is presented for language teacher self-evaluation concerning attitudes and knowledge about learning theories, general linguistics, sociolinguistics, pragmatics, discourse analysis, teaching methodology, the communicative approach, class activities, class management, instructional support, and evaluation. (MSE)
NPI Licensing and Beyond: Children's Knowledge of the Semantics of "Any"
ERIC Educational Resources Information Center
Tieu, Lyn; Lidz, Jeffrey
2016-01-01
This article presents a study of preschool-aged children's knowledge of the semantics of the negative polarity item (NPI) "any". NPIs like "any" differ in distribution from non-polarity-sensitive indefinites like "a": "Any" is restricted to downward-entailing linguistic environments (Fauconnier 1975, 1979;…
Epistemics and Expertise in Peer Tutoring Interactions: Co-Constructing Knowledge of Spanish
ERIC Educational Resources Information Center
Back, Michele
2016-01-01
Peer tutoring is viewed as a valuable component of additional language learning due to the presence of a more knowledgeable interlocutor. Yet researchers and language program directors alike often ignore the linguistic and cultural differences that peer tutors possess, instead categorizing them homogeneously as "experts" or "native…
Linguistic Predictors of Cultural Identification in Bilinguals
ERIC Educational Resources Information Center
Schroeder, Scott R.; Lam, Tuan Q.; Marian, Viorica
2017-01-01
Most of the world's population has knowledge of at least two languages. Many of these bilinguals are also exposed to and identify with at least two cultures. Because language knowledge enables participation in cultural practices and expression of cultural beliefs, bilingual experience and cultural identity are interconnected. However, the specific…
Extracting genetic alteration information for personalized cancer therapy from ClinicalTrials.gov
Xu, Jun; Lee, Hee-Jin; Zeng, Jia; Wu, Yonghui; Zhang, Yaoyun; Huang, Liang-Chin; Johnson, Amber; Holla, Vijaykumar; Bailey, Ann M; Cohen, Trevor; Meric-Bernstam, Funda; Bernstam, Elmer V
2016-01-01
Objective: Clinical trials investigating drugs that target specific genetic alterations in tumors are important for promoting personalized cancer therapy. The goal of this project is to create a knowledge base of cancer treatment trials with annotations about genetic alterations from ClinicalTrials.gov. Methods: We developed a semi-automatic framework that combines advanced text-processing techniques with manual review to curate genetic alteration information in cancer trials. The framework consists of a document classification system to identify cancer treatment trials from ClinicalTrials.gov and an information extraction system to extract gene and alteration pairs from the Title and Eligibility Criteria sections of clinical trials. By applying the framework to trials at ClinicalTrials.gov, we created a knowledge base of cancer treatment trials with genetic alteration annotations. We then evaluated each component of the framework against manually reviewed sets of clinical trials and generated descriptive statistics of the knowledge base. Results and Discussion: The automated cancer treatment trial identification system achieved a high precision of 0.9944. Together with the manual review process, it identified 20 193 cancer treatment trials from ClinicalTrials.gov. The automated gene-alteration extraction system achieved a precision of 0.8300 and a recall of 0.6803. After validation by manual review, we generated a knowledge base of 2024 cancer trials that are labeled with specific genetic alteration information. Analysis of the knowledge base revealed the trend of increased use of targeted therapy for cancer, as well as top frequent gene-alteration pairs of interest. We expect this knowledge base to be a valuable resource for physicians and patients who are seeking information about personalized cancer therapy. PMID:27013523
Extracting genetic alteration information for personalized cancer therapy from ClinicalTrials.gov.
Xu, Jun; Lee, Hee-Jin; Zeng, Jia; Wu, Yonghui; Zhang, Yaoyun; Huang, Liang-Chin; Johnson, Amber; Holla, Vijaykumar; Bailey, Ann M; Cohen, Trevor; Meric-Bernstam, Funda; Bernstam, Elmer V; Xu, Hua
2016-07-01
Clinical trials investigating drugs that target specific genetic alterations in tumors are important for promoting personalized cancer therapy. The goal of this project is to create a knowledge base of cancer treatment trials with annotations about genetic alterations from ClinicalTrials.gov. We developed a semi-automatic framework that combines advanced text-processing techniques with manual review to curate genetic alteration information in cancer trials. The framework consists of a document classification system to identify cancer treatment trials from ClinicalTrials.gov and an information extraction system to extract gene and alteration pairs from the Title and Eligibility Criteria sections of clinical trials. By applying the framework to trials at ClinicalTrials.gov, we created a knowledge base of cancer treatment trials with genetic alteration annotations. We then evaluated each component of the framework against manually reviewed sets of clinical trials and generated descriptive statistics of the knowledge base. The automated cancer treatment trial identification system achieved a high precision of 0.9944. Together with the manual review process, it identified 20 193 cancer treatment trials from ClinicalTrials.gov. The automated gene-alteration extraction system achieved a precision of 0.8300 and a recall of 0.6803. After validation by manual review, we generated a knowledge base of 2024 cancer trials that are labeled with specific genetic alteration information. Analysis of the knowledge base revealed the trend of increased use of targeted therapy for cancer, as well as top frequent gene-alteration pairs of interest. We expect this knowledge base to be a valuable resource for physicians and patients who are seeking information about personalized cancer therapy. © The Author 2016. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Information categorization approach to literary authorship disputes
NASA Astrophysics Data System (ADS)
Yang, Albert C.-C.; Peng, C.-K.; Yien, H.-W.; Goldberger, Ary L.
2003-11-01
Scientific analysis of the linguistic styles of different authors has generated considerable interest. We present a generic approach to measuring the similarity of two symbolic sequences that requires minimal background knowledge about a given human language. Our analysis is based on word rank order-frequency statistics and phylogenetic tree construction. We demonstrate the applicability of this method to historic authorship questions related to the classic Chinese novel “The Dream of the Red Chamber,” to the plays of William Shakespeare, and to the Federalist papers. This method may also provide a simple approach to other large databases based on their information content.
The role of language in shaping international migration
Adserà, Alícia; Pytliková, Mariola
2016-01-01
This paper examines the importance of language in international migration from multiple angles by studying the role of linguistic proximity, widely spoken languages, linguistic enclaves and language-based immigration policy requirements. To this aim we collect a unique dataset on immigration flows and stocks in 30 OECD destinations from all world countries over the period 1980–2010, and construct a set of linguistic proximity measures. Migration rates increase with linguistic proximity and with English at destination. Softer linguistic requirements for naturalization and larger linguistic communities at destination encourage more migrants to move. Linguistic proximity matters less when local linguistic network are larger. PMID:27330195
Biological network extraction from scientific literature: state of the art and challenges.
Li, Chen; Liakata, Maria; Rebholz-Schuhmann, Dietrich
2014-09-01
Networks of molecular interactions explain complex biological processes, and all known information on molecular events is contained in a number of public repositories including the scientific literature. Metabolic and signalling pathways are often viewed separately, even though both types are composed of interactions involving proteins and other chemical entities. It is necessary to be able to combine data from all available resources to judge the functionality, complexity and completeness of any given network overall, but especially the full integration of relevant information from the scientific literature is still an ongoing and complex task. Currently, the text-mining research community is steadily moving towards processing the full body of the scientific literature by making use of rich linguistic features such as full text parsing, to extract biological interactions. The next step will be to combine these with information from scientific databases to support hypothesis generation for the discovery of new knowledge and the extension of biological networks. The generation of comprehensive networks requires technologies such as entity grounding, coordination resolution and co-reference resolution, which are not fully solved and are required to further improve the quality of results. Here, we analyse the state of the art for the extraction of network information from the scientific literature and the evaluation of extraction methods against reference corpora, discuss challenges involved and identify directions for future research. © The Author 2013. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Louwerse, Max M; Benesh, Nick
2012-01-01
Spatial mental representations can be derived from linguistic and non-linguistic sources of information. This study tested whether these representations could be formed from statistical linguistic frequencies of city names, and to what extent participants differed in their performance when they estimated spatial locations from language or maps. In a computational linguistic study, we demonstrated that co-occurrences of cities in Tolkien's Lord of the Rings trilogy and The Hobbit predicted the authentic longitude and latitude of those cities in Middle Earth. In a human study, we showed that human spatial estimates of the location of cities were very similar regardless of whether participants read Tolkien's texts or memorized a map of Middle Earth. However, text-based location estimates obtained from statistical linguistic frequencies better predicted the human text-based estimates than the human map-based estimates. These findings suggest that language encodes spatial structure of cities, and that human cognitive map representations can come from implicit statistical linguistic patterns, from explicit non-linguistic perceptual information, or from both. Copyright © 2012 Cognitive Science Society, Inc.
Evidence-based Neuro Linguistic Psychotherapy: a meta-analysis.
Zaharia, Cătălin; Reiner, Melita; Schütz, Peter
2015-12-01
Neuro Linguistic Programming (NLP) Framework has enjoyed enormous popularity in the field of applied psychology. NLP has been used in business, education, law, medicine and psychotherapy to identify people's patterns and alter their responses to stimuli, so they are better able to regulate their environment and themselves. NLP looks at achieving goals, creating stable relationships, eliminating barriers such as fears and phobias, building self-confidence, and self-esteem, and achieving peak performance. Neuro Linguistic Psychotherapy (NLPt) encompasses NLP as framework and set of interventions in the treatment of individuals with different psychological and/or social problems. We aimed systematically to analyse the available data regarding the effectiveness of Neuro Linguistic Psychotherapy (NLPt). The present work is a meta-analysis of studies, observational or randomized controlled trials, for evaluating the efficacy of Neuro Linguistic Programming in individuals with different psychological and/or social problems. The databases searched to identify studies in English and German language: CENTRAL in the Cochrane Library; PubMed; ISI Web of Knowledge (include results also from Medline and the Web of Science); PsycINFO (including PsycARTICLES); Psyndex; Deutschsprachige Diplomarbeiten der Psychologie (database of theses in Psychology in German language), Social SciSearch; National library of health and two NLP-specific research databases: one from the NLP Community (http://www.nlp.de/cgi-bin/research/nlprdb.cgi?action=res_entries) and one from the NLP Group (http://www.nlpgrup.com/bilimselarastirmalar/bilimsel-arastirmalar-4.html#Zweig154). From a total number of 425 studies, 350 were removed and considered not relevant based on the title and abstract. Included, in the final analysis, are 12 studies with numbers of participants ranging between 12 and 115 subjects. The vast majority of studies were prospective observational. The actual paper represents the first meta-analysis evaluating the effectiveness of NLP therapy for individuals with social/psychological problems. The overall meta-analysis found that the NLP therapy may add an overall standardized mean difference of 0.54 with a confidence interval of CI=[0.20; 0.88]. Neuro-Linguistic Psychotherapy as a psychotherapeutic modality grounded in theoretical frameworks, methodologies and interventions scientifically developed, including models developed by NLP, shows results that can hold its ground in comparison with other psychotherapeutic methods.
Social Cognition in Preschoolers: Effects of Early Experience and Individual Differences
Bulgarelli, Daniela; Molina, Paola
2016-01-01
Social cognition is the way in which people process, remember, and use information in social contexts to explain and predict their own behavior and that of others. Children’s social cognition may be influenced by multiple factors, both external and internal to the child. In the current study, two aspects of social cognition were examined: Theory of Mind and Emotion Understanding. The aim of this study was to analyze the effects of type of early care (0–3 years of age), maternal education, parents’ country of birth, and child’s language on the social cognition of 118 Italian preschoolers. To our knowledge, the joint effect of these variables on social cognition has not previously been investigated in the literature. The measures used to collect social cognition and linguistic data were not parent- or teacher-reports, but based on direct assessment of the children through two standardized tests, the Test of Emotion Comprehension and the ToM Storybooks. Relationships among the variables showed a complex pattern. Overall, maternal education and linguistic competence showed a systematic effect on social cognition; the linguistic competence mediated the effect of maternal education. In children who had experienced centre-base care in the first 3 years of life, the effect of maternal education disappeared, supporting the protective role of centre-base care for children with less educated mothers. The children with native and foreign parents did not significantly differ on the social cognition tasks. Limits of the study, possible educational outcomes and future research lines were discussed. PMID:27895605
Purvis, Caralyn J; McNeill, Brigid C; Everatt, John
2016-04-01
Low metalinguistic knowledge of pre-service and in-service teachers is likely to restrict the provision of evidence-based literacy instruction in the classroom. Despite such concerns, relatively few studies have examined the effects of teacher preparation coursework in building pre-service teachers' language structure knowledge. This study examined the effects of 7 h of language structure coursework, delivered over 7 weeks, on 121 New Zealand pre-service teachers in their initial year of study. Changes in participants' phonological awareness, morphological awareness, and orthographic knowledge were tracked across the teaching period. The impact of the coursework for participants who presented with strong spelling (n = 24) and poor spelling (n = 24) ability was also compared. The cohort demonstrated significant gains across all measures. Strong spellers responded more favourably to the teaching than poor spellers, even when accounting for initial levels of meta-linguistic knowledge. Implications for the development of teacher preparation programmes that enhance the provision of effective literacy instruction are discussed.
Kalindi, Sylvia Chanda; Chung, Kevin Kien Hoa
2018-01-01
This study investigated the role of morphological awareness in understanding Chinese word reading and dictation among Chinese-speaking adolescent readers in Hong Kong as well as the cognitive-linguistic profile of early adolescent readers with dyslexia. Fifty-four readers with dyslexia in Grades 5 and 6 were compared with 54 chronological age-matched (CA) typical readers on the following measures of cognitive-linguistic and literacy skills: morphological awareness, phonological awareness, visual-orthographic knowledge, rapid naming, vocabulary knowledge, verbal short-term memory (STM), Chinese word reading, and dictation (or spelling). The results indicated that early adolescent readers with dyslexia performed less well than the typical readers on all cognitive-linguistic and literacy measures except the phonological measures. Both groups' scores showed substantial correlations between morphological awareness and Chinese word reading and dictation. Visual-orthographic knowledge and rapid naming were also associated with dictation in early adolescent readers with and without dyslexia, respectively. Moderated multiple regression analyses further revealed that morphological awareness and rapid naming explained unique variance in word reading and dictation for the readers with dyslexia and typical readers separately after controlling readers' age and group effect. These results highlight the potential importance of morphological awareness and rapid naming in Chinese word reading and writing in Chinese early adolescents' literacy development and impairment.
Zhao, Chao; Jiang, Jingchi; Guan, Yi; Guo, Xitong; He, Bin
2018-05-01
Electronic medical records (EMRs) contain medical knowledge that can be used for clinical decision support (CDS). Our objective is to develop a general system that can extract and represent knowledge contained in EMRs to support three CDS tasks-test recommendation, initial diagnosis, and treatment plan recommendation-given the condition of a patient. We extracted four kinds of medical entities from records and constructed an EMR-based medical knowledge network (EMKN), in which nodes are entities and edges reflect their co-occurrence in a record. Three bipartite subgraphs (bigraphs) were extracted from the EMKN, one to support each task. One part of the bigraph was the given condition (e.g., symptoms), and the other was the condition to be inferred (e.g., diseases). Each bigraph was regarded as a Markov random field (MRF) to support the inference. We proposed three graph-based energy functions and three likelihood-based energy functions. Two of these functions are based on knowledge representation learning and can provide distributed representations of medical entities. Two EMR datasets and three metrics were utilized to evaluate the performance. As a whole, the evaluation results indicate that the proposed system outperformed the baseline methods. The distributed representation of medical entities does reflect similarity relationships with respect to knowledge level. Combining EMKN and MRF is an effective approach for general medical knowledge representation and inference. Different tasks, however, require individually designed energy functions. Copyright © 2018 Elsevier B.V. All rights reserved.
Nicholas, Marjorie; Sinotte, Michele P.; Helm-Estabrooks, Nancy
2011-01-01
Learning how to use a computer-based communication system can be challenging for people with severe aphasia even if the system is not word-based. This study explored cognitive and linguistic factors relative to how they affected individual patients’ ability to communicate expressively using C-Speak Aphasia, (CSA), an alternative communication computer program that is primarily picture-based. Ten individuals with severe non-fluent aphasia received at least six months of training with CSA. To assess carryover of training, untrained functional communication tasks (i.e., answering autobiographical questions, describing pictures, making telephone calls, describing a short video, and two writing tasks) were repeatedly probed in two conditions: 1) using CSA in addition to natural forms of communication, and 2) using only natural forms of communication, e.g., speaking, writing, gesturing, drawing. Four of the ten participants communicated more information on selected probe tasks using CSA than they did without the computer. Response to treatment also was examined in relation to baseline measures of non-linguistic executive function skills, pictorial semantic abilities, and auditory comprehension. Only nonlinguistic executive function skills were significantly correlated with treatment response. PMID:21506045
Transferred L1 Strategies and L2 Syntactic Structure in L2 Sentence Comprehension.
ERIC Educational Resources Information Center
Koda, Keiko
1993-01-01
The application of language processing skills between 2 languages with dissimilar morphosyntactic features was investigated with 72 American university students learning Japanese. Results suggest that learners' first- and second-language knowledge both play a significant role and that the linguistic knowledge and coding capability for text…
ERIC Educational Resources Information Center
Lu, Lin-Miao
2014-01-01
With a specific focus on power relations in creating and distributing knowledge in society, this study examines the government-published children's series "Historical Picture of Taiwan" produced in Taiwan in the Martial Law era (1949-1987) to uncover ideological assumptions and persuasions permeating both linguistic and visual…
Enacting Acts of Authentication in a Robotics Competition: An Interpretivist Study
ERIC Educational Resources Information Center
Verma, Geeta; Puvirajah, Anton; Webb, Horace
2015-01-01
While the science classroom primarily remains a site for knowledge acquisition through teacher directed experiences, other sites exist outside of the classroom that allow for student generation of scientific knowledge. These sites provide opportunities for linguistic and social interactions to play a powerful role in situating students'…
ERIC Educational Resources Information Center
Connor, Carol McDonald; Day, Stephanie L.; Phillips, Beth; Sparapani, Nicole; Ingebrand, Sarah W.; McLean, Leigh; Barrus, Angela; Kaschak, Michael P.
2016-01-01
Many assume that cognitive and linguistic processes, such as semantic knowledge (SK) and self-regulation (SR), subserve learned skills like reading. However, complex models of interacting and bootstrapping effects of SK, SR, instruction, and reading hypothesize reciprocal effects. Testing this "lattice" model with children (n = 852)…
Documenting Indigenous Knowledge and Languages: Research Planning & Protocol.
ERIC Educational Resources Information Center
Leonard, Beth
2001-01-01
The author's experiences of learning her heritage language of Deg Xinag, an Athabascan language spoken in Alaska, serve as a backdrop for discussing issues in learning endangered indigenous languages. When Deg Xinag is taught by linguists, obvious differences between English and Deg Xinag are not articulated, due to the lack of knowledge of…
The Impact of New Technologies on the Literacy Attainment of Deaf Children
ERIC Educational Resources Information Center
Harris, Margaret
2015-01-01
To become successful readers, hearing children require competence in both decoding--the ability to read individual words, underpinned by phonological skills and letter-sound knowledge--and linguistic comprehension--the ability to understand what they read--underpinned by language skills, including vocabulary knowledge. Children who are born with a…
Issues Regarding the Use of Interpreters and Translators in a School Setting.
ERIC Educational Resources Information Center
Medina, Victoria
This paper sets forth guidelines regarding the use of interpreters/translators for use in assessment of students from linguistically and culturally different environments. Training components for such personnel are listed according to general knowledge, cultural knowledge, and specific skills. Limitations of using a third party in the assessment…
Representing sentence information
NASA Astrophysics Data System (ADS)
Perkins, Walton A., III
1991-03-01
This paper describes a computer-oriented representation for sentence information. Whereas many Artificial Intelligence (AI) natural language systems start with a syntactic parse of a sentence into the linguist's components: noun, verb, adjective, preposition, etc., we argue that it is better to parse the input sentence into 'meaning' components: attribute, attribute value, object class, object instance, and relation. AI systems need a representation that will allow rapid storage and retrieval of information and convenient reasoning with that information. The attribute-of-object representation has proven useful for handling information in relational databases (which are well known for their efficiency in storage and retrieval) and for reasoning in knowledge- based systems. On the other hand, the linguist's syntactic representation of the works in sentences has not been shown to be useful for information handling and reasoning. We think it is an unnecessary and misleading intermediate form. Our sentence representation is semantic based in terms of attribute, attribute value, object class, object instance, and relation. Every sentence is segmented into one or more components with the form: 'attribute' of 'object' 'relation' 'attribute value'. Using only one format for all information gives the system simplicity and good performance as a RISC architecture does for hardware. The attribute-of-object representation is not new; it is used extensively in relational databases and knowledge-based systems. However, we will show that it can be used as a meaning representation for natural language sentences with minor extensions. In this paper we describe how a computer system can parse English sentences into this representation and generate English sentences from this representation. Much of this has been tested with computer implementation.
The Actualization of Literary Learning Model Based on Verbal-Linguistic Intelligence
ERIC Educational Resources Information Center
Hali, Nur Ihsan
2017-01-01
This article is inspired by Howard Gardner's concept of linguistic intelligence and also from some authors' previous writings. All of them became the authors' reference in developing ideas on constructing a literary learning model based on linguistic intelligence. The writing of this article is not done by collecting data empirically, but by…
Statistical Measures for Usage-Based Linguistics
ERIC Educational Resources Information Center
Gries, Stefan Th.; Ellis, Nick C.
2015-01-01
The advent of usage-/exemplar-based approaches has resulted in a major change in the theoretical landscape of linguistics, but also in the range of methodologies that are brought to bear on the study of language acquisition/learning, structure, and use. In particular, methods from corpus linguistics are now frequently used to study distributional…
Event-based Plausibility Immediately Influences On-line Language Comprehension
Matsuki, Kazunaga; Chow, Tracy; Hare, Mary; Elman, Jeffrey L.; Scheepers, Christoph; McRae, Ken
2011-01-01
In some theories of sentence comprehension, linguistically-relevant lexical knowledge such as selectional restrictions is privileged in terms of the time-course of its access and influence. We examined whether event knowledge computed by combining multiple concepts can rapidly influence language understanding even in the absence of selectional restriction violations. Specifically, we investigated whether instruments can combine with actions to influence comprehension of ensuing patients. Instrument-verb-patient triplets were created in a norming study designed to tap directly into event knowledge. In self-paced reading (Experiment 1), participants were faster to read patient nouns such as hair when they were typical of the instrument-action pair (Donna used the shampoo to wash vs. the hose to wash). Experiment 2 showed that these results were not due to direct instrument-patient relations. Experiment 3 replicated Experiment 1 using eyetracking, with effects of event typicality observed in first fixation and gaze durations on the patient noun. This research demonstrates that conceptual event-based expectations are computed and used rapidly and dynamically during on-line language comprehension. We discuss relationships among plausibility and predictability, as well as their implications. We conclude that selectional restrictions may be best considered as event-based conceptual knowledge, rather than lexical-grammatical knowledge. PMID:21517222
Songs as an aid for language acquisition.
Schön, Daniele; Boyer, Maud; Moreno, Sylvain; Besson, Mireille; Peretz, Isabelle; Kolinsky, Régine
2008-02-01
In previous research, Saffran and colleagues [Saffran, J. R., Aslin, R. N., & Newport, E. L. (1996). Statistical learning by 8-month-old infants. Science, 274, 1926-1928; Saffran, J. R., Newport, E. L., & Aslin, R. N. (1996). Word segmentation: The role of distributional cues. Journal of Memory and Language, 35, 606-621.] have shown that adults and infants can use the statistical properties of syllable sequences to extract words from continuous speech. They also showed that a similar learning mechanism operates with musical stimuli [Saffran, J. R., Johnson, R. E. K., Aslin, N., & Newport, E. L. (1999). Abstract Statistical learning of tone sequences by human infants and adults. Cognition, 70, 27-52.]. In this work we combined linguistic and musical information and we compared language learning based on speech sequences to language learning based on sung sequences. We hypothesized that, compared to speech sequences, a consistent mapping of linguistic and musical information would enhance learning. Results confirmed the hypothesis showing a strong learning facilitation of song compared to speech. Most importantly, the present results show that learning a new language, especially in the first learning phase wherein one needs to segment new words, may largely benefit of the motivational and structuring properties of music in song.
Newkirk-Turner, Brandi L; Johnson, Valerie E
2018-04-05
The purpose of this tutorial is to discuss the use of curriculum-based language assessment (CBLA) with students who are English language learners and students who speak nonmainstream varieties of English, such as African American English. The article begins with a discussion of the discourse of mathematics and the role of the speech-language pathologist (SLP), followed by a review of studies that includes those that examined the performance of English language learner and nonmainstream dialect-speaking students on word-based math items. The literature review highlights the linguistic and content biases associated with word-based math problems. Useful strategies that SLPs and educators can incorporate in culturally and linguistically appropriate assessments are discussed. The tutorial ends with a discussion of CBLA as a viable assessment approach to use with culturally and linguistically diverse students. Tests used at national, state, and school levels to assess students' math abilities have associated linguistic bias and content bias often leading to an inaccurate depiction of culturally and linguistically diverse students' math skills. CBLA as an assessment method can be used by school-based SLPs to gather valid and useful information about culturally and linguistically diverse students' language for learning math. By using CBLA, SLPs can help modify curricular tasks in broader contexts in an effort to make math, including high-level math, "accessible and achievable for all" students (American Speech-Language-Hearing Association, 2017).
Argumentation Based Joint Learning: A Novel Ensemble Learning Approach
Xu, Junyi; Yao, Li; Li, Le
2015-01-01
Recently, ensemble learning methods have been widely used to improve classification performance in machine learning. In this paper, we present a novel ensemble learning method: argumentation based multi-agent joint learning (AMAJL), which integrates ideas from multi-agent argumentation, ensemble learning, and association rule mining. In AMAJL, argumentation technology is introduced as an ensemble strategy to integrate multiple base classifiers and generate a high performance ensemble classifier. We design an argumentation framework named Arena as a communication platform for knowledge integration. Through argumentation based joint learning, high quality individual knowledge can be extracted, and thus a refined global knowledge base can be generated and used independently for classification. We perform numerous experiments on multiple public datasets using AMAJL and other benchmark methods. The results demonstrate that our method can effectively extract high quality knowledge for ensemble classifier and improve the performance of classification. PMID:25966359
Cross-Cultural Communication in Oncology: Challenges and Training Interests.
Weber, Orest; Sulstarova, Brikela; Singy, Pascal
2016-01-01
To survey oncology nurses and oncologists about difficulties in taking care of culturally and linguistically diverse patients and about interests in cross-cultural training. . Descriptive, cross-sectional. . Web-based survey. . 108 oncology nurses and 44 oncologists. . 31-item questionnaire derived from preexisting surveys in the United States and Switzerland. . Self-rated difficulties in taking care of culturally and linguistically diverse patients and self-rated interests in cross-cultural training. . All respondents reported communication difficulties in encounters with culturally and linguistically diverse patients. Respondents considered the absence of written materials in other languages, absence of a shared common language with patients, and sensitive subjects (e.g., end of life, sexuality) to be particularly problematic. Respondents also expressed a high level of interest in all aspects of cross-cultural training (task-oriented skills, background knowledge, reflexivity, and attitudes). Nurses perceived several difficulties related to care of migrants as more problematic than physicians did and were more interested in all aspects of cross-cultural training. . The need for cross-cultural training is high among oncology clinicians, particularly among nurses. . The results reported in the current study may help nurses in decision-making positions and educators in introducing elements of cross-cultural education into oncology curricula for nurses. Cross-cultural training should be offered to oncology nurses.
Analysis of a Knowledge-Management-Based Process of Transferring Project Management Skills
ERIC Educational Resources Information Center
Ioi, Toshihiro; Ono, Masakazu; Ishii, Kota; Kato, Kazuhiko
2012-01-01
Purpose: The purpose of this paper is to propose a method for the transfer of knowledge and skills in project management (PM) based on techniques in knowledge management (KM). Design/methodology/approach: The literature contains studies on methods to extract experiential knowledge in PM, but few studies exist that focus on methods to convert…
ERIC Educational Resources Information Center
Montalvo, Ricardo; Combes, Bertina H.; Kea, Cathy D.
2014-01-01
Response to intervention (RtI) originates from national legislation and critical research of evidence-based practices for low performing students and students at-risk of failing or receiving special education services. RtI proactively facilitates culturally and linguistically responsive pedagogy for culturally and linguistically diverse (CLD)…
Professional Training of Computational Linguists at the University of Stuttgart
ERIC Educational Resources Information Center
Darmoroz, Halyna
2017-01-01
The paper deals with the aspects of professional training of specialists in computational linguistics by the example of the University of Stuttgart. First of all, we have attempted to define the essence of the terms "applied linguistics" and "computational linguistics" based on the views of Ukrainian and foreign scholars. We…
Discovering body site and severity modifiers in clinical texts
Dligach, Dmitriy; Bethard, Steven; Becker, Lee; Miller, Timothy; Savova, Guergana K
2014-01-01
Objective To research computational methods for discovering body site and severity modifiers in clinical texts. Methods We cast the task of discovering body site and severity modifiers as a relation extraction problem in the context of a supervised machine learning framework. We utilize rich linguistic features to represent the pairs of relation arguments and delegate the decision about the nature of the relationship between them to a support vector machine model. We evaluate our models using two corpora that annotate body site and severity modifiers. We also compare the model performance to a number of rule-based baselines. We conduct cross-domain portability experiments. In addition, we carry out feature ablation experiments to determine the contribution of various feature groups. Finally, we perform error analysis and report the sources of errors. Results The performance of our method for discovering body site modifiers achieves F1 of 0.740–0.908 and our method for discovering severity modifiers achieves F1 of 0.905–0.929. Discussion Results indicate that both methods perform well on both in-domain and out-domain data, approaching the performance of human annotators. The most salient features are token and named entity features, although syntactic dependency features also contribute to the overall performance. The dominant sources of errors are infrequent patterns in the data and inability of the system to discern deeper semantic structures. Conclusions We investigated computational methods for discovering body site and severity modifiers in clinical texts. Our best system is released open source as part of the clinical Text Analysis and Knowledge Extraction System (cTAKES). PMID:24091648
Discovering body site and severity modifiers in clinical texts.
Dligach, Dmitriy; Bethard, Steven; Becker, Lee; Miller, Timothy; Savova, Guergana K
2014-01-01
To research computational methods for discovering body site and severity modifiers in clinical texts. We cast the task of discovering body site and severity modifiers as a relation extraction problem in the context of a supervised machine learning framework. We utilize rich linguistic features to represent the pairs of relation arguments and delegate the decision about the nature of the relationship between them to a support vector machine model. We evaluate our models using two corpora that annotate body site and severity modifiers. We also compare the model performance to a number of rule-based baselines. We conduct cross-domain portability experiments. In addition, we carry out feature ablation experiments to determine the contribution of various feature groups. Finally, we perform error analysis and report the sources of errors. The performance of our method for discovering body site modifiers achieves F1 of 0.740-0.908 and our method for discovering severity modifiers achieves F1 of 0.905-0.929. Results indicate that both methods perform well on both in-domain and out-domain data, approaching the performance of human annotators. The most salient features are token and named entity features, although syntactic dependency features also contribute to the overall performance. The dominant sources of errors are infrequent patterns in the data and inability of the system to discern deeper semantic structures. We investigated computational methods for discovering body site and severity modifiers in clinical texts. Our best system is released open source as part of the clinical Text Analysis and Knowledge Extraction System (cTAKES).
The effects of linguistic modification on ESL students' comprehension of nursing course test items.
Bosher, Susan; Bowles, Melissa
2008-01-01
Recent research has indicated that language may be a source of construct-irrelevant variance for non-native speakers of English, or English as a second language (ESL) students, when they take exams. As a result, exams may not accurately measure knowledge of nursing content. One accommodation often used to level the playing field for ESL students is linguistic modification, a process by which the reading load of test items is reduced while the content and integrity of the item are maintained. Research on the effects of linguistic modification has been conducted on examinees in the K-12 population, but is just beginning in other areas. This study describes the collaborative process by which items from a pathophysiology exam were linguistically modified and subsequently evaluated for comprehensibility by ESL students. Findings indicate that in a majority of cases, modification improved examinees' comprehension of test items. Implications for test item writing and future research are discussed.
Are computers effective lie detectors? A meta-analysis of linguistic cues to deception.
Hauch, Valerie; Blandón-Gitlin, Iris; Masip, Jaume; Sporer, Siegfried L
2015-11-01
This meta-analysis investigates linguistic cues to deception and whether these cues can be detected with computer programs. We integrated operational definitions for 79 cues from 44 studies where software had been used to identify linguistic deception cues. These cues were allocated to six research questions. As expected, the meta-analyses demonstrated that, relative to truth-tellers, liars experienced greater cognitive load, expressed more negative emotions, distanced themselves more from events, expressed fewer sensory-perceptual words, and referred less often to cognitive processes. However, liars were not more uncertain than truth-tellers. These effects were moderated by event type, involvement, emotional valence, intensity of interaction, motivation, and other moderators. Although the overall effect size was small, theory-driven predictions for certain cues received support. These findings not only further our knowledge about the usefulness of linguistic cues to detect deception with computers in applied settings but also elucidate the relationship between language and deception. © 2014 by the Society for Personality and Social Psychology, Inc.
Preserved Statistical Learning of Tonal and Linguistic Material in Congenital Amusia
Omigie, Diana; Stewart, Lauren
2011-01-01
Congenital amusia is a lifelong disorder whereby individuals have pervasive difficulties in perceiving and producing music. In contrast, typical individuals display a sophisticated understanding of musical structure, even in the absence of musical training. Previous research has shown that they acquire this knowledge implicitly, through exposure to music's statistical regularities. The present study tested the hypothesis that congenital amusia may result from a failure to internalize statistical regularities – specifically, lower-order transitional probabilities. To explore the specificity of any potential deficits to the musical domain, learning was examined with both tonal and linguistic material. Participants were exposed to structured tonal and linguistic sequences and, in a subsequent test phase, were required to identify items which had been heard in the exposure phase, as distinct from foils comprising elements that had been present during exposure, but presented in a different temporal order. Amusic and control individuals showed comparable learning, for both tonal and linguistic material, even when the tonal stream included pitch intervals around one semitone. However analysis of binary confidence ratings revealed that amusic individuals have less confidence in their abilities and that their performance in learning tasks may not be contingent on explicit knowledge formation or level of awareness to the degree shown in typical individuals. The current findings suggest that the difficulties amusic individuals have with real-world music cannot be accounted for by an inability to internalize lower-order statistical regularities but may arise from other factors. PMID:21779263
Computational Modeling for Language Acquisition: A Tutorial With Syntactic Islands.
Pearl, Lisa S; Sprouse, Jon
2015-06-01
Given the growing prominence of computational modeling in the acquisition research community, we present a tutorial on how to use computational modeling to investigate learning strategies that underlie the acquisition process. This is useful for understanding both typical and atypical linguistic development. We provide a general overview of why modeling can be a particularly informative tool and some general considerations when creating a computational acquisition model. We then review a concrete example of a computational acquisition model for complex structural knowledge referred to as syntactic islands. This includes an overview of syntactic islands knowledge, a precise definition of the acquisition task being modeled, the modeling results, and how to meaningfully interpret those results in a way that is relevant for questions about knowledge representation and the learning process. Computational modeling is a powerful tool that can be used to understand linguistic development. The general approach presented here can be used to investigate any acquisition task and any learning strategy, provided both are precisely defined.
Wu, Jia-ting; Wang, Jian-qiang; Wang, Jing; Zhang, Hong-yu; Chen, Xiao-hong
2014-01-01
Based on linguistic term sets and hesitant fuzzy sets, the concept of hesitant fuzzy linguistic sets was introduced. The focus of this paper is the multicriteria decision-making (MCDM) problems in which the criteria are in different priority levels and the criteria values take the form of hesitant fuzzy linguistic numbers (HFLNs). A new approach to solving these problems is proposed, which is based on the generalized prioritized aggregation operator of HFLNs. Firstly, the new operations and comparison method for HFLNs are provided and some linguistic scale functions are applied. Subsequently, two prioritized aggregation operators and a generalized prioritized aggregation operator of HFLNs are developed and applied to MCDM problems. Finally, an illustrative example is given to illustrate the effectiveness and feasibility of the proposed method, which are then compared to the existing approach.
NASA Astrophysics Data System (ADS)
McIntosh Ciechanowski, Kathryn E.
Driven by questions surrounding the documented "fourth-grade slump" in student test scores and about the content learning of English language learners, this dissertation examines the science and social studies literacy practices of third grade bilingual Latino/as in an urban school. Using qualitative and quantitative methods, I examined three questions: (a) What content area demands are evident in instruction and in the assigned texts that children read? (b) What sociocultural knowledge do students draw on in the reading and writing of content area texts? How does it shape their reading and writing? and (c) What linguistic knowledge do students draw on in the reading and writing of content area texts? How does it shape their reading and writing? These questions are premised on three key tenets from the extant research literature. First, research has documented that middle grade students struggle to make sense of content texts, which could be caused by not only a scarcity of expository texts in early grades but also by discipline-specific demands in the content texts. Second, although all students may struggle to read specialized texts, students from non-mainstream backgrounds may struggle more because they do not possess the social and linguistic capital valued in mainstream schools. Third, sociocultural research has documented the importance of social and cultural funds of knowledge in classroom learning and knowledge construction. Guided by these tenets, I observed for six months in 2 classes and recorded field notes, interviewed participants, collected artifacts, and conducted pre- and post-unit assessments. Analytic methods included quantitative evaluation of assessments and constant comparative and discourse analyses. Findings indicate that the textbooks posed linguistic and conceptual demands and represented multiple discourses including the discourses of the natural and social sciences. To make sense of texts, students drew from various sociocultural resources such as popular culture, family, and children's literature. The teacher was more likely to take up these resources (although briefly) when they tightly aligned with instructional goals. Bilingual students faced great complexity as they drew upon linguistic resources to learn technical language and content in two languages and within multiple academic and everyday discourses.
From language identification to language distance
NASA Astrophysics Data System (ADS)
Gamallo, Pablo; Pichel, José Ramom; Alegria, Iñaki
2017-10-01
In this paper, we define two quantitative distances to measure how far apart two languages are. The distance measure that we have identified as more accurate is based on the perplexity of n-gram models extracted from text corpora. An experiment to compare forty-four European languages has been performed. For this purpose, we computed the distances for all the possible language pairs and built a network whose nodes are languages and edges are distances. The network we have built on the basis of linguistic distances represents the current map of similarities and divergences among the main languages of Europe.
ERIC Educational Resources Information Center
Portmess, Lisa
2013-01-01
Media representations of massive open online courses (MOOCs) such as those offered by Coursera, edX and Udacity reflect tension and ambiguity in their bold promise of democratized education and global knowledge sharing. An approach to MOOCs that emphasizes the tacit epistemology of such representations suggests a richer account of the ambiguities…
The Metalinguistic Knowledge of Undergraduate Students of English Language or Linguistics
ERIC Educational Resources Information Center
Alderson, J. Charles; Hudson, Richard
2013-01-01
It is often asserted that UK school-leavers know less grammatical terminology than in earlier years. However, objective data on this supposed phenomenon are somewhat scarce. The study reported in this paper aimed to see whether and to what extent knowledge about language (KaL) has declined over three decades, and how this might relate to…
ERIC Educational Resources Information Center
Buxton, Cory A.; Salinas, Ale; Mahotiere, Margarette; Lee, Okhee; Secada, Walter G.
2015-01-01
Background: In exploring how emergent bilingual learners' prior knowledge from home and play contexts might influence their scientific reasoning, this study drew upon two distinct research traditions: (a) experimental research from the developmental and cognitive psychology tradition, and (b) research on culturally and linguistically diverse…
ERIC Educational Resources Information Center
Dabrowska, Ewa; Tomasello, Michael
2008-01-01
Rapid acquisition of linguistic categories or constructions is sometimes regarded as evidence of innate knowledge. In this paper, we examine Polish children's early understanding of an idiosyncratic, language-specific construction involving the instrumental case--which could not be due to innate knowledge. Thirty Polish-speaking children aged 2; 6…
The Role of E-Vocabularies in the Description and Retrieval of Digital Educational Resources
ERIC Educational Resources Information Center
Fernández-Pampillón, Ana M.
2017-01-01
Vocabularies are linguistic resources that make it possible to access knowledge through words. They can constitute a mechanism to identify, describe, explore, and access all the digital resources with informational content pertaining to a specific knowledge domain. In this regard, they play a key role as systems for the representation and…
20 CFR 404.510 - When an individual is “without fault” in a deduction overpayment.
Code of Federal Regulations, 2010 CFR
2010-04-01
... linguistic limitations (including any lack of facility with the English language) the individual has. Except... good faith that he was entitled to checks subsequently received. (h) Lack of knowledge that bonuses... amount for such year. (k) Lack of knowledge by a wife, husband, or child entitled to wife's, husband's...
Covington, Michael A; Lunden, S L Anya; Cristofaro, Sarah L; Wan, Claire Ramsay; Bailey, C Thomas; Broussard, Beth; Fogarty, Robert; Johnson, Stephanie; Zhang, Shayi; Compton, Michael T
2012-12-01
Aprosody, or flattened speech intonation, is a recognized negative symptom of schizophrenia, though it has rarely been studied from a linguistic/phonological perspective. To bring the latest advances in computational linguistics to the phenomenology of schizophrenia and related psychotic disorders, a clinical first-episode psychosis research team joined with a phonetics/computational linguistics team to conduct a preliminary, proof-of-concept study. Video recordings from a semi-structured clinical research interview were available from 47 first-episode psychosis patients. Audio tracks of the video recordings were extracted, and after review of quality, 25 recordings were available for phonetic analysis. These files were de-noised and a trained phonologist extracted a 1-minute sample of each patient's speech. WaveSurfer 1.8.5 was used to create, from each speech sample, a file of formant values (F0, F1, F2, where F0 is the fundamental frequency and F1 and F2 are resonance bands indicating the moment-by-moment shape of the oral cavity). Variability in these phonetic indices was correlated with severity of Positive and Negative Syndrome Scale negative symptom scores using Pearson correlations. A measure of variability of tongue front-to-back position-the standard deviation of F2-was statistically significantly correlated with the severity of negative symptoms (r=-0.446, p=0.03). This study demonstrates a statistically significant and meaningful correlation between negative symptom severity and phonetically measured reductions in tongue movements during speech in a sample of first-episode patients just initiating treatment. Further studies of negative symptoms, applying computational linguistics methods, are warranted. Copyright © 2012 Elsevier B.V. All rights reserved.
Covington, Michael A.; Lunden, S.L. Anya; Cristofaro, Sarah L.; Wan, Claire Ramsay; Bailey, C. Thomas; Broussard, Beth; Fogarty, Robert; Johnson, Stephanie; Zhang, Shayi; Compton, Michael T.
2012-01-01
Background Aprosody, or flattened speech intonation, is a recognized negative symptom of schizophrenia, though it has rarely been studied from a linguistic/phonological perspective. To bring the latest advances in computational linguistics to the phenomenology of schizophrenia and related psychotic disorders, a clinical first-episode psychosis research team joined with a phonetics/computational linguistics team to conduct a preliminary, proof-of-concept study. Methods Video recordings from a semi-structured clinical research interview were available from 47 first-episode psychosis patients. Audio tracks of the video recordings were extracted, and after review of quality, 25 recordings were available for phonetic analysis. These files were de-noised and a trained phonologist extracted a 1-minute sample of each patient’s speech. WaveSurfer 1.8.5 was used to create, from each speech sample, a file of formant values (F0, F1, F2, where F0 is the fundamental frequency and F1 and F2 are resonance bands indicating the moment-by-moment shape of the oral cavity). Variability in these phonetic indices was correlated with severity of Positive and Negative Syndrome Scale negative symptom scores using Pearson correlations. Results A measure of variability of tongue front-to-back position—the standard deviation of F2—was statistically significantly correlated with the severity of negative symptoms (r=−0.446, p=0.03). Conclusion This study demonstrates a statistically significant and meaningful correlation between negative symptom severity and phonetically measured reductions in tongue movements during speech in a sample of first-episode patients just initiating treatment. Further studies of negative symptoms, applying computational linguistics methods, are warranted. PMID:23102940
ERIC Educational Resources Information Center
Gruninger, Yvonne
Extracts from a journal kept by a French-speaking adult student in an intensive German language class are presented here. It is a critique of the audio-visual method, texts, teaching techniques, and students' reactions during the two parts of the course at the University of Berne. The extracts deal particularly with a comparison of the texts used…
Chasin, Rachel; Rumshisky, Anna; Uzuner, Ozlem; Szolovits, Peter
2014-01-01
Objective To evaluate state-of-the-art unsupervised methods on the word sense disambiguation (WSD) task in the clinical domain. In particular, to compare graph-based approaches relying on a clinical knowledge base with bottom-up topic-modeling-based approaches. We investigate several enhancements to the topic-modeling techniques that use domain-specific knowledge sources. Materials and methods The graph-based methods use variations of PageRank and distance-based similarity metrics, operating over the Unified Medical Language System (UMLS). Topic-modeling methods use unlabeled data from the Multiparameter Intelligent Monitoring in Intensive Care (MIMIC II) database to derive models for each ambiguous word. We investigate the impact of using different linguistic features for topic models, including UMLS-based and syntactic features. We use a sense-tagged clinical dataset from the Mayo Clinic for evaluation. Results The topic-modeling methods achieve 66.9% accuracy on a subset of the Mayo Clinic's data, while the graph-based methods only reach the 40–50% range, with a most-frequent-sense baseline of 56.5%. Features derived from the UMLS semantic type and concept hierarchies do not produce a gain over bag-of-words features in the topic models, but identifying phrases from UMLS and using syntax does help. Discussion Although topic models outperform graph-based methods, semantic features derived from the UMLS prove too noisy to improve performance beyond bag-of-words. Conclusions Topic modeling for WSD provides superior results in the clinical domain; however, integration of knowledge remains to be effectively exploited. PMID:24441986
Henseler, Ilona; Regenbrecht, Frank; Obrig, Hellmuth
2014-03-01
One way to investigate the neuronal underpinnings of language competence is to correlate patholinguistic profiles of aphasic patients to corresponding lesion sites. Constituting the beginnings of aphasiology and neurolinguistics over a century ago, this approach has been revived and refined in the past decade by statistical approaches mapping continuous variables (providing metrics that are not simply categorical) on voxel-wise lesion information (voxel-based lesion-symptom mapping). Here we investigate whether and how voxel-based lesion-symptom mapping allows us to delineate specific lesion patterns for differentially fine-grained clinical classifications. The latter encompass 'classical' syndrome-based approaches (e.g. Broca's aphasia), more symptom-oriented descriptions (e.g. agrammatism) and further refinement to linguistic sub-functions (e.g. lexico-semantic deficits for inanimate versus animate items). From a large database of patients treated for aphasia of different aetiologies (n = 1167) a carefully selected group of 102 first ever ischaemic stroke patients with chronic aphasia (∅ 12 months) were included in a VLSM analysis. Specifically, we investigated how performance in the Aachen Aphasia Test-the standard clinical test battery for chronic aphasia in German-relates to distinct brain lesions. The Aachen Aphasia Test evaluates aphasia on different levels: a non-parametric discriminant procedure yields probabilities for the allocation to one of the four 'standard' syndromes (Broca, Wernicke, global and amnestic aphasia), whereas standardized subtests target linguistic modalities (e.g. repetition), or even more specific symptoms (e.g. phoneme repetition). Because some subtests of the Aachen Aphasia Test (e.g. for the linguistic level of lexico-semantics) rely on rather coarse and heterogeneous test items we complemented the analysis with a number of more detailed clinically used tests in selected mostly mildly affected subgroups of patients. Our results indicate that: (i) Aachen Aphasia Test-based syndrome allocation allows for an unexpectedly concise differentiation between 'Broca's' and 'Wernicke's' aphasia corresponding to non-overlapping anterior and posterior lesion sites; whereas (ii) analyses for modalities and specific symptoms yielded more circumscribed but partially overlapping lesion foci, often cutting across the above syndrome territories; and (iii) especially for lexico-semantic capacities more specialized clinical test-batteries are required to delineate precise lesion patterns at this linguistic level. In sum this is the first report on a successful lesion-delineation of syndrome-based aphasia classification highlighting the relevance of vascular distribution for the syndrome level while confirming and extending a number of more linguistically motivated differentiations, based on clinically used tests. We consider such a comprehensive view reaching from the syndrome to a fine-grained symptom-oriented assessment mandatory to converge neurolinguistic, patholinguistic and clinical-therapeutic knowledge on language-competence and impairment.
A Cognition Account of Differences between Children's Comprehension and Production of Language.
ERIC Educational Resources Information Center
Rice, Mabel
1984-01-01
Suggests that there are no sharp distinctions among children's linguistic comprehension, production, and knowledge. Instead, all performance and understanding are embedded in a fluctuating, interrelated thought system. (PD)
Vavatzanidis, Niki Katerina; Mürbe, Dirk; Friederici, Angela; Hahne, Anja
2015-12-01
One main incentive for supplying hearing impaired children with a cochlear implant is the prospect of oral language acquisition. Only scarce knowledge exists, however, of what congenitally deaf children actually perceive when receiving their first auditory input, and specifically what speech-relevant features they are able to extract from the new modality. We therefore presented congenitally deaf infants and young children implanted before the age of 4 years with an oddball paradigm of long and short vowel variants of the syllable /ba/. We measured the EEG in regular intervals to study their discriminative ability starting with the first activation of the implant up to 8 months later. We were thus able to time-track the emerging ability to differentiate one of the most basic linguistic features that bears semantic differentiation and helps in word segmentation, namely, vowel length. Results show that already 2 months after the first auditory input, but not directly after implant activation, these early implanted children differentiate between long and short syllables. Surprisingly, after only 4 months of hearing experience, the ERPs have reached the same properties as those of the normal hearing control group, demonstrating the plasticity of the brain with respect to the new modality. We thus show that a simple but linguistically highly relevant feature such as vowel length reaches age-appropriate electrophysiological levels as fast as 4 months after the first acoustic stimulation, providing an important basis for further language acquisition.
Connor, Carol McDonald; Day, Stephanie L.; Phillips, Beth; Sparapani, Nicole; Ingebrand, Sarah W.; McLean, Leigh; Barrus, Angela; Kaschak, Michael P.
2016-01-01
Many assume that cognitive and linguistic processes, such as semantic knowledge (SK) and self-regulation (SR) subserve learned skills like reading. However, complex models of interacting and bootstrapping effects of SK, SR, instruction, and reading hypothesize reciprocal effects. Testing this “lattice” model with children (n = 852) followed from 1st–2nd grade (5.9–10.4 years-of-age), revealed reciprocal effects for reading and SR, and reading and SK, but not SR and SK. More effective literacy instruction reduced reading stability over time. Findings elucidate the synergistic and reciprocal effects of learning to read on other important linguistic, self-regulatory, and cognitive processes, the value of using complex models of development to inform intervention design, and how learned skills may influence development during middle childhood. PMID:27264645
Web-Based Knowledge Exchange through Social Links in the Workplace
ERIC Educational Resources Information Center
Filipowski, Tomasz; Kazienko, Przemyslaw; Brodka, Piotr; Kajdanowicz, Tomasz
2012-01-01
Knowledge exchange between employees is an essential feature of recent commercial organisations on the competitive market. Based on the data gathered by various information technology (IT) systems, social links can be extracted and exploited in knowledge exchange systems of a new kind. Users of such a system ask their queries and the system…
ERIC Educational Resources Information Center
Wu, Yun-Wu; Weng, Apollo; Weng, Kuo-Hua
2017-01-01
The purpose of this study is to design a knowledge conversion and management digital learning system for architecture design learning, helping students to share, extract, use and create their design knowledge through web-based interactive activities based on socialization, internalization, combination and externalization process in addition to…
New Method for Knowledge Management Focused on Communication Pattern in Product Development
NASA Astrophysics Data System (ADS)
Noguchi, Takashi; Shiba, Hajime
In the field of manufacturing, the importance of utilizing knowledge and know-how has been growing. To meet this background, there is a need for new methods to efficiently accumulate and extract effective knowledge and know-how. To facilitate the extraction of knowledge and know-how needed by engineers, we first defined business process information which includes schedule/progress information, document data, information about communication among parties concerned, and information which corresponds to these three types of information. Based on our definitions, we proposed an IT system (FlexPIM: Flexible and collaborative Process Information Management) to register and accumulate business process information with the least effort. In order to efficiently extract effective information from huge volumes of accumulated business process information, focusing attention on “actions” and communication patterns, we propose a new extraction method using communication patterns. And the validity of this method has been verified for some communication patterns.
Verdon, Sarah; McLeod, Sharynne; Wong, Sandie
2015-01-01
Speech-language pathologists (SLPs) are working with an increasing number of families from culturally and linguistically diverse backgrounds as the world's population continues to become more internationally mobile. The heterogeneity of these diverse populations makes it impossible to identify and document a one size fits all strategy for working with culturally and linguistically diverse families. This paper explores approaches to practice by SLPs identified as specialising in multilingual and multicultural practice in culturally and linguistically diverse contexts from around the world. Data were obtained from ethnographic observation of 14 sites in 5 countries on 4 continents. The sites included hospital settings, university clinics, school-based settings, private practices and Indigenous community-based services. There were 652 individual artefacts collected from the sites which included interview transcripts, photographs, videos, narrative reflections, informal and formal field notes. The data were analysed using Cultural-Historical Activity Theory (Engeström, 1987). From the analysis six overarching Principles of Culturally Competent Practice (PCCP) were identified. These were: (1) identification of culturally appropriate and mutually motivating therapy goals, (2) knowledge of languages and culture, (3) use of culturally appropriate resources, (4) consideration of the cultural, social and political context, (5) consultation with families and communities, and (6) collaboration between professionals. These overarching principles align with the six position statements developed by the International Expert Panel on Multilingual Children's Speech (2012) which aim to enhance the cultural competence of speech pathologists and their practice. The international examples provided in the current study demonstrate the individualised ways that these overarching principles are enacted in a range of different organisational, social, cultural and political contexts. Tensions experienced in enacting the principles are also discussed. This paper emphasises the potential for individual SLPs to enhance their practice by adopting these overarching principles to support the individual children and families in diverse contexts around the world. Copyright © 2015 Elsevier Inc. All rights reserved.
Words and possible words in early language acquisition.
Marchetto, Erika; Bonatti, Luca L
2013-11-01
In order to acquire language, infants must extract its building blocks-words-and master the rules governing their legal combinations from speech. These two problems are not independent, however: words also have internal structure. Thus, infants must extract two kinds of information from the same speech input. They must find the actual words of their language. Furthermore, they must identify its possible words, that is, the sequences of sounds that, being morphologically well formed, could be words. Here, we show that infants' sensitivity to possible words appears to be more primitive and fundamental than their ability to find actual words. We expose 12- and 18-month-old infants to an artificial language containing a conflict between statistically coherent and structurally coherent items. We show that 18-month-olds can extract possible words when the familiarization stream contains marks of segmentation, but cannot do so when the stream is continuous. Yet, they can find actual words from a continuous stream by computing statistical relationships among syllables. By contrast, 12-month-olds can find possible words when familiarized with a segmented stream, but seem unable to extract statistically coherent items from a continuous stream that contains minimal conflicts between statistical and structural information. These results suggest that sensitivity to word structure is in place earlier than the ability to analyze distributional information. The ability to compute nontrivial statistical relationships becomes fully effective relatively late in development, when infants have already acquired a considerable amount of linguistic knowledge. Thus, mechanisms for structure extraction that do not rely on extensive sampling of the input are likely to have a much larger role in language acquisition than general-purpose statistical abilities. Copyright © 2013. Published by Elsevier Inc.
Ravikumar, Ke; Liu, Haibin; Cohn, Judith D; Wall, Michael E; Verspoor, Karin
2012-10-05
We propose a method for automatic extraction of protein-specific residue mentions from the biomedical literature. The method searches text for mentions of amino acids at specific sequence positions and attempts to correctly associate each mention with a protein also named in the text. The methods presented in this work will enable improved protein functional site extraction from articles, ultimately supporting protein function prediction. Our method made use of linguistic patterns for identifying the amino acid residue mentions in text. Further, we applied an automated graph-based method to learn syntactic patterns corresponding to protein-residue pairs mentioned in the text. We finally present an approach to automated construction of relevant training and test data using the distant supervision model. The performance of the method was assessed by extracting protein-residue relations from a new automatically generated test set of sentences containing high confidence examples found using distant supervision. It achieved a F-measure of 0.84 on automatically created silver corpus and 0.79 on a manually annotated gold data set for this task, outperforming previous methods. The primary contributions of this work are to (1) demonstrate the effectiveness of distant supervision for automatic creation of training data for protein-residue relation extraction, substantially reducing the effort and time involved in manual annotation of a data set and (2) show that the graph-based relation extraction approach we used generalizes well to the problem of protein-residue association extraction. This work paves the way towards effective extraction of protein functional residues from the literature.
ERIC Educational Resources Information Center
Mowarin, Macaulay; Tonukari, Emmanuel Ufuoma
2010-01-01
This essay discusses the linguistic and cultural factors that have acted as impediments to Nigeria's breakthrough into the knowledge era. It identifies language deficit in English by most Nigerians, under-developed state of most Nigerian languages, absence of creative education and the presence of certain cultural taboos which stifles the…
ERIC Educational Resources Information Center
Daddow, Angela
2016-01-01
With the massification of higher education in a knowledge-driven economy, Western universities have struggled to keep pace with the cultural, linguistic, educational and economic diversity of university students and the complex realities of their lifeworlds. This has generated systemic inequities for diverse or "non-traditional"…
ERIC Educational Resources Information Center
Graber, Kathryn Elizabeth
2012-01-01
How might institutional projects to improve the status of minority languages and publics have unintended and contradictory consequences? This dissertation examines media and language practices in order to illuminate the everyday sociocultural processes by which the value of knowledge is figured. It focuses on news media institutions in the Buryat…
The Pursuit of Quality over Quantity in TESOL Teacher Education: Coursework versus Test Only
ERIC Educational Resources Information Center
Sehlaoui, Abdelilah Salim; Shinge, Manjula
2013-01-01
The purpose of this study was to examine whether licensed in-service teachers of English for speakers of other languages (ESOL) in K-12 schools are more knowledgeable in the area of applied linguistics than their nonlicensed counterparts, and whether the ESOL-licensed teachers who have taken courses toward their licensure are more knowledgeable in…
ERIC Educational Resources Information Center
Heidrick, Ingrid T.
2017-01-01
This study compares monolinguals and different kinds of bilinguals with respect to their knowledge of the type of lexical phenomenon known as collocation. Collocations are word combinations that speakers use recurrently, forming the basis of conventionalized lexical patterns that are shared by a linguistic community. Examples of collocations…
ERIC Educational Resources Information Center
Dahl, Trine
2009-01-01
This article deals with how economists present their new knowledge claim in the genre of the research article. In the discipline of economics today, the claim is typically included not only in the obvious results/discussion section(s) but also in three other locations of the article: the abstract, the introduction, and the conclusion. The present…
ERIC Educational Resources Information Center
Jacewicz, Ewa; Fox, Robert Allen
2014-01-01
Purpose: The purpose of this study was to investigate how linguistic knowledge interacts with indexical knowledge in older children's perception under demanding listening conditions created by extensive talker variability. Method: Twenty-five 9- to 12-year-old children, 12 from North Carolina (NC) and 13 from Wisconsin (WI), identified 12 vowels…
ERIC Educational Resources Information Center
Silva, Luis Humberto Rodríguez; Roehr-Brackin, Karen
2016-01-01
This article draws on an approach that conceptualizes L2 learning difficulty in terms of implicit and explicit knowledge. In a study with first language Mexican Spanish university-level learners (n = 30), their teachers (n = 11), and applied linguistics experts (n = 3), we investigated the relationship between (a) these groups' difficulty…
20 CFR 416.552 - Waiver of adjustment or recovery-without fault.
Code of Federal Regulations, 2010 CFR
2010-04-01
... physical, mental, educational, or linguistic limitations (including any lack of facility with the English..., knowledge of the occurrence of events that should have been reported, efforts to comply with the reporting...
Extracting semantically enriched events from biomedical literature
2012-01-01
Background Research into event-based text mining from the biomedical literature has been growing in popularity to facilitate the development of advanced biomedical text mining systems. Such technology permits advanced search, which goes beyond document or sentence-based retrieval. However, existing event-based systems typically ignore additional information within the textual context of events that can determine, amongst other things, whether an event represents a fact, hypothesis, experimental result or analysis of results, whether it describes new or previously reported knowledge, and whether it is speculated or negated. We refer to such contextual information as meta-knowledge. The automatic recognition of such information can permit the training of systems allowing finer-grained searching of events according to the meta-knowledge that is associated with them. Results Based on a corpus of 1,000 MEDLINE abstracts, fully manually annotated with both events and associated meta-knowledge, we have constructed a machine learning-based system that automatically assigns meta-knowledge information to events. This system has been integrated into EventMine, a state-of-the-art event extraction system, in order to create a more advanced system (EventMine-MK) that not only extracts events from text automatically, but also assigns five different types of meta-knowledge to these events. The meta-knowledge assignment module of EventMine-MK performs with macro-averaged F-scores in the range of 57-87% on the BioNLP’09 Shared Task corpus. EventMine-MK has been evaluated on the BioNLP’09 Shared Task subtask of detecting negated and speculated events. Our results show that EventMine-MK can outperform other state-of-the-art systems that participated in this task. Conclusions We have constructed the first practical system that extracts both events and associated, detailed meta-knowledge information from biomedical literature. The automatically assigned meta-knowledge information can be used to refine search systems, in order to provide an extra search layer beyond entities and assertions, dealing with phenomena such as rhetorical intent, speculations, contradictions and negations. This finer grained search functionality can assist in several important tasks, e.g., database curation (by locating new experimental knowledge) and pathway enrichment (by providing information for inference). To allow easy integration into text mining systems, EventMine-MK is provided as a UIMA component that can be used in the interoperable text mining infrastructure, U-Compare. PMID:22621266
Extracting semantically enriched events from biomedical literature.
Miwa, Makoto; Thompson, Paul; McNaught, John; Kell, Douglas B; Ananiadou, Sophia
2012-05-23
Research into event-based text mining from the biomedical literature has been growing in popularity to facilitate the development of advanced biomedical text mining systems. Such technology permits advanced search, which goes beyond document or sentence-based retrieval. However, existing event-based systems typically ignore additional information within the textual context of events that can determine, amongst other things, whether an event represents a fact, hypothesis, experimental result or analysis of results, whether it describes new or previously reported knowledge, and whether it is speculated or negated. We refer to such contextual information as meta-knowledge. The automatic recognition of such information can permit the training of systems allowing finer-grained searching of events according to the meta-knowledge that is associated with them. Based on a corpus of 1,000 MEDLINE abstracts, fully manually annotated with both events and associated meta-knowledge, we have constructed a machine learning-based system that automatically assigns meta-knowledge information to events. This system has been integrated into EventMine, a state-of-the-art event extraction system, in order to create a more advanced system (EventMine-MK) that not only extracts events from text automatically, but also assigns five different types of meta-knowledge to these events. The meta-knowledge assignment module of EventMine-MK performs with macro-averaged F-scores in the range of 57-87% on the BioNLP'09 Shared Task corpus. EventMine-MK has been evaluated on the BioNLP'09 Shared Task subtask of detecting negated and speculated events. Our results show that EventMine-MK can outperform other state-of-the-art systems that participated in this task. We have constructed the first practical system that extracts both events and associated, detailed meta-knowledge information from biomedical literature. The automatically assigned meta-knowledge information can be used to refine search systems, in order to provide an extra search layer beyond entities and assertions, dealing with phenomena such as rhetorical intent, speculations, contradictions and negations. This finer grained search functionality can assist in several important tasks, e.g., database curation (by locating new experimental knowledge) and pathway enrichment (by providing information for inference). To allow easy integration into text mining systems, EventMine-MK is provided as a UIMA component that can be used in the interoperable text mining infrastructure, U-Compare.
Jacobs, Robin J; Caballero, Joshua; Ownby, Raymond L; Kane, Michael N
2014-11-30
Low health literacy is associated with poor medication adherence in persons with human immunodeficiency virus (HIV), which can lead to poor health outcomes. As linguistic minorities, Spanish-dominant Hispanics (SDH) face challenges such as difficulties in obtaining and understanding accurate information about HIV and its treatment. Traditional health educational methods (e.g., pamphlets, talking) may not be as effective as delivering through alternate venues. Technology-based health information interventions have the potential for being readily available on desktop computers or over the Internet. The purpose of this research was to adapt a theoretically-based computer application (initially developed for English-speaking HIV-positive persons) that will provide linguistically and culturally appropriate tailored health education to Spanish-dominant Hispanics with HIV (HIV + SDH). A mixed methods approach using quantitative and qualitative interviews with 25 HIV + SDH and 5 key informants guided by the Information-Motivation-Behavioral (IMB) Skills model was used to investigate cultural factors influencing medication adherence in HIV + SDH. We used a triangulation approach to identify major themes within cultural contexts relevant to understanding factors related to motivation to adhere to treatment. From this data we adapted an automated computer-based health literacy intervention to be delivered in Spanish. Culture-specific motivational factors for treatment adherence in HIV + SDH persons that emerged from the data were stigma, familismo (family), mood, and social support. Using this data, we developed a culturally and linguistically adapted a tailored intervention that provides information about HIV infection, treatment, and medication related problem solving skills (proven effective in English-speaking populations) that can be delivered using touch-screen computers, tablets, and smartphones to be tested in a future study. Using a theoretically-grounded Internet-based eHealth education intervention that builds on knowledge and also targets core cultural determinants of adherence may prove a highly effective approach to improve health literacy and medication decision-making in this group.
Huysmans, Elke; Bolk, Elske; Zekveld, Adriana A; Festen, Joost M; de Groot, Annette M B; Goverts, S Theo
2016-01-01
The authors first examined the influence of moderate to severe congenital hearing impairment (CHI) on the correctness of samples of elicited spoken language. Then, the authors used this measure as an indicator of linguistic proficiency and examined its effect on performance in language reception, independent of bottom-up auditory processing. In groups of adults with normal hearing (NH, n = 22), acquired hearing impairment (AHI, n = 22), and moderate to severe CHI (n = 21), the authors assessed linguistic proficiency by analyzing the morphosyntactic correctness of their spoken language production. Language reception skills were examined with a task for masked sentence recognition in the visual domain (text), at a readability level of 50%, using grammatically correct sentences and sentences with distorted morphosyntactic cues. The actual performance on the tasks was compared between groups. Adults with CHI made more morphosyntactic errors in spoken language production than adults with NH, while no differences were observed between the AHI and NH group. This outcome pattern sustained when comparisons were restricted to subgroups of AHI and CHI adults, matched for current auditory speech reception abilities. The data yielded no differences between groups in performance in masked text recognition of grammatically correct sentences in a test condition in which subjects could fully take advantage of their linguistic knowledge. Also, no difference between groups was found in the sensitivity to morphosyntactic distortions when processing short masked sentences, presented visually. These data showed that problems with the correct use of specific morphosyntactic knowledge in spoken language production are a long-term effect of moderate to severe CHI, independent of current auditory processing abilities. However, moderate to severe CHI generally does not impede performance in masked language reception in the visual modality, as measured in this study with short, degraded sentences. Aspects of linguistic proficiency that are affected by CHI thus do not seem to play a role in masked sentence recognition in the visual modality.
Subtle linguistic cues influence perceived blame and financial liability.
Fausey, Caitlin M; Boroditsky, Lera
2010-10-01
When bad things happen, how do we decide who is to blame and how much they should be punished? In the present studies, we examined whether subtly different linguistic descriptions of accidents influence how much people blame and punish those involved. In three studies, participants judged how much people involved in particular accidents should be blamed and how much they should have to pay for the resulting damage. The language used to describe the accidents differed subtly across conditions: Either agentive (transitive) or non-agentive (intransitive) verb forms were used. Agentive descriptions led participants to attribute more blame and request higher financial penalties than did nonagentive descriptions. Further, linguistic framing influenced judgments, even when participants reasoned about a well-known event, such as the "wardrobe malfunction" of Super Bowl 2004. Importantly, this effect of language held, even when people were able to see a video of the event. These results demonstrate that even when people have rich established knowledge and visual information about events, linguistic framing can shape event construal, with important real-world consequences. Subtle differences in linguistic descriptions can change how people construe what happened, attribute blame, and dole out punishment. Supplemental results and analyses may be downloaded from http://pbr.psychonomic-journals.org/content/supplemental.
Language experience changes subsequent learning
Onnis, Luca; Thiessen, Erik
2013-01-01
What are the effects of experience on subsequent learning? We explored the effects of language-specific word order knowledge on the acquisition of sequential conditional information. Korean and English adults were engaged in a sequence learning task involving three different sets of stimuli: auditory linguistic (nonsense syllables), visual non-linguistic (nonsense shapes), and auditory non-linguistic (pure tones). The forward and backward probabilities between adjacent elements generated two equally probable and orthogonal perceptual parses of the elements, such that any significant preference at test must be due to either general cognitive biases, or prior language-induced biases. We found that language modulated parsing preferences with the linguistic stimuli only. Intriguingly, these preferences are congruent with the dominant word order patterns of each language, as corroborated by corpus analyses, and are driven by probabilistic preferences. Furthermore, although the Korean individuals had received extensive formal explicit training in English and lived in an English-speaking environment, they exhibited statistical learning biases congruent with their native language. Our findings suggest that mechanisms of statistical sequential learning are implicated in language across the lifespan, and experience with language may affect cognitive processes and later learning. PMID:23200510
Linguistics from a Conceptual Viewpoint (Aspects of Aspects of a Theory of Syntax).
ERIC Educational Resources Information Center
Schank, Roger C.
Some of the assertions made by Chomsky in "Aspects of the Theory of Syntax" are considered. In particular, the notion of a "competence" model in linguistics is criticized. Formal postulates for a conceptually-based linguistic theory are presented. (Author/JD)
Knowledge and Policy: Research and Knowledge Transfer
ERIC Educational Resources Information Center
Ozga, Jenny
2007-01-01
Knowledge transfer (KT) is the emergent "third sector" of higher education activity--alongside research and teaching. Its commercialization origins are evidenced in its concerns to extract maximum value from research, and in the policy push to make research-based knowledge trapped in disciplinary silos more responsive to the growing…
The Typicality Ranking Task: A New Method to Derive Typicality Judgments from Children.
Djalal, Farah Mutiasari; Ameel, Eef; Storms, Gert
2016-01-01
An alternative method for deriving typicality judgments, applicable in young children that are not familiar with numerical values yet, is introduced, allowing researchers to study gradedness at younger ages in concept development. Contrary to the long tradition of using rating-based procedures to derive typicality judgments, we propose a method that is based on typicality ranking rather than rating, in which items are gradually sorted according to their typicality, and that requires a minimum of linguistic knowledge. The validity of the method is investigated and the method is compared to the traditional typicality rating measurement in a large empirical study with eight different semantic concepts. The results show that the typicality ranking task can be used to assess children's category knowledge and to evaluate how this knowledge evolves over time. Contrary to earlier held assumptions in studies on typicality in young children, our results also show that preference is not so much a confounding variable to be avoided, but that both variables are often significantly correlated in older children and even in adults.
The Typicality Ranking Task: A New Method to Derive Typicality Judgments from Children
Ameel, Eef; Storms, Gert
2016-01-01
An alternative method for deriving typicality judgments, applicable in young children that are not familiar with numerical values yet, is introduced, allowing researchers to study gradedness at younger ages in concept development. Contrary to the long tradition of using rating-based procedures to derive typicality judgments, we propose a method that is based on typicality ranking rather than rating, in which items are gradually sorted according to their typicality, and that requires a minimum of linguistic knowledge. The validity of the method is investigated and the method is compared to the traditional typicality rating measurement in a large empirical study with eight different semantic concepts. The results show that the typicality ranking task can be used to assess children’s category knowledge and to evaluate how this knowledge evolves over time. Contrary to earlier held assumptions in studies on typicality in young children, our results also show that preference is not so much a confounding variable to be avoided, but that both variables are often significantly correlated in older children and even in adults. PMID:27322371
A model for indexing medical documents combining statistical and symbolic knowledge.
Avillach, Paul; Joubert, Michel; Fieschi, Marius
2007-10-11
To develop and evaluate an information processing method based on terminologies, in order to index medical documents in any given documentary context. We designed a model using both symbolic general knowledge extracted from the Unified Medical Language System (UMLS) and statistical knowledge extracted from a domain of application. Using statistical knowledge allowed us to contextualize the general knowledge for every particular situation. For each document studied, the extracted terms are ranked to highlight the most significant ones. The model was tested on a set of 17,079 French standardized discharge summaries (SDSs). The most important ICD-10 term of each SDS was ranked 1st or 2nd by the method in nearly 90% of the cases. The use of several terminologies leads to more precise indexing. The improvement achieved in the models implementation performances as a result of using semantic relationships is encouraging.
A Model for Indexing Medical Documents Combining Statistical and Symbolic Knowledge.
Avillach, Paul; Joubert, Michel; Fieschi, Marius
2007-01-01
OBJECTIVES: To develop and evaluate an information processing method based on terminologies, in order to index medical documents in any given documentary context. METHODS: We designed a model using both symbolic general knowledge extracted from the Unified Medical Language System (UMLS) and statistical knowledge extracted from a domain of application. Using statistical knowledge allowed us to contextualize the general knowledge for every particular situation. For each document studied, the extracted terms are ranked to highlight the most significant ones. The model was tested on a set of 17,079 French standardized discharge summaries (SDSs). RESULTS: The most important ICD-10 term of each SDS was ranked 1st or 2nd by the method in nearly 90% of the cases. CONCLUSIONS: The use of several terminologies leads to more precise indexing. The improvement achieved in the model’s implementation performances as a result of using semantic relationships is encouraging. PMID:18693792
Enhancing international medical graduates' communication: the contribution of applied linguistics.
Dahm, Maria R; Yates, Lynda; Ogden, Kathryn; Rooney, Kim; Sheldon, Brooke
2015-08-01
International medical graduates (IMGs) make up one-third of the Australian medical workforce. Those from non-English-language backgrounds can face cultural and communication barriers, yet linguistic support is variable and medical educators are often required to provide feedback on both medical and communication issues. However, some communication difficulties may be very specific to the experiences of IMGs as second language users. This interdisciplinary study combines perspectives from applied linguistics experts and clinical educators to address IMGs' difficulties from multiple dimensions and to enhance feedback quality. Five video-recorded patient encounters with five IMGs were collected at Launceston General Hospital. Three clinical educators gave quantitative and qualitative feedback using the Rating Instrument for Clinical Consulting Skills, and two applied linguistics experts analysed the data for language, pragmatic and communication difficulties. The comparison of the educators' language-related feedback with linguistic analyses of the same interactions facilitated the exploration of differences in the difficulties identified by the two expert groups. Although the clinical educators were able to use their tacit intuitive understanding of communication issues to identify IMG difficulties, they less frequently addressed the underlying issues or suggested specific remedies in their feedback. This pilot study illustrates the effectiveness of interdisciplinary collaboration in highlighting the specific discourse features contributing to IMG communication difficulties and thus assists educators in deconstructing their intuitive knowledge. The authors suggest that linguistic insights can therefore improve communications training by assisting educators to provide more targeted feedback. © 2015 John Wiley & Sons Ltd.
Evans, Vyvyan
2016-01-01
Recent research in language and cognitive science proposes that the linguistic system evolved to provide an “executive” control system on the evolutionarily more ancient conceptual system (e.g., Barsalou et al., 2008; Evans, 2009, 2015a,b; Bergen, 2012). In short, the claim is that embodied representations in the linguistic system interface with non-linguistic representations in the conceptual system, facilitating rich meanings, or simulations, enabling linguistically mediated communication. In this paper I build on these proposals by examining the nature of what I identify as design features for this control system. In particular, I address how the ideational function of language—our ability to deploy linguistic symbols to convey meanings of great complexity—is facilitated. The central proposal of this paper is as follows. The linguistic system of any given language user, of any given linguistic system—spoken or signed—facilitates access to knowledge representation—concepts—in the conceptual system, which subserves this ideational function. In the most general terms, the human meaning-making capacity is underpinned by two distinct, although tightly coupled representational systems: the conceptual system and the linguistic system. Each system contributes to meaning construction in qualitatively distinct ways. This leads to the first design feature: given that the two systems are representational—they are populated by semantic representations—the nature and function of the representations are qualitatively different. This proposed design feature I term the bifurcation in semantic representation. After all, it stands to reason that if a linguistic system has a different function, vis-à-vis the conceptual system, which is of far greater evolutionary antiquity, then the semantic representations will be complementary, and as such, qualitatively different, reflecting the functional distinctions of the two systems, in collectively giving rise to meaning. I consider the nature of these qualitatively distinct representations. And second, language itself is adapted to the conceptual system—the semantic potential—that it marshals in the meaning construction process. Hence, a linguistic system itself exhibits a bifurcation, in terms of the symbolic resources at its disposal. This design feature I dub the birfucation in linguistic organization. As I shall argue, this relates to two distinct reference strategies available for symbolic encoding in language: what I dub words-to-world reference and words-to-words reference. In slightly different terms, this design feature of language amounts to a distinction between a lexical subsystem, and a grammatical subsystem. PMID:26925000
Evans, Vyvyan
2016-01-01
Recent research in language and cognitive science proposes that the linguistic system evolved to provide an "executive" control system on the evolutionarily more ancient conceptual system (e.g., Barsalou et al., 2008; Evans, 2009, 2015a,b; Bergen, 2012). In short, the claim is that embodied representations in the linguistic system interface with non-linguistic representations in the conceptual system, facilitating rich meanings, or simulations, enabling linguistically mediated communication. In this paper I build on these proposals by examining the nature of what I identify as design features for this control system. In particular, I address how the ideational function of language-our ability to deploy linguistic symbols to convey meanings of great complexity-is facilitated. The central proposal of this paper is as follows. The linguistic system of any given language user, of any given linguistic system-spoken or signed-facilitates access to knowledge representation-concepts-in the conceptual system, which subserves this ideational function. In the most general terms, the human meaning-making capacity is underpinned by two distinct, although tightly coupled representational systems: the conceptual system and the linguistic system. Each system contributes to meaning construction in qualitatively distinct ways. This leads to the first design feature: given that the two systems are representational-they are populated by semantic representations-the nature and function of the representations are qualitatively different. This proposed design feature I term the bifurcation in semantic representation. After all, it stands to reason that if a linguistic system has a different function, vis-à-vis the conceptual system, which is of far greater evolutionary antiquity, then the semantic representations will be complementary, and as such, qualitatively different, reflecting the functional distinctions of the two systems, in collectively giving rise to meaning. I consider the nature of these qualitatively distinct representations. And second, language itself is adapted to the conceptual system-the semantic potential-that it marshals in the meaning construction process. Hence, a linguistic system itself exhibits a bifurcation, in terms of the symbolic resources at its disposal. This design feature I dub the birfucation in linguistic organization. As I shall argue, this relates to two distinct reference strategies available for symbolic encoding in language: what I dub words-to-world reference and words-to-words reference. In slightly different terms, this design feature of language amounts to a distinction between a lexical subsystem, and a grammatical subsystem.
Connor, Carol McDonald; Day, Stephanie L; Phillips, Beth; Sparapani, Nicole; Ingebrand, Sarah W; McLean, Leigh; Barrus, Angela; Kaschak, Michael P
2016-11-01
Many assume that cognitive and linguistic processes, such as semantic knowledge (SK) and self-regulation (SR), subserve learned skills like reading. However, complex models of interacting and bootstrapping effects of SK, SR, instruction, and reading hypothesize reciprocal effects. Testing this "lattice" model with children (n = 852) followed from first to second grade (5.9-10.4 years of age) revealed reciprocal effects for reading and SR, and reading and SK, but not SR and SK. More effective literacy instruction reduced reading stability over time. Findings elucidate the synergistic and reciprocal effects of learning to read on other important linguistic, self-regulatory, and cognitive processes; the value of using complex models of development to inform intervention design; and how learned skills may influence development during middle childhood. © 2016 The Authors. Child Development © 2016 Society for Research in Child Development, Inc.
Long-term associative learning predicts verbal short-term memory performance.
Jones, Gary; Macken, Bill
2018-02-01
Studies using tests such as digit span and nonword repetition have implicated short-term memory across a range of developmental domains. Such tests ostensibly assess specialized processes for the short-term manipulation and maintenance of information that are often argued to enable long-term learning. However, there is considerable evidence for an influence of long-term linguistic learning on performance in short-term memory tasks that brings into question the role of a specialized short-term memory system separate from long-term knowledge. Using natural language corpora, we show experimentally and computationally that performance on three widely used measures of short-term memory (digit span, nonword repetition, and sentence recall) can be predicted from simple associative learning operating on the linguistic environment to which a typical child may have been exposed. The findings support the broad view that short-term verbal memory performance reflects the application of long-term language knowledge to the experimental setting.
Blind Linguistic Steganalysis against Translation Based Steganography
NASA Astrophysics Data System (ADS)
Chen, Zhili; Huang, Liusheng; Meng, Peng; Yang, Wei; Miao, Haibo
Translation based steganography (TBS) is a kind of relatively new and secure linguistic steganography. It takes advantage of the "noise" created by automatic translation of natural language text to encode the secret information. Up to date, there is little research on the steganalysis against this kind of linguistic steganography. In this paper, a blind steganalytic method, which is named natural frequency zoned word distribution analysis (NFZ-WDA), is presented. This method has improved on a previously proposed linguistic steganalysis method based on word distribution which is targeted for the detection of linguistic steganography like nicetext and texto. The new method aims to detect the application of TBS and uses none of the related information about TBS, its only used resource is a word frequency dictionary obtained from a large corpus, or a so called natural frequency dictionary, so it is totally blind. To verify the effectiveness of NFZ-WDA, two experiments with two-class and multi-class SVM classifiers respectively are carried out. The experimental results show that the steganalytic method is pretty promising.
Children and adults integrate talker and verb information in online processing.
Borovsky, Arielle; Creel, Sarah
2015-01-01
Children seem able to efficiently interpret a variety of linguistic cues during speech comprehension, yet have difficulty interpreting sources of non-linguistic and paralinguistic information that accompany speech. The current study asked whether (paralinguistic) voice-activated role knowledge is rapidly interpreted in coordination with a linguistic cue (a sentential action) during speech comprehension in an eye-tracked sentence comprehension task with children (aged 3-10) and college-aged adults. Participants were initially familiarized with two talkers who identified their respective roles (e.g. PRINCESS and PIRATE) before hearing a previously-introduced talker name an action and object (“I want to hold the sword,” in the pirate's voice). As the sentence was spoken, eye-movements were recorded to four objects that varied in relationship to the sentential talker and action (Target: SWORD, Talker-Related: SHIP, Action-Related: WAND, and Unrelated: CARRIAGE). The task was to select the named image. Even young child listeners rapidly combined inferences about talker identity with the action, allowing them to fixate on the Target before it was mentioned, although there were developmental and vocabulary differences on this task. Results suggest that children, like adults, store real-world knowledge of a talker's role and actively use this information to interpret speech. PMID:24611671
Between physics and metaphysics: structure as a boundary concept.
Tau, Ramiro
2015-03-01
The notion of structure is found to be used in a great number of theories, scientific research programs and world views. However, its uses and definitions are as diverse as the objects of the scientific disciplines where it can be found. Without trying to recreate the structuralist aspiration from the mid XX century, which believed to have found in this notion a common transdisciplinary language, I discuss a specific aspect of this concept that could be considered a constant in different perspectives. This aspect refers to the location of the notions of structure as boundaries in the different scientific theories. With this, I try to argue that the definition or presentation of a structure configures in itself the frontier for scientific knowledge, defining at the same time implied ontological assumptions. In order to discuss this hypothesis, and taking into consideration the double origin of contemporary notions of structure -the mathematical and linguistic line-, I revise several theoretical perspectives which made explicit the relation between structures and knowledge, and their relation with the real: the arguments on physical knowledge by Eddington, structural anthropology, structural linguistics, Lacanian psychoanalysis and Piaget's genetic psychology.
Kwok, Cannas; Lim, Danforn
2016-09-01
This paper aims to evaluate the impact of the culturally sensitive and linguistically appropriate education program on the following: (i) awareness of screening practices (breast awareness, mammogram, and Pap smear test); (ii) screening intention within the next six months; and (iii) knowledge about breast and cervical cancer among Chinese-Australian women. Titled "Happy and Healthy Life in Sydney," this was a quasi-experimental study with both pre- and post-test design. A convenience sample of 288 Chinese women was recruited through Chinese organizations such as churches and community centers. Participants completed the questionnaires before and after the educational program. The results show that the program was effective in promoting awareness of breast and cervical cancer screening and resulted in increased participative intentions in both mammogram and Pap smear testing within the next 6 months. Results also indicate that knowledge and belief scores were significantly increased. Our study supports that educational programs which use culturally sensitive and linguistically appropriate strategies are effective in improving both knowledge of breast and cervical cancer and awareness of their early detection practices among Chinese-Australian women.
Corpus Linguistics and the Design of a Response Message
NASA Astrophysics Data System (ADS)
Atwell, E.
2002-01-01
Most research related to SETI, the Search for Extra-Terrestrial Intelligence, is focussed on techniques for detection of possible incoming signals from extra-terrestrial intelligent sources (e.g. Turnbull et al. 1999), and algorithms for analysis of these signals to identify intelligent language-like characteristics (e.g. Elliott and Atwell 1999, 2000). However, another issue for research and debate is the nature of our response, should a signal arrive and be detected. The design of potentially the most significant communicative act in history should not be decided solely by astrophysicists; the Corpus Linguistics research community has a contribution to make to what is essentially a Corpus design and implementation project. (Vakoch 1998) advocated that the message constructed to transmit to extraterrestrials should include a broad, representative collection of perspectives rather than a single viewpoint or genre; this should strike a chord with Corpus Linguists for whom a central principle is that a corpus must be "balanced" to be representative (Meyer 2001). One idea favoured by SETI researchers is to transmit an encyclopaedia summarising human knowledge, such as the Encyclopaedia Britannica, to give ET communicators an overview and "training set" key to analysis of subsequent messages. Furthermore, this should be sent in several versions in parallel: the text; page-images, to include illustrations left out of the text-file and perhaps some sort of abstract linguistic representation of the text, using a functional or logic language (Ollongren 1999, Freudenthal 1960). The idea of "enriching" the message corpus with annotations at several levels should also strike a chord with Corpus Linguists who have long known that Natural language exhibits highly complex multi-layering sequencing, structural and functional patterns, as difficult to model as sequences and structures found in more traditional physical and biological sciences. Some corpora have been annotated with several levels or layers of linguistic knowledge, for example the SEC corpus (Taylor and Knowles 1988), the ISLE corpus (Menzel et al. 2000). Tagged and parsed corpus can be used by corpus linguists as a testbed to guide their development of grammars (e.g. Souter and Atwell 1994); and they can be used to train Natural Language Learning or data-mining models of complex sequence data (e.g. Brill 1993, Hughes 1993, Atwell 1996). Corpus linguists have a range of standards and tools for design and annotation of representative corpus resources, and experience of which annotation types are more amenable to Natural Language Learning algorithms. An Advisory panel of corpus linguists could help design and implement an extended Multi-annotated Interstellar Corpus of English, incorporating ideas from Corpus Linguistics such as: - Augment the Encyclopaedia Britannica with a collection of samples representing the diversity of language in real use. - As an additional "key", transmit a dictionary aimed at language learners which has also been a rich source for NLP - Supply our ET communicators with several levels of linguistic annotation, to give them a richer training set for their - Add translations of the English text into other human languages: Humanity should not be represented by English alone, This calls for a large-scale corpus annotation project, requiring an Interstellar Corpus Advisory Panel, analogous to the BNC or MATE advisory panels, to include experts in English grammar and semantics, English language learning, computational Natural language Learning algorithms, and corpus design, implementation, annotation, standardisation, and analysis.
NASA's online machine aided indexing system
NASA Technical Reports Server (NTRS)
Silvester, June P.; Genuardi, Michael T.; Klingbiel, Paul H.
1993-01-01
This report describes the NASA Lexical Dictionary, a machine aided indexing system used online at the National Aeronautics and Space Administration's Center for Aerospace Information (CASI). This system is comprised of a text processor that is based on the computational, non-syntactic analysis of input text, and an extensive 'knowledge base' that serves to recognize and translate text-extracted concepts. The structure and function of the various NLD system components are described in detail. Methods used for the development of the knowledge base are discussed. Particular attention is given to a statistically-based text analysis program that provides the knowledge base developer with a list of concept-specific phrases extracted from large textual corpora. Production and quality benefits resulting from the integration of machine aided indexing at CASI are discussed along with a number of secondary applications of NLD-derived systems including on-line spell checking and machine aided lexicography.
Ash salts and bodily affects: Witoto environmental knowledge as sexual education
NASA Astrophysics Data System (ADS)
Alvaro Echeverri, Juan; Enokakuiodo Román-Jitdutjaaño, Oscar
2013-03-01
This letter addresses the indigenous discourse on a set of plant species used by the Witoto Indians of Northwest Amazonia to extract ash or vegetable salt, obtained from the combustion of the tissues of vegetable species, filtering of the ashes, and desiccation of the resulting brine. It aims to demonstrate how the study of the human condition is carried out through a reading of natural entities. The method employed is the indexical analysis of a discourse uttered by the elder Enokakuiodo in the Witoto language from 1995 to 1998, in a verbal genre called rafue, one of several genres of the ‘language of the yard of coca’. The species used to extract ash salt are conceived of as coming from the body of the Creator and as an image of the human body. The rafue of salt performs, in words and gestures, a narrative of human affects and capacities by reading ecological, biological, cultural and linguistic indices from a set of plant species. This discourse on plant species is a discourse on the control and management of bodily affects and capacities, represented as ash salts, that are lessons about sexual development which the Creator left for humanity as a guide—a ‘sexual education’.
Si, Guangsen; Xu, Zeshui
2018-01-01
Hesitant fuzzy linguistic term set provides an effective tool to represent uncertain decision information. However, the semantics corresponding to the linguistic terms in it cannot accurately reflect the decision-makers’ subjective cognition. In general, different decision-makers’ sensitivities towards the semantics are different. Such sensitivities can be represented by the cumulative prospect theory value function. Inspired by this, we propose a linguistic scale function to transform the semantics corresponding to linguistic terms into the linguistic preference values. Furthermore, we propose the hesitant fuzzy linguistic preference utility set, based on which, the decision-makers can flexibly express their distinct semantics and obtain the decision results that are consistent with their cognition. For calculations and comparisons over the hesitant fuzzy linguistic preference utility sets, we introduce some distance measures and comparison laws. Afterwards, to apply the hesitant fuzzy linguistic preference utility sets in emergency management, we develop a method to obtain objective weights of attributes and then propose a hesitant fuzzy linguistic preference utility-TOPSIS method to select the best fire rescue plan. Finally, the validity of the proposed method is verified by some comparisons of the method with other two representative methods including the hesitant fuzzy linguistic-TOPSIS method and the hesitant fuzzy linguistic-VIKOR method. PMID:29614019
Liao, Huchang; Si, Guangsen; Xu, Zeshui; Fujita, Hamido
2018-04-03
Hesitant fuzzy linguistic term set provides an effective tool to represent uncertain decision information. However, the semantics corresponding to the linguistic terms in it cannot accurately reflect the decision-makers' subjective cognition. In general, different decision-makers' sensitivities towards the semantics are different. Such sensitivities can be represented by the cumulative prospect theory value function. Inspired by this, we propose a linguistic scale function to transform the semantics corresponding to linguistic terms into the linguistic preference values. Furthermore, we propose the hesitant fuzzy linguistic preference utility set, based on which, the decision-makers can flexibly express their distinct semantics and obtain the decision results that are consistent with their cognition. For calculations and comparisons over the hesitant fuzzy linguistic preference utility sets, we introduce some distance measures and comparison laws. Afterwards, to apply the hesitant fuzzy linguistic preference utility sets in emergency management, we develop a method to obtain objective weights of attributes and then propose a hesitant fuzzy linguistic preference utility-TOPSIS method to select the best fire rescue plan. Finally, the validity of the proposed method is verified by some comparisons of the method with other two representative methods including the hesitant fuzzy linguistic-TOPSIS method and the hesitant fuzzy linguistic-VIKOR method.
Concept maps: A tool for knowledge management and synthesis in web-based conversational learning.
Joshi, Ankur; Singh, Satendra; Jaswal, Shivani; Badyal, Dinesh Kumar; Singh, Tejinder
2016-01-01
Web-based conversational learning provides an opportunity for shared knowledge base creation through collaboration and collective wisdom extraction. Usually, the amount of generated information in such forums is very huge, multidimensional (in alignment with the desirable preconditions for constructivist knowledge creation), and sometimes, the nature of expected new information may not be anticipated in advance. Thus, concept maps (crafted from constructed data) as "process summary" tools may be a solution to improve critical thinking and learning by making connections between the facts or knowledge shared by the participants during online discussion This exploratory paper begins with the description of this innovation tried on a web-based interacting platform (email list management software), FAIMER-Listserv, and generated qualitative evidence through peer-feedback. This process description is further supported by a theoretical construct which shows how social constructivism (inclusive of autonomy and complexity) affects the conversational learning. The paper rationalizes the use of concept map as mid-summary tool for extracting information and further sense making out of this apparent intricacy.
Role of the motor system in language knowledge.
Berent, Iris; Brem, Anna-Katharine; Zhao, Xu; Seligson, Erica; Pan, Hong; Epstein, Jane; Stern, Emily; Galaburda, Albert M; Pascual-Leone, Alvaro
2015-02-17
All spoken languages express words by sound patterns, and certain patterns (e.g., blog) are systematically preferred to others (e.g., lbog). What principles account for such preferences: does the language system encode abstract rules banning syllables like lbog, or does their dislike reflect the increased motor demands associated with speech production? More generally, we ask whether linguistic knowledge is fully embodied or whether some linguistic principles could potentially be abstract. To address this question, here we gauge the sensitivity of English speakers to the putative universal syllable hierarchy (e.g., blif ≻ bnif ≻ bdif ≻ lbif) while undergoing transcranial magnetic stimulation (TMS) over the cortical motor representation of the left orbicularis oris muscle. If syllable preferences reflect motor simulation, then worse-formed syllables (e.g., lbif) should (i) elicit more errors; (ii) engage more strongly motor brain areas; and (iii) elicit stronger effects of TMS on these motor regions. In line with the motor account, we found that repetitive TMS pulses impaired participants' global sensitivity to the number of syllables, and functional MRI confirmed that the cortical stimulation site was sensitive to the syllable hierarchy. Contrary to the motor account, however, ill-formed syllables were least likely to engage the lip sensorimotor area and they were least impaired by TMS. Results suggest that speech perception automatically triggers motor action, but this effect is not causally linked to the computation of linguistic structure. We conclude that the language and motor systems are intimately linked, yet distinct. Language is designed to optimize motor action, but its knowledge includes principles that are disembodied and potentially abstract.
Role of the motor system in language knowledge
Berent, Iris; Brem, Anna-Katharine; Zhao, Xu; Seligson, Erica; Pan, Hong; Epstein, Jane; Stern, Emily; Galaburda, Albert M.; Pascual-Leone, Alvaro
2015-01-01
All spoken languages express words by sound patterns, and certain patterns (e.g., blog) are systematically preferred to others (e.g., lbog). What principles account for such preferences: does the language system encode abstract rules banning syllables like lbog, or does their dislike reflect the increased motor demands associated with speech production? More generally, we ask whether linguistic knowledge is fully embodied or whether some linguistic principles could potentially be abstract. To address this question, here we gauge the sensitivity of English speakers to the putative universal syllable hierarchy (e.g., blif≻bnif≻bdif≻lbif) while undergoing transcranial magnetic stimulation (TMS) over the cortical motor representation of the left orbicularis oris muscle. If syllable preferences reflect motor simulation, then worse-formed syllables (e.g., lbif) should (i) elicit more errors; (ii) engage more strongly motor brain areas; and (iii) elicit stronger effects of TMS on these motor regions. In line with the motor account, we found that repetitive TMS pulses impaired participants’ global sensitivity to the number of syllables, and functional MRI confirmed that the cortical stimulation site was sensitive to the syllable hierarchy. Contrary to the motor account, however, ill-formed syllables were least likely to engage the lip sensorimotor area and they were least impaired by TMS. Results suggest that speech perception automatically triggers motor action, but this effect is not causally linked to the computation of linguistic structure. We conclude that the language and motor systems are intimately linked, yet distinct. Language is designed to optimize motor action, but its knowledge includes principles that are disembodied and potentially abstract. PMID:25646465
PKDE4J: Entity and relation extraction for public knowledge discovery.
Song, Min; Kim, Won Chul; Lee, Dahee; Heo, Go Eun; Kang, Keun Young
2015-10-01
Due to an enormous number of scientific publications that cannot be handled manually, there is a rising interest in text-mining techniques for automated information extraction, especially in the biomedical field. Such techniques provide effective means of information search, knowledge discovery, and hypothesis generation. Most previous studies have primarily focused on the design and performance improvement of either named entity recognition or relation extraction. In this paper, we present PKDE4J, a comprehensive text-mining system that integrates dictionary-based entity extraction and rule-based relation extraction in a highly flexible and extensible framework. Starting with the Stanford CoreNLP, we developed the system to cope with multiple types of entities and relations. The system also has fairly good performance in terms of accuracy as well as the ability to configure text-processing components. We demonstrate its competitive performance by evaluating it on many corpora and found that it surpasses existing systems with average F-measures of 85% for entity extraction and 81% for relation extraction. Copyright © 2015 Elsevier Inc. All rights reserved.
Some Evidence of Continuing Linguistic Acquisitions in Learning Adolescents.
ERIC Educational Resources Information Center
Thomas, Elizabeth K.; Walmsley, Sean A.
The linguistic development of 42 learning disabled students 10-16 years old was examined. Responses were elicited to five linguistic structures, including the distinction between "ask" and "tell", pronominal restriction, and the minimum distance principle. Data were analyzed in terms of three groups based on Verbal and Performance differentials on…
Mapping Dialectal Variation Using the Algonquian Linguistic Atlas
ERIC Educational Resources Information Center
Cenerini, Chantale; Junker, Marie-Odile; Rosen, Nicole
2017-01-01
The Algonquian Linguistic Atlas (www.atlas-ling.ca) is an online multimedia linguistic atlas of Algonquian languages in Canada, built based on a template of conversational topics. It includes Algonquian languages primarily from the CreeInnu-Naskapi continuum, but also from Blackfoot, Mi'kmaw, and Ojibwe (including Algonquin), with other languages…
On Research Methodology in Applied Linguistics in 2002-2008
ERIC Educational Resources Information Center
Martynychev, Andrey
2010-01-01
This dissertation examined the status of data-based research in applied linguistics through an analysis of published research studies in nine peer-reviewed applied linguistics journals ("Applied Language Learning, The Canadian Modern Language Review / La Revue canadienne des langues vivantes, Current Issues in Language Planning, Dialog on Language…
Caruso, Francisco; Silveira, Cristina
2009-01-01
A new method for working with scientific, healthcare, historic, sociological, linguistic and other concepts through comic books is presented for youth from public high schools in Rio de Janeiro. The method is based on the pedagogy inspired by Bachelard, according to which scientific knowledge and artistic production are integrated by the stimulus to creativity. It shows how it is capable of contributing to the recuperation of students' self-esteem and increasing motivation to study and how through a creative process and emphasis on a critical spirit, youths construct their citizenship, based on re-readings and translations of a new world built of sciences, dreams and images, which are made concrete in comics, some of which illustrate the text.
[Astrologic and medical manuscript of the 18th Century].
Kugener, Henri
2010-01-01
We present a manuscript from the 18th century, an extract taken from the "Great and the Little Albert" attributed to Albertus Magnus. The linguistic variety in the paper is typical for a text composed in Luxembourg. Added to this text are two incantations and a short cartomancy paper.
Analysing Representations of Otherness Using Different Text-Types.
ERIC Educational Resources Information Center
Murphy-LeJeune, Elizabeth; And Others
1996-01-01
Demonstrates how the teacher can use texts to confront learners with cultural representations. Four texts are used to represent a literary extract, a student essay, an advertising document, and a newspaper article. The article illustrates approaches that borrow from stylistics, linguistics, and discourse analysis. (21 references) (Author/CK)
Semicommunication and Accommodation: Observations from the Linguistic Situation in Scandinavia.
ERIC Educational Resources Information Center
Braunmuller, Kurt
2002-01-01
Focuses on semicommunication and accommodation and discusses two longer extracts from a large corpus of authentic communication from Scandinavia. Various aspects of a comprehensive model of semicommunication are presented and discussed, showing code switching and accommodation are not considered antagonistic but rather as scalar phenomena covering…
A decision method based on uncertainty reasoning of linguistic truth-valued concept lattice
NASA Astrophysics Data System (ADS)
Yang, Li; Xu, Yang
2010-04-01
Decision making with linguistic information is a research hotspot now. This paper begins by establishing the theory basis for linguistic information processing and constructs the linguistic truth-valued concept lattice for a decision information system, and further utilises uncertainty reasoning to make the decision. That is, we first utilise the linguistic truth-valued lattice implication algebra to unify the different kinds of linguistic expressions; second, we construct the linguistic truth-valued concept lattice and decision concept lattice according to the concrete decision information system and third, we establish the internal and external uncertainty reasoning methods and talk about the rationality of them. We apply these uncertainty reasoning methods into decision making and present some generation methods of decision rules. In the end, we give an application of this decision method by an example.
Quiñones, Karin D; Su, Hua; Marshall, Byron; Eggers, Shauna; Chen, Hsinchun
2007-09-01
Explosive growth in biomedical research has made automated information extraction, knowledge integration, and visualization increasingly important and critically needed. The Arizona BioPathway (ABP) system extracts and displays biological regulatory pathway information from the abstracts of journal articles. This study uses relations extracted from more than 200 PubMed abstracts presented in a tabular and graphical user interface with built-in search and aggregation functionality. This paper presents a task-centered assessment of the usefulness and usability of the ABP system focusing on its relation aggregation and visualization functionalities. Results suggest that our graph-based visualization is more efficient in supporting pathway analysis tasks and is perceived as more useful and easier to use as compared to a text-based literature-viewing method. Relation aggregation significantly contributes to knowledge-acquisition efficiency. Together, the graphic and tabular views in the ABP Visualizer provide a flexible and effective interface for pathway relation browsing and analysis. Our study contributes to pathway-related research and biological information extraction by assessing the value of a multiview, relation-based interface that supports user-controlled exploration of pathway information across multiple granularities.
Hemispheric association and dissociation of voice and speech information processing in stroke.
Jones, Anna B; Farrall, Andrew J; Belin, Pascal; Pernet, Cyril R
2015-10-01
As we listen to someone speaking, we extract both linguistic and non-linguistic information. Knowing how these two sets of information are processed in the brain is fundamental for the general understanding of social communication, speech recognition and therapy of language impairments. We investigated the pattern of performances in phoneme versus gender categorization in left and right hemisphere stroke patients, and found an anatomo-functional dissociation in the right frontal cortex, establishing a new syndrome in voice discrimination abilities. In addition, phoneme and gender performances were most often associated than dissociated in the left hemisphere patients, suggesting a common neural underpinnings. Copyright © 2015 Elsevier Ltd. All rights reserved.
Knowledge-Based Image Analysis.
1981-04-01
UNCLASSIF1 ED ETL-025s N IIp ETL-0258 AL Ai01319 S"Knowledge-based image analysis u George C. Stockman Barbara A. Lambird I David Lavine Laveen N. Kanal...extraction, verification, region classification, pattern recognition, image analysis . 3 20. A. CT (Continue on rever.. d. It necessary and Identify by...UNCLgSTFTF n In f SECURITY CLASSIFICATION OF THIS PAGE (When Date Entered) .L1 - I Table of Contents Knowledge Based Image Analysis I Preface
Information Extraction Using Controlled English to Support Knowledge-Sharing and Decision-Making
2012-06-01
or language variants. CE-based information extraction will greatly facilitate the processes in the cognitive and social domains that enable forces...terminology or language variants. CE-based information extraction will greatly facilitate the processes in the cognitive and social domains that...processor is run to turn the atomic CE into a more “ stylistically felicitous” CE, using techniques such as: aggregating all information about an entity
"Voices of the people": linguistic research among Germany's prisoners of war during World War I.
Kaplan, Judith
2013-01-01
This paper investigates the history of the Royal Prussian Phonographic Commission, a body that collected and archived linguistic, ethnographic, and anthropological data from prisoners-of-war (POWs) in Germany during World War I. Recent literature has analyzed the significance of this research for the rise of conservative physical anthropology. Taking a complementary approach, the essay charts new territory in seeking to understand how the prison-camp studies informed philology and linguistics specifically. I argue that recognizing philological commitments of the Phonographic Commission is essential to comprehending the project contextually. My approach reveals that linguists accommodated material and contemporary evidence to older text-based research models, sustaining dynamic theories of language. Through a case study based on the Iranian philologist F. C. Andreas (1846-1930), the paper ultimately argues that linguistics merits greater recognition in the historiography of the behavioral sciences. © 2013 Wiley Periodicals, Inc.
A flower image retrieval method based on ROI feature.
Hong, An-Xiang; Chen, Gang; Li, Jun-Li; Chi, Zhe-Ru; Zhang, Dan
2004-07-01
Flower image retrieval is a very important step for computer-aided plant species recognition. In this paper, we propose an efficient segmentation method based on color clustering and domain knowledge to extract flower regions from flower images. For flower retrieval, we use the color histogram of a flower region to characterize the color features of flower and two shape-based features sets, Centroid-Contour Distance (CCD) and Angle Code Histogram (ACH), to characterize the shape features of a flower contour. Experimental results showed that our flower region extraction method based on color clustering and domain knowledge can produce accurate flower regions. Flower retrieval results on a database of 885 flower images collected from 14 plant species showed that our Region-of-Interest (ROI) based retrieval approach using both color and shape features can perform better than a method based on the global color histogram proposed by Swain and Ballard (1991) and a method based on domain knowledge-driven segmentation and color names proposed by Das et al.(1999).
Analysis of collaborative communication for linguistic cues of cognitive load.
Khawaja, M Asif; Chen, Fang; Marcus, Nadine
2012-08-01
Analyses of novel linguistic and grammatical features, extracted from transcribed speech of people working in a collaborative environment, were performed for cognitive load measurement Prior studies have attempted to assess users' cognitive load with several measures, but most of them are intrusive and disrupt normal task flow. An effective measurement of people's cognitive load can help improve their performance by deploying appropriate output and support strategies accordingly. The authors studied 33 members of bushfire management teams working collaboratively in computerized incident control rooms and involved in complex bushfire management tasks. The participants' communication was analyzed for some novel linguistic features as potential indices of cognitive load, which included sentence length, use of agreement and disagreement phrases, and use of personal pronouns, including both singular and plural pronoun types. Results showed users' different linguistic and grammatical patterns with various cognitive load levels. Specifically, with high load, people spoke more and used longer sentences, used more words that indicated disagreement with other team members, and exhibited increased use of plural personal pronouns and decreased use of singular pronouns. The article provides encouraging evidence for the use of linguistic and grammatical analysis for measuring users' cognitive load and proposes some novel features as cognitive load indices. The proposed approach may be applied to many data-intense and safety-critical task scenarios, such as emergency management departments, for example, bushfire or traffic incident management centers; air traffic control rooms; and call centers, where speech is used as part of everyday tasks.
A knowledge-driven approach to cluster validity assessment.
Bolshakova, Nadia; Azuaje, Francisco; Cunningham, Pádraig
2005-05-15
This paper presents an approach to assessing cluster validity based on similarity knowledge extracted from the Gene Ontology. The program is freely available for non-profit use on request from the authors.
ERIC Educational Resources Information Center
Rice Doran, Patricia
2015-01-01
This article provides an overview of the Universal Design for Learning (UDL) framework, which is based on brain-structure research and which incorporates multiple means of instruction, action and expression, and engagement. The article describes the relevance of this framework to linguistically diverse and culturally and linguistically diverse…
Linguistic Variability and Intellectual Development. Miami Linguistics Series No. 9.
ERIC Educational Resources Information Center
von Humboldt, Wilhelm
Although this edition of Wilhelm von Humboldt's "Linguistic Variability and Intellectual Development" is based entirely on the original German edition, the translators (George C. Buck and Frithjof A. Raven) and the publisher have attempted to clarify certain aspects of this work for the modern-day reader. These features include the addition of…
Reflections on Mixing Methods in Applied Linguistics Research
ERIC Educational Resources Information Center
Hashemi, Mohammad R.
2012-01-01
This commentary advocates the use of mixed methods research--that is the integration of qualitative and quantitative methods in a single study--in applied linguistics. Based on preliminary findings from a research project in progress, some reflections on the current practice of mixing methods as a new trend in applied linguistics are put forward.…
ERIC Educational Resources Information Center
Ge, Haoyan; Matthews, Stephen; Cheung, Lawrence Yam-leung; Yip, Virginia
2017-01-01
This corpus-based study demonstrates a case of bidirectional cross-linguistic influence in the acquisition of right-dislocation by Cantonese-English bilingual children and interprets the results in relation to Hulk and Müller's hypothesis for cross-linguistic influence. Longitudinal data reveal qualitative and quantitative differences between…
Pedagogical Implications on Medical Students' Linguistic Needs
ERIC Educational Resources Information Center
Hwang, Yanling
2011-01-01
In this paper, an extended teaching implication is performed based on the study of medical students' linguistic needs in Tawian (Hwang, Lin, 2010). The aims of previous study were to provide a description of the linguistic needs and perceptions of medical students and faculty members in Taiwan. However, this paper put more thoughts on the…
Linguistic Multi-Competence of Fiji School Students and Their Conversational Partners
ERIC Educational Resources Information Center
Hopf, Suzanne C.; McLeod, Sharynne; McDonagh, Sarah H.
2018-01-01
This study explored linguistic multi-competence in Fiji students and their conversational partners through a description of linguistic diversity in one school community. Students' caregivers (n = 75), teachers (n = 25) and year 4 students (n = 40) in an urban school of Fiji completed paper-based questionnaires regarding: 75 students, 75 mothers,…
English Medium Instruction (EMI) as Linguistic Capital in Nepal: Promises and Realities
ERIC Educational Resources Information Center
Sah, Pramod Kumar; Li, Guofang
2018-01-01
This article reports on a critical qualitative case study of an EMI-based, underresourced public school in Nepal through Bourdieu's lens of linguistic capital. As the data analysis revealed, parents, students, and teachers regarded EMI as a privileged form of linguistic capital for developing advanced English skills, enhancing educational…
Developmental Dyslexia as Developmental and Linguistic Variation: Editor's Commentary.
ERIC Educational Resources Information Center
Leong, Che Kan
2002-01-01
This commentary reviews forthcoming articles on the scientific study of dyslexia, genetic and neurophysiological aspects of dyslexia, cross-linguistic aspects of literacy development and dyslexia, and theory-based practice. It concludes that educators should continue to strive to promote theory-based research and evidence-based practice to achieve…
Zhang, Kejiang; Achari, Gopal; Pei, Yuansheng
2010-10-01
Different types of uncertain information-linguistic, probabilistic, and possibilistic-exist in site characterization. Their representation and propagation significantly influence the management of contaminated sites. In the absence of a framework with which to properly represent and integrate these quantitative and qualitative inputs together, decision makers cannot fully take advantage of the available and necessary information to identify all the plausible alternatives. A systematic methodology was developed in the present work to incorporate linguistic, probabilistic, and possibilistic information into the Preference Ranking Organization METHod for Enrichment Evaluation (PROMETHEE), a subgroup of Multi-Criteria Decision Analysis (MCDA) methods for ranking contaminated sites. The identification of criteria based on the paradigm of comparative risk assessment provides a rationale for risk-based prioritization. Uncertain linguistic, probabilistic, and possibilistic information identified in characterizing contaminated sites can be properly represented as numerical values, intervals, probability distributions, and fuzzy sets or possibility distributions, and linguistic variables according to their nature. These different kinds of representation are first transformed into a 2-tuple linguistic representation domain. The propagation of hybrid uncertainties is then carried out in the same domain. This methodology can use the original site information directly as much as possible. The case study shows that this systematic methodology provides more reasonable results. © 2010 SETAC.
Ontology-Based Search of Genomic Metadata.
Fernandez, Javier D; Lenzerini, Maurizio; Masseroli, Marco; Venco, Francesco; Ceri, Stefano
2016-01-01
The Encyclopedia of DNA Elements (ENCODE) is a huge and still expanding public repository of more than 4,000 experiments and 25,000 data files, assembled by a large international consortium since 2007; unknown biological knowledge can be extracted from these huge and largely unexplored data, leading to data-driven genomic, transcriptomic, and epigenomic discoveries. Yet, search of relevant datasets for knowledge discovery is limitedly supported: metadata describing ENCODE datasets are quite simple and incomplete, and not described by a coherent underlying ontology. Here, we show how to overcome this limitation, by adopting an ENCODE metadata searching approach which uses high-quality ontological knowledge and state-of-the-art indexing technologies. Specifically, we developed S.O.S. GeM (http://www.bioinformatics.deib.polimi.it/SOSGeM/), a system supporting effective semantic search and retrieval of ENCODE datasets. First, we constructed a Semantic Knowledge Base by starting with concepts extracted from ENCODE metadata, matched to and expanded on biomedical ontologies integrated in the well-established Unified Medical Language System. We prove that this inference method is sound and complete. Then, we leveraged the Semantic Knowledge Base to semantically search ENCODE data from arbitrary biologists' queries. This allows correctly finding more datasets than those extracted by a purely syntactic search, as supported by the other available systems. We empirically show the relevance of found datasets to the biologists' queries.
Chinese children's early knowledge about writing.
Zhang, Lan; Yin, Li; Treiman, Rebecca
2017-09-01
Much research on literacy development has focused on learners of alphabetic writing systems. Researchers have hypothesized that children learn about the formal characteristics of writing before they learn about the relations between units of writing and units of speech. We tested this hypothesis by examining young Chinese children's understanding of writing. Mandarin-speaking 2- to 5-year-olds completed a graphic task, which tapped their knowledge about the formal characteristics of writing, and a phonological task, which tapped their knowledge about the correspondence between Chinese characters and syllables. The 3- to 5-year-olds performed better on the graphic task than the phonological task, indicating that learning how writing appears visually begins earlier than learning that writing corresponds to linguistic units, even in a writing system in which written units correspond to syllables. Statement of contribution What is already known on this subject? Learning about writing's visual form, how it looks, is an important part of emergent literacy. Knowledge of how writing symbolizes linguistic units may emerge later. What does this study add? We test the hypothesis that Chinese children learn about writing's visual form earlier than its symbolic nature. Chinese 3- to 5- year-olds know more about visual features than character-syllable links. Results show learning of the visual appearance of a notation system is developmentally precocious. © 2016 The British Psychological Society.
Cutler, Anne; Broersma, Mirjam
2017-01-01
Children adopted early in life into another linguistic community typically forget their birth language but retain, unaware, relevant linguistic knowledge that may facilitate (re)learning of birth-language patterns. Understanding the nature of this knowledge can shed light on how language is acquired. Here, international adoptees from Korea with Dutch as their current language, and matched Dutch-native controls, provided speech production data on a Korean consonantal distinction unlike any Dutch distinctions, at the outset and end of an intensive perceptual training. The productions, elicited in a repetition task, were identified and rated by Korean listeners. Adoptees' production scores improved significantly more across the training period than control participants' scores, and, for adoptees only, relative production success correlated significantly with the rate of learning in perception (which had, as predicted, also surpassed that of the controls). Of the adoptee group, half had been adopted at 17 months or older (when talking would have begun), while half had been prelinguistic (under six months). The former group, with production experience, showed no advantage over the group without. Thus the adoptees' retained knowledge of Korean transferred from perception to production and appears to be abstract in nature rather than dependent on the amount of experience. PMID:28280567
Fransen, Mirjam P; Dekker, Evelien; Timmermans, Daniëlle R M; Uiters, Ellen; Essink-Bot, Marie-Louise
2017-02-01
To explore the accessibility of standardized printed information materials of the national Dutch colorectal cancer screening program among low health literate screening invitees and to assess the effect of the information on their knowledge about colorectal cancer and the screening program. Linguistic tools were used to analyze the text and design characteristics. The accessibility, comprehensibility and relevance of the information materials were explored in interviews and in observations (n=25). The effect of the information on knowledge was assessed in an online survey (n=127). The materials employed a simple text and design. However, respondents expressed problems with the amount of information, and the difference between screening and diagnostic follow-up. Knowledge significantly increased in 10 out of 16 items after reading the information but remained low for colorectal cancer risk, sensitivity of testing, and the voluntariness of colorectal cancer screening. Despite intelligible linguistic and design characteristics, screening invitees with low health literacy had problems in accessing, comprehending and applying standard information materials on colorectal cancer screening, and lacked essential knowledge for informed decision-making about participation. To enable equal access to informed decision-making, information strategies need to be adjusted to the skills of low health literate screening invitees. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Language use and stereotyping: the role of approach and avoidance motivation goals.
Gil de Montes, Lorena; Ortiz, Garbiñe; Valencia, José F; Larrañaga, Maider; Agirrezabal, Arrate
2012-11-01
The use of more abstract language to describe expected behaviors as opposed to unexpected behaviors has traditionally been considered a way of stereotype maintenance. This tendency is known as linguistic expectancy bias. Two experiments examined the influence of approach and avoidance motivational orientations on the production of this linguistic expectancy bias. It was predicted that approach strategic orientation is likely to describe expectancy consistent behaviors at a higher level of linguistic abstraction than expectancy inconsistent behaviors. In contrast, avoidance strategic orientation is likely to describe both expectancy consistent behaviors and expectancy inconsistent behaviors at a lower level of linguistic abstraction, thus facilitating the disappearance of linguistic expectancy bias. Two experiments confirmed these expectations, using strategic orientation manipulations based either on communication goals or on motor action, and measuring linguistic abstraction either on forced-choice answer format or on free descriptions. Implications for the generalisation of linguistic expectancy bias are discussed.
It ain't what you say, it's how you say it: linguistic and cultural diversity in the classroom.
Robinson, Cynthia Cole; Clardy, Pauline
2011-01-01
The disparity between the cultural and linguistic diversity of the teaching population and the student population continues to grow as teacher education programs enroll and graduate primarily white teacher candidates (83.7%). At the same time, the diversity of the K-12 student body has increased with 65% of public school students being from culturally and linguistically diverse backgrounds (National Center for Education Statistics, 2007). This chasm between the diversity of the teaching force and student population is of concern as many teachers report that they do not have the cultural knowledge and experience of working or living in diverse environments, yet will be faced with teaching a very diverse student population. Hence, the need for teacher candidates and current teachers to be explicitly taught the skills needed to successfully teach diverse student populations is urgent. In this article, we explore the following phenomena: how linguistic and cultural diversity is regarded in teacher education programs, as well as teacher candidates' and current K-12 teachers' dispositions towards students who do not share their cultural backgrounds or language (including those who vary in their dialects). Finally, we will present strategies that teacher educators can use to embrace and empower culturally and linguistically diverse (CLD) teacher candidates, as well as prepare teacher candidates to teach diverse student populations.
Broz, Frank; Nehaniv, Chrystopher L; Belpaeme, Tony; Bisio, Ambra; Dautenhahn, Kerstin; Fadiga, Luciano; Ferrauto, Tomassino; Fischer, Kerstin; Förster, Frank; Gigliotta, Onofrio; Griffiths, Sascha; Lehmann, Hagen; Lohan, Katrin S; Lyon, Caroline; Marocco, Davide; Massera, Gianluca; Metta, Giorgio; Mohan, Vishwanathan; Morse, Anthony; Nolfi, Stefano; Nori, Francesco; Peniak, Martin; Pitsch, Karola; Rohlfing, Katharina J; Sagerer, Gerhard; Sato, Yo; Saunders, Joe; Schillingmann, Lars; Sciutti, Alessandra; Tikhanoff, Vadim; Wrede, Britta; Zeschel, Arne; Cangelosi, Angelo
2014-07-01
This article presents results from a multidisciplinary research project on the integration and transfer of language knowledge into robots as an empirical paradigm for the study of language development in both humans and humanoid robots. Within the framework of human linguistic and cognitive development, we focus on how three central types of learning interact and co-develop: individual learning about one's own embodiment and the environment, social learning (learning from others), and learning of linguistic capability. Our primary concern is how these capabilities can scaffold each other's development in a continuous feedback cycle as their interactions yield increasingly sophisticated competencies in the agent's capacity to interact with others and manipulate its world. Experimental results are summarized in relation to milestones in human linguistic and cognitive development and show that the mutual scaffolding of social learning, individual learning, and linguistic capabilities creates the context, conditions, and requisites for learning in each domain. Challenges and insights identified as a result of this research program are discussed with regard to possible and actual contributions to cognitive science and language ontogeny. In conclusion, directions for future work are suggested that continue to develop this approach toward an integrated framework for understanding these mutually scaffolding processes as a basis for language development in humans and robots. Copyright © 2014 Cognitive Science Society, Inc.
Conceptual Knowledge Acquisition in Biomedicine: A Methodological Review
Payne, Philip R.O.; Mendonça, Eneida A.; Johnson, Stephen B.; Starren, Justin B.
2007-01-01
The use of conceptual knowledge collections or structures within the biomedical domain is pervasive, spanning a variety of applications including controlled terminologies, semantic networks, ontologies, and database schemas. A number of theoretical constructs and practical methods or techniques support the development and evaluation of conceptual knowledge collections. This review will provide an overview of the current state of knowledge concerning conceptual knowledge acquisition, drawing from multiple contributing academic disciplines such as biomedicine, computer science, cognitive science, education, linguistics, semiotics, and psychology. In addition, multiple taxonomic approaches to the description and selection of conceptual knowledge acquisition and evaluation techniques will be proposed in order to partially address the apparent fragmentation of the current literature concerning this domain. PMID:17482521
ECO: A Framework for Entity Co-Occurrence Exploration with Faceted Navigation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Halliday, K. D.
2010-08-20
Even as highly structured databases and semantic knowledge bases become more prevalent, a substantial amount of human knowledge is reported as written prose. Typical textual reports, such as news articles, contain information about entities (people, organizations, and locations) and their relationships. Automatically extracting such relationships from large text corpora is a key component of corporate and government knowledge bases. The primary goal of the ECO project is to develop a scalable framework for extracting and presenting these relationships for exploration using an easily navigable faceted user interface. ECO uses entity co-occurrence relationships to identify related entities. The system aggregates andmore » indexes information on each entity pair, allowing the user to rapidly discover and mine relational information.« less
Sensory Intelligence for Extraction of an Abstract Auditory Rule: A Cross-Linguistic Study.
Guo, Xiao-Tao; Wang, Xiao-Dong; Liang, Xiu-Yuan; Wang, Ming; Chen, Lin
2018-02-21
In a complex linguistic environment, while speech sounds can greatly vary, some shared features are often invariant. These invariant features constitute so-called abstract auditory rules. Our previous study has shown that with auditory sensory intelligence, the human brain can automatically extract the abstract auditory rules in the speech sound stream, presumably serving as the neural basis for speech comprehension. However, whether the sensory intelligence for extraction of abstract auditory rules in speech is inherent or experience-dependent remains unclear. To address this issue, we constructed a complex speech sound stream using auditory materials in Mandarin Chinese, in which syllables had a flat lexical tone but differed in other acoustic features to form an abstract auditory rule. This rule was occasionally and randomly violated by the syllables with the rising, dipping or falling tone. We found that both Chinese and foreign speakers detected the violations of the abstract auditory rule in the speech sound stream at a pre-attentive stage, as revealed by the whole-head recordings of mismatch negativity (MMN) in a passive paradigm. However, MMNs peaked earlier in Chinese speakers than in foreign speakers. Furthermore, Chinese speakers showed different MMN peak latencies for the three deviant types, which paralleled recognition points. These findings indicate that the sensory intelligence for extraction of abstract auditory rules in speech sounds is innate but shaped by language experience. Copyright © 2018 IBRO. Published by Elsevier Ltd. All rights reserved.
Rodrigues, J M; Trombert-Paviot, B; Baud, R; Wagner, J; Meusnier-Carriot, F
1998-01-01
GALEN has developed a language independent common reference model based on a medically oriented ontology and practical tools and techniques for managing healthcare terminology including natural language processing. GALEN-IN-USE is the current phase which applied the modelling and the tools to the development or the updating of coding systems for surgical procedures in different national coding centers co-operating within the European Federation of Coding Centre (EFCC) to create a language independent knowledge repository for multicultural Europe. We used an integrated set of artificial intelligence terminology tools named CLAssification Manager workbench to process French professional medical language rubrics into intermediate dissections and to the Grail reference ontology model representation. From this language independent concept model representation we generate controlled French natural language. The French national coding centre is then able to retrieve the initial professional rubrics with different categories of concepts, to compare the professional language proposed by expert clinicians to the French generated controlled vocabulary and to finalize the linguistic labels of the coding system in relation with the meanings of the conceptual system structure.
DBpedia and the Live Extraction of Structured Data from Wikipedia
ERIC Educational Resources Information Center
Morsey, Mohamed; Lehmann, Jens; Auer, Soren; Stadler, Claus; Hellmann, Sebastian
2012-01-01
Purpose: DBpedia extracts structured information from Wikipedia, interlinks it with other knowledge bases and freely publishes the results on the web using Linked Data and SPARQL. However, the DBpedia release process is heavyweight and releases are sometimes based on several months old data. DBpedia-Live solves this problem by providing a live…
Data interoperability software solution for emergency reaction in the Europe Union
NASA Astrophysics Data System (ADS)
Casado, R.; Rubiera, E.; Sacristan, M.; Schütte, F.; Peters, R.
2014-09-01
Emergency management becomes more challenging in international crisis episodes because of cultural, semantic and linguistic differences between all stakeholders, especially first responders. Misunderstandings between first responders makes decision-making slower and more difficult. However, spread and development of networks and IT-based Emergency Management Systems (EMS) has improved emergency responses, becoming more coordinated. Despite improvements made in recent years, EMS have not still solved problems related to cultural, semantic and linguistic differences which are the real cause of slower decision-making. In addition, from a technical perspective, the consolidation of current EMS and the different formats used to exchange information offers another problem to be solved in any solution proposed for information interoperability between heterogeneous EMS surrounded by different contexts. To overcome these problems we present a software solution based on semantic and mediation technologies. EMERGency ELements (EMERGEL) (Fundacion CTIC and AntwortING Ingenieurbüro PartG 2013), a common and modular ontology shared by all the stakeholders, has been defined. It offers the best solution to gather all stakeholders' knowledge in a unique and flexible data model, taking into account different countries cultural linguistic issues. To deal with the diversity of data protocols and formats, we have designed a Service Oriented Architecture for Data Interoperability (named DISASTER) providing a flexible extensible solution to solve the mediation issues. Web Services have been adopted as specific technology to implement such paradigm that has the most significant academic and industrial visibility and attraction. Contributions of this work have been validated through the design and development of a cross-border realistic prototype scenario, actively involving both emergency managers and emergency first responders: the Netherlands-Germany border fire.
Data interoperability software solution for emergency reaction in the Europe Union
NASA Astrophysics Data System (ADS)
Casado, R.; Rubiera, E.; Sacristan, M.; Schütte, F.; Peters, R.
2015-07-01
Emergency management becomes more challenging in international crisis episodes because of cultural, semantic and linguistic differences between all stakeholders, especially first responders. Misunderstandings between first responders makes decision making slower and more difficult. However, spread and development of networks and IT-based emergency management systems (EMSs) have improved emergency responses, which have become more coordinated. Despite improvements made in recent years, EMSs have not still solved problems related to cultural, semantic and linguistic differences which are the real cause of slower decision making. In addition, from a technical perspective, the consolidation of current EMSs and the different formats used to exchange information offers another problem to be solved in any solution proposed for information interoperability between heterogeneous EMSs in different contexts. To overcome these problems, we present a software solution based on semantic and mediation technologies. EMERGency ELements (EMERGEL) (Fundacion CTIC and AntwortING Ingenieurbüro PartG, 2013), a common and modular ontology shared by all the stakeholders, has been defined. It offers the best solution to gather all stakeholders' knowledge in a unique and flexible data model, taking into account different countries' cultural and linguistic issues. To deal with the diversity of data protocols and formats, we have designed a service-oriented architecture for data interoperability (named DISASTER: Data Interoperability Solution At STakeholders Emergency Reaction) providing a flexible extensible solution to solve the mediation issues. Web services have been adopted as specific technology to implement this paradigm that has the most significant academic and industrial visibility and attraction. Contributions of this work have been validated through the design and development of a cross-border realistic prototype scenario, actively involving both emergency managers and emergency-first responders: the Netherlands-Germany border fire.
Managing search complexity in linguistic geometry.
Stilman, B
1997-01-01
This paper is a new step in the development of linguistic geometry. This formal theory is intended to discover and generalize the inner properties of human expert heuristics, which have been successful in a certain class of complex control systems, and apply them to different systems. In this paper, we investigate heuristics extracted in the form of hierarchical networks of planning paths of autonomous agents. Employing linguistic geometry tools the dynamic hierarchy of networks is represented as a hierarchy of formal attribute languages. The main ideas of this methodology are shown in the paper on two pilot examples of the solution of complex optimization problems. The first example is a problem of strategic planning for the air combat, in which concurrent actions of four vehicles are simulated as serial interleaving moves. The second example is a problem of strategic planning for the space comb of eight autonomous vehicles (with interleaving moves) that requires generation of the search tree of the depth 25 with the branching factor 30. This is beyond the capabilities of modern and conceivable future computers (employing conventional approaches). In both examples the linguistic geometry tools showed deep and highly selective searches in comparison with conventional search algorithms. For the first example a sketch of the proof of optimality of the solution is considered.
Demjén, Zsófia
2014-01-01
This paper demonstrates how a range of linguistic methods can be harnessed in pursuit of a deeper understanding of the 'lived experience' of psychological disorders. It argues that such methods should be applied more in medical contexts, especially in medical humanities. Key extracts from The Unabridged Journals of Sylvia Plath are examined, as a case study of the experience of depression. Combinations of qualitative and quantitative linguistic methods, and inter- and intra-textual comparisons are used to consider distinctive patterns in the use of metaphor, personal pronouns and (the semantics of) verbs, as well as other relevant aspects of language. Qualitative techniques provide in-depth insights, while quantitative corpus methods make the analyses more robust and ensure the breadth necessary to gain insights into the individual experience. Depression emerges as a highly complex and sometimes potentially contradictory experience for Plath, involving both a sense of apathy and inner turmoil. It involves a sense of a split self, trapped in a state that one cannot overcome, and intense self-focus, a turning in on oneself and a view of the world that is both more negative and more polarized than the norm. It is argued that a linguistic approach is useful beyond this specific case.
First Extraction of Transversity from a Global Analysis of Electron-Proton and Proton-Proton Data
NASA Astrophysics Data System (ADS)
Radici, Marco; Bacchetta, Alessandro
2018-05-01
We present the first extraction of the transversity distribution in the framework of collinear factorization based on the global analysis of pion-pair production in deep-inelastic scattering and in proton-proton collisions with a transversely polarized proton. The extraction relies on the knowledge of dihadron fragmentation functions, which are taken from the analysis of electron-positron annihilation data. For the first time, the transversity is extracted from a global analysis similar to what is usually done for the spin-averaged and helicity distributions. The knowledge of transversity is important for, among other things, detecting possible signals of new physics in high-precision low-energy experiments.
Linguistic methodology for the analysis of aviation accidents
NASA Technical Reports Server (NTRS)
Goguen, J. A.; Linde, C.
1983-01-01
A linguistic method for the analysis of small group discourse, was developed and the use of this method on transcripts of commercial air transpot accidents is demonstrated. The method identifies the discourse types that occur and determine their linguistic structure; it identifies significant linguistic variables based upon these structures or other linguistic concepts such as speech act and topic; it tests hypotheses that support significance and reliability of these variables; and it indicates the implications of the validated hypotheses. These implications fall into three categories: (1) to train crews to use more nearly optimal communication patterns; (2) to use linguistic variables as indices for aspects of crew performance such as attention; and (3) to provide guidelines for the design of aviation procedures and equipment, especially those that involve speech.
From Information Society to Knowledge Society: The Ontology Issue
NASA Astrophysics Data System (ADS)
Roche, Christophe
2002-09-01
Information society, virtual enterprise, e-business rely more and more on communication and knowledge sharing between heterogeneous actors. But, no communication is possible, and all the more so no co-operation or collaboration, if those actors do not share the same or at least a compatible meaning for the terms they use. Ontology, understood as an agreed vocabulary of common terms and meanings, is a solution to that problem. Nevertheless, although there is quite a lot of experience in using ontologies, several barriers remain which stand against a real use of ontology. As a matter of fact, it is very difficult to build, reuse and share ontologies. We claim that the ontology problem requires a multidisciplinary approach based on sound epistemological, logical and linguistic principles. This article presents the Ontological Knowledge Station (OK Station©), a software environment for building and using ontologies which relies on such principles. The OK Station is currently being used in several industrial applications.
McDonough, A Manuela; Vargas, Marcela; Nguyen-Rodriguez, Selena; Garcia, Melawhy; Galvez, Gino; Rios-Ellis, Britt
2016-01-01
Although cervical cancer can be prevented through screening and follow-up, Latinas' rate of Pap tests remains low due to knowledge gaps and cultural and attitudinal factors. This study used a single-group pre-/post-test design to evaluate the effectiveness of Mujer Sana, Familia Fuerte (Healthy Woman, Strong Family), an intervention intended to improve Latinas' cervical cancer prevention knowledge, attitudes, self-efficacy to obtain a Pap test, and intention to get tested. The intervention is delivered through a single session by promotores de salud, who use a culturally competent, linguistically appropriate toolkit. A total of 5,211 Latinas participated in the study. The evaluation indicated that participants had increases in knowledge, positive attitudes, self-efficacy, and intention to test. Latinas have a low rate of cervical cancer screening but a high rate of cervical cancer, and Mujer Sana, Familia Fuerte shows promise as a public health practice for use with this population.
Salud de la mujer: using fotonovelas to increase health literacy among Latinas.
Sberna Hinojosa, Melanie; Hinojosa, Ramon; Nelson, David A; Delgado, Angelica; Witzack, Bernadette; Gonzalez, Magdalisse; Farias, Rene; Ahmed, Syed; Meurer, Linda
2010-01-01
There is an identified need for health literacy strategies to be culturally sensitive and linguistically appropriate. The goal of our community-based participatory research (CBPR) project related to health and nutrition is to demonstrate that active community involvement in the creation of health education fotonovelas that are relevant to culture, ethnicity, gender, social class, and language can increase the health literacy of women in a disadvantaged community. We recruited 12 women to take part in our pilot fotonovela intervention about healthy eating and nutrition. Pre- and post-test assessments of knowledge, attitudes, and behavior around nutrition were given at baseline and will be collected after the completion of the project. We hypothesize that post-test assessments of our participants will reveal increased nutrition knowledge as well as positive changes in attitudes and behavior toward healthy eating. We believe that our fotonovelas will represent experiences of community members and encourage good health practices by increasing knowledge and cooperation among community members.
McDonough, A. Manuela; Vargas, Marcela; Nguyen-Rodriguez, Selena; Garcia, Melawhy; Galvez, Gino; Rios-Ellis, Britt
2018-01-01
Objective Although cervical cancer can be prevented through screening and follow-up, Latinas’ rate of Pap tests remains low due to knowledge gaps and cultural and attitudinal factors. Methods This study used a single-group pre-/post-test design to evaluate the effectiveness of Mujer Sana, Familia Fuerte (Healthy Woman, Strong Family), an intervention intended to improve Latinas’ cervical cancer prevention knowledge, attitudes, self-efficacy to obtain a Pap test, and intention to get tested. The intervention is delivered through a single session by promotores de salud, who use a culturally competent, linguistically appropriate toolkit. A total of 5,211 Latinas participated in the study. Results The evaluation indicated that participants had increases in knowledge, positive attitudes, self-efficacy, and intention to test. Conclusion Latinas have a low rate of cervical cancer screening but a high rate of cervical cancer, and Mujer Sana, Familia Fuerte shows promise as a public health practice for use with this population. PMID:27180696
Infusing Culturally Responsive Science Curriculum into Early Childhood Teacher Preparation
NASA Astrophysics Data System (ADS)
Yoon, Jiyoon; Martin, Leisa A.
2017-08-01
Previous research studies in early childhood teacher education have indicated that teacher candidates are not adequately prepared to demonstrate the knowledge and skills needed to teach science to all children including culturally and linguistically diverse students. To address this issue, the researchers provided 31 early childhood teacher candidates with instructions through a culturally responsive science education curriculum that integrates American and Korean science curriculum corresponding to the American and Korean standards for teacher education. The results showed a statistically significant increase in their Personal Science Teaching Efficacy (PSTE). In addition, the teacher candidates were able to create a multicultural/diverse lesson in the developing and proficiency levels based on Ambrosio's lesson matrix. This study provides teacher candidates' knowledge as well as an additional resource for developing their self-efficacy and understanding the role of multicultural/diverse lesson planning for science instruction. Also, teacher candidates could be better prepared by understanding how other countries approach science education and integrating this knowledge to enrich their own science instruction.
Pajak, Bozena; Fine, Alex B; Kleinschmidt, Dave F; Jaeger, T Florian
2016-12-01
We present a framework of second and additional language (L2/L n ) acquisition motivated by recent work on socio-indexical knowledge in first language (L1) processing. The distribution of linguistic categories covaries with socio-indexical variables (e.g., talker identity, gender, dialects). We summarize evidence that implicit probabilistic knowledge of this covariance is critical to L1 processing, and propose that L2/L n learning uses the same type of socio-indexical information to probabilistically infer latent hierarchical structure over previously learned and new languages. This structure guides the acquisition of new languages based on their inferred place within that hierarchy, and is itself continuously revised based on new input from any language. This proposal unifies L1 processing and L2/L n acquisition as probabilistic inference under uncertainty over socio-indexical structure. It also offers a new perspective on crosslinguistic influences during L2/L n learning, accommodating gradient and continued transfer (both negative and positive) from previously learned to novel languages, and vice versa.
Pajak, Bozena; Fine, Alex B.; Kleinschmidt, Dave F.; Jaeger, T. Florian
2015-01-01
We present a framework of second and additional language (L2/Ln) acquisition motivated by recent work on socio-indexical knowledge in first language (L1) processing. The distribution of linguistic categories covaries with socio-indexical variables (e.g., talker identity, gender, dialects). We summarize evidence that implicit probabilistic knowledge of this covariance is critical to L1 processing, and propose that L2/Ln learning uses the same type of socio-indexical information to probabilistically infer latent hierarchical structure over previously learned and new languages. This structure guides the acquisition of new languages based on their inferred place within that hierarchy, and is itself continuously revised based on new input from any language. This proposal unifies L1 processing and L2/Ln acquisition as probabilistic inference under uncertainty over socio-indexical structure. It also offers a new perspective on crosslinguistic influences during L2/Ln learning, accommodating gradient and continued transfer (both negative and positive) from previously learned to novel languages, and vice versa. PMID:28348442
Gelada vocal sequences follow Menzerath's linguistic law.
Gustison, Morgan L; Semple, Stuart; Ferrer-I-Cancho, Ramon; Bergman, Thore J
2016-05-10
Identifying universal principles underpinning diverse natural systems is a key goal of the life sciences. A powerful approach in addressing this goal has been to test whether patterns consistent with linguistic laws are found in nonhuman animals. Menzerath's law is a linguistic law that states that, the larger the construct, the smaller the size of its constituents. Here, to our knowledge, we present the first evidence that Menzerath's law holds in the vocal communication of a nonhuman species. We show that, in vocal sequences of wild male geladas (Theropithecus gelada), construct size (sequence size in number of calls) is negatively correlated with constituent size (duration of calls). Call duration does not vary significantly with position in the sequence, but call sequence composition does change with sequence size and most call types are abbreviated in larger sequences. We also find that intercall intervals follow the same relationship with sequence size as do calls. Finally, we provide formal mathematical support for the idea that Menzerath's law reflects compression-the principle of minimizing the expected length of a code. Our findings suggest that a common principle underpins human and gelada vocal communication, highlighting the value of exploring the applicability of linguistic laws in vocal systems outside the realm of language.
Language experience changes subsequent learning.
Onnis, Luca; Thiessen, Erik
2013-02-01
What are the effects of experience on subsequent learning? We explored the effects of language-specific word order knowledge on the acquisition of sequential conditional information. Korean and English adults were engaged in a sequence learning task involving three different sets of stimuli: auditory linguistic (nonsense syllables), visual non-linguistic (nonsense shapes), and auditory non-linguistic (pure tones). The forward and backward probabilities between adjacent elements generated two equally probable and orthogonal perceptual parses of the elements, such that any significant preference at test must be due to either general cognitive biases, or prior language-induced biases. We found that language modulated parsing preferences with the linguistic stimuli only. Intriguingly, these preferences are congruent with the dominant word order patterns of each language, as corroborated by corpus analyses, and are driven by probabilistic preferences. Furthermore, although the Korean individuals had received extensive formal explicit training in English and lived in an English-speaking environment, they exhibited statistical learning biases congruent with their native language. Our findings suggest that mechanisms of statistical sequential learning are implicated in language across the lifespan, and experience with language may affect cognitive processes and later learning. Copyright © 2012 Elsevier B.V. All rights reserved.
List, Johann-Mattis; Pathmanathan, Jananan Sylvestre; Lopez, Philippe; Bapteste, Eric
2016-08-20
For a long time biologists and linguists have been noticing surprising similarities between the evolution of life forms and languages. Most of the proposed analogies have been rejected. Some, however, have persisted, and some even turned out to be fruitful, inspiring the transfer of methods and models between biology and linguistics up to today. Most proposed analogies were based on a comparison of the research objects rather than the processes that shaped their evolution. Focusing on process-based analogies, however, has the advantage of minimizing the risk of overstating similarities, while at the same time reflecting the common strategy to use processes to explain the evolution of complexity in both fields. We compared important evolutionary processes in biology and linguistics and identified processes specific to only one of the two disciplines as well as processes which seem to be analogous, potentially reflecting core evolutionary processes. These new process-based analogies support novel methodological transfer, expanding the application range of biological methods to the field of historical linguistics. We illustrate this by showing (i) how methods dealing with incomplete lineage sorting offer an introgression-free framework to analyze highly mosaic word distributions across languages; (ii) how sequence similarity networks can be used to identify composite and borrowed words across different languages; (iii) how research on partial homology can inspire new methods and models in both fields; and (iv) how constructive neutral evolution provides an original framework for analyzing convergent evolution in languages resulting from common descent (Sapir's drift). Apart from new analogies between evolutionary processes, we also identified processes which are specific to either biology or linguistics. This shows that general evolution cannot be studied from within one discipline alone. In order to get a full picture of evolution, biologists and linguists need to complement their studies, trying to identify cross-disciplinary and discipline-specific evolutionary processes. The fact that we found many process-based analogies favoring transfer from biology to linguistics further shows that certain biological methods and models have a broader scope than previously recognized. This opens fruitful paths for collaboration between the two disciplines. This article was reviewed by W. Ford Doolittle and Eugene V. Koonin.
Language, Cognition, and the Right Hemisphere: A Response to Gazzaniga.
ERIC Educational Resources Information Center
Levy, Jerre
1983-01-01
Disputes several assumptions made by Gazzaniga in the preceding article, namely: (l) that any capacity to extract meaning from spoken or written words indicates linguistic competence; and (2) that the right hemisphere is passive and nonresponsive and that the limits of its cognitive abilities are manifested in simple matching-to-sample tasks. (GC)
Linguistic Markers of Stance in Early and Advanced Academic Writing: A Corpus-Based Comparison
ERIC Educational Resources Information Center
Aull, Laura L.; Lancaster, Zak
2014-01-01
This article uses corpus methods to examine linguistic expressions of stance in over 4,000 argumentative essays written by incoming first-year university students in comparison with the writing of upper-level undergraduate students and published academics. The findings reveal linguistic stance markers shared across the first-year essays despite…
Examining the Utility of Topic Models for Linguistic Analysis of Couple Therapy
ERIC Educational Resources Information Center
Doeden, Michelle A.
2012-01-01
This study examined the basic utility of topic models, a computational linguistics model for text-based data, to the investigation of the process of couple therapy. Linguistic analysis offers an additional lens through which to examine clinical data, and the topic model is presented as a novel methodology within couple and family psychology that…
ERIC Educational Resources Information Center
Duncan, Sharon E.; De Avila, Edward A.
Language Assessment Scales, Level 2 (LAS II) are used to assess the linguistic proficiency of limited-English-speaking or non-English-speaking adolescents. LAS II, like its predecessor, LAS I, provides a picture of oral linguistic proficiency based on a student's performance across four linguistic subsystems: phonemic, lexical, syntactic and…
Sociolinguistics and the Counselling Process
ERIC Educational Resources Information Center
Conville, Richard L.; Ivey, Allen E.
1975-01-01
Sociolinguistics is the study of language as part of culture and society. Counselling, basically a linguistic-communicative process, has too often failed to consider systematic knowledge from related fields. This article discusses basic concepts of sociolinguistics and considers their relation to the counselling process. (Author)
Self-assessment procedure using fuzzy sets
NASA Astrophysics Data System (ADS)
Mimi, Fotini
2000-10-01
Self-Assessment processes, initiated by a company itself and carried out by its own people, are considered to be the starting point for a regular strategic or operative planning process to ensure a continuous quality improvement. Their importance has increased by the growing relevance and acceptance of international quality awards such as the Malcolm Baldrige National Quality Award, the European Quality Award and the Deming Prize. Especially award winners use the instrument of a systematic and regular Self-Assessment and not only because they have to verify their quality and business results for at least three years. The Total Quality Model of the European Foundation for Quality Management (EFQM), used for the European Quality Award, is the basis for Self-Assessment in Europe. This paper presents a self-assessment supporting method based on a methodology of fuzzy control systems providing an effective means of converting the linguistic approximation into an automatic control strategy. In particular, the elements of the Quality Model mentioned above are interpreted as linguistic variables. The LR-type of a fuzzy interval is used for their representation. The input data has a qualitative character based on empirical investigation and expert knowledge and therefore the base- variables are ordinal scaled. The aggregation process takes place on the basis of a hierarchical structure. Finally, in order to render the use of the method more practical a software system on PC basis is developed and implemented.
Linguistic and pragmatic constraints on utterance interpretation
NASA Astrophysics Data System (ADS)
Hinkelman, Elizabeth A.
1990-05-01
In order to model how people understand language, it is necessary to understand not only grammar and logic but also how people use language to affect their environment. This area of study is known as natural language pragmatics. Speech acts, for instance, are the offers, promises, announcements, etc., that people make by talking. The same expression may be different acts in different contexts, and yet not every expression performs every act. We want to understand how people are able to recognize other's intentions and implications in saying something. Previous plan-based theories of speech act interpretation do not account for the conventional aspect of speech acts. They can, however, be made sensitive to both linguistic and propositional information. This dissertation presents a method of speech act interpretation which uses patterns of linguistic features (e.g., mood, verb form, sentence adverbials, thematic roles) to identify a range of speech act interpretations for the utterance. These are then filtered and elaborated by inferences about agents' goals and plans. In many cases the plan reasoning consists of short, local inference chains (that are in fact conversational implicatures) and, extended reasoning is necessary only for the most difficult cases. The method is able to accommodate a wide range of cases, from those which seem very idiomatic to those which must be analyzed using knowledge about the world and human behavior. It explains how, Can you pass the salt, can be a request while, Are you able to pass the salt, is not.
The proper treatment of language acquisition and change in a population setting.
Niyogi, Partha; Berwick, Robert C
2009-06-23
Language acquisition maps linguistic experience, primary linguistic data (PLD), onto linguistic knowledge, a grammar. Classically, computational models of language acquisition assume a single target grammar and one PLD source, the central question being whether the target grammar can be acquired from the PLD. However, real-world learners confront populations with variation, i.e., multiple target grammars and PLDs. Removing this idealization has inspired a new class of population-based language acquisition models. This paper contrasts 2 such models. In the first, iterated learning (IL), each learner receives PLD from one target grammar but different learners can have different targets. In the second, social learning (SL), each learner receives PLD from possibly multiple targets, e.g., from 2 parents. We demonstrate that these 2 models have radically different evolutionary consequences. The IL model is dynamically deficient in 2 key respects. First, the IL model admits only linear dynamics and so cannot describe phase transitions, attested rapid changes in languages over time. Second, the IL model cannot properly describe the stability of languages over time. In contrast, the SL model leads to nonlinear dynamics, bifurcations, and possibly multiple equilibria and so suffices to model both the case of stable language populations, mixtures of more than 1 language, as well as rapid language change. The 2 models also make distinct, empirically testable predictions about language change. Using historical data, we show that the SL model more faithfully replicates the dynamics of the evolution of Middle English.
Grammaticality, Acceptability, and Probability: A Probabilistic View of Linguistic Knowledge.
Lau, Jey Han; Clark, Alexander; Lappin, Shalom
2017-07-01
The question of whether humans represent grammatical knowledge as a binary condition on membership in a set of well-formed sentences, or as a probabilistic property has been the subject of debate among linguists, psychologists, and cognitive scientists for many decades. Acceptability judgments present a serious problem for both classical binary and probabilistic theories of grammaticality. These judgements are gradient in nature, and so cannot be directly accommodated in a binary formal grammar. However, it is also not possible to simply reduce acceptability to probability. The acceptability of a sentence is not the same as the likelihood of its occurrence, which is, in part, determined by factors like sentence length and lexical frequency. In this paper, we present the results of a set of large-scale experiments using crowd-sourced acceptability judgments that demonstrate gradience to be a pervasive feature in acceptability judgments. We then show how one can predict acceptability judgments on the basis of probability by augmenting probabilistic language models with an acceptability measure. This is a function that normalizes probability values to eliminate the confounding factors of length and lexical frequency. We describe a sequence of modeling experiments with unsupervised language models drawn from state-of-the-art machine learning methods in natural language processing. Several of these models achieve very encouraging levels of accuracy in the acceptability prediction task, as measured by the correlation between the acceptability measure scores and mean human acceptability values. We consider the relevance of these results to the debate on the nature of grammatical competence, and we argue that they support the view that linguistic knowledge can be intrinsically probabilistic. Copyright © 2016 Cognitive Science Society, Inc.
A common neural substrate for perceiving and knowing about color
Simmons, W. Kyle; Ramjee, Vimal; Beauchamp, Michael S.; McRae, Ken; Martin, Alex; Barsalou, Lawrence W.
2013-01-01
Functional neuroimaging research has demonstrated that retrieving information about object-associated colors activates the left fusiform gyrus in posterior temporal cortex. Although regions near the fusiform have previously been implicated in color perception, it remains unclear whether color knowledge retrieval actually activates the color perception system. Evidence to this effect would be particularly strong if color perception cortex was activated by color knowledge retrieval triggered strictly with linguistic stimuli. To address this question, subjects performed two tasks while undergoing fMRI. First, subjects performed a property verification task using only words to assess conceptual knowledge. On each trial, subjects verified whether a named color or motor property was true of a named object (e.g., TAXI-yellow, HAIR-combed). Next, subjects performed a color perception task. A region of the left fusiform gyrus that was highly responsive during color perception also showed greater activity for retrieving color than motor property knowledge. These data provide the first evidence for a direct overlap in the neural bases of color perception and stored information about object-associated color, and they significantly add to accumulating evidence that conceptual knowledge is grounded in the brain’s modality-specific systems. PMID:17575989
A common neural substrate for perceiving and knowing about color.
Simmons, W Kyle; Ramjee, Vimal; Beauchamp, Michael S; McRae, Ken; Martin, Alex; Barsalou, Lawrence W
2007-09-20
Functional neuroimaging research has demonstrated that retrieving information about object-associated colors activates the left fusiform gyrus in posterior temporal cortex. Although regions near the fusiform have previously been implicated in color perception, it remains unclear whether color knowledge retrieval actually activates the color perception system. Evidence to this effect would be particularly strong if color perception cortex was activated by color knowledge retrieval triggered strictly with linguistic stimuli. To address this question, subjects performed two tasks while undergoing fMRI. First, subjects performed a property verification task using only words to assess conceptual knowledge. On each trial, subjects verified whether a named color or motor property was true of a named object (e.g., TAXI-yellow, HAIR-combed). Next, subjects performed a color perception task. A region of the left fusiform gyrus that was highly responsive during color perception also showed greater activity for retrieving color than motor property knowledge. These data provide the first evidence for a direct overlap in the neural bases of color perception and stored information about object-associated color, and they significantly add to accumulating evidence that conceptual knowledge is grounded in the brain's modality-specific systems.
Long-range comparison between genes and languages based on syntactic distances.
Colonna, Vincenza; Boattini, Alessio; Guardiano, Cristina; Dall'ara, Irene; Pettener, Davide; Longobardi, Giuseppe; Barbujani, Guido
2010-01-01
To propose a new approach for comparing genetic and linguistic diversity in populations belonging to distantly related groups. Comparisons of linguistic and genetic differences have proved powerful tools to reconstruct human demographic history. Current models assume on both sides that similarities reflect either descent from common ancestry or the balance between isolation and contact. Most linguistic phylogenies are ultimately based on lexical evidence (roughly, words and morphemes with their sounds and meanings). However, measures of lexical divergence are reliable only for closely related languages, thus large-scale comparisons of genetic and linguistic diversity have appeared problematic so far. Syntax (abstract rules to combine words into sentences) appears more measurable, universally comparable, and stable than the lexicon, and hence certain syntactic similarities might reflect deeper linguistic relationships, such as those between distant language families. In this study, we for the first time compared genetic data to a matrix of syntactic differences among selected populations of three continents. Comparing two databases of microsatellite (Short Tandem Repeat) markers and Single Nucleotides Polymorphisms (SNPs), with a linguistic matrix based on the values of 62 grammatical parameters, we show that there is indeed a correlation of syntactic and genetic distances. We also identified a few outliers and suggest a possible interpretation of the overall pattern. These results strongly support the possibility of better investigating population history by combining genetic data with linguistic information of a new type, provided by a theoretically more sophisticated method to assess the relationships between distantly related languages and language families. Copyright © 2010 S. Karger AG, Basel.
2013-01-01
Background A large-scale, highly accurate, machine-understandable drug-disease treatment relationship knowledge base is important for computational approaches to drug repurposing. The large body of published biomedical research articles and clinical case reports available on MEDLINE is a rich source of FDA-approved drug-disease indication as well as drug-repurposing knowledge that is crucial for applying FDA-approved drugs for new diseases. However, much of this information is buried in free text and not captured in any existing databases. The goal of this study is to extract a large number of accurate drug-disease treatment pairs from published literature. Results In this study, we developed a simple but highly accurate pattern-learning approach to extract treatment-specific drug-disease pairs from 20 million biomedical abstracts available on MEDLINE. We extracted a total of 34,305 unique drug-disease treatment pairs, the majority of which are not included in existing structured databases. Our algorithm achieved a precision of 0.904 and a recall of 0.131 in extracting all pairs, and a precision of 0.904 and a recall of 0.842 in extracting frequent pairs. In addition, we have shown that the extracted pairs strongly correlate with both drug target genes and therapeutic classes, therefore may have high potential in drug discovery. Conclusions We demonstrated that our simple pattern-learning relationship extraction algorithm is able to accurately extract many drug-disease pairs from the free text of biomedical literature that are not captured in structured databases. The large-scale, accurate, machine-understandable drug-disease treatment knowledge base that is resultant of our study, in combination with pairs from structured databases, will have high potential in computational drug repurposing tasks. PMID:23742147
Xu, Rong; Wang, QuanQiu
2015-02-01
Anticancer drug-associated side effect knowledge often exists in multiple heterogeneous and complementary data sources. A comprehensive anticancer drug-side effect (drug-SE) relationship knowledge base is important for computation-based drug target discovery, drug toxicity predication and drug repositioning. In this study, we present a two-step approach by combining table classification and relationship extraction to extract drug-SE pairs from a large number of high-profile oncological full-text articles. The data consists of 31,255 tables downloaded from the Journal of Oncology (JCO). We first trained a statistical classifier to classify tables into SE-related and -unrelated categories. We then extracted drug-SE pairs from SE-related tables. We compared drug side effect knowledge extracted from JCO tables to that derived from FDA drug labels. Finally, we systematically analyzed relationships between anti-cancer drug-associated side effects and drug-associated gene targets, metabolism genes, and disease indications. The statistical table classifier is effective in classifying tables into SE-related and -unrelated (precision: 0.711; recall: 0.941; F1: 0.810). We extracted a total of 26,918 drug-SE pairs from SE-related tables with a precision of 0.605, a recall of 0.460, and a F1 of 0.520. Drug-SE pairs extracted from JCO tables is largely complementary to those derived from FDA drug labels; as many as 84.7% of the pairs extracted from JCO tables have not been included a side effect database constructed from FDA drug labels. Side effects associated with anticancer drugs positively correlate with drug target genes, drug metabolism genes, and disease indications. Copyright © 2014 Elsevier Inc. All rights reserved.
Jadhav, Ashutosh; Sheth, Amit; Pathak, Jyotishman
2014-01-01
Since the early 2000’s, Internet usage for health information searching has increased significantly. Studying search queries can help us to understand users “information need” and how do they formulate search queries (“expression of information need”). Although cardiovascular diseases (CVD) affect a large percentage of the population, few studies have investigated how and what users search for CVD. We address this knowledge gap in the community by analyzing a large corpus of 10 million CVD related search queries from MayoClinic.com. Using UMLS MetaMap and UMLS semantic types/concepts, we developed a rule-based approach to categorize the queries into 14 health categories. We analyzed structural properties, types (keyword-based/Wh-questions/Yes-No questions) and linguistic structure of the queries. Our results show that the most searched health categories are ‘Diseases/Conditions’, ‘Vital-Sings’, ‘Symptoms’ and ‘Living-with’. CVD queries are longer and are predominantly keyword-based. This study extends our knowledge about online health information searching and provides useful insights for Web search engines and health websites. PMID:25954380
Kreitewolf, Jens; Friederici, Angela D; von Kriegstein, Katharina
2014-11-15
Hemispheric specialization for linguistic prosody is a controversial issue. While it is commonly assumed that linguistic prosody and emotional prosody are preferentially processed in the right hemisphere, neuropsychological work directly comparing processes of linguistic prosody and emotional prosody suggests a predominant role of the left hemisphere for linguistic prosody processing. Here, we used two functional magnetic resonance imaging (fMRI) experiments to clarify the role of left and right hemispheres in the neural processing of linguistic prosody. In the first experiment, we sought to confirm previous findings showing that linguistic prosody processing compared to other speech-related processes predominantly involves the right hemisphere. Unlike previous studies, we controlled for stimulus influences by employing a prosody and speech task using the same speech material. The second experiment was designed to investigate whether a left-hemispheric involvement in linguistic prosody processing is specific to contrasts between linguistic prosody and emotional prosody or whether it also occurs when linguistic prosody is contrasted against other non-linguistic processes (i.e., speaker recognition). Prosody and speaker tasks were performed on the same stimulus material. In both experiments, linguistic prosody processing was associated with activity in temporal, frontal, parietal and cerebellar regions. Activation in temporo-frontal regions showed differential lateralization depending on whether the control task required recognition of speech or speaker: recognition of linguistic prosody predominantly involved right temporo-frontal areas when it was contrasted against speech recognition; when contrasted against speaker recognition, recognition of linguistic prosody predominantly involved left temporo-frontal areas. The results show that linguistic prosody processing involves functions of both hemispheres and suggest that recognition of linguistic prosody is based on an inter-hemispheric mechanism which exploits both a right-hemispheric sensitivity to pitch information and a left-hemispheric dominance in speech processing. Copyright © 2014 Elsevier Inc. All rights reserved.
Multidimensional Analysis of Linguistic Networks
NASA Astrophysics Data System (ADS)
Araújo, Tanya; Banisch, Sven
Network-based approaches play an increasingly important role in the analysis of data even in systems in which a network representation is not immediately apparent. This is particularly true for linguistic networks, which use to be induced from a linguistic data set for which a network perspective is only one out of several options for representation. Here we introduce a multidimensional framework for network construction and analysis with special focus on linguistic networks. Such a framework is used to show that the higher is the abstraction level of network induction, the harder is the interpretation of the topological indicators used in network analysis. Several examples are provided allowing for the comparison of different linguistic networks as well as to networks in other fields of application of network theory. The computation and the intelligibility of some statistical indicators frequently used in linguistic networks are discussed. It suggests that the field of linguistic networks, by applying statistical tools inspired by network studies in other domains, may, in its current state, have only a limited contribution to the development of linguistic theory.
NASA Astrophysics Data System (ADS)
Thomson, Norman
2003-01-01
Using Keiyo (Kenya) knowledge, learning and oral narratives about snakes, the paper advances the argument that science educators have a pivotal role as orthographers in 'preserving and promoting science for all'. Linguists, and a growing number of scientists, realize that in processes of globalisation, many indigenous languages and cultures are facing extinction, especially languages that remain unwritten, such as the Keiyo language. Within these languages are several thousand years of indigenous science education that include knowledge, teaching and learning about local environments. Science educators are a missing link in the ongoing conversations between biologists, linguists and indigenous cultures. Today, it is also known that reptiles are at greater risk for extinction than amphibians. In an area noted for its reptiles (Kenya's Rift Valley), Keiyo elders and students (n = 748) were interviewed or given a questionnaire to determine indigenous names for snakes and how Keiyo oral narratives of snakes are used in teaching and learning. They provided names for 19 of 34 (55%) snake species and 278 narratives that include snakes. The data are being used to document Keiyo language and construct relevant written science curriculum materials for Keiyo children
Chemical name extraction based on automatic training data generation and rich feature set.
Yan, Su; Spangler, W Scott; Chen, Ying
2013-01-01
The automation of extracting chemical names from text has significant value to biomedical and life science research. A major barrier in this task is the difficulty of getting a sizable and good quality data to train a reliable entity extraction model. Another difficulty is the selection of informative features of chemical names, since comprehensive domain knowledge on chemistry nomenclature is required. Leveraging random text generation techniques, we explore the idea of automatically creating training sets for the task of chemical name extraction. Assuming the availability of an incomplete list of chemical names, called a dictionary, we are able to generate well-controlled, random, yet realistic chemical-like training documents. We statistically analyze the construction of chemical names based on the incomplete dictionary, and propose a series of new features, without relying on any domain knowledge. Compared to state-of-the-art models learned from manually labeled data and domain knowledge, our solution shows better or comparable results in annotating real-world data with less human effort. Moreover, we report an interesting observation about the language for chemical names. That is, both the structural and semantic components of chemical names follow a Zipfian distribution, which resembles many natural languages.
Learning a Health Knowledge Graph from Electronic Medical Records.
Rotmensch, Maya; Halpern, Yoni; Tlimat, Abdulhakim; Horng, Steven; Sontag, David
2017-07-20
Demand for clinical decision support systems in medicine and self-diagnostic symptom checkers has substantially increased in recent years. Existing platforms rely on knowledge bases manually compiled through a labor-intensive process or automatically derived using simple pairwise statistics. This study explored an automated process to learn high quality knowledge bases linking diseases and symptoms directly from electronic medical records. Medical concepts were extracted from 273,174 de-identified patient records and maximum likelihood estimation of three probabilistic models was used to automatically construct knowledge graphs: logistic regression, naive Bayes classifier and a Bayesian network using noisy OR gates. A graph of disease-symptom relationships was elicited from the learned parameters and the constructed knowledge graphs were evaluated and validated, with permission, against Google's manually-constructed knowledge graph and against expert physician opinions. Our study shows that direct and automated construction of high quality health knowledge graphs from medical records using rudimentary concept extraction is feasible. The noisy OR model produces a high quality knowledge graph reaching precision of 0.85 for a recall of 0.6 in the clinical evaluation. Noisy OR significantly outperforms all tested models across evaluation frameworks (p < 0.01).
Yin, Kedong; Yang, Benshuo; Li, Xuemei
2018-01-24
In this paper, we investigate multiple attribute group decision making (MAGDM) problems where decision makers represent their evaluation of alternatives by trapezoidal fuzzy two-dimensional uncertain linguistic variable. To begin with, we introduce the definition, properties, expectation, operational laws of trapezoidal fuzzy two-dimensional linguistic information. Then, to improve the accuracy of decision making in some case where there are a sort of interrelationship among the attributes, we analyze partition Bonferroni mean (PBM) operator in trapezoidal fuzzy two-dimensional variable environment and develop two operators: trapezoidal fuzzy two-dimensional linguistic partitioned Bonferroni mean (TF2DLPBM) aggregation operator and trapezoidal fuzzy two-dimensional linguistic weighted partitioned Bonferroni mean (TF2DLWPBM) aggregation operator. Furthermore, we develop a novel method to solve MAGDM problems based on TF2DLWPBM aggregation operator. Finally, a practical example is presented to illustrate the effectiveness of this method and analyses the impact of different parameters on the results of decision-making.
Yin, Kedong; Yang, Benshuo
2018-01-01
In this paper, we investigate multiple attribute group decision making (MAGDM) problems where decision makers represent their evaluation of alternatives by trapezoidal fuzzy two-dimensional uncertain linguistic variable. To begin with, we introduce the definition, properties, expectation, operational laws of trapezoidal fuzzy two-dimensional linguistic information. Then, to improve the accuracy of decision making in some case where there are a sort of interrelationship among the attributes, we analyze partition Bonferroni mean (PBM) operator in trapezoidal fuzzy two-dimensional variable environment and develop two operators: trapezoidal fuzzy two-dimensional linguistic partitioned Bonferroni mean (TF2DLPBM) aggregation operator and trapezoidal fuzzy two-dimensional linguistic weighted partitioned Bonferroni mean (TF2DLWPBM) aggregation operator. Furthermore, we develop a novel method to solve MAGDM problems based on TF2DLWPBM aggregation operator. Finally, a practical example is presented to illustrate the effectiveness of this method and analyses the impact of different parameters on the results of decision-making. PMID:29364849
ERIC Educational Resources Information Center
Xie, Qin
2015-01-01
Corpus linguistics has transformed the landscape of empirical research on languages in recent decades. The proliferation of corpus technology has enabled researchers worldwide to conduct research in their own geographical locations with few hindrances. It has become increasingly commonplace for researchers to compile their own corpora for specific…
20 CFR 416.732 - No penalty deduction if you have good cause for failure to report timely.
Code of Federal Regulations, 2010 CFR
2010-04-01
..., educational, or linguistic limitations (including any lack of facility with the English language) you may have... willful. “Not willful” means that— (i) You did not have full knowledge of the existence of your obligation...
Educating Consultants for Multicultural Practice of Consultee-Centered Consultation
ERIC Educational Resources Information Center
Ingraham, Colette L.
2017-01-01
Literature about educating consultants with knowledge, skills, and dispositions to work effectively within culturally and linguistically diverse schools is scarce. Research suggests that additional attention is needed on the preparation of consultants to practice with multicultural competence. This article reviews theories and research and…
DesAutels, Spencer J; Fox, Zachary E; Giuse, Dario A; Williams, Annette M; Kou, Qing-Hua; Weitkamp, Asli; Neal R, Patel; Bettinsoli Giuse, Nunzia
2016-01-01
Clinical decision support (CDS) knowledge, embedded over time in mature medical systems, presents an interesting and complex opportunity for information organization, maintenance, and reuse. To have a holistic view of all decision support requires an in-depth understanding of each clinical system as well as expert knowledge of the latest evidence. This approach to clinical decision support presents an opportunity to unify and externalize the knowledge within rules-based decision support. Driven by an institutional need to prioritize decision support content for migration to new clinical systems, the Center for Knowledge Management and Health Information Technology teams applied their unique expertise to extract content from individual systems, organize it through a single extensible schema, and present it for discovery and reuse through a newly created Clinical Support Knowledge Acquisition and Archival Tool (CS-KAAT). CS-KAAT can build and maintain the underlying knowledge infrastructure needed by clinical systems.
2013-01-01
Background We introduce a Knowledge-based Decision Support System (KDSS) in order to face the Protein Complex Extraction issue. Using a Knowledge Base (KB) coding the expertise about the proposed scenario, our KDSS is able to suggest both strategies and tools, according to the features of input dataset. Our system provides a navigable workflow for the current experiment and furthermore it offers support in the configuration and running of every processing component of that workflow. This last feature makes our system a crossover between classical DSS and Workflow Management Systems. Results We briefly present the KDSS' architecture and basic concepts used in the design of the knowledge base and the reasoning component. The system is then tested using a subset of Saccharomyces cerevisiae Protein-Protein interaction dataset. We used this subset because it has been well studied in literature by several research groups in the field of complex extraction: in this way we could easily compare the results obtained through our KDSS with theirs. Our system suggests both a preprocessing and a clustering strategy, and for each of them it proposes and eventually runs suited algorithms. Our system's final results are then composed of a workflow of tasks, that can be reused for other experiments, and the specific numerical results for that particular trial. Conclusions The proposed approach, using the KDSS' knowledge base, provides a novel workflow that gives the best results with regard to the other workflows produced by the system. This workflow and its numeric results have been compared with other approaches about PPI network analysis found in literature, offering similar results. PMID:23368995
Johnson, Jeffrey P.; Villard, Sarah; Kiran, Swathi
2017-01-01
Purpose This study was conducted to investigate the static and dynamic relationships between impairment-level cognitive-linguistic abilities and activity-level functional communication skills in persons with aphasia (PWA). Method In Experiment 1, a battery of standardized assessments was administered to a group of PWA (N = 72) to examine associations between cognitive-linguistic ability and functional communication at a single time point. In Experiment 2, impairment-based treatment was administered to a subset of PWA from Experiment 1 (n = 39) in order to examine associations between change in cognitive-linguistic ability and change in function and associations at a single time point. Results In both experiments, numerous significant associations were found between scores on tests of cognitive-linguistic ability and a test of functional communication at a single time point. In Experiment 2, significant treatment-induced gains were seen on both types of measures in participants with more severe aphasia, yet cognitive-linguistic change scores were not significantly correlated with functional communication change scores. Conclusions At a single time point, cognitive-linguistic and functional communication abilities are associated in PWA. However, although changes on standardized assessments reflecting improvements in both types of skills can occur following an impairment-based therapy, these changes may not be significantly associated with each other. PMID:28196373
Online Knowledge-Based Model for Big Data Topic Extraction.
Khan, Muhammad Taimoor; Durrani, Mehr; Khalid, Shehzad; Aziz, Furqan
2016-01-01
Lifelong machine learning (LML) models learn with experience maintaining a knowledge-base, without user intervention. Unlike traditional single-domain models they can easily scale up to explore big data. The existing LML models have high data dependency, consume more resources, and do not support streaming data. This paper proposes online LML model (OAMC) to support streaming data with reduced data dependency. With engineering the knowledge-base and introducing new knowledge features the learning pattern of the model is improved for data arriving in pieces. OAMC improves accuracy as topic coherence by 7% for streaming data while reducing the processing cost to half.
Language acquisition and use: learning and applying probabilistic constraints.
Seidenberg, M S
1997-03-14
What kinds of knowledge underlie the use of language and how is this knowledge acquired? Linguists equate knowing a language with knowing a grammar. Classic "poverty of the stimulus" arguments suggest that grammar identification is an intractable inductive problem and that acquisition is possible only because children possess innate knowledge of grammatical structure. An alternative view is emerging from studies of statistical and probabilistic aspects of language, connectionist models, and the learning capacities of infants. This approach emphasizes continuity between how language is acquired and how it is used. It retains the idea that innate capacities constrain language learning, but calls into question whether they include knowledge of grammatical structure.
Towards an Age-Phenome Knowledge-base
2011-01-01
Background Currently, data about age-phenotype associations are not systematically organized and cannot be studied methodically. Searching for scientific articles describing phenotypic changes reported as occurring at a given age is not possible for most ages. Results Here we present the Age-Phenome Knowledge-base (APK), in which knowledge about age-related phenotypic patterns and events can be modeled and stored for retrieval. The APK contains evidence connecting specific ages or age groups with phenotypes, such as disease and clinical traits. Using a simple text mining tool developed for this purpose, we extracted instances of age-phenotype associations from journal abstracts related to non-insulin-dependent Diabetes Mellitus. In addition, links between age and phenotype were extracted from clinical data obtained from the NHANES III survey. The knowledge stored in the APK is made available for the relevant research community in the form of 'Age-Cards', each card holds the collection of all the information stored in the APK about a particular age. These Age-Cards are presented in a wiki, allowing community review, amendment and contribution of additional information. In addition to the wiki interaction, complex searches can also be conducted which require the user to have some knowledge of database query construction. Conclusions The combination of a knowledge model based repository with community participation in the evolution and refinement of the knowledge-base makes the APK a useful and valuable environment for collecting and curating existing knowledge of the connections between age and phenotypes. PMID:21651792
Mediating Language Learning: Teacher Interactions with ESL Students in a Content-Based Classroom.
ERIC Educational Resources Information Center
Gibbons, Pauline
2003-01-01
Draws on constructs of "mediation" from sociocultural theory and "mode continuum" from systemic functional linguistics to investigate how student-teacher talk in a content-based classroom contributes to learners' language development. Shows how teachers mediate between students' linguistic levels in English and their…
Concept-Based Content of Professional Linguistic Education
ERIC Educational Resources Information Center
Makshantseva, Nataliia Veniaminovna; Bankova, Liudmila Lvovna
2016-01-01
The article deals with professional education of future linguists built on the basis of conceptual approach. The topic is exemplified by the Russian language and a successful attempt to implement the concept-based approach to forming the content of professional language education. Within the framework of the proposed research, the concept is…