NASA Astrophysics Data System (ADS)
Znikina, Ludmila; Rozhneva, Elena
2017-11-01
The article deals with the distribution of informative intensity of the English-language scientific text based on its structural features contributing to the process of formalization of the scientific text and the preservation of the adequacy of the text with derived semantic information in relation to the primary. Discourse analysis is built on specific compositional and meaningful examples of scientific texts taken from the mining field. It also analyzes the adequacy of the translation of foreign texts into another language, the relationships between elements of linguistic systems, the degree of a formal conformance, translation with the specific objectives and information needs of the recipient. Some key words and ideas are emphasized in the paragraphs of the English-language mining scientific texts. The article gives the characteristic features of the structure of paragraphs of technical text and examples of constructions in English scientific texts based on a mining theme with the aim to explain the possible ways of their adequate translation.
Cognitive Implications of Nominalizations in the Advancement of Scientific Discourse
ERIC Educational Resources Information Center
Bello, Iria
2016-01-01
Nominalizations are well-known features of scientific writing. Scholars have been intrigued by their form and by their functions. While these features have been widely studied, the cognitive side of nominalizations in scientific texts still needs further attention. Nominalizations contribute to the advancement of discourse and at the same time add…
Teaching Scientific Metaphors through Informational Text Read-Alouds
ERIC Educational Resources Information Center
Barnes, Erica M.; Oliveira, Alandeom W.
2018-01-01
Elementary students are expected to use various features of informational texts to build knowledge in the content areas. In science informational texts, scientific metaphors are commonly used to make sense of complex and invisible processes. Although elementary students may be familiar with literary metaphors as used in narratives, they may be…
ERIC Educational Resources Information Center
Bromme, Rainer; Scharrer, Lisa; Stadtler, Marc; Hömberg, Johanna; Torspecken, Ronja
2015-01-01
Scientific texts are a genre in which adherence to specific discourse conventions allows for conclusions on the scientific integrity of the information and thus on its validity. This study examines whether genre-typical features of scientific discourse influence how laypeople handle conflicting science-based knowledge claims. In two experiments…
ERIC Educational Resources Information Center
Lesley, Mellinee
2014-01-01
Through a content analysis of 200 "tweets," this study was an exploration into the distinct features of text posted to NASA's "Twitter" site and the potential for these posts to serve as more engaging scientific text than traditional textbooks for adolescents. Results of the content analysis indicated the tweets and linked…
NASA Astrophysics Data System (ADS)
Defrance, Nancy L.
Technology offers promise of 'leveling the playing field' for struggling readers. That is, instructional support features within digital texts may enable all readers to learn. This quasi-experimental study examined the effects on learning of two support features, which offered unique opportunities to interact with text. The Highlight & Animate Feature highlighted an important idea in prose, while simultaneously animating its representation in an adjacent graphic. It invited readers to integrate ideas depicted in graphics and prose, using each one to interpret the other. The Manipulable Graphics had parts that the reader could operate to discover relationships among phenomena. It invited readers to test or refine the ideas that they brought to, or gleaned from, the text. Use of these support features was compulsory. Twenty fifth grade struggling readers read a graphic-rich digital science text in a clinical interview setting, under one of two conditions: using either the Highlight & Animate Feature or the Manipulable Graphics. Participants in both conditions made statistically significant gains on a multiple choice measure of knowledge of the topic of the text. While there were no significant differences by condition in the amount of knowledge gained; there were significant differences in the quality of knowledge expressed. Transcripts revealed that understandings about light and vision, expressed by those who used the Highlight & Animate Feature, were more often conceptually and linguistically 'complete.' That is, their understandings included both a description of phenomena as well as an explanation of underlying scientific principles, which participants articulated using the vocabulary of the text. This finding may be attributed to the multiple opportunities to integrate graphics (depicting the behavior of phenomena) and prose (providing the scientific explanation of that phenomena), which characterized the Highlight & Animate Condition. Those who used the Manipulable Graphics were more likely to express complete understandings when they were able to structure a systematic investigation of the graphic and when the graphic was designed to confront their own naive conceptions about light and vision. The Manipulable Graphics also provided a foothold for those who entered the study with very little prior knowledge of the topic.
Arrow of Time: Metaphorical Construals of Entropy and the Second Law of Thermodynamics
ERIC Educational Resources Information Center
Amin, Tamer G.; Jeppsson, Fredrik; Haglund, Jesper; Stromdahl, Helge
2012-01-01
Various features of scientific discourse have been characterized in the science education literature, and challenges students face in appropriating these features have been explored. Using the framework of conceptual metaphor, this paper sought to identify explicit and implicit metaphors in pedagogical texts dealing with the concept of entropy and…
Reading Picture Books and Learning Science: Engaging Young Children with Informational Text
ERIC Educational Resources Information Center
Mantzicopoulos, Panayota; Patrick, Helen
2011-01-01
The authors draw from the research literature and from their work with the Scientific Literacy Project (SLP) in kindergarten classrooms to address the inclusion of science picture books in the curriculum. They describe features and functions of informational texts, discuss teachers' common concerns about providing young children with experiences…
NASA Astrophysics Data System (ADS)
Wilson, B. D.; McGibbney, L. J.; Mattmann, C. A.; Ramirez, P.; Joyce, M.; Whitehall, K. D.
2015-12-01
Quantifying scientific relevancy is of increasing importance to NASA and the research community. Scientific relevancy may be defined by mapping the impacts of a particular NASA mission, instrument, and/or retrieved variables to disciplines such as climate predictions, natural hazards detection and mitigation processes, education, and scientific discoveries. Related to relevancy, is the ability to expose data with similar attributes. This in turn depends upon the ability for us to extract latent, implicit document features from scientific data and resources and make them explicit, accessible and useable for search activities amongst others. This paper presents MemexGATE; a server side application, command line interface and computing environment for running large scale metadata extraction, general architecture text engineering, document classification and indexing tasks over document resources such as social media streams, scientific literature archives, legal documentation, etc. This work builds on existing experiences using MemexGATE (funded, developed and validated through the DARPA Memex Progrjam PI Mattmann) for extracting and leveraging latent content features from document resources within the Materials Research domain. We extend the software functionality capability to the domain of scientific literature with emphasis on the expansion of gazetteer lists, named entity rules, natural language construct labeling (e.g. synonym, antonym, hyponym, etc.) efforts to enable extraction of latent content features from data hosted by wide variety of scientific literature vendors (AGU Meeting Abstract Database, Springer, Wiley Online, Elsevier, etc.) hosting earth science literature. Such literature makes both implicit and explicit references to NASA datasets and relationships between such concepts stored across EOSDIS DAAC's hence we envisage that a significant part of this effort will also include development and understanding of relevancy signals which can ultimately be utilized for improved search and relevancy ranking across scientific literature.
USSR Report: Cybernetics, Computers and Automation Technology. No. 69.
1983-05-06
computers in multiprocessor and multistation design , control and scientific research automation systems. The results of comparing the efficiency of...Podvizhnaya, Scientific Research Institute of Control Computers, Severodonetsk] [Text] The most significant change in the design of the SM-2M compared to...UPRAVLYAYUSHCHIYE SISTEMY I MASHINY, Nov-Dec 82) 95 APPLICATIONS Kiev Automated Control System, Design Features and Prospects for Development (V. A
ERIC Educational Resources Information Center
Liddicoat, Anthony J.
2004-01-01
This article investigates one aspect of scientific style in French: the use of tenses. It investigates the claims made in the literature that the verb system of scientific French is a temporal. The frequency of tensed finite forms in 10 French language journal articles on biological sciences is examined. The rhetorical function of past and future…
Modifiable futures: science fiction at the bench.
Milburn, Colin
2010-09-01
Science fiction remains an alien dimension of the history of science. Historical and literary studies of science have become increasingly attentive to various "literary technologies" in scientific practice, the metaphorical features of scientific discourse, and the impact of popular science writing on the social development of scientific knowledge. But the function of science fiction and even literature as such in the history of scientific and technological innovation has often been obscured, misconstrued, or repudiated owing to conventional notions of authorship, influence, and the organic unity of texts. The better to address those close encounters where scientific practice makes use of speculative fiction, this essay proposes that we instead analyze such exchanges as processes of appropriation, remixing, and modification.
NASA Astrophysics Data System (ADS)
Langbeheim, Elon; Safran, Samuel A.; Yerushalmi, Edit
2013-01-01
We present design guidelines for using Adapted Primary Literature (APL) as part of current interdisciplinary topics to introductory physics students. APL is a text genre that allows students to comprehend a scientific article, while maintaining the core features of the communication among scientists, thus representing an authentic scientific discourse. We describe the adaptation of a research paper by Nobel Laureate Paul Flory on phase equilibrium in polymer-solvent mixtures that was presented to high school students in a project-based unit on soft matter. The adaptation followed two design strategies: a) Making explicit the interplay between the theory and experiment. b) Re-structuring the text to map the theory onto the students' prior knowledge. Specifically, we map the theory of polymer-solvent systems onto a model for binary mixtures of small molecules of equal size that was already studied in class.
mSciences: An Affinity Space for Science Teachers
ERIC Educational Resources Information Center
Mota, Jorge; Morais, Carla; Moreira, Luciano; Paiva, João C.
2017-01-01
The project "Multimedia in science teaching: five years of research and teaching in Portugal" was successful in featuring the national research on multimedia in science education and in providing the community with a simple reference tool--a repository of open access scientific texts. The current work aims to describe the theoretical…
Moradi, Milad; Ghadiri, Nasser
2018-01-01
Automatic text summarization tools help users in the biomedical domain to acquire their intended information from various textual resources more efficiently. Some of biomedical text summarization systems put the basis of their sentence selection approach on the frequency of concepts extracted from the input text. However, it seems that exploring other measures rather than the raw frequency for identifying valuable contents within an input document, or considering correlations existing between concepts, may be more useful for this type of summarization. In this paper, we describe a Bayesian summarization method for biomedical text documents. The Bayesian summarizer initially maps the input text to the Unified Medical Language System (UMLS) concepts; then it selects the important ones to be used as classification features. We introduce six different feature selection approaches to identify the most important concepts of the text and select the most informative contents according to the distribution of these concepts. We show that with the use of an appropriate feature selection approach, the Bayesian summarizer can improve the performance of biomedical summarization. Using the Recall-Oriented Understudy for Gisting Evaluation (ROUGE) toolkit, we perform extensive evaluations on a corpus of scientific papers in the biomedical domain. The results show that when the Bayesian summarizer utilizes the feature selection methods that do not use the raw frequency, it can outperform the biomedical summarizers that rely on the frequency of concepts, domain-independent and baseline methods. Copyright © 2017 Elsevier B.V. All rights reserved.
Knowledge Discovery in Literature Data Bases
NASA Astrophysics Data System (ADS)
Albrecht, Rudolf; Merkl, Dieter
The concept of knowledge discovery as defined through ``establishing previously unknown and unsuspected relations of features in a data base'' is, cum grano salis, relatively easy to implement for data bases containing numerical data. Increasingly we find at our disposal data bases containing scientific literature. Computer assisted detection of unknown relations of features in such data bases would be extremely valuable and would lead to new scientific insights. However, the current representation of scientific knowledge in such data bases is not conducive to computer processing. Any correlation of features still has to be done by the human reader, a process which is plagued by ineffectiveness and incompleteness. On the other hand we note that considerable progress is being made in an area where reading all available material is totally prohibitive: the World Wide Web. Robots and Web crawlers mine the Web continuously and construct data bases which allow the identification of pages of interest in near real time. An obvious step is to categorize and classify the documents in the text data base. This can be used to identify papers worth reading, or which are of unexpected cross-relevance. We show the results of first experiments using unsupervised classification based on neural networks.
The Sciences: An Integrated Approach, 2nd Edition (by James Trefil and Robert M. Hazen)
NASA Astrophysics Data System (ADS)
Hoffman, Reviewed By Megan M.
2000-01-01
"You're going to teach the organic chemistry section of the Natural Science class?" - one of my biology colleagues asked me last semester - "Better you than me!" "You are?" added a chemistry professor, with interest. Yet these same people ardently believe that all our students should have a basic understanding of carbon's remarkable bonding capabilities and how they relate to life on Earth. If our art or economics majors can learn about organic chemistry and genetics and astronomy, our faculty should be able to teach those same topics, regardless of their acknowledged specialties. The basis of a scientifically literate society is not expertise in specific arcane subfields of science. Scientific literacy is a general understanding of what science is, what science can and cannot do, and what scientific accomplishments have occurred over the centuries. If you subscribe to this definition of scientific literacy, James Trefil and Robert M. Hazen's The Sciences: An Integrated Approach can help you and your general science students. The self-avowed purpose of this text is to address science illiteracy in America. Trefil and Hazen propose that the best way to combat scientific illiteracy is to provide integrated science courses that focus on a broad understanding of science, rather than the specialized knowledge available to a science major. The new edition of The Sciences has been influenced by the 1996 publication of the National Research Council's National Science Education Standards. While the first edition of Trefil and Hazen's book admirably addressed the integration of the natural and physical sciences, in this second edition, the authors have increased the connections between science and real-world situations and have made a more conscious effort to emphasize the process of science and the overlapping nature of scientific disciplines. The text is based on 25 "scientific concepts", one per chapter. These concepts are clearly explained in relatively jargon-free language and are then tied explicitly to familiar situations and life experiences. For instance, a power outage at a baseball game helps set the scene for quantum mechanics and Heisenberg's uncertainty principle, while jump-starting a car illustrates the conversion of energy from potential through kinetic to chemical. Most of the fine pedagogical features of the first edition have been continued, including descriptions of relevant technologies, historical aspects of various discoveries, and clear descriptions of mathematical approaches to the topics. The second edition of The Sciences has increased the accessibility of science and scientific concepts by adding several new features to the successful features of the first edition: "The Ongoing Process of Science" addresses current scientific questions; "Stop and Think" encourages students to consider further implications of the topic at hand; and "Science News" provides excerpts from the periodical of the same name. In addition, previous features that highlighted connections to human physiology have been broadened to include all living things, thus allowing students to make connections between the familiar and the more abstract, for instance magnetic navigation in birds (Electricity and Magnetism), upright human posture (Plate Tectonics) and blood clotting (The Chemical Bond). A final addition to each chapter is "Great Ideas Across the Sciences", which ties the Great Idea on which the chapter is based to each of the natural sciences. This latter addition is one that students might easily overlook, but it has great potential for opening class discussion on how, for instance, the science of entropy relates to weather, arthritis, volcanoes, and gasoline use (Chapter 4). Trefil and Hazen offer a basis for understanding physics, chemistry, biology, earth science, and cosmology. While the text and figures provide a basic description of these topics, this book will not produce physicists, chemists, etc. Keep the general-science purpose of the text in mind when you begin to feel that the chapters on your favorite topic are leaving out details or ideas that you consider crucial to scientific literacy in your area. My first impression of the chapter on Classical and Modern Genetics was that it did not spend enough time on Mendel and his foundational contributions to biology. Consequently, I went well beyond the text material in my lecture on Mendelian genetics. To my regret, I learned that this extra, "crucial" material was more intimidating than enlightening. While there are sure to be critics who will wish that certain topics were covered in more depth or who will want topics added or deleted, my conclusion after teaching from this book is that Trefil and Hazen have provided a clear, well-considered, and extremely useful text for a general science course.
Science Is Women's Work: Photos and Biographies of American Women in the Sciences.
ERIC Educational Resources Information Center
Gallop, Nancy
Girls show an early interest in science, but are deterred from pursuing science careers as they get older due to society's stereotypes. This text identifies the many women in history who have made significant contributions to all scientific fields. The volume features the biographies of Maria Mitchell (Astronomer); Ellen Richards (Chemist,…
Choo, Jae-Uk
2014-12-01
As the sciences advanced rapidly in the modern European world, outstanding achievements have been made in medicine, chemistry, biology, physiology, physics and others, which have been co-influencing each of the scientific disciplines. Accordingly, such medical and scientific phenomena began to be reflected in novels. In particular, Mary Shelley's Frankenstein includes the diverse aspects of the change and development in the medicine and science. Associated with medical and scientific information reflected in Frankenstein and Frankenstein's experiments in the text, accordingly, this research will investigate the aspects of medical and scientific development taking place in the nineteenth century in three ways. First, the medical and scientific development of the nineteenth century has been reviewed by summerizing both the information of alchemy in which Frankenstein shows his interest and the new science in general that M. Waldman introduces in the text. Second, the actual features of medical and scientific development have been examined through some examples of the experimental methods that M. Waldman implicitly uttered to Frankenstein. Third, it has been checked how the medical and scientific development is related to the main issues of mechanism and vitalism which can be explained as principles of life. Even though this research deals with the developmental process of medicine & science and origin & principles of life implied in Mary Shelley's Frankenstein, its significance is that it is the interdisciplinary research focussing on how deeply medical and scientific discourse of Mary Shelley's period has been imbedded in the nineteenth century novel.
Featured Article: Genotation: Actionable knowledge for the scientific reader
Willis, Ethan; Sakauye, Mark; Jose, Rony; Chen, Hao; Davis, Robert L
2016-01-01
We present an article viewer application that allows a scientific reader to easily discover and share knowledge by linking genomics-related concepts to knowledge of disparate biomedical databases. High-throughput data streams generated by technical advancements have contributed to scientific knowledge discovery at an unprecedented rate. Biomedical Informaticists have created a diverse set of databases to store and retrieve the discovered knowledge. The diversity and abundance of such resources present biomedical researchers a challenge with knowledge discovery. These challenges highlight a need for a better informatics solution. We use a text mining algorithm, Genomine, to identify gene symbols from the text of a journal article. The identified symbols are supplemented with information from the GenoDB knowledgebase. Self-updating GenoDB contains information from NCBI Gene, Clinvar, Medgen, dbSNP, KEGG, PharmGKB, Uniprot, and Hugo Gene databases. The journal viewer is a web application accessible via a web browser. The features described herein are accessible on www.genotation.org. The Genomine algorithm identifies gene symbols with an accuracy shown by .65 F-Score. GenoDB currently contains information regarding 59,905 gene symbols, 5633 drug–gene relationships, 5981 gene–disease relationships, and 713 pathways. This application provides scientific readers with actionable knowledge related to concepts of a manuscript. The reader will be able to save and share supplements to be visualized in a graphical manner. This provides convenient access to details of complex biological phenomena, enabling biomedical researchers to generate novel hypothesis to further our knowledge in human health. This manuscript presents a novel application that integrates genomic, proteomic, and pharmacogenomic information to supplement content of a biomedical manuscript and enable readers to automatically discover actionable knowledge. PMID:26900164
Featured Article: Genotation: Actionable knowledge for the scientific reader.
Nagahawatte, Panduka; Willis, Ethan; Sakauye, Mark; Jose, Rony; Chen, Hao; Davis, Robert L
2016-06-01
We present an article viewer application that allows a scientific reader to easily discover and share knowledge by linking genomics-related concepts to knowledge of disparate biomedical databases. High-throughput data streams generated by technical advancements have contributed to scientific knowledge discovery at an unprecedented rate. Biomedical Informaticists have created a diverse set of databases to store and retrieve the discovered knowledge. The diversity and abundance of such resources present biomedical researchers a challenge with knowledge discovery. These challenges highlight a need for a better informatics solution. We use a text mining algorithm, Genomine, to identify gene symbols from the text of a journal article. The identified symbols are supplemented with information from the GenoDB knowledgebase. Self-updating GenoDB contains information from NCBI Gene, Clinvar, Medgen, dbSNP, KEGG, PharmGKB, Uniprot, and Hugo Gene databases. The journal viewer is a web application accessible via a web browser. The features described herein are accessible on www.genotation.org The Genomine algorithm identifies gene symbols with an accuracy shown by .65 F-Score. GenoDB currently contains information regarding 59,905 gene symbols, 5633 drug-gene relationships, 5981 gene-disease relationships, and 713 pathways. This application provides scientific readers with actionable knowledge related to concepts of a manuscript. The reader will be able to save and share supplements to be visualized in a graphical manner. This provides convenient access to details of complex biological phenomena, enabling biomedical researchers to generate novel hypothesis to further our knowledge in human health. This manuscript presents a novel application that integrates genomic, proteomic, and pharmacogenomic information to supplement content of a biomedical manuscript and enable readers to automatically discover actionable knowledge. © 2016 by the Society for Experimental Biology and Medicine.
Biological network extraction from scientific literature: state of the art and challenges.
Li, Chen; Liakata, Maria; Rebholz-Schuhmann, Dietrich
2014-09-01
Networks of molecular interactions explain complex biological processes, and all known information on molecular events is contained in a number of public repositories including the scientific literature. Metabolic and signalling pathways are often viewed separately, even though both types are composed of interactions involving proteins and other chemical entities. It is necessary to be able to combine data from all available resources to judge the functionality, complexity and completeness of any given network overall, but especially the full integration of relevant information from the scientific literature is still an ongoing and complex task. Currently, the text-mining research community is steadily moving towards processing the full body of the scientific literature by making use of rich linguistic features such as full text parsing, to extract biological interactions. The next step will be to combine these with information from scientific databases to support hypothesis generation for the discovery of new knowledge and the extension of biological networks. The generation of comprehensive networks requires technologies such as entity grounding, coordination resolution and co-reference resolution, which are not fully solved and are required to further improve the quality of results. Here, we analyse the state of the art for the extraction of network information from the scientific literature and the evaluation of extraction methods against reference corpora, discuss challenges involved and identify directions for future research. © The Author 2013. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Reading for meaning: The foundational knowledge every teacher of science should have
NASA Astrophysics Data System (ADS)
Patterson, Alexis; Roman, Diego; Friend, Michelle; Osborne, Jonathan; Donovan, Brian
2018-02-01
Reading is fundamental to science and not an adjunct to its practice. In other words, understanding the meaning of the various forms of written discourse employed in the creation, discussion, and communication of scientific knowledge is inherent to how science works. The language used in science, however, sets up a barrier, that in order to be overcome requires all students to have a clear understanding of the features of the multimodal informational texts employed in science and the strategies they can use to decode the scientific concepts communicated in informational texts. We argue that all teachers of science must develop a functional understanding of reading comprehension as part of their professional knowledge and skill. After describing our rationale for including knowledge about reading as a professional knowledge base every teacher of science should have, we outline the knowledge about language teachers must develop, the knowledge about the challenges that reading comprehension of science texts poses for students, and the knowledge about instructional strategies science teachers should know to support their students' reading comprehension of science texts. Implications regarding the essential role that knowledge about reading should play in the preparation of science teachers are also discussed here.
NASA Astrophysics Data System (ADS)
Esquinca, Alberto
This is a study of language use in the context of an inquiry-based science curriculum in which conceptual understanding ratings are used split texts into groups of "successful" and "unsuccessful" texts. "Successful" texts could include known features of science language. 420 texts generated by students in 14 classrooms from three school districts, culled from a prior study on the effectiveness of science notebooks to assess understanding, in addition to the aforementioned ratings are the data sources. In science notebooks, students write in the process of learning (here, a unit on electricity). The analytical framework is systemic functional linguistics (Halliday and Matthiessen, 2004; Eggins, 2004), specifically the concepts of genre, register and nominalization. Genre classification involves an analysis of the purpose and register features in the text (Schleppegrell, 2004). The use of features of the scientific academic register, namely the use relational processes and nominalization (Halliday and Martin, 1993), requires transitivity analysis and noun analysis. Transitivity analysis, consisting of the identification of the process type, is conducted on 4737 ranking clauses. A manual count of each noun used in the corpus allows for a typology of nouns. Four school science genres, procedures, procedural recounts reports and explanations, are found. Most texts (85.4%) are factual, and 14.1% are classified as explanations, the analytical genre. Logistic regression analysis indicates that there is no significant probability that the texts classified as explanation are placed in the group of "successful" texts. In addition, material process clauses predominate in the corpus, followed by relational process clauses. Results of a logistic regression analysis indicate that there is a significant probability (Chi square = 15.23, p < .0001) that texts with a high rate of relational processes are placed in the group of "successful" texts. In addition, 59.5% of 6511 nouns are references to physical materials, followed by references to abstract concepts (35.54%). Only two of the concept nouns were found to be nominalized referents in definition model sentences. In sum, the corpus has recognizable genres and features science language, and relational processes are more prevalent in "successful" texts. However, the pervasive feature of science language, nominalization, is scarce.
Groves, Tamar; Figuerola, Carlos G; Quintanilla, Miguel Á
2016-08-01
This article presents our study of science coverage in the digital Spanish press over the last decade. We employed automated information retrieval procedures to create a corpus of 50,763 text units dealing with science and technology, and used automated text-analysis procedures in order to provide a general picture of the structure, characteristics and evolution of science news in Spain. We found between 6% and 7% of science coverage, a clear high proportion of biomedicine and predominance of science over technology, although we also detected an increase in technological content during the second half of the decade. Analysing the extrinsic and intrinsic features of science culture, we found a predominance of intrinsic features that still need further analysis. Our attempt to use specialised software to examine big data was effective, and allowed us to reach these preliminary conclusions. © The Author(s) 2015.
Temporality of Features in Near-Death Experience Narratives
Martial, Charlotte; Cassol, Héléna; Antonopoulos, Georgios; Charlier, Thomas; Heros, Julien; Donneau, Anne-Françoise; Charland-Verville, Vanessa; Laureys, Steven
2017-01-01
Background: After an occurrence of a Near-Death Experience (NDE), Near-Death Experiencers (NDErs) usually report extremely rich and detailed narratives. Phenomenologically, a NDE can be described as a set of distinguishable features. Some authors have proposed regular patterns of NDEs, however, the actual temporality sequence of NDE core features remains a little explored area. Objectives: The aim of the present study was to investigate the frequency distribution of these features (globally and according to the position of features in narratives) as well as the most frequently reported temporality sequences of features. Methods: We collected 154 French freely expressed written NDE narratives (i.e., Greyson NDE scale total score ≥ 7/32). A text analysis was conducted on all narratives in order to infer temporal ordering and frequency distribution of NDE features. Results: Our analyses highlighted the following most frequently reported sequence of consecutive NDE features: Out-of-Body Experience, Experiencing a tunnel, Seeing a bright light, Feeling of peace. Yet, this sequence was encountered in a very limited number of NDErs. Conclusion: These findings may suggest that NDEs temporality sequences can vary across NDErs. Exploring associations and relationships among features encountered during NDEs may complete the rigorous definition and scientific comprehension of the phenomenon. PMID:28659779
Temporality of Features in Near-Death Experience Narratives.
Martial, Charlotte; Cassol, Héléna; Antonopoulos, Georgios; Charlier, Thomas; Heros, Julien; Donneau, Anne-Françoise; Charland-Verville, Vanessa; Laureys, Steven
2017-01-01
Background: After an occurrence of a Near-Death Experience (NDE), Near-Death Experiencers (NDErs) usually report extremely rich and detailed narratives. Phenomenologically, a NDE can be described as a set of distinguishable features. Some authors have proposed regular patterns of NDEs, however, the actual temporality sequence of NDE core features remains a little explored area. Objectives: The aim of the present study was to investigate the frequency distribution of these features (globally and according to the position of features in narratives) as well as the most frequently reported temporality sequences of features. Methods: We collected 154 French freely expressed written NDE narratives (i.e., Greyson NDE scale total score ≥ 7/32). A text analysis was conducted on all narratives in order to infer temporal ordering and frequency distribution of NDE features. Results: Our analyses highlighted the following most frequently reported sequence of consecutive NDE features: Out-of-Body Experience, Experiencing a tunnel, Seeing a bright light, Feeling of peace. Yet, this sequence was encountered in a very limited number of NDErs. Conclusion: These findings may suggest that NDEs temporality sequences can vary across NDErs. Exploring associations and relationships among features encountered during NDEs may complete the rigorous definition and scientific comprehension of the phenomenon.
An Imagination Effect in Learning from Scientific Text
ERIC Educational Resources Information Center
Leopold, Claudia; Mayer, Richard E.
2015-01-01
Asking students to imagine the spatial arrangement of the elements in a scientific text constitutes a learning strategy intended to foster deep processing of the instructional material. Two experiments investigated the effects of mental imagery prompts on learning from scientific text. Students read a computer-based text on the human respiratory…
NASA Astrophysics Data System (ADS)
Aldahmash, Abdulwali H.; Mansour, Nasser S.; Alshamrani, Saeed M.; Almohi, Saeed
2016-12-01
This study examines Saudi Arabian middle school science textbooks' coverage of the essential features of scientific inquiry. All activities in the middle school science textbooks and workbooks were analyzed by using the scientific inquiry `essential features' rubric. The results indicated that the essential features are included in about 59 % of the analyzed science activities. However, feature 2, `making learner give priority to evidence in responding to questions' and feature 3, `allowing learner to formulate explanations from evidence' appeared more frequently than the other three features (feature 1: engaging learner in scientifically oriented questions, feature 4: helping learner connect explanations to scientific knowledge, and feature 5: helping learner communicate and justify explanations to others), whether in the activities as a whole, or in the activities included in each of the four science domains (physical science, Earth science, life science and chemistry). These features are represented in almost all activities. This means that almost all activities in the middle school science textbooks and the workbooks include features 2 and 3. Meanwhile, the mean level of inclusion of the five essential features of scientific inquiry found in the middle school science textbooks and workbooks as a whole is 2.55. However, results found for features 1, 4, 5 and for in-level inclusion of the inquiry features in each of the science domains indicate that the inclusion of the essential inquiry features is teacher-centred. As a result, neither science textbooks nor workbooks provide students with the opportunity or encouragement to develop their inquiry skills. Consequently, the results suggest important directions for educational administrators and policy-makers in the preparation and use of science educational content.
ERIC Educational Resources Information Center
Kim, Sangsoo; Park, Jongwon
2018-01-01
Observing scientific events or objects is a complex process that occurs through the interaction between the observer's knowledge or expectations, the surrounding context, physiological features of the human senses, scientific inquiry processes, and the use of observational instruments. Scientific observation has various features specific to this…
Transmutation of Matter in Byzantium: The Case of Michael Psellos, the Alchemist
NASA Astrophysics Data System (ADS)
Katsiampoura, Gianna
2008-06-01
There is thus nothing paradoxical about the inclusion of alchemy in the ensemble of the physical sciences nor in the preoccupation with it on the part of learned men engaged in scientific study. In the context of the Medieval model, where discourse on the physical world was ambiguous, often unclear, and lacking the support of experimental verification, the transmutation of matter, which was the subject of alchemy, even if not attended by a host of occult features, was a process that was thought to have a probable basis in reality. What is interesting in this connection is the utilization of the scientific categories of the day for discussion of transmutation of matter and the attempt to avoid, in most instances in the texts that survive, of methods reminiscent of magic.
An Overview of Biomolecular Event Extraction from Scientific Documents
Vanegas, Jorge A.; Matos, Sérgio; González, Fabio; Oliveira, José L.
2015-01-01
This paper presents a review of state-of-the-art approaches to automatic extraction of biomolecular events from scientific texts. Events involving biomolecules such as genes, transcription factors, or enzymes, for example, have a central role in biological processes and functions and provide valuable information for describing physiological and pathogenesis mechanisms. Event extraction from biomedical literature has a broad range of applications, including support for information retrieval, knowledge summarization, and information extraction and discovery. However, automatic event extraction is a challenging task due to the ambiguity and diversity of natural language and higher-level linguistic phenomena, such as speculations and negations, which occur in biological texts and can lead to misunderstanding or incorrect interpretation. Many strategies have been proposed in the last decade, originating from different research areas such as natural language processing, machine learning, and statistics. This review summarizes the most representative approaches in biomolecular event extraction and presents an analysis of the current state of the art and of commonly used methods, features, and tools. Finally, current research trends and future perspectives are also discussed. PMID:26587051
[Recent developments on the scientific research in optometry and visual science in China].
Qu, Jia
2010-10-01
The current text reviewed the situation of the scientific research in the field of Optometry and visual sciences in the recent 5 to 6 years in our country. It showed the advancement and achievement in the myopic fundamental research and the application research of visual science. In addition, it also analyzed the guidance of research in solving the clinical visual issues and the significance of community service of research in eye care in public. This text indicated by the concrete current situation and the result data of research that the biology and optics, the double property of the eye endowed the distinguished feature to the research in Ophthalmology and Optometry, and that the cross cooperation of multidisciplinary promoted the innovation in the fields of Optometry and visual research. In future, the fields of Optometry and visual science in China will face up to more and more anticipations of the original and systematic research. The prophylaxis and treatment of myopia would be still a long-term and rough exploration theme in these fields.
Anatomical entity mention recognition at literature scale
Pyysalo, Sampo; Ananiadou, Sophia
2014-01-01
Motivation: Anatomical entities ranging from subcellular structures to organ systems are central to biomedical science, and mentions of these entities are essential to understanding the scientific literature. Despite extensive efforts to automatically analyze various aspects of biomedical text, there have been only few studies focusing on anatomical entities, and no dedicated methods for learning to automatically recognize anatomical entity mentions in free-form text have been introduced. Results: We present AnatomyTagger, a machine learning-based system for anatomical entity mention recognition. The system incorporates a broad array of approaches proposed to benefit tagging, including the use of Unified Medical Language System (UMLS)- and Open Biomedical Ontologies (OBO)-based lexical resources, word representations induced from unlabeled text, statistical truecasing and non-local features. We train and evaluate the system on a newly introduced corpus that substantially extends on previously available resources, and apply the resulting tagger to automatically annotate the entire open access scientific domain literature. The resulting analyses have been applied to extend services provided by the Europe PubMed Central literature database. Availability and implementation: All tools and resources introduced in this work are available from http://nactem.ac.uk/anatomytagger. Contact: sophia.ananiadou@manchester.ac.uk Supplementary Information: Supplementary data are available at Bioinformatics online. PMID:24162468
ERIC Educational Resources Information Center
Aldahmash, Abdulwali H.; Mansour, Nasser S.; Alshamrani, Saeed M.; Almohi, Saeed
2016-01-01
This study examines Saudi Arabian middle school science textbooks' coverage of the essential features of scientific inquiry. All activities in the middle school science textbooks and workbooks were analyzed by using the scientific inquiry "essential features" rubric. The results indicated that the essential features are included in about…
Assessment of Burmese Refugee Students' Meaning Making of Scientific Informational Texts
ERIC Educational Resources Information Center
Croce, Keri-Anne
2014-01-01
This two and a half year study examines how non-native English-speaking Burmese refugee students from first to third grades made meaning of scientific informational texts. The study is framed by sociocultural theory and transactional theory. Primary data were drawn from 160 student retellings of scientific informational texts. Secondary data…
Flemming, Danny; Feinkohl, Insa; Cress, Ulrike; Kimmerle, Joachim
2015-01-01
We examined in two empirical studies how situational and personal aspects of uncertainty influence laypeople's understanding of the uncertainty of scientific information, with focus on the detection of tentativeness and perception of scientific credibility. In the first study (N = 48), we investigated the impact of a perceived conflict due to contradicting information as a situational, text-inherent aspect of uncertainty. The aim of the second study (N = 61) was to explore the role of general self-efficacy as an intra-personal uncertainty factor. In Study 1, participants read one of two versions of an introductory text in a between-group design. This text provided them with an overview about the neurosurgical procedure of deep brain stimulation (DBS). The text expressed a positive attitude toward DBS in one experimental condition or focused on the negative aspects of this method in the other condition. Then participants in both conditions read the same text that dealt with a study about DBS as experimental treatment in a small sample of patients with major depression. Perceived conflict between the two texts was found to increase the perception of tentativeness and to decrease the perception of scientific credibility, implicating that text-inherent aspects have significant effects on critical appraisal. The results of Study 2 demonstrated that participants with higher general self-efficacy detected the tentativeness to a lesser degree and assumed a higher level of scientific credibility, indicating a more naïve understanding of scientific information. This appears to be contradictory to large parts of previous findings that showed positive effects of high self-efficacy on learning. Both studies showed that perceived tentativeness and perceived scientific credibility of medical information contradicted each other. We conclude that there is a need for supporting laypeople in understanding the uncertainty of scientific information and that scientific writers should consider how to present scientific results when compiling pertinent texts.
Flemming, Danny; Feinkohl, Insa; Cress, Ulrike; Kimmerle, Joachim
2015-01-01
We examined in two empirical studies how situational and personal aspects of uncertainty influence laypeople’s understanding of the uncertainty of scientific information, with focus on the detection of tentativeness and perception of scientific credibility. In the first study (N = 48), we investigated the impact of a perceived conflict due to contradicting information as a situational, text-inherent aspect of uncertainty. The aim of the second study (N = 61) was to explore the role of general self-efficacy as an intra-personal uncertainty factor. In Study 1, participants read one of two versions of an introductory text in a between-group design. This text provided them with an overview about the neurosurgical procedure of deep brain stimulation (DBS). The text expressed a positive attitude toward DBS in one experimental condition or focused on the negative aspects of this method in the other condition. Then participants in both conditions read the same text that dealt with a study about DBS as experimental treatment in a small sample of patients with major depression. Perceived conflict between the two texts was found to increase the perception of tentativeness and to decrease the perception of scientific credibility, implicating that text-inherent aspects have significant effects on critical appraisal. The results of Study 2 demonstrated that participants with higher general self-efficacy detected the tentativeness to a lesser degree and assumed a higher level of scientific credibility, indicating a more naïve understanding of scientific information. This appears to be contradictory to large parts of previous findings that showed positive effects of high self-efficacy on learning. Both studies showed that perceived tentativeness and perceived scientific credibility of medical information contradicted each other. We conclude that there is a need for supporting laypeople in understanding the uncertainty of scientific information and that scientific writers should consider how to present scientific results when compiling pertinent texts. PMID:26648902
Using clustering and a modified classification algorithm for automatic text summarization
NASA Astrophysics Data System (ADS)
Aries, Abdelkrime; Oufaida, Houda; Nouali, Omar
2013-01-01
In this paper we describe a modified classification method destined for extractive summarization purpose. The classification in this method doesn't need a learning corpus; it uses the input text to do that. First, we cluster the document sentences to exploit the diversity of topics, then we use a learning algorithm (here we used Naive Bayes) on each cluster considering it as a class. After obtaining the classification model, we calculate the score of a sentence in each class, using a scoring model derived from classification algorithm. These scores are used, then, to reorder the sentences and extract the first ones as the output summary. We conducted some experiments using a corpus of scientific papers, and we have compared our results to another summarization system called UNIS.1 Also, we experiment the impact of clustering threshold tuning, on the resulted summary, as well as the impact of adding more features to the classifier. We found that this method is interesting, and gives good performance, and the addition of new features (which is simple using this method) can improve summary's accuracy.
NASA Astrophysics Data System (ADS)
Shea, Nicole Anne
Science curriculum is often used as a means to train students as future scientists with less emphasis placed on preparing students to reason about issues they may encounter in their daily lives (Feinstein, Allen, & Jenkins, 2013; Roth & Barton, 2004). The general public is required to think scientifically to some degree throughout their life and often across a variety of issues. From an empirical standpoint, we do not have a robust understanding of what scientific knowledge the public finds useful for reasoning about socio-scientific issues in their everyday lives (Feinstein, 2011). We also know very little about how the situational features of an issue influences reasoning strategy (i.e., the use of knowledge to generate arguments). Rapid advances in science - particularly in genetics - increasingly challenge the public to reason about socio-scientific issues. This raises questions about the public's ability to participate knowledgeably in socio-scientific debates, and to provide informed consent for a variety of novel scientific procedures. This dissertation aims to answer the questions: How do individuals use their genetic content knowledge to reason about authentic issues they may encounter in their daily lives? Individuals' scientific knowledge is a critical aspect of scientific literacy, but what scientific literacy looks like in practice as individuals use their content knowledge to reason about issues comprised of different situational features is still unclear. The purpose of this dissertation is to explore what knowledge is actually used by individuals to generate and support arguments about a variety of socio-scientific issues, and how the features of those issues influences reasoning strategy. Three studies were conducted to answer questions reflecting this purpose. Findings from this dissertation provide important insights into what scientific literacy looks like in practice.
Political astronomy: Comet and meteor observations by Muslim historians
NASA Astrophysics Data System (ADS)
Chander Kapoor, Ramesh
2015-08-01
Eclipses and unexpected phenomena like comets, meteors, novae and earthquakes were viewed among various cultures as violating the established order of the heavens. They were considered to be ill omens for kings and emperors and were routinely monitored. The present work looks into the texts of history and literature by Muslim historians and chroniclers in West Asia and India that carry stray references to such phenomena. The accounts often relate the apparitions to specific disastrous events or prognosticate revolts, deaths, epidemics, earthquakes all that that took place in later times. Obviously, the occurrences interested the astrologers more. Comet appearances would last for days and weeks but nearly all the writings lack sequential observations. Meteor showers are annual features but the Islamic calendar being lunar would not easily lead one to notice periodic nature of the incidents, let alone sensing a periodicity in comet appearances. These are non-astronomy texts with little scientific content but being from different ages permit us to see how the astronomical perceptions changed over the times. The recorded details and firm chronology, tested against modern back calculations, can provide valuable information on them, keeping in mind the text and the context in which the original reference was made. We also notice a qualitative change in the Indian writings of the 18th century and later where the authors begin to show up with influence of exposure to the European scientific progress.
ERIC Educational Resources Information Center
Gilmanshina, Suriya I.; Gilmanshin, Iskander R.; Sagitova, Rimma N.; Galeeva, Asiya I.
2016-01-01
The aim of this article is to disclose features of scientific explanation in teaching of chemistry in the environment of new information of school students' developmental education. The leading approach to the study of this problem is the information and environmental approach that comprehensively address the problem of scientific explanation in…
Citation Sentiment Analysis in Clinical Trial Papers
Xu, Jun; Zhang, Yaoyun; Wu, Yonghui; Wang, Jingqi; Dong, Xiao; Xu, Hua
2015-01-01
In scientific writing, positive credits and negative criticisms can often be seen in the text mentioning the cited papers, providing useful information about whether a study can be reproduced or not. In this study, we focus on citation sentiment analysis, which aims to determine the sentiment polarity that the citation context carries towards the cited paper. A citation sentiment corpus was annotated first on clinical trial papers. The effectiveness of n-gram and sentiment lexicon features, and problem-specified structure features for citation sentiment analysis were then examined using the annotated corpus. The combined features from the word n-grams, the sentiment lexicons and the structure information achieved the highest Micro F-score of 0.860 and Macro-F score of 0.719, indicating that it is feasible to use machine learning methods for citation sentiment analysis in biomedical publications. A comprehensive comparison between citation sentiment analysis of clinical trial papers and other general domains were conducted, which additionally highlights the unique challenges within this domain. PMID:26958274
Neurofibromatosis: part 2--clinical management.
Batista, Pollyanna Barros; Bertollo, Eny Maria Goloni; Costa, Danielle de Souza; Eliam, Lucas; Cunha, Karin Soares Gonçalves; Cunha-Melo, José Renan; Darrigo Junior, Luiz Guilherme; Geller, Mauro; Gianordoli-Nascimento, Ingrid Faria; Madeira, Luciana Gonçalves; Mendes, Hérika Martins; Miranda, Débora Marques de; Mata-Machado, Nikolas Andre; Morato, Eric Grossi; Pavarino, Érika Cristina; Pereira, Luciana Baptista; Rezende, Nilton Alves de; Rodrigues, Luíza de Oliveira; Sette, Jorge Bezerra Cavalcanti
2015-06-01
Part 1 of this guideline addressed the differential diagnosis of the neurofibromatoses (NF): neurofibromatosis type 1 (NF1), neurofibromatosis type 2 (NF2) and schwannomatosis (SCH). NF shares some features such as the genetic origin of the neural tumors and cutaneous manifestations, and affects nearly 80 thousand Brazilians. Increasing scientific knowledge on NF has allowed better clinical management and reduced rate of complications and morbidity, resulting in higher quality of life for NF patients. Most medical doctors are able to perform NF diagnosis, but the wide range of clinical manifestations and the inability to predict the onset or severity of new features, consequences, or complications make NF management a real clinical challenge, requiring the support of different specialists for proper treatment and genetic counseling, especially in NF2 and SCH. The present text suggests guidelines for the clinical management of NF, with emphasis on NF1.
ERIC Educational Resources Information Center
Braun, Isabel; Nuckles, Matthias
2014-01-01
Scholarly scientific literature conveys epistemological assumptions scientists operate on. Popular scientific literature and instructional science texts deviate in their portrayal of science from these epistemological assumptions. Thus, scholarly scientific literature holds more potential for improving students' epistemological understanding…
Object-Oriented Scientific Programming with Fortran 90
NASA Technical Reports Server (NTRS)
Norton, C.
1998-01-01
Fortran 90 is a modern language that introduces many important new features beneficial for scientific programming. We discuss our experiences in plasma particle simulation and unstructured adaptive mesh refinement on supercomputers, illustrating the features of Fortran 90 that support the object-oriented methodology.
NASA Astrophysics Data System (ADS)
Wong, Siu Ling; Kwan, Jenny; Hodson, Derek; Yung, Benny Hin Wai
2009-01-01
Interviews with key scientists who had conducted research on Severe Acute Respiratory Syndrome (SARS), together with analysis of media reports, documentaries and other literature published during and after the SARS epidemic, revealed many interesting aspects of the nature of science (NOS) and scientific inquiry in contemporary scientific research in the rapidly growing field of molecular biology. The story of SARS illustrates vividly some NOS features advocated in the school science curriculum, including the tentative nature of scientific knowledge, theory-laden observation and interpretation, multiplicity of approaches adopted in scientific inquiry, the inter-relationship between science and technology, and the nexus of science, politics, social and cultural practices. The story also provided some insights into a number of NOS features less emphasised in the school curriculum—for example, the need to combine and coordinate expertise in a number of scientific fields, the intense competition between research groups (suspended during the SARS crisis), the significance of affective issues relating to intellectual honesty and the courage to challenge authority, the pressure of funding issues on the conduct of research and the ‘peace of mind’ of researchers, These less emphasised elements provided empirical evidence that NOS knowledge, like scientific knowledge itself, changes over time. They reflected the need for teachers and curriculum planners to revisit and reconsider whether the features of NOS currently included in the school science curriculum are fully reflective of the practice of science in the 21st century. In this paper, we also report on how we made use of extracts from the news reports and documentaries on SARS, together with episodes from the scientists’ interviews, to develop a multimedia instructional package for explicitly teaching the prominent features of NOS and scientific inquiry identified in the SARS research.
Recommendation on the Status of Scientific Researchers
ERIC Educational Resources Information Center
Reynolds, Paul Davidson
1975-01-01
This article contains the verbatim text of the Recommendation on the Status of Scientific Researchers adopted by the General Conference of Unesco at its eighteenth session in 1974. Also listed are international instruments and other texts concerning workers in general or scientific researchers in particular. For journal availability see SO 504…
Synoptic reporting in tumor pathology: advantages of a web-based system.
Qu, Zhenhong; Ninan, Shibu; Almosa, Ahmed; Chang, K G; Kuruvilla, Supriya; Nguyen, Nghia
2007-06-01
The American College of Surgeons Commission on Cancer (ACS-CoC) mandates that pathology reports at ACS-CoC-approved cancer programs include all scientifically validated data elements for each site and tumor specimen. The College of American Pathologists (CAP) has produced cancer checklists in static text formats to assist reporting. To be inclusive, the CAP checklists are pages long, requiring extensive text editing and multiple intermediate steps. We created a set of dynamic tumor-reporting templates, using Microsoft Active Server Page (ASP.NET), with drop-down list and data-compile features, and added a reminder function to indicate missing information. Users can access this system on the Internet, prepare the tumor report by selecting relevant data from drop-down lists with an embedded tumor staging scheme, and directly transfer the final report into a laboratory information system by using the copy-and-paste function. By minimizing extensive text editing and eliminating intermediate steps, this system can reduce reporting errors, improve work efficiency, and increase compliance.
MSL: Facilitating automatic and physical analysis of published scientific literature in PDF format.
Ahmed, Zeeshan; Dandekar, Thomas
2015-01-01
Published scientific literature contains millions of figures, including information about the results obtained from different scientific experiments e.g. PCR-ELISA data, microarray analysis, gel electrophoresis, mass spectrometry data, DNA/RNA sequencing, diagnostic imaging (CT/MRI and ultrasound scans), and medicinal imaging like electroencephalography (EEG), magnetoencephalography (MEG), echocardiography (ECG), positron-emission tomography (PET) images. The importance of biomedical figures has been widely recognized in scientific and medicine communities, as they play a vital role in providing major original data, experimental and computational results in concise form. One major challenge for implementing a system for scientific literature analysis is extracting and analyzing text and figures from published PDF files by physical and logical document analysis. Here we present a product line architecture based bioinformatics tool 'Mining Scientific Literature (MSL)', which supports the extraction of text and images by interpreting all kinds of published PDF files using advanced data mining and image processing techniques. It provides modules for the marginalization of extracted text based on different coordinates and keywords, visualization of extracted figures and extraction of embedded text from all kinds of biological and biomedical figures using applied Optimal Character Recognition (OCR). Moreover, for further analysis and usage, it generates the system's output in different formats including text, PDF, XML and images files. Hence, MSL is an easy to install and use analysis tool to interpret published scientific literature in PDF format.
Forensic scientists' conclusions: how readable are they for non-scientist report-users?
Howes, Loene M; Kirkbride, K Paul; Kelty, Sally F; Julian, Roberta; Kemp, Nenagh
2013-09-10
Scientists have an ethical responsibility to assist non-scientists to understand their findings and expert opinions before they are used as decision-aids within the criminal justice system. The communication of scientific expert opinion to non-scientist audiences (e.g., police, lawyers, and judges) through expert reports is an important but under-researched issue. Readability statistics were used to assess 111 conclusions from a proficiency test in forensic glass analysis. The conclusions were written using an average of 23 words per sentence, and approximately half of the conclusions were expressed using the active voice. At an average Flesch-Kincaid Grade level of university undergraduate (Grade 13), and Flesch Reading Ease score of difficult (42), the conclusions were written at a level suitable for people with some tertiary education in science, suggesting that the intended non-scientist readers would find them difficult to read. To further analyse the readability of conclusions, descriptive features of text were used: text structure; sentence structure; vocabulary; elaboration; and coherence and unity. Descriptive analysis supported the finding that texts were written at a level difficult for non-scientists to read. Specific aspects of conclusions that may pose difficulties for non-scientists were located. Suggestions are included to assist scientists to write conclusions with increased readability for non-scientist readers, while retaining scientific integrity. In the next stage of research, the readability of expert reports in their entirety is to be explored. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Westergaard, David; Stærfeldt, Hans-Henrik; Tønsberg, Christian; Jensen, Lars Juhl; Brunak, Søren
2018-02-01
Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15 million English scientific full-text articles published during the period 1823-2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein-protein, disease-gene, and protein subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full-text articles consistently outperforms using abstracts only.
Westergaard, David; Stærfeldt, Hans-Henrik
2018-01-01
Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15 million English scientific full-text articles published during the period 1823–2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein–protein, disease–gene, and protein subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full-text articles consistently outperforms using abstracts only. PMID:29447159
Storyboards and Science: Introducing the Planetary Data Storyboard
NASA Astrophysics Data System (ADS)
King, T. A.; Del Villar, A.; Alkhawaja, A.; Grayzeck, E. J.; Galica, C.; Odess, J.; Erickson, K. J.
2015-12-01
Every discovery has a story and storytelling is an ancient form of education. The stories of scientific discovery are often very formal and technical and not always very accessible. As in the past, today most scientific storytelling is done as in-person presentations in the form of slide shows or movies that unfold according to the design of its author. Things have changed. Using today's technologies telling stories can be a rich multi-media experience with a blending of text, animations, movies and infographics. Also, with presentations on the web the presentation can provide links to more details and the audience (reader) can jump to the linked information. Even so, the most common form of today's storytelling is as a narrative that starts with a page, a link to a single movie or a slide-show. We introduce a new promising form of scientific storytelling, the storyboard. With a storyboard a story is presented as a set of panels that contain representative images of an event and may have associated notes or instructions. The panels are arranged in a timeline that allow the audience to experience the discovery in the same way it occurred. A panel can also link to a more detailed source such as a publication, the data that was collected or items derived from the research (like movies or animations). Scientific storyboards can make science discovery more accessible to people by presenting events in an easy to follow layout. Scientific storyboards can also help to teach the scientific method, by following the experiences of a researcher as they investigate a phenomenon or try to understand a new set of observations. We illustrate the unique features of scientific storyboards with the Planetary Data Storyboard using data archived by the Planetary Data System.
NASA Astrophysics Data System (ADS)
Teixeira, Carlos; Paulo, Gallo; Nogueira, Maria Inês
2015-04-01
Communication's Purpose: Identify the artistic expression that uses the language of cartoons and comics for public communication, having as reference the Earth Education for a better planet sustainability. Object/Theme: Cartoons and comics published in newspapers, on five continents, made available in online version. Theoretical: This study is related to the assumption that the public communication of science by cartoons and comics constitute a textual genre, by the fact that they report scientific and complex themes presented in playful language, using humor and artistic traces accessible to the lay public. The scientific cartoons and comics aim to call public attention to scientific discoveries and science themes using illustrative chart features and short texts, both contextualized in a humorous structure. There are in the cartoons and comics, which are created to the public communication of science, an unintentionally pedagogical approach/formal, while transmitting information by unpretentious way and using graphic/artistic communication By the fact that in this specific format of communication there is knowledge being informed, the scientific cartoons and comics can contribute to the scientific empowerment of the society, in addition to being instruments that can also arouse scientific curiosity. The scientific cartoons and comics use objective language and short sentences, also employ words that may have a double meaning. It can be considered as an incentive for people's reflection. Method: It was analyzed cartoons and comics published in newspapers, made available in online version, published on five continents, in English, Portuguese and Spanish. Palavras-chave: science communication, public communication of science and technology; cartoons; comics
Tachycardia detection in ICDs by Boston Scientific : Algorithms, pearls, and pitfalls.
Zanker, Norbert; Schuster, Diane; Gilkerson, James; Stein, Kenneth
2016-09-01
The aim of this study was to summarize how implantable cardioverter defibrillators (ICDs) by Boston Scientific sense, detect, discriminate rhythms, and classify episodes. Modern devices include multiple programming selections, diagnostic features, therapy options, memory functions, and device-related history features. Device operation includes logical steps from sensing, detection, discrimination, therapy delivery to history recording. The program is designed to facilitate the application of the device algorithms to the individual patient's clinical needs. Features and functions described in this article represent a selective excerpt by the authors from Boston Scientific publicly available product resources. Programming of ICDs may affect patient outcomes. Patient-adapted and optimized programming requires understanding of device operation and concepts.
Endnote Web tutorial for BJCVS/RBCCV
de Oliveira, Marcos Aurélio Barboza; dos Santos, Carlos Alberto; Brandi, Antônio Carlos; Botelho, Paulo Henrique Husseini; Sciarra, Adília Maria Pires; Braile, Domingo Marcolino
2015-01-01
At present, many useful tools for reference management are available for use. They can be either off-line softwares or accessible Websites to all users in the internet. Their target is to facilitate the production of scientific text. But, to accomplish that, the featured bibliographic style should be effectively inserted, and the program has to be free. Here in this tutorial, we present Endnote Web®, a bibliographic reference management program comprising these two requirements: it contains the Brazilian Journal of Cardiovascular Surgery reference format and its use is free for charge after sign-in in IP registered terminal in Web of Science®. PMID:26107457
Science Teaching as Educational Interrogation of Scientific Research
ERIC Educational Resources Information Center
Ginev, Dimitri
2013-01-01
The main argument of this article is that science teaching based on a pedagogy of questions is to be modeled on a hermeneutic conception of scientific research as a process of the constitution of texts. This process is spelled out in terms of hermeneutic phenomenology. A text constituted by scientific practices is at once united by a hermeneutic…
Exploring access to scientific literature using content-based image retrieval
NASA Astrophysics Data System (ADS)
Deserno, Thomas M.; Antani, Sameer; Long, Rodney
2007-03-01
The number of articles published in the scientific medical literature is continuously increasing, and Web access to the journals is becoming common. Databases such as SPIE Digital Library, IEEE Xplore, indices such as PubMed, and search engines such as Google provide the user with sophisticated full-text search capabilities. However, information in images and graphs within these articles is entirely disregarded. In this paper, we quantify the potential impact of using content-based image retrieval (CBIR) to access this non-text data. Based on the Journal Citations Report (JCR), the journal Radiology was selected for this study. In 2005, 734 articles were published electronically in this journal. This included 2,587 figures, which yields a rate of 3.52 figures per article. Furthermore, 56.4% of these figures are composed of several individual panels, i.e. the figure combines different images and/or graphs. According to the Image Cross-Language Evaluation Forum (ImageCLEF), the error rate of automatic identification of medical images is about 15%. Therefore, it is expected that, by applying ImageCLEF-like techniques, already 95.5% of articles could be retrieved by means of CBIR. The challenge for CBIR in scientific literature, however, is the use of local texture properties to analyze individual image panels in composite illustrations. Using local features for content-based image representation, 8.81 images per article are available, and the predicted correctness rate may increase to 98.3%. From this study, we conclude that CBIR may have a high impact in medical literature research and suggest that additional research in this area is warranted.
Ghost Hunting as a Means to Illustrate Scientific Methodology and Enhance Critical Thinking
ERIC Educational Resources Information Center
Rockwell, Steven C.
2012-01-01
The increasing popularity of television shows featuring paranormal investigations has led to a renewed enthusiasm in ghost hunting activities, and belief in the paranormal in general. These shows typically feature a group of investigators who, while claiming to utilize proper scientifically correct methodologies, violate many core scientific…
MSL: Facilitating automatic and physical analysis of published scientific literature in PDF format
Ahmed, Zeeshan; Dandekar, Thomas
2018-01-01
Published scientific literature contains millions of figures, including information about the results obtained from different scientific experiments e.g. PCR-ELISA data, microarray analysis, gel electrophoresis, mass spectrometry data, DNA/RNA sequencing, diagnostic imaging (CT/MRI and ultrasound scans), and medicinal imaging like electroencephalography (EEG), magnetoencephalography (MEG), echocardiography (ECG), positron-emission tomography (PET) images. The importance of biomedical figures has been widely recognized in scientific and medicine communities, as they play a vital role in providing major original data, experimental and computational results in concise form. One major challenge for implementing a system for scientific literature analysis is extracting and analyzing text and figures from published PDF files by physical and logical document analysis. Here we present a product line architecture based bioinformatics tool ‘Mining Scientific Literature (MSL)’, which supports the extraction of text and images by interpreting all kinds of published PDF files using advanced data mining and image processing techniques. It provides modules for the marginalization of extracted text based on different coordinates and keywords, visualization of extracted figures and extraction of embedded text from all kinds of biological and biomedical figures using applied Optimal Character Recognition (OCR). Moreover, for further analysis and usage, it generates the system’s output in different formats including text, PDF, XML and images files. Hence, MSL is an easy to install and use analysis tool to interpret published scientific literature in PDF format. PMID:29721305
Climates of risk: a field analysis of global climate change in US media discourse, 1997-2004.
Sonnett, John
2010-11-01
How are industry and environmentalist discourses of climate risk related to dominant scientific and political discourses? This study operationalizes Bourdieu's concept of symbolic capital in order to map dimensions of risk description and prescription onto a journalistic field of industry, environmentalist, scientific, and political media. Results show that conventional definitions of risk mirror an opposition between scientific and political discourses. Prescriptions for action on risk are partly autonomous from definitions however. Environmentalist and scientific media feature more proactive discourse, and industry and political media feature more reactive discourse. Implications for future research on climate risk and relational studies of media discourse are discussed.
Guiding Students through Expository Text with Text Feature Walks
ERIC Educational Resources Information Center
Kelley, Michelle J.; Clausen-Grace, Nicki
2010-01-01
The Text Feature Walk is a structure created and employed by the authors that guides students in the reading of text features in order to access prior knowledge, make connections, and set a purpose for reading expository text. Results from a pilot study are described in order to illustrate the benefits of using the Text Feature Walk over…
Representation of scientific methodology in secondary science textbooks
NASA Astrophysics Data System (ADS)
Binns, Ian C.
The purpose of this investigation was to assess the representation of scientific methodology in secondary science textbooks. More specifically, this study looked at how textbooks introduced scientific methodology and to what degree the examples from the rest of the textbook, the investigations, and the images were consistent with the text's description of scientific methodology, if at all. The sample included eight secondary science textbooks from two publishers, McGraw-Hill/Glencoe and Harcourt/Holt, Rinehart & Winston. Data consisted of all student text and teacher text that referred to scientific methodology. Second, all investigations in the textbooks were analyzed. Finally, any images that depicted scientists working were also collected and analyzed. The text analysis and activity analysis used the ethnographic content analysis approach developed by Altheide (1996). The rubrics used for the text analysis and activity analysis were initially guided by the Benchmarks (AAAS, 1993), the NSES (NRC, 1996), and the nature of science literature. Preliminary analyses helped to refine each of the rubrics and grounded them in the data. Image analysis used stereotypes identified in the DAST literature. Findings indicated that all eight textbooks presented mixed views of scientific methodology in their initial descriptions. Five textbooks placed more emphasis on the traditional view and three placed more emphasis on the broad view. Results also revealed that the initial descriptions, examples, investigations, and images all emphasized the broad view for Glencoe Biology and the traditional view for Chemistry: Matter and Change. The initial descriptions, examples, investigations, and images in the other six textbooks were not consistent. Overall, the textbook with the most appropriate depiction of scientific methodology was Glencoe Biology and the textbook with the least appropriate depiction of scientific methodology was Physics: Principles and Problems. These findings suggest that compared to earlier investigations, textbooks have begun to improve in how they represent scientific methodology. However, there is still much room for improvement. Future research needs to consider how textbooks impact teachers' and students' understandings of scientific methodology.
Processing and Recall of Seductive Details in Scientific Text
ERIC Educational Resources Information Center
Lehman, Stephen; Schraw, Gregory; McCrudden, Matthew T.; Hartley, Kendall
2007-01-01
This study examined how seductive details affect on-line processing of a technical, scientific text. In Experiment 1, each sentence from the experimental text was rated for interest and importance. Participants rated seductive details as being more interesting but less important than main ideas. In Experiment 2, we examined the effect of seductive…
Text feature extraction based on deep learning: a review.
Liang, Hong; Sun, Xiao; Sun, Yunlei; Gao, Yuan
2017-01-01
Selection of text feature item is a basic and important matter for text mining and information retrieval. Traditional methods of feature extraction require handcrafted features. To hand-design, an effective feature is a lengthy process, but aiming at new applications, deep learning enables to acquire new effective feature representation from training data. As a new feature extraction method, deep learning has made achievements in text mining. The major difference between deep learning and conventional methods is that deep learning automatically learns features from big data, instead of adopting handcrafted features, which mainly depends on priori knowledge of designers and is highly impossible to take the advantage of big data. Deep learning can automatically learn feature representation from big data, including millions of parameters. This thesis outlines the common methods used in text feature extraction first, and then expands frequently used deep learning methods in text feature extraction and its applications, and forecasts the application of deep learning in feature extraction.
NASA Astrophysics Data System (ADS)
Sofronieva, Tzveta
2014-03-01
Many of the major figures in the history of science have produced literary works, but the relationship between their poetic texts and their scientific work is often underestimated. This paper illuminates the poetry of Erwin Schrödinger—one of the premier figures in twentieth-century science, and an accomplished poet in both English and his native German. It discusses existing perceptions of his poetry and challenges the assumptions that his poetic work was a mere hobby unrelated to his other achievements by focusing on the interplay between poetic images and scientific ideas in his German-language poems. It emphasizes that more research is needed on the understated role of bilingualism and of—often marginalized—writing in an adopted language in science and in poetry, with the premise that this feature of Schrödinger's life deserves more study. It argues that Schrödinger's literary imagination and his bilingualism are an integral part of his approach to reality and considers Schrödinger's literary work to be an important aspect of his intellectual heritage.
Major Strands in Scientific Inquiry through Cluster Analysis of Research Abstracts
ERIC Educational Resources Information Center
Yeh, Yi-Fen; Jen, Tsung-Hau; Hsu, Ying-Shao
2012-01-01
Scientific inquiry involves a variety of abilities scientists use to investigate the natural world. In order to develop students' scientific inquiry, researchers and educators have developed different curricula and a variety of instructional resources, which make features and descriptors of scientific inquiry in teaching and learning even more…
Using a Feature Film to Promote Scientific Enquiry
ERIC Educational Resources Information Center
Hadzigeorgiou, Yannis; Kodakos, Tassos; Garganourakis, Vassilios
2010-01-01
This article reports on an action research project undertaken with the primary aim of investigating the extent to which a feature film, whose plot included Tesla's demonstrations on the wireless transmission of electrical energy, can promote scientific enquiry. The class that participated in this project was an 11th grade class in a rural area of…
The Features of Peer Argumentation in Middle School Students' Scientific Inquiry
ERIC Educational Resources Information Center
Kim, Heekyong; Song, Jinwoong
2006-01-01
This study examined the features of peer argumentation in middle school students' scientific inquiry. Participants were two boys and six girls in grade 8 of a middle school in Seoul, Korea. Students engaged in open inquiry activities in small groups. Each group prepared the report for peer review and then, during the peer discussion, presented…
ERIC Educational Resources Information Center
Ault, Marilyn; Craig-Hare, Jana; Frey, Bruce; Ellis, James D.; Bulgren, Janis
2015-01-01
Reason Racer is an online, rate-based, multiplayer game that applies specific game features in order to engage middle school students in introductory knowledge of and thinking related to scientific argumentation. Game features include rapid and competitive play, timed performance, immediate feedback, and high rates of response across many…
Manuscript Architect: a Web application for scientific writing in virtual interdisciplinary groups
Pietrobon, Ricardo; Nielsen, Karen C; Steele, Susan M; Menezes, Andreia P; Martins, Henrique; Jacobs, Danny O
2005-01-01
Background Although scientific writing plays a central role in the communication of clinical research findings and consumes a significant amount of time from clinical researchers, few Web applications have been designed to systematically improve the writing process. This application had as its main objective the separation of the multiple tasks associated with scientific writing into smaller components. It was also aimed at providing a mechanism where sections of the manuscript (text blocks) could be assigned to different specialists. Manuscript Architect was built using Java language in conjunction with the classic lifecycle development method. The interface was designed for simplicity and economy of movements. Manuscripts are divided into multiple text blocks that can be assigned to different co-authors by the first author. Each text block contains notes to guide co-authors regarding the central focus of each text block, previous examples, and an additional field for translation when the initial text is written in a language different from the one used by the target journal. Usability was evaluated using formal usability tests and field observations. Results The application presented excellent usability and integration with the regular writing habits of experienced researchers. Workshops were developed to train novice researchers, presenting an accelerated learning curve. The application has been used in over 20 different scientific articles and grant proposals. Conclusion The current version of Manuscript Architect has proven to be very useful in the writing of multiple scientific texts, suggesting that virtual writing by interdisciplinary groups is an effective manner of scientific writing when interdisciplinary work is required. PMID:15960855
Huh, Sun
2013-01-01
ScienceCentral, a free or open access, full-text archive of scientific journal literature at the Korean Federation of Science and Technology Societies, was under test in September 2013. Since it is a Journal Article Tag Suite-based full text database, extensible markup language files of all languages can be presented, according to Unicode Transformation Format 8-bit encoding. It is comparable to PubMed Central: however, there are two distinct differences. First, its scope comprises all science fields; second, it accepts all language journals. Launching ScienceCentral is the first step for free access or open access academic scientific journals of all languages to leap to the world, including scientific journals from Croatia.
Feature extraction for document text using Latent Dirichlet Allocation
NASA Astrophysics Data System (ADS)
Prihatini, P. M.; Suryawan, I. K.; Mandia, IN
2018-01-01
Feature extraction is one of stages in the information retrieval system that used to extract the unique feature values of a text document. The process of feature extraction can be done by several methods, one of which is Latent Dirichlet Allocation. However, researches related to text feature extraction using Latent Dirichlet Allocation method are rarely found for Indonesian text. Therefore, through this research, a text feature extraction will be implemented for Indonesian text. The research method consists of data acquisition, text pre-processing, initialization, topic sampling and evaluation. The evaluation is done by comparing Precision, Recall and F-Measure value between Latent Dirichlet Allocation and Term Frequency Inverse Document Frequency KMeans which commonly used for feature extraction. The evaluation results show that Precision, Recall and F-Measure value of Latent Dirichlet Allocation method is higher than Term Frequency Inverse Document Frequency KMeans method. This shows that Latent Dirichlet Allocation method is able to extract features and cluster Indonesian text better than Term Frequency Inverse Document Frequency KMeans method.
NCI at Frederick Scientific Library Reintroduces Scientific Publications Database | Poster
A 20-year-old database of scientific publications by NCI at Frederick, FNLCR, and affiliated employees has gotten a significant facelift. Maintained by the Scientific Library, the redesigned database—which is linked from each of the Scientific Library’s web pages—offers features that were not available in previous versions, such as additional search limits and non-traditional
NCI at Frederick Scientific Library Reintroduces Scientific Publications Database | Poster
A 20-year-old database of scientific publications by NCI at Frederick, FNLCR, and affiliated employees has gotten a significant facelift. Maintained by the Scientific Library, the redesigned database—which is linked from each of the Scientific Library’s web pages—offers features that were not available in previous versions, such as additional search limits and non-traditional metrics for scholarly and scientific publishing known as altmetrics.
Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use
ERIC Educational Resources Information Center
White, Sheida
2012-01-01
This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…
ERIC Educational Resources Information Center
Pease, Craig M.; Bull, J. J.
1992-01-01
Offers a concise, abstract description of the scientific method different from the historical, philosophical, and case-study approaches, which lead to comprehension of this method. Discusses features of scientific models, dynamic interactions underlying scientific progress, ways that scientist successfully understand nature, mechanisms for…
Hahn, P; Dullweber, F; Unglaub, F; Spies, C K
2014-06-01
Searching for relevant publications is becoming more difficult with the increasing number of scientific articles. Text mining as a specific form of computer-based data analysis may be helpful in this context. Highlighting relations between authors and finding relevant publications concerning a specific subject using text analysis programs are illustrated graphically by 2 performed examples. © Georg Thieme Verlag KG Stuttgart · New York.
Sentiment analysis of feature ranking methods for classification accuracy
NASA Astrophysics Data System (ADS)
Joseph, Shashank; Mugauri, Calvin; Sumathy, S.
2017-11-01
Text pre-processing and feature selection are important and critical steps in text mining. Text pre-processing of large volumes of datasets is a difficult task as unstructured raw data is converted into structured format. Traditional methods of processing and weighing took much time and were less accurate. To overcome this challenge, feature ranking techniques have been devised. A feature set from text preprocessing is fed as input for feature selection. Feature selection helps improve text classification accuracy. Of the three feature selection categories available, the filter category will be the focus. Five feature ranking methods namely: document frequency, standard deviation information gain, CHI-SQUARE, and weighted-log likelihood -ratio is analyzed.
The readability of scientific texts is decreasing over time
2017-01-01
Clarity and accuracy of reporting are fundamental to the scientific process. Readability formulas can estimate how difficult a text is to read. Here, in a corpus consisting of 709,577 abstracts published between 1881 and 2015 from 123 scientific journals, we show that the readability of science is steadily decreasing. Our analyses show that this trend is indicative of a growing use of general scientific jargon. These results are concerning for scientists and for the wider public, as they impact both the reproducibility and accessibility of research findings. PMID:28873054
A matter of font type: The effect of serifs on the evaluation of scientific abstracts.
Kaspar, Kai; Wehlitz, Thea; von Knobelsdorff, Sara; Wulf, Tim; von Saldern, Marie Antoinette Oktavie
2015-10-01
Text-based communication is one of the substantial ways of spreading scientific information. While the content and contextual aspects of written words have been widely researched, the impact of font characteristics on text perception is an almost blank page. The following study deals with the influence of serifs on the evaluation of online-presented scientific abstracts. Yet there is only evidence for faster reading times when texts are presented in sans-serif fonts, although the opposite is stated in parts of the literature. The present work examines if the presence or absence of serifs also have an impact on the appraisal of scientific texts when all other important font characteristics do not change. For this purpose, 188 university students participated in an online experiment and rated different aspects of scientific abstracts as well as of the research outlined in the abstracts. The results show that missing serifs led to increased reading speed. However, and in contrast to the perceptual fluency hypothesis, the presence of serifs had a positive effect on all evaluation dimensions. The results of a second study with 187 participants also indicated that reading fluency counteracted the liking of texts. Implications for future studies and media production are discussed. © 2015 International Union of Psychological Science.
GeneView: a comprehensive semantic search engine for PubMed.
Thomas, Philippe; Starlinger, Johannes; Vowinkel, Alexander; Arzt, Sebastian; Leser, Ulf
2012-07-01
Research results are primarily published in scientific literature and curation efforts cannot keep up with the rapid growth of published literature. The plethora of knowledge remains hidden in large text repositories like MEDLINE. Consequently, life scientists have to spend a great amount of time searching for specific information. The enormous ambiguity among most names of biomedical objects such as genes, chemicals and diseases often produces too large and unspecific search results. We present GeneView, a semantic search engine for biomedical knowledge. GeneView is built upon a comprehensively annotated version of PubMed abstracts and openly available PubMed Central full texts. This semi-structured representation of biomedical texts enables a number of features extending classical search engines. For instance, users may search for entities using unique database identifiers or they may rank documents by the number of specific mentions they contain. Annotation is performed by a multitude of state-of-the-art text-mining tools for recognizing mentions from 10 entity classes and for identifying protein-protein interactions. GeneView currently contains annotations for >194 million entities from 10 classes for ∼21 million citations with 271,000 full text bodies. GeneView can be searched at http://bc3.informatik.hu-berlin.de/.
Automating document classification for the Immune Epitope Database
Wang, Peng; Morgan, Alexander A; Zhang, Qing; Sette, Alessandro; Peters, Bjoern
2007-01-01
Background The Immune Epitope Database contains information on immune epitopes curated manually from the scientific literature. Like similar projects in other knowledge domains, significant effort is spent on identifying which articles are relevant for this purpose. Results We here report our experience in automating this process using Naïve Bayes classifiers trained on 20,910 abstracts classified by domain experts. Improvements on the basic classifier performance were made by a) utilizing information stored in PubMed beyond the abstract itself b) applying standard feature selection criteria and c) extracting domain specific feature patterns that e.g. identify peptides sequences. We have implemented the classifier into the curation process determining if abstracts are clearly relevant, clearly irrelevant, or if no certain classification can be made, in which case the abstracts are manually classified. Testing this classification scheme on an independent dataset, we achieve 95% sensitivity and specificity in the 51.1% of abstracts that were automatically classified. Conclusion By implementing text classification, we have sped up the reference selection process without sacrificing sensitivity or specificity of the human expert classification. This study provides both practical recommendations for users of text classification tools, as well as a large dataset which can serve as a benchmark for tool developers. PMID:17655769
Huh, Sun
2013-01-01
ScienceCentral, a free or open access, full-text archive of scientific journal literature at the Korean Federation of Science and Technology Societies, was under test in September 2013. Since it is a Journal Article Tag Suite-based full text database, extensible markup language files of all languages can be presented, according to Unicode Transformation Format 8-bit encoding. It is comparable to PubMed Central: however, there are two distinct differences. First, its scope comprises all science fields; second, it accepts all language journals. Launching ScienceCentral is the first step for free access or open access academic scientific journals of all languages to leap to the world, including scientific journals from Croatia. PMID:24266292
Text Recycling in Scientific Writing.
Moskovitz, Cary
2018-03-15
Text recycling, often called "self-plagiarism", is the practice of reusing textual material from one's prior documents in a new work. The practice presents a complex set of ethical and practical challenges to the scientific community, many of which have not been addressed in prior discourse on the subject. This essay identifies and discusses these factors in a systematic fashion, concluding with a new definition of text recycling that takes these factors into account. Topics include terminology, what is not text recycling, factors affecting judgements about the appropriateness of text recycling, and visual materials.
Rughiniş, Cosima; Ciocănel, Alexandra; Vasile, Sorina
2017-09-27
We discuss homeopathy's placebo effect as the result of a distributed therapeutic agency involving humans, objects, and texts. Homeopathy has been involved in controversies for centuries, and the dispute whether it is therapy or quackery is as lively as ever. Still, homeopathy has retained significant popularity and acceptance within the medical establishment. We bracket the issue of biochemical effectiveness of homeopathic remedies as we only discuss homeopathy's potential to elicit a placebo response within its therapeutic alliance, in virtue of its social, symbolic, and material features. The review is based on literature discussing homeopathic effectiveness, including historical, biographical, sociological, and epistemological perspectives. We build upon research that clarifies the therapeutic relationship, examining its activities and meanings for practitioners and patients. Previous analyses discussing homeopathy's placebo effect stress the importance of the individualized consultation that functions as psychotherapy and generates empathy and hope. We enlarge the discussion, highlighting homeopathy's distributed therapeutic agency across humans, texts, and materials. The historical evolution of homeopathy in relation to biomedicine and science is important to understand its institutional integration into mainstream medicine and its appeal to scientifically minded doctors. Anecdotes of healing and the message of no-harm encourage patients to try homeopathy and hope for the best. The esthetics and ritual of remedies, coupled with computers' scientific legitimacy and time-saving power constitute a material infrastructure of therapeutic persuasion. Through its relation with biomedicine, its doctrine, consultation design, and treatment rituals, homeopathy offers a powerful medium to elicit a placebo response in a therapeutic alliance. By virtue of its proximity and radical difference from the scientific and biomedical enterprises, its material and textual organization, its storytelling and esthetics, homeopathy offers doctors and patients the opportunity and the tools to collaborate, to witness healing, and to hope for success against adversity.
NASA Astrophysics Data System (ADS)
Hapgood, Susanna Elizabeth
This interpretive case study describes a 10-day inquiry science program of study of motion down inclined planes during which a class of 21 second graders investigated scientific relationships such as mass and speed, speed and momentum, and mass and momentum via both text-based experiences ("second-hand investigations") and hands-on materials-based experiments ("first-hand investigations"). Data sources included over 11 hours of videotaped instruction in addition to children's written work, class-generated artifacts, and paper-and-pencil pre- and posttests. Content analyses informed by both sociocultural and developmental perspectives revealed that, in addition to a significant increase in pre- to posttest scores, children in the class engaged in several processes integral to inquiry, namely, (a) using data as evidence, (b) evaluating investigative procedures, and (c) making sense of multiple forms of representations. In addition, the study describes the range of and shifts in children's ideas about scientific relationships fundamental to developing an understanding of motion. Many children were observed to make causal attributions involving a relationship between two variables, such as the mass and momentum of a ball rolling down a ramp. Discussed are mediating factors such as the teacher's role in scaffolding the class's investigations and features of the innovative "scientists' notebook" texts, which were integral to the instruction. Also presented is evidence of first-hand and second-hand investigations working in concert to provide the elementary school students with rich opportunities to learn and to express their developing understandings of scientific ideas. This study provides a rare glimpse of primary-grade inquiry-based science instruction within a classroom context.
Teixeira, Marlon Amaro Coelho; Belloze, Kele Teixeira; Cavalcanti, Maria Cláudia; Silva-Junior, Floriano P
2018-04-01
Semantic text annotation enables the association of semantic information (ontology concepts) to text expressions (terms), which are readable by software agents. In the scientific scenario, this is particularly useful because it reveals a lot of scientific discoveries that are hidden within academic articles. The Biomedical area has more than 300 ontologies, most of them composed of over 500 concepts. These ontologies can be used to annotate scientific papers and thus, facilitate data extraction. However, in the context of a scientific research, a simple keyword-based query using the interface of a digital scientific texts library can return more than a thousand hits. The analysis of such a large set of texts, annotated with such numerous and large ontologies, is not an easy task. Therefore, the main objective of this work is to provide a method that could facilitate this task. This work describes a method called Text and Ontology ETL (TOETL), to build an analytical view over such texts. First, a corpus of selected papers is semantically annotated using distinct ontologies. Then, the annotation data is extracted, organized and aggregated into the dimensional schema of a data mart. Besides the TOETL method, this work illustrates its application through the development of the TaP DM (Target Prioritization data mart). This data mart has focus on the research of gene essentiality, a key concept to be considered when searching for genes showing potential as anti-infective drug targets. This work reveals that the proposed approach is a relevant tool to support decision making in the prioritization of new drug targets, being more efficient than the keyword-based traditional tools. Copyright © 2018 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Gonçalves Nigro, Rogerio; Frateschi Trivelato, Silvia
2012-11-01
The purpose of this article is to assess the knowledge, application of knowledge, and attitudes associated with the reading of different genres of expository science texts. We assigned approximately half of a sample consisting of 220 students 14-15 years of age, chosen at random, to read an excerpt from a popular scientific text, and the other half to read an excerpt from a textbook addressing the same topic. Readers took knowledge and application tests immediately after the reading and again 15 days later. Students also took knowledge and reading proficiency pre-tests, and attitude tests related to the selected texts. Overall, girls scored higher than boys and readers of the popular scientific text scored higher than their colleagues who read the textbook excerpt. We noted interaction between 'reader gender' and 'genre of the text read' in terms of long-term learning based on the reading. Attitude regarding the text read appears as an important factor in explaining behavior of boys who read the popular scientific text. Surprisingly, knowledge and application test scores were not statistically different among girls with different degrees of reading proficiency who read the textbook excerpt. In addition, on the application tests, among the boys who read the popular scientific text, good readers scored lower than their colleagues who read the textbook excerpt. In our opinion, this study can serve to show that 'reading in science education' is not a trivial matter and we feel that the subject merits more in-depth investigation.
Drawing on Text Features for Reading Comprehension and Composing
ERIC Educational Resources Information Center
Risko, Victoria J.; Walker-Dalhouse, Doris
2011-01-01
Students read multiple-genre texts such as graphic novels, poetry, brochures, digitized texts with videos, and informational and narrative texts. Features such as overlapping illustrations and implied cause-and-effect relationships can affect students' comprehension. Teaching with these texts and drawing attention to organizational features hold…
ERIC Educational Resources Information Center
Arthurs, Leilani A.; Van Den Broeke, Matthew S.
2016-01-01
The ability to explain scientific phenomena is a key feature of scientific literacy, and engaging students' prior knowledge, especially their alternate conceptions, is an effective strategy for enhancing scientific literacy and developing expertise. The gap in knowledge about the alternate conceptions that novices have about many of Earth's…
Integrated scientific data bases review on asulacrine and associated toxicity.
Afzal, Attia; Sarfraz, Muhammad; Wu, Zimei; Wang, Guangji; Sun, Jianguo
2016-08-01
Asulacrine (ASL), a weakly basic and highly lipophilic drug was synthesized in 1980's in cancer research laboratory of Auckland by modifications to the acridine portion of amsacrine on 3-, 4- and 5-substitution patterns. In contrast to its precursor amsacrine (m-AMSA), ASL was effective not only against leukemia and Lewis lung tumor system but also a wide variety of solid tumor. Its metabolic pathway is not same to amsacrine hence different side effects, hepatotoxicity and excretion was observed. Asulacrine is under phase II clinical trials and has showed promising results but its toxicity especially phlebitis is stumbling block in its clinical implementation. This review is an effort to give a possible clue, based on scientifically proven results, to the researchers to solve the mystery of associated toxicity, phlebitis. Review covers the available literature on asulacrine and other acridine derivatives regarding pharmacology, pharmacokinetics, quantitative structure activity relationship and toxicology via electronic search using scientific databases like PubMed and others. To date, all abstracts and full-text articles were discussed and analyzed. The tabulated comparisons and circuitry mechanism of ASL are the added features of the review which give a complete understanding of hidden aspects of possible route cause of associated toxicity, the phlebitis. Copyright © 2016. Published by Elsevier Ireland Ltd.
Hao, Xin; Cui, Shuai; Li, Wenfu; Yang, Wenjing; Qiu, Jiang; Zhang, Qinglin
2013-10-09
Insight can be the first step toward creating a groundbreaking product. As evident in anecdotes and major inventions in history, heuristic events (heuristic prototypes) prompted inventors to acquire insight when solving problems. Bionic imitation in scientific innovation is an example of this kind of problem solving. In particular, heuristic prototypes (e.g., the lotus effect; the very high water repellence exhibited by lotus leaves) help solve insight problems (e.g., non-stick surfaces). We speculated that the biological functional feature of prototypes is a critical factor in inducing insightful scientific problem solving. In this functional magnetic resonance imaging (fMRI) study, we selected scientific innovation problems and utilized "learning prototypes-solving problems" two-phase paradigm to test the supposition. We also explored its neural mechanisms. Functional MRI data showed that the activation of the middle temporal gyrus (MTG, BA 37) and the middle occipital gyrus (MOG, BA 19) were associated with the highlighted functional feature condition. fMRI data also indicated that the MTG (BA 37) could be responsible for the semantic processing of functional features and for the formation of novel associations based on related functions. In addition, the MOG (BA 19) could be involved in the visual imagery of formation and application of function association between the heuristic prototype and problem. Our findings suggest that both semantic processing and visual imagery could be crucial components underlying scientific problem solving. © 2013 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Coll, Richard K.; Lay, Mark C.; Taylor, Neil
2008-01-01
Scientific literacy is explored in this paper which describes two studies that seek to understand a particular feature of the nature of science; namely scientists' habits of mind. The research investigated scientists' views of scientific evidence and how scientists judge evidence claims. The first study is concerned with scientists' views of what…
ERIC Educational Resources Information Center
Thomm, Eva; Bromme, Rainer
2012-01-01
The Internet is a convenient source of information about science-based topics (e.g., health matters). Whereas experts are familiar with the conventions of "true" scientific discourse and the assessment of scientific information, laypeople may have great difficulty choosing among, evaluating, and deciding on the vast amount of information…
Experiential Thinking in Creationism—A Textual Analysis
Nieminen, Petteri; Ryökäs, Esko; Mustonen, Anne-Mari
2015-01-01
Creationism is a religiously motivated worldview in denial of biological evolution that has been very resistant to change. We performed a textual analysis by examining creationist and pro-evolutionary texts for aspects of “experiential thinking”, a cognitive process different from scientific thought. We observed characteristics of experiential thinking as follows: testimonials (present in 100% of sampled creationist texts), such as quotations, were a major form of proof. Confirmation bias (100% of sampled texts) was represented by ignoring or dismissing information that would contradict the creationist hypothesis. Scientifically irrelevant or flawed information was re-interpreted as relevant for the falsification of evolution (75–90% of sampled texts). Evolutionary theory was associated to moral issues by demonizing scientists and linking evolutionary theory to atrocities (63–93% of sampled texts). Pro-evolutionary rebuttals of creationist claims also contained testimonials (93% of sampled texts) and referred to moral implications (80% of sampled texts) but displayed lower prevalences of stereotypical thinking (47% of sampled texts), confirmation bias (27% of sampled texts) and pseudodiagnostics (7% of sampled texts). The aspects of experiential thinking could also be interpreted as argumentative fallacies. Testimonials lead, for instance, to ad hominem and appeals to authorities. Confirmation bias and simplification of data give rise to hasty generalizations and false dilemmas. Moral issues lead to guilt by association and appeals to consequences. Experiential thinking and fallacies can contribute to false beliefs and the persistence of the claims. We propose that science educators would benefit from the systematic analysis of experiential thinking patterns and fallacies in creationist texts and pro-evolutionary rebuttals in order to concentrate on scientific misconceptions instead of the scientifically irrelevant aspects of the creationist—evolutionist debate. PMID:25734650
Experiential thinking in creationism--a textual analysis.
Nieminen, Petteri; Ryökäs, Esko; Mustonen, Anne-Mari
2015-01-01
Creationism is a religiously motivated worldview in denial of biological evolution that has been very resistant to change. We performed a textual analysis by examining creationist and pro-evolutionary texts for aspects of "experiential thinking", a cognitive process different from scientific thought. We observed characteristics of experiential thinking as follows: testimonials (present in 100% of sampled creationist texts), such as quotations, were a major form of proof. Confirmation bias (100% of sampled texts) was represented by ignoring or dismissing information that would contradict the creationist hypothesis. Scientifically irrelevant or flawed information was re-interpreted as relevant for the falsification of evolution (75-90% of sampled texts). Evolutionary theory was associated to moral issues by demonizing scientists and linking evolutionary theory to atrocities (63-93% of sampled texts). Pro-evolutionary rebuttals of creationist claims also contained testimonials (93% of sampled texts) and referred to moral implications (80% of sampled texts) but displayed lower prevalences of stereotypical thinking (47% of sampled texts), confirmation bias (27% of sampled texts) and pseudodiagnostics (7% of sampled texts). The aspects of experiential thinking could also be interpreted as argumentative fallacies. Testimonials lead, for instance, to ad hominem and appeals to authorities. Confirmation bias and simplification of data give rise to hasty generalizations and false dilemmas. Moral issues lead to guilt by association and appeals to consequences. Experiential thinking and fallacies can contribute to false beliefs and the persistence of the claims. We propose that science educators would benefit from the systematic analysis of experiential thinking patterns and fallacies in creationist texts and pro-evolutionary rebuttals in order to concentrate on scientific misconceptions instead of the scientifically irrelevant aspects of the creationist-evolutionist debate.
An Evaluation of Text Mining Tools as Applied to Selected Scientific and Engineering Literature.
ERIC Educational Resources Information Center
Trybula, Walter J.; Wyllys, Ronald E.
2000-01-01
Addresses an approach to the discovery of scientific knowledge through an examination of data mining and text mining techniques. Presents the results of experiments that investigated knowledge acquisition from a selected set of technical documents by domain experts. (Contains 15 references.) (Author/LRW)
Reading Online News Media for Science Content: A Social Psychological Approach
ERIC Educational Resources Information Center
Roth, Wolff-Michael
2010-01-01
Reading multimodal (popularized) scientific texts is studied predominantly in terms of said-to-be-required technical decoding skills. In this article I suggest that there are other interesting approaches to studying the reading of multimodal (popularized) scientific texts, approaches that are grounded in social psychological concerns. These…
Problems of Simultaneous Interpreting of Scientific Discussion.
ERIC Educational Resources Information Center
Chachibaia, Nelly
This article focuses on the problems of simultaneous translation (SI) of scientific discussion at the Conference on Training Translators and Interpreters in the New Millennium, the development of which greatly depends on extralinguistic, external conference conditions. Text linguistics considers text not only as a grammatical unit larger than a…
Tieberghien, Julie
2014-03-01
Drug policy is one of the most polarised subjects of public debate and media coverage, which frequently tend to be dramatic and event-centred. Although the role of the media in directing the drug discourse is widely acknowledged, limited research has been conducted in examining the particular role of the media in the science-policy nexus. We sought to determine how the (mis)representation of scientific knowledge in the media may, or may not, have an impact on the contribution of scientific knowledge to the drug-policy making process. Using a case study of the Belgian drug-policy debates between 1996 and 2003, we conducted a discourse analysis of specially selected 1067 newspaper articles and 164 policy documents. Our analysis focused on: textual elements that feature intra-discourse differences, how players and scientific knowledge are represented in the text, the arguments used and claims made, and the various types of research utilisation. Media discourse strongly influenced the public's and policy makers' understanding as well as the content of the Belgian drug policy debate between 1996 and 2003. As a major source of scientific knowledge, media coverage supported the 'enlightenment' role of scientific knowledge in the policy-making process by broadening and even determining frames of reference. However, as the presentation of scientific knowledge in the media was often inaccurate or distorted due to the lack of contextual information or statistical misinformation, the media may also support the selective utilisation of scientific knowledge. Many challenges as well as opportunities lie ahead for researchers who want to influence the policy-making process since most research fails to go beyond academic publications. Although media is a valuable linking mechanism between science and policy, by no means does it provide scientists with a guarantee of a more 'evidence-based' drug policy. Copyright © 2013 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Jian, Yu-Cin; Wu, Chao-Jung
2015-02-01
We investigated strategies used by readers when reading a science article with a diagram and assessed whether semantic and spatial representations were constructed while reading the diagram. Seventy-one undergraduate participants read a scientific article while tracking their eye movements and then completed a reading comprehension test. Our results showed that the text-diagram referencing strategy was commonly used. However, some readers adopted other reading strategies, such as reading the diagram or text first. We found all readers who had referred to the diagram spent roughly the same amount of time reading and performed equally well. However, some participants who ignored the diagram performed more poorly on questions that tested understanding of basic facts. This result indicates that dual coding theory may be a possible theory to explain the phenomenon. Eye movement patterns indicated that at least some readers had extracted semantic information of the scientific terms when first looking at the diagram. Readers who read the scientific terms on the diagram first tended to spend less time looking at the same terms in the text, which they read after. Besides, presented clear diagrams can help readers process both semantic and spatial information, thereby facilitating an overall understanding of the article. In addition, although text-first and diagram-first readers spent similar total reading time on the text and diagram parts of the article, respectively, text-first readers had significantly less number of saccades of text and diagram than diagram-first readers. This result might be explained as text-directed reading.
Heming, Thomas A; Nandagopal, Shobha
2012-11-01
Medical education requires student comprehension of both technical (scientific/medical) and non-technical (general) vocabulary. Our experience with "English as a second language" (ESL) Arab students suggested they often have problems comprehending scientific statements because of weaknesses in their understanding of non-scientific vocabulary. This study aimed to determine whether ESL students have difficulties with general vocabulary that could hinder their understanding of scientific/medical texts. A survey containing English text was given to ESL students in the premedical years of an English-medium medical school in an Arabic country. The survey consisted of sample questions from the Medical College Admission Test (USA). Students were instructed to identify all unknown words in the text. ESL students commenced premedical studies with substantial deficiencies in English vocabulary. Students from English-medium secondary schools had a selective deficiency in scientific/medical terminology which disappeared with time. Students from Arabic-medium secondary schools had equal difficulty with general and scientific/medical vocabulary. Deficiencies in both areas diminished with time but remained even after three years of English-medium higher education. Typically, when teaching technical subjects to ESL students, attention is focused on subject-unique vocabulary and associated modifiers. This study highlights that ESL students also face difficulties with the general vocabulary used to construct statements employing technical words. Such students would benefit from increases in general vocabulary knowledge.
The Ties that Bind: Emergent Literacy and Scientific Inquiry
ERIC Educational Resources Information Center
Whitin, Phyllis
2007-01-01
This study describes one kindergarten classroom in which informational books and other nonfiction resources were used in the context of a long-term scientific study. Children became proficient in locating information and interpreting content-specific textual features in the process of making sense of their scientific observations and sharing them…
ERIC Educational Resources Information Center
Rönnebeck, Silke; Bernholt, Sascha; Ropohl, Mathias
2016-01-01
Despite the importance of scientific inquiry in science education, researchers and educators disagree considerably regarding what features define this instructional approach. While a large body of literature addresses theoretical considerations, numerous empirical studies investigate scientific inquiry on quite different levels of detail and also…
Story Telling With Storyboards: Enhancements and Experiences
NASA Astrophysics Data System (ADS)
King, T. A.; Grayzeck, E. J.; Galica, C.; Erickson, K. J.
2016-12-01
A year ago a tool to help tell stories, called the Planetary Data Storyboard, was introduced. This tool is designed to use today's technologies to tell stories that are rich multi-media experiences, blending text, animations, movies and infographics. The Storyboard tool presents a set of panels that contain representative images of an event with associated notes or instructions. The panels are arranged in a timeline that allow a user to experience a discovery or event in the same way it occurred. Each panel can link to a more detailed source such as a publication, the data that was collected or items derived from the research (like movies or animations). A storyboard can be used to make science discovery more accessible to people by presenting events in an easy to follow layout. A storyboard can also help to teach the scientific method, by following the experiences of a researcher as they investigate a phenomenon or try to understand a new set of observations. We present the new features of Storyboard tool and show example stories for scientific discoveries.
Effective Tooling for Linked Data Publishing in Scientific Research
DOE Office of Scientific and Technical Information (OSTI.GOV)
Purohit, Sumit; Smith, William P.; Chappell, Alan R.
Challenges that make it difficult to find, share, and combine published data, such as data heterogeneity and resource discovery, have led to increased adoption of semantic data standards and data publishing technologies. To make data more accessible, interconnected and discoverable, some domains are being encouraged to publish their data as Linked Data. Consequently, this trend greatly increases the amount of data that semantic web tools are required to process, store, and interconnect. In attempting to process and manipulate large data sets, tools–ranging from simple text editors to modern triplestores– eventually breakdown upon reaching undefined thresholds. This paper offers a systematicmore » approach that data publishers can use to categorize suitable tools to meet their data publishing needs. We present a real-world use case, the Resource Discovery for Extreme Scale Collaboration (RDESC), which features a scientific dataset(maximum size of 1.4 billion triples) used to evaluate a toolbox for data publishing in climate research. This paper also introduces a semantic data publishing software suite developed for the RDESC project.« less
NASA Astrophysics Data System (ADS)
Michalsky, Tova
2013-07-01
This study investigated the effectiveness of cognitive-metacognitive versus motivational components of the IMPROVE self-regulatory model, used while reading scientific texts, for 10th graders' scientific literacy and self-regulated learning (SRL). Three treatment groups (N = 198) received one type of self-addressable questions while reading scientific texts: cognitive-metacognitive (CogMet), motivational (Mot), or combined (CogMetMot). Control group received no self-addressed questions (noSRL). One measure assessed scientific literacy, and two measures assessed SRL: (a) as an aptitude-pre/post questionnaires assessing self-perceived SRL, and (b) as an event-audiotaping participants' thinking-aloud SRL behaviors in real-time learning experiences and data coding illustrating SRL changes. Findings indicated that treatment groups significantly outperformed the non-treatment group. No differences emerged between CogMet and Mot, whereas fully combined SRL support (CogMetMot) was most effective. Theoretical and practical implications of this preliminary study are discussed.
ERIC Educational Resources Information Center
Park, Do-Yong; Park, Mira
2013-01-01
The purpose of this study was to investigate the inquiry features demonstrated in the inquiry tasks of a high school Earth Science curriculum. One of the most widely used curricula, Holt Earth Science, was chosen for this case study to examine how Earth Science logical reasoning and authentic scientific inquiry were related to one another and how…
Gimli: open source and high-performance biomedical name recognition
2013-01-01
Background Automatic recognition of biomedical names is an essential task in biomedical information extraction, presenting several complex and unsolved challenges. In recent years, various solutions have been implemented to tackle this problem. However, limitations regarding system characteristics, customization and usability still hinder their wider application outside text mining research. Results We present Gimli, an open-source, state-of-the-art tool for automatic recognition of biomedical names. Gimli includes an extended set of implemented and user-selectable features, such as orthographic, morphological, linguistic-based, conjunctions and dictionary-based. A simple and fast method to combine different trained models is also provided. Gimli achieves an F-measure of 87.17% on GENETAG and 72.23% on JNLPBA corpus, significantly outperforming existing open-source solutions. Conclusions Gimli is an off-the-shelf, ready to use tool for named-entity recognition, providing trained and optimized models for recognition of biomedical entities from scientific text. It can be used as a command line tool, offering full functionality, including training of new models and customization of the feature set and model parameters through a configuration file. Advanced users can integrate Gimli in their text mining workflows through the provided library, and extend or adapt its functionalities. Based on the underlying system characteristics and functionality, both for final users and developers, and on the reported performance results, we believe that Gimli is a state-of-the-art solution for biomedical NER, contributing to faster and better research in the field. Gimli is freely available at http://bioinformatics.ua.pt/gimli. PMID:23413997
NASA Astrophysics Data System (ADS)
Ha, Minsu; Nehm, Ross H.
2016-06-01
Automated computerized scoring systems (ACSSs) are being increasingly used to analyze text in many educational settings. Nevertheless, the impact of misspelled words (MSW) on scoring accuracy remains to be investigated in many domains, particularly jargon-rich disciplines such as the life sciences. Empirical studies confirm that MSW are a pervasive feature of human-generated text and that despite improvements, spell-check and auto-replace programs continue to be characterized by significant errors. Our study explored four research questions relating to MSW and text-based computer assessments: (1) Do English language learners (ELLs) produce equivalent magnitudes and types of spelling errors as non-ELLs? (2) To what degree do MSW impact concept-specific computer scoring rules? (3) What impact do MSW have on computer scoring accuracy? and (4) Are MSW more likely to impact false-positive or false-negative feedback to students? We found that although ELLs produced twice as many MSW as non-ELLs, MSW were relatively uncommon in our corpora. The MSW in the corpora were found to be important features of the computer scoring models. Although MSW did not significantly or meaningfully impact computer scoring efficacy across nine different computer scoring models, MSW had a greater impact on the scoring algorithms for naïve ideas than key concepts. Linguistic and concept redundancy in student responses explains the weak connection between MSW and scoring accuracy. Lastly, we found that MSW tend to have a greater impact on false-positive feedback. We discuss the implications of these findings for the development of next-generation science assessments.
Semantic Web Compatible Names and Descriptions for Organisms
NASA Astrophysics Data System (ADS)
Wang, H.; Wilson, N.; McGuinness, D. L.
2012-12-01
Modern scientific names are critical for understanding the biological literature and provide a valuable way to understand evolutionary relationships. To validly publish a name, a description is required to separate the described group of organisms from those described by other names at the same level of the taxonomic hierarchy. The frequent revision of descriptions due to new evolutionary evidence has lead to situations where a single given scientific name may over time have multiple descriptions associated with it and a given published description may apply to multiple scientific names. Because of these many-to-many relationships between scientific names and descriptions, the usage of scientific names as a proxy for descriptions is inevitably ambiguous. Another issue lies in the fact that the precise application of scientific names often requires careful microscopic work, or increasingly, genetic sequencing, as scientific names are focused on the evolutionary relatedness between and within named groups such as species, genera, families, etc. This is problematic to many audiences, especially field biologists, who often do not have access to the instruments and tools required to make identifications on a microscopic or genetic basis. To better connect scientific names to descriptions and find a more convenient way to support computer assisted identification, we proposed the Semantic Vernacular System, a novel naming system that creates named, machine-interpretable descriptions for groups of organisms, and is compatible with the Semantic Web. Unlike the evolutionary relationship based scientific naming system, it emphasizes the observable features of organisms. By independently naming the descriptions composed of sets of observational features, as well as maintaining connections to scientific names, it preserves the observational data used to identify organisms. The system is designed to support a peer-review mechanism for creating new names, and uses a controlled vocabulary encoded in the Web Ontology Language to represent the observational features. A prototype of the system is currently under development in collaboration with the Mushroom Observer website. It allows users to propose new names and descriptions for fungi, provide feedback on those proposals, and ultimately have them formally approved. It relies on SPARQL queries and semantic reasoning for data management. This effort will offer the mycology community a knowledge base of fungal observational features and a tool for identifying fungal observations. It will also serve as an operational specification of how the Semantic Vernacular System can be used in practice in one scientific community (in this case mycology).
Zur Wortbildung in wissenschaftlichen Texten (Word Formation in Scientific Texts)
ERIC Educational Resources Information Center
Rogalla, Hanna; Rogalla, Willy
1976-01-01
Discusses a German frequency list of 1,500 to 2,000 scientific words, which is being developed, and the importance of learning word-building principles. Substantive and adjective suffixes are listed according to frequency, followed by remarks on copulative compounds, with examples and frequency ranking, and, finally, prefixes. (Text is in German.)…
Processes Utilized by High School Students Reading Scientific Text
ERIC Educational Resources Information Center
Clinger, Alicia Farr
2014-01-01
In response to an increased emphasis on disciplinary literacy in the secondary science classroom, an investigation of the literacy processes utilized by high school students while reading scientific text was undertaken. A think-aloud protocol was implemented to collect data on the processes students used when not prompted while reading a magazine…
76 FR 11373 - Fisheries of the Northeastern United States; Atlantic Herring; Amendment 4
Federal Register 2010, 2011, 2012, 2013, 2014
2011-03-02
...). This action specifies that ABC is to be recommended by the Council's Scientific and Statistical..., paragraphs (a) introductory text, (b)(1), (b)(2), (b)(3), (b)(4), (e), and (f) introductory text are revised... OFL. The Council's Scientific and Statistical Committee (SSC) shall recommend ABC to the Council...
Implementation and Evaluation of the Course Dossier Methodology
ERIC Educational Resources Information Center
Khanam, Wahidun N.; Kalman, Calvin S.
2017-01-01
It has been argued that for novice students to acquire a full understanding of scientific texts, they also need to pursue a recurrent construction of their comprehension of scientific concepts. The course dossier method has students examine concepts in multiple passes: (a) through reflective writing on text before it is considered in the…
ERIC Educational Resources Information Center
Hartwell, Laura M.; Jacques, Marie-Paule
2012-01-01
Both reading and writing abstracts require specific language skills and conceptual capacities, which may challenge advanced learners. This paper draws explicitly upon the "Emergence" and "Scientext" research projects which focused on the lexis of scientific texts in French and English. The teaching objective of the project…
Fourth and fifth grade Latino(a) students making meaning of scientific informational texts
NASA Astrophysics Data System (ADS)
Croce, Keri-Anne
Using a socio-psycholinguistic perspective of literacy and a social-semiotic analysis of texts, this study investigates how six students made meaning of informational texts. The students came to school from a variety of English and Spanish language backgrounds. The research question being asked was 'How do Latino(a) fourth and fifth grade students make meaning of English informational texts?' Miscue analysis was used as a tool to investigate how students who have been labeled non-struggling readers by their classroom teacher and are from various language backgrounds approached five informational texts. In order to investigate students' responses to the nature of informational texts, this dissertation draws on commonly occurring structures within texts. Primary data collected included read alouds and retellings of five texts, retrospective miscue analysis, and interviews with six participant students. Two of these participants are discussed within this dissertation. Secondary data included classroom observations and teacher interviews. This study proposes that non-native speakers may use scientific concept placeholders as they transact with informational texts. The use of scientific concept placeholders by a reader indicates that the reader is engaged in the meaning making process and possesses evolving scientific knowledge about a phenomenon. The findings suggest that Latino(a) students' understandings of English informational texts is influenced not only by a student's language development but also (1) the nature of the text; (2) the reading strategies that a student uses, such as the use of placeholders; (3) the influence of the researcher during the aided retelling. This study contributes methodological tools to assess English language learners' reading. The conclusions presented within this study also support the idea that students from a variety of language backgrounds slightly altered their reliance on certain cuing systems as they encountered various sub-genres within an informational text. I conclude that reading assessment should account for how a student approaches different structural elements of a text.
Software Attribution for Geoscience Applications in the Computational Infrastructure for Geodynamics
NASA Astrophysics Data System (ADS)
Hwang, L.; Dumit, J.; Fish, A.; Soito, L.; Kellogg, L. H.; Smith, M.
2015-12-01
Scientific software is largely developed by individual scientists and represents a significant intellectual contribution to the field. As the scientific culture and funding agencies move towards an expectation that software be open-source, there is a corresponding need for mechanisms to cite software, both to provide credit and recognition to developers, and to aid in discoverability of software and scientific reproducibility. We assess the geodynamic modeling community's current citation practices by examining more than 300 predominantly self-reported publications utilizing scientific software in the past 5 years that is available through the Computational Infrastructure for Geodynamics (CIG). Preliminary results indicate that authors cite and attribute software either through citing (in rank order) peer-reviewed scientific publications, a user's manual, and/or a paper describing the software code. Attributions maybe found directly in the text, in acknowledgements, in figure captions, or in footnotes. What is considered citable varies widely. Citations predominantly lack software version numbers or persistent identifiers to find the software package. Versioning may be implied through reference to a versioned user manual. Authors sometimes report code features used and whether they have modified the code. As an open-source community, CIG requests that researchers contribute their modifications to the repository. However, such modifications may not be contributed back to a repository code branch, decreasing the chances of discoverability and reproducibility. Survey results through CIG's Software Attribution for Geoscience Applications (SAGA) project suggest that lack of knowledge, tools, and workflows to cite codes are barriers to effectively implement the emerging citation norms. Generated on-demand attributions on software landing pages and a prototype extensible plug-in to automatically generate attributions in codes are the first steps towards reproducibility.
Exploring supervised and unsupervised methods to detect topics in biomedical text
Lee, Minsuk; Wang, Weiqing; Yu, Hong
2006-01-01
Background Topic detection is a task that automatically identifies topics (e.g., "biochemistry" and "protein structure") in scientific articles based on information content. Topic detection will benefit many other natural language processing tasks including information retrieval, text summarization and question answering; and is a necessary step towards the building of an information system that provides an efficient way for biologists to seek information from an ocean of literature. Results We have explored the methods of Topic Spotting, a task of text categorization that applies the supervised machine-learning technique naïve Bayes to assign automatically a document into one or more predefined topics; and Topic Clustering, which apply unsupervised hierarchical clustering algorithms to aggregate documents into clusters such that each cluster represents a topic. We have applied our methods to detect topics of more than fifteen thousand of articles that represent over sixteen thousand entries in the Online Mendelian Inheritance in Man (OMIM) database. We have explored bag of words as the features. Additionally, we have explored semantic features; namely, the Medical Subject Headings (MeSH) that are assigned to the MEDLINE records, and the Unified Medical Language System (UMLS) semantic types that correspond to the MeSH terms, in addition to bag of words, to facilitate the tasks of topic detection. Our results indicate that incorporating the MeSH terms and the UMLS semantic types as additional features enhances the performance of topic detection and the naïve Bayes has the highest accuracy, 66.4%, for predicting the topic of an OMIM article as one of the total twenty-five topics. Conclusion Our results indicate that the supervised topic spotting methods outperformed the unsupervised topic clustering; on the other hand, the unsupervised topic clustering methods have the advantages of being robust and applicable in real world settings. PMID:16539745
A multi-ontology approach to annotate scientific documents based on a modularization technique.
Gomes, Priscilla Corrêa E Castro; Moura, Ana Maria de Carvalho; Cavalcanti, Maria Cláudia
2015-12-01
Scientific text annotation has become an important task for biomedical scientists. Nowadays, there is an increasing need for the development of intelligent systems to support new scientific findings. Public databases available on the Web provide useful data, but much more useful information is only accessible in scientific texts. Text annotation may help as it relies on the use of ontologies to maintain annotations based on a uniform vocabulary. However, it is difficult to use an ontology, especially those that cover a large domain. In addition, since scientific texts explore multiple domains, which are covered by distinct ontologies, it becomes even more difficult to deal with such task. Moreover, there are dozens of ontologies in the biomedical area, and they are usually big in terms of the number of concepts. It is in this context that ontology modularization can be useful. This work presents an approach to annotate scientific documents using modules of different ontologies, which are built according to a module extraction technique. The main idea is to analyze a set of single-ontology annotations on a text to find out the user interests. Based on these annotations a set of modules are extracted from a set of distinct ontologies, and are made available for the user, for complementary annotation. The reduced size and focus of the extracted modules tend to facilitate the annotation task. An experiment was conducted to evaluate this approach, with the participation of a bioinformatician specialist of the Laboratory of Peptides and Proteins of the IOC/Fiocruz, who was interested in discovering new drug targets aiming at the combat of tropical diseases. Copyright © 2015 Elsevier Inc. All rights reserved.
Uzun, Günalp; Mutluoğlu, Mesut; Bakir, Alev; Senocak, Mustafa S
2013-01-01
The full-text publication of abstracts presented at any given scientific meeting in peer-reviewed journals is accepted as a measure of scientific quality of that particular meeting. The aim of this study is to determine the full-text publication rate of abstracts presented at the 2005 Scientific Meeting of the Undersea and Hyperbaric Medical Society (UHMS). We identified the scientific abstracts presented at the 2005 UHMS meeting and searched the PubMed database (June 2005 to July 2010) for their corresponding full-text publication. We recorded the following parameters for each of the abstracts: number of authors, number of centers involved in the study, statistical methods used, country of origin of the study, study type, and subject of the abstract. We recorded the time to publication and the title of the journal if the abstract had been published in a peer-reviewed journal. Overall, we identified 187 abstracts presented at the 2005 UHMS meeting. Two of the abstracts were excluded from the study because they had been retracted from the meeting and six more because they had been already published as full-text articles at the time the meeting was held. Of the 179 abstracts, 62 (34.6%) were published as full-text articles within the succeeding five years. The mean (+/- SD) time to publication was 18.5 (+/- 13.6) months. Multivariate analysis with logistic regression identified "country of origin" and "the subject of the abstract" as independent predictors of full-text publication. We found that only one-third of the abstracts presented at the 2005 UHMS meeting were published as full-text articles within the succeeding five years. Although this rate is consistent with similar studies from various disciplines, further research is needed to identify the specific barriers to full-text publication of abstracts in the field of underwater and hyperbaric medicine.
Mini-Journal Inquiry Laboratory: A Case Study in a General Chemistry Kinetics Experiment
ERIC Educational Resources Information Center
Zhao, Ningfeng; Wardeska, Jeffrey G.
2011-01-01
The mini-journal curriculum for undergraduate science laboratories mirrors the format of scientific literature and helps students improve their learning through direct scientific practices. The lab embodies the essential features of scientific inquiry and replaces the traditional "cookbook" lab to engage students in active learning. A case study…
From the Horse's Mouth: What Scientists Say about Scientific Investigation and Scientific Knowledge
ERIC Educational Resources Information Center
Wong, Siu Ling; Hodson, Derek
2009-01-01
This study sought to identify prominent features of the nature of science (NOS) embedded in authentic scientific inquiry. Thirteen well-established scientists from different parts of the world, working in experimental or theoretical research, in both traditional fields such as astrophysics and rapidly growing research fields such as molecular…
Constructing a Scientific Explanation--A Narrative Account
ERIC Educational Resources Information Center
Yeo, Jennifer; Gilbert, John K.
2014-01-01
Studies analyzing explanations that have been constructed by science students have found that they were generally weak and lack necessary features. The goal of this study was to establish the competencies that one needs to construct a scientific explanation. Scientific explanations can be looked at in three ways, in terms of their function, form…
The Use of Popular Science Articles in Teaching Scientific Literacy
ERIC Educational Resources Information Center
Parkinson, Jean; Adendorff, Ralph
2004-01-01
This article considers the use of popular science articles in teaching scientific literacy. Comparing the discourse features of popular science with research article and textbook science--the last two being target forms for students--it argues that popular science articles cannot serve as models for scientific writing. It does, however, suggest…
Common Characteristics of Models in Present-Day Scientific Practice
ERIC Educational Resources Information Center
Van Der Valk, Ton; Van Driel, Jan H.; De Vos, Wobbe
2007-01-01
Teaching the use of models in scientific research requires a description, in general terms, of how scientists actually use models in their research activities. This paper aims to arrive at defining common characteristics of models that are used in present-day scientific research. Initially, a list of common features of models and modelling, based…
Use and mis-use of supplementary material in science publications.
Pop, Mihai; Salzberg, Steven L
2015-11-03
Supplementary material is a ubiquitous feature of scientific articles, particularly in journals that limit the length of the articles. While the judicious use of supplementary material can improve the readability of scientific articles, its excessive use threatens the scientific review process and by extension the integrity of the scientific literature. In many cases supplementary material today is so extensive that it is reviewed superficially or not at all. Furthermore, citations buried within supplementary files rob other scientists of recognition of their contribution to the scientific record. These issues are exacerbated by the lack of guidance on the use of supplementary information from the journals to authors and reviewers. We propose that the removal of artificial length restrictions plus the use of interactive features made possible by modern electronic media can help to alleviate these problems. Many journals, in fact, have already removed article length limitations (as is the case for BMC Bioinformatics and other BioMed Central journals). We hope that the issues raised in our article will encourage publishers and scientists to work together towards a better use of supplementary information in scientific publishing.
Li, Yongyan
2013-06-01
Text-based plagiarism, or textual copying, typically in the form of replicating or patchwriting sentences in a row from sources, seems to be an issue of growing concern among scientific journal editors. Editors have emphasized that senior authors (typically supervisors of science students) should take the responsibility for educating novices against text-based plagiarism. To address a research gap in the literature as to how scientist supervisors perceive the issue of textual copying and what they do in educating their students, this paper reports an interview study with 14 supervisors at a research-oriented Chinese university. The study throws light on the potentiality of senior authors mentoring novices in English as an Additional Language (EAL) contexts and has implications for the efforts that can be made in the wider scientific community to support scientists in writing against text-based plagiarism.
The citation merit of scientific publications.
Crespo, Juan A; Ortuño-Ortín, Ignacio; Ruiz-Castillo, Javier
2012-01-01
We propose a new method to assess the merit of any set of scientific papers in a given field based on the citations they receive. Given a field and a citation impact indicator, such as the mean citation or the [Formula: see text]-index, the merit of a given set of [Formula: see text] articles is identified with the probability that a randomly drawn set of [Formula: see text] articles from a given pool of articles in that field has a lower citation impact according to the indicator in question. The method allows for comparisons between sets of articles of different sizes and fields. Using a dataset acquired from Thomson Scientific that contains the articles published in the periodical literature in the period 1998-2007, we show that the novel approach yields rankings of research units different from those obtained by a direct application of the mean citation or the [Formula: see text]-index.
Wagner, Mathias; Vicinus, Benjamin; Muthra, Sherieda T; Richards, Tereza A; Linder, Roland; Frick, Vilma Oliveira; Groh, Andreas; Rubie, Claudia; Weichert, Frank
2016-06-01
The continuous growth of medical sciences literature indicates the need for automated text analysis. Scientific writing which is neither unitary, transcending social situation nor defined by a timeless idea is subject to constant change as it develops in response to evolving knowledge, aims at different goals, and embodies different assumptions about nature and communication. The objective of this study was to evaluate whether publication dates should be considered when performing text mining. A search of PUBMED for combined references to chemokine identifiers and particular cancer related terms was conducted to detect changes over the past 36 years. Text analyses were performed using freeware available from the World Wide Web. TOEFL Scores of territories hosting institutional affiliations as well as various readability indices were investigated. Further assessment was conducted using Principal Component Analysis. Laboratory examination was performed to evaluate the quality of attempts to extract content from the examined linguistic features. The PUBMED search yielded a total of 14,420 abstracts (3,190,219 words). The range of findings in laboratory experimentation were coherent with the variability of the results described in the analyzed body of literature. Increased concurrence of chemokine identifiers together with cancer related terms was found at the abstract and sentence level, whereas complexity of sentences remained fairly stable. The findings of the present study indicate that concurrent references to chemokines and cancer increased over time whereas text complexity remained stable. Copyright © 2016 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Uzunöz, Abdulkadir
2018-01-01
The purpose of this study is to identify the conceptual mistakes frequently encountered in teaching geography such as latitude-parallel concepts, and to prepare conceptual change text based on the Scientific Storyline Method, in order to resolve the identified misconceptions. In this study, the special case method, which is one of the qualitative…
Feature selection from a facial image for distinction of sasang constitution.
Koo, Imhoi; Kim, Jong Yeol; Kim, Myoung Geun; Kim, Keun Ho
2009-09-01
Recently, oriental medicine has received attention for providing personalized medicine through consideration of the unique nature and constitution of individual patients. With the eventual goal of globalization, the current trend in oriental medicine research is the standardization by adopting western scientific methods, which could represent a scientific revolution. The purpose of this study is to establish methods for finding statistically significant features in a facial image with respect to distinguishing constitution and to show the meaning of those features. From facial photo images, facial elements are analyzed in terms of the distance, angle and the distance ratios, for which there are 1225, 61 250 and 749 700 features, respectively. Due to the very large number of facial features, it is quite difficult to determine truly meaningful features. We suggest a process for the efficient analysis of facial features including the removal of outliers, control for missing data to guarantee data confidence and calculation of statistical significance by applying ANOVA. We show the statistical properties of selected features according to different constitutions using the nine distances, 10 angles and 10 rates of distance features that are finally established. Additionally, the Sasang constitutional meaning of the selected features is shown here.
Feature Selection from a Facial Image for Distinction of Sasang Constitution
Koo, Imhoi; Kim, Jong Yeol; Kim, Myoung Geun
2009-01-01
Recently, oriental medicine has received attention for providing personalized medicine through consideration of the unique nature and constitution of individual patients. With the eventual goal of globalization, the current trend in oriental medicine research is the standardization by adopting western scientific methods, which could represent a scientific revolution. The purpose of this study is to establish methods for finding statistically significant features in a facial image with respect to distinguishing constitution and to show the meaning of those features. From facial photo images, facial elements are analyzed in terms of the distance, angle and the distance ratios, for which there are 1225, 61 250 and 749 700 features, respectively. Due to the very large number of facial features, it is quite difficult to determine truly meaningful features. We suggest a process for the efficient analysis of facial features including the removal of outliers, control for missing data to guarantee data confidence and calculation of statistical significance by applying ANOVA. We show the statistical properties of selected features according to different constitutions using the nine distances, 10 angles and 10 rates of distance features that are finally established. Additionally, the Sasang constitutional meaning of the selected features is shown here. PMID:19745013
Physical Science Experiments for Scientific Glassblowing Technicians.
ERIC Educational Resources Information Center
Tillis, Samuel E.; Donaghay, Herbert C.
The twenty experiments in this text have been designed to give the scientific glassblowing technician the opportunity to use scientific glass apparatus in the study of physical science. Primary emphasis of these experiments is on the practical application of the physical science program as a working tool for the scientific glassblowing technician.…
Applications of Precipitation Feature Databases from GPM core and constellation Satellites
NASA Astrophysics Data System (ADS)
Liu, C.
2017-12-01
Using the observations from Global Precipitation Mission (GPM) core and constellation satellites, global precipitation was quantitatively described from the perspective of precipitation systems and their properties. This presentation will introduce the development of precipitation feature databases, and several scientific questions that have been tackled using this database, including the topics of global snow precipitation, extreme intensive convection, hail storms, extreme precipitation, and microphysical properties derived with dual frequency radars at the top of convective cores. As more and more observations of constellation satellites become available, it is anticipated that the precipitation feature approach will help to address a large variety of scientific questions in the future. For anyone who is interested, all the current precipitation feature databases are freely open to public at: http://atmos.tamucc.edu/trmm/.
The Relationship in Biology between the Nature of Science and Scientific Inquiry
ERIC Educational Resources Information Center
Kremer, Kerstin; Specht, Christiane; Urhahne, Detlef; Mayer, Jürgen
2014-01-01
Informed understandings of nature of science and scientific inquiry are generally accepted goals of biology education. This article points out central features of scientific inquiry with relation to biology and the nature of science in general terms and focuses on the relationship of students' inquiry skills in biology and their beliefs on the…
Web Based Semi-automatic Scientific Validation of Models of the Corona and Inner Heliosphere
NASA Astrophysics Data System (ADS)
MacNeice, P. J.; Chulaki, A.; Taktakishvili, A.; Kuznetsova, M. M.
2013-12-01
Validation is a critical step in preparing models of the corona and inner heliosphere for future roles supporting either or both the scientific research community and the operational space weather forecasting community. Validation of forecasting quality tends to focus on a short list of key features in the model solutions, with an unchanging order of priority. Scientific validation exposes a much larger range of physical processes and features, and as the models evolve to better represent features of interest, the research community tends to shift its focus to other areas which are less well understood and modeled. Given the more comprehensive and dynamic nature of scientific validation, and the limited resources available to the community to pursue this, it is imperative that the community establish a semi-automated process which engages the model developers directly into an ongoing and evolving validation process. In this presentation we describe the ongoing design and develpment of a web based facility to enable this type of validation of models of the corona and inner heliosphere, on the growing list of model results being generated, and on strategies we have been developing to account for model results that incorporate adaptively refined numerical grids.
Rissing, Steven W
2013-01-01
Most American colleges and universities offer gateway biology courses to meet the needs of three undergraduate audiences: biology and related science majors, many of whom will become biomedical researchers; premedical students meeting medical school requirements and preparing for the Medical College Admissions Test (MCAT); and students completing general education (GE) graduation requirements. Biology textbooks for these three audiences present a topic scope and sequence that correlates with the topic scope and importance ratings of the biology content specifications for the MCAT regardless of the intended audience. Texts for "nonmajors," GE courses appear derived directly from their publisher's majors text. Topic scope and sequence of GE texts reflect those of "their" majors text and, indirectly, the MCAT. MCAT term density of GE texts equals or exceeds that of their corresponding majors text. Most American universities require a GE curriculum to promote a core level of academic understanding among their graduates. This includes civic scientific literacy, recognized as an essential competence for the development of public policies in an increasingly scientific and technological world. Deriving GE biology and related science texts from majors texts designed to meet very different learning objectives may defeat the scientific literacy goals of most schools' GE curricula.
Rissing, Steven W.
2013-01-01
Most American colleges and universities offer gateway biology courses to meet the needs of three undergraduate audiences: biology and related science majors, many of whom will become biomedical researchers; premedical students meeting medical school requirements and preparing for the Medical College Admissions Test (MCAT); and students completing general education (GE) graduation requirements. Biology textbooks for these three audiences present a topic scope and sequence that correlates with the topic scope and importance ratings of the biology content specifications for the MCAT regardless of the intended audience. Texts for “nonmajors,” GE courses appear derived directly from their publisher's majors text. Topic scope and sequence of GE texts reflect those of “their” majors text and, indirectly, the MCAT. MCAT term density of GE texts equals or exceeds that of their corresponding majors text. Most American universities require a GE curriculum to promote a core level of academic understanding among their graduates. This includes civic scientific literacy, recognized as an essential competence for the development of public policies in an increasingly scientific and technological world. Deriving GE biology and related science texts from majors texts designed to meet very different learning objectives may defeat the scientific literacy goals of most schools’ GE curricula. PMID:24006392
Astronomy through the Skylab scientific airlocks.
NASA Technical Reports Server (NTRS)
Henize, K. G.; Weinberg, J. L.
1973-01-01
Description of Skylab astronomy experiments (other than the Apollo Telescope Mount experiments) designed to study the earth's atmosphere, particles near the spacecraft, various components of the background skylight, the spectra of the sun, and the features of stars, nebulae, and galaxies. Emphasis is placed on the eight experiments that will operate through the scientific airlocks in the Orbital Workshop. The major features of equipment to be used in each experiment are outlined together with characteristics and relevance of information expected in each case.
Mukherjee, Partha; Leroy, Gondy; Kauchak, David; Navarrete, Brianda Armenta; Diaz, Damian Y.; Colina, Sonia
2017-01-01
Simplifying medical texts facilitates readability and comprehension. While most simplification work focuses on English, we investigate whether features important for simplifying English text are similarly helpful for simplifying Spanish text. We conducted a user study on 15 Spanish medical texts using Amazon Mechanical Turk and measured perceived and actual difficulty. Using the median of the difficulty scores, we split the texts into easy and difficult groups and extracted 10 surface, 2 semantic and 4 grammatical features. Using t-tests, we identified those features that significantly distinguish easy text from difficult text in Spanish and compare with prior work in English. We found that easy Spanish texts use more repeated words and adverbs, less negations and more familiar words, similar to English. Also like English, difficult Spanish texts use more nouns and adjectives. However in contrast to English, easier Spanish texts contained longer sentences and used grammatical structures that were more varied. PMID:29854201
ERIC Educational Resources Information Center
Yang, Fang-Ying; Chang, Cheng-Chieh; Chen, Li-Ling; Chen, Yi-Chun
2016-01-01
The main purpose of this study was to explore learners' beliefs about science reading and scientific epistemic beliefs, and how these beliefs were associating with their understanding of science texts. About 400 10th graders were involved in the development and validation of the Beliefs about Science Reading Inventory (BSRI). To find the effects…
ERIC Educational Resources Information Center
Jian, Yu-Cin
2017-01-01
This study investigated the cognitive processes and reader characteristics of sixth graders who had good and poor performance when reading scientific text with diagrams. We first measured the reading ability and reading self-efficacy of sixth-grade participants, and then recorded their eye movements while they were reading an illustrated…
ERIC Educational Resources Information Center
Ishiwa, Koto; Sanjose, Vicente; Otero, Jose
2013-01-01
Background: A number of studies report that few questions are asked in classrooms and that many of them are shallow questions. Aims: This study investigates the way in which reading goals determine questioning on scientific texts. Reading goals were manipulated through two different tasks: reading for understanding versus reading to solve a…
ERIC Educational Resources Information Center
Wallen, Erik; Plass, Jan L.; Brunken, Roland
2005-01-01
Students participated in a study (n = 98) investigating the effectiveness of three types of annotations on three learning outcome measures. The annotations were designed to support the cognitive processes in the comprehension of scientific texts, with a function to aid either the process of selecting relevant information, organizing the…
Relevance, textual unity, and politeness in writing about science
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kreml, N.M.P.
1992-01-01
The question of whether there are social implications of linguistic choices in unifying a text is investigated empirically by this study which accounts for the interpretation of implicatures in conversation and written texts. It considers Relevance Theory (Sperber and Wilson 1988, Blakemore 1987, Blass 1990) to be the explanation of the unity of the text, as opposed to semantic theories of cohesion (Halliday and Hasan 1976) or pragmatic theories of coherence (van Dijk 1977). This study presents a model of three types of textual unifiers: overt (referring specifically to the text), embedded (referring to intra- and extra-textual information), and inferencemore » (not referring to the text at all). It hypothesizes that different genres are characterized by the predominance of different types of textual unifiers, and that readers will prefer those texts that rely on inferential unifiers which emphasize the reader's ability to participate in creating the meaning of the text. Eighteen texts of 275 words each are selected from three genres: scientific magazines, introductory science textbooks, and essays on science. The texts are found to vary significantly by genre in the type of textual unifier used. An Overtness Index expresses the ratio of the marked forms: science textbooks have more Overt unifiers (such as connective phrases) and thus a high Overtness Index; essays rely more on Inference unifiers (not represented by words) and thus have a low Overtness Index. The texts are submitted to 188 readers, and a significantly high number of all types of readers prefer the texts with the lower Overtness Indices-the essays. Thus a low Overtness Index is one feature of texts preferred by readers, supporting the hypotheses that genres of texts vary in the type of unifier used and that readers prefer texts that allow them to participate in constructing the meaning of the text.« less
TEES 2.2: Biomedical Event Extraction for Diverse Corpora
2015-01-01
Background The Turku Event Extraction System (TEES) is a text mining program developed for the extraction of events, complex biomedical relationships, from scientific literature. Based on a graph-generation approach, the system detects events with the use of a rich feature set built via dependency parsing. The TEES system has achieved record performance in several of the shared tasks of its domain, and continues to be used in a variety of biomedical text mining tasks. Results The TEES system was quickly adapted to the BioNLP'13 Shared Task in order to provide a public baseline for derived systems. An automated approach was developed for learning the underlying annotation rules of event type, allowing immediate adaptation to the various subtasks, and leading to a first place in four out of eight tasks. The system for the automated learning of annotation rules is further enhanced in this paper to the point of requiring no manual adaptation to any of the BioNLP'13 tasks. Further, the scikit-learn machine learning library is integrated into the system, bringing a wide variety of machine learning methods usable with TEES in addition to the default SVM. A scikit-learn ensemble method is also used to analyze the importances of the features in the TEES feature sets. Conclusions The TEES system was introduced for the BioNLP'09 Shared Task and has since then demonstrated good performance in several other shared tasks. By applying the current TEES 2.2 system to multiple corpora from these past shared tasks an overarching analysis of the most promising methods and possible pitfalls in the evolving field of biomedical event extraction are presented. PMID:26551925
TEES 2.2: Biomedical Event Extraction for Diverse Corpora.
Björne, Jari; Salakoski, Tapio
2015-01-01
The Turku Event Extraction System (TEES) is a text mining program developed for the extraction of events, complex biomedical relationships, from scientific literature. Based on a graph-generation approach, the system detects events with the use of a rich feature set built via dependency parsing. The TEES system has achieved record performance in several of the shared tasks of its domain, and continues to be used in a variety of biomedical text mining tasks. The TEES system was quickly adapted to the BioNLP'13 Shared Task in order to provide a public baseline for derived systems. An automated approach was developed for learning the underlying annotation rules of event type, allowing immediate adaptation to the various subtasks, and leading to a first place in four out of eight tasks. The system for the automated learning of annotation rules is further enhanced in this paper to the point of requiring no manual adaptation to any of the BioNLP'13 tasks. Further, the scikit-learn machine learning library is integrated into the system, bringing a wide variety of machine learning methods usable with TEES in addition to the default SVM. A scikit-learn ensemble method is also used to analyze the importances of the features in the TEES feature sets. The TEES system was introduced for the BioNLP'09 Shared Task and has since then demonstrated good performance in several other shared tasks. By applying the current TEES 2.2 system to multiple corpora from these past shared tasks an overarching analysis of the most promising methods and possible pitfalls in the evolving field of biomedical event extraction are presented.
Relevance popularity: A term event model based feature selection scheme for text classification.
Feng, Guozhong; An, Baiguo; Yang, Fengqin; Wang, Han; Zhang, Libiao
2017-01-01
Feature selection is a practical approach for improving the performance of text classification methods by optimizing the feature subsets input to classifiers. In traditional feature selection methods such as information gain and chi-square, the number of documents that contain a particular term (i.e. the document frequency) is often used. However, the frequency of a given term appearing in each document has not been fully investigated, even though it is a promising feature to produce accurate classifications. In this paper, we propose a new feature selection scheme based on a term event Multinomial naive Bayes probabilistic model. According to the model assumptions, the matching score function, which is based on the prediction probability ratio, can be factorized. Finally, we derive a feature selection measurement for each term after replacing inner parameters by their estimators. On a benchmark English text datasets (20 Newsgroups) and a Chinese text dataset (MPH-20), our numerical experiment results obtained from using two widely used text classifiers (naive Bayes and support vector machine) demonstrate that our method outperformed the representative feature selection methods.
Information extraction from full text scientific articles: where are the keywords?
Shah, Parantu K; Perez-Iratxeta, Carolina; Bork, Peer; Andrade, Miguel A
2003-05-29
To date, many of the methods for information extraction of biological information from scientific articles are restricted to the abstract of the article. However, full text articles in electronic version, which offer larger sources of data, are currently available. Several questions arise as to whether the effort of scanning full text articles is worthy, or whether the information that can be extracted from the different sections of an article can be relevant. In this work we addressed those questions showing that the keyword content of the different sections of a standard scientific article (abstract, introduction, methods, results, and discussion) is very heterogeneous. Although the abstract contains the best ratio of keywords per total of words, other sections of the article may be a better source of biologically relevant data.
Medeiros, Aline da Silva
2018-03-01
This article reflects on the scientific authorship of Pedro Luiz Napoleão Chernoviz, based on his Dicionário de medicina popular, which was published in six editions between 1842 and 1890. The first part of the text discusses Chernoviz's position within the regimes of scientific authorship which were present in the medical community in Rio de Janeiro. Next, we analyze the author's arguments justifying a text that popularized medical science while this field strove for exclusivity in the practice of medicine. Finally, we suggest new meanings around Chernoviz's scientific authorship based on how the Dicionário was used and read by an initiated public.
JoVE: the Journal of Visualized Experiments.
Vardell, Emily
2015-01-01
The Journal of Visualized Experiments (JoVE) is the world's first scientific video journal and is designed to communicate research and scientific methods in an innovative, intuitive way. JoVE includes a wide range of biomedical videos, from biology to immunology and bioengineering to clinical and translation medicine. This column describes the browsing and searching capabilities of JoVE, as well as its additional features (including the JoVE Scientific Education Database designed for students in scientific fields).
Linguistic Features of Middle School Environmental Education Texts.
ERIC Educational Resources Information Center
Chenhansa, Suporn; Schleppegrell, Mary
1998-01-01
The language used in environmental education texts has linguistic features that affect students' comprehension of concepts and their ability to envision solutions to environmental problems. Findings indicate that features of texts such as abstract nouns and lack of explicit agents impede students' full comprehension of complex issues and obscure…
Study on Hybrid Image Search Technology Based on Texts and Contents
NASA Astrophysics Data System (ADS)
Wang, H. T.; Ma, F. L.; Yan, C.; Pan, H.
2018-05-01
Image search was studied first here based on texts and contents, respectively. The text-based image feature extraction was put forward by integrating the statistical and topic features in view of the limitation of extraction of keywords only by means of statistical features of words. On the other hand, a search-by-image method was put forward based on multi-feature fusion in view of the imprecision of the content-based image search by means of a single feature. The layered-searching method depended on primarily the text-based image search method and additionally the content-based image search was then put forward in view of differences between the text-based and content-based methods and their difficult direct fusion. The feasibility and effectiveness of the hybrid search algorithm were experimentally verified.
Sahadevan, S; Hofmann-Apitius, M; Schellander, K; Tesfaye, D; Fluck, J; Friedrich, C M
2012-10-01
In biological research, establishing the prior art by searching and collecting information already present in the domain has equal importance as the experiments done. To obtain a complete overview about the relevant knowledge, researchers mainly rely on 2 major information sources: i) various biological databases and ii) scientific publications in the field. The major difference between the 2 information sources is that information from databases is available, typically well structured and condensed. The information content in scientific literature is vastly unstructured; that is, dispersed among the many different sections of scientific text. The traditional method of information extraction from scientific literature occurs by generating a list of relevant publications in the field of interest and manually scanning these texts for relevant information, which is very time consuming. It is more than likely that in using this "classical" approach the researcher misses some relevant information mentioned in the literature or has to go through biological databases to extract further information. Text mining and named entity recognition methods have already been used in human genomics and related fields as a solution to this problem. These methods can process and extract information from large volumes of scientific text. Text mining is defined as the automatic extraction of previously unknown and potentially useful information from text. Named entity recognition (NER) is defined as the method of identifying named entities (names of real world objects; for example, gene/protein names, drugs, enzymes) in text. In animal sciences, text mining and related methods have been briefly used in murine genomics and associated fields, leaving behind other fields of animal sciences, such as livestock genomics. The aim of this work was to develop an information retrieval platform in the livestock domain focusing on livestock publications and the recognition of relevant data from cattle and pigs. For this purpose, the rather noncomprehensive resources of pig and cattle gene and protein terminologies were enriched with orthologue synonyms, integrated in the NER platform, ProMiner, which is successfully used in human genomics domain. Based on the performance tests done, the present system achieved a fair performance with precision 0.64, recall 0.74, and F(1) measure of 0.69 in a test scenario based on cattle literature.
Activity Structures and the Unfolding of Problem-Solving Actions in High-School Chemistry Classrooms
NASA Astrophysics Data System (ADS)
Criswell, Brett A.; Rushton, Greg T.
2014-02-01
In this paper, we argue for a more systematic approach for studying the relationship between classroom practices and scientific practices—an approach that will likely better support the systemic reforms being promoted in the Next Generation Science Standards in the USA and similar efforts in other countries. One component of that approach is looking at how the nature of the activity structure may influence the relative alignment between classroom and scientific practices. To that end, we build on previously published research related to the practices utilized by five high-school chemistry teachers as they enacted problem-solving activities in which students were likely to generate proposals that were not aligned with normative scientific understandings. In that prior work, our analysis had emphasized micro-level features of the talk interactions and how they related to the way students' ideas were explored; in the current paper, the analysis zooms out to consider the macro-level nature of the enactments associated with the activity structure of each lesson examined. Our data show that there were two general patterns to the activity structure across the 14 lessons scrutinized, and that each pattern had associated with it a constellation of features that impinged on the way the problem space was navigated. A key finding is that both activity structures (the expansive and the open) had features that aligned with scientific practices espoused in the Next Generation Science Standards—and both had features that were not aligned with those practices. We discuss the nature of these two structures, evidence of the relationship of each structure to key features of how the lessons unfolded, and the implications of these findings for both future research and the training of teachers.
ERIC Educational Resources Information Center
Britt, M. Anne; Richter, Tobias; Rouet, Jean-François
2014-01-01
In this article, we examine the mental processes and representations that are required of laypersons when learning about science issues from texts. We begin by defining scientific literacy as the ability to understand and critically evaluate scientific content in order to achieve one's goals. We then present 3 challenges of learning from…
The Origin of Chondrules and Chondrites
NASA Astrophysics Data System (ADS)
Sears, Derek W. G.
2005-01-01
Drawing on research from the various scientific disciplines involved, this text summarizes the origin and history of chondrules and chondrites. Including citations to every published paper on the topic, it forms a comprehensive bibliography of the latest research. In addition, extensive illustrations provide a clear visual representation of the scientific theories. The text will be a valuable reference for graduate students and researchers in planetary science, geology and astronomy.
ENHANCING SCIENTIFIC COLLABORATION THROUGH QUALITY ASSURANCE
The basic features of the Quality Assurance Program have been in existence since the early 1980's, but this poster will highlight some topics that have emerged more recently, in particular the Agency's laboratory competency policy, the information quality guidelines, and scientif...
Mujtaba, Ghulam; Shuib, Liyana; Raj, Ram Gopal; Rajandram, Retnagowri; Shaikh, Khairunisa
2018-07-01
Automatic text classification techniques are useful for classifying plaintext medical documents. This study aims to automatically predict the cause of death from free text forensic autopsy reports by comparing various schemes for feature extraction, term weighing or feature value representation, text classification, and feature reduction. For experiments, the autopsy reports belonging to eight different causes of death were collected, preprocessed and converted into 43 master feature vectors using various schemes for feature extraction, representation, and reduction. The six different text classification techniques were applied on these 43 master feature vectors to construct a classification model that can predict the cause of death. Finally, classification model performance was evaluated using four performance measures i.e. overall accuracy, macro precision, macro-F-measure, and macro recall. From experiments, it was found that that unigram features obtained the highest performance compared to bigram, trigram, and hybrid-gram features. Furthermore, in feature representation schemes, term frequency, and term frequency with inverse document frequency obtained similar and better results when compared with binary frequency, and normalized term frequency with inverse document frequency. Furthermore, the chi-square feature reduction approach outperformed Pearson correlation, and information gain approaches. Finally, in text classification algorithms, support vector machine classifier outperforms random forest, Naive Bayes, k-nearest neighbor, decision tree, and ensemble-voted classifier. Our results and comparisons hold practical importance and serve as references for future works. Moreover, the comparison outputs will act as state-of-art techniques to compare future proposals with existing automated text classification techniques. Copyright © 2017 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
Full-Text Searching on Major Supermarket Systems: Dialog, Data-Star, and Nexis.
ERIC Educational Resources Information Center
Tenopir, Carol; Berglund, Sharon
1993-01-01
Examines the similarities, differences, and full-text features of the three most-used online systems for full-text searching in general libraries: DIALOG, Data-Star, and NEXIS. Overlapping databases, unique sources, search features, proximity operators, set building, language enhancement and word equivalencies, and display features are discussed.…
Fate of abstracts presented at the 2008 European Congress of Physical and Rehabilitation Medicine.
Allart, E; Beaucamp, F; Tiffreau, V; Thevenon, A
2015-08-01
The subsequent full-text publication of abstracts presented at a scientific congress reflects the latter's scientific quality. The aim of this paper was to evaluate the publication rate for abstracts presented at the 2008 European Congress of Physical and Rehabilitation Medicine (ECPRM), characterize the publications and identify factors that were predictive of publication. It is a bibliography search. We used the PubMed database to search for subsequent publication of abstracts. We screened the abstracts' characteristics for features that were predictive of publication among abstracts features, such the status of the authors, the topic and the type of work. We performed univariate analyses and a logistic regression analysis. Of 779 abstracts presented at ECPRM 2008, 169 (21.2%) were subsequently published. The mean time to publication was 12±15.7 months and the mean impact factor of the publishing journals was 2.05±2.1. In a univariate analysis, university status (P<10-6), geographic origin (P=10-3), oral presentation (P<10-6), and original research (P<10-6) (and particularly multicentre trials [P<0.01] and randomized controlled trials [P=10-3]) were predictive of publication. In a logistic regression analysis, oral presentation (odds ratio [OR]=0.37) and university status (OR=0.36) were significant, independent predictors of publication. ECPRM 2008 publication rate and impact factor were relatively low, when compared with most other national and international conferences in this field. University status, the type of abstract and oral presentation were predictive of subsequent publication.
NASA Astrophysics Data System (ADS)
Buslov, A. S.; Kotov, Yu. D.; Yurov, V. N.; Bessonov, M. V.; Kalmykov, P. A.; Oreshnikov, E. M.; Alimov, A. M.; Tumanov, A. V.; Zhuchkova, E. A.
2011-06-01
This paper deals with the organizational structure of ground-based receiving, processing, and dissemination of scientific information created by the Astrophysics Institute of the Scientific Research Nuclear University, Moscow Engineering Physics Institute. Hardware structure and software features are described. The principles are given for forming sets of control commands for scientific equipment (SE) devices, and statistics data are presented on the operation of facility during flight tests of the spacecraft (SC) in the course of one year.
PaperBLAST: Text Mining Papers for Information about Homologs
Price, Morgan N.; Arkin, Adam P.
2017-08-15
Large-scale genome sequencing has identified millions of protein-coding genes whose function is unknown. Many of these proteins are similar to characterized proteins from other organisms, but much of this information is missing from annotation databases and is hidden in the scientific literature. To make this information accessible, PaperBLAST uses EuropePMC to search the full text of scientific articles for references to genes. PaperBLAST also takes advantage of curated resources (Swiss-Prot, GeneRIF, and EcoCyc) that link protein sequences to scientific articles. PaperBLAST’s database includes over 700,000 scientific articles that mention over 400,000 different proteins. Given a protein of interest, PaperBLAST quicklymore » finds similar proteins that are discussed in the literature and presents snippets of text from relevant articles or from the curators. With the recent explosion of genome sequencing data, there are now millions of uncharacterized proteins. If a scientist becomes interested in one of these proteins, it can be very difficult to find information as to its likely function. Often a protein whose sequence is similar, and which is likely to have a similar function, has been studied already, but this information is not available in any database. To help find articles about similar proteins, PaperBLAST searches the full text of scientific articles for protein identifiers or gene identifiers, and it links these articles to protein sequences. Then, given a protein of interest, it can quickly find similar proteins in its database by using standard software (BLAST), and it can show snippets of text from relevant papers. We hope that PaperBLAST will make it easier for biologists to predict proteins’ functions.« less
PaperBLAST: Text Mining Papers for Information about Homologs
DOE Office of Scientific and Technical Information (OSTI.GOV)
Price, Morgan N.; Arkin, Adam P.
Large-scale genome sequencing has identified millions of protein-coding genes whose function is unknown. Many of these proteins are similar to characterized proteins from other organisms, but much of this information is missing from annotation databases and is hidden in the scientific literature. To make this information accessible, PaperBLAST uses EuropePMC to search the full text of scientific articles for references to genes. PaperBLAST also takes advantage of curated resources (Swiss-Prot, GeneRIF, and EcoCyc) that link protein sequences to scientific articles. PaperBLAST’s database includes over 700,000 scientific articles that mention over 400,000 different proteins. Given a protein of interest, PaperBLAST quicklymore » finds similar proteins that are discussed in the literature and presents snippets of text from relevant articles or from the curators. With the recent explosion of genome sequencing data, there are now millions of uncharacterized proteins. If a scientist becomes interested in one of these proteins, it can be very difficult to find information as to its likely function. Often a protein whose sequence is similar, and which is likely to have a similar function, has been studied already, but this information is not available in any database. To help find articles about similar proteins, PaperBLAST searches the full text of scientific articles for protein identifiers or gene identifiers, and it links these articles to protein sequences. Then, given a protein of interest, it can quickly find similar proteins in its database by using standard software (BLAST), and it can show snippets of text from relevant papers. We hope that PaperBLAST will make it easier for biologists to predict proteins’ functions.« less
PaperBLAST: Text Mining Papers for Information about Homologs
Arkin, Adam P.
2017-01-01
ABSTRACT Large-scale genome sequencing has identified millions of protein-coding genes whose function is unknown. Many of these proteins are similar to characterized proteins from other organisms, but much of this information is missing from annotation databases and is hidden in the scientific literature. To make this information accessible, PaperBLAST uses EuropePMC to search the full text of scientific articles for references to genes. PaperBLAST also takes advantage of curated resources (Swiss-Prot, GeneRIF, and EcoCyc) that link protein sequences to scientific articles. PaperBLAST’s database includes over 700,000 scientific articles that mention over 400,000 different proteins. Given a protein of interest, PaperBLAST quickly finds similar proteins that are discussed in the literature and presents snippets of text from relevant articles or from the curators. PaperBLAST is available at http://papers.genomics.lbl.gov/. IMPORTANCE With the recent explosion of genome sequencing data, there are now millions of uncharacterized proteins. If a scientist becomes interested in one of these proteins, it can be very difficult to find information as to its likely function. Often a protein whose sequence is similar, and which is likely to have a similar function, has been studied already, but this information is not available in any database. To help find articles about similar proteins, PaperBLAST searches the full text of scientific articles for protein identifiers or gene identifiers, and it links these articles to protein sequences. Then, given a protein of interest, it can quickly find similar proteins in its database by using standard software (BLAST), and it can show snippets of text from relevant papers. We hope that PaperBLAST will make it easier for biologists to predict proteins’ functions. PMID:28845458
PaperBLAST: Text Mining Papers for Information about Homologs.
Price, Morgan N; Arkin, Adam P
2017-01-01
Large-scale genome sequencing has identified millions of protein-coding genes whose function is unknown. Many of these proteins are similar to characterized proteins from other organisms, but much of this information is missing from annotation databases and is hidden in the scientific literature. To make this information accessible, PaperBLAST uses EuropePMC to search the full text of scientific articles for references to genes. PaperBLAST also takes advantage of curated resources (Swiss-Prot, GeneRIF, and EcoCyc) that link protein sequences to scientific articles. PaperBLAST's database includes over 700,000 scientific articles that mention over 400,000 different proteins. Given a protein of interest, PaperBLAST quickly finds similar proteins that are discussed in the literature and presents snippets of text from relevant articles or from the curators. PaperBLAST is available at http://papers.genomics.lbl.gov/. IMPORTANCE With the recent explosion of genome sequencing data, there are now millions of uncharacterized proteins. If a scientist becomes interested in one of these proteins, it can be very difficult to find information as to its likely function. Often a protein whose sequence is similar, and which is likely to have a similar function, has been studied already, but this information is not available in any database. To help find articles about similar proteins, PaperBLAST searches the full text of scientific articles for protein identifiers or gene identifiers, and it links these articles to protein sequences. Then, given a protein of interest, it can quickly find similar proteins in its database by using standard software (BLAST), and it can show snippets of text from relevant papers. We hope that PaperBLAST will make it easier for biologists to predict proteins' functions.
Shatkay, Hagit; Pan, Fengxia; Rzhetsky, Andrey; Wilbur, W. John
2008-01-01
Motivation: Much current research in biomedical text mining is concerned with serving biologists by extracting certain information from scientific text. We note that there is no ‘average biologist’ client; different users have distinct needs. For instance, as noted in past evaluation efforts (BioCreative, TREC, KDD) database curators are often interested in sentences showing experimental evidence and methods. Conversely, lab scientists searching for known information about a protein may seek facts, typically stated with high confidence. Text-mining systems can target specific end-users and become more effective, if the system can first identify text regions rich in the type of scientific content that is of interest to the user, retrieve documents that have many such regions, and focus on fact extraction from these regions. Here, we study the ability to characterize and classify such text automatically. We have recently introduced a multi-dimensional categorization and annotation scheme, developed to be applicable to a wide variety of biomedical documents and scientific statements, while intended to support specific biomedical retrieval and extraction tasks. Results: The annotation scheme was applied to a large corpus in a controlled effort by eight independent annotators, where three individual annotators independently tagged each sentence. We then trained and tested machine learning classifiers to automatically categorize sentence fragments based on the annotation. We discuss here the issues involved in this task, and present an overview of the results. The latter strongly suggest that automatic annotation along most of the dimensions is highly feasible, and that this new framework for scientific sentence categorization is applicable in practice. Contact: shatkay@cs.queensu.ca PMID:18718948
Cherry Featured in NCI’s Spotlight on Scientists Video Series | Poster
James Cherry, Ph.D., learned at an early age that education is crucial to success. He credits his mentors, some of whom include his grandmother, Shepherd University professor Burton Lidgerding, Ph.D., David Munroe, Ph.D., Frederick National Lab, and Robert J. Hohman, Ph.D., National Institute of Allergy and Infectious Diseases, for guiding him to the career he has today. Cherry, scientific program director, Office of Scientific Operations (OSO), NCI at Frederick, is one of the scientists featured in NCI’s Spotlight on Scientists video series.
Representation of Scientific Methodology in Secondary Science Textbooks
ERIC Educational Resources Information Center
Binns, Ian C.
2009-01-01
The purpose of this investigation was to assess the representation of scientific methodology in secondary science textbooks. More specifically, this study looked at how textbooks introduced scientific methodology and to what degree the examples from the rest of the textbook, the investigations, and the images were consistent with the text's…
Representation of Scientific Methodology in Secondary Science Textbooks
ERIC Educational Resources Information Center
Binns, Ian C.; Bell, Randy L.
2015-01-01
This study explored how eight widely used secondary science textbooks described scientific methodology and to what degree the textbooks' examples and investigations were consistent with this description. Data consisted of all text from student and teacher editions that referred to scientific methodology and all investigations. Analysis used an…
The Influence of Group Dynamics on Collaborative Scientific Argumentation
ERIC Educational Resources Information Center
Ryu, Suna; Sandoval, William A.
2015-01-01
Research has addressed what instructional conditions may inhibit or promote scientific argumentation. Little research, however, has paid attention to interpersonal factors that influence collaborative argumentation. The present study examines the ways interpersonal factors affected group dynamics, which influence the features of collaborative…
Pesticides in the atmosphere: distribution, trends, and governing factors
Majewski, Michael S.; Capel, Paul D.
1996-01-01
Most people know about the presence and health effects of pesticide residues in the water they drink. However, they may not realize the impact of atmospheric transportation and deposition of pesticides on water quality. Scientific studies of pesticides in various atmospheric matrices (air, rain, snow, aerosols, and fog) provide some of the answers. Pesticides in the Atmosphere focuses on the review and interpretation of direct measurements of pesticides in the environment. An exhaustive compilation, the book examines hundreds of studies in detailed tabular listings, with accompanying maps that include such features as spatial and temporal domain studies, target analytes, detection limits, and compounds detected. Working with the foundation of forty years of scientific studies, the editors synthesize this research to characterize the common threads and main conclusions. They use this information to identify where we need to improve our understanding of pesticides in the atmosphere and their significance to water quality. Pesticides in the Atmosphere serves as a resource, text, and reference to a wide spectrum of scientists, water managers, and students. It includes extensive compilations of references, interpretive analyses and conclusions. For those not familiar with the atmospheric transportation and deposition of pesticides it provides a comprehensive introduction.
Geopotential research mission, science, engineering and program summary
NASA Technical Reports Server (NTRS)
Keating, T. (Editor); Taylor, P. (Editor); Kahn, W. (Editor); Lerch, F. (Editor)
1986-01-01
This report is based upon the accumulated scientific and engineering studies pertaining to the Geopotential Research Mission (GRM). The scientific need and justification for the measurement of the Earth's gravity and magnetic fields are discussed. Emphasis is placed upon the studies and conclusions of scientific organizations and NASA advisory groups. The engineering design and investigations performed over the last 4 years are described, and a spacecraft design capable of fulfilling all scientific objectives is presented. In addition, critical features of the scientific requirements and state-of-the-art limitations of spacecraft design, mission flight performance, and data processing are discussed.
ERIC Educational Resources Information Center
Kotani, Katsunori; Yoshimi, Takehiko; Isahara, Hitoshi
2011-01-01
The present paper introduces and evaluates a readability measurement method designed for learners of EFL (English as a foreign language). The proposed readability measurement method (a regression model) estimates the text readability based on linguistic features, such as lexical, syntactic and discourse features. Text readability refers to the…
The Texts of Literacy Instruction: Obstacles to or Opportunities for Educational Equity?
ERIC Educational Resources Information Center
Hiebert, Elfrieda H.
2017-01-01
Texts are a central part of reading. Yet our understandings of appropriate text features and distributions of text diets at different points in students' reading development are limited. The thesis of the essay is that, if the trajectory of struggling readers is to change, attention is needed to the features of texts and students' text diets,…
Toward Routine Automatic Pathway Discovery from On-line Scientific Text Abstracts.
Ng; Wong
1999-01-01
We are entering a new era of research where the latest scientific discoveries are often first reported online and are readily accessible by scientists worldwide. This rapid electronic dissemination of research breakthroughs has greatly accelerated the current pace in genomics and proteomics research. The race to the discovery of a gene or a drug has now become increasingly dependent on how quickly a scientist can scan through voluminous amount of information available online to construct the relevant picture (such as protein-protein interaction pathways) as it takes shape amongst the rapidly expanding pool of globally accessible biological data (e.g. GENBANK) and scientific literature (e.g. MEDLINE). We describe a prototype system for automatic pathway discovery from on-line text abstracts, combining technologies that (1) retrieve research abstracts from online sources, (2) extract relevant information from the free texts, and (3) present the extracted information graphically and intuitively. Our work demonstrates that this framework allows us to routinely scan online scientific literature for automatic discovery of knowledge, giving modern scientists the necessary competitive edge in managing the information explosion in this electronic age.
Sherlock Holmes as a Social Scientist.
ERIC Educational Resources Information Center
Ward, Veronica; Orbell, John
1988-01-01
Presents a way of teaching the scientific method through studying the adventures of Sherlock Holmes. Asserting that Sherlock Holmes used the scientific method to solve cases, the authors construct Holmes' method through excerpts from novels featuring his adventures. Discusses basic assumptions, paradigms, theory building, and testing. (SLM)
The Literacy Component of Mathematical and Scientific Literacy
ERIC Educational Resources Information Center
Yore, Larry D.; Pimm, David; Tuan, Hsiao-Lin
2007-01-01
This opening article of the Special Issue makes an argument for parallel definitions of scientific literacy and mathematical literacy that have shared features: importance of general cognitive and metacognitive abilities and reasoning/thinking and discipline-specific language, habits-of-mind/emotional dispositions, and information communication…
A Primer on Disseminating Applied Quantitative Research
ERIC Educational Resources Information Center
Bell, Bethany A.; DiStefano, Christine; Morgan, Grant B.
2010-01-01
Transparency and replication are essential features of scientific inquiry, yet scientific communications of applied quantitative research are often lacking in much-needed procedural information. In an effort to promote researchers dissemination of their quantitative studies in a cohesive, detailed, and informative manner, the authors delineate…
Some Grammatical Problems in Scientific English.
ERIC Educational Resources Information Center
Halliday, M. A. K.
While native and non-native English-speakers may approach scientific English differently, the same features cause difficulty for both groups. The difficulties generally occur more with grammar and the complex relationships between terms than with vocabularly, and may be classified in seven categories: interlocking definitions, technical…
Using R for large spatiotemporal data sets
NASA Astrophysics Data System (ADS)
Pebesma, Edzer
2017-04-01
Writing and sharing scientific software is a means to communicate scientific ideas for finding scientific consensus, no more and no less than writing and sharing scientific papers is. Important factors for successful communication are adopting an open source environment, and using a language that is understood by many. For many scientist, R's combination of rich data abstraction and highly exposed data structures makes it an attractive communication tool. This paper discusses the development of spatial and spatiotemporal data handling and analysis with R since 2000, and will point to some of R's strengths and weaknesses in a historical perspective. We will also discuss a new, S3-based package for feature data ("Simple Features for R"), and point to a way forward into the data science realm, where pipeline-based workflows are assumed. Finally, we will discuss how, in a similar vein, massive satellite or climate model data sets, potentially held in a cloud environment, can be handled and analyzed with R.
Geographic names of the Antarctic
,; ,; ,; ,; Alberts, Fred G.
1995-01-01
This gazetteer contains 12,710 names approved by the United States Board on Geographic Names and the Secretary of the Interior for features in Antarctica and the area extending northward to the Antarctic Convergence. Included in this geographic area, the Antarctic region, are the off-lying South Shetland Islands, the South Orkney Islands, the South Sandwich Islands, South Georgia, Bouvetøya, Heard Island, and the Balleny Islands. These names have been approved for use by U.S. Government agencies. Their use by the Antarctic specialist and the public is highly recommended for the sake of accuracy and uniformity. This publication, which supersedes previous Board gazetteers or lists for the area, contains names approved as recently as December 1994. The basic name coverage of this gazetteer corresponds to that of maps at the scale of 1:250,000 or larger for coastal Antarctica, the off-lying islands, and isolated mountains and ranges of the continent. Much of the interior of Antarctica is a featureless ice plateau. That area has been mapped at a smaller scale and is nearly devoid of toponyms. All of the names are for natural features, such as mountains, glaciers, peninsulas, capes, bays, islands, and subglacial entities. The names of scientific stations have not been listed alphabetically, but they may appear in the texts of some decisions. For the names of submarine features, reference should be made to the Gazetteer of Undersea Features, 4th edition, U.S. Board on Geographic Names, 1990.
Public Knowledge, Private Minds: Meaning Making on the Pathways of Science Communication
NASA Astrophysics Data System (ADS)
Davis, Pryce R.
Every day people are inundated with news reports about the latest scientific research. The ways in which these texts enlighten or misinform the general public is a central question in both the research literature and discussions in popular culture. However, both research and popular discussion often take on deficit views of these texts, and the capabilities of readers to critically engage with them, and treat them as static, one-way conduits that transfer information to a passive audience. In contrast, I advocate treating popular science texts as the result of a chain of consumption and production that are actively shaped by the varied perspectives of scientists, communicators, and members of the general public. My work envisions all of these actors as science learners who simultaneously act as both producers and consumers of science, and who interact with one another through in-the-moment meaning making. This dissertation examines how the meaning of scientific research is filtered and transformed in moments of interaction and knowledge construction as it moves along this pathway of science communication from scientists to the general public. I present the results of a study that attempts to follow pieces of recent scientific research as they work their way from scientists to publication as popular science news stories, and ultimately to the public. To that end, I collected data from three types of actors involved in the paths of science communication, as well as the texts they read and generate. These actors include (1) the scientists who performed the research, (2) the reporters tasked with writing about it for popular dissemination, and (3) members of the public who must read and interpret the research. The texts I analyze include: peer-reviewed scientific journal articles, university-produced news briefs, popular press science stories, and various text-based conversations between scientists and reporters. Through an analysis of texts, individual interviews, and video-recorded interactions between actors, I demonstrate how individual meaning making shapes scientific understanding and how the problems observed in the public's understanding of science are by-products of properties of the process of science communication itself rather than the fault of individual actors.
NASA Astrophysics Data System (ADS)
Ding, Dan Xiong
The passive voice is a major stylistic feature of modern scientific discourse, but such a feature did not dominate scientific writing until the 1890s. It has its roots in the philosophical thoughts of experimental science of Francis Bacon and his followers such as Thomas Sprat and John Locke. In the early seventeenth century. Bacon called for a new science that emphasized collective knowledge of nature. Such a science was a cooperative and public enterprise in which scientists should work as a group to advance knowledge of nature. When science was moving gradually toward a public enterprise from the early seventeenth century, the passive voice gradually replaced the active voice in science writing as a dominant stylistic feature. The passive voice in scientific writing is thus historically and socially conditioned. Scientists take advantage of the linguistic functions of the passive voice to serve their rhetorical and pragmatic purposes such as presenting experiments as they are for others to reproduce and verify the results. It embodies two major conventions of scientific communities: (1) science is a public enterprise and (2) it is also a cooperative venture. Other conventions are related to these two: the collective authority of an scientific community is above the personal authority of any one individual scientist; science is not an infallible force, so any research result needs to be verified by a scientific community before it becomes knowledge; scientists use passive voice to approach their writing to make it appear as if it were objective; and science is a human profession. Therefore, we need to teach science students to use the passive voice, and more importantly, why and when to use it. We should emphasize writing practice to have students' see that they use passives rhetorically to present experimental processes, materials and methods.
Scaffolding Students' Independent Decoding of Unfamiliar Text with a Prototype of an eBook-Feature
ERIC Educational Resources Information Center
Gissel, Stig T.
2015-01-01
This study was undertaken to design, evaluate and refine an eBook-feature that supports students' decoding of unfamiliar text. The feature supports students' independent reading of eBooks with text-to-speech, graded support in the form of syllabification and rhyme analogy, and by dividing the word material into different categories based on the…
Visual affective classification by combining visual and text features.
Liu, Ningning; Wang, Kai; Jin, Xin; Gao, Boyang; Dellandréa, Emmanuel; Chen, Liming
2017-01-01
Affective analysis of images in social networks has drawn much attention, and the texts surrounding images are proven to provide valuable semantic meanings about image content, which can hardly be represented by low-level visual features. In this paper, we propose a novel approach for visual affective classification (VAC) task. This approach combines visual representations along with novel text features through a fusion scheme based on Dempster-Shafer (D-S) Evidence Theory. Specifically, we not only investigate different types of visual features and fusion methods for VAC, but also propose textual features to effectively capture emotional semantics from the short text associated to images based on word similarity. Experiments are conducted on three public available databases: the International Affective Picture System (IAPS), the Artistic Photos and the MirFlickr Affect set. The results demonstrate that the proposed approach combining visual and textual features provides promising results for VAC task.
Visual affective classification by combining visual and text features
Liu, Ningning; Wang, Kai; Jin, Xin; Gao, Boyang; Dellandréa, Emmanuel; Chen, Liming
2017-01-01
Affective analysis of images in social networks has drawn much attention, and the texts surrounding images are proven to provide valuable semantic meanings about image content, which can hardly be represented by low-level visual features. In this paper, we propose a novel approach for visual affective classification (VAC) task. This approach combines visual representations along with novel text features through a fusion scheme based on Dempster-Shafer (D-S) Evidence Theory. Specifically, we not only investigate different types of visual features and fusion methods for VAC, but also propose textual features to effectively capture emotional semantics from the short text associated to images based on word similarity. Experiments are conducted on three public available databases: the International Affective Picture System (IAPS), the Artistic Photos and the MirFlickr Affect set. The results demonstrate that the proposed approach combining visual and textual features provides promising results for VAC task. PMID:28850566
NASA Astrophysics Data System (ADS)
Belfatti, Monica A.
Recently developed common core standards echo calls by educators for ensuring that upper elementary students become proficient readers of informational texts. Informational texts have been theorized as causing difficulty for students because they contain linguistic and visual features different from more familiar narrative genres (Lemke, 2004). It has been argued that learning to read informational texts, particularly those with science subject matter, requires making sense of words, images, and the relationships among them (Pappas, 2006). Yet, conspicuously absent in the research are empirical studies documenting ways students make use of textual resources to build textual and conceptual understandings during classroom literacy instruction. This 10-month practitioner research study was designed to investigate the ways a group of ethnically and linguistically diverse fourth graders in one metropolitan school made sense of science information books during dialogically organized literature discussions. In this nontraditional instructional context, I wondered whether and how young students might make use of science informational text features, both words and images, in the midst of collaborative textual and conceptual inquiry. Drawing on methods of constructivist grounded theory and classroom discourse analysis, I analyzed student and teacher talk in 25 discussions of earth and life science books. Digital voice recordings and transcriptions served as the main data sources for this study. I found that, without teacher prompts or mandates to do so, fourth graders raised a wide range of textual and conceptual inquiries about words, images, scientific figures, and phenomena. In addition, my analysis yielded a typology of ways students constructed relationships between words and images within and across page openings of the information books read for their sense-making endeavors. The diversity of constructed word-image relationships aided students in raising, exploring, and contesting textual and conceptual ideas. Moreover, through their joint inquiries, students marshaled and evaluated a rich array of resources. Students' sense-making of information books was not contained by the words and images alone; it involved a situated, complex process of making sense of multiple texts, discourses, and epistemologies. These findings suggest educators, theorists, and policy makers reconsider acontextual, linear, hierarchical models for developing elementary students as sense-makers of nonfiction.
Generative Mechanistic Explanation Building in Undergraduate Molecular and Cellular Biology
ERIC Educational Resources Information Center
Southard, Katelyn M.; Espindola, Melissa R.; Zaepfel, Samantha D.; Bolger, Molly S.
2017-01-01
When conducting scientific research, experts in molecular and cellular biology (MCB) use specific reasoning strategies to construct mechanistic explanations for the underlying causal features of molecular phenomena. We explored how undergraduate students applied this scientific practice in MCB. Drawing from studies of explanation building among…
ERIC Educational Resources Information Center
Weston, Michele; Haudek, Kevin C.; Prevost, Luanna; Urban-Lurain, Mark; Merrill, John
2015-01-01
One challenge in science education assessment is that students often focus on surface features of questions rather than the underlying scientific principles. We investigated how student written responses to constructed-response questions about photosynthesis vary based on two surface features of the question: the species of plant and the order of…
Text-Based Conferencing: Features vs. Functionality
ERIC Educational Resources Information Center
Anderson, Lynn; McCarthy, Cathy
2005-01-01
This report examines three text-based conferencing products: "WowBB", "Invision Power Board", and "vBulletin". Their selection was prompted by a feature-by-feature comparison of the same products on the "WowBB" website. The comparison chart painted a misleading impression of "WowBB's" features in relation to the other two products; so the…
Do Particular Design Features Assist People with Aphasia to Comprehend Text? An Exploratory Study
ERIC Educational Resources Information Center
Wilson, Lucy; Read, Jennifer
2016-01-01
Background: Much of the evidence underlying guidelines for producing accessible information for people with aphasia focuses on client preference for particular design features. There is limited evidence regarding the effects of these features on comprehension. Aims: To examine the effects of specific design features on text comprehension. It was…
Helping Students Bridge Inferences in Science Texts Using Graphic Organizers
ERIC Educational Resources Information Center
Roman, Diego; Jones, Francesca; Basaraba, Deni; Hironaka, Stephanie
2016-01-01
The difficulties that students face when reading science texts go beyond understanding vocabulary and syntactic structures. Comprehension of science texts requires students to infer how these texts function as a unit to communicate scientific meaning. To help students in this process, science texts sometimes employ logical connectives (e.g.,…
Uncovering the Effect of Text Structure in Learning from a Science Text: An Eye-Tracking Study
ERIC Educational Resources Information Center
Ariasi, Nicola; Mason, Lucia
2011-01-01
This study examined whether reading a refutational or non-refutational text would induce different cognitive processing, as revealed by eye-movement analyses. Unlike a standard expository text, a refutational text acknowledges a reader's alternative conceptions about a topic, refutes them, and then introduces scientific conceptions as viable…
Text Structuration Leading to an Automatic Summary System: RAFI.
ERIC Educational Resources Information Center
Lehman, Abderrafih
1999-01-01
Describes the design and construction of Resume Automatique a Fragments Indicateurs (RAFI), a system of automatic text summary which sums up scientific and technical texts. The RAFI system transforms a long source text into several versions of more condensed texts, using discourse analysis, to make searching easier; it could be adapted to the…
Graphic design and scientific research: the experience of the INGV Laboratorio Grafica e Immagini
NASA Astrophysics Data System (ADS)
Riposati, Daniela; D'Addezio, Giuliana; Chesi, Angela; Di Laura, Francesca; Palone, Sabrina
2016-04-01
The Laboratorio Grafica e Immagini is the INGV reference structure for the graphic and visual communication supporting institutional and research activities. Part of the activity is focused on the production of different materials concerning the INGV Educational and Outreach projects on the main themes of Geophysics and natural hazards. The forefront results of research activity, in fact, are periodically transferred to the public through an intense and comprehensive plan of scientific dissemination. In 10 years of activity, the Laboratorio has become an essential point of reference for this production, widely known within the scientific community. Positive experiences are the result of a strict relationship between graphic design and scientific research, in particular the process concerning the collaborative work between designers and researchers. In projects such as the realization of museum exhibition or the production of illustrative brochures, generally designed for broad-spectrum public, the goal is to make easier the understanding and to support the scientific message, making concepts enjoyable and fruitful through the emotional involvement that visual image can arouse. Our graphics and editorial products through composition of signs and images by using differt tools on different media (the use of colors, lettering, graphic design, visual design, web design etc.) link to create a strong identity "INGV style", in order to make them easily recognizable in Educational and Outreach projects: in one words "branding". For example, a project product package might include a logo or other artwork, organized text and pure design elements such as shapes and colour, which unify the piece. Colour is used not only to help the "brand" stand out from the international overview, but in our case to have a unifying outcome across all the INGV sections. We also analysed the restyling project of different materials, one of the most important features of graphic design, especially when using pre-existing product or diverse elements, including web elements.
Informational Text and the CCSS
ERIC Educational Resources Information Center
Aspen Institute, 2012
2012-01-01
What constitutes an informational text covers a broad swath of different types of texts. Biographies & memoirs, speeches, opinion pieces & argumentative essays, and historical, scientific or technical accounts of a non-narrative nature are all included in what the Common Core State Standards (CCSS) envisions as informational text. Also included…
Refutation Texts for Effective Climate Change Education
ERIC Educational Resources Information Center
Nussbaum, E. Michael; Cordova, Jacqueline R.; Rehmat, Abeera P.
2017-01-01
Refutation texts, which are texts that rebut scientific misconceptions and explain the normative concept, can be effective devices for addressing misconceptions and affecting conceptual change. However, few, if any, refutation texts specifically related to climate change have been validated for effectiveness. In this project, we developed and…
The Poster features the news, local events, and people of the scientific, administrative, and support communities at NCI at Frederick, Frederick, Maryland. It is published by Scientific Publications, Graphics & Media, Leidos Biomedical Research, for NCI at Frederick. The content of this publication does not necessarily reflect the views or policies of the Department of Health
Designing Project-Based Instruction to Foster Generative and Mechanistic Understandings in Genetics
ERIC Educational Resources Information Center
Duncan, Ravit Golan; Tseng, Katie Ann
2011-01-01
The acquisition of scientific knowledge is fraught with difficulties and challenges for the learner. The very nature of some scientific domains contributes to the learning difficulties students' experience. Phenomena in these domains are composed of multiple organization levels featuring complicated interactions within and across these levels.…
36 CFR 290.3 - Nomination, evaluation, and designation of significant caves.
Code of Federal Regulations, 2012 CFR
2012-07-01
... governmental agencies and the public, including those who utilize caves for scientific, educational, or... is located as new cave discoveries are made. Caves nominated but not approved for designation may be... mineralogic features that are fragile, represent formation processes that are of scientific interest, or that...
36 CFR 290.3 - Nomination, evaluation, and designation of significant caves.
Code of Federal Regulations, 2013 CFR
2013-07-01
... governmental agencies and the public, including those who utilize caves for scientific, educational, or... is located as new cave discoveries are made. Caves nominated but not approved for designation may be... mineralogic features that are fragile, represent formation processes that are of scientific interest, or that...
36 CFR 290.3 - Nomination, evaluation, and designation of significant caves.
Code of Federal Regulations, 2014 CFR
2014-07-01
... governmental agencies and the public, including those who utilize caves for scientific, educational, or... is located as new cave discoveries are made. Caves nominated but not approved for designation may be... mineralogic features that are fragile, represent formation processes that are of scientific interest, or that...
36 CFR 290.3 - Nomination, evaluation, and designation of significant caves.
Code of Federal Regulations, 2011 CFR
2011-07-01
... governmental agencies and the public, including those who utilize caves for scientific, educational, or... is located as new cave discoveries are made. Caves nominated but not approved for designation may be... mineralogic features that are fragile, represent formation processes that are of scientific interest, or that...
Visual Discourse in Scientific Conference Papers: A Genre-based Study.
ERIC Educational Resources Information Center
Rowley-Jolivet, Elizabeth
2002-01-01
Investigates the role of visual communication in a spoken research genre: the scientific research paper. Analyzes 2,048 visuals projected during 90 papers given at five international conferences in three fields (Geology, medicine, physics), in order to bring out the recurrent features of the visual dimension. (Author/VWL)
Program on Public Conceptions of Science, Newsletter 14.
ERIC Educational Resources Information Center
Shelanski, Vivien, Ed.
Three special features related to increasing attention given to the relationships between scientific and social, political, moral and legal issues are presented. One article is presented which questions whether the traditional scientific norms provided adequate guidance for scientists in their interaction with public officials, the news media, and…
Lexical Errors in Second Language Scientific Writing: Some Conceptual Implications
ERIC Educational Resources Information Center
Carrió Pastor, María Luisa; Mestre-Mestre, Eva María
2014-01-01
Nowadays, scientific writers are required not only a thorough knowledge of their subject field, but also a sound command of English as a lingua franca. In this paper, the lexical errors produced in scientific texts written in English by non-native researchers are identified to propose a classification of the categories they contain. This study…
Districts Gear up for Shift to Informational Texts
ERIC Educational Resources Information Center
Gewertz, Catherine
2012-01-01
The Common Core State Standards' emphasis on informational text arose in part from research suggesting that employers and college instructors found students weak at comprehending technical manuals, scientific and historical journals, and other texts pivotal to work in those arenas. The common core's vision of informational text includes literary…
ERIC Educational Resources Information Center
Becerra Cortés, Ximena
2013-01-01
This paper reports on an innovative and action research project which focused on the use of the dictionary and the prior knowledge of Colombian high school students to improve their reading comprehension of short scientific texts. Data collection instruments included students' work gathered during two workshops, field notes, and a questionnaire.…
Detecting spam comments on Indonesia’s Instagram posts
NASA Astrophysics Data System (ADS)
Septiandri, Ali Akbar; Wibisono, Okiriza
2017-01-01
In this paper we experimented with several feature sets for detecting spam comments in social media contents authored by Indonesian public figures. We define spam comments as comments which have promotional purposes (e.g. referring other users to products and services) and thus not related to the content to which the comments are posted. Three sets of features are evaluated for detecting spams: (1) hand-engineered features such as comment length, number of capital letters, and number of emojis, (2) keyword features such as whether the comment contains advertising words or product-related words, and (3) text features, namely, bag-of-words, TF-IDF, and fastText embeddings, each combined with latent semantic analysis. With 24,000 manually-annotated comments scraped from Instagram posts authored by more than 100 Indonesian public figures, we compared the performance of these feature sets and their combinations using 3 popular classification algorithms: Na¨ıve Bayes, SVM, and XGBoost. We find that using all three feature sets (with fastText embedding for the text features) gave the best F 1-score of 0.9601 on a holdout dataset. More interestingly, fastText embedding combined with hand-engineered features (i.e. without keyword features) yield similar F 1-score of 0.9523, and McNemar’s test failed to reject the hypothesis that the two results are not significantly different. This result is important as keyword features are largely dependent on the dataset and may not be as generalisable as the other feature sets when applied to new data. For future work, we hope to collect bigger and more diverse dataset of Indonesian spam comments, improve our model’s performance and generalisability, and publish a programming package for others to reliably detect spam comments.
Multiple External Representations: Bridges or Barriers to Climate Literacy?
NASA Astrophysics Data System (ADS)
Holzer, M. A.
2012-12-01
The continuous barrage of science related headlines and other media sources warn us of the need to heed the imperative for a science literate society. Climate change, genetics, evolution are a few of the charged and complex scientific topics requiring public understanding of the science to fully grasp the enormous reach of these topics in our daily lives. For instance, our global climate is changing as evidenced by the analysis of Earth observing satellite data, in-situ data, and proxy data records. How we as a global society decide to address the needs associated with a changing climate are contingent upon having a population that understands how the climate system functions, and can therefore make informed decisions on how to mitigate the effects of climate change. Communication in science relies heavily on the use of multiple representations to support the claims presented. However, these multiple representations require spatial and temporal skills to interpret information portrayed in them, and how a person engages with complex text and the multiple representations varies with the level of expertise one has with the content area. For example, a climatologist will likely identify anomalous data more quickly than a novice when presented with a graph of temperature change over time. These representations are used throughout textbooks as well as popular reading materials such as newspapers and magazines without much consideration for how a reader engages with complex text, diagrams, images, and graphs. If the ability to read and interact with scientific text found in popular literature is perceived as a worthy goal of scientific literacy, then it is imperative that readers understand the relationship between multiple representations and the text while interacting with the science literature they are reading. For example, in climate related articles multiple representations not only support the content, but they are part of the content not to be overlooked by a reader. Climatologists recognize the wealth of data and content found in these representations and therefore find themselves in a position where they can effectively interact with the author and their claims. This expert ability to seamlessly integrate text with the associated representations is at one end of the continuum of scientific text comprehension, but what abilities define a novice and those in between expert and novice in this continuum of scientific text comprehension? This talk will describe an ongoing research project with the overarching goal to establish the balance of this continuum in order to identify scaffolds that will assist non expert readers negotiate meaning from complex scientific text inclusive of multiple representations found in popular literature in climatology. It will inform those creating data representations on how best to create the representations so that claims and causal relationships may be derived from the literature or media source.
Rudichenko, V M
2012-01-01
In this article there were analyzed gender data about features of hyperuricaemia and gout: women are much older at the onset of gout arthritis (one of main reasons, probably, makes menopause by itself), have more associated comorbid deseases as hypertension and kidney failure and drinks less alcoholic beverages. It was noticed, that typical localisation of the lesion on the first toe is less often in women, and women are more inclined to use diuretics among medical drugs. Abovementioned clinical features are of some importance for the broad activity of general practitioners - family doctors. Gender features of polyarthicular gout are not uniformed. Scientific researches confirmed possibility of the genetic basis of the uric acid metabolism, which influences some fenotypical features of the organism. Several genes are known for their influence on serum uric acid: PDZK1, GCKR, SLC2A9, ABCG2, LRRC16A, SLC17A3, SLC16A9 and SLC22A12. However, conclusions of the research works confirm the necessity of scientific clarification of the importance of different factors of gender differences.
Science students' critical examination of scientific information related to socioscientific issues
NASA Astrophysics Data System (ADS)
Dankert Kolstø, Stein; Bungum, Berit; Arnesen, Erik; Isnes, Anders; Kristensen, Terje; Mathiassen, Ketil; Mestad, Idar; Quale, Andreas; Sissel Vedvik Tonning, Anne; Ulvik, Marit
2006-07-01
It is widely accepted that to be scientifically literate one needs to have the ability to make thoughtful decisions about socioscientific issues (SSI). This includes critical assessment of scientific claims and arguments involved. In this study we asked 89 science education students with substantial academic qualifications in science, working in groups of two and three, to assess the reliability of scientific claims in an article of their own choice, but related to a socioscientific issue, and to present their evaluation in a short text. In analyzing the students' texts, we focused on the criteria they had explicitly and implicitly used in their evaluations. Through a qualitative analysis, we identified 13 different criteria focusing on empirical and theoretical adequacy, completeness of presented information, social aspects, and manipulative strategies. An inspection of the students' evaluations revealed that they drew upon knowledge of possible institutional interests, different signs of competence and an appreciation of concurrent expert views, but also methodological norms in science, specialized content knowledge, and an appreciation of evidence and disclosure of sources. The number of criteria used and the quality of their application varied, indicating that critical examination of texts with a science dimension needs to be emphasized in science teacher education.
Leonti, Marco
2011-04-12
Apart from empirically learned medicinal and pharmacological properties, the selection of medicinal plants is dependent on cognitive features, ecological factors and cultural history. In literate societies the transmission of medicinal plant knowledge through texts and, more recently, other media containing local as well as non-local knowledge has a more immediate and a more prolonged effect than oral transmission. Therefore, I try to visualize how field based studies in ethnobiology and especially medical ethnobotany and ethnopharmacology run the risk of repeating information and knowledge and illustrate the importance of differentiating and acknowledging the origin, transmission and rationale of plant use made by humans. Reviewing literature dealing with the traditional parameters (e.g. hot/cold dichotomy, organoleptic properties, doctrine of signatures) influencing the selection and transmission of plant use in a juxtaposition to our recent finding of causal influence of text on local plant use. Discussing the passing down of knowledge by text as a special case of oblique/one-to-many knowledge transmission. Historical texts on materia medica, popular books on plant use, clinical studies, and informants of ethnobotanical field studies generate a circle of information and knowledge, which progressively conditions the results of ethnobotanical field studies. While text reporting on phytotherapeutical trends may cause innovation through the introduction of "new" applications to local customs, persistently repeating well established folk remedies leads to the consolidation of such uses adding a conservative dimension to a local pharmacopoeia, which might not actually be there to that extent. Such a "shaping" of what might appear to be the results of a field investigation is clearly outside the ordinary principles of scientific enquiry. The traditional pillars of ethnobotanical field studies - that is, "input to drug discovery" and "conservation of cultural heritage" - are also incompatible with this process. Ethnobotancial field studies aimed at a contribution to natural products research and/or the conservation of cultural heritage, as well as those aimed at an assessment and validation of local pharmacopoeias should differentiate between local plant use and widespread as well as modern knowledge reported in popular textbooks and scientific literature. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Variability in Text Features in Six Grade 1 Basal Reading Programs
ERIC Educational Resources Information Center
Foorman, Barbara R.; Francis, David J.; Davidson, Kevin C.; Harm, Michael W.; Griffin, Jennifer
2004-01-01
California and Texas mandate 75% to 80% decodable texts for first-grade reading programs, yet these percentages have no empirical base. This study examines the text selections in 6 first-grade programs from the perspective of lexical, semantic, and syntactic features. The composition of text differed across the 6 programs with respect to length,…
Letter to the editor : Impartial review is key.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Crabtree, G. W.; Materials Science Division
The News Feature, 'Misconduct in physics: Time to wise up? [Nature 418, 120-121; 2002], raises important issues that the physical-science community must face. Argonne National Laboratory's code of ethics calls for a response very similar to that of Bell Labs, namely: 'The Laboratory director may appoint an ad-hoc scientific review committee to investigate internal or external charges of scientific misconduct, fraud, falsification of data, misinterpretation of data, or other activities involving scientific or technical matters.'
Report of the Annual Scientific Session of the American College of Cardiology (ACC) 2018, Orlando.
Hashimoto, Takuya; Ako, Junya
2018-04-28
The 67 th Annual Scientific Session and Expo of the American College of Cardiology (ACC) were held at the Orange County Convention Center, Orlando, from March 10-12, 2018. This meeting offered 2,700 accepted abstracts presented in oral and poster sessions by 2,100 experts and 37 Late-Breaking Clinical Trials and Featured Clinical Research presentations. This report introduces the key presentations and highlights from the ACC 2018 Scientific Session.
Scientific Networks on Data Landscapes: Question Difficulty, Epistemic Success, and Convergence
Grim, Patrick; Singer, Daniel J.; Fisher, Steven; Bramson, Aaron; Berger, William J.; Reade, Christopher; Flocken, Carissa; Sales, Adam
2014-01-01
A scientific community can be modeled as a collection of epistemic agents attempting to answer questions, in part by communicating about their hypotheses and results. We can treat the pathways of scientific communication as a network. When we do, it becomes clear that the interaction between the structure of the network and the nature of the question under investigation affects epistemic desiderata, including accuracy and speed to community consensus. Here we build on previous work, both our own and others’, in order to get a firmer grasp on precisely which features of scientific communities interact with which features of scientific questions in order to influence epistemic outcomes. Here we introduce a measure on the landscape meant to capture some aspects of the difficulty of answering an empirical question. We then investigate both how different communication networks affect whether the community finds the best answer and the time it takes for the community to reach consensus on an answer. We measure these two epistemic desiderata on a continuum of networks sampled from the Watts-Strogatz spectrum. It turns out that finding the best answer and reaching consensus exhibit radically different patterns. The time it takes for a community to reach a consensus in these models roughly tracks mean path length in the network. Whether a scientific community finds the best answer, on the other hand, tracks neither mean path length nor clustering coefficient. PMID:24683416
Scientific Networks on Data Landscapes: Question Difficulty, Epistemic Success, and Convergence.
Grim, Patrick; Singer, Daniel J; Fisher, Steven; Bramson, Aaron; Berger, William J; Reade, Christopher; Flocken, Carissa; Sales, Adam
2013-12-01
A scientific community can be modeled as a collection of epistemic agents attempting to answer questions, in part by communicating about their hypotheses and results. We can treat the pathways of scientific communication as a network. When we do, it becomes clear that the interaction between the structure of the network and the nature of the question under investigation affects epistemic desiderata, including accuracy and speed to community consensus. Here we build on previous work, both our own and others', in order to get a firmer grasp on precisely which features of scientific communities interact with which features of scientific questions in order to influence epistemic outcomes. Here we introduce a measure on the landscape meant to capture some aspects of the difficulty of answering an empirical question. We then investigate both how different communication networks affect whether the community finds the best answer and the time it takes for the community to reach consensus on an answer. We measure these two epistemic desiderata on a continuum of networks sampled from the Watts-Strogatz spectrum. It turns out that finding the best answer and reaching consensus exhibit radically different patterns. The time it takes for a community to reach a consensus in these models roughly tracks mean path length in the network. Whether a scientific community finds the best answer, on the other hand, tracks neither mean path length nor clustering coefficient.
Yalçınkaya, Merter; Bagatur, Erdem
2013-01-01
The aim of this study was to evaluate the publication rates of full-text articles after presentation of abstracts at a Turkish National Orthopaedics and Traumatology Congress, determine the time lag from the congress date to publication of full-text articles and assess the consistency between abstracts and the subsequent publications. All abstracts from the scientific program of the 20th Turkish National Orthopaedics and Traumatology Congress (2007) were identified and computerized PubMed searches were conducted to determine whether an abstract had been followed by publication of a full-text article and key features were compared to evaluate their consistency. The time lag to publication and the impact factors of the journals where the articles were published were noted. Of the 770 abstracts (264 oral, 506 poster presentations), 227 (29.5%) were followed by a full-text and 116 (44%) of the 264 oral and 111 (22%) of the 506 poster presentations were published. The mean time to publication was 14.9±16.075 (range: 33 to 55) months. Thirty-three (14.5%) were published prior to the presentation at the congress. The likelihood of publication decreased after the third year (26 of 227, 11.5%). A total of 182 (80.2%) articles showed inconsistencies with the abstract; 74 (32.6%) minor, 14 (6.2%) major, and 94 (41.4%) minor and major inconsistencies. The mean impact factor of the journals was 1.152±0.858. The vast majority of abstracts presented at this congress were not followed by publication of a full-text article. Additionally, frequent inconsistencies between the final published article and the original abstract indicated the inadequacy of quality of reporting in abstracts.
The Magellan Venus explorer's guide
NASA Technical Reports Server (NTRS)
Young, Carolynn (Editor)
1990-01-01
The Magellan radar-mapping mission to the planet Venus is described. Scientific highlights include the history of U.S. and Soviet missions, as well as ground-based radar observations, that have provided the current knowledge about the surface of Venus. Descriptions of the major Venusian surface features include controversial theories about the origin of some of the features. The organization of the Magellan science investigators into discipline-related task groups for data-analysis purposes is presented. The design of the Magellan spacecraft and the ability of its radar sensor to conduct radar imaging, altimetry, and radiometry measurements are discussed. Other topics report on the May 1989 launch, the interplanetary cruise, the Venus orbit-insertion maneuver, and the in-orbit mapping strategy. The objectives of a possible extended mission emphasize the gravity experiment and explain why high-resolution gravity data cannot be acquired during the primary mission. A focus on the people of Magellan reveals how they fly the spacecraft and prepare for major mission events. Special items of interest associated with the Magellan mission are contained in windows interspersed throughout the text. Finally, short summaries describe the major objectives and schedules for several exciting space missions planned to take us into the 21st century.
CORRIGENDUM: The growth of aligned carbon nanotubes on quartz substrate by spray pyrolysis of hexane
NASA Astrophysics Data System (ADS)
Sadeghian, Zahra
2008-07-01
Some of the text in this paper was copied directly from other papers cited by the author. Whilst this does not affect the scientific content and is therefore not scientific plagiarism, the author acknowledges that such usage of text attributed and copyrighted to other unrelated authors is unacceptable practice. Reference [21] in the paper should read: [21] Afre R A, Soga T, Jimbo T, Kumar M, Ando Y and Sharon M 2005 Chem. Phys. Lett. 414 6-10
ERIC Educational Resources Information Center
Cacciatore, Kristen L.; Sevian, Hannah
2006-01-01
We present an alternative to a traditional first-year chemistry laboratory experiment. This experiment has four key features: students utilize stoichiometry, learn and apply principles of green chemistry, engage in authentic scientific inquiry, and discover why each part of a scientific lab report is necessary. The importance and essential…
Pseudoscience in Instructional Technology: The Case of Learner Control Research.
ERIC Educational Resources Information Center
Reeves, Thomas C.
Scientific research that is conducted without the structure of a supporting scientific paradigm should be labeled pseudoscience in that such research is deceptive or false science. It is argued that much of the research in educational technology is pseudoscience, with the focus on learner control research. Learner control is the design feature of…
ERIC Educational Resources Information Center
Mueller, Jon F.; Coon, Heather M.
2013-01-01
The ability to distinguish between correlational and causal claims is core knowledge for scientific literacy. News reports of scientific research prominently feature these claims. Thus, this knowledge has significant real-world application, and distinguishing among claims is critical to making sense of the reported research. We constructed an…
ERIC Educational Resources Information Center
Littlejohn, Emily
2018-01-01
"Adaptation" originally began as a scientific term, but from 1860 to today it most often refers to an altered version of a text, film, or other literary source. When this term was first analyzed, humanities scholars often measured adaptations against their source texts, frequently privileging "original" texts. However, this…
Book reviews in medical journals.
Kroenke, K
1986-01-01
In a study of book reviews published in four general medical journals over a six-month period, 480 reviews were analyzed. Twenty-five features that reviewers address when evaluating a text were identified, and the frequency of commentary for each feature was determined. The mean number of features addressed per review was 9.0. Reviews averaged 389 words, but review length did not correlate with the length or scope of the book, with the number of features addressed, nor with the reviewer's assessment of the text. Extraneous commentary by the reviewer occurred in 16% of the reviews. This editorializing appeared in lengthier reviews that addressed fewer features. Favorable reviews were far more common than unfavorable ones (88.5% vs. 11.5%). Consequently, for the fifty-five books reviewed in more than one journal, agreement regarding rating of the text was high (86%). Results of this study may provide useful guidelines for reviewers of medical texts. PMID:3947772
World Wide Web Based Image Search Engine Using Text and Image Content Features
NASA Astrophysics Data System (ADS)
Luo, Bo; Wang, Xiaogang; Tang, Xiaoou
2003-01-01
Using both text and image content features, a hybrid image retrieval system for Word Wide Web is developed in this paper. We first use a text-based image meta-search engine to retrieve images from the Web based on the text information on the image host pages to provide an initial image set. Because of the high-speed and low cost nature of the text-based approach, we can easily retrieve a broad coverage of images with a high recall rate and a relatively low precision. An image content based ordering is then performed on the initial image set. All the images are clustered into different folders based on the image content features. In addition, the images can be re-ranked by the content features according to the user feedback. Such a design makes it truly practical to use both text and image content for image retrieval over the Internet. Experimental results confirm the efficiency of the system.
Different Words for the Same Concept: Learning Collaboratively from Multiple Documents
ERIC Educational Resources Information Center
Jucks, Regina; Paus, Elisabeth
2013-01-01
This study investigated how varying the lexical encodings of technical terms in multiple texts influences learners' dyadic processing of scientific-related information. Fifty-seven pairs of college students read journalistic texts on depression. Each partner in a dyad received one text; for half of the dyads the partner's text contained different…
ERIC Educational Resources Information Center
Guilford, Jacquelyn; Bustamante, Annette; Mackura, Kelly; Hirsch, Susan; Lyon, Edward; Estrada, Kelly
2017-01-01
Learning science is language intensive. Students might have to interpret the meaning of models, support claims with evidence, communicate arguments, and discuss phenomena and scientific principles. For English Language Learners (ELLs), engaging in scientific and engineering practices includes additional challenges. This article describes a series…
NASA Astrophysics Data System (ADS)
Martynov, M. B.; Merkulov, P. V.; Lomakin, I. V.; Vyatlev, P. A.; Simonov, A. V.; Leun, E. V.; Barabanov, A. A.; Nasyrov, A. F.
2017-12-01
The advanced Russian project Laplace-P is aimed at developing and launching two scientific spacecraft (SC)— Laplace-P1 ( LP1 SC) and Laplace-P2 ( LP2 SC)—designed for remote and in-situ studies of the system of Jupiter and its moon Ganymede. The LP1 and LP2 spacecraft carry an orbiter and a lander onboard, respectively. One of the orbiter's objectives is to map the surface of Ganymede from the artificial satellite's orbit and to acquire the data for the landing site selection. The main objective of the lander is to carry out in-situ investigations of Ganymede's surface. The paper describes the scientific goals and objectives of the mission, its special features, and the LP1 and LP2 mission profiles during all of the phases—from the launch to the landing on the surface of Ganymede.
Mlalila, Nichrous; Kadam, Dattatreya M; Swai, Hulda; Hilonga, Askwar
2016-09-01
In recent decades, there is a global advancement in manufacturing industry due to increased applications of nanotechnology. Food industry also has been tremendously changing from passive packaging to innovative packaging, to cope with global trends, technological advancements, and consumer preferences. Active research is taking place in food industry and other scientific fields to develop innovative packages including smart, intelligent and active food packaging for more effective and efficient packaging materials with balanced environmental issues. However, in food industry the features behind smart packaging are narrowly defined to be distinguished from intelligent packaging as in other scientific fields, where smart materials are under critical investigations. This review presents some scientific concepts and features pertaining innovative food packaging. The review opens new research window in innovative food packaging to cover the existing disparities for further precise research and development of food packaging industry.
Emotional and deliberative reactions to a public crisis: Mad Cow disease in France.
Sinaceur, Marwan; Heath, Chip; Cole, Steve
2005-03-01
Although most theories of choice are cognitive, recent research has emphasized the role of emotions. We used a novel context--the Mad Cow crisis in France--to investigate how emotions alter choice even when consequences are held constant. A field study showed that individuals reduced beef consumption in months after many newspaper articles featured the emotional label "Mad Cow," but beef consumption was unaffected after articles featured scientific labels for the same disease. The reverse pattern held for the disease-related actions of a government bureaucracy. A lab study showed that the Mad Cow label induces people to make choices based solely on emotional reactions, whereas scientific labels induce people to consider their own probability judgments. Although the Mad Cow label produces less rational behavior than scientific labels, it is two to four times more common in the environment.
Text-based plagiarism in scientific publishing: issues, developments and education.
Li, Yongyan
2013-09-01
Text-based plagiarism, or copying language from sources, has recently become an issue of growing concern in scientific publishing. Use of CrossCheck (a computational text-matching tool) by journals has sometimes exposed an unexpected amount of textual similarity between submissions and databases of scholarly literature. In this paper I provide an overview of the relevant literature, to examine how journal gatekeepers perceive textual appropriation, and how automated plagiarism-screening tools have been developed to detect text matching, with the technique now available for self-check of manuscripts before submission; I also discuss issues around English as an additional language (EAL) authors and in particular EAL novices being the typical offenders of textual borrowing. The final section of the paper proposes a few educational directions to take in tackling text-based plagiarism, highlighting the roles of the publishing industry, senior authors and English for academic purposes professionals.
CRIE: An automated analyzer for Chinese texts.
Sung, Yao-Ting; Chang, Tao-Hsing; Lin, Wei-Chun; Hsieh, Kuan-Sheng; Chang, Kuo-En
2016-12-01
Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been developed for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguistic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in language arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.
Brinkley, Dawn Y.; Ackerman, Robert A.; Ehrenreich, Samuel E.; Underwood, Marion K.
2017-01-01
This research examined adolescents’ written text messages with sexual content to investigate how sexting relates to sexual activity and borderline personality features. Participants (N = 181, 85 girls) completed a measure of borderline personality features prior to 10th grade and were subsequently given smartphones configured to capture the content of their text messages. Four days of text messaging were micro-coded for content related to sex. Following 12th grade, participants reported on their sexual activity and again completed a measure of borderline personality features. Results showed that engaging in sexting at age 16 was associated with reporting an early sexual debut, having sexual intercourse experience, having multiple sex partners, and engaging in drug use in combination with sexual activity two years later. Girls engaging in sex talk were more likely to have had sexual intercourse by age 18. Text messaging about hypothetical sex in grade 10 also predicted borderline personality features at age 18. These findings suggest that sending text messages with sexual content poses risks for adolescents. Programs to prevent risky sexual activity and to promote psychological health could be enhanced by teaching adolescents to use digital communication responsibly. PMID:28824224
Brinkley, Dawn Y; Ackerman, Robert A; Ehrenreich, Samuel E; Underwood, Marion K
2017-05-01
This research examined adolescents' written text messages with sexual content to investigate how sexting relates to sexual activity and borderline personality features. Participants (N = 181, 85 girls) completed a measure of borderline personality features prior to 10 th grade and were subsequently given smartphones configured to capture the content of their text messages. Four days of text messaging were micro-coded for content related to sex. Following 12 th grade, participants reported on their sexual activity and again completed a measure of borderline personality features. Results showed that engaging in sexting at age 16 was associated with reporting an early sexual debut, having sexual intercourse experience, having multiple sex partners, and engaging in drug use in combination with sexual activity two years later. Girls engaging in sex talk were more likely to have had sexual intercourse by age 18. Text messaging about hypothetical sex in grade 10 also predicted borderline personality features at age 18. These findings suggest that sending text messages with sexual content poses risks for adolescents. Programs to prevent risky sexual activity and to promote psychological health could be enhanced by teaching adolescents to use digital communication responsibly.
WE-E-218-01: Writing and Reviewing Papers in Medical Physics.
Hendee, W; Slattery, P; Rogers, D; Karellas, A
2012-06-01
There is an art to writing a scientific paper so that it communicates accurately, succinctly, and comprehensively. Developing this art comes with experience, and sharing that experience with younger physicists is an obligation of senior scientists, especially those with editorial responsibilities for the journal. In this workshop, the preparation of a scientific manuscript will be dissected so participants can appreciate how each part is developed and then assembled into a complete paper. Then the review process for the paper will be discussed, including how to examine a paper and write an insightful and constructive review. Finally, we will consider the challenge of accommodating the concerns and recommendations of a reviewer in preparing a revision of the paper. A second feature of the workshop will be a discussion of the process of electronic submission of a paper for consideration by Medical Physics. The web-based PeerX-Press engine for manuscript submission and management will be examined, with attention to special features such as epaps and line-referencing. Finally, new features of Medical Physics will be explained, such as Vision 20/20 manuscripts, Physics Letters and the standardized formatting of book reviews. 1. Improve the participants' abilities to write a scientific manuscript. 2. Understand the review process for Medical Physics manuscripts and how to participate in and benefit from it. 3. Appreciate the many features of the PeerX-Press electronic management process for Medical Physics manuscripts. 4. Develop a knowledge of new features of Medical Physics. © 2012 American Association of Physicists in Medicine.
Feature engineering for MEDLINE citation categorization with MeSH.
Jimeno Yepes, Antonio Jose; Plaza, Laura; Carrillo-de-Albornoz, Jorge; Mork, James G; Aronson, Alan R
2015-04-08
Research in biomedical text categorization has mostly used the bag-of-words representation. Other more sophisticated representations of text based on syntactic, semantic and argumentative properties have been less studied. In this paper, we evaluate the impact of different text representations of biomedical texts as features for reproducing the MeSH annotations of some of the most frequent MeSH headings. In addition to unigrams and bigrams, these features include noun phrases, citation meta-data, citation structure, and semantic annotation of the citations. Traditional features like unigrams and bigrams exhibit strong performance compared to other feature sets. Little or no improvement is obtained when using meta-data or citation structure. Noun phrases are too sparse and thus have lower performance compared to more traditional features. Conceptual annotation of the texts by MetaMap shows similar performance compared to unigrams, but adding concepts from the UMLS taxonomy does not improve the performance of using only mapped concepts. The combination of all the features performs largely better than any individual feature set considered. In addition, this combination improves the performance of a state-of-the-art MeSH indexer. Concerning the machine learning algorithms, we find that those that are more resilient to class imbalance largely obtain better performance. We conclude that even though traditional features such as unigrams and bigrams have strong performance compared to other features, it is possible to combine them to effectively improve the performance of the bag-of-words representation. We have also found that the combination of the learning algorithm and feature sets has an influence in the overall performance of the system. Moreover, using learning algorithms resilient to class imbalance largely improves performance. However, when using a large set of features, consideration needs to be taken with algorithms due to the risk of over-fitting. Specific combinations of learning algorithms and features for individual MeSH headings could further increase the performance of an indexing system.
ERIC Educational Resources Information Center
Ault, Marilyn; Craig-Hare, Jana; Frey, Bruce
2016-01-01
Reason Racer is an online, rate-based, multiplayer game designed to engage middle school students in the knowledge and skills related to scientific argumentation. Several game features are included as design considerations unrelated to science content or argumentation. One specific feature, a competitive racing component that occurs in between…
ERIC Educational Resources Information Center
Wu, Pai-Hsing; Wu, Hsin-Kai; Kuo, Che-Yu; Hsu, Ying-Shao
2015-01-01
Computer-based learning tools include design features to enhance learning but learners may not always perceive the existence of these features and use them in desirable ways. There might be a gap between what the tool features are designed to offer (intended affordance) and what they are actually used (actual affordance). This study thus aims at…
Arabic OCR: toward a complete system
NASA Astrophysics Data System (ADS)
El-Bialy, Ahmed M.; Kandil, Ahmed H.; Hashish, Mohamed; Yamany, Sameh M.
1999-12-01
Latin and Chinese OCR systems have been studied extensively in the literature. Yet little work was performed for Arabic character recognition. This is due to the technical challenges found in the Arabic text. Due to its cursive nature, a powerful and stable text segmentation is needed. Also; features capturing the characteristics of the rich Arabic character representation are needed to build the Arabic OCR. In this paper a novel segmentation technique which is font and size independent is introduced. This technique can segment the cursive written text line even if the line suffers from small skewness. The technique is not sensitive to the location of the centerline of the text line and can segment different font sizes and type (for different character sets) occurring on the same line. Features extraction is considered one of the most important phases of the text reading system. Ideally, the features extracted from a character image should capture the essential characteristics of this character that are independent of the font type and size. In such ideal case, the classifier stores a single prototype per character. However, it is practically challenging to find such ideal set of features. In this paper, a set of features that reflect the topological aspects of Arabia characters is proposed. These proposed features integrated with a topological matching technique introduce an Arabic text reading system that is semi Omni.
Semi-automated surface mapping via unsupervised classification
NASA Astrophysics Data System (ADS)
D'Amore, M.; Le Scaon, R.; Helbert, J.; Maturilli, A.
2017-09-01
Due to the increasing volume of the returned data from space mission, the human search for correlation and identification of interesting features becomes more and more unfeasible. Statistical extraction of features via machine learning methods will increase the scientific output of remote sensing missions and aid the discovery of yet unknown feature hidden in dataset. Those methods exploit algorithm trained on features from multiple instrument, returning classification maps that explore intra-dataset correlation, allowing for the discovery of unknown features. We present two applications, one for Mercury and one for Vesta.
The Scientific Method - Critical and Creative Thinking
NASA Astrophysics Data System (ADS)
Cotton, John; Scarlise, Randall
2011-10-01
The ``scientific method'' is not just for scientists! Combined with critical thinking, the scientific method can enable students to distinguish credible sources of information from nonsense and become intelligent consumers of information. Professors John Cotton and Randall Scalise illustrate these principles using a series of examples and demonstrations that is enlightening, educational, and entertaining. This lecture/demonstration features highlights from their course (whose unofficial title is ``debunking pseudoscience'' ) which enables students to detect pseudoscience in its many guises: paranormal phenomena, free-energy devices, alternative medicine, and many others.
Report of the American Heart Association (AHA) Scientific Sessions 2016, New Orleans.
Amaki, Makoto; Konagai, Nao; Fujino, Masashi; Kawakami, Shouji; Nakao, Kazuhiro; Hasegawa, Takuya; Sugano, Yasuo; Tahara, Yoshio; Yasuda, Satoshi
2016-12-22
The American Heart Association (AHA) Scientific Sessions 2016 were held on November 12-16 at the Ernest N. Morial Convention Center, New Orleans, LA. This 5-day event featured cardiovascular clinical practice covering all aspects of basic, clinical, population, and translational content. One of the hot topics at AHA 2016 was precision medicine. The key presentations and highlights from the AHA Scientific Sessions 2016, including "precision medicine" as one of the hot topics, are herein reported.
ERIC Educational Resources Information Center
Halbauer, Siegfried
1976-01-01
It was considered that students of intensive scientific Russian courses could learn vocabulary more efficiently if they were taught word stems and how to combine them with prefixes and suffixes to form scientific words. The computer programs developed to identify the most important stems is discussed. (Text is in German.) (FB)
2017-01-01
Evidence-based dietary information represented as unstructured text is a crucial information that needs to be accessed in order to help dietitians follow the new knowledge arrives daily with newly published scientific reports. Different named-entity recognition (NER) methods have been introduced previously to extract useful information from the biomedical literature. They are focused on, for example extracting gene mentions, proteins mentions, relationships between genes and proteins, chemical concepts and relationships between drugs and diseases. In this paper, we present a novel NER method, called drNER, for knowledge extraction of evidence-based dietary information. To the best of our knowledge this is the first attempt at extracting dietary concepts. DrNER is a rule-based NER that consists of two phases. The first one involves the detection and determination of the entities mention, and the second one involves the selection and extraction of the entities. We evaluate the method by using text corpora from heterogeneous sources, including text from several scientifically validated web sites and text from scientific publications. Evaluation of the method showed that drNER gives good results and can be used for knowledge extraction of evidence-based dietary recommendations. PMID:28644863
Helios: Understanding Solar Evolution Through Text Analytics
DOE Office of Scientific and Technical Information (OSTI.GOV)
Randazzese, Lucien
This proof-of-concept project focused on developing, testing, and validating a range of bibliometric, text analytic, and machine-learning based methods to explore the evolution of three photovoltaic (PV) technologies: Cadmium Telluride (CdTe), Dye-Sensitized solar cells (DSSC), and Multi-junction solar cells. The analytical approach to the work was inspired by previous work by the same team to measure and predict the scientific prominence of terms and entities within specific research domains. The goal was to create tools that could assist domain-knowledgeable analysts in investigating the history and path of technological developments in general, with a focus on analyzing step-function changes in performance,more » or “breakthroughs,” in particular. The text-analytics platform developed during this project was dubbed Helios. The project relied on computational methods for analyzing large corpora of technical documents. For this project we ingested technical documents from the following sources into Helios: Thomson Scientific Web of Science (papers), the U.S. Patent & Trademark Office (patents), the U.S. Department of Energy (technical documents), the U.S. National Science Foundation (project funding summaries), and a hand curated set of full-text documents from Thomson Scientific and other sources.« less
Is the recall of verbal-spatial information from working memory affected by symptoms of ADHD?
Caterino, Linda C; Verdi, Michael P
2012-10-01
OJECTIVE: The Kulhavy model for text learning using organized spatial displays proposes that learning will be increased when participants view visual images prior to related text. In contrast to previous studies, this study also included students who exhibited symptoms of ADHD. Participants were presented with either a map-text or text-map condition. The map-text condition led to a significantly higher performance than the text-map condition, overall. However, students who endorsed more symptoms of inattention and hyperactivity-impulsivity scored more poorly when asked to recall text facts, text features, and map features and were less able to correctly place map features on a reconstructed map than were students who endorsed fewer symptoms. The results of the study support the Kulhavy model for typical students; however, the benefit of viewing a display prior to text was not seen for students with ADHD symptoms, thus supporting previous studies that have demonstrated that ADHD appears to negatively affect operations that occur in working memory.
Layout-aware text extraction from full-text PDF of scientific articles.
Ramakrishnan, Cartic; Patnia, Abhishek; Hovy, Eduard; Burns, Gully Apc
2012-05-28
The Portable Document Format (PDF) is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocuration informatics systems that use published literature as an information source. In this paper we introduce the 'Layout-Aware PDF Text Extraction' (LA-PDFText) system to facilitate accurate extraction of text from PDF files of research articles for use in text mining applications. Our paper describes the construction and performance of an open source system that extracts text blocks from PDF-formatted full-text research articles and classifies them into logical units based on rules that characterize specific sections. The LA-PDFText system focuses only on the textual content of the research articles and is meant as a baseline for further experiments into more advanced extraction methods that handle multi-modal content, such as images and graphs. The system works in a three-stage process: (1) Detecting contiguous text blocks using spatial layout processing to locate and identify blocks of contiguous text, (2) Classifying text blocks into rhetorical categories using a rule-based method and (3) Stitching classified text blocks together in the correct order resulting in the extraction of text from section-wise grouped blocks. We show that our system can identify text blocks and classify them into rhetorical categories with Precision1 = 0.96% Recall = 0.89% and F1 = 0.91%. We also present an evaluation of the accuracy of the block detection algorithm used in step 2. Additionally, we have compared the accuracy of the text extracted by LA-PDFText to the text from the Open Access subset of PubMed Central. We then compared this accuracy with that of the text extracted by the PDF2Text system, 2commonly used to extract text from PDF. Finally, we discuss preliminary error analysis for our system and identify further areas of improvement. LA-PDFText is an open-source tool for accurately extracting text from full-text scientific articles. The release of the system is available at http://code.google.com/p/lapdftext/.
Layout-aware text extraction from full-text PDF of scientific articles
2012-01-01
Background The Portable Document Format (PDF) is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocuration informatics systems that use published literature as an information source. In this paper we introduce the ‘Layout-Aware PDF Text Extraction’ (LA-PDFText) system to facilitate accurate extraction of text from PDF files of research articles for use in text mining applications. Results Our paper describes the construction and performance of an open source system that extracts text blocks from PDF-formatted full-text research articles and classifies them into logical units based on rules that characterize specific sections. The LA-PDFText system focuses only on the textual content of the research articles and is meant as a baseline for further experiments into more advanced extraction methods that handle multi-modal content, such as images and graphs. The system works in a three-stage process: (1) Detecting contiguous text blocks using spatial layout processing to locate and identify blocks of contiguous text, (2) Classifying text blocks into rhetorical categories using a rule-based method and (3) Stitching classified text blocks together in the correct order resulting in the extraction of text from section-wise grouped blocks. We show that our system can identify text blocks and classify them into rhetorical categories with Precision1 = 0.96% Recall = 0.89% and F1 = 0.91%. We also present an evaluation of the accuracy of the block detection algorithm used in step 2. Additionally, we have compared the accuracy of the text extracted by LA-PDFText to the text from the Open Access subset of PubMed Central. We then compared this accuracy with that of the text extracted by the PDF2Text system, 2commonly used to extract text from PDF. Finally, we discuss preliminary error analysis for our system and identify further areas of improvement. Conclusions LA-PDFText is an open-source tool for accurately extracting text from full-text scientific articles. The release of the system is available at http://code.google.com/p/lapdftext/. PMID:22640904
Naukaoklimacie.pl: Between Science Blog and Mythbuster.
NASA Astrophysics Data System (ADS)
Malinowski, S. P.; Popkiewicz, M.; Kardaś, A.; Bielewicz, A.
2015-12-01
"Naukaoklimacie" is a Polish fellow of a well known portal SkepticalScience.com. The name is a quibble. This cluster of two Polish words can be translated into English as "Climate Science" or "Science about Climate". Naukaoklimacie.pl and the associate Facebook page is an ongoing, over two years old project, aimed at providing Polish-speaking community fundamentals of climate science. Itgives insight into the recent climate science achievements, rebutts climate misinformation and busts climate myths. During two years of activity we published over 250 texts, our Facebook page has over 4k fans and results in 4-12 thousands post reach week-to-week, the mainpage articles are quoted in press and used as reading texts for students. Unlike in many countries, in Poland there is a real problem in finding a trustworthy information on climate change and science behind it. Neither universities, nor governmental agencies present climate science to Polish society. Naukaoklimacie.pl fills this gap in an unique way. Editorial group of the portal consists of two atmospheric scientists, a physicist and the professional journalist and is supported by a scientific council, consisting of 14 active scientists specializing in various aspects of climate, atmosphere, biodiversity, atmospheric chemistry e.t.c.. All the texts published in the webpage are consulted with scientists - specialists in the subject of the text, usually from the scientific council, sometimes by the external specialists. All the texts provide links to the original scientific publications. Naukaoklimacie.pl is not only an internet activity. We meet people on Festivals of Science, Science Open Days. We exist also in the mainstream media the editors and the scientific councils were interviewed by press and TV.
Medical Concept Normalization in Social Media Posts with Recurrent Neural Networks.
Tutubalina, Elena; Miftahutdinov, Zulfat; Nikolenko, Sergey; Malykh, Valentin
2018-06-12
Text mining of scientific libraries and social media has already proven itself as a reliable tool for drug repurposing and hypothesis generation. The task of mapping a disease mention to a concept in a controlled vocabulary, typically to the standard thesaurus in the Unified Medical Language System (UMLS), is known as medical concept normalization. This task is challenging due to the differences in the use of medical terminology between health care professionals and social media texts coming from the lay public. To bridge this gap, we use sequence learning with recurrent neural networks and semantic representation of one- or multi-word expressions: we develop end-to-end architectures directly tailored to the task, including bidirectional Long Short-Term Memory, Gated Recurrent Units with an attention mechanism, and additional semantic similarity features based on UMLS. Our evaluation against a standard benchmark shows that recurrent neural networks improve results over an effective baseline for classification based on convolutional neural networks. A qualitative examination of mentions discovered in a dataset of user reviews collected from popular online health information platforms as well as a quantitative evaluation both show improvements in the semantic representation of health-related expressions in social media. Copyright © 2018. Published by Elsevier Inc.
ERIC Educational Resources Information Center
Adesope, Olusola O.; Cavagnetto, Andy; Hunsu, Nathaniel J.; Anguiano, Carlos; Lloyd, Joshua
2017-01-01
This study used a between-subjects experimental design to examine the effects of three different computer-based instructional strategies (concept map, refutation text, and expository scientific text) on science learning. Concept maps are node-link diagrams that show concepts as nodes and relationships among the concepts as labeled links.…
ERIC Educational Resources Information Center
Dutke, Stephan; Grefe, Anna Christina; Leopold, Claudia
2016-01-01
In an experiment with 65 high-school students, we tested the hypothesis that personalizing learning materials would increase students' learning performance and motivation to study the learning materials. Students studied either a 915-word standard text on the anatomy and functionality of the human eye or a personalized version of the same text in…
Content Analysis of Articles Published in Iranian Scientific Nursing Journals From 2009 Through 2011
Tahamtan, Iman; Bagheri, Zeinab; Janani, Payman; Majidi, Somayye; Ghasemi, Elham; Negarandeh, Reza
2014-01-01
Background: Little is known about the features of Iranian nursing journals, specifically the subject areas used in articles, study designs, sampling methods, international collaboration of Iranian nursing scholars, specialty and academic rank of authors, and the most frequently contributing academic institutions in articles. Objectives: The aim of this study was to analyze the content of the articles published in Iranian scientific nursing journals. Materials and Methods: Quantitative content analysis was implemented to study Iranian nursing journals, which were approved by the commission for accreditation and improvement of Iranian medical journals in 2011. Thus, 763 articles from six journals, published from 2009 through 2011, were investigated. Data were extracted from the abstracts and when necessary, from the full-text of articles by visiting the websites of these journals. Descriptive statistics were used to analyze the data. Results: The main subjects of published articles in Iranian scientific nursing journals were consecutively renal dialysis (n = 21), intensive care unit (n = 16), nursing education (n = 15), patient satisfaction (n = 13), quality of life (n = 12), health education (n = 11), patient education (n = 11), pain (n = 10), and education (n = 9). The majority of authors had nursing and midwifery specialty (52.59%) followed by epidemiology/biostatistics specialty (7.72%). Isfahan, Tehran, Shahid Beheshti, Iran, Baqiyatallah, and Urmia universities of medical sciences had consecutively the largest number of publications in the studied journals. Only three papers (0.39%) were published by the international collaboration. Conclusions: Iranian nursing journals should publish special issues in the neglected subject areas. These journals should encourage authors to publish research evidence with higher quality. PMID:25741512
Tahamtan, Iman; Bagheri, Zeinab; Janani, Payman; Majidi, Somayye; Ghasemi, Elham; Negarandeh, Reza
2014-12-01
Little is known about the features of Iranian nursing journals, specifically the subject areas used in articles, study designs, sampling methods, international collaboration of Iranian nursing scholars, specialty and academic rank of authors, and the most frequently contributing academic institutions in articles. The aim of this study was to analyze the content of the articles published in Iranian scientific nursing journals. Quantitative content analysis was implemented to study Iranian nursing journals, which were approved by the commission for accreditation and improvement of Iranian medical journals in 2011. Thus, 763 articles from six journals, published from 2009 through 2011, were investigated. Data were extracted from the abstracts and when necessary, from the full-text of articles by visiting the websites of these journals. Descriptive statistics were used to analyze the data. The main subjects of published articles in Iranian scientific nursing journals were consecutively renal dialysis (n = 21), intensive care unit (n = 16), nursing education (n = 15), patient satisfaction (n = 13), quality of life (n = 12), health education (n = 11), patient education (n = 11), pain (n = 10), and education (n = 9). The majority of authors had nursing and midwifery specialty (52.59%) followed by epidemiology/biostatistics specialty (7.72%). Isfahan, Tehran, Shahid Beheshti, Iran, Baqiyatallah, and Urmia universities of medical sciences had consecutively the largest number of publications in the studied journals. Only three papers (0.39%) were published by the international collaboration. Iranian nursing journals should publish special issues in the neglected subject areas. These journals should encourage authors to publish research evidence with higher quality.
Comparative and Contrastive Observations on Scientific Titles Written in English and Spanish
ERIC Educational Resources Information Center
Soler, Viviana
2011-01-01
This research focuses on the structural construction of scientific titles in English and Spanish in research papers (RP) and review papers (RVP) in the biological and social sciences. The questions raised were (i) whether structural construction is a key distinctive feature between RP and RVP titles; (ii) whether the inherent peculiarities of…
Large-Scale Assessment, Rationality, and Scientific Management: The Case of No Child Left Behind
ERIC Educational Resources Information Center
Roach, Andrew T.; Frank, Jennifer
2007-01-01
This article examines the ways in which NCLB and the movement towards large-scale assessment systems are based on Weber's concept of formal rationality and tradition of scientific management. Building on these ideas, the authors use Ritzer's McDonaldization thesis to examine some of the core features of large-scale assessment and accountability…
ERIC Educational Resources Information Center
Fuselier, Linda; Murphy, Claudia; Bender, Anita; Falcón, Kandace Creel
2015-01-01
Background and purpose: The purpose of this exploratory case study is to describe how scholars negotiated disciplinary divides to develop and communicate to their students an understanding of the basic features of scientific knowledge. Our goals were to examine boundary crossing in interdisciplinary collaboration and to assess the efficacy of…
ERIC Educational Resources Information Center
Bråten, Ivar; Braasch, Jason L. G.; Strømsø, Helge I.; Ferguson, Leila E.
2015-01-01
Students read six documents that varied in terms of their perspectives on a scientific issue and the trustworthiness of the source features. After reading, students wrote essays, rank-ordered the documents according to perceived trustworthiness, and provided reasons for their rank-order decisions. Students put the most trust in a textbook and a…
ERIC Educational Resources Information Center
Freeland, Peter
2013-01-01
Charles Darwin supposed that evolution involved a process of gradual change, generated randomly, with the selection and retention over many generations of survival-promoting features. Some theists have never accepted this idea. "Intelligent design" is a relatively recent theory, supposedly based on scientific evidence, which attempts to…
Unweaving Time and Food Chains: Two Classroom Exercises in Scientific and Emotional Literacy.
ERIC Educational Resources Information Center
Alsop, Steve; Watts, Mike
2002-01-01
Discusses affective dimensions in school science. Uses data from two case studies and explores ways in which science has the potential to stimulate and challenge emotions. Discusses the importance of affect in learning, how emotions might feature more centrally in science classrooms, and how definitions of scientific literacy might more explicitly…
ERIC Educational Resources Information Center
Roff, Lori; Stringer, Lola
The food science course developed in Missouri combines basic scientific and mathematics principles in a hands-on instructional format as a part of the family and consumer sciences education curriculum. Throughout the course, students conduct controlled experiments and use scientific laboratory techniques and information to explore the biological…
Automatic Detection of Sand Ripple Features in Sidescan Sonar Imagery
2014-07-09
Among the features used in forensic scientific fingerprint analysis are terminations or bifurcations of print ridges. Sidescan sonar imagery of ripple...always be pathological cases. The size of the blocks of pixels used in determining the ripple wavelength is evident in the output images on the right in
Scientific and Technical English.
ERIC Educational Resources Information Center
Vaclavik, Jaroslav
Technical English differs from everyday English because of the specialized contexts in which it is used and because of the specialized interests of scientists and engineers. This text provides exercises in technical and scientific exposition in the following fields: mathematics, physics, temperature effects, mechanics, dynamics, conservation of…
ERIC Educational Resources Information Center
Ward, Jeremy
2001-01-01
Examines chemical engineering students' attitudes to text and other parts of English language textbooks. A questionnaire was administered to a group of undergraduates. Results reveal one way students get around the problem of textbook reading. (Author/VWL)
Van Landeghem, Sofie; Abeel, Thomas; Saeys, Yvan; Van de Peer, Yves
2010-09-15
In the field of biomolecular text mining, black box behavior of machine learning systems currently limits understanding of the true nature of the predictions. However, feature selection (FS) is capable of identifying the most relevant features in any supervised learning setting, providing insight into the specific properties of the classification algorithm. This allows us to build more accurate classifiers while at the same time bridging the gap between the black box behavior and the end-user who has to interpret the results. We show that our FS methodology successfully discards a large fraction of machine-generated features, improving classification performance of state-of-the-art text mining algorithms. Furthermore, we illustrate how FS can be applied to gain understanding in the predictions of a framework for biomolecular event extraction from text. We include numerous examples of highly discriminative features that model either biological reality or common linguistic constructs. Finally, we discuss a number of insights from our FS analyses that will provide the opportunity to considerably improve upon current text mining tools. The FS algorithms and classifiers are available in Java-ML (http://java-ml.sf.net). The datasets are publicly available from the BioNLP'09 Shared Task web site (http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/SharedTask/).
KnowLife: a versatile approach for constructing a large knowledge graph for biomedical sciences.
Ernst, Patrick; Siu, Amy; Weikum, Gerhard
2015-05-14
Biomedical knowledge bases (KB's) have become important assets in life sciences. Prior work on KB construction has three major limitations. First, most biomedical KBs are manually built and curated, and cannot keep up with the rate at which new findings are published. Second, for automatic information extraction (IE), the text genre of choice has been scientific publications, neglecting sources like health portals and online communities. Third, most prior work on IE has focused on the molecular level or chemogenomics only, like protein-protein interactions or gene-drug relationships, or solely address highly specific topics such as drug effects. We address these three limitations by a versatile and scalable approach to automatic KB construction. Using a small number of seed facts for distant supervision of pattern-based extraction, we harvest a huge number of facts in an automated manner without requiring any explicit training. We extend previous techniques for pattern-based IE with confidence statistics, and we combine this recall-oriented stage with logical reasoning for consistency constraint checking to achieve high precision. To our knowledge, this is the first method that uses consistency checking for biomedical relations. Our approach can be easily extended to incorporate additional relations and constraints. We ran extensive experiments not only for scientific publications, but also for encyclopedic health portals and online communities, creating different KB's based on different configurations. We assess the size and quality of each KB, in terms of number of facts and precision. The best configured KB, KnowLife, contains more than 500,000 facts at a precision of 93% for 13 relations covering genes, organs, diseases, symptoms, treatments, as well as environmental and lifestyle risk factors. KnowLife is a large knowledge base for health and life sciences, automatically constructed from different Web sources. As a unique feature, KnowLife is harvested from different text genres such as scientific publications, health portals, and online communities. Thus, it has the potential to serve as one-stop portal for a wide range of relations and use cases. To showcase the breadth and usefulness, we make the KnowLife KB accessible through the health portal (http://knowlife.mpi-inf.mpg.de).
Design Approaches to Support Preservice Teachers in Scientific Modeling
NASA Astrophysics Data System (ADS)
Kenyon, Lisa; Davis, Elizabeth A.; Hug, Barbara
2011-02-01
Engaging children in scientific practices is hard for beginning teachers. One such scientific practice with which beginning teachers may have limited experience is scientific modeling. We have iteratively designed preservice teacher learning experiences and materials intended to help teachers achieve learning goals associated with scientific modeling. Our work has taken place across multiple years at three university sites, with preservice teachers focused on early childhood, elementary, and middle school teaching. Based on results from our empirical studies supporting these design decisions, we discuss design features of our modeling instruction in each iteration. Our results suggest some successes in supporting preservice teachers in engaging students in modeling practice. We propose design principles that can guide science teacher educators in incorporating modeling in teacher education.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Geveci, Berk; Maynard, Robert
The XVis project brings together the key elements of research to enable scientific discovery at extreme scale. Scientific computing will no longer be purely about how fast computations can be performed. Energy constraints, processor changes, and I/O limitations necessitate significant changes in both the software applications used in scientific computation and the ways in which scientists use them. Components for modeling, simulation, analysis, and visualization must work together in a computational ecosystem, rather than working independently as they have in the past. The XVis project brought together collaborators from predominant DOE projects for visualization on accelerators and combining their respectivemore » features into a new visualization toolkit called VTK-m.« less
Rinaldi, Fabio; Schneider, Gerold; Kaljurand, Kaarel; Hess, Michael; Andronis, Christos; Konstandi, Ourania; Persidis, Andreas
2007-02-01
The amount of new discoveries (as published in the scientific literature) in the biomedical area is growing at an exponential rate. This growth makes it very difficult to filter the most relevant results, and thus the extraction of the core information becomes very expensive. Therefore, there is a growing interest in text processing approaches that can deliver selected information from scientific publications, which can limit the amount of human intervention normally needed to gather those results. This paper presents and evaluates an approach aimed at automating the process of extracting functional relations (e.g. interactions between genes and proteins) from scientific literature in the biomedical domain. The approach, using a novel dependency-based parser, is based on a complete syntactic analysis of the corpus. We have implemented a state-of-the-art text mining system for biomedical literature, based on a deep-linguistic, full-parsing approach. The results are validated on two different corpora: the manually annotated genomics information access (GENIA) corpus and the automatically annotated arabidopsis thaliana circadian rhythms (ATCR) corpus. We show how a deep-linguistic approach (contrary to common belief) can be used in a real world text mining application, offering high-precision relation extraction, while at the same time retaining a sufficient recall.
Effects of Strategy Instructions on Learning from Text and Pictures
ERIC Educational Resources Information Center
Leopold, Claudia; Doerner, Marcel; Leutner, Detlev; Dutke, Stephan
2015-01-01
In two experiments, we compared effects of instructions that encourage learners to create referential connections between words and pictures with instructions that distract learners from creating referential connections. In Experiment 1, students read a scientific text under four conditions. In the text-picture condition, students read the…
ERIC Educational Resources Information Center
Leopold, Claudia; Leutner, Detlev
2015-01-01
In three experiments, students were trained to use strategies for learning from scientific texts: text highlighting (Experiment 1), knowledge mapping (Experiment 2), and visualizing (Experiment 3). Each experiment compared a control condition, cognitive strategy training, and a combined cognitive strategy plus metacognitive self-regulation…
An Overall Perspective of Machine Translation with Its Shortcomings
ERIC Educational Resources Information Center
Akbari, Alireza
2014-01-01
The petition for language translation has strikingly augmented recently due to cross-cultural communication and exchange of information. In order to communicate well, text should be translated correctly and completely in each field such as legal documents, technical texts, scientific texts, publicity leaflets, and instructional materials. In this…
Word-level recognition of multifont Arabic text using a feature vector matching approach
NASA Astrophysics Data System (ADS)
Erlandson, Erik J.; Trenkle, John M.; Vogt, Robert C., III
1996-03-01
Many text recognition systems recognize text imagery at the character level and assemble words from the recognized characters. An alternative approach is to recognize text imagery at the word level, without analyzing individual characters. This approach avoids the problem of individual character segmentation, and can overcome local errors in character recognition. A word-level recognition system for machine-printed Arabic text has been implemented. Arabic is a script language, and is therefore difficult to segment at the character level. Character segmentation has been avoided by recognizing text imagery of complete words. The Arabic recognition system computes a vector of image-morphological features on a query word image. This vector is matched against a precomputed database of vectors from a lexicon of Arabic words. Vectors from the database with the highest match score are returned as hypotheses for the unknown image. Several feature vectors may be stored for each word in the database. Database feature vectors generated using multiple fonts and noise models allow the system to be tuned to its input stream. Used in conjunction with database pruning techniques, this Arabic recognition system has obtained promising word recognition rates on low-quality multifont text imagery.
Comparisons and Selections of Features and Classifiers for Short Text Classification
NASA Astrophysics Data System (ADS)
Wang, Ye; Zhou, Zhi; Jin, Shan; Liu, Debin; Lu, Mi
2017-10-01
Short text is considerably different from traditional long text documents due to its shortness and conciseness, which somehow hinders the applications of conventional machine learning and data mining algorithms in short text classification. According to traditional artificial intelligence methods, we divide short text classification into three steps, namely preprocessing, feature selection and classifier comparison. In this paper, we have illustrated step-by-step how we approach our goals. Specifically, in feature selection, we compared the performance and robustness of the four methods of one-hot encoding, tf-idf weighting, word2vec and paragraph2vec, and in the classification part, we deliberately chose and compared Naive Bayes, Logistic Regression, Support Vector Machine, K-nearest Neighbor and Decision Tree as our classifiers. Then, we compared and analysed the classifiers horizontally with each other and vertically with feature selections. Regarding the datasets, we crawled more than 400,000 short text files from Shanghai and Shenzhen Stock Exchanges and manually labeled them into two classes, the big and the small. There are eight labels in the big class, and 59 labels in the small class.
Wang, Yin; Li, Rudong; Zhou, Yuhua; Ling, Zongxin; Guo, Xiaokui; Xie, Lu; Liu, Lei
2016-01-01
Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.
Extraction of Pharmacokinetic Evidence of Drug–Drug Interactions from the Literature
Kolchinsky, Artemy; Lourenço, Anália; Wu, Heng-Yi; Li, Lang; Rocha, Luis M.
2015-01-01
Drug-drug interaction (DDI) is a major cause of morbidity and mortality and a subject of intense scientific interest. Biomedical literature mining can aid DDI research by extracting evidence for large numbers of potential interactions from published literature and clinical databases. Though DDI is investigated in domains ranging in scale from intracellular biochemistry to human populations, literature mining has not been used to extract specific types of experimental evidence, which are reported differently for distinct experimental goals. We focus on pharmacokinetic evidence for DDI, essential for identifying causal mechanisms of putative interactions and as input for further pharmacological and pharmacoepidemiology investigations. We used manually curated corpora of PubMed abstracts and annotated sentences to evaluate the efficacy of literature mining on two tasks: first, identifying PubMed abstracts containing pharmacokinetic evidence of DDIs; second, extracting sentences containing such evidence from abstracts. We implemented a text mining pipeline and evaluated it using several linear classifiers and a variety of feature transforms. The most important textual features in the abstract and sentence classification tasks were analyzed. We also investigated the performance benefits of using features derived from PubMed metadata fields, various publicly available named entity recognizers, and pharmacokinetic dictionaries. Several classifiers performed very well in distinguishing relevant and irrelevant abstracts (reaching F1≈0.93, MCC≈0.74, iAUC≈0.99) and sentences (F1≈0.76, MCC≈0.65, iAUC≈0.83). We found that word bigram features were important for achieving optimal classifier performance and that features derived from Medical Subject Headings (MeSH) terms significantly improved abstract classification. We also found that some drug-related named entity recognition tools and dictionaries led to slight but significant improvements, especially in classification of evidence sentences. Based on our thorough analysis of classifiers and feature transforms and the high classification performance achieved, we demonstrate that literature mining can aid DDI discovery by supporting automatic extraction of specific types of experimental evidence. PMID:25961290
Optimel: Software for selecting the optimal method
NASA Astrophysics Data System (ADS)
Popova, Olga; Popov, Boris; Romanov, Dmitry; Evseeva, Marina
Optimel: software for selecting the optimal method automates the process of selecting a solution method from the optimization methods domain. Optimel features practical novelty. It saves time and money when conducting exploratory studies if its objective is to select the most appropriate method for solving an optimization problem. Optimel features theoretical novelty because for obtaining the domain a new method of knowledge structuring was used. In the Optimel domain, extended quantity of methods and their properties are used, which allows identifying the level of scientific studies, enhancing the user's expertise level, expand the prospects the user faces and opening up new research objectives. Optimel can be used both in scientific research institutes and in educational institutions.
Atypical combinations and scientific impact.
Uzzi, Brian; Mukherjee, Satyam; Stringer, Michael; Jones, Ben
2013-10-25
Novelty is an essential feature of creative ideas, yet the building blocks of new ideas are often embodied in existing knowledge. From this perspective, balancing atypical knowledge with conventional knowledge may be critical to the link between innovativeness and impact. Our analysis of 17.9 million papers spanning all scientific fields suggests that science follows a nearly universal pattern: The highest-impact science is primarily grounded in exceptionally conventional combinations of prior work yet simultaneously features an intrusion of unusual combinations. Papers of this type were twice as likely to be highly cited works. Novel combinations of prior work are rare, yet teams are 37.7% more likely than solo authors to insert novel combinations into familiar knowledge domains.
On the Reconstruction of Text Phylogeny Trees: Evaluation and Analysis of Textual Relationships
Marmerola, Guilherme D.; Dias, Zanoni; Goldenstein, Siome; Rocha, Anderson
2016-01-01
Over the history of mankind, textual records change. Sometimes due to mistakes during transcription, sometimes on purpose, as a way to rewrite facts and reinterpret history. There are several classical cases, such as the logarithmic tables, and the transmission of antique and medieval scholarship. Today, text documents are largely edited and redistributed on the Web. Articles on news portals and collaborative platforms (such as Wikipedia), source code, posts on social networks, and even scientific publications or literary works are some examples in which textual content can be subject to changes in an evolutionary process. In this scenario, given a set of near-duplicate documents, it is worthwhile to find which one is the original and the history of changes that created the whole set. Such functionality would have immediate applications on news tracking services, detection of plagiarism, textual criticism, and copyright enforcement, for instance. However, this is not an easy task, as textual features pointing to the documents’ evolutionary direction may not be evident and are often dataset dependent. Moreover, side information, such as time stamps, are neither always available nor reliable. In this paper, we propose a framework for reliably reconstructing text phylogeny trees, and seamlessly exploring new approaches on a wide range of scenarios of text reusage. We employ and evaluate distinct combinations of dissimilarity measures and reconstruction strategies within the proposed framework, and evaluate each approach with extensive experiments, including a set of artificial near-duplicate documents with known phylogeny, and from documents collected from Wikipedia, whose modifications were made by Internet users. We also present results from qualitative experiments in two different applications: text plagiarism and reconstruction of evolutionary trees for manuscripts (stemmatology). PMID:27992446
Federal Register 2010, 2011, 2012, 2013, 2014
2011-08-05
...; The Science of Compassion--Future Directions in End-of-Life and Palliative Care SUMMARY: Notice is... science at the end-of-life. On August 11-12, the summit will feature keynote presentations, three plenary...), Department of Health and Human Services, will convene a scientific summit titled ``The Science of Compassion...
ERIC Educational Resources Information Center
Tekerci, Hacer; Kandir, Adalet
2017-01-01
Purpose: This study aimed to examine the effects of the Sense-Based Science Education Program on 60-66 months old children's scientific process skills. Research Methods: In this study, which carries experimental attribute features, the pre-test/final-test/observing-test control grouped experimental pattern, and qualitative research were used.…
ERIC Educational Resources Information Center
Bierschenk, Bernhard
Two kinds of perspectives governing the provision and preservation of knowledge, a universal and an ecological perspective, are discussed in this paper. In the first case, scientific observations are represented through a semantic interpretation of facts. This is illustrated with a series of experiments on semantic feature perception in the recall…
Nyström, Pär; Falck-Ytter, Terje; Gredebäck, Gustaf
2016-06-01
This article describes a new open source scientific workflow system, the TimeStudio Project, dedicated to the behavioral and brain sciences. The program is written in MATLAB and features a graphical user interface for the dynamic pipelining of computer algorithms developed as TimeStudio plugins. TimeStudio includes both a set of general plugins (for reading data files, modifying data structures, visualizing data structures, etc.) and a set of plugins specifically developed for the analysis of event-related eyetracking data as a proof of concept. It is possible to create custom plugins to integrate new or existing MATLAB code anywhere in a workflow, making TimeStudio a flexible workbench for organizing and performing a wide range of analyses. The system also features an integrated sharing and archiving tool for TimeStudio workflows, which can be used to share workflows both during the data analysis phase and after scientific publication. TimeStudio thus facilitates the reproduction and replication of scientific studies, increases the transparency of analyses, and reduces individual researchers' analysis workload. The project website ( http://timestudioproject.com ) contains the latest releases of TimeStudio, together with documentation and user forums.
Primary Teachers' beliefs about Scientific Creativity in the Classroom Context
NASA Astrophysics Data System (ADS)
Liu, Shu-Chiu; Lin, Huann-shyang
2014-07-01
While a number of studies have investigated people's perceptions or conceptions of creativity, there is a lack of studies looking into science teachers' views. The study aimed to explore the meanings of scientific creativity in the classroom context as perceived by a selective group of upper primary (Grades 3-6; student ages 8-12) science teachers (n = 16) in Taiwan. Using a self-report, open-ended questionnaire and follow-up interviews, the participants responded to questions as to (1) what quality, behaviours and abilities characterise a creative learner in their science classrooms, (2) what a science classroom should be like if it is to facilitate scientific creativity, and (3) whether and what particular elements of the inquiry approach are incorporated in such a classroom. The analyses revealed that the teachers captured the central features of creativity and proposed diverse ideas about how to foster creativity in school science, but seemed to overlook some aspects, such as convergent thinking, problem-finding, and linking the arts and science. These missing features are regarded as important for scientific creativity in contemporary research. The findings were discussed along with their implications for teacher education and future research.
Binder, Andrew R; Hillback, Elliott D; Brossard, Dominique
2016-04-01
Research indicates that uncertainty in science news stories affects public assessment of risk and uncertainty. However, the form in which uncertainty is presented may also affect people's risk and uncertainty assessments. For example, a news story that features an expert discussing both what is known and what is unknown about a topic may convey a different form of scientific uncertainty than a story that features two experts who hold conflicting opinions about the status of scientific knowledge of the topic, even when both stories contain the same information about knowledge and its boundaries. This study focuses on audience uncertainty and risk perceptions regarding the emerging science of nanotechnology by manipulating whether uncertainty in a news story about potential risks is attributed to expert sources in the form of caveats (individual uncertainty) or conflicting viewpoints (collective uncertainty). Results suggest that the type of uncertainty portrayed does not impact audience feelings of uncertainty or risk perceptions directly. Rather, the presentation of the story influences risk perceptions only among those who are highly deferent to scientific authority. Implications for risk communication theory and practice are discussed. © 2015 Society for Risk Analysis.
Liu, Yuanchao; Liu, Ming; Wang, Xin
2015-01-01
The objective of text clustering is to divide document collections into clusters based on the similarity between documents. In this paper, an extension-based feature modeling approach towards semantically sensitive text clustering is proposed along with the corresponding feature space construction and similarity computation method. By combining the similarity in traditional feature space and that in extension space, the adverse effects of the complexity and diversity of natural language can be addressed and clustering semantic sensitivity can be improved correspondingly. The generated clusters can be organized using different granularities. The experimental evaluations on well-known clustering algorithms and datasets have verified the effectiveness of our approach.
Liu, Yuanchao; Liu, Ming; Wang, Xin
2015-01-01
The objective of text clustering is to divide document collections into clusters based on the similarity between documents. In this paper, an extension-based feature modeling approach towards semantically sensitive text clustering is proposed along with the corresponding feature space construction and similarity computation method. By combining the similarity in traditional feature space and that in extension space, the adverse effects of the complexity and diversity of natural language can be addressed and clustering semantic sensitivity can be improved correspondingly. The generated clusters can be organized using different granularities. The experimental evaluations on well-known clustering algorithms and datasets have verified the effectiveness of our approach. PMID:25794172
Subgraph augmented non-negative tensor factorization (SANTF) for modeling clinical narrative text
Xin, Yu; Hochberg, Ephraim; Joshi, Rohit; Uzuner, Ozlem; Szolovits, Peter
2015-01-01
Objective Extracting medical knowledge from electronic medical records requires automated approaches to combat scalability limitations and selection biases. However, existing machine learning approaches are often regarded by clinicians as black boxes. Moreover, training data for these automated approaches at often sparsely annotated at best. The authors target unsupervised learning for modeling clinical narrative text, aiming at improving both accuracy and interpretability. Methods The authors introduce a novel framework named subgraph augmented non-negative tensor factorization (SANTF). In addition to relying on atomic features (e.g., words in clinical narrative text), SANTF automatically mines higher-order features (e.g., relations of lymphoid cells expressing antigens) from clinical narrative text by converting sentences into a graph representation and identifying important subgraphs. The authors compose a tensor using patients, higher-order features, and atomic features as its respective modes. We then apply non-negative tensor factorization to cluster patients, and simultaneously identify latent groups of higher-order features that link to patient clusters, as in clinical guidelines where a panel of immunophenotypic features and laboratory results are used to specify diagnostic criteria. Results and Conclusion SANTF demonstrated over 10% improvement in averaged F-measure on patient clustering compared to widely used non-negative matrix factorization (NMF) and k-means clustering methods. Multiple baselines were established by modeling patient data using patient-by-features matrices with different feature configurations and then performing NMF or k-means to cluster patients. Feature analysis identified latent groups of higher-order features that lead to medical insights. We also found that the latent groups of atomic features help to better correlate the latent groups of higher-order features. PMID:25862765
Fall Take a Hike Features a New Poster Puzzler Challenge | Poster
The recent Take a Hike event, sponsored by Occupational Health Services, featured a new twist: A Poster Puzzler challenge courtesy of Scientific Publications, Graphics and Media. Participants were asked to identify words on six objects along the Hike path based on photographs that showed the objects with the words blurred out.
Students' Problem-Solving in Mechanics: Preference of a Process Based Model.
ERIC Educational Resources Information Center
Stavy, Ruth; And Others
Research in science and mathematics education has indicated that students often use inappropriate models for solving problems because they tend to mentally represent a problem according to surface features instead of referring to scientific concepts and features. The objective of the study reported in this paper was to determine whether 34 Israeli…
ERIC Educational Resources Information Center
Topolovcan, Tomislav
2016-01-01
This paper provides a critical analysis of art-based research in education, that is, in constructivist learning and teaching. It presents the methodological features and advantages of art-based research in terms of the axiological, ontological and epistemological features of the constructivist, participatory and critical scientific paradigm, and…
Vaccine adverse event text mining system for extracting features from vaccine safety reports.
Botsis, Taxiarchis; Buttolph, Thomas; Nguyen, Michael D; Winiecki, Scott; Woo, Emily Jane; Ball, Robert
2012-01-01
To develop and evaluate a text mining system for extracting key clinical features from vaccine adverse event reporting system (VAERS) narratives to aid in the automated review of adverse event reports. Based upon clinical significance to VAERS reviewing physicians, we defined the primary (diagnosis and cause of death) and secondary features (eg, symptoms) for extraction. We built a novel vaccine adverse event text mining (VaeTM) system based on a semantic text mining strategy. The performance of VaeTM was evaluated using a total of 300 VAERS reports in three sequential evaluations of 100 reports each. Moreover, we evaluated the VaeTM contribution to case classification; an information retrieval-based approach was used for the identification of anaphylaxis cases in a set of reports and was compared with two other methods: a dedicated text classifier and an online tool. The performance metrics of VaeTM were text mining metrics: recall, precision and F-measure. We also conducted a qualitative difference analysis and calculated sensitivity and specificity for classification of anaphylaxis cases based on the above three approaches. VaeTM performed best in extracting diagnosis, second level diagnosis, drug, vaccine, and lot number features (lenient F-measure in the third evaluation: 0.897, 0.817, 0.858, 0.874, and 0.914, respectively). In terms of case classification, high sensitivity was achieved (83.1%); this was equal and better compared to the text classifier (83.1%) and the online tool (40.7%), respectively. Our VaeTM implementation of a semantic text mining strategy shows promise in providing accurate and efficient extraction of key features from VAERS narratives.
Federal Register 2010, 2011, 2012, 2013, 2014
2010-04-27
... Effectiveness of Proposed Rule Change To Remove a Feature and Revise Outdated Text Regarding Certain Execution... Proposed Rule Change The Exchange is proposing to eliminate a feature and revise outdated text regarding certain of its execution rules. The text of the proposed rule change is available on CBOE's Web site at...
Frequently Asked Questions | DOepatents
OSTI? Where can I find information about doing business with DOE? How can I find additional information Scientific and Technical Information (OSTI) to demonstrate the Department's contribution to scientific the patent application, full text, and other descriptive information accessible to the public. New
History Forum Addresses Creation/Evolution Controversy.
ERIC Educational Resources Information Center
Schweinsberg, John
1997-01-01
A series of programs entitled Creationism and Evolution: The History of a Controversy was presented at the University of Alabama in Huntsville. The controversy was addressed from an historical and sociological, rather than a scientific perspective. Speakers addressed the evolution of scientific creationism, ancient texts versus sedimentary rocks…
NASA Astrophysics Data System (ADS)
Vosniadou, Stella; Skopeliti, Irini
2017-10-01
The present research tested the hypothesis that the reading of science text can create new misconceptions in students with incongruent prior knowledge, and that these new misconceptions will be similar to the fragmented and synthetic conceptions obtained in prior developmental research. Ninety-nine third- and fifth-grade children read and recalled one of two texts that provided scientific or phenomenal explanations of the day/night cycle. All the participants gave explanations of the phenomenon in question prior to reading one of the texts and after they read it. The results showed that the participants who provided explanations of the day/night cycle at pretest incongruent with the scientific explanation recalled less information and generated more invalid inferences. An analysis of the participants' posttest explanations indicated that these readers formed new misconceptions similar to the fragmented and synthetic conceptions obtained in developmental research. The implications of the above for text comprehension and science education research are discussed.
NASA Astrophysics Data System (ADS)
Kintsakis, Athanassios M.; Psomopoulos, Fotis E.; Symeonidis, Andreas L.; Mitkas, Pericles A.
Hermes introduces a new "describe once, run anywhere" paradigm for the execution of bioinformatics workflows in hybrid cloud environments. It combines the traditional features of parallelization-enabled workflow management systems and of distributed computing platforms in a container-based approach. It offers seamless deployment, overcoming the burden of setting up and configuring the software and network requirements. Most importantly, Hermes fosters the reproducibility of scientific workflows by supporting standardization of the software execution environment, thus leading to consistent scientific workflow results and accelerating scientific output.
Bret, Patrice
2016-04-01
Eighteenth-century scientific translation was not just a linguistic or intellectual affair. It included numerous material aspects requiring a social organization to marshal the indispensable human and non-human actors. Paratexts and actors' correspondences provide a good observatory to get information about aspects such as shipments and routes, processes of translation and language acquisition (dictionaries, grammars and other helpful materials, such as translated works in both languages), texts acquisition and dissemination (including author's additions and corrections, oral presentations in academic meetings and announcements of forthcoming translations). The nature of scientific translation changed in France during the second half of the eighteenth century. Beside solitary translators, it also happened to become a collective enterprise, dedicated to providing abridgements (Collection académique, 1755-79) or enriching the learned journals with full translations of the most recent foreign texts (Guyton de Morveau's 'Bureau de traduction de Dijon', devoted to chemistry and mineralogy, 1781-90). That new trend clearly had a decisive influence on the nature of the scientific press itself. A way to set up science as a social activity in the provincial capital of Dijon, translation required a local and international network for acquiring the linguistic and scientific expertise, along with the original texts, as quickly as possible. Laboratory results and mineralogical observations were used to compare material facts (colour, odour, shape of crystals, etc.) with those described in the original text. By providing a double kind of validation - with both the experiments and the translations - the laboratory thus happened to play a major role in translation.
Text analysis devices, articles of manufacture, and text analysis methods
Turner, Alan E; Hetzler, Elizabeth G; Nakamura, Grant C
2013-05-28
Text analysis devices, articles of manufacture, and text analysis methods are described according to some aspects. In one aspect, a text analysis device includes processing circuitry configured to analyze initial text to generate a measurement basis usable in analysis of subsequent text, wherein the measurement basis comprises a plurality of measurement features from the initial text, a plurality of dimension anchors from the initial text and a plurality of associations of the measurement features with the dimension anchors, and wherein the processing circuitry is configured to access a viewpoint indicative of a perspective of interest of a user with respect to the analysis of the subsequent text, and wherein the processing circuitry is configured to use the viewpoint to generate the measurement basis.
Agile parallel bioinformatics workflow management using Pwrake.
Mishima, Hiroyuki; Sasaki, Kensaku; Tanaka, Masahiro; Tatebe, Osamu; Yoshiura, Koh-Ichiro
2011-09-08
In bioinformatics projects, scientific workflow systems are widely used to manage computational procedures. Full-featured workflow systems have been proposed to fulfil the demand for workflow management. However, such systems tend to be over-weighted for actual bioinformatics practices. We realize that quick deployment of cutting-edge software implementing advanced algorithms and data formats, and continuous adaptation to changes in computational resources and the environment are often prioritized in scientific workflow management. These features have a greater affinity with the agile software development method through iterative development phases after trial and error.Here, we show the application of a scientific workflow system Pwrake to bioinformatics workflows. Pwrake is a parallel workflow extension of Ruby's standard build tool Rake, the flexibility of which has been demonstrated in the astronomy domain. Therefore, we hypothesize that Pwrake also has advantages in actual bioinformatics workflows. We implemented the Pwrake workflows to process next generation sequencing data using the Genomic Analysis Toolkit (GATK) and Dindel. GATK and Dindel workflows are typical examples of sequential and parallel workflows, respectively. We found that in practice, actual scientific workflow development iterates over two phases, the workflow definition phase and the parameter adjustment phase. We introduced separate workflow definitions to help focus on each of the two developmental phases, as well as helper methods to simplify the descriptions. This approach increased iterative development efficiency. Moreover, we implemented combined workflows to demonstrate modularity of the GATK and Dindel workflows. Pwrake enables agile management of scientific workflows in the bioinformatics domain. The internal domain specific language design built on Ruby gives the flexibility of rakefiles for writing scientific workflows. Furthermore, readability and maintainability of rakefiles may facilitate sharing workflows among the scientific community. Workflows for GATK and Dindel are available at http://github.com/misshie/Workflows.
Agile parallel bioinformatics workflow management using Pwrake
2011-01-01
Background In bioinformatics projects, scientific workflow systems are widely used to manage computational procedures. Full-featured workflow systems have been proposed to fulfil the demand for workflow management. However, such systems tend to be over-weighted for actual bioinformatics practices. We realize that quick deployment of cutting-edge software implementing advanced algorithms and data formats, and continuous adaptation to changes in computational resources and the environment are often prioritized in scientific workflow management. These features have a greater affinity with the agile software development method through iterative development phases after trial and error. Here, we show the application of a scientific workflow system Pwrake to bioinformatics workflows. Pwrake is a parallel workflow extension of Ruby's standard build tool Rake, the flexibility of which has been demonstrated in the astronomy domain. Therefore, we hypothesize that Pwrake also has advantages in actual bioinformatics workflows. Findings We implemented the Pwrake workflows to process next generation sequencing data using the Genomic Analysis Toolkit (GATK) and Dindel. GATK and Dindel workflows are typical examples of sequential and parallel workflows, respectively. We found that in practice, actual scientific workflow development iterates over two phases, the workflow definition phase and the parameter adjustment phase. We introduced separate workflow definitions to help focus on each of the two developmental phases, as well as helper methods to simplify the descriptions. This approach increased iterative development efficiency. Moreover, we implemented combined workflows to demonstrate modularity of the GATK and Dindel workflows. Conclusions Pwrake enables agile management of scientific workflows in the bioinformatics domain. The internal domain specific language design built on Ruby gives the flexibility of rakefiles for writing scientific workflows. Furthermore, readability and maintainability of rakefiles may facilitate sharing workflows among the scientific community. Workflows for GATK and Dindel are available at http://github.com/misshie/Workflows. PMID:21899774
A Preliminary Investigation of the Influences of Refutation Text and Instructional Design
ERIC Educational Resources Information Center
Schroeder, Noah L.
2016-01-01
Teachers are often tasked with changing their students' conceptions about scientific topics. One strategy that has been found effective for conceptual change is the use of refutation text. However, reviewing the literature revealed that many practical questions around the use refutation text have not been adequately addressed. A secondary issue is…
Teaching Students to Compose Informational Poetic Riddles to Further Scientific Understanding
ERIC Educational Resources Information Center
Frye, Elizabeth M.; Bradbury, Leslie; Gross, Lisa A.
2016-01-01
In most elementary schools, students spend more time reading and writing narrative texts and less time with informational texts. Yet, the Common Core State Standards advocate that informational texts comprise nearly half of K-8 students' entire academic reading, including content areas like science and social studies. The authors propose remixing…
Using Refutational Text in Mathematics Education
ERIC Educational Resources Information Center
Lem, Stephanie; Onghena, Patrick; Verschaffel, Lieven; Van Dooren, Wim
2017-01-01
Refutational text is one of the many instructional techniques that have been proposed to be used in education as a way to achieve effective learning. The aim of refutational text is to transform misconceptions into conceptions that are in line with current scientific concepts. This is done by explicitly stating a misconception, refuting it, and…
Processing and Representation of Arguments in One-Sided Texts about Disputed Topics
ERIC Educational Resources Information Center
Wolfe, Michael B.; Tanner, Shawna M.; Taylor, Andrew R.
2013-01-01
We examine students' processing and representation of arguments and counterarguments in one-sided scientific texts. In Experiment 1, students read texts about evolution and TV violence. Sentence reading times indicated that subjects slowed down reading to the extent that arguments were both more consistent, and inconsistent, with the text…
Global and Local Features Based Classification for Bleed-Through Removal
NASA Astrophysics Data System (ADS)
Hu, Xiangyu; Lin, Hui; Li, Shutao; Sun, Bin
2016-12-01
The text on one side of historical documents often seeps through and appears on the other side, so the bleed-through is a common problem in historical document images. It makes the document images hard to read and the text difficult to recognize. To improve the image quality and readability, the bleed-through has to be removed. This paper proposes a global and local features extraction based bleed-through removal method. The Gaussian mixture model is used to get the global features of the images. Local features are extracted by the patch around each pixel. Then, the extreme learning machine classifier is utilized to classify the scanned images into the foreground text and the bleed-through component. Experimental results on real document image datasets show that the proposed method outperforms the state-of-the-art bleed-through removal methods and preserves the text strokes well.
SparkText: Biomedical Text Mining on Big Data Framework.
Ye, Zhan; Tafti, Ahmad P; He, Karen Y; Wang, Kai; He, Max M
Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.
SparkText: Biomedical Text Mining on Big Data Framework
He, Karen Y.; Wang, Kai
2016-01-01
Background Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. Results In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. Conclusions This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research. PMID:27685652
NASA scientific and technical information for the 1990s
NASA Technical Reports Server (NTRS)
Cotter, Gladys A.
1990-01-01
Projections for NASA scientific and technical information (STI) in the 1990s are outlined. NASA STI for the 1990s will maintain a quality bibliographic and full-text database, emphasizing electronic input and products supplemented by networked access to a wide variety of sources, particularly numeric databases.
Using JournalMap to improve discovery and visualization of rangeland scientific knowledge
USDA-ARS?s Scientific Manuscript database
Most of the ecological research conducted around the world is tied to specific places; however, that location information is locked up in the text and figures of scientific articles in myriad forms that are not easily searchable. While access to ecological literature has improved dramatically in the...
Using text mining techniques to extract phenotypic information from the PhenoCHF corpus
2015-01-01
Background Phenotypic information locked away in unstructured narrative text presents significant barriers to information accessibility, both for clinical practitioners and for computerised applications used for clinical research purposes. Text mining (TM) techniques have previously been applied successfully to extract different types of information from text in the biomedical domain. They have the potential to be extended to allow the extraction of information relating to phenotypes from free text. Methods To stimulate the development of TM systems that are able to extract phenotypic information from text, we have created a new corpus (PhenoCHF) that is annotated by domain experts with several types of phenotypic information relating to congestive heart failure. To ensure that systems developed using the corpus are robust to multiple text types, it integrates text from heterogeneous sources, i.e., electronic health records (EHRs) and scientific articles from the literature. We have developed several different phenotype extraction methods to demonstrate the utility of the corpus, and tested these methods on a further corpus, i.e., ShARe/CLEF 2013. Results Evaluation of our automated methods showed that PhenoCHF can facilitate the training of reliable phenotype extraction systems, which are robust to variations in text type. These results have been reinforced by evaluating our trained systems on the ShARe/CLEF corpus, which contains clinical records of various types. Like other studies within the biomedical domain, we found that solutions based on conditional random fields produced the best results, when coupled with a rich feature set. Conclusions PhenoCHF is the first annotated corpus aimed at encoding detailed phenotypic information. The unique heterogeneous composition of the corpus has been shown to be advantageous in the training of systems that can accurately extract phenotypic information from a range of different text types. Although the scope of our annotation is currently limited to a single disease, the promising results achieved can stimulate further work into the extraction of phenotypic information for other diseases. The PhenoCHF annotation guidelines and annotations are publicly available at https://code.google.com/p/phenochf-corpus. PMID:26099853
Using text mining techniques to extract phenotypic information from the PhenoCHF corpus.
Alnazzawi, Noha; Thompson, Paul; Batista-Navarro, Riza; Ananiadou, Sophia
2015-01-01
Phenotypic information locked away in unstructured narrative text presents significant barriers to information accessibility, both for clinical practitioners and for computerised applications used for clinical research purposes. Text mining (TM) techniques have previously been applied successfully to extract different types of information from text in the biomedical domain. They have the potential to be extended to allow the extraction of information relating to phenotypes from free text. To stimulate the development of TM systems that are able to extract phenotypic information from text, we have created a new corpus (PhenoCHF) that is annotated by domain experts with several types of phenotypic information relating to congestive heart failure. To ensure that systems developed using the corpus are robust to multiple text types, it integrates text from heterogeneous sources, i.e., electronic health records (EHRs) and scientific articles from the literature. We have developed several different phenotype extraction methods to demonstrate the utility of the corpus, and tested these methods on a further corpus, i.e., ShARe/CLEF 2013. Evaluation of our automated methods showed that PhenoCHF can facilitate the training of reliable phenotype extraction systems, which are robust to variations in text type. These results have been reinforced by evaluating our trained systems on the ShARe/CLEF corpus, which contains clinical records of various types. Like other studies within the biomedical domain, we found that solutions based on conditional random fields produced the best results, when coupled with a rich feature set. PhenoCHF is the first annotated corpus aimed at encoding detailed phenotypic information. The unique heterogeneous composition of the corpus has been shown to be advantageous in the training of systems that can accurately extract phenotypic information from a range of different text types. Although the scope of our annotation is currently limited to a single disease, the promising results achieved can stimulate further work into the extraction of phenotypic information for other diseases. The PhenoCHF annotation guidelines and annotations are publicly available at https://code.google.com/p/phenochf-corpus.
Idea Paper: The Lifecycle of Software for Scientific Simulations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dubey, Anshu; McInnes, Lois C.
The software lifecycle is a well researched topic that has produced many models to meet the needs of different types of software projects. However, one class of projects, software development for scientific computing, has received relatively little attention from lifecycle researchers. In particular, software for end-to-end computations for obtaining scientific results has received few lifecycle proposals and no formalization of a development model. An examination of development approaches employed by the teams implementing large multicomponent codes reveals a great deal of similarity in their strategies. This idea paper formalizes these related approaches into a lifecycle model for end-to-end scientific applicationmore » software, featuring loose coupling between submodels for development of infrastructure and scientific capability. We also invite input from stakeholders to converge on a model that captures the complexity of this development processes and provides needed lifecycle guidance to the scientific software community.« less
Reading for tracing evidence: developing scientific knowledge through science text
NASA Astrophysics Data System (ADS)
Probosari, R. M.; Widyastuti, F.; Sajidan, S.; Suranto, S.; Prayitno, B. A.
2018-05-01
The purposes of this study were to investigate students’ learning progression on reading activity, science concept comprehension and how they imply it in scientific communication in the classroom. Fifty-nine biology education students participated in this study. This classroom research was developed to portray students’ reading activity, factors affecting reading comprehension, and the development of reading motivation. Qualitative analysis was used to describe the whole activities, involve the instruction, process and the product of reading activity. The result concluded that each student has their own way in interpreting the information from scientific text, but generally, they can filter and apply it in their argument as a part of reasoning and evidence. The findings can be used to direct reading activity to the goal of inquiry in order to support the nature of reading as evidence.
Cole, James C.; Larson, Ed; Farmer, Lang; Kellogg, Karl S.
2008-01-01
The report contains the illustrated guidebook that was used for the fall field trip of the Colorado Scientific Society on September 6-7, 2008. It summarizes new information about the Tertiary geologic history of the northern Front Range and the Never Summer Mountains, particularly the late Oligocene volcanic and intrusive rocks designated the Braddock Peak complex. Minor modifications were made in response to technical reviews by D.J. Lidke and C.A. Ruleman (U.S. Geological Survey) regarding clarity and consistency, and text editing by M.A. Kidd. However, the text remains essentially similar to the guidebook that was circulated to the participants on the Colorado Scientific Society 2008 field trip. Several notes were added following the trip (as indicated) to address developments since the guidebook was written.
Electromagnetic Induction Rediscovered Using Original Texts.
ERIC Educational Resources Information Center
Barth, Michael
2000-01-01
Describes a teaching unit on electromagnetic induction using historic texts. Uses some of Faraday's diary entries from 1831 to introduce the phenomenon of electromagnetic induction and teach about the properties of electricity, of taking conclusions from experiment, and scientific methodology. (ASK)
ERIC Educational Resources Information Center
Svedholm, Annika M.; Lindeman, Marjaana
2013-01-01
Lay conceptions of energy often conflict with scientific knowledge, hinder science learning and scientific literacy, and provide a basis for ungrounded beliefs. In a sample of Finnish upper secondary school students, energy was attributed with features of living and animate beings and thought of as a mental property. These ontologically confused…
Overview of machine vision methods in x-ray imaging and microtomography
NASA Astrophysics Data System (ADS)
Buzmakov, Alexey; Zolotov, Denis; Chukalina, Marina; Nikolaev, Dmitry; Gladkov, Andrey; Ingacheva, Anastasia; Yakimchuk, Ivan; Asadchikov, Victor
2018-04-01
Digital X-ray imaging became widely used in science, medicine, non-destructive testing. This allows using modern digital images analysis for automatic information extraction and interpretation. We give short review of scientific applications of machine vision in scientific X-ray imaging and microtomography, including image processing, feature detection and extraction, images compression to increase camera throughput, microtomography reconstruction, visualization and setup adjustment.
Structuring supplemental materials in support of reproducibility.
Greenbaum, Dov; Rozowsky, Joel; Stodden, Victoria; Gerstein, Mark
2017-04-05
Supplements are increasingly important to the scientific record, particularly in genomics. However, they are often underutilized. Optimally, supplements should make results findable, accessible, interoperable, and reusable (i.e., "FAIR"). Moreover, properly off-loading to them the data and detail in a paper could make the main text more readable. We propose a hierarchical organization for supplements, with some parts paralleling and "shadowing" the main text and other elements branching off from it, and we suggest a specific formatting to make this structure explicit. Furthermore, sections of the supplement could be presented in multiple scientific "dialects", including machine-readable and lay-friendly formats.
[The role of Hunayn, physician and translator].
Habbi, J
1994-01-01
Hunayn ibn Ishāq is one of the most important translators of scientific and medical texts in the 'abbāsid era. He played a fundamental role in the transmission of Greek medical science to the Arab world, because of his deep knowledge of the Syriac, Arabic and Persian languages. He translated about 300 texts, especially medical ones, and he founded a school in which many disciples were instructed in the art of translation, like his son Ishāq and his nephew Hubays. Hunayn composed a lexicon of scientific terminology, using a new method of translation which represents his great innovation.
Incorporating Feature-Based Annotations into Automatically Generated Knowledge Representations
NASA Astrophysics Data System (ADS)
Lumb, L. I.; Lederman, J. I.; Aldridge, K. D.
2006-12-01
Earth Science Markup Language (ESML) is efficient and effective in representing scientific data in an XML- based formalism. However, features of the data being represented are not accounted for in ESML. Such features might derive from events (e.g., a gap in data collection due to instrument servicing), identifications (e.g., a scientifically interesting area/volume in an image), or some other source. In order to account for features in an ESML context, we consider them from the perspective of annotation, i.e., the addition of information to existing documents without changing the originals. Although it is possible to extend ESML to incorporate feature-based annotations internally (e.g., by extending the XML schema for ESML), there are a number of complicating factors that we identify. Rather than pursuing the ESML-extension approach, we focus on an external representation for feature-based annotations via XML Pointer Language (XPointer). In previous work (Lumb &Aldridge, HPCS 2006, IEEE, doi:10.1109/HPCS.2006.26), we have shown that it is possible to extract relationships from ESML-based representations, and capture the results in the Resource Description Format (RDF). Thus we explore and report on this same requirement for XPointer-based annotations of ESML representations. As in our past efforts, the Global Geodynamics Project (GGP) allows us to illustrate with a real-world example this approach for introducing annotations into automatically generated knowledge representations.
ERIC Educational Resources Information Center
Sugai, George; Horner, Robert H.
2009-01-01
The Individuals with Disabilities Education Act and No Child Left Behind emphasize the use of scientifically based research to improve outcomes for students. From this emphasis, response-to-intervention has evolved. We present one perspective on the defining features of response-to-intervention and application of those features to school-wide…
Black, Maureen M.; Saavedra, Jose M.
2016-01-01
Interventions targeting parenting focused modifiable factors to prevent obesity and promote healthy growth in the first 1000 days of life are needed. Scale-up of interventions to global populations is necessary to reverse trends in weight status among infants and toddlers, and large scale dissemination will require understanding of effective strategies. Utilizing nutrition education theories, this paper describes the design of a digital-based nutrition guidance system targeted to first-time mothers to prevent obesity during the first two years. The multicomponent system consists of scientifically substantiated content, tools, and telephone-based professional support delivered in an anticipatory and sequential manner via the internet, email, and text messages, focusing on educational modules addressing the modifiable factors associated with childhood obesity. Digital delivery formats leverage consumer media trends and provide the opportunity for scale-up, unavailable to previous interventions reliant on resource heavy clinic and home-based counseling. Designed initially for use in the United States, this system's core features are applicable to all contexts and constitute an approach fostering healthy growth, not just obesity prevention. The multicomponent features, combined with a global concern for optimal growth and positive trends in mobile internet use, represent this system's future potential to affect change in nutrition practice in developing countries. PMID:27635257
Escape Excel: A tool for preventing gene symbol and accession conversion errors.
Welsh, Eric A; Stewart, Paul A; Kuenzi, Brent M; Eschrich, James A
2017-01-01
Microsoft Excel automatically converts certain gene symbols, database accessions, and other alphanumeric text into dates, scientific notation, and other numerical representations. These conversions lead to subsequent, irreversible, corruption of the imported text. A recent survey of popular genomic literature estimates that one-fifth of all papers with supplementary gene lists suffer from this issue. Here, we present an open-source tool, Escape Excel, which prevents these erroneous conversions by generating an escaped text file that can be safely imported into Excel. Escape Excel is implemented in a variety of formats (http://www.github.com/pstew/escape_excel), including a command line based Perl script, a Windows-only Excel Add-In, an OS X drag-and-drop application, a simple web-server, and as a Galaxy web environment interface. Test server implementations are accessible as a Galaxy interface (http://apostl.moffitt.org) and simple non-Galaxy web server (http://apostl.moffitt.org:8000/). Escape Excel detects and escapes a wide variety of problematic text strings so that they are not erroneously converted into other representations upon importation into Excel. Examples of problematic strings include date-like strings, time-like strings, leading zeroes in front of numbers, and long numeric and alphanumeric identifiers that should not be automatically converted into scientific notation. It is hoped that greater awareness of these potential data corruption issues, together with diligent escaping of text files prior to importation into Excel, will help to reduce the amount of Excel-corrupted data in scientific analyses and publications.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Buican, I.; Hriscu, V.; Amador, M.
For the past four and a half years, the International Communication Committee at Los Alamos National Laboratory has been working to develop a set of guidelines for writing technical and scientific documents in International English, that is, English for those whose native language is not English. Originally designed for documents intended for presentation in English to an international audience of technical experts, the International English guidelines apply equally well to the preparation of English text for translation. This is the second workshop in a series devoted to the topic of translation. The authors focus on the advantages of using Internationalmore » English, rather than various methods of simplifying language, to prepare scientific and technical text for translation.« less
... and health departments, description of several norovirus surveillance systems... Resources & References Scientific articles and educational materials related to norovirus... Multimedia Lists norovirus web features, podcasts, videos, infographics and web widget... Norovirus ...
Set of Frequent Word Item sets as Feature Representation for Text with Indonesian Slang
NASA Astrophysics Data System (ADS)
Sa'adillah Maylawati, Dian; Putri Saptawati, G. A.
2017-01-01
Indonesian slang are commonly used in social media. Due to their unstructured syntax, it is difficult to extract their features based on Indonesian grammar for text mining. To do so, we propose Set of Frequent Word Item sets (SFWI) as text representation which is considered match for Indonesian slang. Besides, SFWI is able to keep the meaning of Indonesian slang with regard to the order of appearance sentence. We use FP-Growth algorithm with adding separation sentence function into the algorithm to extract the feature of SFWI. The experiments is done with text data from social media such as Facebook, Twitter, and personal website. The result of experiments shows that Indonesian slang were more correctly interpreted based on SFWI.
NASA Astrophysics Data System (ADS)
Rogers, L. D.; Valderrama Graff, P.; Bandfield, J. L.; Christensen, P. R.; Klug, S. L.; Deva, B.; Capages, C.
2007-12-01
The Mars Public Mapping Project is a web-based education and public outreach tool developed by the Mars Space Flight Facility at Arizona State University. This tool allows the general public to identify and map geologic features on Mars, utilizing Thermal Emission Imaging System (THEMIS) visible images, allowing public participation in authentic scientific research. In addition, participants are able to rate each image (based on a 1 to 5 star scale) to help build a catalog of some of the more appealing and interesting martian surface features. Once participants have identified observable features in an image, they are able to view a map of the global distribution of the many geologic features they just identified. This automatic feedback, through a global distribution map, allows participants to see how their answers compare to the answers of other participants. Participants check boxes "yes, no, or not sure" for each feature that is listed on the Mars Public Mapping Project web page, including surface geologic features such as gullies, sand dunes, dust devil tracks, wind streaks, lava flows, several types of craters, and layers. Each type of feature has a quick and easily accessible description and example image. When a participant moves their mouse over each example thumbnail image, a window pops up with a picture and a description of the feature. This provides a form of "on the job training" for the participants that can vary with their background level. For users who are more comfortable with Mars geology, there is also an advanced feature identification section accessible by a drop down menu. This includes additional features that may be identified, such as streamlined islands, valley networks, chaotic terrain, yardangs, and dark slope streaks. The Mars Public Mapping Project achieves several goals: 1) It engages the public in a manner that encourages active participation in scientific research and learning about geologic features and processes. 2) It helps to build a mappable database that can be used by researchers (and the public in general) to quickly access image based data that contains particular feature types. 3) It builds a searchable database of images containing specific geologic features that the public deem to be visually appealing. Other education and public outreach programs at the Mars Space Flight Facility, such as the Rock Around the World and the Mars Student Imaging Project, have shown an increase in demand for programs that allow "kids of all ages" to participate in authentic scientific research. The Mars Public Mapping Project is a broadly accessible program that continues this theme by building a set of activities that is useful for both the public and scientists.
Abroms, Lorien C; Ahuja, Meenakshi; Kodl, Yvonne; Thaweethai, Lalida; Sims, Justin; Winickoff, Jonathan P; Windsor, Richard A
2012-01-01
Text messaging programs on mobile phones have shown some promise in helping people quit smoking. Text2Quit is an automated, personalized, and interactive mobile health program that sends text messages and e-mails timed around a participant's quit date over the course of 3 months. The text messages include pre- and post-quit educational messages, peer ex-smoker messages, medication reminders and relapse messages, and multiple opportunities for interaction. Study participants were university students (N = 23) enrolled in the Text2Quit program. Participants were surveyed at baseline and at 2 and 4 weeks after enrollment. The majority of participants agreed that they liked the program at 2 and 4 weeks after enrollment (90.5% and 82.3%, respectively). Support for text messages was found to be moderate and higher than that of the e-mail and web components. Of participants, 75% reported reading most or all of the texts. On average, users made 11.8 responses to the texts over a 4-week period, although responses declined after the quit date. The interactive feature for tracking cigarettes was the most used interactive feature, followed by the craving trivia game. This pilot test provides some support for the Text2Quit program. A future iteration of the program will include additional tracking features in both the pre-quit and post-quit protocols and an easier entry into the not-quit protocol. Future studies are recommended that identify the value of the interactive and personalized features that characterize this program.
ABROMS, LORIEN C.; AHUJA, MEENAKSHI; KODL, YVONNE; THAWEETHAI, LALIDA; SIMS, JUSTIN; WINICKOFF, JONATHAN; WINDSOR, RICHARD A.
2012-01-01
Text messaging programs on mobile phones have shown some promise in helping people quit smoking. Text2Quit is an automated, personalized and interactive mobile health program that sends text messages and emails timed around a participant’s quit date over the course of 3 months. The text messages include pre- and post-quit educational messages, peer ex-smoker messages, medication reminders and relapse messages, as well as multiple opportunities for interaction. Study participants were university students (n=23) enrolled in the Text2Quit program. Participants were surveyed at baseline and at 2 and 4 weeks post-enrollment. The vast majority of participants agreed that they liked the program at 2 and 4 weeks post-enrollment (90.5% and 82.3%, respectively). Support for text messages was found to be moderate, and higher than that of the email and web components. Seventy-five percent of participants reported reading most or all of the texts. On average, users made 11.8 responses to the texts over a 4 week period, although responses declined following the quit date. The interactive feature for tracking cigarettes was the most used interactive feature, followed by the craving trivia game. This pilot test provides some support for the Text2Quit program. A future iteration of the program will include additional tracking features in both the pre-quit and post-quit protocol and an easier entry into the not-quit protocol. Future studies are recommended that identify the value of the interactive and personalized features that characterize this program. PMID:22548598
The Effects of Text Analysis on Drafting and Justifying Research Questions
ERIC Educational Resources Information Center
Padilla, Maria Antonia; Solorzano, Wendy Guadalupe; Pacheco, Virginia
2009-01-01
Introduction: A correspondence has been seen between the level at which one can read scientific texts and his/her performance in writing this type of texts. Besides being able to read at the most complex levels, formulating research problems requires explicit training in writing. The objective of the present study was to evaluate whether…
Effects of Text-Belief Consistency and Reading Task on the Strategic Validation of Multiple Texts
ERIC Educational Resources Information Center
Maier, Johanna; Richter, Tobias
2016-01-01
In the comprehension of multiple controversial scientific texts, readers with strong prior beliefs tend to construct a one-sided mental representation that is biased towards belief-consistent information. In the present study, we examined whether an argument in contrast to a summary task instruction can increase the resource allocation to and…
The Interplay of Firsthand and Text-Based Investigations in Science Education. Ciera Report.
ERIC Educational Resources Information Center
Palincsar, Annemarie Sullivan; Magnusson, Shirley J.
This paper presents the results of a study concerning the use of text in support of firsthand scientific inquiry instruction in the early elementary grades. A partial transcript of two teaching sessions in which an expert classroom teacher incorporated text into her inquiry instruction is investigated. The knowledge gained from these sessions…
Hancock, Matthew C; Magnan, Jerry F
2016-10-01
In the assessment of nodules in CT scans of the lungs, a number of image-derived features are diagnostically relevant. Currently, many of these features are defined only qualitatively, so they are difficult to quantify from first principles. Nevertheless, these features (through their qualitative definitions and interpretations thereof) are often quantified via a variety of mathematical methods for the purpose of computer-aided diagnosis (CAD). To determine the potential usefulness of quantified diagnostic image features as inputs to a CAD system, we investigate the predictive capability of statistical learning methods for classifying nodule malignancy. We utilize the Lung Image Database Consortium dataset and only employ the radiologist-assigned diagnostic feature values for the lung nodules therein, as well as our derived estimates of the diameter and volume of the nodules from the radiologists' annotations. We calculate theoretical upper bounds on the classification accuracy that are achievable by an ideal classifier that only uses the radiologist-assigned feature values, and we obtain an accuracy of 85.74 [Formula: see text], which is, on average, 4.43% below the theoretical maximum of 90.17%. The corresponding area-under-the-curve (AUC) score is 0.932 ([Formula: see text]), which increases to 0.949 ([Formula: see text]) when diameter and volume features are included and has an accuracy of 88.08 [Formula: see text]. Our results are comparable to those in the literature that use algorithmically derived image-based features, which supports our hypothesis that lung nodules can be classified as malignant or benign using only quantified, diagnostic image features, and indicates the competitiveness of this approach. We also analyze how the classification accuracy depends on specific features and feature subsets, and we rank the features according to their predictive power, statistically demonstrating the top four to be spiculation, lobulation, subtlety, and calcification.
Lu, Yingjie
2013-01-01
To facilitate patient involvement in online health community and obtain informative support and emotional support they need, a topic identification approach was proposed in this paper for identifying automatically topics of the health-related messages in online health community, thus assisting patients in reaching the most relevant messages for their queries efficiently. Feature-based classification framework was presented for automatic topic identification in our study. We first collected the messages related to some predefined topics in a online health community. Then we combined three different types of features, n-gram-based features, domain-specific features and sentiment features to build four feature sets for health-related text representation. Finally, three different text classification techniques, C4.5, Naïve Bayes and SVM were adopted to evaluate our topic classification model. By comparing different feature sets and different classification techniques, we found that n-gram-based features, domain-specific features and sentiment features were all considered to be effective in distinguishing different types of health-related topics. In addition, feature reduction technique based on information gain was also effective to improve the topic classification performance. In terms of classification techniques, SVM outperformed C4.5 and Naïve Bayes significantly. The experimental results demonstrated that the proposed approach could identify the topics of online health-related messages efficiently.
ERIC Educational Resources Information Center
Balluerka, Nekane
1995-01-01
Effects of 3 different instructional aids on the acquisition of information from a scientific passage were studied with 104 Spanish undergraduates. Written instructions, preparing a written outline, and seeing an illustration all led to higher performance. The outline condition led to the highest performance for questions requiring information…
Dynamic Framing in the Communication of Scientific Research: Texts and Interactions
ERIC Educational Resources Information Center
Davis, Pryce R.; Russ, Rosemary S.
2015-01-01
The fields of science education and science communication share the overarching goal of helping non-experts and non-members of the professional science community develop knowledge of the content and processes of scientific research. However, the specific audiences, methods, and aims employed in the two fields have evolved quite differently and as…
Attitude, Certainty and Allusions to Common Knowledge in Scientific Research Articles
ERIC Educational Resources Information Center
Koutsantoni, Dimitra
2004-01-01
Acceptance of claims made in scientific research articles depends on the "stance" authors take and their resources for "appraisal" (Martin and White, http://www.grammatics.com/appraisal). "Stance" has been defined as "the ways authors project themselves into their texts to communicate their relationship to subject matter and the readers",…
Lexical Cohesion and Specialized Knowledge in Science and Popular Science Texts.
ERIC Educational Resources Information Center
Myers, Greg
1991-01-01
Examines cohesion in the introductions to some scientific articles and compares the patterns to those from popularizations. Discusses a computational model of cohesion. Argues that readers of scientific articles must have a knowledge of lexical relations to see the implicit cohesion, whereas readers of popularizations must see the cohesive…
Practicing the Four Seasons of Ethnography Methodology while Searching for Identity in Mexico
ERIC Educational Resources Information Center
Pitts, Margaret Jane
2012-01-01
This narrative is an account of my field experiences and challenges practicing Gonzalez's (2000) Four Seasons of Ethnography methodology in Mexico City. I describe the complexities and tensions inherent in managing two scientific paradigms: Western scientific logic vs. a more organic ontology. The experiential knowledge produced in this text is…
Visualising abortion: emotion discourse and fetal imagery in a contemporary abortion debate.
Hopkins, Nick; Zeedyk, Suzanne; Raitt, Fiona
2005-07-01
This paper presents an analysis of a recent UK anti-abortion campaign in which the use of fetal imagery--especially images of fetal remains--was a prominent issue. A striking feature of the texts produced by the group behind the campaign was the emphasis given to the emotions of those viewing such imagery. Traditionally, social scientific analyses of mass communication have problematised references to emotion and viewed them as being of significance because of their power to subvert the rational appraisal of message content. However, we argue that emotion discourse may be analysed from a different perspective. As the categorisation of the fetus is a social choice and contested, it follows that all protagonists in the abortion debate (whether pro- or anti-abortion) are faced with the task of constructing the fetus as a particular entity rather than another, and that they must seek to portray their preferred categorisation as objective and driven by an 'out-there' reality. Following this logic, we show how the emotional experience of viewing fetal imagery was represented so as to ground an anti-abortion construction of the fetus as objective. We also show how the arguments of the (pro-abortion) opposition were construed as totally discrepant with such emotions and so were invalidated as deceitful distortions of reality. The wider significance of this analysis for social scientific analyses of the abortion debate is discussed.
Fulcher, Ben D; Jones, Nick S
2017-11-22
Phenotype measurements frequently take the form of time series, but we currently lack a systematic method for relating these complex data streams to scientifically meaningful outcomes, such as relating the movement dynamics of organisms to their genotype or measurements of brain dynamics of a patient to their disease diagnosis. Previous work addressed this problem by comparing implementations of thousands of diverse scientific time-series analysis methods in an approach termed highly comparative time-series analysis. Here, we introduce hctsa, a software tool for applying this methodological approach to data. hctsa includes an architecture for computing over 7,700 time-series features and a suite of analysis and visualization algorithms to automatically select useful and interpretable time-series features for a given application. Using exemplar applications to high-throughput phenotyping experiments, we show how hctsa allows researchers to leverage decades of time-series research to quantify and understand informative structure in time-series data. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Castaldi, S; Giacometti, M; Toigo, W; Bert, F; Siliquini, R
2015-09-29
In Public Health, a thorough review of abstract quality evaluations and the publication history of studies presented at scientific meetings has never been conducted. To analyse the long-term outcome of quality abstracts submitted to conferences of Italian Society of Hygiene and Public Health (SItI) from 2005 to 2007, we conducted a second analysis of previously published material aiming to estimate full-text publication rate of high quality abstract presented at Italian public health meetings, and to identify predictors of full-text publication. The search was undertaken through scientific databases and search engines and through the web sites of the major Italian journals of Public Health. For each publication confirmed as a full text paper, the journal name, impact factor, year of publication, gender of the first author, type of study design, characteristics of the results and sample size were collected. The overall publication rate of the abstracts presented is 23.5%; most of the papers were published in Public Health journals (average impact factor: 3.007). Non universitary affiliation had resulted in a lower probability of publication, while some of the Conference topics had predisposed the studies to an increased likelihood of publication as well as poster form presentation. The method presented in this study provides a good framework for the evaluation of the scientific evidence. The findings achieved should be taken into consideration by the Scientific Societies during the contributions selection phase, with the aim of achieving a continuous improvement of work quality. In the future, it would be interesting to survey the abstract authors to identify reasons for unpublished data.
JOVIAL/Ada Microprocessor Study.
1982-04-01
Study Final Technical Report interesting feature of the nodes is that they provide multiple virtual terminals, so it is possible to monitor several...Terminal Interface Tasking Except ion Handling A more elaborate system could allow such features as spooling, background jobs or multiple users. To a large...Another editor feature is the buffer. Buffers may hold small amounts of text or entire text objects. They allow multiple files to be edited simultaneously
Sung, Yao-Ting; Chen, Ju-Ling; Cha, Ji-Her; Tseng, Hou-Chiang; Chang, Tao-Hsing; Chang, Kuo-En
2015-06-01
Multilevel linguistic features have been proposed for discourse analysis, but there have been few applications of multilevel linguistic features to readability models and also few validations of such models. Most traditional readability formulae are based on generalized linear models (GLMs; e.g., discriminant analysis and multiple regression), but these models have to comply with certain statistical assumptions about data properties and include all of the data in formulae construction without pruning the outliers in advance. The use of such readability formulae tends to produce a low text classification accuracy, while using a support vector machine (SVM) in machine learning can enhance the classification outcome. The present study constructed readability models by integrating multilevel linguistic features with SVM, which is more appropriate for text classification. Taking the Chinese language as an example, this study developed 31 linguistic features as the predicting variables at the word, semantic, syntax, and cohesion levels, with grade levels of texts as the criterion variable. The study compared four types of readability models by integrating unilevel and multilevel linguistic features with GLMs and an SVM. The results indicate that adopting a multilevel approach in readability analysis provides a better representation of the complexities of both texts and the reading comprehension process.
A Novel Feature Selection Technique for Text Classification Using Naïve Bayes.
Dey Sarkar, Subhajit; Goswami, Saptarsi; Agarwal, Aman; Aktar, Javed
2014-01-01
With the proliferation of unstructured data, text classification or text categorization has found many applications in topic classification, sentiment analysis, authorship identification, spam detection, and so on. There are many classification algorithms available. Naïve Bayes remains one of the oldest and most popular classifiers. On one hand, implementation of naïve Bayes is simple and, on the other hand, this also requires fewer amounts of training data. From the literature review, it is found that naïve Bayes performs poorly compared to other classifiers in text classification. As a result, this makes the naïve Bayes classifier unusable in spite of the simplicity and intuitiveness of the model. In this paper, we propose a two-step feature selection method based on firstly a univariate feature selection and then feature clustering, where we use the univariate feature selection method to reduce the search space and then apply clustering to select relatively independent feature sets. We demonstrate the effectiveness of our method by a thorough evaluation and comparison over 13 datasets. The performance improvement thus achieved makes naïve Bayes comparable or superior to other classifiers. The proposed algorithm is shown to outperform other traditional methods like greedy search based wrapper or CFS.
Liakata, Maria; Saha, Shyamasree; Dobnik, Simon; Batchelor, Colin; Rebholz-Schuhmann, Dietrich
2012-04-01
Scholarly biomedical publications report on the findings of a research investigation. Scientists use a well-established discourse structure to relate their work to the state of the art, express their own motivation and hypotheses and report on their methods, results and conclusions. In previous work, we have proposed ways to explicitly annotate the structure of scientific investigations in scholarly publications. Here we present the means to facilitate automatic access to the scientific discourse of articles by automating the recognition of 11 categories at the sentence level, which we call Core Scientific Concepts (CoreSCs). These include: Hypothesis, Motivation, Goal, Object, Background, Method, Experiment, Model, Observation, Result and Conclusion. CoreSCs provide the structure and context to all statements and relations within an article and their automatic recognition can greatly facilitate biomedical information extraction by characterizing the different types of facts, hypotheses and evidence available in a scientific publication. We have trained and compared machine learning classifiers (support vector machines and conditional random fields) on a corpus of 265 full articles in biochemistry and chemistry to automatically recognize CoreSCs. We have evaluated our automatic classifications against a manually annotated gold standard, and have achieved promising accuracies with 'Experiment', 'Background' and 'Model' being the categories with the highest F1-scores (76%, 62% and 53%, respectively). We have analysed the task of CoreSC annotation both from a sentence classification as well as sequence labelling perspective and we present a detailed feature evaluation. The most discriminative features are local sentence features such as unigrams, bigrams and grammatical dependencies while features encoding the document structure, such as section headings, also play an important role for some of the categories. We discuss the usefulness of automatically generated CoreSCs in two biomedical applications as well as work in progress. A web-based tool for the automatic annotation of articles with CoreSCs and corresponding documentation is available online at http://www.sapientaproject.com/software http://www.sapientaproject.com also contains detailed information pertaining to CoreSC annotation and links to annotation guidelines as well as a corpus of manually annotated articles, which served as our training data. liakata@ebi.ac.uk Supplementary data are available at Bioinformatics online.
NASA Astrophysics Data System (ADS)
Federer, Meghan Rector
Assessment is a key element in the process of science education teaching and research. Understanding sources of performance bias in science assessment is a major challenge for science education reforms. Prior research has documented several limitations of instrument types on the measurement of students' scientific knowledge (Liu et al., 2011; Messick, 1995; Popham, 2010). Furthermore, a large body of work has been devoted to reducing assessment biases that distort inferences about students' science understanding, particularly in multiple-choice [MC] instruments. Despite the above documented biases, much has yet to be determined for constructed response [CR] assessments in biology and their use for evaluating students' conceptual understanding of scientific practices (such as explanation). Understanding differences in science achievement provides important insights into whether science curricula and/or assessments are valid representations of student abilities. Using the integrative framework put forth by the National Research Council (2012), this dissertation aimed to explore whether assessment biases occur for assessment practices intended to measure students' conceptual understanding and proficiency in scientific practices. Using a large corpus of undergraduate biology students' explanations, three studies were conducted to examine whether known biases of MC instruments were also apparent in a CR instrument designed to assess students' explanatory practice and understanding of evolutionary change (ACORNS: Assessment of COntextual Reasoning about Natural Selection). The first study investigated the challenge of interpreting and scoring lexically ambiguous language in CR answers. The incorporation of 'multivalent' terms into scientific discourse practices often results in statements or explanations that are difficult to interpret and can produce faulty inferences about student knowledge. The results of this study indicate that many undergraduate biology majors frequently incorporate multivalent concepts into explanations of change, resulting in explanatory practices that were scientifically non-normative. However, use of follow-up question approaches was found to resolve this source of bias and thereby increase the validity of inferences about student understanding. The second study focused on issues of item and instrument structure, specifically item feature effects and item position effects, which have been shown to influence measures of student performance across assessment tasks. Results indicated that, along the instrument item sequence, items with similar surface features produced greater sequencing effects than sequences of items with dissimilar surface features. This bias could be addressed by use of a counterbalanced design (i.e., Latin Square) at the population level of analysis. Explanation scores were also highly correlated with student verbosity, despite verbosity being an intrinsically trivial aspect of explanation quality. Attempting to standardize student response length was one proposed solution to the verbosity bias. The third study explored gender differences in students' performance on constructed-response explanation tasks using impact (i.e., mean raw scores) and differential item function (i.e., item difficulties) patterns. While prior research in science education has suggested that females tend to perform better on constructed-response items, the results of this study revealed no overall differences in gender achievement. However, evaluation of specific item features patterns suggested that female respondents have a slight advantage on unfamiliar explanation tasks. That is, male students tended to incorporate fewer scientifically normative concepts (i.e., key concepts) than females for unfamiliar taxa. Conversely, females tended to incorporate more scientifically non-normative ideas (i.e., naive ideas) than males for familiar taxa. Together these results indicate that gender achievement differences for this CR instrument may be a result of differences in how males and females interpret and respond to combinations of item features. Overall, the results presented in the subsequent chapters suggest that as science education shifts toward the evaluation of fused scientific knowledge and practice (e.g., explanation), it is essential that educators and researchers investigate potential sources of bias inherent to specific assessment practices. This dissertation revealed significant sources of CR assessment bias, and provided solutions to address these problems.
[Aged woman's vulnerability related to AIDS].
Silva, Carla Marins; Lopes, Fernanda Maria do Valle Martins; Vargens, Octavio Muniz da Costa
2010-09-01
This article is a systhematic literature review including the period from 1994 to 2009, whose objective was to discuss the aged woman's vulnerability in relation to Acquired Imunodeficiency Syndrome (Aids). The search for scientific texts was accomplished in the following databases: Biblioteca Virtual em Saúde, Scientific Eletronic Library Online (SciELO), Literatura Latino-Americana e do Caribe em Ciências da Saúde (LILACS) and Medical Literature Analysis and Retrieval System Online (MEDLINE). The descriptors used were vulnerability, woman and Aids. Eighteen texts were analyzed, including articles in scientific journals, thesis and dissertations. As a conclusion, it was noted that aged women and vulnerability to Aids are directly related, through gender characteristics including submission and that were built historical and socially. We consider as fundamental the development of studies which may generate publications accessible to women, in order to help them see themselves as persons vulnerable to Aids contagion just for being women.
Performance and Scalability of the NAS Parallel Benchmarks in Java
NASA Technical Reports Server (NTRS)
Frumkin, Michael A.; Schultz, Matthew; Jin, Haoqiang; Yan, Jerry; Biegel, Bryan A. (Technical Monitor)
2002-01-01
Several features make Java an attractive choice for scientific applications. In order to gauge the applicability of Java to Computational Fluid Dynamics (CFD), we have implemented the NAS (NASA Advanced Supercomputing) Parallel Benchmarks in Java. The performance and scalability of the benchmarks point out the areas where improvement in Java compiler technology and in Java thread implementation would position Java closer to Fortran in the competition for scientific applications.
The Poster features the news, local events, and people of the scientific, administrative, and support communities at NCI at Frederick, Frederick, Maryland. It is published by Scientific Publications, Graphics & Media, Leidos Biomedical Research, for NCI at Frederick. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. government.
Darwinism and positivism as methodological influences on the development of psychology.
Mackenzie, B
1976-10-01
The methodological significance of evolutionary theory for psychology may be distinguished from its substantive or theoretical significance. The methodological significance was that evolutionay theory broadened the current conceptors of scientific method and rendered them relatively independent of physics. It thereby made the application of the "scientific method" to psychology much more feasible than it had been previously, and thus established the possibility of a wide-ranging scientific psychology for the first time. The methodological eclecticism that made scientific psychology possible did not, however, remain a feature of psychology for very long. Psychology's methodology rapidly became restricted and codified through the influence of, and in imitation of, the rigorously positivistic orientation of physics around the turn of the twentieth century.
Yi, Chucai; Tian, Yingli
2012-09-01
In this paper, we propose a novel framework to extract text regions from scene images with complex backgrounds and multiple text appearances. This framework consists of three main steps: boundary clustering (BC), stroke segmentation, and string fragment classification. In BC, we propose a new bigram-color-uniformity-based method to model both text and attachment surface, and cluster edge pixels based on color pairs and spatial positions into boundary layers. Then, stroke segmentation is performed at each boundary layer by color assignment to extract character candidates. We propose two algorithms to combine the structural analysis of text stroke with color assignment and filter out background interferences. Further, we design a robust string fragment classification based on Gabor-based text features. The features are obtained from feature maps of gradient, stroke distribution, and stroke width. The proposed framework of text localization is evaluated on scene images, born-digital images, broadcast video images, and images of handheld objects captured by blind persons. Experimental results on respective datasets demonstrate that the framework outperforms state-of-the-art localization algorithms.
In Search of Commonalities: Some Linguistic and Rhetorical Features of Business Reports as a Genre
ERIC Educational Resources Information Center
Yeung, Lorrita
2007-01-01
The present study analyzes 22 authentic business reports in an attempt to identify textual features that are typical of business reports as a genre. The analysis shows that there are certain characteristics which distinguish business reports from other related genres such as scientific reports, of which RAs are a typical example. There are also…
Linguistic positivity in historical texts reflects dynamic environmental and psychological factors.
Iliev, Rumen; Hoover, Joe; Dehghani, Morteza; Axelrod, Robert
2016-12-06
People use more positive words than negative words. Referred to as "linguistic positivity bias" (LPB), this effect has been found across cultures and languages, prompting the conclusion that it is a panhuman tendency. However, although multiple competing explanations of LPB have been proposed, there is still no consensus on what mechanism(s) generate LPB or even on whether it is driven primarily by universal cognitive features or by environmental factors. In this work we propose that LPB has remained unresolved because previous research has neglected an essential dimension of language: time. In four studies conducted with two independent, time-stamped text corpora (Google books Ngrams and the New York Times), we found that LPB in American English has decreased during the last two centuries. We also observed dynamic fluctuations in LPB that were predicted by changes in objective environment, i.e., war and economic hardships, and by changes in national subjective happiness. In addition to providing evidence that LPB is a dynamic phenomenon, these results suggest that cognitive mechanisms alone cannot account for the observed dynamic fluctuations in LPB. At the least, LPB likely arises from multiple interacting mechanisms involving subjective, objective, and societal factors. In addition to having theoretical significance, our results demonstrate the value of newly available data sources in addressing long-standing scientific questions.
Linguistic positivity in historical texts reflects dynamic environmental and psychological factors
Iliev, Rumen; Hoover, Joe; Dehghani, Morteza
2016-01-01
People use more positive words than negative words. Referred to as “linguistic positivity bias” (LPB), this effect has been found across cultures and languages, prompting the conclusion that it is a panhuman tendency. However, although multiple competing explanations of LPB have been proposed, there is still no consensus on what mechanism(s) generate LPB or even on whether it is driven primarily by universal cognitive features or by environmental factors. In this work we propose that LPB has remained unresolved because previous research has neglected an essential dimension of language: time. In four studies conducted with two independent, time-stamped text corpora (Google books Ngrams and the New York Times), we found that LPB in American English has decreased during the last two centuries. We also observed dynamic fluctuations in LPB that were predicted by changes in objective environment, i.e., war and economic hardships, and by changes in national subjective happiness. In addition to providing evidence that LPB is a dynamic phenomenon, these results suggest that cognitive mechanisms alone cannot account for the observed dynamic fluctuations in LPB. At the least, LPB likely arises from multiple interacting mechanisms involving subjective, objective, and societal factors. In addition to having theoretical significance, our results demonstrate the value of newly available data sources in addressing long-standing scientific questions. PMID:27872286
Prioritizing Scientific Data for Transmission
NASA Technical Reports Server (NTRS)
Castano, Rebecca; Anderson, Robert; Estlin, Tara; DeCoste, Dennis; Gaines, Daniel; Mazzoni, Dominic; Fisher, Forest; Judd, Michele
2004-01-01
A software system has been developed for prioritizing newly acquired geological data onboard a planetary rover. The system has been designed to enable efficient use of limited communication resources by transmitting the data likely to have the most scientific value. This software operates onboard a rover by analyzing collected data, identifying potential scientific targets, and then using that information to prioritize data for transmission to Earth. Currently, the system is focused on the analysis of acquired images, although the general techniques are applicable to a wide range of data modalities. Image prioritization is performed using two main steps. In the first step, the software detects features of interest from each image. In its current application, the system is focused on visual properties of rocks. Thus, rocks are located in each image and rock properties, such as shape, texture, and albedo, are extracted from the identified rocks. In the second step, the features extracted from a group of images are used to prioritize the images using three different methods: (1) identification of key target signature (finding specific rock features the scientist has identified as important), (2) novelty detection (finding rocks we haven t seen before), and (3) representative rock sampling (finding the most average sample of each rock type). These methods use techniques such as K-means unsupervised clustering and a discrimination-based kernel classifier to rank images based on their interest level.
Unsupervised feature learning for autonomous rock image classification
NASA Astrophysics Data System (ADS)
Shu, Lei; McIsaac, Kenneth; Osinski, Gordon R.; Francis, Raymond
2017-09-01
Autonomous rock image classification can enhance the capability of robots for geological detection and enlarge the scientific returns, both in investigation on Earth and planetary surface exploration on Mars. Since rock textural images are usually inhomogeneous and manually hand-crafting features is not always reliable, we propose an unsupervised feature learning method to autonomously learn the feature representation for rock images. In our tests, rock image classification using the learned features shows that the learned features can outperform manually selected features. Self-taught learning is also proposed to learn the feature representation from a large database of unlabelled rock images of mixed class. The learned features can then be used repeatedly for classification of any subclass. This takes advantage of the large dataset of unlabelled rock images and learns a general feature representation for many kinds of rocks. We show experimental results supporting the feasibility of self-taught learning on rock images.
ERIC Educational Resources Information Center
Hall, Sophie S.; Kowalski, Rebecca; Paterson, Kevin B.; Basran, Jaskaran; Filik, Ruth; Maltby, John
2015-01-01
In response to the concern of the need to improve the scientific skills of school children, this study investigated the influence of text design (in terms of text cohesion) and individual differences, with the aim of identifying pathways to improving science education in early secondary school (Key Stage 3). One hundred and four secondary school…
Supporting Scientific Analysis within Collaborative Problem Solving Environments
NASA Technical Reports Server (NTRS)
Watson, Velvin R.; Kwak, Dochan (Technical Monitor)
2000-01-01
Collaborative problem solving environments for scientists should contain the analysis tools the scientists require in addition to the remote collaboration tools used for general communication. Unfortunately, most scientific analysis tools have been designed for a "stand-alone mode" and cannot be easily modified to work well in a collaborative environment. This paper addresses the questions, "What features are desired in a scientific analysis tool contained within a collaborative environment?", "What are the tool design criteria needed to provide these features?", and "What support is required from the architecture to support these design criteria?." First, the features of scientific analysis tools that are important for effective analysis in collaborative environments are listed. Next, several design criteria for developing analysis tools that will provide these features are presented. Then requirements for the architecture to support these design criteria are listed. Sonic proposed architectures for collaborative problem solving environments are reviewed and their capabilities to support the specified design criteria are discussed. A deficiency in the most popular architecture for remote application sharing, the ITU T. 120 architecture, prevents it from supporting highly interactive, dynamic, high resolution graphics. To illustrate that the specified design criteria can provide a highly effective analysis tool within a collaborative problem solving environment, a scientific analysis tool that contains the specified design criteria has been integrated into a collaborative environment and tested for effectiveness. The tests were conducted in collaborations between remote sites in the US and between remote sites on different continents. The tests showed that the tool (a tool for the visual analysis of computer simulations of physics) was highly effective for both synchronous and asynchronous collaborative analyses. The important features provided by the tool (and made possible by the specified design criteria) are: 1. The tool provides highly interactive, dynamic, high resolution, 3D graphics. 2. All remote scientists can view the same dynamic, high resolution, 3D scenes of the analysis as the analysis is being conducted. 3. The responsiveness of the tool is nearly identical to the responsiveness of the tool in a stand-alone mode. 4. The scientists can transfer control of the analysis between themselves. 5. Any analysis session or segment of an analysis session, whether done individually or collaboratively, can be recorded and posted on the Web for other scientists or students to download and play in either a collaborative or individual mode. 6. The scientist or student who downloaded the session can, individually or collaboratively, modify or extend the session with his/her own "what if" analysis of the data and post his/her version of the analysis back onto the Web. 7. The peak network bandwidth used in the collaborative sessions is only 1K bit/second even though the scientists at all sites are viewing high resolution (1280 x 1024 pixels), dynamic, 3D scenes of the analysis. The links between the specified design criteria and these performance features are presented.
Modeling Guru: Knowledge Base for NASA Modelers
NASA Astrophysics Data System (ADS)
Seablom, M. S.; Wojcik, G. S.; van Aartsen, B. H.
2009-05-01
Modeling Guru is an on-line knowledge-sharing resource for anyone involved with or interested in NASA's scientific models or High End Computing (HEC) systems. Developed and maintained by the NASA's Software Integration and Visualization Office (SIVO) and the NASA Center for Computational Sciences (NCCS), Modeling Guru's combined forums and knowledge base for research and collaboration is becoming a repository for the accumulated expertise of NASA's scientific modeling and HEC communities. All NASA modelers and associates are encouraged to participate and provide knowledge about the models and systems so that other users may benefit from their experience. Modeling Guru is divided into a hierarchy of communities, each with its own set forums and knowledge base documents. Current modeling communities include those for space science, land and atmospheric dynamics, atmospheric chemistry, and oceanography. In addition, there are communities focused on NCCS systems, HEC tools and libraries, and programming and scripting languages. Anyone may view most of the content on Modeling Guru (available at http://modelingguru.nasa.gov/), but you must log in to post messages and subscribe to community postings. The site offers a full range of "Web 2.0" features, including discussion forums, "wiki" document generation, document uploading, RSS feeds, search tools, blogs, email notification, and "breadcrumb" links. A discussion (a.k.a. forum "thread") is used to post comments, solicit feedback, or ask questions. If marked as a question, SIVO will monitor the thread, and normally respond within a day. Discussions can include embedded images, tables, and formatting through the use of the Rich Text Editor. Also, the user can add "Tags" to their thread to facilitate later searches. The "knowledge base" is comprised of documents that are used to capture and share expertise with others. The default "wiki" document lets users edit within the browser so others can easily collaborate on the same document, even allowing the author to select those who may edit and approve the document. To maintain knowledge integrity, all documents are moderated before they are visible to the public. Modeling Guru, running on Clearspace by Jive Software, has been an active resource to the NASA modeling and HEC communities for more than a year and currently has more than 100 active users. SIVO will soon install live instant messaging support, as well as a user-customizable homepage with social-networking features. In addition, SIVO plans to implement a large dataset/file storage capability so that users can quickly and easily exchange datasets and files with one another. Continued active community participation combined with periodic software updates and improved features will ensure that Modeling Guru remains a vibrant, effective, easy-to-use tool for the NASA scientific community.
Full Text Journal Subscriptions: An Evolutionary Process.
ERIC Educational Resources Information Center
Luther, Judy
1997-01-01
Provides an overview of companies offering Web accessible subscriptions to full text electronic versions of scientific, technical, and medical journals (Academic Press, Blackwell, EBSCO, Elsevier, Highwire Press, Information Quest, Institute of Physics, Johns Hopkins University Press, OCLC, OVID, Springer, and SWETS). Also lists guidelines for…
ERIC Educational Resources Information Center
Jian, Yu-Cin; Wu, Chao-Jung
2015-01-01
We investigated strategies used by readers when reading a science article with a diagram and assessed whether semantic and spatial representations were constructed while reading the diagram. Seventy-one undergraduate participants read a scientific article while tracking their eye movements and then completed a reading comprehension test. Our…
Religion and Rationality: Quaker Women and Science Education 1790-1850
ERIC Educational Resources Information Center
Leach, Camilla
2006-01-01
This article examines the work of two Quaker women, Priscilla Wakefield (1750-1832) and Maria Hack (1777-1844) as popularizers of science and in the context of the development of scientific literacy. Both women were writers who specialized in scientific educational texts for children and young adults. As Quakers their community and culture played…
Assessing Toxic Risk. Teacher's Guide [and] Student Edition. Cornell Scientific Inquiry Series.
ERIC Educational Resources Information Center
Trautmann, Nancy M.; Carlsen, William S.; Krasny, Marianne E.; Cunningham, Christine M.
The teacher's guide of "Assessing Toxic Risk" aims to help students conduct scientific research on relevant environmental topics. Using the research protocols in this book, students learn to carry out experiments known as bioassays. In this way, the toxicity of substances is evaluated by measuring its effect on living things. The text is…
"To Learn about Science": Real Life Scientific Literacy across Multicultural Communities
ERIC Educational Resources Information Center
Briseño-Garzón, Adriana; Perry, Kristen H.; Purcell-Gates, Victoria
2014-01-01
Much of the current research on scientific literacy focuses on particular text genres read by students within the classroom context. We offer a cross-case analysis of literacy as social practice in multicultural communities around the world, through which we reveal that individuals with no formal education, as well as people with varied levels of…
ERIC Educational Resources Information Center
Scharrer, Lisa; Bromme, Rainer; Britt, M. Anne; Stadtler, Marc
2012-01-01
The present research investigated whether laypeople are inclined to rely on their own evaluations of the acceptability of scientific claims despite their knowledge limitations. Specifically, we tested whether laypeople are more prone to discount their actual dependence on expert knowledge when they are presented with simplified science texts. In…
Code of Federal Regulations, 2010 CFR
2010-01-01
... include, but are not limited to, press releases, media advisories, news features, and Web postings. Not included under this definition are scientific and technical reports, Web postings designed for technical or...
Short text sentiment classification based on feature extension and ensemble classifier
NASA Astrophysics Data System (ADS)
Liu, Yang; Zhu, Xie
2018-05-01
With the rapid development of Internet social media, excavating the emotional tendencies of the short text information from the Internet, the acquisition of useful information has attracted the attention of researchers. At present, the commonly used can be attributed to the rule-based classification and statistical machine learning classification methods. Although micro-blog sentiment analysis has made good progress, there still exist some shortcomings such as not highly accurate enough and strong dependence from sentiment classification effect. Aiming at the characteristics of Chinese short texts, such as less information, sparse features, and diverse expressions, this paper considers expanding the original text by mining related semantic information from the reviews, forwarding and other related information. First, this paper uses Word2vec to compute word similarity to extend the feature words. And then uses an ensemble classifier composed of SVM, KNN and HMM to analyze the emotion of the short text of micro-blog. The experimental results show that the proposed method can make good use of the comment forwarding information to extend the original features. Compared with the traditional method, the accuracy, recall and F1 value obtained by this method have been improved.
NASA Astrophysics Data System (ADS)
Li, Ji; Ren, Fuji
Weblogs have greatly changed the communication ways of mankind. Affective analysis of blog posts is found valuable for many applications such as text-to-speech synthesis or computer-assisted recommendation. Traditional emotion recognition in text based on single-label classification can not satisfy higher requirements of affective computing. In this paper, the automatic identification of sentence emotion in weblogs is modeled as a multi-label text categorization task. Experiments are carried out on 12273 blog sentences from the Chinese emotion corpus Ren_CECps with 8-dimension emotion annotation. An ensemble algorithm RAKEL is used to recognize dominant emotions from the writer's perspective. Our emotion feature using detailed intensity representation for word emotions outperforms the other main features such as the word frequency feature and the traditional lexicon-based feature. In order to deal with relatively complex sentences, we integrate grammatical characteristics of punctuations, disjunctive connectives, modification relations and negation into features. It achieves 13.51% and 12.49% increases for Micro-averaged F1 and Macro-averaged F1 respectively compared to the traditional lexicon-based feature. Result shows that multiple-dimension emotion representation with grammatical features can efficiently classify sentence emotion in a multi-label problem.
2017-11-01
Reports an error in "Replicability and other features of a high-quality science: Toward a balanced and empirical approach" by Eli J. Finkel, Paul W. Eastwick and Harry T. Reis ( Journal of Personality and Social Psychology , 2017[Aug], Vol 113[2], 244-253). In the commentary, there was an error in the References list. The publishing year for the 18th article was cited incorrectly as 2016. The in-text acronym associated with this citation should read instead as LCL2017. The correct References list citation should read as follows: LeBel, E. P., Campbell, L., & Loving, T. J. (2017). Benefits of open and high-powered research outweigh costs. Journal of Personality and Social Psychology , 113, 230-243. http://dx.doi.org/10 .1037/pspi0000049. The online version of this article has been corrected. (The following abstract of the original article appeared in record 2017-30567-002.) Finkel, Eastwick, and Reis (2015; FER2015) argued that psychological science is better served by responding to apprehensions about replicability rates with contextualized solutions than with one-size-fits-all solutions. Here, we extend FER2015's analysis to suggest that much of the discussion of best research practices since 2011 has focused on a single feature of high-quality science-replicability-with insufficient sensitivity to the implications of recommended practices for other features, like discovery, internal validity, external validity, construct validity, consequentiality, and cumulativeness. Thus, although recommendations for bolstering replicability have been innovative, compelling, and abundant, it is difficult to evaluate their impact on our science as a whole, especially because many research practices that are beneficial for some features of scientific quality are harmful for others. For example, FER2015 argued that bigger samples are generally better, but also noted that very large samples ("those larger than required for effect sizes to stabilize"; p. 291) could have the downside of commandeering resources that would have been better invested in other studies. In their critique of FER2015, LeBel, Campbell, and Loving (2016) concluded, based on simulated data, that ever-larger samples are better for the efficiency of scientific discovery (i.e., that there are no tradeoffs). As demonstrated here, however, this conclusion holds only when the replicator's resources are considered in isolation. If we widen the assumptions to include the original researcher's resources as well, which is necessary if the goal is to consider resource investment for the field as a whole, the conclusion changes radically-and strongly supports a tradeoff-based analysis. In general, as psychologists seek to strengthen our science, we must complement our much-needed work on increasing replicability with careful attention to the other features of a high-quality science. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
PyEEG: an open source Python module for EEG/MEG feature extraction.
Bao, Forrest Sheng; Liu, Xin; Zhang, Christina
2011-01-01
Computer-aided diagnosis of neural diseases from EEG signals (or other physiological signals that can be treated as time series, e.g., MEG) is an emerging field that has gained much attention in past years. Extracting features is a key component in the analysis of EEG signals. In our previous works, we have implemented many EEG feature extraction functions in the Python programming language. As Python is gaining more ground in scientific computing, an open source Python module for extracting EEG features has the potential to save much time for computational neuroscientists. In this paper, we introduce PyEEG, an open source Python module for EEG feature extraction.
PyEEG: An Open Source Python Module for EEG/MEG Feature Extraction
Bao, Forrest Sheng; Liu, Xin; Zhang, Christina
2011-01-01
Computer-aided diagnosis of neural diseases from EEG signals (or other physiological signals that can be treated as time series, e.g., MEG) is an emerging field that has gained much attention in past years. Extracting features is a key component in the analysis of EEG signals. In our previous works, we have implemented many EEG feature extraction functions in the Python programming language. As Python is gaining more ground in scientific computing, an open source Python module for extracting EEG features has the potential to save much time for computational neuroscientists. In this paper, we introduce PyEEG, an open source Python module for EEG feature extraction. PMID:21512582
Escape Excel: A tool for preventing gene symbol and accession conversion errors
Stewart, Paul A.; Kuenzi, Brent M.; Eschrich, James A.
2017-01-01
Background Microsoft Excel automatically converts certain gene symbols, database accessions, and other alphanumeric text into dates, scientific notation, and other numerical representations. These conversions lead to subsequent, irreversible, corruption of the imported text. A recent survey of popular genomic literature estimates that one-fifth of all papers with supplementary gene lists suffer from this issue. Results Here, we present an open-source tool, Escape Excel, which prevents these erroneous conversions by generating an escaped text file that can be safely imported into Excel. Escape Excel is implemented in a variety of formats (http://www.github.com/pstew/escape_excel), including a command line based Perl script, a Windows-only Excel Add-In, an OS X drag-and-drop application, a simple web-server, and as a Galaxy web environment interface. Test server implementations are accessible as a Galaxy interface (http://apostl.moffitt.org) and simple non-Galaxy web server (http://apostl.moffitt.org:8000/). Conclusions Escape Excel detects and escapes a wide variety of problematic text strings so that they are not erroneously converted into other representations upon importation into Excel. Examples of problematic strings include date-like strings, time-like strings, leading zeroes in front of numbers, and long numeric and alphanumeric identifiers that should not be automatically converted into scientific notation. It is hoped that greater awareness of these potential data corruption issues, together with diligent escaping of text files prior to importation into Excel, will help to reduce the amount of Excel-corrupted data in scientific analyses and publications. PMID:28953918
Li, Guang-Qing; Liu, Zi; Shen, Hong-Bin; Yu, Dong-Jun
2016-10-01
As one of the most ubiquitous post-transcriptional modifications of RNA, N 6 -methyladenosine ( [Formula: see text]) plays an essential role in many vital biological processes. The identification of [Formula: see text] sites in RNAs is significantly important for both basic biomedical research and practical drug development. In this study, we designed a computational-based method, called TargetM6A, to rapidly and accurately target [Formula: see text] sites solely from the primary RNA sequences. Two new features, i.e., position-specific nucleotide/dinucleotide propensities (PSNP/PSDP), are introduced and combined with the traditional nucleotide composition (NC) feature to formulate RNA sequences. The extracted features are further optimized to obtain a much more compact and discriminative feature subset by applying an incremental feature selection (IFS) procedure. Based on the optimized feature subset, we trained TargetM6A on the training dataset with a support vector machine (SVM) as the prediction engine. We compared the proposed TargetM6A method with existing methods for predicting [Formula: see text] sites by performing stringent jackknife tests and independent validation tests on benchmark datasets. The experimental results show that the proposed TargetM6A method outperformed the existing methods for predicting [Formula: see text] sites and remarkably improved the prediction performances, with MCC = 0.526 and AUC = 0.818. We also provided a user-friendly web server for TargetM6A, which is publicly accessible for academic use at http://csbio.njust.edu.cn/bioinf/TargetM6A.
Constructing a Scientific Explanation—A Narrative Account
NASA Astrophysics Data System (ADS)
Yeo, Jennifer; Gilbert, John K.
2014-07-01
Studies analyzing explanations that have been constructed by science students have found that they were generally weak and lack necessary features. The goal of this study was to establish the competencies that one needs to construct a scientific explanation. Scientific explanations can be looked at in three ways, in terms of their function, form and level, as being essentially sign-making processes. Taking a case study approach and using Lemke's multimodal framework, we analyzed the scientific explanation of an electromagnetic induction phenomenon constructed by one high school student. We found that such a construction involves the complex coordination of different types of signs, not only to represent the entities in the phenomenon, but also to support thinking and reasoning about it at abstract levels. Scientific conventions and rules, and everyday material and social tools were found to be crucial in shifting from one level of abstraction to another. The findings highlight the importance of developing the skillful use of schemes of scientific representation by students and familiarizing them with commonly encountered contexts.
NASA Astrophysics Data System (ADS)
Svedholm, Annika M.; Lindeman, Marjaana
2013-03-01
Lay conceptions of energy often conflict with scientific knowledge, hinder science learning and scientific literacy, and provide a basis for ungrounded beliefs. In a sample of Finnish upper secondary school students, energy was attributed with features of living and animate beings and thought of as a mental property. These ontologically confused conceptions (OCC) were associated with trust in complementary and alternative medicine (CAM), and independent of scientifically valid conceptions. Substance-based energy conceptions followed the correlational pattern of OCC, rather than scientific conceptions. OCC and CAM decreased both during the regular school physics curriculum and after a lesson targeted at the ontological confusions. OCC and CAM were slightly less common among students with high actively open-minded thinking, low trust in intuition and high need for cognition. The findings are discussed in relation to the goals of scientific education.
Map Feature Content and Text Recall of Good and Poor Readers.
ERIC Educational Resources Information Center
Amlund, Jeanne T.; And Others
1985-01-01
Reports two experiments evaluating the effect of map feature content on text recall by subjects of varying reading skill levels. Finds that both experiments support the conjoint retention hypothesis, in which dual-coding of spatial and verbal information and their interaction in memory enhance recall. (MM)
Ethnobotanical Research at the Kutukú Scientific Station, Morona-Santiago, Ecuador.
Ballesteros, Jose Luis; Bracco, Francesco; Cerna, Marco; Vita Finzi, Paola; Vidari, Giovanni
2016-01-01
This work features the results of an ethnobotanical study on the uses of medicinal plants by the inhabitants of the region near to the Kutukú Scientific Station of Universidad Politécnica Salesiana, located in the Morona-Santiago province, southeast of Ecuador. In the surroundings of the station, one ethnic group, the Shuar, has been identified. The survey hereafter reports a total of 131 plant species, with 73 different therapeutic uses.
ERIC Educational Resources Information Center
Johnson, Janice K.
1973-01-01
Discusses the planning, construction, use, and maintenance of a nature trail. Ideal for demonstrating interrelationships between plants and animals, conservation practices, wildlife management, plant succession, forestry, geologic features and other scientific phenomena. (JR)
Identifying sports videos using replay, text, and camera motion features
NASA Astrophysics Data System (ADS)
Kobla, Vikrant; DeMenthon, Daniel; Doermann, David S.
1999-12-01
Automated classification of digital video is emerging as an important piece of the puzzle in the design of content management systems for digital libraries. The ability to classify videos into various classes such as sports, news, movies, or documentaries, increases the efficiency of indexing, browsing, and retrieval of video in large databases. In this paper, we discuss the extraction of features that enable identification of sports videos directly from the compressed domain of MPEG video. These features include detecting the presence of action replays, determining the amount of scene text in vide, and calculating various statistics on camera and/or object motion. The features are derived from the macroblock, motion,and bit-rate information that is readily accessible from MPEG video with very minimal decoding, leading to substantial gains in processing speeds. Full-decoding of selective frames is required only for text analysis. A decision tree classifier built using these features is able to identify sports clips with an accuracy of about 93 percent.
Nursing professionalism: An evolutionary concept analysis
Ghadirian, Fataneh; Salsali, Mahvash; Cheraghi, Mohammad Ali
2014-01-01
Background: Professionalism is an important feature of the professional jobs. Dynamic nature and the various interpretations of this term lead to multiple definitions of this concept. The aim of this paper is to identify the core attributes of the nursing professionalism. Materials and Methods: We followed Rodgers’ evolutionary method of concept analysis. Texts published in scientific databases about nursing professionalism between 1980 and 2011 were assessed. After applying the selection criteria, the final sample consisting of 4 books and 213 articles was selected, examined, and analyzed in depth. Two experts checked the process of analysis and monitored and reviewed them. Results: The analysis showed that nursing professionalism is determined by three attributes of cognitive, attitudinal, and psychomotor. In addition, the most important antecedents concepts were demographic, experiential, educational, environmental, and attitudinal factors. Conclusion: Nursing professionalism is an inevitable, complex, varied, and dynamic process. In this study, the importance, scope, and concept of professionalism in nursing, the concept of a beginning for further research and development, and expanding the nursing knowledge are explained and clarified. PMID:24554953
Fleck and the social constitution of scientific objectivity.
Fagan, Melinda B
2009-12-01
Ludwik Fleck's theory of thought-styles has been hailed as a pioneer of constructivist science studies and sociology of scientific knowledge. But this consensus ignores an important feature of Fleck's epistemology. At the core of his account is the ideal of 'objective truth, clarity, and accuracy'. I begin with Fleck's account of modern natural science, locating the ideal of scientific objectivity within his general social epistemology. I then draw on Fleck's view of scientific objectivity to improve upon reflexive accounts of the origin and development of the theory of thought-styles, and reply to objections that Fleck's epistemological stance is self-undermining or inconsistent. Explicating the role of scientific objectivity in Fleck's epistemology reveals his view to be an internally consistent alternative to recent social accounts of scientific objectivity by Harding, Daston and Galison. I use these contrasts to indicate the strengths and weaknesses of Fleck's innovative social epistemology, and propose modifications to address the latter. The result is a renewed version of Fleck's social epistemology, which reconciles commitment to scientific objectivity with integrated sociology, history and philosophy of science.
Linguistic feature analysis for protein interaction extraction
2009-01-01
Background The rapid growth of the amount of publicly available reports on biomedical experimental results has recently caused a boost of text mining approaches for protein interaction extraction. Most approaches rely implicitly or explicitly on linguistic, i.e., lexical and syntactic, data extracted from text. However, only few attempts have been made to evaluate the contribution of the different feature types. In this work, we contribute to this evaluation by studying the relative importance of deep syntactic features, i.e., grammatical relations, shallow syntactic features (part-of-speech information) and lexical features. For this purpose, we use a recently proposed approach that uses support vector machines with structured kernels. Results Our results reveal that the contribution of the different feature types varies for the different data sets on which the experiments were conducted. The smaller the training corpus compared to the test data, the more important the role of grammatical relations becomes. Moreover, deep syntactic information based classifiers prove to be more robust on heterogeneous texts where no or only limited common vocabulary is shared. Conclusion Our findings suggest that grammatical relations play an important role in the interaction extraction task. Moreover, the net advantage of adding lexical and shallow syntactic features is small related to the number of added features. This implies that efficient classifiers can be built by using only a small fraction of the features that are typically being used in recent approaches. PMID:19909518
[Leonardo da Vinci--a dyslectic genius?].
Røsstad, Anna
2002-12-10
Leonardo da Vinci's texts consist almost exclusively of scientific notes. Working on a book on Leonardo's art, I studied all Leonardo's published texts carefully for any new information. In some prefaces I came to suspect that Leonardo might have suffered from dyslexia. This article considers the question of whether it is possible to find indications of dyslexia in Leonardo's texts and in the accounts of his life.
From the desktop to the grid: scalable bioinformatics via workflow conversion.
de la Garza, Luis; Veit, Johannes; Szolek, Andras; Röttig, Marc; Aiche, Stephan; Gesing, Sandra; Reinert, Knut; Kohlbacher, Oliver
2016-03-12
Reproducibility is one of the tenets of the scientific method. Scientific experiments often comprise complex data flows, selection of adequate parameters, and analysis and visualization of intermediate and end results. Breaking down the complexity of such experiments into the joint collaboration of small, repeatable, well defined tasks, each with well defined inputs, parameters, and outputs, offers the immediate benefit of identifying bottlenecks, pinpoint sections which could benefit from parallelization, among others. Workflows rest upon the notion of splitting complex work into the joint effort of several manageable tasks. There are several engines that give users the ability to design and execute workflows. Each engine was created to address certain problems of a specific community, therefore each one has its advantages and shortcomings. Furthermore, not all features of all workflow engines are royalty-free -an aspect that could potentially drive away members of the scientific community. We have developed a set of tools that enables the scientific community to benefit from workflow interoperability. We developed a platform-free structured representation of parameters, inputs, outputs of command-line tools in so-called Common Tool Descriptor documents. We have also overcome the shortcomings and combined the features of two royalty-free workflow engines with a substantial user community: the Konstanz Information Miner, an engine which we see as a formidable workflow editor, and the Grid and User Support Environment, a web-based framework able to interact with several high-performance computing resources. We have thus created a free and highly accessible way to design workflows on a desktop computer and execute them on high-performance computing resources. Our work will not only reduce time spent on designing scientific workflows, but also make executing workflows on remote high-performance computing resources more accessible to technically inexperienced users. We strongly believe that our efforts not only decrease the turnaround time to obtain scientific results but also have a positive impact on reproducibility, thus elevating the quality of obtained scientific results.
Reference management: A critical element of scientific writing
Kali, Arunava
2016-01-01
With the rapid growth of medical science, the number of scientific writing contributing to medical literature has increased significantly in recent years. Owing to considerable variation of formatting in different citation styles, strict adherence to the accurate referencing manually is labor intensive and challenging. However, the introduction of referencing tools has decreased the complexity to a great extent. These software have advanced overtime to include newer features to support effective reference management. Since scientific writing is an essential component of medical curriculum, it is imperative for medical graduates to understand various referencing systems to effectively make use of these tools in their dissertations and future researches. PMID:26952149
Understanding as Integration of Heterogeneous Representations
NASA Astrophysics Data System (ADS)
Martínez, Sergio F.
2014-03-01
The search for understanding is a major aim of science. Traditionally, understanding has been undervalued in the philosophy of science because of its psychological underpinnings; nowadays, however, it is widely recognized that epistemology cannot be divorced from psychology as sharp as traditional epistemology required. This eliminates the main obstacle to give scientific understanding due attention in philosophy of science. My aim in this paper is to describe an account of scientific understanding as an emergent feature of our mastering of different (causal) explanatory frameworks that takes place through the mastering of scientific practices. Different practices lead to different kinds of representations. Such representations are often heterogeneous. The integration of such representations constitute understanding.
Reference management: A critical element of scientific writing.
Kali, Arunava
2016-01-01
With the rapid growth of medical science, the number of scientific writing contributing to medical literature has increased significantly in recent years. Owing to considerable variation of formatting in different citation styles, strict adherence to the accurate referencing manually is labor intensive and challenging. However, the introduction of referencing tools has decreased the complexity to a great extent. These software have advanced overtime to include newer features to support effective reference management. Since scientific writing is an essential component of medical curriculum, it is imperative for medical graduates to understand various referencing systems to effectively make use of these tools in their dissertations and future researches.
[Inheritance and innovation of traditional Chinese medicinal authentication].
Zhao, Zhong-zhen; Chen, Hu-biao; Xiao, Pei-gen; Guo, Ping; Liang, Zhi-tao; Hung, Fanny; Wong, Lai-lai; Brand, Eric; Liu, Jing
2015-09-01
Chinese medicinal authentication is fundamental for the standardization and globalization of Chinese medicine. The discipline of authentication addresses difficult issues that have remained unresolved for thousands of years, and is essential for preserving safety. Chinese medicinal authentication has both scientific and traditional cultural connotations; the use of scientific methods to elucidate traditional experience-based differentiation carries the legacy of Chinese medicine forward, and offers immediate practical significance and long-term scientific value. In this paper, a path of inheritance and innovation is explored through the scientific exposition of Chinese medicinal authentication, featuring a review of specialized publications, the establishment of a Chinese medicine specimen center and Chinese medicinal image databases, the expansion of authentication technologies, and the formation of a cultural project dedicated to the Compedium of Materia Medica.
Alzforum and SWAN: the present and future of scientific web communities.
Clark, Tim; Kinoshita, June
2007-05-01
Scientists drove the early development of the World Wide Web, primarily as a means for rapid communication, document sharing and data access. They have been far slower to adopt the web as a medium for building research communities. Yet, web-based communities hold great potential for accelerating the pace of scientific research. In this article, we will describe the 10-year experience of the Alzheimer Research Forum ('Alzforum'), a unique example of a thriving scientific web community, and explain the features that contributed to its success. We will then outline the SWAN (Semantic Web Applications in Neuromedicine) project, in which Alzforum curators are collaborating with informatics researchers to develop novel approaches that will enable communities to share richly contextualized information about scientific data, claims and hypotheses.
Learnscapes, transforming the world into an Open Air Museum.
NASA Astrophysics Data System (ADS)
Lucía, Ana
2017-04-01
Scientists are working everywhere, but scientific knowledge is still not widespread among people, and only limited to museums and a few other places. Learnscapes is a new tool for scientists to disseminate their work, making it accessible for people in the right place and at the right moment. It will be possible through a platform (both web and app) that allows tourists to access accurate scientific knowledge related to the place they are visiting and the studied objects they are interested in (river, mountains, monuments…): in this way, the visited place will acquire a higher value. Learnscapes will benefit people, science and territory. The objective is to cover the current gap of communication between science and people. Since the information will be geolocalized, the users will receive an alert when passing near a location with scientific information, this way they will better understand it, becoming more aware of the importance of the research. The audience or users of Learnscapes are curious people, who go to scientific museums, who are familiar with the technology and web apps and have a high level of education. Since curious people and geolocalized science are common worldwide, it is a clearly up-scalable project. Scientists will be able to feature a summary of their work in Learnscapes with little time investment. All the content will be open and freely available for the users and will have a DOI. At the same time, scientists that feature their work in Learnscapes, as well as research and funding institutions involved in the featured studies, will have their own profile that, even if virtual, will enable an interaction between scientist and society. In order to guarantee the scientific accuracy, two kinds of contents are accepted: (1) related to already published scientific results (in peer-reviewed publications) or (2) related to ongoing projects that still do not have published results but there is any kind of equipment installed outside the laboratories or research institutions and can be seen by the people passing by, this way it could substitute or complement the usual panels that scientist install at the monitoring stations. Learnscapes is not limited to any given discipline; nevertheless, since the scientific information included in the platform has to relate to a certain place, it is likely that Geosciences will take the most advantage of it. With Learnscapes scientists and research institutions will have the chance to spread their works with an innovative tool and to obtain visibility and social recognition. www.learnscapes.co
Druzinsky, Robert E; Balhoff, James P; Crompton, Alfred W; Done, James; German, Rebecca Z; Haendel, Melissa A; Herrel, Anthony; Herring, Susan W; Lapp, Hilmar; Mabee, Paula M; Muller, Hans-Michael; Mungall, Christopher J; Sternberg, Paul W; Van Auken, Kimberly; Vinyard, Christopher J; Williams, Susan H; Wall, Christine E
2016-01-01
In recent years large bibliographic databases have made much of the published literature of biology available for searches. However, the capabilities of the search engines integrated into these databases for text-based bibliographic searches are limited. To enable searches that deliver the results expected by comparative anatomists, an underlying logical structure known as an ontology is required. Here we present the Mammalian Feeding Muscle Ontology (MFMO), a multi-species ontology focused on anatomical structures that participate in feeding and other oral/pharyngeal behaviors. A unique feature of the MFMO is that a simple, computable, definition of each muscle, which includes its attachments and innervation, is true across mammals. This construction mirrors the logical foundation of comparative anatomy and permits searches using language familiar to biologists. Further, it provides a template for muscles that will be useful in extending any anatomy ontology. The MFMO is developed to support the Feeding Experiments End-User Database Project (FEED, https://feedexp.org/), a publicly-available, online repository for physiological data collected from in vivo studies of feeding (e.g., mastication, biting, swallowing) in mammals. Currently the MFMO is integrated into FEED and also into two literature-specific implementations of Textpresso, a text-mining system that facilitates powerful searches of a corpus of scientific publications. We evaluate the MFMO by asking questions that test the ability of the ontology to return appropriate answers (competency questions). We compare the results of queries of the MFMO to results from similar searches in PubMed and Google Scholar. Our tests demonstrate that the MFMO is competent to answer queries formed in the common language of comparative anatomy, but PubMed and Google Scholar are not. Overall, our results show that by incorporating anatomical ontologies into searches, an expanded and anatomically comprehensive set of results can be obtained. The broader scientific and publishing communities should consider taking up the challenge of semantically enabled search capabilities.
Development of Human Face Literature Database Using Text Mining Approach: Phase I.
Kaur, Paramjit; Krishan, Kewal; Sharma, Suresh K
2018-06-01
The face is an important part of the human body by which an individual communicates in the society. Its importance can be highlighted by the fact that a person deprived of face cannot sustain in the living world. The amount of experiments being performed and the number of research papers being published under the domain of human face have surged in the past few decades. Several scientific disciplines, which are conducting research on human face include: Medical Science, Anthropology, Information Technology (Biometrics, Robotics, and Artificial Intelligence, etc.), Psychology, Forensic Science, Neuroscience, etc. This alarms the need of collecting and managing the data concerning human face so that the public and free access of it can be provided to the scientific community. This can be attained by developing databases and tools on human face using bioinformatics approach. The current research emphasizes on creating a database concerning literature data of human face. The database can be accessed on the basis of specific keywords, journal name, date of publication, author's name, etc. The collected research papers will be stored in the form of a database. Hence, the database will be beneficial to the research community as the comprehensive information dedicated to the human face could be found at one place. The information related to facial morphologic features, facial disorders, facial asymmetry, facial abnormalities, and many other parameters can be extracted from this database. The front end has been developed using Hyper Text Mark-up Language and Cascading Style Sheets. The back end has been developed using hypertext preprocessor (PHP). The JAVA Script has used as scripting language. MySQL (Structured Query Language) is used for database development as it is most widely used Relational Database Management System. XAMPP (X (cross platform), Apache, MySQL, PHP, Perl) open source web application software has been used as the server.The database is still under the developmental phase and discusses the initial steps of its creation. The current paper throws light on the work done till date.
Teaching Text Structure: Examining the Affordances of Children's Informational Texts
ERIC Educational Resources Information Center
Jones, Cindy D.; Clark, Sarah K.; Reutzel, D. Ray
2016-01-01
This study investigated the affordances of informational texts to serve as model texts for teaching text structure to elementary school children. Content analysis of a random sampling of children's informational texts from top publishers was conducted on text structure organization and on the inclusion of text features as signals of text…
Intervention and Revision: Expertise and Interaction in Text Mediation
ERIC Educational Resources Information Center
Luo, Na; Hyland, Ken
2017-01-01
Many EAL (English as an Additional Language) scholars enlist text mediators' support when faced with the challenges of writing for international publication. However, the contributions these individuals are able to make in improving scientific manuscripts remains unclear, especially when language professionals such as English teachers do this…
The Function of Frame in the Comprehension of Scientific Text.
ERIC Educational Resources Information Center
Rossi, Jean Pierre
1990-01-01
One hundred French children in grade five participated in an experiment to determine how the problem frame facilitates comprehension of a problem solution text. Results demonstrate the positive role of frames in macrostructure construction and support the model of T. A. van Dijk and W. Kintsch (1983). (SLD)
Mapping a Space for a Rhetorical-Cultural Analysis: A Case of a Scientific Proposal
ERIC Educational Resources Information Center
Dorpenyo, Isidore Kafui
2015-01-01
This article analyzes a proposal submitted to a funding unit in Michigan Technological University by a PhD Forestry student. A rhetorical-cultural approach of the text provides evidence to argue that scientific writing is rooted in a cultural practice that valorizes certain kinds of thought, practices, rituals, and symbols; that a scientist's work…
ERIC Educational Resources Information Center
Thiebach, Monja; Mayweg-Paus, Elisabeth; Jucks, Regina
2015-01-01
Contemporary school learning typically includes the processing of popular scientific information as found in journals, magazines, and/or the WWW. The German high school curriculum emphasizes that students should have achieved science literacy and have learned to evaluate the substance of text-based learning content by the end of high school.…
1988-04-26
Technology [Text] Today, science and technology are the object of ideological debate worldwide. The article addresses questions concerning the meaning...other areas of cooperation between the two sides, from scientific and technological cooperation to envi- ronmental protection. Perhaps the only blot...not so far erased in their mutual relations remains Yugoslavia’s inclusion in the "Eureka" scientific and technological program, which is being
ERIC Educational Resources Information Center
Arya, Diana J.; Maul, Andrew
2012-01-01
In an experimental study (N = 209), the authors compared the effects of exposure to typical middle-school written science content when presented in the context of the scientific discovery narrative and when presented in a more traditional nonnarrative format on 7th and 8th grade students in the United States. The development of texts was…
SciELO, Scientific Electronic Library Online, a Database of Open Access Journals
ERIC Educational Resources Information Center
Meneghini, Rogerio
2013-01-01
This essay discusses SciELO, a scientific journal database operating in 14 countries. It covers over 1000 journals providing open access to full text and table sets of scientometrics data. In Brazil it is responsible for a collection of nearly 300 journals, selected along 15 years as the best Brazilian periodicals in natural and social sciences.…
Liu, Tongtong; Ge, Xifeng; Yu, Jinhua; Guo, Yi; Wang, Yuanyuan; Wang, Wenping; Cui, Ligang
2018-06-21
B-mode ultrasound (B-US) and strain elastography ultrasound (SE-US) images have a potential to distinguish thyroid tumor with different lymph node (LN) status. The purpose of our study is to investigate whether the application of multi-modality images including B-US and SE-US can improve the discriminability of thyroid tumor with LN metastasis based on a radiomics approach. Ultrasound (US) images including B-US and SE-US images of 75 papillary thyroid carcinoma (PTC) cases were retrospectively collected. A radiomics approach was developed in this study to estimate LNs status of PTC patients. The approach included image segmentation, quantitative feature extraction, feature selection and classification. Three feature sets were extracted from B-US, SE-US, and multi-modality containing B-US and SE-US. They were used to evaluate the contribution of different modalities. A total of 684 radiomics features have been extracted in our study. We used sparse representation coefficient-based feature selection method with 10-bootstrap to reduce the dimension of feature sets. Support vector machine with leave-one-out cross-validation was used to build the model for estimating LN status. Using features extracted from both B-US and SE-US, the radiomics-based model produced an area under the receiver operating characteristic curve (AUC) [Formula: see text] 0.90, accuracy (ACC) [Formula: see text] 0.85, sensitivity (SENS) [Formula: see text] 0.77 and specificity (SPEC) [Formula: see text] 0.88, which was better than using features extracted from B-US or SE-US separately. Multi-modality images provided more information in radiomics study. Combining use of B-US and SE-US could improve the LN metastasis estimation accuracy for PTC patients.
Literacy Practices in Computer-Mediated Communication in Hong Kong.
ERIC Educational Resources Information Center
Lee, Carmen
2002-01-01
Examines linguistic features of text-based computer-mediated communication (CMC) in Hong Kong. The study is based on a 70,000-word corpus of electronic mail and ICQ instant messaging texts, which were collected from students in Hong Kong. Identified language-specific features that may be seen as new literacy practices within the theoretical…
Comparing the Lexical Features of EAP Students' Essays by Prompt and Rating
ERIC Educational Resources Information Center
Lavallée, Maxime; McDonough, Kim
2015-01-01
Previous research has shown that high frequency lexical items, such as AWL words and formulaic expressions, may differentiate between texts written by expert and novice writers (Chen & Baker, 2010; Hancioglu, 2009), and that lexical features related to breadth, depth, and accessibility differentiate among texts from L2 writers of different…
NASA Astrophysics Data System (ADS)
Wallace, Carolyn S.
2004-11-01
This article presents a theoretical framework in the form of a model on which to base research in scientific literacy and language use. The assumption guiding the framework is that scientific literacy is comprised of the abilities to think metacognitively, to read and write scientific texts, and to apply the elements of a scientific argument. The framework is composed of three theoretical constructs: authenticity, multiple discourses, and Bhabha's Third Space. Some of the implications of the framework are that students need opportunities to (a) use scientific language in everyday situations; (b) negotiate readily among the many discourse genres of science; and (c) collaborate with teachers and peers on the meaning of scientific language. These ideas are illustrated with data excerpts from contemporary research studies. A set of potential research issues for the future is posed at the end of the article.
NASA Astrophysics Data System (ADS)
Zhao, P.; Xu, X.; Chen, F.; Guo, X.; Zheng, X.; Liu, L. P.; Hong, Y.; Li, Y.; La, Z.; Peng, H.; Zhong, L. Z.; Ma, Y.; Tang, S. H.; Liu, Y.; Liu, H.; Li, Y. H.; Zhang, Q.; Hu, Z.; Sun, J. H.; Zhang, S.; Dong, L.; Zhang, H.; Zhao, Y.; Yan, X.; Xiao, A.; Wan, W.; Zhou, X.
2016-12-01
The Third Tibetan Plateau atmospheric scientific experiment (TIPEX-III) was initiated jointly by the China Meteorological Administration, the National Natural Scientific Foundation, and the Chinese Academy of Sciences. This paper presents the background, scientific objectives, and overall experimental design of TIPEX-III. It was designed to conduct an integrated observation of the earth-atmosphere coupled system over the Tibetan Plateau (TP) from land surface, planetary boundary layer (PBL), troposphere, and stratosphere for eight to ten years by coordinating ground- and air-based measurement facilities for understanding spatial heterogeneities of complex land-air interactions, cloud-precipitation physical processes, and interactions between troposphere and stratosphere. TIPEX-III originally began in 2014, and is ongoing. It established multiscale land-surface and PBL observation networks over the TP and a tropospheric meteorological radiosonde network over the western TP, and executed an integrated observation mission for cloud-precipitation physical features using ground-based radar systems and aircraft campaigns and an observation task for atmospheric ozone, aerosol, and water vapor. The archive, management, and share policy of the observation data are also introduced herein. Some TIPEX-III data have been preliminarily applied to analyze the features of surface sensible and latent heat fluxes, cloud-precipitation physical processes, and atmospheric water vapor and ozone over the TP, and to improve the local precipitation forecast. Furthermore, TIPEX-III intends to promote greater scientific and technological cooperation with international research communities and broader organizations. Scientists working internationally are invited to participate in the field campaigns and to use the TIPEX-III data for their own research.
Ethnobotanical Research at the Kutukú Scientific Station, Morona-Santiago, Ecuador
Bracco, Francesco; Cerna, Marco; Vita Finzi, Paola; Vidari, Giovanni
2016-01-01
This work features the results of an ethnobotanical study on the uses of medicinal plants by the inhabitants of the region near to the Kutukú Scientific Station of Universidad Politécnica Salesiana, located in the Morona-Santiago province, southeast of Ecuador. In the surroundings of the station, one ethnic group, the Shuar, has been identified. The survey hereafter reports a total of 131 plant species, with 73 different therapeutic uses. PMID:28074189
Debru, Claude
2010-06-01
This paper is based on Canguilhem's text on the concept of scientific ideology, which he introduced in 1969. We describe Canguilhem's attempts at designing a methodological framework for the history of science including the status of kinds of knowledge related to science, like scientific ideologies preceding particular scientific domains (like ideologies about inheritance before Mendel, or Spencer's universal evolutionary laws preceding Darwin). This attempt at picturing the relationships between science and ideology is compared with Jürgen Habermas's book Technology and Science as 'Ideology' in 1968. The philosphical issue of human normativity provides the framework of this discussion.
[The treatment of scientific knowledge in the framework of CITES].
Lanfranchi, Marie-Pierre
2014-03-01
Access to scientific knowledge in the context of CITES is a crucial issue. The effectiveness of the text is indeed largely based on adequate scientific knowledge of CITES species. This is a major challenge: more than 30,000 species and 178 member states are involved. The issue of expertise, however, is not really addressed by the Convention. The question was left to the consideration of the COP. Therefore, the COP has created two ad hoc scientific committees: the Plants Committee and the Animals Committee, conferring upon them an ambitious mandate. The article addresses some important issues at stake which are linked to institutional questions, as well as the mixed record after twenty-five years of practice.
Text-based discovery in biomedicine: the architecture of the DAD-system.
Weeber, M; Klein, H; Aronson, A R; Mork, J G; de Jong-van den Berg, L T; Vos, R
2000-01-01
Current scientific research takes place in highly specialized contexts with poor communication between disciplines as a likely consequence. Knowledge from one discipline may be useful for the other without researchers knowing it. As scientific publications are a condensation of this knowledge, literature-based discovery tools may help the individual scientist to explore new useful domains. We report on the development of the DAD-system, a concept-based Natural Language Processing system for PubMed citations that provides the biomedical researcher such a tool. We describe the general architecture and illustrate its operation by a simulation of a well-known text-based discovery: The favorable effects of fish oil on patients suffering from Raynaud's disease [1].
Artificial muscles' enrichment text: Chemical Literacy Profile of pre-service teachers
NASA Astrophysics Data System (ADS)
Hernani, Ulum, Luthfi Lulul; Mudzakir, Ahmad
2017-08-01
This research aims to determine the profile of chemical literacy abilities of pre-service teachers based on scientific attitudes and scientific competencies in PISA 2015 through individualized learning by using an artificial muscle context based-enrichment book. This research uses descriptive method, involving 20 of the 90 randomly selected population. This research uses a multiple-choice questions instrument. The result of this research are : 1) in the attitude aspects of interest in science and technology, valuing scientific approaches to inquiry, and environmental awareness, the results obtained respectively for 90%, 80%, and 30%. 2) for scientific competence of apply appropriate scientific knowledge, identify models and representations, make appropriate predictions, and explain the potential implications of scientific knowledge for society, the results obtained respectively for 30%, 50%, 60%, and 55%. 3) For scientific competence of identify the question explored in a given scientific study and distinguish questions that could be investigated scientifically, the results obtained respectively for 30 % and 50%. 4) For scientific competence of transform data from one representation to another and draw appropriate conclusions, the results obtained respectively for 60% and 45%. Based on the results, which need to be developed in pre-service chemistry teachers are environmental awareness, apply appropriate scientific knowledge, identify the question explored in a given scientific study, and draw appropriate conclusions.
NASA Astrophysics Data System (ADS)
Krumhansl, R. A.; Foster, J.; Peach, C. L.; Busey, A.; Baker, I.
2012-12-01
The practice of science and engineering is being revolutionized by the development of cyberinfrastructure for accessing near real-time and archived observatory data. Large cyberinfrastructure projects have the potential to transform the way science is taught in high school classrooms, making enormous quantities of scientific data available, giving students opportunities to analyze and draw conclusions from many kinds of complex data, and providing students with experiences using state-of-the-art resources and techniques for scientific investigations. However, online interfaces to scientific data are built by scientists for scientists, and their design can significantly impede broad use by novices. Knowledge relevant to the design of student interfaces to complex scientific databases is broadly dispersed among disciplines ranging from cognitive science to computer science and cartography and is not easily accessible to designers of educational interfaces. To inform efforts at bridging scientific cyberinfrastructure to the high school classroom, Education Development Center, Inc. and the Scripps Institution of Oceanography conducted an NSF-funded 2-year interdisciplinary review of literature and expert opinion pertinent to making interfaces to large scientific databases accessible to and usable by precollege learners and their teachers. Project findings are grounded in the fundamentals of Cognitive Load Theory, Visual Perception, Schemata formation and Universal Design for Learning. The Knowledge Status Report (KSR) presents cross-cutting and visualization-specific guidelines that highlight how interface design features can address/ ameliorate challenges novice high school students face as they navigate complex databases to find data, and construct and look for patterns in maps, graphs, animations and other data visualizations. The guidelines present ways to make scientific databases more broadly accessible by: 1) adjusting the cognitive load imposed by the user interface and visualizations so that it doesn't exceed the amount of information the learner can actively process; 2) drawing attention to important features and patterns; and 3) enabling customization of visualizations and tools to meet the needs of diverse learners.
... page: https://medlineplus.gov/palliativecaretexts.html Palliative Care Texts To use the sharing features on this page, please enable JavaScript. Free text messages to support you and your family during ...
Specifics on a XML Data Format for Scientific Data
NASA Astrophysics Data System (ADS)
Shaya, E.; Thomas, B.; Cheung, C.
An XML-based data format for interchange and archiving of scientific data would benefit in many ways from the features standardized in XML. Foremost of these features is the world-wide acceptance and adoption of XML. Applications, such as browsers, XQL and XSQL advanced query, XML editing, or CSS or XSLT transformation, that are coming out of industry and academia can be easily adopted and provide startling new benefits and features. We have designed a prototype of a core format for holding, in a very general way, parameters, tables, scalar and vector fields, atlases, animations and complex combinations of these. This eXtensible Data Format (XDF) makes use of XML functionalities such as: self-validation of document structure, default values for attributes, XLink hyperlinks, entity replacements, internal referencing, inheritance, and XSLT transformation. An API is available to aid in detailed assembly, extraction, and manipulation. Conversion tools to and from FITS and other existing data formats are under development. In the future, we hope to provide object oriented interfaces to C++, Java, Python, IDL, Mathematica, Maple, and various databases. http://xml.gsfc.nasa.gov/XDF
Li, Jie; Li, Lei; Liu, Rui; Lin, Hong-sheng
2012-10-01
The features and advantages of Chinese medicine (CM) in cancer comprehensive treatment have been in the spotlight of experts both at home and abroad. However, how to evaluate the effect of CM more objectively, scientifically and systematically is still the key problem of clinical trial, and also a limitation to the development and internationalization of CM oncology. The change of tumor response evaluation system in conventional medicine is gradually consistent with the features of CM clinical effect, such as they both focus on a combination of soft endpoints (i.e. quality of life, clinical benefit, etc.) and hard endpoints (i.e. tumor remission rate, time to progress, etc.). Although experts have proposed protocols of CM tumor response evaluation criteria and come to an agreement in general, divergences still exist in the importance, quantification and CM feature of the potential endpoints. Thus, establishing a CM characteristic and wildly accepted tumor response evaluation system is the key to promote internationalization of CM oncology, and also provides a more convenient and scientific platform for CM international cooperation and communication.
Structural biology at the European X-ray free-electron laser facility
Altarelli, Massimo; Mancuso, Adrian P.
2014-01-01
The European X-ray free-electron laser (XFEL) facility, under construction in the Hamburg region, will provide high-peak brilliance (greater than 1033 photons s−1 mm−2 mrad−2 per 0.1% BW), ultrashort pulses (approx. 10 fs) of X-rays, with a high repetition rate (up to 27 000 pulses s−1) from 2016 onwards. The main features of this exceptional X-ray source, and the instrumentation developments necessary to exploit them fully, for application to a variety of scientific disciplines, are briefly summarized. In the case of structural biology, that has a central role in the scientific case of this new facility, the instruments and ancillary laboratories that are being planned and built within the baseline programme of the European XFEL and by consortia of users are also discussed. It is expected that the unique features of the source and the advanced features of the instrumentation will allow operation modes with more efficient use of sample materials, faster acquisition times, and conditions better approaching feasibility of single molecule imaging. PMID:24914145
CheS-Mapper - Chemical Space Mapping and Visualization in 3D.
Gütlein, Martin; Karwath, Andreas; Kramer, Stefan
2012-03-17
Analyzing chemical datasets is a challenging task for scientific researchers in the field of chemoinformatics. It is important, yet difficult to understand the relationship between the structure of chemical compounds, their physico-chemical properties, and biological or toxic effects. To that respect, visualization tools can help to better comprehend the underlying correlations. Our recently developed 3D molecular viewer CheS-Mapper (Chemical Space Mapper) divides large datasets into clusters of similar compounds and consequently arranges them in 3D space, such that their spatial proximity reflects their similarity. The user can indirectly determine similarity, by selecting which features to employ in the process. The tool can use and calculate different kind of features, like structural fragments as well as quantitative chemical descriptors. These features can be highlighted within CheS-Mapper, which aids the chemist to better understand patterns and regularities and relate the observations to established scientific knowledge. As a final function, the tool can also be used to select and export specific subsets of a given dataset for further analysis.
CheS-Mapper - Chemical Space Mapping and Visualization in 3D
2012-01-01
Analyzing chemical datasets is a challenging task for scientific researchers in the field of chemoinformatics. It is important, yet difficult to understand the relationship between the structure of chemical compounds, their physico-chemical properties, and biological or toxic effects. To that respect, visualization tools can help to better comprehend the underlying correlations. Our recently developed 3D molecular viewer CheS-Mapper (Chemical Space Mapper) divides large datasets into clusters of similar compounds and consequently arranges them in 3D space, such that their spatial proximity reflects their similarity. The user can indirectly determine similarity, by selecting which features to employ in the process. The tool can use and calculate different kind of features, like structural fragments as well as quantitative chemical descriptors. These features can be highlighted within CheS-Mapper, which aids the chemist to better understand patterns and regularities and relate the observations to established scientific knowledge. As a final function, the tool can also be used to select and export specific subsets of a given dataset for further analysis. PMID:22424447
Structural biology at the European X-ray free-electron laser facility.
Altarelli, Massimo; Mancuso, Adrian P
2014-07-17
The European X-ray free-electron laser (XFEL) facility, under construction in the Hamburg region, will provide high-peak brilliance (greater than 10(33) photons s(-1) mm(-2) mrad(-2) per 0.1% BW), ultrashort pulses (approx. 10 fs) of X-rays, with a high repetition rate (up to 27 000 pulses s(-1)) from 2016 onwards. The main features of this exceptional X-ray source, and the instrumentation developments necessary to exploit them fully, for application to a variety of scientific disciplines, are briefly summarized. In the case of structural biology, that has a central role in the scientific case of this new facility, the instruments and ancillary laboratories that are being planned and built within the baseline programme of the European XFEL and by consortia of users are also discussed. It is expected that the unique features of the source and the advanced features of the instrumentation will allow operation modes with more efficient use of sample materials, faster acquisition times, and conditions better approaching feasibility of single molecule imaging. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Fisher Center for Alzheimer's Research Foundation
... We are making a major impact on Alzheimer’s research. Our scientific discoveries are featured in top publications ... is vetted by scientists for accuracy. Explore Our Research Nobel Prize Winner Dr. Paul Greengard leads our ...
NASA Astrophysics Data System (ADS)
Manos, Harry
2003-05-01
Most visitors to Florence, Italy, know about the Galleria dell'Accademia, housing Michelangelo's famous statue of David, or the Galleria degli Uffizi with the famous Medici collection. Few visitors know that only two blocks from the Uffizi on the Arno River is one of the world's finest museums featuring historic scientific instruments, the Museo di Storia della Scienza. In the February issue of TPT, Nickell states that the Museo di Storia della Scienza ``is perhaps the best museum on the history of science in the world.''1 This fact is likely true, and the museum is a must for physics teachers visiting Florence. It features a vast collection of authentic ``cutting-edge'' scientific instruments, including one of Galileo's lenses in a magnificent ebony and ivory frame. One of the tragedies is that this museum goes unmarked on many tourist maps and unmentioned in many guidebooks.
NASA Technical Reports Server (NTRS)
Saini, Subhash; Hood, Robert T.; Chang, Johnny; Baron, John
2016-01-01
We present a performance evaluation conducted on a production supercomputer of the Intel Xeon Processor E5- 2680v3, a twelve-core implementation of the fourth-generation Haswell architecture, and compare it with Intel Xeon Processor E5-2680v2, an Ivy Bridge implementation of the third-generation Sandy Bridge architecture. Several new architectural features have been incorporated in Haswell including improvements in all levels of the memory hierarchy as well as improvements to vector instructions and power management. We critically evaluate these new features of Haswell and compare with Ivy Bridge using several low-level benchmarks including subset of HPCC, HPCG and four full-scale scientific and engineering applications. We also present a model to predict the performance of HPCG and Cart3D within 5%, and Overflow within 10% accuracy.
Earth Observing Scanning Polarimeter (EOSP), phase B
NASA Technical Reports Server (NTRS)
1990-01-01
Evaluations performed during a Phase B study directed towards defining an optimal design for the Earth Observing Scanning Polarimeter (EOSP) instrument is summarized. An overview of the experiment approach is included which provides a summary of the scientific objectives, the background of the measurement approach, and the measurement method. In the instrumentation section, details of the design are discussed starting with the key instrument features required to accomplish the scientific objectives and a system characterization in terms of the Stokes vector/Mueller matrix formalism. This is followed by a detailing of the instrument design concept, the design of the individual elements of the system, the predicted performance, and a summary of appropriate instrument testing and calibration. The selected design makes use of key features of predecessor polarimeters and is fully compatible with the Earth Observing System spacecraft requirements.
NASA Astrophysics Data System (ADS)
Kramer, K.; Shedd, W. W.
2017-12-01
In May, 2017, the U.S. Department of the Interior's Bureau of Ocean Energy Management (BOEM) published a high-resolution seafloor map of the northern Gulf of Mexico region. The new map, derived from 3-D seismic surveys, provides the scientific community with enhanced resolution and reveals previously undiscovered and poorly resolved geologic features of the continental slope, salt minibasin province, abyssal plain, Mississippi Fan, and the Florida Shelf and Escarpment. It becomes an even more powerful scientific tool when paired with BOEM's public database of 35,000 seafloor features, identifying natural hydrocarbon seeps, hard grounds, mud volcanoes, sediment flows, pockmarks, slumps, and many others. BOEM has mapped the Gulf of Mexico seafloor since 1998 in a regulatory mission to identify natural oil and gas seeps and protect the coral and chemosynthetic communities growing at those sites. The nineteen-year mapping effort, still ongoing, resulted in the creation of the 1.4-billion pixel map and the seafloor features database. With these tools and continual collaboration with academia, professional scientific institutions, and the offshore energy industry, BOEM will continue to incorporate new data to update and expand these two resources on a regular basis. They can be downloaded for free from BOEM's website at https://www.boem.gov/Gulf-of-Mexico-Deepwater-Bathymetry/ and https://www.boem.gov/Seismic-Water-Bottom-Anomalies-Map-Gallery/.
High performance geospatial and climate data visualization using GeoJS
NASA Astrophysics Data System (ADS)
Chaudhary, A.; Beezley, J. D.
2015-12-01
GeoJS (https://github.com/OpenGeoscience/geojs) is an open-source library developed to support interactive scientific and geospatial visualization of climate and earth science datasets in a web environment. GeoJS has a convenient application programming interface (API) that enables users to harness the fast performance of WebGL and Canvas 2D APIs with sophisticated Scalable Vector Graphics (SVG) features in a consistent and convenient manner. We started the project in response to the need for an open-source JavaScript library that can combine traditional geographic information systems (GIS) and scientific visualization on the web. Many libraries, some of which are open source, support mapping or other GIS capabilities, but lack the features required to visualize scientific and other geospatial datasets. For instance, such libraries are not be capable of rendering climate plots from NetCDF files, and some libraries are limited in regards to geoinformatics (infovis in a geospatial environment). While libraries such as d3.js are extremely powerful for these kinds of plots, in order to integrate them into other GIS libraries, the construction of geoinformatics visualizations must be completed manually and separately, or the code must somehow be mixed in an unintuitive way.We developed GeoJS with the following motivations:• To create an open-source geovisualization and GIS library that combines scientific visualization with GIS and informatics• To develop an extensible library that can combine data from multiple sources and render them using multiple backends• To build a library that works well with existing scientific visualizations tools such as VTKWe have successfully deployed GeoJS-based applications for multiple domains across various projects. The ClimatePipes project funded by the Department of Energy, for example, used GeoJS to visualize NetCDF datasets from climate data archives. Other projects built visualizations using GeoJS for interactively exploring data and analysis regarding 1) the human trafficking domain, 2) New York City taxi drop-offs and pick-ups, and 3) the Ebola outbreak. GeoJS supports advanced visualization features such as picking and selecting, as well as clustering. It also supports 2D contour plots, vector plots, heat maps, and geospatial graphs.
Large-scale feature searches of collections of medical imagery
NASA Astrophysics Data System (ADS)
Hedgcock, Marcus W.; Karshat, Walter B.; Levitt, Tod S.; Vosky, D. N.
1993-09-01
Large scale feature searches of accumulated collections of medical imagery are required for multiple purposes, including clinical studies, administrative planning, epidemiology, teaching, quality improvement, and research. To perform a feature search of large collections of medical imagery, one can either search text descriptors of the imagery in the collection (usually the interpretation), or (if the imagery is in digital format) the imagery itself. At our institution, text interpretations of medical imagery are all available in our VA Hospital Information System. These are downloaded daily into an off-line computer. The text descriptors of most medical imagery are usually formatted as free text, and so require a user friendly database search tool to make searches quick and easy for any user to design and execute. We are tailoring such a database search tool (Liveview), developed by one of the authors (Karshat). To further facilitate search construction, we are constructing (from our accumulated interpretation data) a dictionary of medical and radiological terms and synonyms. If the imagery database is digital, the imagery which the search discovers is easily retrieved from the computer archive. We describe our database search user interface, with examples, and compare the efficacy of computer assisted imagery searches from a clinical text database with manual searches. Our initial work on direct feature searches of digital medical imagery is outlined.
NASA Astrophysics Data System (ADS)
Yang, Fang-Ying; Chang, Cheng-Chieh; Chen, Li-Ling; Chen, Yi-Chun
2016-07-01
The main purpose of this study was to explore learners' beliefs about science reading and scientific epistemic beliefs, and how these beliefs were associating with their understanding of science texts. About 400 10th graders were involved in the development and validation of the Beliefs about Science Reading Inventory (BSRI). To find the effects of reader beliefs and epistemic beliefs, a new group of 65 10th grade students whose reader and epistemic beliefs were assessed by the newly developed BSRI and an existing SEB questionnaire were invited to take part in a science reading task. Students' text understanding in terms of concept gain and text interpretations was collected and analyzed. By the correlation analysis, it was found that when students had stronger beliefs about meaning construction based on personal goals and experiences (i.e. transaction beliefs), they produced more thematic and critical interpretations of the content of the test article. The regression analysis suggested that students SEBs could predict concept gain as a result of reading. Moreover, among all beliefs examined in the study, transaction beliefs stood out as the best predictor of overall science-text understanding.
Zhang, L; Price, R; Aweeka, F; Bellibas, S E; Sheiner, L B
2001-02-01
A small-scale clinical investigation was done to quantify the penetration of stavudine (D4T) into cerebrospinal fluid (CSF). A model-based analysis estimates the steady-state ratio of AUCs of CSF and plasma concentrations (R(AUC)) to be 0.270, and the mean residence time of drug in the CSF to be 7.04 h. The analysis illustrates the advantages of a causal (scientific, predictive) model-based approach to analysis over a noncausal (empirical, descriptive) approach when the data, as here, demonstrate certain problematic features commonly encountered in clinical data, namely (i) few subjects, (ii) sparse sampling, (iii) repeated measures, (iv) imbalance, and (v) individual design variation. These features generally require special attention in data analysis. The causal-model-based analysis deals with features (i) and (ii), both of which reduce efficiency, by combining data from different studies and adding subject-matter prior information. It deals with features (iii)--(v), all of which prevent 'averaging' individual data points directly, first, by adjusting in the model for interindividual data differences due to design differences, secondly, by explicitly differentiating between interpatient, interoccasion, and measurement error variation, and lastly, by defining a scientifically meaningful estimand (R(AUC)) that is independent of design.
Recommendations for the use of notebooks in upper-division physics lab courses
NASA Astrophysics Data System (ADS)
Stanley, Jacob T.; Lewandowski, H. J.
2018-01-01
The use of lab notebooks for scientific documentation is a ubiquitous part of physics research. However, it is common for undergraduate physics laboratory courses not to emphasize the development of documentation skills, despite the fact that such courses are some of the earliest opportunities for students to start engaging in this practice. One potential impediment to the inclusion of explicit documentation training is that it may be unclear to instructors which features of authentic documentation practice are efficacious to teach and how to incorporate these features into the lab class environment. In this work, we outline some of the salient features of authentic documentation, informed by interviews with physics researchers, and provide recommendations for how these can be incorporated into the lab curriculum. We do not focus on structural details or templates for notebooks. Instead, we address holistic considerations for the purpose of scientific documentation that can guide students to develop their own documentation style. While taking into consideration all the aspects that can help improve students' documentation, it is also important to consider the design of the lab activities themselves. Students should have experience with implementing these authentic features of documentation during lab activities in order for them to find practice with documentation beneficial.
Streambeds Merit Recognition as a Scientific Discipline
NASA Astrophysics Data System (ADS)
Constantz, J. E.
2016-12-01
Streambeds are generally viewed as simply sediments beneath streams, sediments topping alluvial aquifers, or sediments housing aquatic life, rather than as distinct geographic features comparable to soils and surficial geologic formations within watersheds. Streambeds should be viewed as distinct elements within watersheds, e.g., as akin to soils. In this presentation, streambeds are described as central features in watersheds, cycling water between the surface and underlying portions of the watershed. Regarding their kinship to soils, soils are often described as surficial sediments largely created by atmospheric weathering of underlying geologic parent material, and similarly, streambeds should be described as submerged sediments largely created by streamflow modification of underlying geologic parent material. Thus, streambeds are clearly overdue for recognition as their own scientific discipline along side other well-recognized disciplines within watersheds; however, slowing progress in this direction, the point is often made that hyporheic zones should be considered comparable to streambeds, but this is as misguided as equating unsaturated zones to soils. Streambeds and soils are physical geographic features of relatively constant volume, while hyporheic and unsaturated zones are hydrologic features of varying volume. Expanded upon in this presentation, 'Streambed Science' is proposed for this discipline, which will require both a well-designed protocol to physically characterize streambeds as well as development of streambed taxonomy, for suitable recognition as an independent discipline within watersheds.
Tbahriti, Imad; Chichester, Christine; Lisacek, Frédérique; Ruch, Patrick
2006-06-01
The aim of this study is to investigate the relationships between citations and the scientific argumentation found abstracts. We design a related article search task and observe how the argumentation can affect the search results. We extracted citation lists from a set of 3200 full-text papers originating from a narrow domain. In parallel, we recovered the corresponding MEDLINE records for analysis of the argumentative moves. Our argumentative model is founded on four classes: PURPOSE, METHODS, RESULTS and CONCLUSION. A Bayesian classifier trained on explicitly structured MEDLINE abstracts generates these argumentative categories. The categories are used to generate four different argumentative indexes. A fifth index contains the complete abstract, together with the title and the list of Medical Subject Headings (MeSH) terms. To appraise the relationship of the moves to the citations, the citation lists were used as the criteria for determining relatedness of articles, establishing a benchmark; it means that two articles are considered as "related" if they share a significant set of co-citations. Our results show that the average precision of queries with the PURPOSE and CONCLUSION features is the highest, while the precision of the RESULTS and METHODS features was relatively low. A linear weighting combination of the moves is proposed, which significantly improves retrieval of related articles.
Zhang, Ming-Huan; Ma, Jun-Shan; Shen, Ying; Chen, Ying
2016-09-01
This study aimed to investigate the optimal support vector machines (SVM)-based classifier of duchenne muscular dystrophy (DMD) magnetic resonance imaging (MRI) images. T1-weighted (T1W) and T2-weighted (T2W) images of the 15 boys with DMD and 15 normal controls were obtained. Textural features of the images were extracted and wavelet decomposed, and then, principal features were selected. Scale transform was then performed for MRI images. Afterward, SVM-based classifiers of MRI images were analyzed based on the radical basis function and decomposition levels. The cost (C) parameter and kernel parameter [Formula: see text] were used for classification. Then, the optimal SVM-based classifier, expressed as [Formula: see text]), was identified by performance evaluation (sensitivity, specificity and accuracy). Eight of 12 textural features were selected as principal features (eigenvalues [Formula: see text]). The 16 SVM-based classifiers were obtained using combination of (C, [Formula: see text]), and those with lower C and [Formula: see text] values showed higher performances, especially classifier of [Formula: see text]). The SVM-based classifiers of T1W images showed higher performance than T1W images at the same decomposition level. The T1W images in classifier of [Formula: see text]) at level 2 decomposition showed the highest performance of all, and its overall correct sensitivity, specificity, and accuracy reached 96.9, 97.3, and 97.1 %, respectively. The T1W images in SVM-based classifier [Formula: see text] at level 2 decomposition showed the highest performance of all, demonstrating that it was the optimal classification for the diagnosis of DMD.
Early Readers and Electronic Texts: CD-ROM Storybook Features That Influence Reading Behaviors
ERIC Educational Resources Information Center
Lefever-Davis, Shirley; Pearman, Cathy
2005-01-01
This research explores the impact of CD-ROM storybook features on the reading behaviors of 6- and 7-year-old students with limited exposure to CD-ROM storybooks. Six categories of behaviors were identified: tracking, electronic feature dependency, distractibility, spectator stance, electronic feature limitations, and electronic features as tools.…
NASA Astrophysics Data System (ADS)
Neiles, Kelly Y.
There is great concern in the scientific community that students in the United States, when compared with other countries, are falling behind in their scientific achievement. Increasing students' reading comprehension of scientific text may be one of the components involved in students' science achievement. To investigate students' reading comprehension this quantitative study examined the effects of different reader characteristics, namely, students' logical reasoning ability, factual chemistry knowledge, working memory capacity, and schema of the chemistry concepts, on reading comprehension of a chemistry text. Students' reading comprehension was measured through their ability to encode the text, access the meanings of words (lexical access), make bridging and elaborative inferences, and integrate the text with their existing schemas to make a lasting mental representation of the text (situational model). Students completed a series of tasks that measured the reader characteristic and reading comprehension variables. Some of the variables were measured using new technologies and software to investigate different cognitive processes. These technologies and software included eye tracking to investigate students' lexical accessing and a Pathfinder program to investigate students' schema of the chemistry concepts. The results from this study were analyzed using canonical correlation and regression analysis. The canonical correlation analysis allows for the ten variables described previously to be included in one multivariate analysis. Results indicate that the relationship between the reader characteristic variables and the reading comprehension variables is significant. The resulting canonical function accounts for a greater amount of variance in students' responses then any individual variable. Regression analysis was used to further investigate which reader characteristic variables accounted for the differences in students' responses for each reading comprehension variable. The results from this regression analysis indicated that the two schema measures (measured by the Pathfinder program) accounted for the greatest amount of variance in four of the reading comprehension variables (encoding the text, bridging and elaborative inferences, and delayed recall of a general summary). This research suggest that providing students with background information on chemistry concepts prior to having them read the text may result in better understanding and more effective incorporation of the chemistry concepts into their schema.
Blikstad‐Balas, Marte
2017-01-01
Abstract All scientists depend on both reading and writing to do their scientific work. It is of paramount importance to ensure that students have a relevant repertoire of practices they can employ when facing scientific content inside and outside the school context. The present study reports on students in seventh grade acting as researchers. Over an 8‐week collaborative research period, students posed their own research question, attempted to answer it by systematically testing hypotheses, discussing findings, presenting their conclusions, and documenting their process in a written report. Drawing on the perspectives of New Literacy Studies—which sees literacy as socially situated—we analyze the purpose of all the 21 participating students’ texts (n = 344). Video observations and interviews with students are used to contextualize the writing events. We find that the students chose to write multiple kinds of texts for a variety of purposes. Analyzing purpose and the context, three stages of socialization into scientific writing is revealed, ranging from what the students write on their own initiative, via texts written through challenges to demanding research tasks scaffolded through writing instructions given by the teacher. Further, the students emphasized the relevance of both the research experience and the writing to their future adult life. PMID:29540938
Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models
Dai, Jin; Liu, Xin
2014-01-01
The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud concept jumping up (CCJU-TC) is proposed. It can efficiently accomplish conversion between qualitative concept and quantitative data. Through the conversion from text set to text information table based on VSM model, the text qualitative concept, which is extraction from the same category, is jumping up as a whole category concept. According to the cloud similarity between the test text and each category concept, the test text is assigned to the most similar category. By the comparison among different text classifiers in different feature selection set, it fully proves that not only does CCJU-TC have a strong ability to adapt to the different text features, but also the classification performance is also better than the traditional classifiers. PMID:24711737
Management of scientific information with Google Drive.
Kubaszewski, Łukasz; Kaczmarczyk, Jacek; Nowakowski, Andrzej
2013-09-20
The amount and diversity of scientific publications requires a modern management system. By "management" we mean the process of gathering interesting information for the purpose of reading and archiving for quick access in future clinical practice and research activity. In the past, such system required physical existence of a library, either institutional or private. Nowadays in an era dominated by electronic information, it is natural to migrate entire systems to a digital form. In the following paper we describe the structure and functions of an individual electronic library system (IELiS) for the management of scientific publications based on the Google Drive service. Architecture of the system. Architecture system consists of a central element and peripheral devices. Central element of the system is virtual Google Drive provided by Google Inc. Physical elements of the system include: tablet with Android operating system and a personal computer, both with internet access. Required software includes a program to view and edit files in PDF format for mobile devices and another to synchronize the files. Functioning of the system. The first step in creating a system is collection of scientific papers in PDF format and their analysis. This step is performed most frequently on a tablet. At this stage, after being read, the papers are cataloged in a system of folders and subfolders, according to individual demands. During this stage, but not exclusively, the PDF files are annotated by the reader. This allows the user to quickly track down interesting information in review or research process. Modification of the document title is performed at this stage, as well. Second element of the system is creation of a mirror database in the Google Drive virtual memory. Modified and cataloged papers are synchronized with Google Drive. At this stage, a fully functional scientific information electronic library becomes available online. The third element of the system is a periodic two-way synchronization of data between Google Drive and tablet, as occasional modification of the files with annotation or recataloging may be performed at both locations. The system architecture is designed to gather, catalog and analyze scientific publications. All steps are electronic, eliminating paper forms. Indexed files are available for re-reading and modification. The system allows for fast access to full-text search with additional features making research easier. Team collaboration is also possible with full control of user privileges. Particularly important is the safety of collected data. In our opinion, the system exceeds many commercially available applications in terms of functionality and versatility.
Shared Features of L2 Writing: Intergroup Homogeneity and Text Classification
ERIC Educational Resources Information Center
Crossley, Scott A.; McNamara, Danielle S.
2011-01-01
This study investigates intergroup homogeneity within high intermediate and advanced L2 writers of English from Czech, Finnish, German, and Spanish first language backgrounds. A variety of linguistic features related to lexical sophistication, syntactic complexity, and cohesion were used to compare texts written by L1 speakers of English to L2…
Developing an Approach for Comparing Students' Multimodal Text Creations: A Case Study
ERIC Educational Resources Information Center
Levy, Mike; Kimber, Kay
2009-01-01
Classroom teachers routinely make judgments on the quality of their students' work based on their recognition of how effectively the student has assembled key features of the genre or the medium. Yet how readily can teachers talk about the features of student-created multimodal texts in ways that can improve learning and performance? This article…
An Analysis of English Business Letters from the Perspective of Interpersonal Function
ERIC Educational Resources Information Center
Xu, Bo
2012-01-01
The purpose of the present study is to find out the features of English business letters. Halliday's systemic functional linguistics is used as the theoretical framework, mainly, interpersonal fucntion. The English business letter (EBL) is an important written text used for international business communication and it has its own features of text.…
Estimating Missing Features to Improve Multimedia Information Retrieval
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bagherjeiran, A; Love, N S; Kamath, C
Retrieval in a multimedia database usually involves combining information from different modalities of data, such as text and images. However, all modalities of the data may not be available to form the query. The retrieval results from such a partial query are often less than satisfactory. In this paper, we present an approach to complete a partial query by estimating the missing features in the query. Our experiments with a database of images and their associated captions show that, with an initial text-only query, our completion method has similar performance to a full query with both image and text features.more » In addition, when we use relevance feedback, our approach outperforms the results obtained using a full query.« less
OntoMate: a text-mining tool aiding curation at the Rat Genome Database
Liu, Weisong; Laulederkind, Stanley J. F.; Hayman, G. Thomas; Wang, Shur-Jen; Nigam, Rajni; Smith, Jennifer R.; De Pons, Jeff; Dwinell, Melinda R.; Shimoyama, Mary
2015-01-01
The Rat Genome Database (RGD) is the premier repository of rat genomic, genetic and physiologic data. Converting data from free text in the scientific literature to a structured format is one of the main tasks of all model organism databases. RGD spends considerable effort manually curating gene, Quantitative Trait Locus (QTL) and strain information. The rapidly growing volume of biomedical literature and the active research in the biological natural language processing (bioNLP) community have given RGD the impetus to adopt text-mining tools to improve curation efficiency. Recently, RGD has initiated a project to use OntoMate, an ontology-driven, concept-based literature search engine developed at RGD, as a replacement for the PubMed (http://www.ncbi.nlm.nih.gov/pubmed) search engine in the gene curation workflow. OntoMate tags abstracts with gene names, gene mutations, organism name and most of the 16 ontologies/vocabularies used at RGD. All terms/ entities tagged to an abstract are listed with the abstract in the search results. All listed terms are linked both to data entry boxes and a term browser in the curation tool. OntoMate also provides user-activated filters for species, date and other parameters relevant to the literature search. Using the system for literature search and import has streamlined the process compared to using PubMed. The system was built with a scalable and open architecture, including features specifically designed to accelerate the RGD gene curation process. With the use of bioNLP tools, RGD has added more automation to its curation workflow. Database URL: http://rgd.mcw.edu PMID:25619558
NASA Astrophysics Data System (ADS)
Cavalli-Sforza, Violetta Laura Maria
Students in science classes hardly ever study scientific controversy, especially in terms of the different types of arguments used to support and criticize theories and hypotheses. Yet, learning the reasons for scientific debate and scientific change is an important part of appreciating the nature of the scientific enterprise and communicating it to the non-scientific world. This dissertation explores the usefulness of graphical representations in teaching students about scientific arguments. Subjects participating in an extended experiment studied instructional materials and used the Belvedere graphical interface to analyze texts drawn from an actual scientific debate. In one experimental condition, subjects used a box-and-arrow representation whose primitive graphical elements had preassigned meanings tailored to the domain of instruction. In the other experimental condition, subjects could use the graphical elements as they wished, thereby creating their own representation. The development of a representation, by forcing a deeper analysis, can potentially yield a greater understanding of the domain under study. The results of the research suggest two conclusions. From the perspective of learning target concepts, asking subjects to develop their own representation may not hurt those subjects who gain a sufficient understanding of the possibilities of abstract representation. The risks are much greater for less able subjects because, if they develop a representation that is inadequate for expressing the target concepts, they will use those concepts less or not at all. From the perspective of coaching subjects as they diagram their analysis of texts, a predefined representation has significant advantages. If it is appropriately expressive for the task, it provides a common language and clearer shared meaning between the subject and the coach. It also enables the coach to understand subjects' analysis more easily, and to evaluate it more effectively against the coach's own model of the ideal analysis.
Scientific Literacy in Food Education: Gardening and Cooking in School
NASA Astrophysics Data System (ADS)
Strohl, Carrie A.
Recent attention to socio-scientific issues such as sustainable agriculture, environmental responsibility and nutritional health has spurred a resurgence of public interest in gardening and cooking. Seen as contexts for fostering scientific literacy---the knowledge domains, methodological approaches, habits of mind and discourse practices that reflect one's understanding of the role of science in society, gardening and cooking are under-examined fields in science education, in part, because they are under-utilized pedagogies in school settings. Although learning gardens were used historically to foster many aspects of scientific literacy (e.g., cognitive knowledge, norms and methods of science, attitudes toward science and discourse of science), analysis of contemporary studies suggests that science learning in gardens focuses mainly on science knowledge alone. Using multiple conceptions of scientific literacy, I analyzed qualitative data to demonstrate how exploration, talk and text fostered scientific literacy in a school garden. Exploration prompted students to engage in scientific practices such as making observations and constructing explanations from evidence. Talk and text provided background knowledge and accurate information about agricultural, environmental and nutritional topics under study. Using a similar qualitative approach, I present a case study of a third grade teacher who explicitly taught food literacy through culinary arts instruction. Drawing on numerous contextual resources, this teacher created a classroom community of food practice through hands-on cooking lessons, guest chef demonstrations, and school-wide tasting events. As a result, she promoted six different types of knowledge (conceptual, procedural, dispositional, sensory, social, and communal) through leveraging contextual resources. This case study highlights how food literacy is largely contingent on often-overlooked mediators of food literacy: the relationships between participants, the activity, and the type of knowledge invoked. Scientific literacy in food education continues to be a topic of interest in the fields of public health and of sustainable agriculture, as well as to proponents of the local food movement. This dissertation begins to map a more cohesive and comprehensive approach to gardening and cooking implementation and research in school settings.
Sarker, Abeed; Gonzalez, Graciela
2015-02-01
Automatic detection of adverse drug reaction (ADR) mentions from text has recently received significant interest in pharmacovigilance research. Current research focuses on various sources of text-based information, including social media-where enormous amounts of user posted data is available, which have the potential for use in pharmacovigilance if collected and filtered accurately. The aims of this study are: (i) to explore natural language processing (NLP) approaches for generating useful features from text, and utilizing them in optimized machine learning algorithms for automatic classification of ADR assertive text segments; (ii) to present two data sets that we prepared for the task of ADR detection from user posted internet data; and (iii) to investigate if combining training data from distinct corpora can improve automatic classification accuracies. One of our three data sets contains annotated sentences from clinical reports, and the two other data sets, built in-house, consist of annotated posts from social media. Our text classification approach relies on generating a large set of features, representing semantic properties (e.g., sentiment, polarity, and topic), from short text nuggets. Importantly, using our expanded feature sets, we combine training data from different corpora in attempts to boost classification accuracies. Our feature-rich classification approach performs significantly better than previously published approaches with ADR class F-scores of 0.812 (previously reported best: 0.770), 0.538 and 0.678 for the three data sets. Combining training data from multiple compatible corpora further improves the ADR F-scores for the in-house data sets to 0.597 (improvement of 5.9 units) and 0.704 (improvement of 2.6 units) respectively. Our research results indicate that using advanced NLP techniques for generating information rich features from text can significantly improve classification accuracies over existing benchmarks. Our experiments illustrate the benefits of incorporating various semantic features such as topics, concepts, sentiments, and polarities. Finally, we show that integration of information from compatible corpora can significantly improve classification performance. This form of multi-corpus training may be particularly useful in cases where data sets are heavily imbalanced (e.g., social media data), and may reduce the time and costs associated with the annotation of data in the future. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Portable Automatic Text Classification for Adverse Drug Reaction Detection via Multi-corpus Training
Gonzalez, Graciela
2014-01-01
Objective Automatic detection of Adverse Drug Reaction (ADR) mentions from text has recently received significant interest in pharmacovigilance research. Current research focuses on various sources of text-based information, including social media — where enormous amounts of user posted data is available, which have the potential for use in pharmacovigilance if collected and filtered accurately. The aims of this study are: (i) to explore natural language processing approaches for generating useful features from text, and utilizing them in optimized machine learning algorithms for automatic classification of ADR assertive text segments; (ii) to present two data sets that we prepared for the task of ADR detection from user posted internet data; and (iii) to investigate if combining training data from distinct corpora can improve automatic classification accuracies. Methods One of our three data sets contains annotated sentences from clinical reports, and the two other data sets, built in-house, consist of annotated posts from social media. Our text classification approach relies on generating a large set of features, representing semantic properties (e.g., sentiment, polarity, and topic), from short text nuggets. Importantly, using our expanded feature sets, we combine training data from different corpora in attempts to boost classification accuracies. Results Our feature-rich classification approach performs significantly better than previously published approaches with ADR class F-scores of 0.812 (previously reported best: 0.770), 0.538 and 0.678 for the three data sets. Combining training data from multiple compatible corpora further improves the ADR F-scores for the in-house data sets to 0.597 (improvement of 5.9 units) and 0.704 (improvement of 2.6 units) respectively. Conclusions Our research results indicate that using advanced NLP techniques for generating information rich features from text can significantly improve classification accuracies over existing benchmarks. Our experiments illustrate the benefits of incorporating various semantic features such as topics, concepts, sentiments, and polarities. Finally, we show that integration of information from compatible corpora can significantly improve classification performance. This form of multi-corpus training may be particularly useful in cases where data sets are heavily imbalanced (e.g., social media data), and may reduce the time and costs associated with the annotation of data in the future. PMID:25451103
ERIC Educational Resources Information Center
Dinsmore, Daniel L.; Zoellner, Brian P.; Parkinson, Meghan M.; Rossi, Anthony M.; Monk, Mary J.; Vinnachi, Jenelle
2017-01-01
View change about socio-scientific issues has been well studied in the literature, but the change in the complexity of those views has not. In the current study, the change in the complexity of views about a specific scientific topic (i.e. genetically modified organisms; GMOs) and use of evidence in explaining those views was examined in relation…
Toward a Social Practice Perspective on the Work of Reading Inscriptions in Science Texts
ERIC Educational Resources Information Center
Pozzer-Ardenghi, Lilian; Roth, Wolff-Michael
2010-01-01
In the social studies of science, visuals and graphical representations are theorized by means of the concept of inscription, a term that denotes all representations other than text inscribed in some medium including graphs, tables, photographs, and equations. Inscriptions constitute an intrinsic and integral part of scientific practice; their…
ERIC Educational Resources Information Center
Scharrer, Lisa; Britt, M. Anne; Stadtler, Marc; Bromme, Rainer
2013-01-01
Well-educated laypeople tend to rely on their own ability to evaluate scientific claims when they obtain information from texts with high comprehensibility. The present study investigated whether controversial content reduces this facilitating effect of high text comprehensibility on readers' self-reliance and whether the influence of…
ERIC Educational Resources Information Center
English, Fenwick W.
2002-01-01
Argues that the uncritical citation of Stephen R. Covey's book, "The 7 Habits of Highly Effective People," in educational administration texts undermines the social-scientific foundation of university-based administrator preparation. Asserts that the Covey's book is based on Mormon metaphysics, not social science. (Contains 41…
ERIC Educational Resources Information Center
Zaslawsky, D.
1976-01-01
Attempts to determine to what extent a linguistic analysis can contribute to an epistemological study of scientific texts. Considerations are also given on the methodological role of argumentation in linguistic science. (Text is in French.) (CDSH/AM)
Exploring the Boundary Conditions of the Redundancy Principle
ERIC Educational Resources Information Center
McCrudden, Matthew T.; Hushman, Carolyn J.; Marley, Scott C.
2014-01-01
This experiment investigated whether study of a scientific text and a visual display that contained redundant text segments would affect memory and transfer. The authors randomly assigned 42 students from a university in the southwestern United States in equal numbers to 1 of 2 conditions: (a) a redundant condition, in which participants studied a…
ERIC Educational Resources Information Center
Tekkaya, Ceren
2003-01-01
Investigates the effectiveness of combining conceptual change text and concept mapping strategies on students' understanding of diffusion and osmosis. Results indicate that while the average percentage of students in the experimental group holding a scientifically correct view rose, the percentage of correct responses in the control group…
Expert-Novice Differences in Memory, Abstraction, and Reasoning in the Domain of Literature.
ERIC Educational Resources Information Center
Zeitz, Colleen M.
1994-01-01
Explored the information processing abilities associated with expertise in literature in high school and college students. Found that literary experts were superior to novices in gist-level recall, extraction of interpretations, and breadth of aspects addressed of literary texts but not in comprehension of scientific texts. (AA)
Tantra yukti method of theorization in ayurveda.
Singh, Anuradha
2003-01-01
Method of theorization (Tantra Yukti-s given in Ayurvedic texts) is analyzed in the backdrop of scientific method. Thirty six methodic devices are singled out from texts for analysis in terms of truth specific, theory specific and discourse specific issues. The paper also points out exact problems in conception of method in Ayurveda and Science.
Teaching the History of Technical Communication: A Lesson with Franklin and Hoover.
ERIC Educational Resources Information Center
Todd, Jeff
2003-01-01
Provides and defends four guidelines as a foundation to study ways to incorporate history into classroom lessons: maintain a continued research interest in teaching history; limit to technical rather than scientific discourse; focus on English-language texts; and focus on American texts, authors, and practices. Works within the guidelines to show…
A Feature Selection Method Based on Fisher's Discriminant Ratio for Text Sentiment Classification
NASA Astrophysics Data System (ADS)
Wang, Suge; Li, Deyu; Wei, Yingjie; Li, Hongxia
With the rapid growth of e-commerce, product reviews on the Web have become an important information source for customers' decision making when they intend to buy some product. As the reviews are often too many for customers to go through, how to automatically classify them into different sentiment orientation categories (i.e. positive/negative) has become a research problem. In this paper, based on Fisher's discriminant ratio, an effective feature selection method is proposed for product review text sentiment classification. In order to validate the validity of the proposed method, we compared it with other methods respectively based on information gain and mutual information while support vector machine is adopted as the classifier. In this paper, 6 subexperiments are conducted by combining different feature selection methods with 2 kinds of candidate feature sets. Under 1006 review documents of cars, the experimental results indicate that the Fisher's discriminant ratio based on word frequency estimation has the best performance with F value 83.3% while the candidate features are the words which appear in both positive and negative texts.
Relating interesting quantitative time series patterns with text events and text features
NASA Astrophysics Data System (ADS)
Wanner, Franz; Schreck, Tobias; Jentner, Wolfgang; Sharalieva, Lyubka; Keim, Daniel A.
2013-12-01
In many application areas, the key to successful data analysis is the integrated analysis of heterogeneous data. One example is the financial domain, where time-dependent and highly frequent quantitative data (e.g., trading volume and price information) and textual data (e.g., economic and political news reports) need to be considered jointly. Data analysis tools need to support an integrated analysis, which allows studying the relationships between textual news documents and quantitative properties of the stock market price series. In this paper, we describe a workflow and tool that allows a flexible formation of hypotheses about text features and their combinations, which reflect quantitative phenomena observed in stock data. To support such an analysis, we combine the analysis steps of frequent quantitative and text-oriented data using an existing a-priori method. First, based on heuristics we extract interesting intervals and patterns in large time series data. The visual analysis supports the analyst in exploring parameter combinations and their results. The identified time series patterns are then input for the second analysis step, in which all identified intervals of interest are analyzed for frequent patterns co-occurring with financial news. An a-priori method supports the discovery of such sequential temporal patterns. Then, various text features like the degree of sentence nesting, noun phrase complexity, the vocabulary richness, etc. are extracted from the news to obtain meta patterns. Meta patterns are defined by a specific combination of text features which significantly differ from the text features of the remaining news data. Our approach combines a portfolio of visualization and analysis techniques, including time-, cluster- and sequence visualization and analysis functionality. We provide two case studies, showing the effectiveness of our combined quantitative and textual analysis work flow. The workflow can also be generalized to other application domains such as data analysis of smart grids, cyber physical systems or the security of critical infrastructure, where the data consists of a combination of quantitative and textual time series data.
Teaching Radiology Physics Interactively with Scientific Notebook Software.
Richardson, Michael L; Amini, Behrang
2018-06-01
The goal of this study is to demonstrate how the teaching of radiology physics can be enhanced with the use of interactive scientific notebook software. We used the scientific notebook software known as Project Jupyter, which is free, open-source, and available for the Macintosh, Windows, and Linux operating systems. We have created a scientific notebook that demonstrates multiple interactive teaching modules we have written for our residents using the Jupyter notebook system. Scientific notebook software allows educators to create teaching modules in a form that combines text, graphics, images, data, interactive calculations, and image analysis within a single document. These notebooks can be used to build interactive teaching modules, which can help explain complex topics in imaging physics to residents. Copyright © 2018 The Association of University Radiologists. Published by Elsevier Inc. All rights reserved.
Latest NIH Research | NIH MedlinePlus the Magazine
... this page please turn Javascript on. Feature: Quit Smoking Latest NIH Research Past Issues / Winter 2011 Table ... with chest X-rays. Clinical Trials Related to Smoking Clinical trials are scientific studies that try to ...
ERIC Educational Resources Information Center
Risley, John, Ed.
1988-01-01
Compares the features of the sonic rangers available from HRM Software, MICROMEASUREMENTS, NAGAWTIS Software Research, and PASCO Scientific for demonstrations and experiments in mechanics. Presents the advantages of the sonic rangers and the typical graphics displayed by each software package. (YP)
Rachmaninoff in Concert with Recently Named Craters on Mercury
2010-03-30
This NASA MESSENGER 3rd Mercury flyby image was quickly identified as a feature of high scientific interest, because of its fresh appearance, its distinctively colored interior plains, and extensional troughs on its floor.
Minayo, Maria Cecília de Souza; Gomes, Romeu
2015-07-01
The article discusses the role of the Ciência & Saúde Coletiva Journal in the dissemination of knowledge in Brazil and in the international scientific community, its new challenges and role in the consolidation of the national public health field. Its history is outlined, positioning it as a scientific journal and the themes approached in it are analyzed. Among the findings, it is emphasized that the journal features a structured space by the habitus of public health, and creates its own habitus that contributes to structure this field. In addition, the journal contributes to the development of critical mass in the area and is committed to the Brazilian Public Health System.
Mahadevan, Anand; Bucholz, Richard; Gaya, Andrew M; Kresl, John J; Mantz, Constantine; Minnich, Douglas J; Muacevic, Alexander; Medbery, Clinton; Yang, Jun; Caglar, Hale Basak; Davis, Joanne N
2014-12-01
The SRS/SBRT Scientific Meeting 2014, Minneapolis, MN, USA, 7-10 May 2014. The Radiosurgery Society(®), a professional medical society dedicated to advancing the field of stereotactic radiosurgery (SRS) and stereotactic body radiotherapy (SBRT), held the international Radiosurgery Society Scientific Meeting, from 7-10 May 2014 in Minneapolis (MN, USA). This year's conference attracted over 400 attendants from around the world and featured over 100 presentations (46 oral) describing the role of SRS/SBRT for the treatment of intracranial and extracranial malignant and nonmalignant lesions. This article summarizes the meeting highlights for SRS/SBRT treatments, both intracranial and extracranial, in a concise review.
Conceptual Tools for Understanding Nature - Proceedings of the 3rd International Symposium
NASA Astrophysics Data System (ADS)
Costa, G.; Calucci, M.
1997-04-01
The Table of Contents for the full book PDF is as follows: * Foreword * Some Limits of Science and Scientists * Three Limits of Scientific Knowledge * On Features and Meaning of Scientific Knowledge * How Science Approaches the World: Risky Truths versus Misleading Certitudes * On Discovery and Justification * Thought Experiments: A Philosophical Analysis * Causality: Epistemological Questions and Cognitive Answers * Scientific Inquiry via Rational Hypothesis Revision * Probabilistic Epistemology * The Transferable Belief Model for Uncertainty Representation * Chemistry and Complexity * The Difficult Epistemology of Medicine * Epidemiology, Causality and Medical Anthropology * Conceptual Tools for Transdisciplinary Unified Theory * Evolution and Learning in Economic Organizations * The Possible Role of Symmetry in Physics and Cosmology * Observational Cosmology and/or other Imaginable Models of the Universe
Full-Text Databases in Medicine.
ERIC Educational Resources Information Center
Sievert, MaryEllen C.; And Others
1995-01-01
Describes types of full-text databases in medicine; discusses features for searching full-text journal databases available through online vendors; reviews research on full-text databases in medicine; and describes the MEDLINE/Full-Text Research Project at the University of Missouri (Columbia) which investigated precision, recall, and relevancy.…
Latch fittings for the scientific instruments on the space telescope
NASA Technical Reports Server (NTRS)
Dozier, J. D.; Kaelber, E.
1983-01-01
Latch fittings which kinematically mount the replaceable scientific instruments onto the Space Telescope must maintain precise alignment and thermal stability for on-orbit observations. Design features which are needed to meet stringent criteria include the use of ceramic isolators for thermal and electrical insulation, materials with different coefficients of thermal expansion for athermalization, precision manufacturing procedures, and extremely tight tolerances. A specific latch fitting to be discussed is a ball-and-socket design. In addition, testing, crew aids, and problems will be covered.
NASA Technical Reports Server (NTRS)
Mathur, F. P.
1972-01-01
Several common higher level program languages are described. FORTRAN, ALGOL, COBOL, PL/1, and LISP 1.5 are summarized and compared. FORTRAN is the most widely used scientific programming language. ALGOL is a more powerful language for scientific programming. COBOL is used for most commercial programming applications. LISP 1.5 is primarily a list-processing language. PL/1 attempts to combine the desirable features of FORTRAN, ALGOL, and COBOL into a single language.
MADNESS: A Multiresolution, Adaptive Numerical Environment for Scientific Simulation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Harrison, Robert J.; Beylkin, Gregory; Bischoff, Florian A.
2016-01-01
MADNESS (multiresolution adaptive numerical environment for scientific simulation) is a high-level software environment for solving integral and differential equations in many dimensions that uses adaptive and fast harmonic analysis methods with guaranteed precision based on multiresolution analysis and separated representations. Underpinning the numerical capabilities is a powerful petascale parallel programming environment that aims to increase both programmer productivity and code scalability. This paper describes the features and capabilities of MADNESS and briefly discusses some current applications in chemistry and several areas of physics.
Colen, Rivka; Foster, Ian; Gatenby, Robert; Giger, Mary Ellen; Gillies, Robert; Gutman, David; Heller, Matthew; Jain, Rajan; Madabhushi, Anant; Madhavan, Subha; Napel, Sandy; Rao, Arvind; Saltz, Joel; Tatum, James; Verhaak, Roeland; Whitman, Gary
2014-10-01
The National Cancer Institute (NCI) Cancer Imaging Program organized two related workshops on June 26-27, 2013, entitled "Correlating Imaging Phenotypes with Genomics Signatures Research" and "Scalable Computational Resources as Required for Imaging-Genomics Decision Support Systems." The first workshop focused on clinical and scientific requirements, exploring our knowledge of phenotypic characteristics of cancer biological properties to determine whether the field is sufficiently advanced to correlate with imaging phenotypes that underpin genomics and clinical outcomes, and exploring new scientific methods to extract phenotypic features from medical images and relate them to genomics analyses. The second workshop focused on computational methods that explore informatics and computational requirements to extract phenotypic features from medical images and relate them to genomics analyses and improve the accessibility and speed of dissemination of existing NIH resources. These workshops linked clinical and scientific requirements of currently known phenotypic and genotypic cancer biology characteristics with imaging phenotypes that underpin genomics and clinical outcomes. The group generated a set of recommendations to NCI leadership and the research community that encourage and support development of the emerging radiogenomics research field to address short-and longer-term goals in cancer research.
Yu, Sheng; Liao, Katherine P; Shaw, Stanley Y; Gainer, Vivian S; Churchill, Susanne E; Szolovits, Peter; Murphy, Shawn N; Kohane, Isaac S; Cai, Tianxi
2015-09-01
Analysis of narrative (text) data from electronic health records (EHRs) can improve population-scale phenotyping for clinical and genetic research. Currently, selection of text features for phenotyping algorithms is slow and laborious, requiring extensive and iterative involvement by domain experts. This paper introduces a method to develop phenotyping algorithms in an unbiased manner by automatically extracting and selecting informative features, which can be comparable to expert-curated ones in classification accuracy. Comprehensive medical concepts were collected from publicly available knowledge sources in an automated, unbiased fashion. Natural language processing (NLP) revealed the occurrence patterns of these concepts in EHR narrative notes, which enabled selection of informative features for phenotype classification. When combined with additional codified features, a penalized logistic regression model was trained to classify the target phenotype. The authors applied our method to develop algorithms to identify patients with rheumatoid arthritis and coronary artery disease cases among those with rheumatoid arthritis from a large multi-institutional EHR. The area under the receiver operating characteristic curves (AUC) for classifying RA and CAD using models trained with automated features were 0.951 and 0.929, respectively, compared to the AUCs of 0.938 and 0.929 by models trained with expert-curated features. Models trained with NLP text features selected through an unbiased, automated procedure achieved comparable or slightly higher accuracy than those trained with expert-curated features. The majority of the selected model features were interpretable. The proposed automated feature extraction method, generating highly accurate phenotyping algorithms with improved efficiency, is a significant step toward high-throughput phenotyping. © The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Segmental Rescoring in Text Recognition
2014-02-04
description relates to rescoring text hypotheses in text recognition based on segmental features. Offline printed text and handwriting recognition (OHR) can... Handwriting , College Park, Md., 2006, which is incorporated by reference here. For the set of training images 202, a character modeler 208 receives
Brkić, Silvija
2013-01-01
Scientific and professional papers represent the information basis for scientific research and professional work. References important for the paper should be cited within the text, and listed at the end of the paper. This paper deals with different styles of reference citation. Special emphasis was placed on the Vancouver Style for reference citation in biomedical journals established by the International Committee of Medical Journal Editors. It includes original samples for citing various types of articles, both printed and electronic, as well as recommendations related to reference citation in accordance with the methodology and ethics of scientific research and guidelines for preparing manuscripts for publication.
NASA Astrophysics Data System (ADS)
Dodick, Jeff; Argamon, Shlomo; Chase, Paul
2009-08-01
A key focus of current science education reforms involves developing inquiry-based learning materials. However, without an understanding of how working scientists actually do science, such learning materials cannot be properly developed. Until now, research on scientific reasoning has focused on cognitive studies of individual scientific fields. However, the question remains as to whether scientists in different fields fundamentally rely on different methodologies. Although many philosophers and historians of science do indeed assert that there is no single monolithic scientific method, this has never been tested empirically. We therefore approach this problem by analyzing patterns of language used by scientists in their published work. Our results demonstrate systematic variation in language use between types of science that are thought to differ in their characteristic methodologies. The features of language use that were found correspond closely to a proposed distinction between Experimental Sciences (e.g., chemistry) and Historical Sciences (e.g., paleontology); thus, different underlying rhetorical and conceptual mechanisms likely operate for scientific reasoning and communication in different contexts.
Marafino, Ben J; Boscardin, W John; Dudley, R Adams
2015-04-01
Sparsity is often a desirable property of statistical models, and various feature selection methods exist so as to yield sparser and interpretable models. However, their application to biomedical text classification, particularly to mortality risk stratification among intensive care unit (ICU) patients, has not been thoroughly studied. To develop and characterize sparse classifiers based on the free text of nursing notes in order to predict ICU mortality risk and to discover text features most strongly associated with mortality. We selected nursing notes from the first 24h of ICU admission for 25,826 adult ICU patients from the MIMIC-II database. We then developed a pair of stochastic gradient descent-based classifiers with elastic-net regularization. We also studied the performance-sparsity tradeoffs of both classifiers as their regularization parameters were varied. The best-performing classifier achieved a 10-fold cross-validated AUC of 0.897 under the log loss function and full L2 regularization, while full L1 regularization used just 0.00025% of candidate input features and resulted in an AUC of 0.889. Using the log loss (range of AUCs 0.889-0.897) yielded better performance compared to the hinge loss (0.850-0.876), but the latter yielded even sparser models. Most features selected by both classifiers appear clinically relevant and correspond to predictors already present in existing ICU mortality models. The sparser classifiers were also able to discover a number of informative - albeit nonclinical - features. The elastic-net-regularized classifiers perform reasonably well and are capable of reducing the number of features required by over a thousandfold, with only a modest impact on performance. Copyright © 2015 Elsevier Inc. All rights reserved.
Chen, Yifei; Sun, Yuxing; Han, Bing-Qing
2015-01-01
Protein interaction article classification is a text classification task in the biological domain to determine which articles describe protein-protein interactions. Since the feature space in text classification is high-dimensional, feature selection is widely used for reducing the dimensionality of features to speed up computation without sacrificing classification performance. Many existing feature selection methods are based on the statistical measure of document frequency and term frequency. One potential drawback of these methods is that they treat features separately. Hence, first we design a similarity measure between the context information to take word cooccurrences and phrase chunks around the features into account. Then we introduce the similarity of context information to the importance measure of the features to substitute the document and term frequency. Hence we propose new context similarity-based feature selection methods. Their performance is evaluated on two protein interaction article collections and compared against the frequency-based methods. The experimental results reveal that the context similarity-based methods perform better in terms of the F1 measure and the dimension reduction rate. Benefiting from the context information surrounding the features, the proposed methods can select distinctive features effectively for protein interaction article classification.
A graphics approach in the design of the dual air density Explorer satellites
NASA Technical Reports Server (NTRS)
Mcdougal, D. S.
1975-01-01
A computer program was developed to generate a graphics display of the Dual Air Density (DAD) Explorer satellites which aids in the engineering and scientific design. The program displays a two-dimensional view of both spacecraft and their surface features from any direction. The graphics have been an indispensable tool in the design, analysis, and understanding of the critical locations of the various surface features for both satellites.
Student-Accessible Science Texts: Elements of Design
ERIC Educational Resources Information Center
McTigue, Erin M.; Slough, Scott W.
2010-01-01
Within this article, we introduce our conception of text accessibility. First, we synthesize recent research on informational text quality and present key attributes proven to contribute to comprehension of science texts beyond the readability formula. These features include (a) the concreteness of text, (b) the voice of the author, (c) coherent…
Reading without Words: Using the Arrival to Teach Visual Literacy with English Language Learners
ERIC Educational Resources Information Center
Mathews, Sarah A.
2014-01-01
This article highlights the use of Shaun Tan's "The Arrival" to teach literacy to English Language Learners in social studies classrooms. The featured text is a book that displays the complexity of migration within a text that does not feature a single written word. The author describes a variety of mini-lessons geared towards…
Del Fiol, Guilherme; Michelson, Matthew; Iorio, Alfonso; Cotoi, Chris; Haynes, R Brian
2018-06-25
A major barrier to the practice of evidence-based medicine is efficiently finding scientifically sound studies on a given clinical topic. To investigate a deep learning approach to retrieve scientifically sound treatment studies from the biomedical literature. We trained a Convolutional Neural Network using a noisy dataset of 403,216 PubMed citations with title and abstract as features. The deep learning model was compared with state-of-the-art search filters, such as PubMed's Clinical Query Broad treatment filter, McMaster's textword search strategy (no Medical Subject Heading, MeSH, terms), and Clinical Query Balanced treatment filter. A previously annotated dataset (Clinical Hedges) was used as the gold standard. The deep learning model obtained significantly lower recall than the Clinical Queries Broad treatment filter (96.9% vs 98.4%; P<.001); and equivalent recall to McMaster's textword search (96.9% vs 97.1%; P=.57) and Clinical Queries Balanced filter (96.9% vs 97.0%; P=.63). Deep learning obtained significantly higher precision than the Clinical Queries Broad filter (34.6% vs 22.4%; P<.001) and McMaster's textword search (34.6% vs 11.8%; P<.001), but was significantly lower than the Clinical Queries Balanced filter (34.6% vs 40.9%; P<.001). Deep learning performed well compared to state-of-the-art search filters, especially when citations were not indexed. Unlike previous machine learning approaches, the proposed deep learning model does not require feature engineering, or time-sensitive or proprietary features, such as MeSH terms and bibliometrics. Deep learning is a promising approach to identifying reports of scientifically rigorous clinical research. Further work is needed to optimize the deep learning model and to assess generalizability to other areas, such as diagnosis, etiology, and prognosis. ©Guilherme Del Fiol, Matthew Michelson, Alfonso Iorio, Chris Cotoi, R Brian Haynes. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 25.06.2018.
Patterns of text reuse in a scientific corpus
Citron, Daniel T.; Ginsparg, Paul
2015-01-01
We consider the incidence of text “reuse” by researchers via a systematic pairwise comparison of the text content of all articles deposited to arXiv.org from 1991 to 2012. We measure the global frequencies of three classes of text reuse and measure how chronic text reuse is distributed among authors in the dataset. We infer a baseline for accepted practice, perhaps surprisingly permissive compared with other societal contexts, and a clearly delineated set of aberrant authors. We find a negative correlation between the amount of reused text in an article and its influence, as measured by subsequent citations. Finally, we consider the distribution of countries of origin of articles containing large amounts of reused text. PMID:25489072
A microwave applicator for uniform irradiation by circularly polarized waves in an anechoic chamber
NASA Astrophysics Data System (ADS)
Chiang, W. Y.; Wu, M. H.; Wu, K. L.; Lin, M. H.; Teng, H. H.; Tsai, Y. F.; Ko, C. C.; Yang, E. C.; Jiang, J. A.; Barnett, L. R.; Chu, K. R.
2014-08-01
Microwave applicators are widely employed for materials heating in scientific research and industrial applications, such as food processing, wood drying, ceramic sintering, chemical synthesis, waste treatment, and insect control. For the majority of microwave applicators, materials are heated in the standing waves of a resonant cavity, which can be highly efficient in energy consumption, but often lacks the field uniformity and controllability required for a scientific study. Here, we report a microwave applicator for rapid heating of small samples by highly uniform irradiation. It features an anechoic chamber, a 24-GHz microwave source, and a linear-to-circular polarization converter. With a rather low energy efficiency, such an applicator functions mainly as a research tool. This paper discusses the significance of its special features and describes the structure, in situ diagnostic tools, calculated and measured field patterns, and a preliminary heating test of the overall system.
The status of the concept of 'phoneme' in psycholinguistics.
Uppstad, Per Henning; Tønnessen, Finn Egil
2010-10-01
The notion of the phoneme counts as a break-through of modern theoretical linguistics in the early twentieth century. It paved the way for descriptions of distinctive features at different levels in linguistics. Although it has since then had a turbulent existence across altering theoretical positions, it remains a powerful concept of a fundamental unit in spoken language. At the same time, its conceptual status remains highly unclear. The present article aims to clarify the status of the concept of 'phoneme' in psycholinguistics, based on the scientific concepts of description, understanding and explanation. Theoretical linguistics has provided mainly descriptions. The ideas underlying this article are, first, that these descriptions may not be directly relevant to psycholinguistics and, second, that psycholinguistics in this sense is not a sub-discipline of theoretical linguistics. Rather, these two disciplines operate with different sets of features and with different orientations when it comes to the scientific concepts of description, understanding and explanation.
A microwave applicator for uniform irradiation by circularly polarized waves in an anechoic chamber.
Chiang, W Y; Wu, M H; Wu, K L; Lin, M H; Teng, H H; Tsai, Y F; Ko, C C; Yang, E C; Jiang, J A; Barnett, L R; Chu, K R
2014-08-01
Microwave applicators are widely employed for materials heating in scientific research and industrial applications, such as food processing, wood drying, ceramic sintering, chemical synthesis, waste treatment, and insect control. For the majority of microwave applicators, materials are heated in the standing waves of a resonant cavity, which can be highly efficient in energy consumption, but often lacks the field uniformity and controllability required for a scientific study. Here, we report a microwave applicator for rapid heating of small samples by highly uniform irradiation. It features an anechoic chamber, a 24-GHz microwave source, and a linear-to-circular polarization converter. With a rather low energy efficiency, such an applicator functions mainly as a research tool. This paper discusses the significance of its special features and describes the structure, in situ diagnostic tools, calculated and measured field patterns, and a preliminary heating test of the overall system.
Drake, Phillip
2016-04-01
The Lapindo mudflow is one of the most controversial disasters in Indonesian history. Despite its unique biophysical features, most consider the mudflow a social disaster as scientific conflicts about its main trigger have evolved into legal disputes over accountability and rights. This paper examines this 'trigger debate', the stakes of scientific contention and the broader social and natural dynamics that shape the terms of this debate. A Latourian impulse drives this analysis, which aims to improve both understandings of--and responses to--complex disasters. This paper also notes that the stakes of representation extend to constructions of its stakeholders, especially to victims. As socionatural disasters become an increasingly common feature of the contemporary world, from mud volcanoes to extreme weather events caused by global warming, it is more important than ever to understand the dynamics of representing disasters and stakeholders. © 2016 The Author(s). Disasters © Overseas Development Institute, 2016.
Representing nested semantic information in a linear string of text using XML.
Krauthammer, Michael; Johnson, Stephen B; Hripcsak, George; Campbell, David A; Friedman, Carol
2002-01-01
XML has been widely adopted as an important data interchange language. The structure of XML enables sharing of data elements with variable degrees of nesting as long as the elements are grouped in a strict tree-like fashion. This requirement potentially restricts the usefulness of XML for marking up written text, which often includes features that do not properly nest within other features. We encountered this problem while marking up medical text with structured semantic information from a Natural Language Processor. Traditional approaches to this problem separate the structured information from the actual text mark up. This paper introduces an alternative solution, which tightly integrates the semantic structure with the text. The resulting XML markup preserves the linearity of the medical texts and can therefore be easily expanded with additional types of information.
Representing nested semantic information in a linear string of text using XML.
Krauthammer, Michael; Johnson, Stephen B.; Hripcsak, George; Campbell, David A.; Friedman, Carol
2002-01-01
XML has been widely adopted as an important data interchange language. The structure of XML enables sharing of data elements with variable degrees of nesting as long as the elements are grouped in a strict tree-like fashion. This requirement potentially restricts the usefulness of XML for marking up written text, which often includes features that do not properly nest within other features. We encountered this problem while marking up medical text with structured semantic information from a Natural Language Processor. Traditional approaches to this problem separate the structured information from the actual text mark up. This paper introduces an alternative solution, which tightly integrates the semantic structure with the text. The resulting XML markup preserves the linearity of the medical texts and can therefore be easily expanded with additional types of information. PMID:12463856
Tashkeela: Novel corpus of Arabic vocalized texts, data for auto-diacritization systems.
Zerrouki, Taha; Balla, Amar
2017-04-01
Arabic diacritics are often missed in Arabic scripts. This feature is a handicap for new learner to read َArabic, text to speech conversion systems, reading and semantic analysis of Arabic texts. The automatic diacritization systems are the best solution to handle this issue. But such automation needs resources as diactritized texts to train and evaluate such systems. In this paper, we describe our corpus of Arabic diacritized texts. This corpus is called Tashkeela. It can be used as a linguistic resource tool for natural language processing such as automatic diacritics systems, dis-ambiguity mechanism, features and data extraction. The corpus is freely available, it contains 75 million of fully vocalized words mainly 97 books from classical and modern Arabic language. The corpus is collected from manually vocalized texts using web crawling process.
ERIC Educational Resources Information Center
Simon, Uwe K.; Steindl, Hanna; Larcher, Nicole; Kulac, Helga; Hotter, Annelies
2016-01-01
Far too few high-school students choose subjects from the natural sciences (NaSc) for their majors in many countries. Even fewer study biology, chemistry or physics at university. Those, that do, often lack training to present and discuss scientific results and ideas in texts. To meet these challenges the center for didactics of biology of Graz…
Learning Semantic Tags from Big Data for Clinical Text Representation.
Li, Yanpeng; Liu, Hongfang
2015-01-01
In clinical text mining, it is one of the biggest challenges to represent medical terminologies and n-gram terms in sparse medical reports using either supervised or unsupervised methods. Addressing this issue, we propose a novel method for word and n-gram representation at semantic level. We first represent each word by its distance with a set of reference features calculated by reference distance estimator (RDE) learned from labeled and unlabeled data, and then generate new features using simple techniques of discretization, random sampling and merging. The new features are a set of binary rules that can be interpreted as semantic tags derived from word and n-grams. We show that the new features significantly outperform classical bag-of-words and n-grams in the task of heart disease risk factor extraction in i2b2 2014 challenge. It is promising to see that semantics tags can be used to replace the original text entirely with even better prediction performance as well as derive new rules beyond lexical level.
Discovering body site and severity modifiers in clinical texts
Dligach, Dmitriy; Bethard, Steven; Becker, Lee; Miller, Timothy; Savova, Guergana K
2014-01-01
Objective To research computational methods for discovering body site and severity modifiers in clinical texts. Methods We cast the task of discovering body site and severity modifiers as a relation extraction problem in the context of a supervised machine learning framework. We utilize rich linguistic features to represent the pairs of relation arguments and delegate the decision about the nature of the relationship between them to a support vector machine model. We evaluate our models using two corpora that annotate body site and severity modifiers. We also compare the model performance to a number of rule-based baselines. We conduct cross-domain portability experiments. In addition, we carry out feature ablation experiments to determine the contribution of various feature groups. Finally, we perform error analysis and report the sources of errors. Results The performance of our method for discovering body site modifiers achieves F1 of 0.740–0.908 and our method for discovering severity modifiers achieves F1 of 0.905–0.929. Discussion Results indicate that both methods perform well on both in-domain and out-domain data, approaching the performance of human annotators. The most salient features are token and named entity features, although syntactic dependency features also contribute to the overall performance. The dominant sources of errors are infrequent patterns in the data and inability of the system to discern deeper semantic structures. Conclusions We investigated computational methods for discovering body site and severity modifiers in clinical texts. Our best system is released open source as part of the clinical Text Analysis and Knowledge Extraction System (cTAKES). PMID:24091648
Discovering body site and severity modifiers in clinical texts.
Dligach, Dmitriy; Bethard, Steven; Becker, Lee; Miller, Timothy; Savova, Guergana K
2014-01-01
To research computational methods for discovering body site and severity modifiers in clinical texts. We cast the task of discovering body site and severity modifiers as a relation extraction problem in the context of a supervised machine learning framework. We utilize rich linguistic features to represent the pairs of relation arguments and delegate the decision about the nature of the relationship between them to a support vector machine model. We evaluate our models using two corpora that annotate body site and severity modifiers. We also compare the model performance to a number of rule-based baselines. We conduct cross-domain portability experiments. In addition, we carry out feature ablation experiments to determine the contribution of various feature groups. Finally, we perform error analysis and report the sources of errors. The performance of our method for discovering body site modifiers achieves F1 of 0.740-0.908 and our method for discovering severity modifiers achieves F1 of 0.905-0.929. Results indicate that both methods perform well on both in-domain and out-domain data, approaching the performance of human annotators. The most salient features are token and named entity features, although syntactic dependency features also contribute to the overall performance. The dominant sources of errors are infrequent patterns in the data and inability of the system to discern deeper semantic structures. We investigated computational methods for discovering body site and severity modifiers in clinical texts. Our best system is released open source as part of the clinical Text Analysis and Knowledge Extraction System (cTAKES).
ERIC Educational Resources Information Center
Safadi, Rafi; Safadi, Ekhlass; Meidav, Meir
2017-01-01
This study compared students' learning in troubleshooting and problem solving activities. The troubleshooting activities provided students with solutions to conceptual problems in the form of refutation texts; namely, solutions that portray common misconceptions, refute them, and then present the accepted scientific ideas. They required students…