umls refined semantic: Topics by Science.gov

Sample records for umls refined semantic

Sculpting the UMLS Refined Semantic Network.

PubMed

He, Zhe; Morrey, C Paul; Perl, Yehoshua; Elhanan, Gai; Chen, Ling; Chen, Yan; Geller, James

2014-01-01

The Refined Semantic Network (RSN) for the UMLS was previously introduced to complement the UMLS Semantic Network (SN). The RSN partitions the UMLS Metathesaurus (META) into disjoint groups of concepts. Each such group is semantically uniform. However, the RSN was initially an order of magnitude larger than the SN, which is undesirable since to be useful, a semantic network should be compact. Most semantic types in the RSN represent combinations of semantic types in the UMLS SN. Such a "combination semantic type" is called Intersection Semantic Type (IST). Many ISTs are assigned to very few concepts. Moreover, when reviewing those concepts, many semantic type assignment inconsistencies were found. After correcting those inconsistencies many ISTs, among them some that contradicted UMLS rules, disappeared, which made the RSN smaller. The authors performed a longitudinal study with the goal of reducing the size of the RSN to become compact. This goal was achieved by correcting inconsistencies and errors in the IST assignments in the UMLS, which additionally helped identify and correct ambiguities, inconsistencies, and errors in source terminologies widely used in the realm of public health. In this paper, we discuss the process and steps employed in this longitudinal study and the intermediate results for different stages. The sculpting process includes removing redundant semantic type assignments, expanding semantic type assignments, and removing illegitimate ISTs by auditing ISTs of small extents. However, the emphasis of this paper is not on the auditing methodologies employed during the process, since they were introduced in earlier publications, but on the strategy of employing them in order to transform the RSN into a compact network. For this paper we also performed a comprehensive audit of 168 "small ISTs" in the 2013AA version of the UMLS to finalize the longitudinal study. Over the years it was found that the editors of the UMLS introduced some new inconsistencies that resulted in the reintroduction of unwarranted ISTs that had already been eliminated as a result of their previous corrections. Because of that, the transformation of the RSN into a compact network covering all necessary categories for the UMLS was slowed down. The corrections suggested by an audit of the 2013AA version of the UMLS achieve a compact RSN of equal magnitude as the UMLS SN. The number of ISTs has been reduced to 336. We also demonstrate how auditing the semantic type assignments of UMLS concepts can expose other modeling errors in the UMLS source terminologies, e.g., SNOMED CT, LOINC, and RxNORM that are important for health informatics. Such errors would otherwise stay hidden. It is hoped that the UMLS curators will implement all required corrections and use the RSN along with the SN when maintaining and extending the UMLS. When used correctly, the RSN will support the prevention of the accidental introduction of inconsistent semantic type assignments into the UMLS. Furthermore, this way the RSN will support the exposure of other hidden errors and inconsistencies in health informatics terminologies, which are sources of the UMLS. Notably, the development of the RSN materializes the deeper, more refined Semantic Network for the UMLS that its designers envisioned originally but had not implemented.
Sculpting the UMLS Refined Semantic Network

PubMed Central

Morrey, C. Paul; Perl, Yehoshua; Elhanan, Gai; Chen, Ling; Chen, Yan; Geller, James

2014-01-01

Background The Refined Semantic Network (RSN) for the UMLS was previously introduced to complement the UMLS Semantic Network (SN). The RSN partitions the UMLS Metathesaurus (META) into disjoint groups of concepts. Each such group is semantically uniform. However, the RSN was initially an order of magnitude larger than the SN, which is undesirable since to be useful, a semantic network should be compact. Most semantic types in the RSN represent combinations of semantic types in the UMLS SN. Such a “combination semantic type” is called Intersection Semantic Type (IST). Many ISTs are assigned to very few concepts. Moreover, when reviewing those concepts, many semantic type assignment inconsistencies were found. After correcting those inconsistencies many ISTs, among them some that contradicted UMLS rules, disappeared, which made the RSN smaller. Objective The authors performed a longitudinal study with the goal of reducing the size of the RSN to become compact. This goal was achieved by correcting inconsistencies and errors in the IST assignments in the UMLS, which additionally helped identify and correct ambiguities, inconsistencies, and errors in source terminologies widely used in the realm of public health. Methods In this paper, we discuss the process and steps employed in this longitudinal study and the intermediate results for different stages. The sculpting process includes removing redundant semantic type assignments, expanding semantic type assignments, and removing illegitimate ISTs by auditing ISTs of small extents. However, the emphasis of this paper is not on the auditing methodologies employed during the process, since they were introduced in earlier publications, but on the strategy of employing them in order to transform the RSN into a compact network. For this paper we also performed a comprehensive audit of 168 “small ISTs” in the 2013AA version of the UMLS to finalize the longitudinal study. Results Over the years it was found that the editors of the UMLS introduced some new inconsistencies that resulted in the reintroduction of unwarranted ISTs that had already been eliminated as a result of their previous corrections. Because of that, the transformation of the RSN into a compact network covering all necessary categories for the UMLS was slowed down. The corrections suggested by an audit of the 2013AA version of the UMLS achieve a compact RSN of equal magnitude as the UMLS SN. The number of ISTs has been reduced to 336. We also demonstrate how auditing the semantic type assignments of UMLS concepts can expose other modeling errors in the UMLS source terminologies, e.g., SNOMED CT, LOINC, and RxNORM that are important for health informatics. Such errors would otherwise stay hidden. Conclusions It is hoped that the UMLS curators will implement all required corrections and use the RSN along with the SN when maintaining and extending the UMLS. When used correctly, the RSN will support the prevention of the accidental introduction of inconsistent semantic type assignments into the UMLS. Furthermore, this way the RSN will support the exposure of other hidden errors and inconsistencies in health informatics terminologies, which are sources of the UMLS. Notably, the development of the RSN materializes the deeper, more refined Semantic Network for the UMLS that its designers envisioned originally but had not implemented. PMID:25422719
Auditing Associative Relations across Two Knowledge Sources

PubMed Central

Vizenor, Lowell T.; Bodenreider, Olivier; McCray, Alexa T.

2009-01-01

Objectives This paper proposes a novel semantic method for auditing associative relations in biomedical terminologies. We tested our methodology on two Unified Medical Language System (UMLS) knowledge sources. Methods We use the UMLS semantic groups as high-level representations of the domain and range of relationships in the Metathesaurus and in the Semantic Network. A mapping created between Metathesaurus relationships and Semantic Network relationships forms the basis for comparing the signatures of a given Metathesaurus relationship to the signatures of the semantic relationship to which it is mapped. The consistency of Metathesaurus relations is studied for each relationship. Results Of the 177 associative relationships in the Metathesaurus, 84 (48%) exhibit a high degree of consistency with the corresponding Semantic Network relationships. Overall, 63% of the 1.8M associative relations in the Metathesaurus are consistent with relations in the Semantic Network. Conclusion The semantics of associative relationships in biomedical terminologies should be defined explicitly by their developers. The Semantic Network would benefit from being extended with new relationships and with new relations for some existing relationships. The UMLS editing environment could take advantage of the correspondence established between relationships in the Metathesaurus and the Semantic Network. Finally, the auditing method also yielded useful information for refining the mapping of associative relationships between the two sources. PMID:19475724
Auditing the Assignments of Top-Level Semantic Types in the UMLS Semantic Network to UMLS Concepts

PubMed Central

He, Zhe; Perl, Yehoshua; Elhanan, Gai; Chen, Yan; Geller, James; Bian, Jiang

2018-01-01

The Unified Medical Language System (UMLS) is an important terminological system. By the policy of its curators, each concept of the UMLS should be assigned the most specific Semantic Types (STs) in the UMLS Semantic Network (SN). Hence, the Semantic Types of most UMLS concepts are assigned at or near the bottom (leaves) of the UMLS Semantic Network. While most ST assignments are correct, some errors do occur. Therefore, Quality Assurance efforts of UMLS curators for ST assignments should concentrate on automatically detected sets of UMLS concepts with higher error rates than random sets. In this paper, we investigate the assignments of top-level semantic types in the UMLS semantic network to concepts, identify potential erroneous assignments, define four categories of errors, and thus provide assistance to curators of the UMLS to avoid these assignments errors. Human experts analyzed samples of concepts assigned 10 of the top-level semantic types and categorized the erroneous ST assignments into these four logical categories. Two thirds of the concepts assigned these 10 top-level semantic types are erroneous. Our results demonstrate that reviewing top-level semantic type assignments to concepts provides an effective way for UMLS quality assurance, comparing to reviewing a random selection of semantic type assignments. PMID:29375930
Auditing the Assignments of Top-Level Semantic Types in the UMLS Semantic Network to UMLS Concepts.

PubMed

He, Zhe; Perl, Yehoshua; Elhanan, Gai; Chen, Yan; Geller, James; Bian, Jiang

2017-11-01

The Unified Medical Language System (UMLS) is an important terminological system. By the policy of its curators, each concept of the UMLS should be assigned the most specific Semantic Types (STs) in the UMLS Semantic Network (SN). Hence, the Semantic Types of most UMLS concepts are assigned at or near the bottom (leaves) of the UMLS Semantic Network. While most ST assignments are correct, some errors do occur. Therefore, Quality Assurance efforts of UMLS curators for ST assignments should concentrate on automatically detected sets of UMLS concepts with higher error rates than random sets. In this paper, we investigate the assignments of top-level semantic types in the UMLS semantic network to concepts, identify potential erroneous assignments, define four categories of errors, and thus provide assistance to curators of the UMLS to avoid these assignments errors. Human experts analyzed samples of concepts assigned 10 of the top-level semantic types and categorized the erroneous ST assignments into these four logical categories. Two thirds of the concepts assigned these 10 top-level semantic types are erroneous. Our results demonstrate that reviewing top-level semantic type assignments to concepts provides an effective way for UMLS quality assurance, comparing to reviewing a random selection of semantic type assignments.
Rule-based support system for multiple UMLS semantic type assignments

PubMed Central

Geller, James; He, Zhe; Perl, Yehoshua; Morrey, C. Paul; Xu, Julia

2012-01-01

Background When new concepts are inserted into the UMLS, they are assigned one or several semantic types from the UMLS Semantic Network by the UMLS editors. However, not every combination of semantic types is permissible. It was observed that many concepts with rare combinations of semantic types have erroneous semantic type assignments or prohibited combinations of semantic types. The correction of such errors is resource-intensive. Objective We design a computational system to inform UMLS editors as to whether a specific combination of two, three, four, or five semantic types is permissible or prohibited or questionable. Methods We identify a set of inclusion and exclusion instructions in the UMLS Semantic Network documentation and derive corresponding rule-categories as well as rule-categories from the UMLS concept content. We then design an algorithm adviseEditor based on these rule-categories. The algorithm specifies rules for an editor how to proceed when considering a tuple (pair, triple, quadruple, quintuple) of semantic types to be assigned to a concept. Results Eight rule-categories were identified. A Web-based system was developed to implement the adviseEditor algorithm, which returns for an input combination of semantic types whether it is permitted, prohibited or (in a few cases) requires more research. The numbers of semantic type pairs assigned to each rule-category are reported. Interesting examples for each rule-category are illustrated. Cases of semantic type assignments that contradict rules are listed, including recently introduced ones. Conclusion The adviseEditor system implements explicit and implicit knowledge available in the UMLS in a system that informs UMLS editors about the permissibility of a desired combination of semantic types. Using adviseEditor might help accelerate the work of the UMLS editors and prevent erroneous semantic type assignments. PMID:23041716
Quality Assurance of UMLS Semantic Type Assignments Using SNOMED CT Hierarchies.

PubMed

Gu, H; Chen, Y; He, Z; Halper, M; Chen, L

2016-01-01

The Unified Medical Language System (UMLS) is one of the largest biomedical terminological systems, with over 2.5 million concepts in its Metathesaurus repository. The UMLS's Semantic Network (SN) with its collection of 133 high-level semantic types serves as an abstraction layer on top of the Metathesaurus. In particular, the SN elaborates an aspect of the Metathesaurus's concepts via the assignment of one or more types to each concept. Due to the scope and complexity of the Metathesaurus, errors are all but inevitable in this semantic-type assignment process. To develop a semi-automated methodology to help assure the quality of semantic-type assignments within the UMLS. The methodology uses a cross-validation strategy involving SNOMED CT's hierarchies in combination with UMLS semantic types. Semantically uniform, disjoint concept groups are generated programmatically by partitioning the collection of all concepts in the same SNOMED CT hierarchy according to their respective semantic-type assignments in the UMLS. Domain experts are then called upon to review the concepts in any group having a small number of concepts. It is our hypothesis that a semantic-type assignment combination applicable only to a very small number of concepts in a SNOMED CT hierarchy is an indicator of potential problems. The methodology was applied to the UMLS 2013AA release along with the SNOMED CT from January 2013. An overall error rate of 33% was found for concepts proposed by the quality-assurance methodology. Supporting our hypothesis, that number was four times higher than the error rate found in control samples. The results show that the quality-assurance methodology can aid in effective and efficient identification of UMLS semantic-type assignment errors.
Semi-Supervised Learning to Identify UMLS Semantic Relations.

PubMed

Luo, Yuan; Uzuner, Ozlem

2014-01-01

The UMLS Semantic Network is constructed by experts and requires periodic expert review to update. We propose and implement a semi-supervised approach for automatically identifying UMLS semantic relations from narrative text in PubMed. Our method analyzes biomedical narrative text to collect semantic entity pairs, and extracts multiple semantic, syntactic and orthographic features for the collected pairs. We experiment with seeded k-means clustering with various distance metrics. We create and annotate a ground truth corpus according to the top two levels of the UMLS semantic relation hierarchy. We evaluate our system on this corpus and characterize the learning curves of different clustering configuration. Using KL divergence consistently performs the best on the held-out test data. With full seeding, we obtain macro-averaged F-measures above 70% for clustering the top level UMLS relations (2-way), and above 50% for clustering the second level relations (7-way).
Expanding the Extent of a UMLS Semantic Type via Group Neighborhood Auditing

PubMed Central

Chen, Yan; Gu, Huanying; Perl, Yehoshua; Halper, Michael; Xu, Junchuan

2009-01-01

Objective Each Unified Medical Language System (UMLS) concept is assigned one or more semantic types (ST). A dynamic methodology for aiding an auditor in finding concepts that are missing the assignment of a given ST, S is presented. Design The first part of the methodology exploits the previously introduced Refined Semantic Network and accompanying refined semantic types (RST) to help narrow the search space for offending concepts. The auditing is focused in a neighborhood surrounding the extent of an RST, T (of S) called an envelope, consisting of parents and children of concepts in the extent. The audit moves outward as long as missing assignments are discovered. In the second part, concepts not reached previously are processed and reassigned T as needed during the processing of S's other RSTs. The set of such concepts is expanded in a similar way to that in the first part. Measurements The number of errors discovered is reported. To measure the methodology's efficiency, “error hit rates” (i.e., errors found in concepts examined) are computed. Results The methodology was applied to three STs: Experimental Model of Disease (EMD), Environmental Effect of Humans, and Governmental or Regulatory Activity. The EMD experienced the most drastic change. For its RST “EMD ∩ Neoplastic Process” (RST “EMD”) with only 33 (31) original concepts, 915 (134) concepts were found by the first (second) part to be missing the EMD assignment. Changes to the other two STs were smaller. Conclusion The results show that the proposed auditing methodology can help to effectively and efficiently identify concepts lacking the assignment of a particular semantic type. PMID:19567802
Evaluation of a UMLS Auditing Process of Semantic Type Assignments

PubMed Central

Gu, Huanying; Hripcsak, George; Chen, Yan; Morrey, C. Paul; Elhanan, Gai; Cimino, James J.; Geller, James; Perl, Yehoshua

2007-01-01

The UMLS is a terminological system that integrates many source terminologies. Each concept in the UMLS is assigned one or more semantic types from the Semantic Network, an upper level ontology for biomedicine. Due to the complexity of the UMLS, errors exist in the semantic type assignments. Finding assignment errors may unearth modeling errors. Even with sophisticated tools, discovering assignment errors requires manual review. In this paper we describe the evaluation of an auditing project of UMLS semantic type assignments. We studied the performance of the auditors who reviewed potential errors. We found that four auditors, interacting according to a multi-step protocol, identified a high rate of errors (one or more errors in 81% of concepts studied) and that results were sufficiently reliable (0.67 to 0.70) for the two most common types of errors. However, reliability was low for each individual auditor, suggesting that review of potential errors is resource-intensive. PMID:18693845
Semantic Mappings and Locality of Nursing Diagnostic Concepts in UMLS

PubMed Central

Kim, Tae Youn; Coenen, Amy; Hardiker, Nicholas

2011-01-01

One solution for enhancing the interoperability between nursing information systems, given the availability of multiple nursing terminologies, is to cross-map existing nursing concepts. The Unified Medical Language System (UMLS) developed and distributed by the National Library of Medicine (NLM) is a knowledge resource containing cross-mappings of various terminologies in a unified framework. While the knowledge resource has been available for the last two decades, little research on the representation of nursing terminologies in UMLS has been conducted. As a first step, UMLS semantic mappings and concept locality were examined for nursing diagnostic concepts or problems selected from three terminologies (i.e., CCC, ICNP, and NANDA-I) along with corresponding SNOMED CT concepts. The evaluation of UMLS semantic mappings was conducted by measuring the proportion of concordance between UMLS and human expert mappings. The semantic locality of nursing diagnostic concepts was assessed by examining the associations of select concepts and the placement of the nursing concepts on the Semantic Network and Group. The study found that the UMLS mappings of CCC and NANDA-I concepts to SNOMED CT were highly concordant to expert mappings. The level of concordance in mappings of ICNP to SNOMED CT, CCC and NANDA-I within UMLS was relatively low, indicating the need for further research and development. Likewise, the semantic locality of ICNP concepts could be further improved. Various stakeholders need to collaborate to enhance the NLM knowledge resource and the interoperability of nursing data within the discipline as well as across health-related disciplines. PMID:21951759
EMSE at TREC 2015 Clinical Decision Support Track

DTIC Science & Technology

2015-11-20

pseudo relevant documents, semantic ressources of UMLS , and a hybrid approach called SMERA that combines LSI and UMLS based approaches. Only three of...approach to query expansion uses ontologies ( UMLS ) and a lo- cal approach based on pseudo relevant feedback documents using LSI. A brief description of...pseudo relevance feedback documents, and a semantic method based on UMLS concepts. The LSI-based method was used only to expand summary terms that can’t
Quality evaluation of value sets from cancer study common data elements using the UMLS semantic groups

PubMed Central

Solbrig, Harold R; Chute, Christopher G

2012-01-01

Objective The objective of this study is to develop an approach to evaluate the quality of terminological annotations on the value set (ie, enumerated value domain) components of the common data elements (CDEs) in the context of clinical research using both unified medical language system (UMLS) semantic types and groups. Materials and methods The CDEs of the National Cancer Institute (NCI) Cancer Data Standards Repository, the NCI Thesaurus (NCIt) concepts and the UMLS semantic network were integrated using a semantic web-based framework for a SPARQL-enabled evaluation. First, the set of CDE-permissible values with corresponding meanings in external controlled terminologies were isolated. The corresponding value meanings were then evaluated against their NCI- or UMLS-generated semantic network mapping to determine whether all of the meanings fell within the same semantic group. Results Of the enumerated CDEs in the Cancer Data Standards Repository, 3093 (26.2%) had elements drawn from more than one UMLS semantic group. A random sample (n=100) of this set of elements indicated that 17% of them were likely to have been misclassified. Discussion The use of existing semantic web tools can support a high-throughput mechanism for evaluating the quality of large CDE collections. This study demonstrates that the involvement of multiple semantic groups in an enumerated value domain of a CDE is an effective anchor to trigger an auditing point for quality evaluation activities. Conclusion This approach produces a useful quality assurance mechanism for a clinical study CDE repository. PMID:22511016
Alignment of the UMLS semantic network with BioTop: methodology and assessment.

PubMed

Schulz, Stefan; Beisswanger, Elena; van den Hoek, László; Bodenreider, Olivier; van Mulligen, Erik M

2009-06-15

For many years, the Unified Medical Language System (UMLS) semantic network (SN) has been used as an upper-level semantic framework for the categorization of terms from terminological resources in biomedicine. BioTop has recently been developed as an upper-level ontology for the biomedical domain. In contrast to the SN, it is founded upon strict ontological principles, using OWL DL as a formal representation language, which has become standard in the semantic Web. In order to make logic-based reasoning available for the resources annotated or categorized with the SN, a mapping ontology was developed aligning the SN with BioTop. The theoretical foundations and the practical realization of the alignment are being described, with a focus on the design decisions taken, the problems encountered and the adaptations of BioTop that became necessary. For evaluation purposes, UMLS concept pairs obtained from MEDLINE abstracts by a named entity recognition system were tested for possible semantic relationships. Furthermore, all semantic-type combinations that occur in the UMLS Metathesaurus were checked for satisfiability. The effort-intensive alignment process required major design changes and enhancements of BioTop and brought up several design errors that could be fixed. A comparison between a human curator and the ontology yielded only a low agreement. Ontology reasoning was also used to successfully identify 133 inconsistent semantic-type combinations. BioTop, the OWL DL representation of the UMLS SN, and the mapping ontology are available at http://www.purl.org/biotop/.
Towards comprehensive syntactic and semantic annotations of the clinical narrative

PubMed Central

Albright, Daniel; Lanfranchi, Arrick; Fredriksen, Anwen; Styler, William F; Warner, Colin; Hwang, Jena D; Choi, Jinho D; Dligach, Dmitriy; Nielsen, Rodney D; Martin, James; Ward, Wayne; Palmer, Martha; Savova, Guergana K

2013-01-01

Objective To create annotated clinical narratives with layers of syntactic and semantic labels to facilitate advances in clinical natural language processing (NLP). To develop NLP algorithms and open source components. Methods Manual annotation of a clinical narrative corpus of 127 606 tokens following the Treebank schema for syntactic information, PropBank schema for predicate-argument structures, and the Unified Medical Language System (UMLS) schema for semantic information. NLP components were developed. Results The final corpus consists of 13 091 sentences containing 1772 distinct predicate lemmas. Of the 766 newly created PropBank frames, 74 are verbs. There are 28 539 named entity (NE) annotations spread over 15 UMLS semantic groups, one UMLS semantic type, and the Person semantic category. The most frequent annotations belong to the UMLS semantic groups of Procedures (15.71%), Disorders (14.74%), Concepts and Ideas (15.10%), Anatomy (12.80%), Chemicals and Drugs (7.49%), and the UMLS semantic type of Sign or Symptom (12.46%). Inter-annotator agreement results: Treebank (0.926), PropBank (0.891–0.931), NE (0.697–0.750). The part-of-speech tagger, constituency parser, dependency parser, and semantic role labeler are built from the corpus and released open source. A significant limitation uncovered by this project is the need for the NLP community to develop a widely agreed-upon schema for the annotation of clinical concepts and their relations. Conclusions This project takes a foundational step towards bringing the field of clinical NLP up to par with NLP in the general domain. The corpus creation and NLP components provide a resource for research and application development that would have been previously impossible. PMID:23355458
Logic-based assessment of the compatibility of UMLS ontology sources

PubMed Central

2011-01-01

Background The UMLS Metathesaurus (UMLS-Meta) is currently the most comprehensive effort for integrating independently-developed medical thesauri and ontologies. UMLS-Meta is being used in many applications, including PubMed and ClinicalTrials.gov. The integration of new sources combines automatic techniques, expert assessment, and auditing protocols. The automatic techniques currently in use, however, are mostly based on lexical algorithms and often disregard the semantics of the sources being integrated. Results In this paper, we argue that UMLS-Meta’s current design and auditing methodologies could be significantly enhanced by taking into account the logic-based semantics of the ontology sources. We provide empirical evidence suggesting that UMLS-Meta in its 2009AA version contains a significant number of errors; these errors become immediately apparent if the rich semantics of the ontology sources is taken into account, manifesting themselves as unintended logical consequences that follow from the ontology sources together with the information in UMLS-Meta. We then propose general principles and specific logic-based techniques to effectively detect and repair such errors. Conclusions Our results suggest that the methodologies employed in the design of UMLS-Meta are not only very costly in terms of human effort, but also error-prone. The techniques presented here can be useful for both reducing human effort in the design and maintenance of UMLS-Meta and improving the quality of its contents. PMID:21388571
Unambiguous UML Composite Structures: The OMEGA2 Experience

NASA Astrophysics Data System (ADS)

Ober, Iulian; Dragomir, Iulia

Starting from version 2.0, UML introduced hierarchical composite structures, which are a very expressive way of defining complex software architectures, but which have a very loosely defined semantics in the standard. In this paper we propose a set of consistency rules that ensure UML composite structures are unambiguous and can be given a precise semantics. Our primary application of the static consistency rules defined in this paper is within the OMEGA UML profile [6], but these rules are general and applicable to other hierarchical component models based on the same concepts, such as MARTE GCM or SysML. The rule set has been formalized in OCL and is currently used in the OMEGA UML compiler.
Access to Biomedical Information: The Unified Medical Language System.

ERIC Educational Resources Information Center

Squires, Steven J.

1993-01-01

Describes the development of a Unified Medical Language System (UMLS) by the National Library of Medicine that will retrieve and integrate information from a variety of information resources. Highlights include the metathesaurus; the UMLS semantic network; semantic locality; information sources map; evaluation of the metathesaurus; future…
Overcoming an obstacle in expanding a UMLS semantic type extent.

PubMed

Chen, Yan; Gu, Huanying; Perl, Yehoshua; Geller, James

2012-02-01

This paper strives to overcome a major problem encountered by a previous expansion methodology for discovering concepts highly likely to be missing a specific semantic type assignment in the UMLS. This methodology is the basis for an algorithm that presents the discovered concepts to a human auditor for review and possible correction. We analyzed the problem of the previous expansion methodology and discovered that it was due to an obstacle constituted by one or more concepts assigned the UMLS Semantic Network semantic type Classification. A new methodology was designed that bypasses such an obstacle without a combinatorial explosion in the number of concepts presented to the human auditor for review. The new expansion methodology with obstacle avoidance was tested with the semantic type Experimental Model of Disease and found over 500 concepts missed by the previous methodology that are in need of this semantic type assignment. Furthermore, other semantic types suffering from the same major problem were discovered, indicating that the methodology is of more general applicability. The algorithmic discovery of concepts that are likely missing a semantic type assignment is possible even in the face of obstacles, without an explosion in the number of processed concepts. Copyright © 2011 Elsevier Inc. All rights reserved.
Overcoming an Obstacle in Expanding a UMLS Semantic Type Extent

PubMed Central

Chen, Yan; Gu, Huanying; Perl, Yehoshua; Geller, James

2011-01-01

This paper strives to overcome a major problem encountered by a previous expansion methodology for discovering concepts highly likely to be missing a specific semantic type assignment in the UMLS. This methodology is the basis for an algorithm that presents the discovered concepts to a human auditor for review and possible correction. We analyzed the problem of the previous expansion methodology and discovered that it was due to an obstacle constituted by one or more concepts assigned the UMLS Semantic Network semantic type Classification. A new methodology was designed that bypasses such an obstacle without a combinatorial explosion in the number of concepts presented to the human auditor for review. The new expansion methodology with obstacle avoidance was tested with the semantic type Experimental Model of Disease and found over 500 concepts missed by the previous methodology that are in need of this semantic type assignment. Furthermore, other semantic types suffering from the same major problem were discovered, indicating that the methodology is of more general applicability. The algorithmic discovery of concepts that are likely missing a semantic type assignment is possible even in the face of obstacles, without an explosion in the number of processed concepts. PMID:21925287

Analyzing polysemous concepts from a clinical perspective: Application to auditing concept categorization in the UMLS

PubMed Central

Mougin, Fleur; Bodenreider, Olivier; Burgun, Anita

2015-01-01

Objectives Polysemy is a frequent issue in biomedical terminologies. In the Unified Medical Language System (UMLS), polysemous terms are either represented as several independent concepts, or clustered into a single, multiply-categorized concept. The objective of this study is to analyze polysemous concepts in the UMLS through their categorization and hierarchical relations for auditing purposes. Methods We used the association of a concept with multiple Semantic Groups (SGs) as a surrogate for polysemy. We first extracted multi-SG (MSG) concepts from the UMLS Metathesaurus and characterized them in terms of the combinations of SGs with which they are associated. We then clustered MSG concepts in order to identify major types of polysemy. We also analyzed the inheritance of SGs in MSG concepts. Finally, we manually reviewed the categorization of the MSG concepts for auditing purposes. Results The 1208 MSG concepts in the Metathesaurus are associated with 30 distinct pairs of SGs. We created 75 semantically homogeneous clusters of MSG concepts, and 276 MSG concepts could not be clustered for lack of hierarchical relations. The clusters were characterized by the most frequent pairs of semantic types of their constituent MSG concepts. MSG concepts exhibit limited semantic compatibility with their parent and child concepts. A large majority of MSG concepts (92%) are adequately categorized. Examples of miscategorized concepts are presented. Conclusion This work is a systematic analysis and manual review of all concepts categorized by multiple SGs in the UMLS. The correctly-categorized MSG concepts do reflect polysemy in the UMLS Metathesaurus. The analysis of inheritance of SGs proved useful for auditing concept categorization in the UMLS. PMID:19303057
A unified representation of findings in clinical radiology using the UMLS and DICOM.

PubMed

Bertaud, Valérie; Lasbleiz, Jérémy; Mougin, Fleur; Burgun, Anita; Duvauferrier, Régis

2008-09-01

Collecting and analyzing findings constitute the basis of medical activity. Computer assisted medical activity raises the problem of modelling findings. We propose a unified representation of findings integrating the representations of findings in the GAMUTS in Radiology [M.M. Reeder, B. Felson, GAMUTS in radiology Comprehensive lists of roentgen differential diagnosis, fourth ed., 2003], the Unified Medical Language System (UMLS), and the Digital Imaging and Communication in Medicine Structured Report (DICOM-SR). Starting from a corpus of findings in bone and joint radiology [M.M. Reeder, B. Felson, GAMUTS in Radiology comprehensive lists of roentgen differential diagnosis, fourth ed., 2003] (3481 words), an automated mapping to the UMLS was performed with the Metamap Program. The resulting UMLS terms and Semantic Types were analyzed in order to find a generic template in accordance with DICOM-SR structure. UMLS Concepts were missing for 45% of the GAMUTS findings. Three kinds of regularities were observed in the way the Semantic Types were combined: "pathological findings", "physiological findings" and "anatomical findings". A generic and original DICOM-SR template modelling finding was proposed. It was evaluated for representing GAMUTS jaws findings. 21% missing terms had to be picked up from Radlex (5%) or created (16%). This article shows that it is possible to represent findings using the UMLS and the DICOM SR formalism with a semi-automated method. The Metamap program helped to find a model to represent the semantic structure of free texts with standardized terms (UMLS Concepts). Nevertheless, the coverage of the UMLS is not comprehensive. This study shows that the UMLS should include more technical concepts and more concepts regarding findings, signs and symptoms to be suitable for radiology representation. The semi-automated translation of the whole GAMUTS using the UMLS concepts and the DICOM SR relations could help to create or supplement the DCMR Templates and Context Groups pertaining to the description of imaging findings.
Using ontology-based semantic similarity to facilitate the article screening process for systematic reviews.

PubMed

Ji, Xiaonan; Ritter, Alan; Yen, Po-Yin

2017-05-01

Systematic Reviews (SRs) are utilized to summarize evidence from high quality studies and are considered the preferred source of evidence-based practice (EBP). However, conducting SRs can be time and labor intensive due to the high cost of article screening. In previous studies, we demonstrated utilizing established (lexical) article relationships to facilitate the identification of relevant articles in an efficient and effective manner. Here we propose to enhance article relationships with background semantic knowledge derived from Unified Medical Language System (UMLS) concepts and ontologies. We developed a pipelined semantic concepts representation process to represent articles from an SR into an optimized and enriched semantic space of UMLS concepts. Throughout the process, we leveraged concepts and concept relations encoded in biomedical ontologies (SNOMED-CT and MeSH) within the UMLS framework to prompt concept features of each article. Article relationships (similarities) were established and represented as a semantic article network, which was readily applied to assist with the article screening process. We incorporated the concept of active learning to simulate an interactive article recommendation process, and evaluated the performance on 15 completed SRs. We used work saved over sampling at 95% recall (WSS95) as the performance measure. We compared the WSS95 performance of our ontology-based semantic approach to existing lexical feature approaches and corpus-based semantic approaches, and found that we had better WSS95 in most SRs. We also had the highest average WSS95 of 43.81% and the highest total WSS95 of 657.18%. We demonstrated using ontology-based semantics to facilitate the identification of relevant articles for SRs. Effective concepts and concept relations derived from UMLS ontologies can be utilized to establish article semantic relationships. Our approach provided a promising performance and can easily apply to any SR topics in the biomedical domain with generalizability. Copyright © 2017 Elsevier Inc. All rights reserved.
Relating UMLS semantic types and task-based ontology to computer-interpretable clinical practice guidelines.

PubMed

Kumar, Anand; Ciccarese, Paolo; Quaglini, Silvana; Stefanelli, Mario; Caffi, Ezio; Boiocchi, Lorenzo

2003-01-01

Medical knowledge in clinical practice guideline (GL) texts is the source of task-based computer-interpretable clinical guideline models (CIGMs). We have used Unified Medical Language System (UMLS) semantic types (STs) to understand the percentage of GL text which belongs to a particular ST. We also use UMLS semantic network together with the CIGM-specific ontology to derive a semantic meaning behind the GL text. In order to achieve this objective, we took nine GL texts from the National Guideline Clearinghouse (NGC) and marked up the text dealing with a particular ST. The STs we took into consideration were restricted taking into account the requirements of a task-based CIGM. We used DARPA Agent Markup Language and Ontology Inference Layer (DAML + OIL) to create the UMLS and CIGM specific semantic network. For the latter, as a bench test, we used the 1999 WHO-International Society of Hypertension Guidelines for the Management of Hypertension. We took into consideration the UMLS STs closest to the clinical tasks. The percentage of the GL text dealing with the ST "Health Care Activity" and subtypes "Laboratory Procedure", "Diagnostic Procedure" and "Therapeutic or Preventive Procedure" were measured. The parts of text belonging to other STs or comments were separated. A mapping of terms belonging to other STs was done to the STs under "HCA" for representation in DAML + OIL. As a result, we found that the three STs under "HCA" were the predominant STs present in the GL text. In cases where the terms of related STs existed, they were mapped into one of the three STs. The DAML + OIL representation was able to describe the hierarchy in task-based CIGMs. To conclude, we understood that the three STs could be used to represent the semantic network of the task-bases CIGMs. We identified some mapping operators which could be used for the mapping of other STs into these.
Enhanced semantic interoperability by profiling health informatics standards.

PubMed

López, Diego M; Blobel, Bernd

2009-01-01

Several standards applied to the healthcare domain support semantic interoperability. These standards are far from being completely adopted in health information system development, however. The objective of this paper is to provide a method and suggest the necessary tooling for reusing standard health information models, by that way supporting the development of semantically interoperable systems and components. The approach is based on the definition of UML Profiles. UML profiling is a formal modeling mechanism to specialize reference meta-models in such a way that it is possible to adapt those meta-models to specific platforms or domains. A health information model can be considered as such a meta-model. The first step of the introduced method identifies the standard health information models and tasks in the software development process in which healthcare information models can be reused. Then, the selected information model is formalized as a UML Profile. That Profile is finally applied to system models, annotating them with the semantics of the information model. The approach is supported on Eclipse-based UML modeling tools. The method is integrated into a comprehensive framework for health information systems development, and the feasibility of the approach is demonstrated in the analysis, design, and implementation of a public health surveillance system, reusing HL7 RIM and DIMs specifications. The paper describes a method and the necessary tooling for reusing standard healthcare information models. UML offers several advantages such as tooling support, graphical notation, exchangeability, extensibility, semi-automatic code generation, etc. The approach presented is also applicable for harmonizing different standard specifications.
Towards a semantic medical Web: HealthCyberMap's tool for building an RDF metadata base of health information resources based on the Qualified Dublin Core Metadata Set.

PubMed

Boulos, Maged N; Roudsari, Abdul V; Carson, Ewart R

2002-07-01

HealthCyberMap (http://healthcybermap.semanticweb.org/) aims at mapping Internet health information resources in novel ways for enhanced retrieval and navigation. This is achieved by collecting appropriate resource metadata in an unambiguous form that preserves semantics. We modelled a qualified Dublin Core (DC) metadata set ontology with extra elements for resource quality and geographical provenance in Prot g -2000. A metadata collection form helps acquiring resource instance data within Prot g . The DC subject field is populated with UMLS terms directly imported from UMLS Knowledge Source Server using UMLS tab, a Prot g -2000 plug-in. The project is saved in RDFS/RDF. The ontology and associated form serve as a free tool for building and maintaining an RDF medical resource metadata base. The UMLS tab enables browsing and searching for concepts that best describe a resource, and importing them to DC subject fields. The resultant metadata base can be used with a search and inference engine, and have textual and/or visual navigation interface(s) applied to it, to ultimately build a medical Semantic Web portal. Different ways of exploiting Prot g -2000 RDF output are discussed. By making the context and semantics of resources, not merely their raw text and formatting, amenable to computer 'understanding,' we can build a Semantic Web that is more useful to humans than the current Web. This requires proper use of metadata and ontologies. Clinical codes can reliably describe the subjects of medical resources, establish the semantic relationships (as defined by underlying coding scheme) between related resources, and automate their topical categorisation.
Towards a Semantic Lexicon for Biological Language Processing

DOE PAGES

Verspoor, Karin

2005-01-01

This paper explores the use of the resources in the National Library of Medicine's Unified Medical Language System (UMLS) for the construction of a lexicon useful for processing texts in the field of molecular biology. A lexicon is constructed from overlapping terms in the UMLS SPECIALIST lexicon and the UMLS Metathesaurus to obtain both morphosyntactic and semantic information for terms, and the coverage of a domain corpus is assessed. Over 77% of tokens in the domain corpus are found in the constructed lexicon, validating the lexicon's coverage of the most frequent terms in the domain and indicating that the constructedmore » lexicon is potentially an important resource for biological text processing.« less
Consumers' Use of UMLS Concepts on Social Media: Diabetes-Related Textual Data Analysis in Blog and Social Q&A Sites.

PubMed

Park, Min Sook; He, Zhe; Chen, Zhiwei; Oh, Sanghee; Bian, Jiang

2016-11-24

The widely known terminology gap between health professionals and health consumers hinders effective information seeking for consumers. The aim of this study was to better understand consumers' usage of medical concepts by evaluating the coverage of concepts and semantic types of the Unified Medical Language System (UMLS) on diabetes-related postings in 2 types of social media: blogs and social question and answer (Q&A). We collected 2 types of social media data: (1) a total of 3711 blogs tagged with "diabetes" on Tumblr posted between February and October 2015; and (2) a total of 58,422 questions and associated answers posted between 2009 and 2014 in the diabetes category of Yahoo! Answers. We analyzed the datasets using a widely adopted biomedical text processing framework Apache cTAKES and its extension YTEX. First, we applied the named entity recognition (NER) method implemented in YTEX to identify UMLS concepts in the datasets. We then analyzed the coverage and the popularity of concepts in the UMLS source vocabularies across the 2 datasets (ie, blogs and social Q&A). Further, we conducted a concept-level comparative coverage analysis between SNOMED Clinical Terms (SNOMED CT) and Open-Access Collaborative Consumer Health Vocabulary (OAC CHV)-the top 2 UMLS source vocabularies that have the most coverage on our datasets. We also analyzed the UMLS semantic types that were frequently observed in our datasets. We identified 2415 UMLS concepts from blog postings, 6452 UMLS concepts from social Q&A questions, and 10,378 UMLS concepts from the answers. The medical concepts identified in the blogs can be covered by 56 source vocabularies in the UMLS, while those in questions and answers can be covered by 58 source vocabularies. SNOMED CT was the dominant vocabulary in terms of coverage across all the datasets, ranging from 84.9% to 95.9%. It was followed by OAC CHV (between 73.5% and 80.0%) and Metathesaurus Names (MTH) (between 55.7% and 73.5%). All of the social media datasets shared frequent semantic types such as "Amino Acid, Peptide, or Protein," "Body Part, Organ, or Organ Component," and "Disease or Syndrome." Although the 3 social media datasets vary greatly in size, they exhibited similar conceptual coverage among UMLS source vocabularies and the identified concepts showed similar semantic type distributions. As such, concepts that are both frequently used by consumers and also found in professional vocabularies such as SNOMED CT can be suggested to OAC CHV to improve its coverage. ©Min Sook Park, Zhe He, Zhiwei Chen, Sanghee Oh, Jiang Bian. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 24.11.2016.
Consumers’ Use of UMLS Concepts on Social Media: Diabetes-Related Textual Data Analysis in Blog and Social Q&A Sites

PubMed Central

Chen, Zhiwei; Oh, Sanghee; Bian, Jiang

2016-01-01

Background The widely known terminology gap between health professionals and health consumers hinders effective information seeking for consumers. Objective The aim of this study was to better understand consumers’ usage of medical concepts by evaluating the coverage of concepts and semantic types of the Unified Medical Language System (UMLS) on diabetes-related postings in 2 types of social media: blogs and social question and answer (Q&A). Methods We collected 2 types of social media data: (1) a total of 3711 blogs tagged with “diabetes” on Tumblr posted between February and October 2015; and (2) a total of 58,422 questions and associated answers posted between 2009 and 2014 in the diabetes category of Yahoo! Answers. We analyzed the datasets using a widely adopted biomedical text processing framework Apache cTAKES and its extension YTEX. First, we applied the named entity recognition (NER) method implemented in YTEX to identify UMLS concepts in the datasets. We then analyzed the coverage and the popularity of concepts in the UMLS source vocabularies across the 2 datasets (ie, blogs and social Q&A). Further, we conducted a concept-level comparative coverage analysis between SNOMED Clinical Terms (SNOMED CT) and Open-Access Collaborative Consumer Health Vocabulary (OAC CHV)—the top 2 UMLS source vocabularies that have the most coverage on our datasets. We also analyzed the UMLS semantic types that were frequently observed in our datasets. Results We identified 2415 UMLS concepts from blog postings, 6452 UMLS concepts from social Q&A questions, and 10,378 UMLS concepts from the answers. The medical concepts identified in the blogs can be covered by 56 source vocabularies in the UMLS, while those in questions and answers can be covered by 58 source vocabularies. SNOMED CT was the dominant vocabulary in terms of coverage across all the datasets, ranging from 84.9% to 95.9%. It was followed by OAC CHV (between 73.5% and 80.0%) and Metathesaurus Names (MTH) (between 55.7% and 73.5%). All of the social media datasets shared frequent semantic types such as “Amino Acid, Peptide, or Protein,” “Body Part, Organ, or Organ Component,” and “Disease or Syndrome.” Conclusions Although the 3 social media datasets vary greatly in size, they exhibited similar conceptual coverage among UMLS source vocabularies and the identified concepts showed similar semantic type distributions. As such, concepts that are both frequently used by consumers and also found in professional vocabularies such as SNOMED CT can be suggested to OAC CHV to improve its coverage. PMID:27884812
Scrutinizing UML Activity Diagrams

NASA Astrophysics Data System (ADS)

Al-Fedaghi, Sabah

Building an information system involves two processes: conceptual modeling of the “real world domain” and designing the software system. Object-oriented methods and languages (e.g., UML) are typically used for describing the software system. For the system analysis process that produces the conceptual description, object-oriented techniques or semantics extensions are utilized. Specifically, UML activity diagrams are the “flow charts” of object-oriented conceptualization tools. This chapter proposes an alternative to UML activity diagrams through the development of a conceptual modeling methodology based on the notion of flow.
Enhancing acronym/abbreviation knowledge bases with semantic information.

PubMed

Torii, Manabu; Liu, Hongfang

2007-10-11

In the biomedical domain, a terminology knowledge base that associates acronyms/abbreviations (denoted as SFs) with the definitions (denoted as LFs) is highly needed. For the construction such terminology knowledge base, we investigate the feasibility to build a system automatically assigning semantic categories to LFs extracted from text. Given a collection of pairs (SF,LF) derived from text, we i) assess the coverage of LFs and pairs (SF,LF) in the UMLS and justify the need of a semantic category assignment system; and ii) automatically derive name phrases annotated with semantic category and construct a system using machine learning. Utilizing ADAM, an existing collection of (SF,LF) pairs extracted from MEDLINE, our system achieved an f-measure of 87% when assigning eight UMLS-based semantic groups to LFs. The system has been incorporated into a web interface which integrates SF knowledge from multiple SF knowledge bases. Web site: http://gauss.dbb.georgetown.edu/liblab/SFThesurus.
Meeting medical terminology needs--the Ontology-Enhanced Medical Concept Mapper.

PubMed

Leroy, G; Chen, H

2001-12-01

This paper describes the development and testing of the Medical Concept Mapper, a tool designed to facilitate access to online medical information sources by providing users with appropriate medical search terms for their personal queries. Our system is valuable for patients whose knowledge of medical vocabularies is inadequate to find the desired information, and for medical experts who search for information outside their field of expertise. The Medical Concept Mapper maps synonyms and semantically related concepts to a user's query. The system is unique because it integrates our natural language processing tool, i.e., the Arizona (AZ) Noun Phraser, with human-created ontologies, the Unified Medical Language System (UMLS) and WordNet, and our computer generated Concept Space, into one system. Our unique contribution results from combining the UMLS Semantic Net with Concept Space in our deep semantic parsing (DSP) algorithm. This algorithm establishes a medical query context based on the UMLS Semantic Net, which allows Concept Space terms to be filtered so as to isolate related terms relevant to the query. We performed two user studies in which Medical Concept Mapper terms were compared against human experts' terms. We conclude that the AZ Noun Phraser is well suited to extract medical phrases from user queries, that WordNet is not well suited to provide strictly medical synonyms, that the UMLS Metathesaurus is well suited to provide medical synonyms, and that Concept Space is well suited to provide related medical terms, especially when these terms are limited by our DSP algorithm.
Using the Unified Modelling Language (UML) to guide the systemic description of biological processes and systems.

PubMed

Roux-Rouquié, Magali; Caritey, Nicolas; Gaubert, Laurent; Rosenthal-Sabroux, Camille

2004-07-01

One of the main issues in Systems Biology is to deal with semantic data integration. Previously, we examined the requirements for a reference conceptual model to guide semantic integration based on the systemic principles. In the present paper, we examine the usefulness of the Unified Modelling Language (UML) to describe and specify biological systems and processes. This makes unambiguous representations of biological systems, which would be suitable for translation into mathematical and computational formalisms, enabling analysis, simulation and prediction of these systems behaviours.
Dynamic generation of a table of contents with consumer-friendly labels.

PubMed

Miller, Trudi; Leroy, Gondy; Wood, Elizabeth

2006-01-01

Consumers increasingly look to the Internet for health information, but available resources are too difficult for the majority to understand. Interactive tables of contents (TOC) can help consumers access health information by providing an easy to understand structure. Using natural language processing and the Unified Medical Language System (UMLS), we have automatically generated TOCs for consumer health information. The TOC are categorized according to consumer-friendly labels for the UMLS semantic types and semantic groups. Categorizing phrases by semantic types is significantly more correct and relevant. Greater correctness and relevance was achieved with documents that are difficult to read than those at an easier reading level. Pruning TOCs to use categories that consumers favor further increases relevancy and correctness while reducing structural complexity.
The Neighborhood Auditing Tool: a hybrid interface for auditing the UMLS.

PubMed

Morrey, C Paul; Geller, James; Halper, Michael; Perl, Yehoshua

2009-06-01

The UMLS's integration of more than 100 source vocabularies, not necessarily consistent with one another, causes some inconsistencies. The purpose of auditing the UMLS is to detect such inconsistencies and to suggest how to resolve them while observing the requirement of fully representing the content of each source in the UMLS. A software tool, called the Neighborhood Auditing Tool (NAT), that facilitates UMLS auditing is presented. The NAT supports "neighborhood-based" auditing, where, at any given time, an auditor concentrates on a single-focus concept and one of a variety of neighborhoods of its closely related concepts. Typical diagrammatic displays of concept networks have a number of shortcomings, so the NAT utilizes a hybrid diagram/text interface that features stylized neighborhood views which retain some of the best features of both the diagrammatic layouts and text windows while avoiding the shortcomings. The NAT allows an auditor to display knowledge from both the Metathesaurus (concept) level and the Semantic Network (semantic type) level. Various additional features of the NAT that support the auditing process are described. The usefulness of the NAT is demonstrated through a group of case studies. Its impact is tested with a study involving a select group of auditors.
Structural Group-based Auditing of Missing Hierarchical Relationships in UMLS

PubMed Central

Chen, Yan; Gu, Huanying(Helen); Perl, Yehoshua; Geller, James

2009-01-01

The Metathesaurus of the UMLS was created by integrating various source terminologies. The inter-concept relationships were either integrated into the UMLS from the source terminologies or specially generated. Due to the extensive size and inherent complexity of the Metathesaurus, the accidental omission of some hierarchical relationships was inevitable. We present a recursive procedure which allows a human expert, with the support of an algorithm, to locate missing hierarchical relationships. The procedure starts with a group of concepts with exactly the same (correct) semantic type assignments. It then partitions the concepts, based on child-of hierarchical relationships, into smaller, singly rooted, hierarchically connected subgroups. The auditor only needs to focus on the subgroups with very few concepts and their concepts with semantic type reassignments. The procedure was evaluated by comparing it with a comprehensive manual audit and it exhibits a perfect error recall. PMID:18824248
Towards a semantic lexicon for biological language processing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Verspoor, K.

It is well understood that natural language processing (NLP) applications require sophisticated lexical resources to support their processing goals. In the biomedical domain, we are privileged to have access to extensive terminological resources in the form of controlled vocabularies and ontologies, which have been integrated into the framework of the National Library of Medicine's Unified Medical Language System's (UMLS) Metathesaurus. However, the existence of such terminological resources does not guarantee their utility for NLP. In particular, we have two core requirements for lexical resources for NLP in addition to the basic enumeration of important domain terms: representation of morphosyntactic informationmore » about those terms, specifically part of speech information and inflectional patterns to support parsing and lemma assignment, and representation of semantic information indicating general categorical information about terms, and significant relations between terms to support text understanding and inference (Hahn et at, 1999). Biomedical vocabularies by and large commonly leave out morphosyntactic information, and where they address semantic considerations, they often do so in an unprincipled manner, for instance by indicating a relation between two concepts without indicating the type of that relation. But all is not lost. The UMLS knowledge sources include two additional resources which are relevant - the SPECIALIST lexicon, a lexicon addressing our morphosyntactic requirements, and the Semantic Network, a representation of core conceptual categories in the biomedical domain. The coverage of these two knowledge sources with respect to the full coverage of the Metathesaurus is, however, not entirely clear. Furthermore, when our goals are specifically to process biological text - and often more specifically, text in the molecular biology domain - it is difficult to say whether the coverage of these resources is meaningful. The utility of the UMLS knowledge sources for medical language processing (MLP) has been explored (Johnson, 1999; Friedman et al 2001); the time has now come to repeat these experiments with respect to biological language processing (BLP). To that end, this paper presents an analysis of ihe UMLS resources, specifically with an eye towards constructing lexical resources suitable for BLP. We follow the paradigm presented in Johnson (1999) for medical language, exploring overlap between the UMLS Metathesaurus and SPECIALIST lexicon to construct a morphosyntactic and semantically-specified lexicon, and then further explore the overlap with a relevant domain corpus for molecular biology.« less
Augmenting Oracle Text with the UMLS for enhanced searching of free-text medical reports.

PubMed

Ding, Jing; Erdal, Selnur; Dhaval, Rakesh; Kamal, Jyoti

2007-10-11

The intrinsic complexity of free-text medical reports imposes great challenges for information retrieval systems. We have developed a prototype search engine for retrieving clinical reports that leverages the powerful indexing and querying capabilities of Oracle Text, and the rich biomedical domain knowledge and semantic structures that are captured in the UMLS Metathesaurus.
Dual deep modeling: multi-level modeling with dual potencies and its formalization in F-Logic.

PubMed

Neumayr, Bernd; Schuetz, Christoph G; Jeusfeld, Manfred A; Schrefl, Michael

2018-01-01

An enterprise database contains a global, integrated, and consistent representation of a company's data. Multi-level modeling facilitates the definition and maintenance of such an integrated conceptual data model in a dynamic environment of changing data requirements of diverse applications. Multi-level models transcend the traditional separation of class and object with clabjects as the central modeling primitive, which allows for a more flexible and natural representation of many real-world use cases. In deep instantiation, the number of instantiation levels of a clabject or property is indicated by a single potency. Dual deep modeling (DDM) differentiates between source potency and target potency of a property or association and supports the flexible instantiation and refinement of the property by statements connecting clabjects at different modeling levels. DDM comes with multiple generalization of clabjects, subsetting/specialization of properties, and multi-level cardinality constraints. Examples are presented using a UML-style notation for DDM together with UML class and object diagrams for the representation of two-level user views derived from the multi-level model. Syntax and semantics of DDM are formalized and implemented in F-Logic, supporting the modeler with integrity checks and rich query facilities.
Indexing Anatomical Phrases in Neuro-Radiology Reports to the UMLS 2005AA

PubMed Central

Bashyam, Vijayaraghavan; Taira, Ricky K.

2005-01-01

This work describes a methodology to index anatomical phrases to the 2005AA release of the Unified Medical Language System (UMLS). A phrase chunking tool based on Natural Language Processing (NLP) was developed to identify semantically coherent phrases within medical reports. Using this phrase chunker, a set of 2,551 unique anatomical phrases was extracted from brain radiology reports. These phrases were mapped to the 2005AA release of the UMLS using a vector space model. Precision for the task of indexing unique phrases was 0.87. PMID:16778995

Representing Thoughts, Words, and Things in the UMLS

PubMed Central

Campbell, Keith E.; Oliver, Diane E.; Spackman, Kent A.; Shortliffe, Edward H.

1998-01-01

The authors describe a framework, based on the Ogden-Richards semiotic triangle, for understanding the relationship between the Unified Medical Language System (UMLS) and the source terminologies from which the UMLS derives its content. They pay particular attention to UMLS's Concept Unique Identifier (CUI) and the sense of “meaning” it represents as contrasted with the sense of “meaning” represented by the source terminologies. The CUI takes on emergent meaning through linkage to terms in different terminology systems. In some cases, a CUI's emergent meaning can differ significantly from the original sources' intended meanings of terms linked by that CUI. Identification of these different senses of meaning within the UMLS is consistent with historical themes of semantic interpretation of language. Examination of the UMLS within such a historical framework makes it possible to better understand the strengths and limitations of the UMLS approach for integrating disparate terminologic systems and to provide a model, or theoretic foundation, for evaluating the UMLS as a Possible World—that is, as a mathematical formalism that represents propositions about some perspective or interpretation of the physical world. PMID:9760390
The UMLS Knowledge Sources: Tools for Building Better User Interfaces

PubMed Central

Lindberg, Donald A. B.; Humphreys, Betsy L.

1990-01-01

The current focus of the National Library of Medicine's Unified Medical Language System (UMLS) project is the development, testing, and evaluation of the first versions of three new knowledge sources: the Metathesaurus, the Semantic Network, and the Information Sources Map. These three knowledge sources can be used by interface programs to conduct an intelligent interaction with the user and to make the conceptual link between the user's question and relevant machine-readable information. NLM is providing experimental copies of the initial versions of the UMLS knowledge sources in exchange for feedback on ways they can and should be improved. The hope is that the results of such experimentation will provide both immediate improvements in biomedical information service and useful suggestions for enhancements to the UMLS.
The MP (Materialization Pattern) Model for Representing Math Educational Standards

NASA Astrophysics Data System (ADS)

Choi, Namyoun; Song, Il-Yeol; An, Yuan

Representing natural languages with UML has been an important research issue for various reasons. Little work has been done for modeling imperative mood sentences which are the sentence structure of math educational standard statements. In this paper, we propose the MP (Materialization Pattern) model that captures the semantics of English sentences used in math educational standards. The MP model is based on the Reed-Kellogg sentence diagrams and creates MP schemas with the UML notation. The MP model explicitly represents the semantics of the sentences by extracting math concepts and the cognitive process of math concepts from math educational standard statements, and simplifies modeling. This MP model is also developed to be used for aligning math educational standard statements via schema matching.
The Neighborhood Auditing Tool: A Hybrid Interface for Auditing the UMLS

PubMed Central

Morrey, C. Paul; Geller, James; Halper, Michael; Perl, Yehoshua

2009-01-01

The UMLS’s integration of more than 100 source vocabularies, not necessarily consistent with one another, causes some inconsistencies. The purpose of auditing the UMLS is to detect such inconsistencies and to suggest how to resolve them while observing the requirement of fully representing the content of each source in the UMLS. A software tool, called the Neighborhood Auditing Tool (NAT), that facilitates UMLS auditing is presented. The NAT supports “neighborhood-based” auditing, where, at any given time, an auditor concentrates on a single focus concept and one of a variety of neighborhoods of its closely related concepts. Typical diagrammatic displays of concept networks have a number of shortcomings, so the NAT utilizes a hybrid diagram/text interface that features stylized neighborhood views which retain some of the best features of both the diagrammatic layouts and text windows while avoiding the shortcomings. The NAT allows an auditor to display knowledge from both the Metathesaurus (concept) level and the Semantic Network (semantic type) level. Various additional features of the NAT that support the auditing process are described. The usefulness of the NAT is demonstrated through a group of case studies. Its impact is tested with a study involving a select group of auditors. PMID:19475725
Graphical tool for navigation within the semantic network of the UMLS metathesaurus on a locally installed database.

PubMed

Frankewitsch, T; Prokosch, H U

2000-01-01

Knowledge in the environment of information technologies is bound to structured vocabularies. Medical data dictionaries are necessary for uniquely describing findings like diagnoses, procedures or functions. Therefore we decided to locally install a version of the Unified Medical Language System (UMLS) of the U.S. National Library of Medicine as a repository for defining entries of a medical multimedia database. Because of the requirement to extend the vocabulary in concepts and relations between existing concepts a graphical tool for appending new items to the database has been developed: Although the database is an instance of a semantic network the focus on single entries offers the opportunity of reducing the net to a tree within this detail. Based on the graph theorem, there are definitions of nodes of concepts and nodes of knowledge. The UMLS additionally offers the specification of sub-relations, which can be represented, too. Using this view it is possible to manage these 1:n-Relations in a simple tree view. On this background an explorer like graphical user interface has been realised to add new concepts and define new relationships between those and existing entries for adapting the UMLS for specific purposes such as describing medical multimedia objects.
Representation of Nursing Terminologies in UMLS

PubMed Central

Kim, Tae Youn; Coenen, Amy; Hardiker, Nicholas; Bartz, Claudia C.

2011-01-01

There are seven nursing terminologies or classifications that are considered a standard to support nursing practice in the U.S. Harmonizing these terminologies will enhance the interoperability of clinical data documented across nursing practice. As a first step to harmonize the nursing terminologies, the purpose of this study was to examine how nursing problems or diagnostic concepts from select terminologies were cross-mapped in Unified Medical Language System (UMLS). A comparison analysis was conducted by examining whether cross-mappings available in UMLS through concept unique identifiers were consistent with cross-mappings conducted by human experts. Of 423 concepts from three terminologies, 411 (97%) were manually cross-mapped by experts to the International Classification for Nursing Practice. The UMLS semantic mapping among the 411 nursing concepts presented 33.6% accuracy (i.e., 138 of 411 concepts) when compared to expert cross-mappings. Further research and collaboration among experts in this field are needed for future enhancement of UMLS. PMID:22195127
An Enriched Unified Medical Language System Semantic Network with a Multiple Subsumption Hierarchy

PubMed Central

Zhang, Li; Perl, Yehoshua; Halper, Michael; Geller, James; Cimino, James J.

2004-01-01

Objective: The Unified Medical Language System's (UMLS's) Semantic Network's (SN's) two-tree structure is restrictive because it does not allow a semantic type to be a specialization of several other semantic types. In this article, the SN is expanded into a multiple subsumption structure with a directed acyclic graph (DAG) IS-A hierarchy, allowing a semantic type to have multiple parents. New viable IS-A links are added as warranted. Design: Two methodologies are presented to identify and add new viable IS-A links. The first methodology is based on imposing the characteristic of connectivity on a previously presented partition of the SN. Four transformations are provided to find viable IS-A links in the process of converting the partition's disconnected groups into connected ones. The second methodology identifies new IS-A links through a string matching process involving names and definitions of various semantic types in the SN. A domain expert is needed to review all the results to determine the validity of the new IS-A links. Results: Nineteen new IS-A links are added to the SN, and four new semantic types are also created to support the multiple subsumption framework. The resulting network, called the Enriched Semantic Network (ESN), exhibits a DAG-structured hierarchy. A partition of the ESN containing 19 connected groups is also derived. Conclusion: The ESN is an expanded abstraction of the UMLS compared with the original SN. Its multiple subsumption hierarchy can accommodate semantic types with multiple parents. Its representation thus provides direct access to a broader range of subsumption knowledge. PMID:14764611
Comprehensive Aspectual UML approach to support AspectJ.

PubMed

Magableh, Aws; Shukur, Zarina; Ali, Noorazean Mohd

2014-01-01

Unified Modeling Language is the most popular and widely used Object-Oriented modelling language in the IT industry. This study focuses on investigating the ability to expand UML to some extent to model crosscutting concerns (Aspects) to support AspectJ. Through a comprehensive literature review, we identify and extensively examine all the available Aspect-Oriented UML modelling approaches and find that the existing Aspect-Oriented Design Modelling approaches using UML cannot be considered to provide a framework for a comprehensive Aspectual UML modelling approach and also that there is a lack of adequate Aspect-Oriented tool support. This study also proposes a set of Aspectual UML semantic rules and attempts to generate AspectJ pseudocode from UML diagrams. The proposed Aspectual UML modelling approach is formally evaluated using a focus group to test six hypotheses regarding performance; a "good design" criteria-based evaluation to assess the quality of the design; and an AspectJ-based evaluation as a reference measurement-based evaluation. The results of the focus group evaluation confirm all the hypotheses put forward regarding the proposed approach. The proposed approach provides a comprehensive set of Aspectual UML structural and behavioral diagrams, which are designed and implemented based on a comprehensive and detailed set of AspectJ programming constructs.
Comprehensive Aspectual UML Approach to Support AspectJ

PubMed Central

Magableh, Aws; Shukur, Zarina; Mohd. Ali, Noorazean

2014-01-01

Unified Modeling Language is the most popular and widely used Object-Oriented modelling language in the IT industry. This study focuses on investigating the ability to expand UML to some extent to model crosscutting concerns (Aspects) to support AspectJ. Through a comprehensive literature review, we identify and extensively examine all the available Aspect-Oriented UML modelling approaches and find that the existing Aspect-Oriented Design Modelling approaches using UML cannot be considered to provide a framework for a comprehensive Aspectual UML modelling approach and also that there is a lack of adequate Aspect-Oriented tool support. This study also proposes a set of Aspectual UML semantic rules and attempts to generate AspectJ pseudocode from UML diagrams. The proposed Aspectual UML modelling approach is formally evaluated using a focus group to test six hypotheses regarding performance; a “good design” criteria-based evaluation to assess the quality of the design; and an AspectJ-based evaluation as a reference measurement-based evaluation. The results of the focus group evaluation confirm all the hypotheses put forward regarding the proposed approach. The proposed approach provides a comprehensive set of Aspectual UML structural and behavioral diagrams, which are designed and implemented based on a comprehensive and detailed set of AspectJ programming constructs. PMID:25136656
Word sense disambiguation in the clinical domain: a comparison of knowledge-rich and knowledge-poor unsupervised methods

PubMed Central

Chasin, Rachel; Rumshisky, Anna; Uzuner, Ozlem; Szolovits, Peter

2014-01-01

Objective To evaluate state-of-the-art unsupervised methods on the word sense disambiguation (WSD) task in the clinical domain. In particular, to compare graph-based approaches relying on a clinical knowledge base with bottom-up topic-modeling-based approaches. We investigate several enhancements to the topic-modeling techniques that use domain-specific knowledge sources. Materials and methods The graph-based methods use variations of PageRank and distance-based similarity metrics, operating over the Unified Medical Language System (UMLS). Topic-modeling methods use unlabeled data from the Multiparameter Intelligent Monitoring in Intensive Care (MIMIC II) database to derive models for each ambiguous word. We investigate the impact of using different linguistic features for topic models, including UMLS-based and syntactic features. We use a sense-tagged clinical dataset from the Mayo Clinic for evaluation. Results The topic-modeling methods achieve 66.9% accuracy on a subset of the Mayo Clinic's data, while the graph-based methods only reach the 40–50% range, with a most-frequent-sense baseline of 56.5%. Features derived from the UMLS semantic type and concept hierarchies do not produce a gain over bag-of-words features in the topic models, but identifying phrases from UMLS and using syntax does help. Discussion Although topic models outperform graph-based methods, semantic features derived from the UMLS prove too noisy to improve performance beyond bag-of-words. Conclusions Topic modeling for WSD provides superior results in the clinical domain; however, integration of knowledge remains to be effectively exploited. PMID:24441986
The caCORE Software Development Kit: streamlining construction of interoperable biomedical information services.

PubMed

Phillips, Joshua; Chilukuri, Ram; Fragoso, Gilberto; Warzel, Denise; Covitz, Peter A

2006-01-06

Robust, programmatically accessible biomedical information services that syntactically and semantically interoperate with other resources are challenging to construct. Such systems require the adoption of common information models, data representations and terminology standards as well as documented application programming interfaces (APIs). The National Cancer Institute (NCI) developed the cancer common ontologic representation environment (caCORE) to provide the infrastructure necessary to achieve interoperability across the systems it develops or sponsors. The caCORE Software Development Kit (SDK) was designed to provide developers both within and outside the NCI with the tools needed to construct such interoperable software systems. The caCORE SDK requires a Unified Modeling Language (UML) tool to begin the development workflow with the construction of a domain information model in the form of a UML Class Diagram. Models are annotated with concepts and definitions from a description logic terminology source using the Semantic Connector component. The annotated model is registered in the Cancer Data Standards Repository (caDSR) using the UML Loader component. System software is automatically generated using the Codegen component, which produces middleware that runs on an application server. The caCORE SDK was initially tested and validated using a seven-class UML model, and has been used to generate the caCORE production system, which includes models with dozens of classes. The deployed system supports access through object-oriented APIs with consistent syntax for retrieval of any type of data object across all classes in the original UML model. The caCORE SDK is currently being used by several development teams, including by participants in the cancer biomedical informatics grid (caBIG) program, to create compatible data services. caBIG compatibility standards are based upon caCORE resources, and thus the caCORE SDK has emerged as a key enabling technology for caBIG. The caCORE SDK substantially lowers the barrier to implementing systems that are syntactically and semantically interoperable by providing workflow and automation tools that standardize and expedite modeling, development, and deployment. It has gained acceptance among developers in the caBIG program, and is expected to provide a common mechanism for creating data service nodes on the data grid that is under development.
Knowledge requirements for automated inference of medical textbook markup.

PubMed Central

Berrios, D. C.; Kehler, A.; Fagan, L. M.

1999-01-01

Indexing medical text in journals or textbooks requires a tremendous amount of resources. We tested two algorithms for automatically indexing nouns, noun-modifiers, and noun phrases, and inferring selected binary relations between UMLS concepts in a textbook of infectious disease. Sixty-six percent of nouns and noun-modifiers and 81% of noun phrases were correctly matched to UMLS concepts. Semantic relations were identified with 100% specificity and 94% sensitivity. For some medical sub-domains, these algorithms could permit expeditious generation of more complex indexing. PMID:10566445
Consistent model driven architecture

NASA Astrophysics Data System (ADS)

Niepostyn, Stanisław J.

2015-09-01

The goal of the MDA is to produce software systems from abstract models in a way where human interaction is restricted to a minimum. These abstract models are based on the UML language. However, the semantics of UML models is defined in a natural language. Subsequently the verification of consistency of these diagrams is needed in order to identify errors in requirements at the early stage of the development process. The verification of consistency is difficult due to a semi-formal nature of UML diagrams. We propose automatic verification of consistency of the series of UML diagrams originating from abstract models implemented with our consistency rules. This Consistent Model Driven Architecture approach enables us to generate automatically complete workflow applications from consistent and complete models developed from abstract models (e.g. Business Context Diagram). Therefore, our method can be used to check practicability (feasibility) of software architecture models.
Ontology Matching with Semantic Verification.

PubMed

Jean-Mary, Yves R; Shironoshita, E Patrick; Kabuka, Mansur R

2009-09-01

ASMOV (Automated Semantic Matching of Ontologies with Verification) is a novel algorithm that uses lexical and structural characteristics of two ontologies to iteratively calculate a similarity measure between them, derives an alignment, and then verifies it to ensure that it does not contain semantic inconsistencies. In this paper, we describe the ASMOV algorithm, and then present experimental results that measure its accuracy using the OAEI 2008 tests, and that evaluate its use with two different thesauri: WordNet, and the Unified Medical Language System (UMLS). These results show the increased accuracy obtained by combining lexical, structural and extensional matchers with semantic verification, and demonstrate the advantage of using a domain-specific thesaurus for the alignment of specialized ontologies.
Auditing the NCI Thesaurus with Semantic Web Technologies

PubMed Central

Mougin, Fleur; Bodenreider, Olivier

2008-01-01

Auditing biomedical terminologies often results in the identification of inconsistencies and thus helps to improve their quality. In this paper, we present a method based on Semantic Web technologies for auditing biomedical terminologies and apply it to the NCI thesaurus. We stored the NCI thesaurus concepts and their properties in an RDF triple store. By querying this store, we assessed the consistency of both hierarchical and associative relations from the NCI thesaurus among themselves and with corresponding relations in the UMLS Semantic Network. We show that the consistency is better for associative relations than for hierarchical relations. Causes for inconsistency and benefits from using Semantic Web technologies for auditing purposes are discussed. PMID:18999265
Auditing the NCI thesaurus with semantic web technologies.

PubMed

Mougin, Fleur; Bodenreider, Olivier

2008-11-06

Auditing biomedical terminologies often results in the identification of inconsistencies and thus helps to improve their quality. In this paper, we present a method based on Semantic Web technologies for auditing biomedical terminologies and apply it to the NCI thesaurus. We stored the NCI thesaurus concepts and their properties in an RDF triple store. By querying this store, we assessed the consistency of both hierarchical and associative relations from the NCI thesaurus among themselves and with corresponding relations in the UMLS Semantic Network. We show that the consistency is better for associative relations than for hierarchical relations. Causes for inconsistency and benefits from using Semantic Web technologies for auditing purposes are discussed.
The caCORE Software Development Kit: Streamlining construction of interoperable biomedical information services

PubMed Central

Phillips, Joshua; Chilukuri, Ram; Fragoso, Gilberto; Warzel, Denise; Covitz, Peter A

2006-01-01

Background Robust, programmatically accessible biomedical information services that syntactically and semantically interoperate with other resources are challenging to construct. Such systems require the adoption of common information models, data representations and terminology standards as well as documented application programming interfaces (APIs). The National Cancer Institute (NCI) developed the cancer common ontologic representation environment (caCORE) to provide the infrastructure necessary to achieve interoperability across the systems it develops or sponsors. The caCORE Software Development Kit (SDK) was designed to provide developers both within and outside the NCI with the tools needed to construct such interoperable software systems. Results The caCORE SDK requires a Unified Modeling Language (UML) tool to begin the development workflow with the construction of a domain information model in the form of a UML Class Diagram. Models are annotated with concepts and definitions from a description logic terminology source using the Semantic Connector component. The annotated model is registered in the Cancer Data Standards Repository (caDSR) using the UML Loader component. System software is automatically generated using the Codegen component, which produces middleware that runs on an application server. The caCORE SDK was initially tested and validated using a seven-class UML model, and has been used to generate the caCORE production system, which includes models with dozens of classes. The deployed system supports access through object-oriented APIs with consistent syntax for retrieval of any type of data object across all classes in the original UML model. The caCORE SDK is currently being used by several development teams, including by participants in the cancer biomedical informatics grid (caBIG) program, to create compatible data services. caBIG compatibility standards are based upon caCORE resources, and thus the caCORE SDK has emerged as a key enabling technology for caBIG. Conclusion The caCORE SDK substantially lowers the barrier to implementing systems that are syntactically and semantically interoperable by providing workflow and automation tools that standardize and expedite modeling, development, and deployment. It has gained acceptance among developers in the caBIG program, and is expected to provide a common mechanism for creating data service nodes on the data grid that is under development. PMID:16398930
Design of a web portal for interdisciplinary image retrieval from multiple online image resources.

PubMed

Kammerer, F J; Frankewitsch, T; Prokosch, H-U

2009-01-01

Images play an important role in medicine. Finding the desired images within the multitude of online image databases is a time-consuming and frustrating process. Existing websites do not meet all the requirements for an ideal learning environment for medical students. This work intends to establish a new web portal providing a centralized access point to a selected number of online image databases. A back-end system locates images on given websites and extracts relevant metadata. The images are indexed using UMLS and the MetaMap system provided by the US National Library of Medicine. Specially developed functions allow to create individual navigation structures. The front-end system suits the specific needs of medical students. A navigation structure consisting of several medical fields, university curricula and the ICD-10 was created. The images may be accessed via the given navigation structure or using different search functions. Cross-references are provided by the semantic relations of the UMLS. Over 25,000 images were identified and indexed. A pilot evaluation among medical students showed good first results concerning the acceptance of the developed navigation structures and search features. The integration of the images from different sources into the UMLS semantic network offers a quick and an easy-to-use learning environment.
Medical Concept Normalization in Social Media Posts with Recurrent Neural Networks.

PubMed

Tutubalina, Elena; Miftahutdinov, Zulfat; Nikolenko, Sergey; Malykh, Valentin

2018-06-12

Text mining of scientific libraries and social media has already proven itself as a reliable tool for drug repurposing and hypothesis generation. The task of mapping a disease mention to a concept in a controlled vocabulary, typically to the standard thesaurus in the Unified Medical Language System (UMLS), is known as medical concept normalization. This task is challenging due to the differences in the use of medical terminology between health care professionals and social media texts coming from the lay public. To bridge this gap, we use sequence learning with recurrent neural networks and semantic representation of one- or multi-word expressions: we develop end-to-end architectures directly tailored to the task, including bidirectional Long Short-Term Memory, Gated Recurrent Units with an attention mechanism, and additional semantic similarity features based on UMLS. Our evaluation against a standard benchmark shows that recurrent neural networks improve results over an effective baseline for classification based on convolutional neural networks. A qualitative examination of mentions discovered in a dataset of user reviews collected from popular online health information platforms as well as a quantitative evaluation both show improvements in the semantic representation of health-related expressions in social media. Copyright © 2018. Published by Elsevier Inc.
Semantically-Rigorous Systems Engineering Modeling Using Sysml and OWL

NASA Technical Reports Server (NTRS)

Jenkins, J. Steven; Rouquette, Nicolas F.

2012-01-01

The Systems Modeling Language (SysML) has found wide acceptance as a standard graphical notation for the domain of systems engineering. SysML subsets and extends the Unified Modeling Language (UML) to define conventions for expressing structural, behavioral, and analytical elements, and relationships among them. SysML-enabled modeling tools are available from multiple providers, and have been used for diverse projects in military aerospace, scientific exploration, and civil engineering. The Web Ontology Language (OWL) has found wide acceptance as a standard notation for knowledge representation. OWL-enabled modeling tools are available from multiple providers, as well as auxiliary assets such as reasoners and application programming interface libraries, etc. OWL has been applied to diverse projects in a wide array of fields. While the emphasis in SysML is on notation, SysML inherits (from UML) a semantic foundation that provides for limited reasoning and analysis. UML's partial formalization (FUML), however, does not cover the full semantics of SysML, which is a substantial impediment to developing high confidence in the soundness of any conclusions drawn therefrom. OWL, by contrast, was developed from the beginning on formal logical principles, and consequently provides strong support for verification of consistency and satisfiability, extraction of entailments, conjunctive query answering, etc. This emphasis on formal logic is counterbalanced by the absence of any graphical notation conventions in the OWL standards. Consequently, OWL has had only limited adoption in systems engineering. The complementary strengths and weaknesses of SysML and OWL motivate an interest in combining them in such a way that we can benefit from the attractive graphical notation of SysML and the formal reasoning of OWL. This paper describes an approach to achieving that combination.

SemanticFind: Locating What You Want in a Patient Record, Not Just What You Ask For

PubMed Central

Prager, John M.; Liang, Jennifer J.; Devarakonda, Murthy V.

2017-01-01

We present a new model of patient record search, called SemanticFind, which goes beyond traditional textual and medical synonym matches by locating patient data that a clinician would want to see rather than just what they ask for. The new model is implemented by making extensive use of the UMLS semantic network, distributional semantics, and NLP, to match query terms along several dimensions in a patient record with the returned matches organized accordingly. The new approach finds all clinically related concepts without the user having to ask for them. An evaluation of the accuracy of SemanticFind shows that it found twice as many relevant matches compared to those found by literal (traditional) search alone, along with very high precision and recall. These results suggest potential uses for SemanticFind in clinical practice, retrospective chart reviews, and in automated extraction of quality metrics. PMID:28815139
Leveraging the UML Metamodel: Expressing ORM Semantics Using a UML Profile

DOE Office of Scientific and Technical Information (OSTI.GOV)

CUYLER,DAVID S.

2000-11-01

Object Role Modeling (ORM) techniques produce a detailed domain model from the perspective of the business owner/customer. The typical process begins with a set of simple sentences reflecting facts about the business. The output of the process is a single model representing primarily the persistent information needs of the business. This type of model contains little, if any reference to a targeted computerized implementation. It is a model of business entities not of software classes. Through well-defined procedures, an ORM model can be transformed into a high quality objector relational schema.
HRT-UML: a design method for hard real-time systems based on the UML notation

NASA Astrophysics Data System (ADS)

D'Alessandro, Massimo; Mazzini, Silvia; di Natale, Marco; Lipari, Giuseppe

2002-07-01

The Hard Real-Time-Unified Modelling Language (HRT-UML) method aims at providing a comprehensive solution to the modeling of Hard Real Time systems. The experience shows that the design of Hard Real-Time systems needs methodologies suitable for the modeling and analysis of aspects related to time, schedulability and performance. In the context of the European Aerospace community a reference method for design is Hierarchical Object Oriented Design (HOOD) and in particular its extension for the modeling of hard real time systems, Hard Real-Time-Hierarchical Object Oriented Design (HRT-HOOD), recommended by the European Space Agency (ESA) for the development of on-board systems. On the other hand in recent years the Unified Modelling Language (UML) has been gaining a very large acceptance in a wide range of domains, all over the world, becoming a de-facto international standard. Tool vendors are very active in this potentially big market. In the Aerospace domain the common opinion is that UML, as a general notation, is not suitable for Hard Real Time systems, even if its importance is recognized as a standard and as a technological trend in the near future. These considerations suggest the possibility of replacing the HRT-HOOD method with a customized version of UML, that incorporates the advantages of both standards and complements the weak points. This approach has the clear advantage of making HRT-HOOD converge on a more powerful and expressive modeling notation. The paper identifies a mapping of the HRT-HOOD semantics into the UML one, and proposes a UML extension profile, that we call HRT-UML, based on the UML standard extension mechanisms, to fully represent HRT-HOOD design concepts. Finally it discusses the relationships between our profile and the UML profile for schedulability, performance and time, adopted by OMG in November 2001.
Extraction of UMLS® Concepts Using Apache cTAKES™ for German Language.

PubMed

Becker, Matthias; Böckmann, Britta

2016-01-01

Automatic information extraction of medical concepts and classification with semantic standards from medical reports is useful for standardization and for clinical research. This paper presents an approach for an UMLS concept extraction with a customized natural language processing pipeline for German clinical notes using Apache cTAKES. The objectives are, to test the natural language processing tool for German language if it is suitable to identify UMLS concepts and map these with SNOMED-CT. The German UMLS database and German OpenNLP models extended the natural language processing pipeline, so the pipeline can normalize to domain ontologies such as SNOMED-CT using the German concepts. For testing, the ShARe/CLEF eHealth 2013 training dataset translated into German was used. The implemented algorithms are tested with a set of 199 German reports, obtaining a result of average 0.36 F1 measure without German stemming, pre- and post-processing of the reports.
Information Retrieval Using UMLS-based Structured Queries

PubMed Central

Fagan, Lawrence M.; Berrios, Daniel C.; Chan, Albert; Cucina, Russell; Datta, Anupam; Shah, Maulik; Surendran, Sujith

2001-01-01

During the last three years, we have developed and described components of ELBook, a semantically based information-retrieval system [1-4]. Using these components, domain experts can specify a query model, indexers can use the query model to index documents, and end-users can search these documents for instances of indexed queries.
Using a Combination of UML, C2RM, XML, and Metadata Registries to Support Long-Term Development/Engineering

DTIC Science & Technology

2003-01-01

Authenticat’n (XCBF) Authorizat’n (XACML) (SAML) Privacy (P3P) Digital Rights Management (XrML) Content Mngmnt (DASL) (WebDAV) Content Syndicat’n...Registry/ Repository BPSS eCommerce XML/EDI Universal Business Language (UBL) Internet & Computing Human Resources (HR-XML) Semantic KEY XML SPECIFICATIONS
CUILESS2016: a clinical corpus applying compositional normalization of text mentions.

PubMed

Osborne, John D; Neu, Matthew B; Danila, Maria I; Solorio, Thamar; Bethard, Steven J

2018-01-10

Traditionally text mention normalization corpora have normalized concepts to single ontology identifiers ("pre-coordinated concepts"). Less frequently, normalization corpora have used concepts with multiple identifiers ("post-coordinated concepts") but the additional identifiers have been restricted to a defined set of relationships to the core concept. This approach limits the ability of the normalization process to express semantic meaning. We generated a freely available corpus using post-coordinated concepts without a defined set of relationships that we term "compositional concepts" to evaluate their use in clinical text. We annotated 5397 disorder mentions from the ShARe corpus to SNOMED CT that were previously normalized as "CUI-less" in the "SemEval-2015 Task 14" shared task because they lacked a pre-coordinated mapping. Unlike the previous normalization method, we do not restrict concept mappings to a particular set of the Unified Medical Language System (UMLS) semantic types and allow normalization to occur to multiple UMLS Concept Unique Identifiers (CUIs). We computed annotator agreement and assessed semantic coverage with this method. We generated the largest clinical text normalization corpus to date with mappings to multiple identifiers and made it freely available. All but 8 of the 5397 disorder mentions were normalized using this methodology. Annotator agreement ranged from 52.4% using the strictest metric (exact matching) to 78.2% using a hierarchical agreement that measures the overlap of shared ancestral nodes. Our results provide evidence that compositional concepts can increase semantic coverage in clinical text. To our knowledge we provide the first freely available corpus of compositional concept annotation in clinical text.
Semantic web data warehousing for caGrid.

PubMed

McCusker, James P; Phillips, Joshua A; González Beltrán, Alejandra; Finkelstein, Anthony; Krauthammer, Michael

2009-10-01

The National Cancer Institute (NCI) is developing caGrid as a means for sharing cancer-related data and services. As more data sets become available on caGrid, we need effective ways of accessing and integrating this information. Although the data models exposed on caGrid are semantically well annotated, it is currently up to the caGrid client to infer relationships between the different models and their classes. In this paper, we present a Semantic Web-based data warehouse (Corvus) for creating relationships among caGrid models. This is accomplished through the transformation of semantically-annotated caBIG Unified Modeling Language (UML) information models into Web Ontology Language (OWL) ontologies that preserve those semantics. We demonstrate the validity of the approach by Semantic Extraction, Transformation and Loading (SETL) of data from two caGrid data sources, caTissue and caArray, as well as alignment and query of those sources in Corvus. We argue that semantic integration is necessary for integration of data from distributed web services and that Corvus is a useful way of accomplishing this. Our approach is generalizable and of broad utility to researchers facing similar integration challenges.
Evaluation Methodology for UML and GML Application Schemas Quality

NASA Astrophysics Data System (ADS)

Chojka, Agnieszka

2014-05-01

INSPIRE Directive implementation in Poland has caused the significant increase of interest in making spatial data and services available, particularly among public administration and private institutions. This entailed a series of initiatives that aim to harmonise different spatial data sets, so to ensure their internal logical and semantic coherence. Harmonisation lets to reach the interoperability of spatial databases, then among other things enables joining them together. The process of harmonisation requires either working out new data structures or adjusting existing data structures of spatial databases to INSPIRE guidelines and recommendations. Data structures are described with the use of UML and GML application schemas. Although working out accurate and correct application schemas isn't an easy task. There should be considered many issues, for instance recommendations of ISO 19100 series of Geographic Information Standards, appropriate regulations for given problem or topic, production opportunities and limitations (software, tools). In addition, GML application schema is deeply connected with UML application schema, it should be its translation. Not everything that can be expressed in UML, though can be directly expressed in GML, and this can have significant influence on the spatial data sets interoperability, and thereby the ability to valid data exchange. For these reasons, the capability to examine and estimate UML and GML application schemas quality, therein also the capability to explore their entropy, would be very important. The principal subject of this research is to propose an evaluation methodology for UML and GML application schemas quality prepared in the Head Office of Geodesy and Cartography in Poland within the INSPIRE Directive implementation works.
Modelling biological behaviours with the unified modelling language: an immunological case study and critique.

PubMed

Read, Mark; Andrews, Paul S; Timmis, Jon; Kumar, Vipin

2014-10-06

We present a framework to assist the diagrammatic modelling of complex biological systems using the unified modelling language (UML). The framework comprises three levels of modelling, ranging in scope from the dynamics of individual model entities to system-level emergent properties. By way of an immunological case study of the mouse disease experimental autoimmune encephalomyelitis, we show how the framework can be used to produce models that capture and communicate the biological system, detailing how biological entities, interactions and behaviours lead to higher-level emergent properties observed in the real world. We demonstrate how the UML can be successfully applied within our framework, and provide a critique of UML's ability to capture concepts fundamental to immunology and biology more generally. We show how specialized, well-explained diagrams with less formal semantics can be used where no suitable UML formalism exists. We highlight UML's lack of expressive ability concerning cyclic feedbacks in cellular networks, and the compounding concurrency arising from huge numbers of stochastic, interacting agents. To compensate for this, we propose several additional relationships for expressing these concepts in UML's activity diagram. We also demonstrate the ambiguous nature of class diagrams when applied to complex biology, and question their utility in modelling such dynamic systems. Models created through our framework are non-executable, and expressly free of simulation implementation concerns. They are a valuable complement and precursor to simulation specifications and implementations, focusing purely on thoroughly exploring the biology, recording hypotheses and assumptions, and serve as a communication medium detailing exactly how a simulation relates to the real biology.
Modelling biological behaviours with the unified modelling language: an immunological case study and critique

PubMed Central

Read, Mark; Andrews, Paul S.; Timmis, Jon; Kumar, Vipin

2014-01-01

We present a framework to assist the diagrammatic modelling of complex biological systems using the unified modelling language (UML). The framework comprises three levels of modelling, ranging in scope from the dynamics of individual model entities to system-level emergent properties. By way of an immunological case study of the mouse disease experimental autoimmune encephalomyelitis, we show how the framework can be used to produce models that capture and communicate the biological system, detailing how biological entities, interactions and behaviours lead to higher-level emergent properties observed in the real world. We demonstrate how the UML can be successfully applied within our framework, and provide a critique of UML's ability to capture concepts fundamental to immunology and biology more generally. We show how specialized, well-explained diagrams with less formal semantics can be used where no suitable UML formalism exists. We highlight UML's lack of expressive ability concerning cyclic feedbacks in cellular networks, and the compounding concurrency arising from huge numbers of stochastic, interacting agents. To compensate for this, we propose several additional relationships for expressing these concepts in UML's activity diagram. We also demonstrate the ambiguous nature of class diagrams when applied to complex biology, and question their utility in modelling such dynamic systems. Models created through our framework are non-executable, and expressly free of simulation implementation concerns. They are a valuable complement and precursor to simulation specifications and implementations, focusing purely on thoroughly exploring the biology, recording hypotheses and assumptions, and serve as a communication medium detailing exactly how a simulation relates to the real biology. PMID:25142524
OBO to UML: Support for the development of conceptual models in the biomedical domain.

PubMed

Waldemarin, Ricardo C; de Farias, Cléver R G

2018-04-01

A conceptual model abstractly defines a number of concepts and their relationships for the purposes of understanding and communication. Once a conceptual model is available, it can also be used as a starting point for the development of a software system. The development of conceptual models using the Unified Modeling Language (UML) facilitates the representation of modeled concepts and allows software developers to directly reuse these concepts in the design of a software system. The OBO Foundry represents the most relevant collaborative effort towards the development of ontologies in the biomedical domain. The development of UML conceptual models in the biomedical domain may benefit from the use of domain-specific semantics and notation. Further, the development of these models may also benefit from the reuse of knowledge contained in OBO ontologies. This paper investigates the support for the development of conceptual models in the biomedical domain using UML as a conceptual modeling language and using the support provided by the OBO Foundry for the development of biomedical ontologies, namely entity kind and relationship types definitions provided by the Basic Formal Ontology (BFO) and the OBO Core Relations Ontology (OBO Core), respectively. Further, the paper investigates the support for the reuse of biomedical knowledge currently available in OBOFFF ontologies in the development these conceptual models. The paper describes a UML profile for the OBO Core Relations Ontology, which basically defines a number of stereotypes to represent BFO entity kinds and OBO Core relationship types definitions. The paper also presents a support toolset consisting of a graphical editor named OBO-RO Editor, which directly supports the development of UML models using the extensions defined by our profile, and a command-line tool named OBO2UML, which directly converts an OBOFFF ontology into a UML model. Copyright © 2018 Elsevier Inc. All rights reserved.
A UML profile for the OBO relation ontology

PubMed Central

2012-01-01

Background Ontologies have increasingly been used in the biomedical domain, which has prompted the emergence of different initiatives to facilitate their development and integration. The Open Biological and Biomedical Ontologies (OBO) Foundry consortium provides a repository of life-science ontologies, which are developed according to a set of shared principles. This consortium has developed an ontology called OBO Relation Ontology aiming at standardizing the different types of biological entity classes and associated relationships. Since ontologies are primarily intended to be used by humans, the use of graphical notations for ontology development facilitates the capture, comprehension and communication of knowledge between its users. However, OBO Foundry ontologies are captured and represented basically using text-based notations. The Unified Modeling Language (UML) provides a standard and widely-used graphical notation for modeling computer systems. UML provides a well-defined set of modeling elements, which can be extended using a built-in extension mechanism named Profile. Thus, this work aims at developing a UML profile for the OBO Relation Ontology to provide a domain-specific set of modeling elements that can be used to create standard UML-based ontologies in the biomedical domain. Results We have studied the OBO Relation Ontology, the UML metamodel and the UML profiling mechanism. Based on these studies, we have proposed an extension to the UML metamodel in conformance with the OBO Relation Ontology and we have defined a profile that implements the extended metamodel. Finally, we have applied the proposed UML profile in the development of a number of fragments from different ontologies. Particularly, we have considered the Gene Ontology (GO), the PRotein Ontology (PRO) and the Xenopus Anatomy and Development Ontology (XAO). Conclusions The use of an established and well-known graphical language in the development of biomedical ontologies provides a more intuitive form of capturing and representing knowledge than using only text-based notations. The use of the profile requires the domain expert to reason about the underlying semantics of the concepts and relationships being modeled, which helps preventing the introduction of inconsistencies in an ontology under development and facilitates the identification and correction of errors in an already defined ontology. PMID:23095840
Semantic web data warehousing for caGrid

PubMed Central

McCusker, James P; Phillips, Joshua A; Beltrán, Alejandra González; Finkelstein, Anthony; Krauthammer, Michael

2009-01-01

The National Cancer Institute (NCI) is developing caGrid as a means for sharing cancer-related data and services. As more data sets become available on caGrid, we need effective ways of accessing and integrating this information. Although the data models exposed on caGrid are semantically well annotated, it is currently up to the caGrid client to infer relationships between the different models and their classes. In this paper, we present a Semantic Web-based data warehouse (Corvus) for creating relationships among caGrid models. This is accomplished through the transformation of semantically-annotated caBIG® Unified Modeling Language (UML) information models into Web Ontology Language (OWL) ontologies that preserve those semantics. We demonstrate the validity of the approach by Semantic Extraction, Transformation and Loading (SETL) of data from two caGrid data sources, caTissue and caArray, as well as alignment and query of those sources in Corvus. We argue that semantic integration is necessary for integration of data from distributed web services and that Corvus is a useful way of accomplishing this. Our approach is generalizable and of broad utility to researchers facing similar integration challenges. PMID:19796399
Enabling online studies of conceptual relationships between medical terms: developing an efficient web platform.

PubMed

Albin, Aaron; Ji, Xiaonan; Borlawsky, Tara B; Ye, Zhan; Lin, Simon; Payne, Philip Ro; Huang, Kun; Xiang, Yang

2014-10-07

The Unified Medical Language System (UMLS) contains many important ontologies in which terms are connected by semantic relations. For many studies on the relationships between biomedical concepts, the use of transitively associated information from ontologies and the UMLS has been shown to be effective. Although there are a few tools and methods available for extracting transitive relationships from the UMLS, they usually have major restrictions on the length of transitive relations or on the number of data sources. Our goal was to design an efficient online platform that enables efficient studies on the conceptual relationships between any medical terms. To overcome the restrictions of available methods and to facilitate studies on the conceptual relationships between medical terms, we developed a Web platform, onGrid, that supports efficient transitive queries and conceptual relationship studies using the UMLS. This framework uses the latest technique in converting natural language queries into UMLS concepts, performs efficient transitive queries, and visualizes the result paths. It also dynamically builds a relationship matrix for two sets of input biomedical terms. We are thus able to perform effective studies on conceptual relationships between medical terms based on their relationship matrix. The advantage of onGrid is that it can be applied to study any two sets of biomedical concept relations and the relations within one set of biomedical concepts. We use onGrid to study the disease-disease relationships in the Online Mendelian Inheritance in Man (OMIM). By crossvalidating our results with an external database, the Comparative Toxicogenomics Database (CTD), we demonstrated that onGrid is effective for the study of conceptual relationships between medical terms. onGrid is an efficient tool for querying the UMLS for transitive relations, studying the relationship between medical terms, and generating hypotheses.
Tailoring vocabularies for NLP in sub-domains: a method to detect unused word sense.

PubMed

Figueroa, Rosa L; Zeng-Treitler, Qing; Goryachev, Sergey; Wiechmann, Eduardo P

2009-11-14

We developed a method to help tailor a comprehensive vocabulary system (e.g. the UMLS) for a sub-domain (e.g. clinical reports) in support of natural language processing (NLP). The method detects unused sense in a sub-domain by comparing the relational neighborhood of a word/term in the vocabulary with the semantic neighborhood of the word/term in the sub-domain. The semantic neighborhood of the word/term in the sub-domain is determined using latent semantic analysis (LSA). We trained and tested the unused sense detection on two clinical text corpora: one contains discharge summaries and the other outpatient visit notes. We were able to detect unused senses with precision from 79% to 87%, recall from 48% to 74%, and an area under receiver operation curve (AUC) of 72% to 87%.
Semantic annotation in biomedicine: the current landscape.

PubMed

Jovanović, Jelena; Bagheri, Ebrahim

2017-09-22

The abundance and unstructured nature of biomedical texts, be it clinical or research content, impose significant challenges for the effective and efficient use of information and knowledge stored in such texts. Annotation of biomedical documents with machine intelligible semantics facilitates advanced, semantics-based text management, curation, indexing, and search. This paper focuses on annotation of biomedical entity mentions with concepts from relevant biomedical knowledge bases such as UMLS. As a result, the meaning of those mentions is unambiguously and explicitly defined, and thus made readily available for automated processing. This process is widely known as semantic annotation, and the tools that perform it are known as semantic annotators.Over the last dozen years, the biomedical research community has invested significant efforts in the development of biomedical semantic annotation technology. Aiming to establish grounds for further developments in this area, we review a selected set of state of the art biomedical semantic annotators, focusing particularly on general purpose annotators, that is, semantic annotation tools that can be customized to work with texts from any area of biomedicine. We also examine potential directions for further improvements of today's annotators which could make them even more capable of meeting the needs of real-world applications. To motivate and encourage further developments in this area, along the suggested and/or related directions, we review existing and potential practical applications and benefits of semantic annotators.
Image segmentation via foreground and background semantic descriptors

NASA Astrophysics Data System (ADS)

Yuan, Ding; Qiang, Jingjing; Yin, Jihao

2017-09-01

In the field of image processing, it has been a challenging task to obtain a complete foreground that is not uniform in color or texture. Unlike other methods, which segment the image by only using low-level features, we present a segmentation framework, in which high-level visual features, such as semantic information, are used. First, the initial semantic labels were obtained by using the nonparametric method. Then, a subset of the training images, with a similar foreground to the input image, was selected. Consequently, the semantic labels could be further refined according to the subset. Finally, the input image was segmented by integrating the object affinity and refined semantic labels. State-of-the-art performance was achieved in experiments with the challenging MSRC 21 dataset.
Automated UMLS-Based Comparison of Medical Forms

PubMed Central

Dugas, Martin; Fritz, Fleur; Krumm, Rainer; Breil, Bernhard

2013-01-01

Medical forms are very heterogeneous: on a European scale there are thousands of data items in several hundred different systems. To enable data exchange for clinical care and research purposes there is a need to develop interoperable documentation systems with harmonized forms for data capture. A prerequisite in this harmonization process is comparison of forms. So far – to our knowledge – an automated method for comparison of medical forms is not available. A form contains a list of data items with corresponding medical concepts. An automatic comparison needs data types, item names and especially item with these unique concept codes from medical terminologies. The scope of the proposed method is a comparison of these items by comparing their concept codes (coded in UMLS). Each data item is represented by item name, concept code and value domain. Two items are called identical, if item name, concept code and value domain are the same. Two items are called matching, if only concept code and value domain are the same. Two items are called similar, if their concept codes are the same, but the value domains are different. Based on these definitions an open-source implementation for automated comparison of medical forms in ODM format with UMLS-based semantic annotations was developed. It is available as package compareODM from http://cran.r-project.org. To evaluate this method, it was applied to a set of 7 real medical forms with 285 data items from a large public ODM repository with forms for different medical purposes (research, quality management, routine care). Comparison results were visualized with grid images and dendrograms. Automated comparison of semantically annotated medical forms is feasible. Dendrograms allow a view on clustered similar forms. The approach is scalable for a large set of real medical forms. PMID:23861827
The UMLS Knowledge Source Server: an experience in Web 2.0 technologies.

PubMed

Thorn, Karen E; Bangalore, Anantha K; Browne, Allen C

2007-10-11

The UMLS Knowledge Source Server (UMLSKS), developed at the National Library of Medicine (NLM), makes the knowledge sources of the Unified Medical Language System (UMLS) available to the research community over the Internet. In 2003, the UMLSKS was redesigned utilizing state-of-the-art technologies available at that time. That design offered a significant improvement over the prior version but presented a set of technology-dependent issues that limited its functionality and usability. Four areas of desired improvement were identified: software interfaces, web interface content, system maintenance/deployment, and user authentication. By employing next generation web technologies, newer authentication paradigms and further refinements in modular design methods, these areas could be addressed and corrected to meet the ever increasing needs of UMLSKS developers. In this paper we detail the issues present with the existing system and describe the new system's design using new technologies considered entrants in the Web 2.0 development era.

A HyperCard Implementation of Meta-1: The First Version of the UMLS Metathesaurus*

PubMed Central

Sherertz, David; Tuttle, Mark; Cole, William; Erlbaum, Mark; Olson, Nels; Nelson, Stuart

1989-01-01

The Unified Medical Language System (UMLS) is being designed to provide uniform access to computer-based resources in biomedicine. For the foreseeable future, the foundation of the UMLS will be a metathesaurus of concepts, synthesized from existing biomedical nomenclatures. Meta-1, the first version of the Metathesaurus, will contain all of MeSH, a selection of terms from primary care, clinical medicine, and other domains, and all terms from SNOMED, ICD-9-CM, and CPT-4 which “match” them -- about 30,000 terms. In addition, Meta-1 will contain information about the occurrence and co-occurrence of its terms in selected resources, such as MEDLINE. As Meta-1 will contain about 100MB of terms and relationships, it is unlikely that it will be “printed.” Instead, some UMLS applications will support Metathesaurus browsing. One way of browsing Meta-1 will be via the Apple Macintosh® application called HyperCard®. A demonstration of a HyperCard interface, called Meta-Card™ will first acquaint viewers with the contents of the pre-human-review version of Meta-1, and second, illustrate how an object-oriented interface can be programmed to support various visual metaphors, e.g. “click-to-get-more-information,” and “click-to-follow-a-semantic-link,” and the notion of a Metathesaurus esthetic.
An introduction to the Semantic Web for health sciences librarians.

PubMed

Robu, Ioana; Robu, Valentin; Thirion, Benoit

2006-04-01

The paper (1) introduces health sciences librarians to the main concepts and principles of the Semantic Web (SW) and (2) briefly reviews a number of projects on the handling of biomedical information that uses SW technology. The paper is structured into two main parts. "Semantic Web Technology" provides a high-level description, with examples, of the main standards and concepts: extensible markup language (XML), Resource Description Framework (RDF), RDF Schema (RDFS), ontologies, and their utility in information retrieval, concluding with mention of more advanced SW languages and their characteristics. "Semantic Web Applications and Research Projects in the Biomedical Field" is a brief review of the Unified Medical Language System (UMLS), Generalised Architecture for Languages, Encyclopedias and Nomenclatures in Medicine (GALEN), HealthCyberMap, LinkBase, and the thesaurus of the National Cancer Institute (NCI). The paper also mentions other benefits and by-products of the SW, citing projects related to them. Some of the problems facing the SW vision are presented, especially the ways in which the librarians' expertise in organizing knowledge and in structuring information may contribute to SW projects.
Toward a Bio-Medical Thesaurus: Building the Foundation of the UMLS

PubMed Central

Tuttle, Mark S.; Blois, Marsden S.; Erlbaum, Mark S.; Nelson, Stuart J.; Sherertz, David D.

1988-01-01

The Unified Medical Language System (UMLS) is being designed to provide a uniform user interface to heterogeneous machine-readable bio-medical information resources, such as bibliographic databases, genetic databases, expert systems and patient records.1 Such an interface will have to recognize different ways of saying the same thing, and provide links to ways of saying related things. One way to represent the necessary associations is via a domain thesaurus. As no such thesaurus exists, and because, once built, it will be both sizable and in need of continuous maintenance, its design should include a methodology for building and maintaining it. We propose a methodology, utilizing lexically expanded schema inversion, and a design, called T. Lex, which together form one approach to the problem of defining and building a bio-medical thesaurus. We argue that the semantic locality implicit in such a thesaurus will support model-based reasoning in bio-medicine.2
The ranking algorithm of the Coach browser for the UMLS metathesaurus.

PubMed Central

Harbourt, A. M.; Syed, E. J.; Hole, W. T.; Kingsland, L. C.

1993-01-01

This paper presents the novel ranking algorithm of the Coach Metathesaurus browser which is a major module of the Coach expert search refinement program. An example shows how the ranking algorithm can assist in creating a list of candidate terms useful in augmenting a suboptimal Grateful Med search of MEDLINE. PMID:8130570
Ontology-based vector space model and fuzzy query expansion to retrieve knowledge on medical computational problem solutions.

PubMed

Bratsas, Charalampos; Koutkias, Vassilis; Kaimakamis, Evangelos; Bamidis, Panagiotis; Maglaveras, Nicos

2007-01-01

Medical Computational Problem (MCP) solving is related to medical problems and their computerized algorithmic solutions. In this paper, an extension of an ontology-based model to fuzzy logic is presented, as a means to enhance the information retrieval (IR) procedure in semantic management of MCPs. We present herein the methodology followed for the fuzzy expansion of the ontology model, the fuzzy query expansion procedure, as well as an appropriate ontology-based Vector Space Model (VSM) that was constructed for efficient mapping of user-defined MCP search criteria and MCP acquired knowledge. The relevant fuzzy thesaurus is constructed by calculating the simultaneous occurrences of terms and the term-to-term similarities derived from the ontology that utilizes UMLS (Unified Medical Language System) concepts by using Concept Unique Identifiers (CUI), synonyms, semantic types, and broader-narrower relationships for fuzzy query expansion. The current approach constitutes a sophisticated advance for effective, semantics-based MCP-related IR.
A study of actions in operative notes.

PubMed

Wang, Yan; Pakhomov, Serguei; Burkart, Nora E; Ryan, James O; Melton, Genevieve B

2012-01-01

Operative notes contain rich information about techniques, instruments, and materials used in procedures. To assist development of effective information extraction (IE) techniques for operative notes, we investigated the sublanguage used to describe actions within the operative report 'procedure description' section. Deep parsing results of 362,310 operative notes with an expanded Stanford parser using the SPECIALIST Lexicon resulted in 200 verbs (92% coverage) including 147 action verbs. Nominal action predicates for each action verb were gathered from WordNet, SPECIALIST Lexicon, New Oxford American Dictionary and Stedman's Medical Dictionary. Coverage gaps were seen in existing lexical, domain, and semantic resources (Unified Medical Language System (UMLS) Metathesaurus, SPECIALIST Lexicon, WordNet and FrameNet). Our findings demonstrate the need to construct surgical domain-specific semantic resources for IE from operative notes.
Integration of the MIP Command and Control Information Exchange Data Model into National Systems

DTIC Science & Technology

2005-06-01

Solutions for the Java programming language include Hibernate ( Hibernate , 2005), Java Data Objects (JDO, 2005), J2EE Container Managed Persistence (CMP) and... Java , C++, or UML classes in a first step. The semantical gap between the relational and the object-oriented world, also called O-R impedance, is a...cannot be achieved at the interfaces – it needs to be established in the core of national systems! References Hibernate (2005). www.hibernate.org. JDO
An introduction to the Semantic Web for health sciences librarians*

PubMed Central

Robu, Ioana; Robu, Valentin; Thirion, Benoit

2006-01-01

Objectives: The paper (1) introduces health sciences librarians to the main concepts and principles of the Semantic Web (SW) and (2) briefly reviews a number of projects on the handling of biomedical information that uses SW technology. Methodology: The paper is structured into two main parts. “Semantic Web Technology” provides a high-level description, with examples, of the main standards and concepts: extensible markup language (XML), Resource Description Framework (RDF), RDF Schema (RDFS), ontologies, and their utility in information retrieval, concluding with mention of more advanced SW languages and their characteristics. “Semantic Web Applications and Research Projects in the Biomedical Field” is a brief review of the Unified Medical Language System (UMLS), Generalised Architecture for Languages, Encyclopedias and Nomenclatures in Medicine (GALEN), HealthCyberMap, LinkBase, and the thesaurus of the National Cancer Institute (NCI). The paper also mentions other benefits and by-products of the SW, citing projects related to them. Discussion and Conclusions: Some of the problems facing the SW vision are presented, especially the ways in which the librarians' expertise in organizing knowledge and in structuring information may contribute to SW projects. PMID:16636713
ODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository.

PubMed

Dugas, Martin; Meidt, Alexandra; Neuhaus, Philipp; Storck, Michael; Varghese, Julian

2016-06-01

The volume and complexity of patient data - especially in personalised medicine - is steadily increasing, both regarding clinical data and genomic profiles: Typically more than 1,000 items (e.g., laboratory values, vital signs, diagnostic tests etc.) are collected per patient in clinical trials. In oncology hundreds of mutations can potentially be detected for each patient by genomic profiling. Therefore data integration from multiple sources constitutes a key challenge for medical research and healthcare. Semantic annotation of data elements can facilitate to identify matching data elements in different sources and thereby supports data integration. Millions of different annotations are required due to the semantic richness of patient data. These annotations should be uniform, i.e., two matching data elements shall contain the same annotations. However, large terminologies like SNOMED CT or UMLS don't provide uniform coding. It is proposed to develop semantic annotations of medical data elements based on a large-scale public metadata repository. To achieve uniform codes, semantic annotations shall be re-used if a matching data element is available in the metadata repository. A web-based tool called ODMedit ( https://odmeditor.uni-muenster.de/ ) was developed to create data models with uniform semantic annotations. It contains ~800,000 terms with semantic annotations which were derived from ~5,800 models from the portal of medical data models (MDM). The tool was successfully applied to manually annotate 22 forms with 292 data items from CDISC and to update 1,495 data models of the MDM portal. Uniform manual semantic annotation of data models is feasible in principle, but requires a large-scale collaborative effort due to the semantic richness of patient data. A web-based tool for these annotations is available, which is linked to a public metadata repository.
Automatic Debugging Support for UML Designs

NASA Technical Reports Server (NTRS)

Schumann, Johann; Swanson, Keith (Technical Monitor)

2001-01-01

Design of large software systems requires rigorous application of software engineering methods covering all phases of the software process. Debugging during the early design phases is extremely important, because late bug-fixes are expensive. In this paper, we describe an approach which facilitates debugging of UML requirements and designs. The Unified Modeling Language (UML) is a set of notations for object-orient design of a software system. We have developed an algorithm which translates requirement specifications in the form of annotated sequence diagrams into structured statecharts. This algorithm detects conflicts between sequence diagrams and inconsistencies in the domain knowledge. After synthesizing statecharts from sequence diagrams, these statecharts usually are subject to manual modification and refinement. By using the "backward" direction of our synthesis algorithm. we are able to map modifications made to the statechart back into the requirements (sequence diagrams) and check for conflicts there. Fed back to the user conflicts detected by our algorithm are the basis for deductive-based debugging of requirements and domain theory in very early development stages. Our approach allows to generate explanations oil why there is a conflict and which parts of the specifications are affected.
A methodology for extending domain coverage in SemRep.

PubMed

Rosemblat, Graciela; Shin, Dongwook; Kilicoglu, Halil; Sneiderman, Charles; Rindflesch, Thomas C

2013-12-01

We describe a domain-independent methodology to extend SemRep coverage beyond the biomedical domain. SemRep, a natural language processing application originally designed for biomedical texts, uses the knowledge sources provided by the Unified Medical Language System (UMLS©). Ontological and terminological extensions to the system are needed in order to support other areas of knowledge. We extended SemRep's application by developing a semantic representation of a previously unsupported domain. This was achieved by adapting well-known ontology engineering phases and integrating them with the UMLS knowledge sources on which SemRep crucially depends. While the process to extend SemRep coverage has been successfully applied in earlier projects, this paper presents in detail the step-wise approach we followed and the mechanisms implemented. A case study in the field of medical informatics illustrates how the ontology engineering phases have been adapted for optimal integration with the UMLS. We provide qualitative and quantitative results, which indicate the validity and usefulness of our methodology. Published by Elsevier Inc.
Modelling expertise at different levels of granularity using semantic similarity measures in the context of collaborative knowledge-curation platforms.

PubMed

Ziaimatin, Hasti; Groza, Tudor; Tudorache, Tania; Hunter, Jane

2016-12-01

Collaboration platforms provide a dynamic environment where the content is subject to ongoing evolution through expert contributions. The knowledge embedded in such platforms is not static as it evolves through incremental refinements - or micro-contributions. Such refinements provide vast resources of tacit knowledge and experience. In our previous work, we proposed and evaluated a Semantic and Time-dependent Expertise Profiling (STEP) approach for capturing expertise from micro-contributions. In this paper we extend our investigation to structured micro-contributions that emerge from an ontology engineering environment, such as the one built for developing the International Classification of Diseases (ICD) revision 11. We take advantage of the semantically related nature of these structured micro-contributions to showcase two major aspects: (i) a novel semantic similarity metric, in addition to an approach for creating bottom-up baseline expertise profiles using expertise centroids; and (ii) the application of STEP in this new environment combined with the use of the same semantic similarity measure to both compare STEP against baseline profiles, as well as to investigate the coverage of these baseline profiles by STEP.
Exploring supervised and unsupervised methods to detect topics in biomedical text

PubMed Central

Lee, Minsuk; Wang, Weiqing; Yu, Hong

2006-01-01

Background Topic detection is a task that automatically identifies topics (e.g., "biochemistry" and "protein structure") in scientific articles based on information content. Topic detection will benefit many other natural language processing tasks including information retrieval, text summarization and question answering; and is a necessary step towards the building of an information system that provides an efficient way for biologists to seek information from an ocean of literature. Results We have explored the methods of Topic Spotting, a task of text categorization that applies the supervised machine-learning technique naïve Bayes to assign automatically a document into one or more predefined topics; and Topic Clustering, which apply unsupervised hierarchical clustering algorithms to aggregate documents into clusters such that each cluster represents a topic. We have applied our methods to detect topics of more than fifteen thousand of articles that represent over sixteen thousand entries in the Online Mendelian Inheritance in Man (OMIM) database. We have explored bag of words as the features. Additionally, we have explored semantic features; namely, the Medical Subject Headings (MeSH) that are assigned to the MEDLINE records, and the Unified Medical Language System (UMLS) semantic types that correspond to the MeSH terms, in addition to bag of words, to facilitate the tasks of topic detection. Our results indicate that incorporating the MeSH terms and the UMLS semantic types as additional features enhances the performance of topic detection and the naïve Bayes has the highest accuracy, 66.4%, for predicting the topic of an OMIM article as one of the total twenty-five topics. Conclusion Our results indicate that the supervised topic spotting methods outperformed the unsupervised topic clustering; on the other hand, the unsupervised topic clustering methods have the advantages of being robust and applicable in real world settings. PMID:16539745
Semantic transference for enriching multilingual biomedical knowledge resources.

PubMed

Pérez, María; Berlanga, Rafael

2015-12-01

Biomedical knowledge resources (KRs) are mainly expressed in English, and many applications using them suffer from the scarcity of knowledge in non-English languages. The goal of the present work is to take maximum profit from existing multilingual biomedical KRs lexicons to enrich their non-English counterparts. We propose to combine different automatic methods to generate pair-wise language alignments. More specifically, we use two well-known translation methods (GIZA++ and Moses), and we propose a new ad hoc method specially devised for multilingual KRs. Then, resulting alignments are used to transfer semantics between KRs across their languages. Transference quality is ensured by checking the semantic coherence of the generated alignments. Experiments have been carried out over the Spanish, French and German UMLS Metathesaurus counterparts. As a result, the enriched Spanish KR can grow up to 1,514,217 concepts (originally 286,659), the French KR up to 1,104,968 concepts (originally 83,119), and the German KR up to 1,136,020 concepts (originally 86,842). Copyright © 2015 Elsevier Inc. All rights reserved.
Building a knowledge base of severe adverse drug events based on AERS reporting data using semantic web technologies.

PubMed

Jiang, Guoqian; Wang, Liwei; Liu, Hongfang; Solbrig, Harold R; Chute, Christopher G

2013-01-01

A semantically coded knowledge base of adverse drug events (ADEs) with severity information is critical for clinical decision support systems and translational research applications. However it remains challenging to measure and identify the severity information of ADEs. The objective of the study is to develop and evaluate a semantic web based approach for building a knowledge base of severe ADEs based on the FDA Adverse Event Reporting System (AERS) reporting data. We utilized a normalized AERS reporting dataset and extracted putative drug-ADE pairs and their associated outcome codes in the domain of cardiac disorders. We validated the drug-ADE associations using ADE datasets from SIDe Effect Resource (SIDER) and the UMLS. We leveraged the Common Terminology Criteria for Adverse Event (CTCAE) grading system and classified the ADEs into the CTCAE in the Web Ontology Language (OWL). We identified and validated 2,444 unique Drug-ADE pairs in the domain of cardiac disorders, of which 760 pairs are in Grade 5, 775 pairs in Grade 4 and 2,196 pairs in Grade 3.
Standardized mappings--a framework to combine different semantic mappers into a standardized web-API.

PubMed

Neuhaus, Philipp; Doods, Justin; Dugas, Martin

2015-01-01

Automatic coding of medical terms is an important, but highly complicated and laborious task. To compare and evaluate different strategies a framework with a standardized web-interface was created. Two UMLS mapping strategies are compared to demonstrate the interface. The framework is a Java Spring application running on a Tomcat application server. It accepts different parameters and returns results in JSON format. To demonstrate the framework, a list of medical data items was mapped by two different methods: similarity search in a large table of terminology codes versus search in a manually curated repository. These mappings were reviewed by a specialist. The evaluation shows that the framework is flexible (due to standardized interfaces like HTTP and JSON), performant and reliable. Accuracy of automatically assigned codes is limited (up to 40%). Combining different semantic mappers into a standardized Web-API is feasible. This framework can be easily enhanced due to its modular design.
Extracting similar terms from multiple EMR-based semantic embeddings to support chart reviews.

PubMed

Cheng Ye, M S; Fabbri, Daniel

2018-05-21

Word embeddings project semantically similar terms into nearby points in a vector space. When trained on clinical text, these embeddings can be leveraged to improve keyword search and text highlighting. In this paper, we present methods to refine the selection process of similar terms from multiple EMR-based word embeddings, and evaluate their performance quantitatively and qualitatively across multiple chart review tasks. Word embeddings were trained on each clinical note type in an EMR. These embeddings were then combined, weighted, and truncated to select a refined set of similar terms to be used in keyword search and text highlighting. To evaluate their quality, we measured the similar terms' information retrieval (IR) performance using precision-at-K (P@5, P@10). Additionally a user study evaluated users' search term preferences, while a timing study measured the time to answer a question from a clinical chart. The refined terms outperformed the baseline method's information retrieval performance (e.g., increasing the average P@5 from 0.48 to 0.60). Additionally, the refined terms were preferred by most users, and reduced the average time to answer a question. Clinical information can be more quickly retrieved and synthesized when using semantically similar term from multiple embeddings. Copyright © 2018. Published by Elsevier Inc.
Deriving a probabilistic syntacto-semantic grammar for biomedicine based on domain-specific terminologies

PubMed Central

Fan, Jung-Wei; Friedman, Carol

2011-01-01

Biomedical natural language processing (BioNLP) is a useful technique that unlocks valuable information stored in textual data for practice and/or research. Syntactic parsing is a critical component of BioNLP applications that rely on correctly determining the sentence and phrase structure of free text. In addition to dealing with the vast amount of domain-specific terms, a robust biomedical parser needs to model the semantic grammar to obtain viable syntactic structures. With either a rule-based or corpus-based approach, the grammar engineering process requires substantial time and knowledge from experts, and does not always yield a semantically transferable grammar. To reduce the human effort and to promote semantic transferability, we propose an automated method for deriving a probabilistic grammar based on a training corpus consisting of concept strings and semantic classes from the Unified Medical Language System (UMLS), a comprehensive terminology resource widely used by the community. The grammar is designed to specify noun phrases only due to the nominal nature of the majority of biomedical terminological concepts. Evaluated on manually parsed clinical notes, the derived grammar achieved a recall of 0.644, precision of 0.737, and average cross-bracketing of 0.61, which demonstrated better performance than a control grammar with the semantic information removed. Error analysis revealed shortcomings that could be addressed to improve performance. The results indicated the feasibility of an approach which automatically incorporates terminology semantics in the building of an operational grammar. Although the current performance of the unsupervised solution does not adequately replace manual engineering, we believe once the performance issues are addressed, it could serve as an aide in a semi-supervised solution. PMID:21549857
Modelling expertise at different levels of granularity using semantic similarity measures in the context of collaborative knowledge-curation platforms

PubMed Central

Groza, Tudor; Tudorache, Tania; Hunter, Jane

2015-01-01

Collaboration platforms provide a dynamic environment where the content is subject to ongoing evolution through expert contributions. The knowledge embedded in such platforms is not static as it evolves through incremental refinements – or micro-contributions. Such refinements provide vast resources of tacit knowledge and experience. In our previous work, we proposed and evaluated a Semantic and Time-dependent Expertise Profiling (STEP) approach for capturing expertise from micro-contributions. In this paper we extend our investigation to structured micro-contributions that emerge from an ontology engineering environment, such as the one built for developing the International Classification of Diseases (ICD) revision 11. We take advantage of the semantically related nature of these structured micro-contributions to showcase two major aspects: (i) a novel semantic similarity metric, in addition to an approach for creating bottom-up baseline expertise profiles using expertise centroids; and (ii) the application of STEP in this new environment combined with the use of the same semantic similarity measure to both compare STEP against baseline profiles, as well as to investigate the coverage of these baseline profiles by STEP. PMID:28077914
Computer Programs for the Semantic Differential: Further Modifications.

ERIC Educational Resources Information Center

Lawson, Edwin D.; And Others

The original nine programs for semantic differential analysis have been condensed into three programs which have been further refined and augmented. They yield: (1) means, standard deviations, and standard errors for each subscale on each concept; (2) Evaluation, Potency, and Activity (EPA) means, standard deviations, and standard errors; (3)…

Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation.

PubMed

Jimeno-Yepes, Antonio J; McInnes, Bridget T; Aronson, Alan R

2011-06-02

Evaluation of Word Sense Disambiguation (WSD) methods in the biomedical domain is difficult because the available resources are either too small or too focused on specific types of entities (e.g. diseases or genes). We present a method that can be used to automatically develop a WSD test collection using the Unified Medical Language System (UMLS) Metathesaurus and the manual MeSH indexing of MEDLINE. We demonstrate the use of this method by developing such a data set, called MSH WSD. In our method, the Metathesaurus is first screened to identify ambiguous terms whose possible senses consist of two or more MeSH headings. We then use each ambiguous term and its corresponding MeSH heading to extract MEDLINE citations where the term and only one of the MeSH headings co-occur. The term found in the MEDLINE citation is automatically assigned the UMLS CUI linked to the MeSH heading. Each instance has been assigned a UMLS Concept Unique Identifier (CUI). We compare the characteristics of the MSH WSD data set to the previously existing NLM WSD data set. The resulting MSH WSD data set consists of 106 ambiguous abbreviations, 88 ambiguous terms and 9 which are a combination of both, for a total of 203 ambiguous entities. For each ambiguous term/abbreviation, the data set contains a maximum of 100 instances per sense obtained from MEDLINE.We evaluated the reliability of the MSH WSD data set using existing knowledge-based methods and compared their performance to that of the results previously obtained by these algorithms on the pre-existing data set, NLM WSD. We show that the knowledge-based methods achieve different results but keep their relative performance except for the Journal Descriptor Indexing (JDI) method, whose performance is below the other methods. The MSH WSD data set allows the evaluation of WSD algorithms in the biomedical domain. Compared to previously existing data sets, MSH WSD contains a larger number of biomedical terms/abbreviations and covers the largest set of UMLS Semantic Types. Furthermore, the MSH WSD data set has been generated automatically reusing already existing annotations and, therefore, can be regenerated from subsequent UMLS versions.
Analysis of Online Information Searching for Cardiovascular Diseases on a Consumer Health Information Portal

PubMed Central

Jadhav, Ashutosh; Sheth, Amit; Pathak, Jyotishman

2014-01-01

Since the early 2000’s, Internet usage for health information searching has increased significantly. Studying search queries can help us to understand users “information need” and how do they formulate search queries (“expression of information need”). Although cardiovascular diseases (CVD) affect a large percentage of the population, few studies have investigated how and what users search for CVD. We address this knowledge gap in the community by analyzing a large corpus of 10 million CVD related search queries from MayoClinic.com. Using UMLS MetaMap and UMLS semantic types/concepts, we developed a rule-based approach to categorize the queries into 14 health categories. We analyzed structural properties, types (keyword-based/Wh-questions/Yes-No questions) and linguistic structure of the queries. Our results show that the most searched health categories are ‘Diseases/Conditions’, ‘Vital-Sings’, ‘Symptoms’ and ‘Living-with’. CVD queries are longer and are predominantly keyword-based. This study extends our knowledge about online health information searching and provides useful insights for Web search engines and health websites. PMID:25954380
A Flexible Statechart-to-Model-Checker Translator

NASA Technical Reports Server (NTRS)

Rouquette, Nicolas; Dunphy, Julia; Feather, Martin S.

2000-01-01

Many current-day software design tools offer some variant of statechart notation for system specification. We, like others, have built an automatic translator from (a subset of) statecharts to a model checker, for use to validate behavioral requirements. Our translator is designed to be flexible. This allows us to quickly adjust the translator to variants of statechart semantics, including problem-specific notational conventions that designers employ. Our system demonstration will be of interest to the following two communities: (1) Potential end-users: Our demonstration will show translation from statecharts created in a commercial UML tool (Rational Rose) to Promela, the input language of Holzmann's model checker SPIN. The translation is accomplished automatically. To accommodate the major variants of statechart semantics, our tool offers user-selectable choices among semantic alternatives. Options for customized semantic variants are also made available. The net result is an easy-to-use tool that operates on a wide range of statechart diagrams to automate the pathway to model-checking input. (2) Other researchers: Our translator embodies, in one tool, ideas and approaches drawn from several sources. Solutions to the major challenges of statechart-to-model-checker translation (e.g., determining which transition(s) will fire, handling of concurrent activities) are retired in a uniform, fully mechanized, setting. The way in which the underlying architecture of the translator itself facilitates flexible and customizable translation will also be evident.
Sharing behavioral data through a grid infrastructure using data standards

PubMed Central

Min, Hua; Ohira, Riki; Collins, Michael A; Bondy, Jessica; Avis, Nancy E; Tchuvatkina, Olga; Courtney, Paul K; Moser, Richard P; Shaikh, Abdul R; Hesse, Bradford W; Cooper, Mary; Reeves, Dianne; Lanese, Bob; Helba, Cindy; Miller, Suzanne M; Ross, Eric A

2014-01-01

Objective In an effort to standardize behavioral measures and their data representation, the present study develops a methodology for incorporating measures found in the National Cancer Institute's (NCI) grid-enabled measures (GEM) portal, a repository for behavioral and social measures, into the cancer data standards registry and repository (caDSR). Methods The methodology consists of four parts for curating GEM measures into the caDSR: (1) develop unified modeling language (UML) models for behavioral measures; (2) create common data elements (CDE) for UML components; (3) bind CDE with concepts from the NCI thesaurus; and (4) register CDE in the caDSR. Results UML models have been developed for four GEM measures, which have been registered in the caDSR as CDE. New behavioral concepts related to these measures have been created and incorporated into the NCI thesaurus. Best practices for representing measures using UML models have been utilized in the practice (eg, caDSR). One dataset based on a GEM-curated measure is available for use by other systems and users connected to the grid. Conclusions Behavioral and population science data can be standardized by using and extending current standards. A new branch of CDE for behavioral science was developed for the caDSR. It expands the caDSR domain coverage beyond the clinical and biological areas. In addition, missing terms and concepts specific to the behavioral measures addressed in this paper were added to the NCI thesaurus. A methodology was developed and refined for curation of behavioral and population science data. PMID:24076749
A Framework to Manage Information Models

NASA Astrophysics Data System (ADS)

Hughes, J. S.; King, T.; Crichton, D.; Walker, R.; Roberts, A.; Thieman, J.

2008-05-01

The Information Model is the foundation on which an Information System is built. It defines the entities to be processed, their attributes, and the relationships that add meaning. The development and subsequent management of the Information Model is the single most significant factor for the development of a successful information system. A framework of tools has been developed that supports the management of an information model with the rigor typically afforded to software development. This framework provides for evolutionary and collaborative development independent of system implementation choices. Once captured, the modeling information can be exported to common languages for the generation of documentation, application databases, and software code that supports both traditional and semantic web applications. This framework is being successfully used for several science information modeling projects including those for the Planetary Data System (PDS), the International Planetary Data Alliance (IPDA), the National Cancer Institute's Early Detection Research Network (EDRN), and several Consultative Committee for Space Data Systems (CCSDS) projects. The objective of the Space Physics Archive Search and Exchange (SPASE) program is to promote collaboration and coordination of archiving activity for the Space Plasma Physics community and ensure the compatibility of the architectures used for a global distributed system and the individual data centers. Over the past several years, the SPASE data model working group has made great progress in developing the SPASE Data Model and supporting artifacts including a data dictionary, XML Schema, and two ontologies. The authors have captured the SPASE Information Model in this framework. This allows the generation of documentation that presents the SPASE Information Model in object-oriented notation including UML class diagrams and class hierarchies. The modeling information can also be exported to semantic web languages such as OWL and RDF and written to XML Metadata Interchange (XMI) files for import into UML tools.
Refining Automatically Extracted Knowledge Bases Using Crowdsourcing.

PubMed

Li, Chunhua; Zhao, Pengpeng; Sheng, Victor S; Xian, Xuefeng; Wu, Jian; Cui, Zhiming

2017-01-01

Machine-constructed knowledge bases often contain noisy and inaccurate facts. There exists significant work in developing automated algorithms for knowledge base refinement. Automated approaches improve the quality of knowledge bases but are far from perfect. In this paper, we leverage crowdsourcing to improve the quality of automatically extracted knowledge bases. As human labelling is costly, an important research challenge is how we can use limited human resources to maximize the quality improvement for a knowledge base. To address this problem, we first introduce a concept of semantic constraints that can be used to detect potential errors and do inference among candidate facts. Then, based on semantic constraints, we propose rank-based and graph-based algorithms for crowdsourced knowledge refining, which judiciously select the most beneficial candidate facts to conduct crowdsourcing and prune unnecessary questions. Our experiments show that our method improves the quality of knowledge bases significantly and outperforms state-of-the-art automatic methods under a reasonable crowdsourcing cost.
Exchange of Computable Patient Data between the Department of Veterans Affairs (VA) and the Department of Defense (DoD): Terminology Mediation Strategy

PubMed Central

Bouhaddou, Omar; Warnekar, Pradnya; Parrish, Fola; Do, Nhan; Mandel, Jack; Kilbourne, John; Lincoln, Michael J.

2008-01-01

Complete patient health information that is available where and when it is needed is essential to providers and patients and improves healthcare quality and patient safety. VA and DoD have built on their previous experience in patient data exchange to establish data standards and terminology services to enable real-time bi-directional computable (i.e., encoded) data exchange and achieve semantic interoperability in compliance with recommended national standards and the eGov initiative. The project uses RxNorm, UMLS, and SNOMED CT terminology standards to mediate codified pharmacy and allergy data with greater than 92 and 60 percent success rates respectively. Implementation of the project has been well received by users and is being expanded to multiple joint care sites. Stable and mature standards, mediation strategies, and a close relationship between healthcare institutions and Standards Development Organizations are recommended to achieve and maintain semantic interoperability in a clinical setting. PMID:18096911
Portal of medical data models: information infrastructure for medical research and healthcare.

PubMed

Dugas, Martin; Neuhaus, Philipp; Meidt, Alexandra; Doods, Justin; Storck, Michael; Bruland, Philipp; Varghese, Julian

2016-01-01

Information systems are a key success factor for medical research and healthcare. Currently, most of these systems apply heterogeneous and proprietary data models, which impede data exchange and integrated data analysis for scientific purposes. Due to the complexity of medical terminology, the overall number of medical data models is very high. At present, the vast majority of these models are not available to the scientific community. The objective of the Portal of Medical Data Models (MDM, https://medical-data-models.org) is to foster sharing of medical data models. MDM is a registered European information infrastructure. It provides a multilingual platform for exchange and discussion of data models in medicine, both for medical research and healthcare. The system is developed in collaboration with the University Library of Münster to ensure sustainability. A web front-end enables users to search, view, download and discuss data models. Eleven different export formats are available (ODM, PDF, CDA, CSV, MACRO-XML, REDCap, SQL, SPSS, ADL, R, XLSX). MDM contents were analysed with descriptive statistics. MDM contains 4387 current versions of data models (in total 10,963 versions). 2475 of these models belong to oncology trials. The most common keyword (n = 3826) is 'Clinical Trial'; most frequent diseases are breast cancer, leukemia, lung and colorectal neoplasms. Most common languages of data elements are English (n = 328,557) and German (n = 68,738). Semantic annotations (UMLS codes) are available for 108,412 data items, 2453 item groups and 35,361 code list items. Overall 335,087 UMLS codes are assigned with 21,847 unique codes. Few UMLS codes are used several thousand times, but there is a long tail of rarely used codes in the frequency distribution. Expected benefits of the MDM portal are improved and accelerated design of medical data models by sharing best practice, more standardised data models with semantic annotation and better information exchange between information systems, in particular Electronic Data Capture (EDC) and Electronic Health Records (EHR) systems. Contents of the MDM portal need to be further expanded to reach broad coverage of all relevant medical domains. Database URL: https://medical-data-models.org. © The Author(s) 2016. Published by Oxford University Press.
A comparative analysis of the density of the SNOMED CT conceptual content for semantic harmonization

PubMed Central

He, Zhe; Geller, James; Chen, Yan

2015-01-01

Objectives Medical terminologies vary in the amount of concept information (the “density”) represented, even in the same sub-domains. This causes problems in terminology mapping, semantic harmonization and terminology integration. Moreover, complex clinical scenarios need to be encoded by a medical terminology with comprehensive content. SNOMED Clinical Terms (SNOMED CT), a leading clinical terminology, was reported to lack concepts and synonyms, problems that cannot be fully alleviated by using post-coordination. Therefore, a scalable solution is needed to enrich the conceptual content of SNOMED CT. We are developing a structure-based, algorithmic method to identify potential concepts for enriching the conceptual content of SNOMED CT and to support semantic harmonization of SNOMED CT with selected other Unified Medical Language System (UMLS) terminologies. Methods We first identified a subset of English terminologies in the UMLS that have ‘PAR’ relationship labeled with ‘IS_A’ and over 10% overlap with one or more of the 19 hierarchies of SNOMED CT. We call these “reference terminologies” and we note that our use of this name is different from the standard use. Next, we defined a set of topological patterns across pairs of terminologies, with SNOMED CT being one terminology in each pair and the other being one of the reference terminologies. We then explored how often these topological patterns appear between SNOMED CT and each reference terminology, and how to interpret them. Results Four viable reference terminologies were identified. Large density differences between terminologies were found. Expected interpretations of these differences were indeed observed, as follows. A random sample of 299 instances of special topological patterns (“2:3 and 3:2 trapezoids”) showed that 39.1% and 59.5% of analyzed concepts in SNOMED CT and in a reference terminology, respectively, were deemed to be alternative classifications of the same conceptual content. In 30.5% and 17.6% of the cases, it was found that intermediate concepts could be imported into SNOMED CT or into the reference terminology, respectively, to enhance their conceptual content, if approved by a human curator. Other cases included synonymy and errors in one of the terminologies. Conclusion These results show that structure-based algorithmic methods can be used to identify potential concepts to enrich SNOMED CT and the four reference terminologies. The comparative analysis has the future potential of supporting terminology authoring by suggesting new content to improve content coverage and semantic harmonization between terminologies. PMID:25890688
Sharing behavioral data through a grid infrastructure using data standards.

PubMed

Min, Hua; Ohira, Riki; Collins, Michael A; Bondy, Jessica; Avis, Nancy E; Tchuvatkina, Olga; Courtney, Paul K; Moser, Richard P; Shaikh, Abdul R; Hesse, Bradford W; Cooper, Mary; Reeves, Dianne; Lanese, Bob; Helba, Cindy; Miller, Suzanne M; Ross, Eric A

2014-01-01

In an effort to standardize behavioral measures and their data representation, the present study develops a methodology for incorporating measures found in the National Cancer Institute's (NCI) grid-enabled measures (GEM) portal, a repository for behavioral and social measures, into the cancer data standards registry and repository (caDSR). The methodology consists of four parts for curating GEM measures into the caDSR: (1) develop unified modeling language (UML) models for behavioral measures; (2) create common data elements (CDE) for UML components; (3) bind CDE with concepts from the NCI thesaurus; and (4) register CDE in the caDSR. UML models have been developed for four GEM measures, which have been registered in the caDSR as CDE. New behavioral concepts related to these measures have been created and incorporated into the NCI thesaurus. Best practices for representing measures using UML models have been utilized in the practice (eg, caDSR). One dataset based on a GEM-curated measure is available for use by other systems and users connected to the grid. Behavioral and population science data can be standardized by using and extending current standards. A new branch of CDE for behavioral science was developed for the caDSR. It expands the caDSR domain coverage beyond the clinical and biological areas. In addition, missing terms and concepts specific to the behavioral measures addressed in this paper were added to the NCI thesaurus. A methodology was developed and refined for curation of behavioral and population science data. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation

PubMed Central

2011-01-01

Background Evaluation of Word Sense Disambiguation (WSD) methods in the biomedical domain is difficult because the available resources are either too small or too focused on specific types of entities (e.g. diseases or genes). We present a method that can be used to automatically develop a WSD test collection using the Unified Medical Language System (UMLS) Metathesaurus and the manual MeSH indexing of MEDLINE. We demonstrate the use of this method by developing such a data set, called MSH WSD. Methods In our method, the Metathesaurus is first screened to identify ambiguous terms whose possible senses consist of two or more MeSH headings. We then use each ambiguous term and its corresponding MeSH heading to extract MEDLINE citations where the term and only one of the MeSH headings co-occur. The term found in the MEDLINE citation is automatically assigned the UMLS CUI linked to the MeSH heading. Each instance has been assigned a UMLS Concept Unique Identifier (CUI). We compare the characteristics of the MSH WSD data set to the previously existing NLM WSD data set. Results The resulting MSH WSD data set consists of 106 ambiguous abbreviations, 88 ambiguous terms and 9 which are a combination of both, for a total of 203 ambiguous entities. For each ambiguous term/abbreviation, the data set contains a maximum of 100 instances per sense obtained from MEDLINE. We evaluated the reliability of the MSH WSD data set using existing knowledge-based methods and compared their performance to that of the results previously obtained by these algorithms on the pre-existing data set, NLM WSD. We show that the knowledge-based methods achieve different results but keep their relative performance except for the Journal Descriptor Indexing (JDI) method, whose performance is below the other methods. Conclusions The MSH WSD data set allows the evaluation of WSD algorithms in the biomedical domain. Compared to previously existing data sets, MSH WSD contains a larger number of biomedical terms/abbreviations and covers the largest set of UMLS Semantic Types. Furthermore, the MSH WSD data set has been generated automatically reusing already existing annotations and, therefore, can be regenerated from subsequent UMLS versions. PMID:21635749
Refining Automatically Extracted Knowledge Bases Using Crowdsourcing

PubMed Central

Xian, Xuefeng; Cui, Zhiming

2017-01-01

Machine-constructed knowledge bases often contain noisy and inaccurate facts. There exists significant work in developing automated algorithms for knowledge base refinement. Automated approaches improve the quality of knowledge bases but are far from perfect. In this paper, we leverage crowdsourcing to improve the quality of automatically extracted knowledge bases. As human labelling is costly, an important research challenge is how we can use limited human resources to maximize the quality improvement for a knowledge base. To address this problem, we first introduce a concept of semantic constraints that can be used to detect potential errors and do inference among candidate facts. Then, based on semantic constraints, we propose rank-based and graph-based algorithms for crowdsourced knowledge refining, which judiciously select the most beneficial candidate facts to conduct crowdsourcing and prune unnecessary questions. Our experiments show that our method improves the quality of knowledge bases significantly and outperforms state-of-the-art automatic methods under a reasonable crowdsourcing cost. PMID:28588611
Modeling stroke rehabilitation processes using the Unified Modeling Language (UML).

PubMed

Ferrante, Simona; Bonacina, Stefano; Pinciroli, Francesco

2013-10-01

In organising and providing rehabilitation procedures for stroke patients, the usual need for many refinements makes it inappropriate to attempt rigid standardisation, but greater detail is required concerning workflow. The aim of this study was to build a model of the post-stroke rehabilitation process. The model, implemented in the Unified Modeling Language, was grounded on international guidelines and refined following the clinical pathway adopted at local level by a specialized rehabilitation centre. The model describes the organisation of the rehabilitation delivery and it facilitates the monitoring of recovery during the process. Indeed, a system software was developed and tested to support clinicians in the digital administration of clinical scales. The model flexibility assures easy updating after process evolution. Copyright © 2013 Elsevier Ltd. All rights reserved.
Customer-experienced rapid prototyping

NASA Astrophysics Data System (ADS)

Zhang, Lijuan; Zhang, Fu; Li, Anbo

2008-12-01

In order to describe accurately and comprehend quickly the perfect GIS requirements, this article will integrate the ideas of QFD (Quality Function Deployment) and UML (Unified Modeling Language), and analyze the deficiency of prototype development model, and will propose the idea of the Customer-Experienced Rapid Prototyping (CE-RP) and describe in detail the process and framework of the CE-RP, from the angle of the characteristics of Modern-GIS. The CE-RP is mainly composed of Customer Tool-Sets (CTS), Developer Tool-Sets (DTS) and Barrier-Free Semantic Interpreter (BF-SI) and performed by two roles of customer and developer. The main purpose of the CE-RP is to produce the unified and authorized requirements data models between customer and software developer.
Semantator: semantic annotator for converting biomedical text to linked data.

PubMed

Tao, Cui; Song, Dezhao; Sharma, Deepak; Chute, Christopher G

2013-10-01

More than 80% of biomedical data is embedded in plain text. The unstructured nature of these text-based documents makes it challenging to easily browse and query the data of interest in them. One approach to facilitate browsing and querying biomedical text is to convert the plain text to a linked web of data, i.e., converting data originally in free text to structured formats with defined meta-level semantics. In this paper, we introduce Semantator (Semantic Annotator), a semantic-web-based environment for annotating data of interest in biomedical documents, browsing and querying the annotated data, and interactively refining annotation results if needed. Through Semantator, information of interest can be either annotated manually or semi-automatically using plug-in information extraction tools. The annotated results will be stored in RDF and can be queried using the SPARQL query language. In addition, semantic reasoners can be directly applied to the annotated data for consistency checking and knowledge inference. Semantator has been released online and was used by the biomedical ontology community who provided positive feedbacks. Our evaluation results indicated that (1) Semantator can perform the annotation functionalities as designed; (2) Semantator can be adopted in real applications in clinical and transactional research; and (3) the annotated results using Semantator can be easily used in Semantic-web-based reasoning tools for further inference. Copyright © 2013 Elsevier Inc. All rights reserved.
A Pilot Study on Modeling of Diagnostic Criteria Using OWL and SWRL.

PubMed

Hong, Na; Jiang, Guoqian; Pathak, Jyotishiman; Chute, Christopher G

2015-01-01

The objective of this study is to describe our efforts in a pilot study on modeling diagnostic criteria using a Semantic Web-based approach. We reused the basic framework of the ICD-11 content model and refined it into an operational model in the Web Ontology Language (OWL). The refinement is based on a bottom-up analysis method, in which we analyzed data elements (including value sets) in a collection (n=20) of randomly selected diagnostic criteria. We also performed a case study to formalize rule logic in the diagnostic criteria of metabolic syndrome using the Semantic Web Rule Language (SWRL). The results demonstrated that it is feasible to use OWL and SWRL to formalize the diagnostic criteria knowledge, and to execute the rules through reasoning.
The User Knows What to Call It: Incorporating Patient Voice Through User-Contributed Tags on a Participatory Platform About Health Management

PubMed Central

Carriere, Rachel M; Kaplan, Samantha Jan

2017-01-01

Background Body listening, described as the act of paying attention to the body’s signals and cues, can be an important component of long-term health management. Objective The aim of this study was to introduce and evaluate the Body Listening Project, an innovative effort to engage the public in the creation of a public resource—to leverage collective wisdom in the health domain. This project involved a website where people could contribute their experiences of and dialogue with others concerning body listening and self-management. This article presents an analysis of the tags contributed, with a focus on the value of these tags for knowledge organization and incorporation into consumer-friendly health information retrieval systems. Methods First, we performed content analysis of the tags contributed, identifying a set of categories and refining the relational structure of the categories to develop a preliminary classification scheme, the Body Listening and Self-Management Taxonomy. Second, we compared the concepts in the Body Listening and Self-Management Taxonomy with concepts that were automatically identified from an extant health knowledge resource, the Unified Medical Language System (UMLS), to better characterize the information that participants contributed. Third, we employed visualization techniques to explore the concept space of the tags. A correlation matrix, based on the extent to which categories tended to be assigned to the same tags, was used to study the interrelatedness of the taxonomy categories. Then a network visualization was used to investigate structural relationships among the categories in the taxonomy. Results First, we proposed a taxonomy called the Body Listening and Self-Management Taxonomy, with four meta-level categories: (1) health management strategies, (2) concepts and states, (3) influencers, and (4) health-related information behavior. This taxonomy could inform future efforts to organize knowledge and content of this subject matter. Second, we compared the categories from this taxonomy with the UMLS concepts that were identified. Though the UMLS offers benefits such as speed and breadth of coverage, the Body Listening and Self-Management Taxonomy is more consumer-centric. Third, the correlation matrix and network visualization demonstrated that there are natural areas of ambiguity and semantic relatedness in the meanings of the concepts in the Body Listening and Self-Management Taxonomy. Use of these visualizations can be helpful in practice settings, to help library and information science practitioners understand and resolve potential challenges in classification; in research, to characterize the structure of the conceptual space of health management; and in the development of consumer-centric health information retrieval systems. Conclusions A participatory platform can be employed to collect data concerning patient experiences of health management, which can in turn be used to develop new health knowledge resources or augment existing ones, as well as be incorporated into consumer-centric health information systems. PMID:28882809
The User Knows What to Call It: Incorporating Patient Voice Through User-Contributed Tags on a Participatory Platform About Health Management.

PubMed

Chen, Annie T; Carriere, Rachel M; Kaplan, Samantha Jan

2017-09-07

Body listening, described as the act of paying attention to the body's signals and cues, can be an important component of long-term health management. The aim of this study was to introduce and evaluate the Body Listening Project, an innovative effort to engage the public in the creation of a public resource-to leverage collective wisdom in the health domain. This project involved a website where people could contribute their experiences of and dialogue with others concerning body listening and self-management. This article presents an analysis of the tags contributed, with a focus on the value of these tags for knowledge organization and incorporation into consumer-friendly health information retrieval systems. First, we performed content analysis of the tags contributed, identifying a set of categories and refining the relational structure of the categories to develop a preliminary classification scheme, the Body Listening and Self-Management Taxonomy. Second, we compared the concepts in the Body Listening and Self-Management Taxonomy with concepts that were automatically identified from an extant health knowledge resource, the Unified Medical Language System (UMLS), to better characterize the information that participants contributed. Third, we employed visualization techniques to explore the concept space of the tags. A correlation matrix, based on the extent to which categories tended to be assigned to the same tags, was used to study the interrelatedness of the taxonomy categories. Then a network visualization was used to investigate structural relationships among the categories in the taxonomy. First, we proposed a taxonomy called the Body Listening and Self-Management Taxonomy, with four meta-level categories: (1) health management strategies, (2) concepts and states, (3) influencers, and (4) health-related information behavior. This taxonomy could inform future efforts to organize knowledge and content of this subject matter. Second, we compared the categories from this taxonomy with the UMLS concepts that were identified. Though the UMLS offers benefits such as speed and breadth of coverage, the Body Listening and Self-Management Taxonomy is more consumer-centric. Third, the correlation matrix and network visualization demonstrated that there are natural areas of ambiguity and semantic relatedness in the meanings of the concepts in the Body Listening and Self-Management Taxonomy. Use of these visualizations can be helpful in practice settings, to help library and information science practitioners understand and resolve potential challenges in classification; in research, to characterize the structure of the conceptual space of health management; and in the development of consumer-centric health information retrieval systems. A participatory platform can be employed to collect data concerning patient experiences of health management, which can in turn be used to develop new health knowledge resources or augment existing ones, as well as be incorporated into consumer-centric health information systems. ©Annie T Chen, Rachel M Carriere, Samantha Jan Kaplan. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 07.09.2017.
eClims: An Extensible and Dynamic Integration Framework for Biomedical Information Systems.

PubMed

Savonnet, Marinette; Leclercq, Eric; Naubourg, Pierre

2016-11-01

Biomedical information systems (BIS) require consideration of three types of variability: data variability induced by new high throughput technologies, schema or model variability induced by large scale studies or new fields of research, and knowledge variability resulting from new discoveries. Beyond data heterogeneity, managing variabilities in the context of BIS requires extensible and dynamic integration process. In this paper, we focus on data and schema variabilities and we propose an integration framework based on ontologies, master data, and semantic annotations. The framework addresses issues related to: 1) collaborative work through a dynamic integration process; 2) variability among studies using an annotation mechanism; and 3) quality control over data and semantic annotations. Our approach relies on two levels of knowledge: BIS-related knowledge is modeled using an application ontology coupled with UML models that allow controlling data completeness and consistency, and domain knowledge is described by a domain ontology, which ensures data coherence. A system build with the eClims framework has been implemented and evaluated in the context of a proteomic platform.
Knowledge-based approaches to the maintenance of a large controlled medical terminology.

PubMed Central

Cimino, J J; Clayton, P D; Hripcsak, G; Johnson, S B

1994-01-01

OBJECTIVE: Develop a knowledge-based representation for a controlled terminology of clinical information to facilitate creation, maintenance, and use of the terminology. DESIGN: The Medical Entities Dictionary (MED) is a semantic network, based on the Unified Medical Language System (UMLS), with a directed acyclic graph to represent multiple hierarchies. Terms from four hospital systems (laboratory, electrocardiography, medical records coding, and pharmacy) were added as nodes in the network. Additional knowledge about terms, added as semantic links, was used to assist in integration, harmonization, and automated classification of disparate terminologies. RESULTS: The MED contains 32,767 terms and is in active clinical use. Automated classification was successfully applied to terms for laboratory specimens, laboratory tests, and medications. One benefit of the approach has been the automated inclusion of medications into multiple pharmacologic and allergenic classes that were not present in the pharmacy system. Another benefit has been the reduction of maintenance efforts by 90%. CONCLUSION: The MED is a hybrid of terminology and knowledge. It provides domain coverage, synonymy, consistency of views, explicit relationships, and multiple classification while preventing redundancy, ambiguity (homonymy) and misclassification. PMID:7719786

Classification of clinically useful sentences in clinical evidence resources.

PubMed

Morid, Mohammad Amin; Fiszman, Marcelo; Raja, Kalpana; Jonnalagadda, Siddhartha R; Del Fiol, Guilherme

2016-04-01

Most patient care questions raised by clinicians can be answered by online clinical knowledge resources. However, important barriers still challenge the use of these resources at the point of care. To design and assess a method for extracting clinically useful sentences from synthesized online clinical resources that represent the most clinically useful information for directly answering clinicians' information needs. We developed a Kernel-based Bayesian Network classification model based on different domain-specific feature types extracted from sentences in a gold standard composed of 18 UpToDate documents. These features included UMLS concepts and their semantic groups, semantic predications extracted by SemRep, patient population identified by a pattern-based natural language processing (NLP) algorithm, and cue words extracted by a feature selection technique. Algorithm performance was measured in terms of precision, recall, and F-measure. The feature-rich approach yielded an F-measure of 74% versus 37% for a feature co-occurrence method (p<0.001). Excluding predication, population, semantic concept or text-based features reduced the F-measure to 62%, 66%, 58% and 69% respectively (p<0.01). The classifier applied to Medline sentences reached an F-measure of 73%, which is equivalent to the performance of the classifier on UpToDate sentences (p=0.62). The feature-rich approach significantly outperformed general baseline methods. This approach significantly outperformed classifiers based on a single type of feature. Different types of semantic features provided a unique contribution to overall classification performance. The classifier's model and features used for UpToDate generalized well to Medline abstracts. Copyright © 2016 Elsevier Inc. All rights reserved.
Conceptual mapping of user's queries to medical subject headings.

PubMed Central

Zieman, Y. L.; Bleich, H. L.

1997-01-01

This paper describes a way to map users' queries to relevant Medical Subject Headings (MeSH terms) used by the National Library of Medicine to index the biomedical literature. The method, called SENSE (SEarch with New SEmantics), transforms words and phrases in the users' queries into primary conceptual components and compares these components with those of the MeSH vocabulary. Similar to the way in which most numbers can be split into numerical factors and expressed as their product--for example, 42 can be expressed as 2*21, 6*7, 3*14, 2*3*7,--so most medical concepts can be split into "semantic factors" and expressed as their juxtaposition. Note that if we split 42 into its primary factors, the breakdown is unique: 2*3*7. Similarly, when we split medical concepts into their "primary semantic factors" the breakdown is also unique. For example, the MeSH term 'renovascular hypertension' can be split morphologically into reno, vascular, hyper, and tension--morphemes that can then be translated into their primary semantic factors--kidney, blood vessel, high, and pressure. By "factoring" each MeSH term in this way, and by similarly factoring the user's query, we can match query to MeSH term by searching for combinations of common factors. Unlike UMLS and other methods that match at the level of words or phrases, SENSE matches at the level of concepts; in this way, a wide variety of words and phrases that have the same meaning produce the same match. Now used in PaperChase, the method is surprisingly powerful in matching users' queries to Medical Subject Headings. PMID:9357680
Object-oriented integrated approach for the design of scalable ECG systems.

PubMed

Boskovic, Dusanka; Besic, Ingmar; Avdagic, Zikrija

2009-01-01

The paper presents the implementation of Object-Oriented (OO) integrated approaches to the design of scalable Electro-Cardio-Graph (ECG) Systems. The purpose of this methodology is to preserve real-world structure and relations with the aim to minimize the information loss during the process of modeling, especially for Real-Time (RT) systems. We report on a case study of the design that uses the integration of OO and RT methods and the Unified Modeling Language (UML) standard notation. OO methods identify objects in the real-world domain and use them as fundamental building blocks for the software system. The gained experience based on the strongly defined semantics of the object model is discussed and related problems are analyzed.
Semantics driven approach for knowledge acquisition from EMRs.

PubMed

Perera, Sujan; Henson, Cory; Thirunarayan, Krishnaprasad; Sheth, Amit; Nair, Suhas

2014-03-01

Semantic computing technologies have matured to be applicable to many critical domains such as national security, life sciences, and health care. However, the key to their success is the availability of a rich domain knowledge base. The creation and refinement of domain knowledge bases pose difficult challenges. The existing knowledge bases in the health care domain are rich in taxonomic relationships, but they lack nontaxonomic (domain) relationships. In this paper, we describe a semiautomatic technique for enriching existing domain knowledge bases with causal relationships gleaned from Electronic Medical Records (EMR) data. We determine missing causal relationships between domain concepts by validating domain knowledge against EMR data sources and leveraging semantic-based techniques to derive plausible relationships that can rectify knowledge gaps. Our evaluation demonstrates that semantic techniques can be employed to improve the efficiency of knowledge acquisition.
Cognitive search model and a new query paradigm

NASA Astrophysics Data System (ADS)

Xu, Zhonghui

2001-06-01

This paper proposes a cognitive model in which people begin to search pictures by using semantic content and find a right picture by judging whether its visual content is a proper visualization of the semantics desired. It is essential that human search is not just a process of matching computation on visual feature but rather a process of visualization of the semantic content known. For people to search electronic images in the way as they manually do in the model, we suggest that querying be a semantic-driven process like design. A query-by-design paradigm is prosed in the sense that what you design is what you find. Unlike query-by-example, query-by-design allows users to specify the semantic content through an iterative and incremental interaction process so that a retrieval can start with association and identification of the given semantic content and get refined while further visual cues are available. An experimental image retrieval system, Kuafu, has been under development using the query-by-design paradigm and an iconic language is adopted.
Assigning clinical codes with data-driven concept representation on Dutch clinical free text.

PubMed

Scheurwegs, Elyne; Luyckx, Kim; Luyten, Léon; Goethals, Bart; Daelemans, Walter

2017-05-01

Clinical codes are used for public reporting purposes, are fundamental to determining public financing for hospitals, and form the basis for reimbursement claims to insurance providers. They are assigned to a patient stay to reflect the diagnosis and performed procedures during that stay. This paper aims to enrich algorithms for automated clinical coding by taking a data-driven approach and by using unsupervised and semi-supervised techniques for the extraction of multi-word expressions that convey a generalisable medical meaning (referred to as concepts). Several methods for extracting concepts from text are compared, two of which are constructed from a large unannotated corpus of clinical free text. A distributional semantic model (i.c. the word2vec skip-gram model) is used to generalize over concepts and retrieve relations between them. These methods are validated on three sets of patient stay data, in the disease areas of urology, cardiology, and gastroenterology. The datasets are in Dutch, which introduces a limitation on available concept definitions from expert-based ontologies (e.g. UMLS). The results show that when expert-based knowledge in ontologies is unavailable, concepts derived from raw clinical texts are a reliable alternative. Both concepts derived from raw clinical texts perform and concepts derived from expert-created dictionaries outperform a bag-of-words approach in clinical code assignment. Adding features based on tokens that appear in a semantically similar context has a positive influence for predicting diagnostic codes. Furthermore, the experiments indicate that a distributional semantics model can find relations between semantically related concepts in texts but also introduces erroneous and redundant relations, which can undermine clinical coding performance. Copyright © 2017. Published by Elsevier Inc.
Supporting Collaborative Learning and Problem-Solving in a Constraint-Based CSCL Environment for UML Class Diagrams

ERIC Educational Resources Information Center

Baghaei, Nilufar; Mitrovic, Antonija; Irwin, Warwick

2007-01-01

We present COLLECT-UML, a constraint-based intelligent tutoring system (ITS) that teaches object-oriented analysis and design using Unified Modelling Language (UML). UML is easily the most popular object-oriented modelling technology in current practice. While teaching how to design UML class diagrams, COLLECT-UML also provides feedback on…
Using knowledge for indexing health web resources in a quality-controlled gateway.

PubMed

Joubert, Michel; Darmoni, Stefan J; Avillach, Paul; Dahamna, Badisse; Fieschi, Marius

2008-01-01

The aim of this study is to provide to indexers MeSH terms to be considered as major ones in a list of terms automatically extracted from a document. We propose a method combining symbolic knowledge - the UMLS Metathesaurus and Semantic Network - and statistical knowledge drawn from co-occurrences of terms in the CISMeF database (a French-language quality-controlled health gateway) using data mining measures. The method was tested on CISMeF corpus of 293 resources. There was a proportion of 0.37+/-0.26 major terms in the processed records. The method produced lists of terms with a proportion of terms initially pointed out as major of 0.54+/-0.31. The method we propose reduces the number of terms, which seem not useful for content description of resources, such as "check tags", but retains the most descriptive ones. Discarding these terms is accounted for by: 1) the removal by using semantic knowledge of associations of concepts bearing no real medical significance, 2) the removal by using statistical knowledge of nonstatistically significant associations of terms. This method can assist effectively indexers in their daily work and will be soon applied in the CISMeF system.
Semantics-informed geological maps: Conceptual modeling and knowledge encoding

NASA Astrophysics Data System (ADS)

Lombardo, Vincenzo; Piana, Fabrizio; Mimmo, Dario

2018-07-01

This paper introduces a novel, semantics-informed geologic mapping process, whose application domain is the production of a synthetic geologic map of a large administrative region. A number of approaches concerning the expression of geologic knowledge through UML schemata and ontologies have been around for more than a decade. These approaches have yielded resources that concern specific domains, such as, e.g., lithology. We develop a conceptual model that aims at building a digital encoding of several domains of geologic knowledge, in order to support the interoperability of the sources. We apply the devised terminological base to the classification of the elements of a geologic map of the Italian Western Alps and northern Apennines (Piemonte region). The digitally encoded knowledge base is a merged set of ontologies, called OntoGeonous. The encoding process identifies the objects of the semantic encoding, the geologic units, gathers the relevant information about such objects from authoritative resources, such as GeoSciML (giving priority to the application schemata reported in the INSPIRE Encoding Cookbook), and expresses the statements by means of axioms encoded in the Web Ontology Language (OWL). To support interoperability, OntoGeonous interlinks the general concepts by referring to the upper part level of ontology SWEET (developed by NASA), and imports knowledge that is already encoded in ontological format (e.g., ontology Simple Lithology). Machine-readable knowledge allows for consistency checking and for classification of the geological map data through algorithms of automatic reasoning.
[Modeling the requirements on routine data of general practitioners from the health-care researcher's point of view with the help of unified modeling langauge (UML)].

PubMed

Kersting, M; Hauswaldt, J; Lingner, H

2012-08-01

Health-care research is, besides primary acquired study data, based on data from widely differing secondary sources. In order to link, compare and analyze data sources uniform models and methods are needed. This could be facilitated by a more structured description of requirements, models and methods of health-care research than those currently used. Suitable methods of presentation were sought in an approach to this target and the unified modeling language (UML) identified as a possible alternative. Using different tools 3 UML diagrams were created to represent some individual aspects of a scientific use file (SUF): A use case diagram as well as an activity and a class diagram. In the use case diagram we attempted to represent the general use cases of an SUF based on general practitioners routine data. Secondly a class diagram was constructed to visualize the contents and structure of a SUF. Thirdly an activity diagram was developed to graphically represent the concept of a general practitioner's episode of care. The creation of the UML diagrams was possible without any technical difficulties. Regarding the content the 3 diagrams must still be considered as prototypes. The use case diagram shows possible uses and users of an SUF, e. g. a research worker, industry but also the general practitioner who supplies the data. The class diagram reveals a general data structure that can serve information processes in practice and research. Besides aggregation, possibilities for specialization and generalization are essential elements of the class diagram that can be used meaningfully. The activity diagram for the schematic representation of a general practitioner's episode of care reveals the existence of multiple endpoints of an episode and the possibility to form relationships by means of episodes (diagnosis>therapy). The constructed diagrams are preliminary results and should be refined in future steps. Use case diagrams enable a rapid overview of the meaning and purpose of a system, in this case an SUF. Class diagrams can help at a professional level to describe relationships between entities (classes/objects) more clearly than with the existing methods of representation. Activity diagrams are successors to classic flow charts. They are complemented appropriately by status diagrams. UML is suitable to uniformly and graphically describe a system (here an SUF) from various points of view. In future, validated UML models will help us to present scientific concepts and results in a more structured form than before and to promote the technological use of these concepts in practice. © Georg Thieme Verlag KG Stuttgart · New York.
Fully convolutional network with cluster for semantic segmentation

NASA Astrophysics Data System (ADS)

Ma, Xiao; Chen, Zhongbi; Zhang, Jianlin

2018-04-01

At present, image semantic segmentation technology has been an active research topic for scientists in the field of computer vision and artificial intelligence. Especially, the extensive research of deep neural network in image recognition greatly promotes the development of semantic segmentation. This paper puts forward a method based on fully convolutional network, by cluster algorithm k-means. The cluster algorithm using the image's low-level features and initializing the cluster centers by the super-pixel segmentation is proposed to correct the set of points with low reliability, which are mistakenly classified in great probability, by the set of points with high reliability in each clustering regions. This method refines the segmentation of the target contour and improves the accuracy of the image segmentation.
Business Process Modelling is an Essential Part of a Requirements Analysis. Contribution of EFMI Primary Care Working Group.

PubMed

de Lusignan, S; Krause, P; Michalakidis, G; Vicente, M Tristan; Thompson, S; McGilchrist, M; Sullivan, F; van Royen, P; Agreus, L; Desombre, T; Taweel, A; Delaney, B

2012-01-01

To perform a requirements analysis of the barriers to conducting research linking of primary care, genetic and cancer data. We extended our initial data-centric approach to include socio-cultural and business requirements. We created reference models of core data requirements common to most studies using unified modelling language (UML), dataflow diagrams (DFD) and business process modelling notation (BPMN). We conducted a stakeholder analysis and constructed DFD and UML diagrams for use cases based on simulated research studies. We used research output as a sensitivity analysis. Differences between the reference model and use cases identified study specific data requirements. The stakeholder analysis identified: tensions, changes in specification, some indifference from data providers and enthusiastic informaticians urging inclusion of socio-cultural context. We identified requirements to collect information at three levels: micro- data items, which need to be semantically interoperable, meso- the medical record and data extraction, and macro- the health system and socio-cultural issues. BPMN clarified complex business requirements among data providers and vendors; and additional geographical requirements for patients to be represented in both linked datasets. High quality research output was the norm for most repositories. Reference models provide high-level schemata of the core data requirements. However, business requirements' modelling identifies stakeholder issues and identifies what needs to be addressed to enable participation.
Assessment of disease named entity recognition on a corpus of annotated sentences.

PubMed

Jimeno, Antonio; Jimenez-Ruiz, Ernesto; Lee, Vivian; Gaudan, Sylvain; Berlanga, Rafael; Rebholz-Schuhmann, Dietrich

2008-04-11

In recent years, the recognition of semantic types from the biomedical scientific literature has been focused on named entities like protein and gene names (PGNs) and gene ontology terms (GO terms). Other semantic types like diseases have not received the same level of attention. Different solutions have been proposed to identify disease named entities in the scientific literature. While matching the terminology with language patterns suffers from low recall (e.g., Whatizit) other solutions make use of morpho-syntactic features to better cover the full scope of terminological variability (e.g., MetaMap). Currently, MetaMap that is provided from the National Library of Medicine (NLM) is the state of the art solution for the annotation of concepts from UMLS (Unified Medical Language System) in the literature. Nonetheless, its performance has not yet been assessed on an annotated corpus. In addition, little effort has been invested so far to generate an annotated dataset that links disease entities in text to disease entries in a database, thesaurus or ontology and that could serve as a gold standard to benchmark text mining solutions. As part of our research work, we have taken a corpus that has been delivered in the past for the identification of associations of genes to diseases based on the UMLS Metathesaurus and we have reprocessed and re-annotated the corpus. We have gathered annotations for disease entities from two curators, analyzed their disagreement (0.51 in the kappa-statistic) and composed a single annotated corpus for public use. Thereafter, three solutions for disease named entity recognition including MetaMap have been applied to the corpus to automatically annotate it with UMLS Metathesaurus concepts. The resulting annotations have been benchmarked to compare their performance. The annotated corpus is publicly available at ftp://ftp.ebi.ac.uk/pub/software/textmining/corpora/diseases and can serve as a benchmark to other systems. In addition, we found that dictionary look-up already provides competitive results indicating that the use of disease terminology is highly standardized throughout the terminologies and the literature. MetaMap generates precise results at the expense of insufficient recall while our statistical method obtains better recall at a lower precision rate. Even better results in terms of precision are achieved by combining at least two of the three methods leading, but this approach again lowers recall. Altogether, our analysis gives a better understanding of the complexity of disease annotations in the literature. MetaMap and the dictionary based approach are available through the Whatizit web service infrastructure (Rebholz-Schuhmann D, Arregui M, Gaudan S, Kirsch H, Jimeno A: Text processing through Web services: Calling Whatizit. Bioinformatics 2008, 24:296-298).
Programming with process groups: Group and multicast semantics

NASA Technical Reports Server (NTRS)

Birman, Kenneth P.; Cooper, Robert; Gleeson, Barry

1991-01-01

Process groups are a natural tool for distributed programming and are increasingly important in distributed computing environments. Discussed here is a new architecture that arose from an effort to simplify Isis process group semantics. The findings include a refined notion of how the clients of a group should be treated, what the properties of a multicast primitive should be when systems contain large numbers of overlapping groups, and a new construct called the causality domain. A system based on this architecture is now being implemented in collaboration with the Chorus and Mach projects.
Qualitative dynamics semantics for SBGN process description.

PubMed

Rougny, Adrien; Froidevaux, Christine; Calzone, Laurence; Paulevé, Loïc

2016-06-16

Qualitative dynamics semantics provide a coarse-grain modeling of networks dynamics by abstracting away kinetic parameters. They allow to capture general features of systems dynamics, such as attractors or reachability properties, for which scalable analyses exist. The Systems Biology Graphical Notation Process Description language (SBGN-PD) has become a standard to represent reaction networks. However, no qualitative dynamics semantics taking into account all the main features available in SBGN-PD had been proposed so far. We propose two qualitative dynamics semantics for SBGN-PD reaction networks, namely the general semantics and the stories semantics, that we formalize using asynchronous automata networks. While the general semantics extends standard Boolean semantics of reaction networks by taking into account all the main features of SBGN-PD, the stories semantics allows to model several molecules of a network by a unique variable. The obtained qualitative models can be checked against dynamical properties and therefore validated with respect to biological knowledge. We apply our framework to reason on the qualitative dynamics of a large network (more than 200 nodes) modeling the regulation of the cell cycle by RB/E2F. The proposed semantics provide a direct formalization of SBGN-PD networks in dynamical qualitative models that can be further analyzed using standard tools for discrete models. The dynamics in stories semantics have a lower dimension than the general one and prune multiple behaviors (which can be considered as spurious) by enforcing the mutual exclusiveness between the activity of different nodes of a same story. Overall, the qualitative semantics for SBGN-PD allow to capture efficiently important dynamical features of reaction network models and can be exploited to further refine them.
Noesis: Ontology based Scoped Search Engine and Resource Aggregator for Atmospheric Science

NASA Astrophysics Data System (ADS)

Ramachandran, R.; Movva, S.; Li, X.; Cherukuri, P.; Graves, S.

2006-12-01

The goal for search engines is to return results that are both accurate and complete. The search engines should find only what you really want and find everything you really want. Search engines (even meta search engines) lack semantics. The basis for search is simply based on string matching between the user's query term and the resource database and the semantics associated with the search string is not captured. For example, if an atmospheric scientist is searching for "pressure" related web resources, most search engines return inaccurate results such as web resources related to blood pressure. In this presentation Noesis, which is a meta-search engine and a resource aggregator that uses domain ontologies to provide scoped search capabilities will be described. Noesis uses domain ontologies to help the user scope the search query to ensure that the search results are both accurate and complete. The domain ontologies guide the user to refine their search query and thereby reduce the user's burden of experimenting with different search strings. Semantics are captured by refining the query terms to cover synonyms, specializations, generalizations and related concepts. Noesis also serves as a resource aggregator. It categorizes the search results from different online resources such as education materials, publications, datasets, web search engines that might be of interest to the user.
Clinical documentation variations and NLP system portability: a case study in asthma birth cohorts across institutions.

PubMed

Sohn, Sunghwan; Wang, Yanshan; Wi, Chung-Il; Krusemark, Elizabeth A; Ryu, Euijung; Ali, Mir H; Juhn, Young J; Liu, Hongfang

2017-11-30

To assess clinical documentation variations across health care institutions using different electronic medical record systems and investigate how they affect natural language processing (NLP) system portability. Birth cohorts from Mayo Clinic and Sanford Children's Hospital (SCH) were used in this study (n = 298 for each). Documentation variations regarding asthma between the 2 cohorts were examined in various aspects: (1) overall corpus at the word level (ie, lexical variation), (2) topics and asthma-related concepts (ie, semantic variation), and (3) clinical note types (ie, process variation). We compared those statistics and explored NLP system portability for asthma ascertainment in 2 stages: prototype and refinement. There exist notable lexical variations (word-level similarity = 0.669) and process variations (differences in major note types containing asthma-related concepts). However, semantic-level corpora were relatively homogeneous (topic similarity = 0.944, asthma-related concept similarity = 0.971). The NLP system for asthma ascertainment had an F-score of 0.937 at Mayo, and produced 0.813 (prototype) and 0.908 (refinement) when applied at SCH. The criteria for asthma ascertainment are largely dependent on asthma-related concepts. Therefore, we believe that semantic similarity is important to estimate NLP system portability. As the Mayo Clinic and SCH corpora were relatively homogeneous at a semantic level, the NLP system, developed at Mayo Clinic, was imported to SCH successfully with proper adjustments to deal with the intrinsic corpus heterogeneity. © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com
The Unified Medical Language System (UMLS): integrating biomedical terminology

PubMed Central

Bodenreider, Olivier

2004-01-01

The Unified Medical Language System (http://umlsks.nlm.nih.gov) is a repository of biomedical vocabularies developed by the US National Library of Medicine. The UMLS integrates over 2 million names for some 900 000 concepts from more than 60 families of biomedical vocabularies, as well as 12 million relations among these concepts. Vocabularies integrated in the UMLS Metathesaurus include the NCBI taxonomy, Gene Ontology, the Medical Subject Headings (MeSH), OMIM and the Digital Anatomist Symbolic Knowledge Base. UMLS concepts are not only inter-related, but may also be linked to external resources such as GenBank. In addition to data, the UMLS includes tools for customizing the Metathesaurus (MetamorphoSys), for generating lexical variants of concept names (lvg) and for extracting UMLS concepts from text (MetaMap). The UMLS knowledge sources are updated quarterly. All vocabularies are available at no fee for research purposes within an institution, but UMLS users are required to sign a license agreement. The UMLS knowledge sources are distributed on CD-ROM and by FTP. PMID:14681409
The Unified Medical Language System (UMLS): integrating biomedical terminology.

PubMed

Bodenreider, Olivier

2004-01-01

The Unified Medical Language System (http://umlsks.nlm.nih.gov) is a repository of biomedical vocabularies developed by the US National Library of Medicine. The UMLS integrates over 2 million names for some 900,000 concepts from more than 60 families of biomedical vocabularies, as well as 12 million relations among these concepts. Vocabularies integrated in the UMLS Metathesaurus include the NCBI taxonomy, Gene Ontology, the Medical Subject Headings (MeSH), OMIM and the Digital Anatomist Symbolic Knowledge Base. UMLS concepts are not only inter-related, but may also be linked to external resources such as GenBank. In addition to data, the UMLS includes tools for customizing the Metathesaurus (MetamorphoSys), for generating lexical variants of concept names (lvg) and for extracting UMLS concepts from text (MetaMap). The UMLS knowledge sources are updated quarterly. All vocabularies are available at no fee for research purposes within an institution, but UMLS users are required to sign a license agreement. The UMLS knowledge sources are distributed on CD-ROM and by FTP.
Model-based testing with UML applied to a roaming algorithm for bluetooth devices.

PubMed

Dai, Zhen Ru; Grabowski, Jens; Neukirchen, Helmut; Pals, Holger

2004-11-01

In late 2001, the Object Management Group issued a Request for Proposal to develop a testing profile for UML 2.0. In June 2003, the work on the UML 2.0 Testing Profile was finally adopted by the OMG. Since March 2004, it has become an official standard of the OMG. The UML 2.0 Testing Profile provides support for UML based model-driven testing. This paper introduces a methodology on how to use the testing profile in order to modify and extend an existing UML design model for test issues. The application of the methodology will be explained by applying it to an existing UML Model for a Bluetooth device.

Enhanced biodegradation of hydrocarbons in petroleum tank bottom oil sludge and characterization of biocatalysts and biosurfactants.

PubMed

Suganthi, S Hepziba; Murshid, Shabnam; Sriram, Sriswarna; Ramani, K

2018-08-15

Petroleum hydrocarbon removal from tank bottom oil sludge is a major issue due to its properties. Conventional physicochemical treatment techniques are less effective. Though the bioremediation is considered for the hydrocarbon removal from tank bottom oil sludge, the efficiency is low and time taking due to the low yield of biocatalysts and biosurfactants. The focal theme of the present investigation is to modify the process by introducing the intermittent inoculation for the enhanced biodegradation of hydrocarbons in the tank bottom oil sludge by maintaining a constant level of biocatalysts such as oxidoreductase, catalase, and lipase as well as biosurfactants. In addition, the heavy metal removal was also addressed. The microbial consortia comprising Shewanalla chilikensis, Bacillus firmus, and Halomonas hamiltonii was used for the biodegradation of oil sludge. One variable at a time approach was used for the optimum of culture conditions. The bacterial consortia degraded the oil sludge by producing biocatalysts such as lipase (80 U/ml), catalase (46 U/ml), oxidoreductase (68 U/ml) along with the production of lipoprotein biosurfactant (152 mg/g of oil sludge) constantly and achieved 96% reduction of total petroleum hydrocarbon. The crude enzymes were characterized by FT-IR and the biosurfactant was characterized by surface tension reduction, emulsification index, FT-IR, TLC, and SDS-PAGE. GC-MS and NMR also revealed that the hydrocarbons present in the oil sludge were effectively degraded by the microbial consortia. The ICP-OES result indicated that the microbial consortium is also effective in removing the heavy metals. Hence, bioremediation using the hydrocarbonoclastic microbial consortium can be considered as an environmentally friendly process for disposal of tank bottom oil sludge from petroleum oil refining industry. Copyright © 2018 Elsevier Ltd. All rights reserved.
Towards a semantic PACS: Using Semantic Web technology to represent imaging data.

PubMed

Van Soest, Johan; Lustberg, Tim; Grittner, Detlef; Marshall, M Scott; Persoon, Lucas; Nijsten, Bas; Feltens, Peter; Dekker, Andre

2014-01-01

The DICOM standard is ubiquitous within medicine. However, improved DICOM semantics would significantly enhance search operations. Furthermore, databases of current PACS systems are not flexible enough for the demands within image analysis research. In this paper, we investigated if we can use Semantic Web technology, to store and represent metadata of DICOM image files, as well as linking additional computational results to image metadata. Therefore, we developed a proof of concept containing two applications: one to store commonly used DICOM metadata in an RDF repository, and one to calculate imaging biomarkers based on DICOM images, and store the biomarker values in an RDF repository. This enabled us to search for all patients with a gross tumor volume calculated to be larger than 50 cc. We have shown that we can successfully store the DICOM metadata in an RDF repository and are refining our proof of concept with regards to volume naming, value representation, and the applications themselves.
Indexing method of digital audiovisual medical resources with semantic Web integration.

PubMed

Cuggia, Marc; Mougin, Fleur; Le Beux, Pierre

2005-03-01

Digitalization of audiovisual resources and network capability offer many possibilities which are the subject of intensive work in scientific and industrial sectors. Indexing such resources is a major challenge. Recently, the Motion Pictures Expert Group (MPEG) has developed MPEG-7, a standard for describing multimedia content. The goal of this standard is to develop a rich set of standardized tools to enable efficient retrieval from digital archives or the filtering of audiovisual broadcasts on the Internet. How could this kind of technology be used in the medical context? In this paper, we propose a simpler indexing system, based on the Dublin Core standard and compliant to MPEG-7. We use MeSH and the UMLS to introduce conceptual navigation. We also present a video-platform which enables encoding and gives access to audiovisual resources in streaming mode.
The Unified Medical Language System

PubMed Central

Humphreys, Betsy L.; Lindberg, Donald A. B.; Schoolman, Harold M.; Barnett, G. Octo

1998-01-01

In 1986, the National Library of Medicine (NLM) assembled a large multidisciplinary, multisite team to work on the Unified Medical Language System (UMLS), a collaborative research project aimed at reducing fundamental barriers to the application of computers to medicine. Beyond its tangible products, the UMLS Knowledge Sources, and its influence on the field of informatics, the UMLS project is an interesting case study in collaborative research and development. It illustrates the strengths and challenges of substantive collaboration among widely distributed research groups. Over the past decade, advances in computing and communications have minimized the technical difficulties associated with UMLS collaboration and also facilitated the development, dissemination, and use of the UMLS Knowledge Sources. The spread of the World Wide Web has increased the visibility of the information access problems caused by multiple vocabularies and many information sources which are the focus of UMLS work. The time is propitious for building on UMLS accomplishments and making more progress on the informatics research issues first highlighted by the UMLS project more than 10 years ago. PMID:9452981
The Unified Medical Language System: an informatics research collaboration.

PubMed

Humphreys, B L; Lindberg, D A; Schoolman, H M; Barnett, G O

1998-01-01

In 1986, the National Library of Medicine (NLM) assembled a large multidisciplinary, multisite team to work on the Unified Medical Language System (UMLS), a collaborative research project aimed at reducing fundamental barriers to the application of computers to medicine. Beyond its tangible products, the UMLS Knowledge Sources, and its influence on the field of informatics, the UMLS project is an interesting case study in collaborative research and development. It illustrates the strengths and challenges of substantive collaboration among widely distributed research groups. Over the past decade, advances in computing and communications have minimized the technical difficulties associated with UMLS collaboration and also facilitated the development, dissemination, and use of the UMLS Knowledge Sources. The spread of the World Wide Web has increased the visibility of the information access problems caused by multiple vocabularies and many information sources which are the focus of UMLS work. The time is propitious for building on UMLS accomplishments and making more progress on the informatics research issues first highlighted by the UMLS project more than 10 years ago.
Effects of perceptual and semantic cues on ERP modulations associated with prospective memory.

PubMed

Cousens, Ross; Cutmore, Timothy; Wang, Ya; Wilson, Jennifer; Chan, Raymond C K; Shum, David H K

2015-10-01

Prospective memory involves the formation and execution of intended actions and is essential for autonomous living. In this study (N=32), the effect of the nature of PM cues (semantic versus perceptual) on established event-related potentials (ERPs) elicited in PM tasks (N300 and prospective positivity) was investigated. PM cues defined by their perceptual features clearly elicited the N300 and prospective positivity whereas PM cues defined by semantic relatedness elicited prospective positivity. This calls into question the view that the N300 is a marker of general processes underlying detection of PM cues, but supports existing research showing that prospective positivity represents general post-retrieval processes that follow detection of PM cues. Continued refinement of ERP paradigms for understanding the neural correlates of PM is needed. Copyright © 2015 Elsevier B.V. All rights reserved.
In the pursuit of a semantic similarity metric based on UMLS annotations for articles in PubMed Central Open Access.

PubMed

Garcia Castro, Leyla Jael; Berlanga, Rafael; Garcia, Alexander

2015-10-01

Although full-text articles are provided by the publishers in electronic formats, it remains a challenge to find related work beyond the title and abstract context. Identifying related articles based on their abstract is indeed a good starting point; this process is straightforward and does not consume as many resources as full-text based similarity would require. However, further analyses may require in-depth understanding of the full content. Two articles with highly related abstracts can be substantially different regarding the full content. How similarity differs when considering title-and-abstract versus full-text and which semantic similarity metric provides better results when dealing with full-text articles are the main issues addressed in this manuscript. We have benchmarked three similarity metrics - BM25, PMRA, and Cosine, in order to determine which one performs best when using concept-based annotations on full-text documents. We also evaluated variations in similarity values based on title-and-abstract against those relying on full-text. Our test dataset comprises the Genomics track article collection from the 2005 Text Retrieval Conference. Initially, we used an entity recognition software to semantically annotate titles and abstracts as well as full-text with concepts defined in the Unified Medical Language System (UMLS®). For each article, we created a document profile, i.e., a set of identified concepts, term frequency, and inverse document frequency; we then applied various similarity metrics to those document profiles. We considered correlation, precision, recall, and F1 in order to determine which similarity metric performs best with concept-based annotations. For those full-text articles available in PubMed Central Open Access (PMC-OA), we also performed dispersion analyses in order to understand how similarity varies when considering full-text articles. We have found that the PubMed Related Articles similarity metric is the most suitable for full-text articles annotated with UMLS concepts. For similarity values above 0.8, all metrics exhibited an F1 around 0.2 and a recall around 0.1; BM25 showed the highest precision close to 1; in all cases the concept-based metrics performed better than the word-stem-based one. Our experiments show that similarity values vary when considering only title-and-abstract versus full-text similarity. Therefore, analyses based on full-text become useful when a given research requires going beyond title and abstract, particularly regarding connectivity across articles. Visualization available at ljgarcia.github.io/semsim.benchmark/, data available at http://dx.doi.org/10.5281/zenodo.13323. Copyright © 2015 Elsevier Inc. All rights reserved.
Preface to FP-UML 2009

NASA Astrophysics Data System (ADS)

Trujillo, Juan; Kim, Dae-Kyoo

The Unified Modeling Language (UML) has been widely accepted as the standard object-oriented (OO) modeling language for modeling various aspects of software and information systems. The UML is an extensible language, in the sense that it provides mechanisms to introduce new elements for specific domains if necessary, such as web applications, database applications, business modeling, software development processes, data warehouses. Furthermore, the latest version of UML 2.0 got even bigger and more complicated with more diagrams for some good reasons. Although UML provides different diagrams for modeling different aspects of a software system, not all of them need to be applied in most cases. Therefore, heuristics, design guidelines, lessons learned from experiences are extremely important for the effective use of UML 2.0 and to avoid unnecessary complication. Also, approaches are needed to better manage UML 2.0 and its extensions so they do not become too complex too manage in the end.
k-neighborhood Decentralization: A Comprehensive Solution to Index the UMLS for Large Scale Knowledge Discovery

PubMed Central

Xiang, Yang; Lu, Kewei; James, Stephen L.; Borlawsky, Tara B.; Huang, Kun; Payne, Philip R.O.

2011-01-01

The Unified Medical Language System (UMLS) is the largest thesaurus in the biomedical informatics domain. Previous works have shown that knowledge constructs comprised of transitively-associated UMLS concepts are effective for discovering potentially novel biomedical hypotheses. However, the extremely large size of the UMLS becomes a major challenge for these applications. To address this problem, we designed a k-neighborhood Decentralization Labeling Scheme (kDLS) for the UMLS, and the corresponding method to effectively evaluate the kDLS indexing results. kDLS provides a comprehensive solution for indexing the UMLS for very efficient large scale knowledge discovery. We demonstrated that it is highly effective to use kDLS paths to prioritize disease-gene relations across the whole genome, with extremely high fold-enrichment values. To our knowledge, this is the first indexing scheme capable of supporting efficient large scale knowledge discovery on the UMLS as a whole. Our expectation is that kDLS will become a vital engine for retrieving information and generating hypotheses from the UMLS for future medical informatics applications. PMID:22154838
k-Neighborhood decentralization: a comprehensive solution to index the UMLS for large scale knowledge discovery.

PubMed

Xiang, Yang; Lu, Kewei; James, Stephen L; Borlawsky, Tara B; Huang, Kun; Payne, Philip R O

2012-04-01

The Unified Medical Language System (UMLS) is the largest thesaurus in the biomedical informatics domain. Previous works have shown that knowledge constructs comprised of transitively-associated UMLS concepts are effective for discovering potentially novel biomedical hypotheses. However, the extremely large size of the UMLS becomes a major challenge for these applications. To address this problem, we designed a k-neighborhood Decentralization Labeling Scheme (kDLS) for the UMLS, and the corresponding method to effectively evaluate the kDLS indexing results. kDLS provides a comprehensive solution for indexing the UMLS for very efficient large scale knowledge discovery. We demonstrated that it is highly effective to use kDLS paths to prioritize disease-gene relations across the whole genome, with extremely high fold-enrichment values. To our knowledge, this is the first indexing scheme capable of supporting efficient large scale knowledge discovery on the UMLS as a whole. Our expectation is that kDLS will become a vital engine for retrieving information and generating hypotheses from the UMLS for future medical informatics applications. Copyright Â© 2011 Elsevier Inc. All rights reserved.
Standardizing clinical trials workflow representation in UML for international site comparison.

PubMed

de Carvalho, Elias Cesar Araujo; Jayanti, Madhav Kishore; Batilana, Adelia Portero; Kozan, Andreia M O; Rodrigues, Maria J; Shah, Jatin; Loures, Marco R; Patil, Sunita; Payne, Philip; Pietrobon, Ricardo

2010-11-09

With the globalization of clinical trials, a growing emphasis has been placed on the standardization of the workflow in order to ensure the reproducibility and reliability of the overall trial. Despite the importance of workflow evaluation, to our knowledge no previous studies have attempted to adapt existing modeling languages to standardize the representation of clinical trials. Unified Modeling Language (UML) is a computational language that can be used to model operational workflow, and a UML profile can be developed to standardize UML models within a given domain. This paper's objective is to develop a UML profile to extend the UML Activity Diagram schema into the clinical trials domain, defining a standard representation for clinical trial workflow diagrams in UML. Two Brazilian clinical trial sites in rheumatology and oncology were examined to model their workflow and collect time-motion data. UML modeling was conducted in Eclipse, and a UML profile was developed to incorporate information used in discrete event simulation software. Ethnographic observation revealed bottlenecks in workflow: these included tasks requiring full commitment of CRCs, transferring notes from paper to computers, deviations from standard operating procedures, and conflicts between different IT systems. Time-motion analysis revealed that nurses' activities took up the most time in the workflow and contained a high frequency of shorter duration activities. Administrative assistants performed more activities near the beginning and end of the workflow. Overall, clinical trial tasks had a greater frequency than clinic routines or other general activities. This paper describes a method for modeling clinical trial workflow in UML and standardizing these workflow diagrams through a UML profile. In the increasingly global environment of clinical trials, the standardization of workflow modeling is a necessary precursor to conducting a comparative analysis of international clinical trials workflows.
Standardizing Clinical Trials Workflow Representation in UML for International Site Comparison

PubMed Central

de Carvalho, Elias Cesar Araujo; Jayanti, Madhav Kishore; Batilana, Adelia Portero; Kozan, Andreia M. O.; Rodrigues, Maria J.; Shah, Jatin; Loures, Marco R.; Patil, Sunita; Payne, Philip; Pietrobon, Ricardo

2010-01-01

Background With the globalization of clinical trials, a growing emphasis has been placed on the standardization of the workflow in order to ensure the reproducibility and reliability of the overall trial. Despite the importance of workflow evaluation, to our knowledge no previous studies have attempted to adapt existing modeling languages to standardize the representation of clinical trials. Unified Modeling Language (UML) is a computational language that can be used to model operational workflow, and a UML profile can be developed to standardize UML models within a given domain. This paper's objective is to develop a UML profile to extend the UML Activity Diagram schema into the clinical trials domain, defining a standard representation for clinical trial workflow diagrams in UML. Methods Two Brazilian clinical trial sites in rheumatology and oncology were examined to model their workflow and collect time-motion data. UML modeling was conducted in Eclipse, and a UML profile was developed to incorporate information used in discrete event simulation software. Results Ethnographic observation revealed bottlenecks in workflow: these included tasks requiring full commitment of CRCs, transferring notes from paper to computers, deviations from standard operating procedures, and conflicts between different IT systems. Time-motion analysis revealed that nurses' activities took up the most time in the workflow and contained a high frequency of shorter duration activities. Administrative assistants performed more activities near the beginning and end of the workflow. Overall, clinical trial tasks had a greater frequency than clinic routines or other general activities. Conclusions This paper describes a method for modeling clinical trial workflow in UML and standardizing these workflow diagrams through a UML profile. In the increasingly global environment of clinical trials, the standardization of workflow modeling is a necessary precursor to conducting a comparative analysis of international clinical trials workflows. PMID:21085484
Building validation tools for knowledge-based systems

NASA Technical Reports Server (NTRS)

Stachowitz, R. A.; Chang, C. L.; Stock, T. S.; Combs, J. B.

1987-01-01

The Expert Systems Validation Associate (EVA), a validation system under development at the Lockheed Artificial Intelligence Center for more than a year, provides a wide range of validation tools to check the correctness, consistency and completeness of a knowledge-based system. A declarative meta-language (higher-order language), is used to create a generic version of EVA to validate applications written in arbitrary expert system shells. The architecture and functionality of EVA are presented. The functionality includes Structure Check, Logic Check, Extended Structure Check (using semantic information), Extended Logic Check, Semantic Check, Omission Check, Rule Refinement, Control Check, Test Case Generation, Error Localization, and Behavior Verification.
minimUML: A Minimalist Approach to UML Diagramming for Early Computer Science Education

ERIC Educational Resources Information Center

Turner, Scott A.; Perez-Quinones, Manuel A.; Edwards, Stephen H.

2005-01-01

In introductory computer science courses, the Unified Modeling Language (UML) is commonly used to teach basic object-oriented design. However, there appears to be a lack of suitable software to support this task. Many of the available programs that support UML focus on developing code and not on enhancing learning. Programs designed for…
Some Aspects of Language Development in Middle Childhood.

ERIC Educational Resources Information Center

Hoar, Nancy

The middle childhood years are a period of refinement of the semantics and syntax acquired in the early years, of substantial metalinguistic development, and of subtle changes in actual processing strategies. In a study undertaken to determine how these three factors interact, children aged 6 to 11 were asked to produce and recognize paraphrases.…
Clinical Diagnostics in Human Genetics with Semantic Similarity Searches in Ontologies

PubMed Central

Köhler, Sebastian; Schulz, Marcel H.; Krawitz, Peter; Bauer, Sebastian; Dölken, Sandra; Ott, Claus E.; Mundlos, Christine; Horn, Denise; Mundlos, Stefan; Robinson, Peter N.

2009-01-01

The differential diagnostic process attempts to identify candidate diseases that best explain a set of clinical features. This process can be complicated by the fact that the features can have varying degrees of specificity, as well as by the presence of features unrelated to the disease itself. Depending on the experience of the physician and the availability of laboratory tests, clinical abnormalities may be described in greater or lesser detail. We have adapted semantic similarity metrics to measure phenotypic similarity between queries and hereditary diseases annotated with the use of the Human Phenotype Ontology (HPO) and have developed a statistical model to assign p values to the resulting similarity scores, which can be used to rank the candidate diseases. We show that our approach outperforms simpler term-matching approaches that do not take the semantic interrelationships between terms into account. The advantage of our approach was greater for queries containing phenotypic noise or imprecise clinical descriptions. The semantic network defined by the HPO can be used to refine the differential diagnosis by suggesting clinical features that, if present, best differentiate among the candidate diagnoses. Thus, semantic similarity searches in ontologies represent a useful way of harnessing the semantic structure of human phenotypic abnormalities to help with the differential diagnosis. We have implemented our methods in a freely available web application for the field of human Mendelian disorders. PMID:19800049
Development and empirical user-centered evaluation of semantically-based query recommendation for an electronic health record search engine.

PubMed

Hanauer, David A; Wu, Danny T Y; Yang, Lei; Mei, Qiaozhu; Murkowski-Steffy, Katherine B; Vydiswaran, V G Vinod; Zheng, Kai

2017-03-01

The utility of biomedical information retrieval environments can be severely limited when users lack expertise in constructing effective search queries. To address this issue, we developed a computer-based query recommendation algorithm that suggests semantically interchangeable terms based on an initial user-entered query. In this study, we assessed the value of this approach, which has broad applicability in biomedical information retrieval, by demonstrating its application as part of a search engine that facilitates retrieval of information from electronic health records (EHRs). The query recommendation algorithm utilizes MetaMap to identify medical concepts from search queries and indexed EHR documents. Synonym variants from UMLS are used to expand the concepts along with a synonym set curated from historical EHR search logs. The empirical study involved 33 clinicians and staff who evaluated the system through a set of simulated EHR search tasks. User acceptance was assessed using the widely used technology acceptance model. The search engine's performance was rated consistently higher with the query recommendation feature turned on vs. off. The relevance of computer-recommended search terms was also rated high, and in most cases the participants had not thought of these terms on their own. The questions on perceived usefulness and perceived ease of use received overwhelmingly positive responses. A vast majority of the participants wanted the query recommendation feature to be available to assist in their day-to-day EHR search tasks. Challenges persist for users to construct effective search queries when retrieving information from biomedical documents including those from EHRs. This study demonstrates that semantically-based query recommendation is a viable solution to addressing this challenge. Published by Elsevier Inc.
Recommending Education Materials for Diabetic Questions Using Information Retrieval Approaches

PubMed Central

Wang, Yanshan; Shen, Feichen; Liu, Sijia; Rastegar-Mojarad, Majid; Wang, Liwei

2017-01-01

Background Self-management is crucial to diabetes care and providing expert-vetted content for answering patients’ questions is crucial in facilitating patient self-management. Objective The aim is to investigate the use of information retrieval techniques in recommending patient education materials for diabetic questions of patients. Methods We compared two retrieval algorithms, one based on Latent Dirichlet Allocation topic modeling (topic modeling-based model) and one based on semantic group (semantic group-based model), with the baseline retrieval models, vector space model (VSM), in recommending diabetic patient education materials to diabetic questions posted on the TuDiabetes forum. The evaluation was based on a gold standard dataset consisting of 50 randomly selected diabetic questions where the relevancy of diabetic education materials to the questions was manually assigned by two experts. The performance was assessed using precision of top-ranked documents. Results We retrieved 7510 diabetic questions on the forum and 144 diabetic patient educational materials from the patient education database at Mayo Clinic. The mapping rate of words in each corpus mapped to the Unified Medical Language System (UMLS) was significantly different (P<.001). The topic modeling-based model outperformed the other retrieval algorithms. For example, for the top-retrieved document, the precision of the topic modeling-based, semantic group-based, and VSM models was 67.0%, 62.8%, and 54.3%, respectively. Conclusions This study demonstrated that topic modeling can mitigate the vocabulary difference and it achieved the best performance in recommending education materials for answering patients’ questions. One direction for future work is to assess the generalizability of our findings and to extend our study to other disease areas, other patient education material resources, and online forums. PMID:29038097
Creating Shareable Clinical Decision Support Rules for a Pharmacogenomics Clinical Guideline Using Structured Knowledge Representation.

PubMed

Linan, Margaret K; Sottara, Davide; Freimuth, Robert R

2015-01-01

Pharmacogenomics (PGx) guidelines contain drug-gene relationships, therapeutic and clinical recommendations from which clinical decision support (CDS) rules can be extracted, rendered and then delivered through clinical decision support systems (CDSS) to provide clinicians with just-in-time information at the point of care. Several tools exist that can be used to generate CDS rules that are based on computer interpretable guidelines (CIG), but none have been previously applied to the PGx domain. We utilized the Unified Modeling Language (UML), the Health Level 7 virtual medical record (HL7 vMR) model, and standard terminologies to represent the semantics and decision logic derived from a PGx guideline, which were then mapped to the Health eDecisions (HeD) schema. The modeling and extraction processes developed here demonstrate how structured knowledge representations can be used to support the creation of shareable CDS rules from PGx guidelines.
Tracking changes in search behaviour at a health web site.

PubMed

Eklund, Ann-Marie

2012-01-01

Nowadays, the internet is used as a means to provide the public with official information on many different topics, including health related matters and care providers. In this work we have studied a search log from the official Swedish health web site 1177.se for patterns of search behaviour over time. To improve the analysis, we mapped the queries to UMLS semantic types and MeSH categories. Our analysis shows that, as expected, diseases and health care activities are the ones of most interest, but also a clear increased interest in geographical locations in the setting of health care providers. We also note a change over time in which kinds of diseases are of interest. Finally, we conclude that this type of analysis may be useful in studies of what health related topics matter to the public, but also for design and follow-up of public information campaigns.

Enhanced LOD Concepts for Virtual 3d City Models

NASA Astrophysics Data System (ADS)

Benner, J.; Geiger, A.; Gröger, G.; Häfele, K.-H.; Löwner, M.-O.

2013-09-01

Virtual 3D city models contain digital three dimensional representations of city objects like buildings, streets or technical infrastructure. Because size and complexity of these models continuously grow, a Level of Detail (LoD) concept effectively supporting the partitioning of a complete model into alternative models of different complexity and providing metadata, addressing informational content, complexity and quality of each alternative model is indispensable. After a short overview on various LoD concepts, this paper discusses the existing LoD concept of the CityGML standard for 3D city models and identifies a number of deficits. Based on this analysis, an alternative concept is developed and illustrated with several examples. It differentiates between first, a Geometric Level of Detail (GLoD) and a Semantic Level of Detail (SLoD), and second between the interior building and its exterior shell. Finally, a possible implementation of the new concept is demonstrated by means of an UML model.
A model for indexing medical documents combining statistical and symbolic knowledge.

PubMed

Avillach, Paul; Joubert, Michel; Fieschi, Marius

2007-10-11

To develop and evaluate an information processing method based on terminologies, in order to index medical documents in any given documentary context. We designed a model using both symbolic general knowledge extracted from the Unified Medical Language System (UMLS) and statistical knowledge extracted from a domain of application. Using statistical knowledge allowed us to contextualize the general knowledge for every particular situation. For each document studied, the extracted terms are ranked to highlight the most significant ones. The model was tested on a set of 17,079 French standardized discharge summaries (SDSs). The most important ICD-10 term of each SDS was ranked 1st or 2nd by the method in nearly 90% of the cases. The use of several terminologies leads to more precise indexing. The improvement achieved in the models implementation performances as a result of using semantic relationships is encouraging.
A Model for Indexing Medical Documents Combining Statistical and Symbolic Knowledge.

PubMed Central

Avillach, Paul; Joubert, Michel; Fieschi, Marius

2007-01-01

OBJECTIVES: To develop and evaluate an information processing method based on terminologies, in order to index medical documents in any given documentary context. METHODS: We designed a model using both symbolic general knowledge extracted from the Unified Medical Language System (UMLS) and statistical knowledge extracted from a domain of application. Using statistical knowledge allowed us to contextualize the general knowledge for every particular situation. For each document studied, the extracted terms are ranked to highlight the most significant ones. The model was tested on a set of 17,079 French standardized discharge summaries (SDSs). RESULTS: The most important ICD-10 term of each SDS was ranked 1st or 2nd by the method in nearly 90% of the cases. CONCLUSIONS: The use of several terminologies leads to more precise indexing. The improvement achieved in the model’s implementation performances as a result of using semantic relationships is encouraging. PMID:18693792
Longitudinal Analysis of New Information Types in Clinical Notes

PubMed Central

Zhang, Rui; Pakhomov, Serguei; Melton, Genevieve B.

2014-01-01

It is increasingly recognized that redundant information in clinical notes within electronic health record (EHR) systems is ubiquitous, significant, and may negatively impact the secondary use of these notes for research and patient care. We investigated several automated methods to identify redundant versus relevant new information in clinical reports. These methods may provide a valuable approach to extract clinically pertinent information and further improve the accuracy of clinical information extraction systems. In this study, we used UMLS semantic types to extract several types of new information, including problems, medications, and laboratory information. Automatically identified new information highly correlated with manual reference standard annotations. Methods to identify different types of new information can potentially help to build up more robust information extraction systems for clinical researchers as well as aid clinicians and researchers in navigating clinical notes more effectively and quickly identify information pertaining to changes in health states. PMID:25717418
Indexing method of digital audiovisual medical resources with semantic Web integration.

PubMed

Cuggia, Marc; Mougin, Fleur; Le Beux, Pierre

2003-01-01

Digitalization of audio-visual resources combined with the performances of the networks offer many possibilities which are the subject of intensive work in the scientific and industrial sectors. Indexing such resources is a major challenge. Recently, the Motion Pictures Expert Group (MPEG) has been developing MPEG-7, a standard for describing multimedia content. The good of this standard is to develop a rich set of standardized tools to enable fast efficient retrieval from digital archives or filtering audiovisual broadcasts on the internet. How this kind of technologies could be used in the medical context? In this paper, we propose a simpler indexing system, based on Dublin Core standard and complaint to MPEG-7. We use MeSH and UMLS to introduce conceptual navigation. We also present a video-platform with enables to encode and give access to audio-visual resources in streaming mode.
Data-Flow Based Model Analysis

NASA Technical Reports Server (NTRS)

Saad, Christian; Bauer, Bernhard

2010-01-01

The concept of (meta) modeling combines an intuitive way of formalizing the structure of an application domain with a high expressiveness that makes it suitable for a wide variety of use cases and has therefore become an integral part of many areas in computer science. While the definition of modeling languages through the use of meta models, e.g. in Unified Modeling Language (UML), is a well-understood process, their validation and the extraction of behavioral information is still a challenge. In this paper we present a novel approach for dynamic model analysis along with several fields of application. Examining the propagation of information along the edges and nodes of the model graph allows to extend and simplify the definition of semantic constraints in comparison to the capabilities offered by e.g. the Object Constraint Language. Performing a flow-based analysis also enables the simulation of dynamic behavior, thus providing an "abstract interpretation"-like analysis method for the modeling domain.
A UML profile for framework modeling.

PubMed

Xu, Xiao-liang; Wang, Le-yu; Zhou, Hong

2004-01-01

The current standard Unified Modeling Language(UML) could not model framework flexibility and extendability adequately due to lack of appropriate constructs to distinguish framework hot-spots from kernel elements. A new UML profile that may customize UML for framework modeling was presented using the extension mechanisms of UML, providing a group of UML extensions to meet the needs of framework modeling. In this profile, the extended class diagrams and sequence diagrams were defined to straightforwardly identify the hot-spots and describe their instantiation restrictions. A transformation model based on design patterns was also put forward, such that the profile based framework design diagrams could be automatically mapped to the corresponding implementation diagrams. It was proved that the presented profile makes framework modeling more straightforwardly and therefore easier to understand and instantiate.
Biomedical Terminology Mapper for UML projects.

PubMed

Thibault, Julien C; Frey, Lewis

2013-01-01

As the biomedical community collects and generates more and more data, the need to describe these datasets for exchange and interoperability becomes crucial. This paper presents a mapping algorithm that can help developers expose local implementations described with UML through standard terminologies. The input UML class or attribute name is first normalized and tokenized, then lookups in a UMLS-based dictionary are performed. For the evaluation of the algorithm 142 UML projects were extracted from caGrid and automatically mapped to National Cancer Institute (NCI) terminology concepts. Resulting mappings at the UML class and attribute levels were compared to the manually curated annotations provided in caGrid. Results are promising and show that this type of algorithm could speed-up the tedious process of mapping local implementations to standard biomedical terminologies.
Biomedical Terminology Mapper for UML projects

PubMed Central

Thibault, Julien C.; Frey, Lewis

As the biomedical community collects and generates more and more data, the need to describe these datasets for exchange and interoperability becomes crucial. This paper presents a mapping algorithm that can help developers expose local implementations described with UML through standard terminologies. The input UML class or attribute name is first normalized and tokenized, then lookups in a UMLS-based dictionary are performed. For the evaluation of the algorithm 142 UML projects were extracted from caGrid and automatically mapped to National Cancer Institute (NCI) terminology concepts. Resulting mappings at the UML class and attribute levels were compared to the manually curated annotations provided in caGrid. Results are promising and show that this type of algorithm could speed-up the tedious process of mapping local implementations to standard biomedical terminologies. PMID:24303278
SSBRP Communication & Data System Development using the Unified Modeling Language (UML)

NASA Technical Reports Server (NTRS)

Windrem, May; Picinich, Lou; Givens, John J. (Technical Monitor)

1998-01-01

The Unified Modeling Language (UML) is the standard method for specifying, visualizing, and documenting the artifacts of an object-oriented system under development. UML is the unification of the object-oriented methods developed by Grady Booch and James Rumbaugh, and of the Use Case Model developed by Ivar Jacobson. This paper discusses the application of UML by the Communications and Data Systems (CDS) team to model the ground control and command of the Space Station Biological Research Project (SSBRP) User Operations Facility (UOF). UML is used to define the context of the system, the logical static structure, the life history of objects, and the interactions among objects.
The Development of Clinical Document Standards for Semantic Interoperability in China

PubMed Central

Yang, Peng; Pan, Feng; Wan, Yi; Tu, Haibo; Tang, Xuejun; Hu, Jianping

2011-01-01

Objectives This study is aimed at developing a set of data groups (DGs) to be employed as reusable building blocks for the construction of the eight most common clinical documents used in China's general hospitals in order to achieve their structural and semantic standardization. Methods The Diagnostics knowledge framework, the related approaches taken from the Health Level Seven (HL7), the Integrating the Healthcare Enterprise (IHE), and the Healthcare Information Technology Standards Panel (HITSP) and 1,487 original clinical records were considered together to form the DG architecture and data sets. The internal structure, content, and semantics of each DG were then defined by mapping each DG data set to a corresponding Clinical Document Architecture data element and matching each DG data set to the metadata in the Chinese National Health Data Dictionary. By using the DGs as reusable building blocks, standardized structures and semantics regarding the clinical documents for semantic interoperability were able to be constructed. Results Altogether, 5 header DGs, 48 section DGs, and 17 entry DGs were developed. Several issues regarding the DGs, including their internal structure, identifiers, data set names, definitions, length and format, data types, and value sets, were further defined. Standardized structures and semantics regarding the eight clinical documents were structured by the DGs. Conclusions This approach of constructing clinical document standards using DGs is a feasible standard-driven solution useful in preparing documents possessing semantic interoperability among the disparate information systems in China. These standards need to be validated and refined through further study. PMID:22259722
Decomposing Animacy Reversals between Agents and Experiencers: An ERP Study

ERIC Educational Resources Information Center

Bourguignon, Nicolas; Drury, John E.; Valois, Daniel; Steinhauer, Karsten

2012-01-01

The present study aimed to refine current hypotheses regarding thematic reversal anomalies, which have been found to elicit either N400 or--more frequently--"semantic-P600" (sP600) effects. Our goal was to investigate whether distinct ERP profiles reflect aspectual-thematic differences between Agent-Subject Verbs (ASVs; e.g., "to eat") and…
Proving refinement transformations using extended denotational semantics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Winter, V.L.; Boyle, J.M.

1996-04-01

TAMPR is a fully automatic transformation system based on syntactic rewrites. Our approach in a correctness proof is to map the transformation into an axiomatized mathematical domain where formal (and automated) reasoning can be performed. This mapping is accomplished via an extended denotational semantic paradigm. In this approach, the abstract notion of a program state is distributed between an environment function and a store function. Such a distribution introduces properties that go beyond the abstract state that is being modeled. The reasoning framework needs to be aware of these properties in order to successfully complete a correctness proof. This papermore » discusses some of our experiences in proving the correctness of TAMPR transformations.« less
Piecewise synonyms for enhanced UMLS source terminology integration.

PubMed

Huang, Kuo-Chuan; Geller, James; Halper, Michael; Cimino, James J

2007-10-11

The UMLS contains more than 100 source vocabularies and is growing via the integration of others. When integrating a new source, the source terms already in the UMLS must first be found. The easiest approach to this is simple string matching. However, string matching usually does not find all concepts that should be found. A new methodology, based on the notion of piecewise synonyms, for enhancing the process of concept discovery in the UMLS is presented. This methodology is supported by first creating a general synonym dictionary based on the UMLS. Each multi-word source term is decomposed into its component words, allowing for the generation of separate synonyms for each word from the general synonym dictionary. The recombination of these synonyms into new terms creates an expanded pool of matching candidates for terms from the source. The methodology is demonstrated with respect to an existing UMLS source. It shows a 34% improvement over simple string matching.
Does serum CA125 have clinical value for follow-up monitoring of postoperative patients with epithelial ovarian cancer? Results of a 12-year study.

PubMed

Guo, Na; Peng, Zhilan

2017-03-11

The detection of CA125 has been used in the follow up of ovarian cancer. At present, some scholars believe that serum CA125 has no clinical value for the follow-up monitoring the recurrence for postoperative patients with epithelial ovarian cancer, but in our clinical follow-up found that when the serum CA125 value is <35 U/ml, postoperative patients of epithelial ovarian carcinoma had already showed recurrent lesions in some ecological and imaging examinations or in laparotomy exploration and biopsy, and we given the patients timely treatment, the prognosis were improved. Retrospective analysis the values of serum CA125 of 342 postoperative patients of epithelial ovarian carcinoma, consisting of 296 non-recurrent and 46 recurrent cases, as well as 3175 cases of menopausal women and 603 cases of postoperative patients of gynecological malignant tumor for the follow-up from January 2005 to December 2016. The median value of CA125 for non-recurrent patients of epithelial ovarian carcinoma is 8.9 U/ml, the median value of CA125 for non-recurrent patients of epithelial ovarian carcinoma is 29.7 U/ml; for menopausal women, 8.1 U/ml; and for postoperative patients of gynecological malignant tumor, 7.2 U/ml, whereas the mean ± standard deviation is 9.0 ± 1.9 U/ml, 31.3 ± 16.2U/ml, 8.0 ± 1.1 U/ml, and 6.8 ± 2.1 U/ml, respectively. If the value of the CA125 for postoperative patients of epithelial ovarian carcinoma between 10 and 35 U/ml indicates a relative risk of recurrence. When the value of CA125 is higher than 10 U/ml and continuously increased, need to be vigilant and should be combined with imaging examination (PET-CT). This result may improve the prognosis for recurrent patients because of the early detection of recurrent lesions and early retreatment.
Elevated soluble MUC1 levels and decreased anti-MUC1 antibody levels in patients with multiple myeloma.

PubMed

Treon, S P; Maimonis, P; Bua, D; Young, G; Raje, N; Mollick, J; Chauhan, D; Tai, Y T; Hideshima, T; Shima, Y; Hilgers, J; von Mensdorff-Pouilly, S; Belch, A R; Pilarski, L M; Anderson, K C

2000-11-01

Soluble MUC1 (sMUC1) levels are elevated in many MUC1(+) cancers. We and others have shown that MUC1 is expressed on multiple myeloma (MM) plasma cells and B cells. In this study, we measured sMUC1 levels in bone marrow (BM) plasma from 71 MM patients and 21 healthy donors (HDs), and in peripheral blood (PB) plasma from 42 MM patients and 13 HDs using an immunoassay that detects the CA27.29 epitope of MUC1. sMUC1 levels were found to be significantly greater (mean 31.76 U/mL, range 5.69 to 142.48 U/mL) in MM patient BM plasma versus HD BM plasma (mean 9.68 U/mL, range 0.65 to 39.83 U/mL) (P <. 001). Importantly, BM plasma sMUC1 levels were related to tumor burden because sMUC1 levels were significantly higher for MM patients with active disease (34.62 U/mL, range 5.69 to 142.48 U/mL) versus MM patients with minimal residual disease (16.16 U/mL, range 5.7 to 56.68 U/mL) (P =.0026). sMUC1 levels were also elevated in the PB plasma of MM patients (32.79 U/mL, range 4.15 to 148.84 U/mL) versus HDs (18.47 U/mL, range 8.84 to 42.49) (P =.0052). Lastly, circulating immunglobulin M (IgM) and IgG antibodies to MUC1 were measured in 114 MM patients and 31 HDs, because natural antibodies to MUC1 have been detected in patients with other MUC1-bearing malignancies. These studies demonstrated lower levels of circulating IgM (P <.001) and IgG (P =.078) antibodies to MUC1 in MM patients compared with HDs. Our data therefore show that in MM patients, sMUC1 levels are elevated and correlate with disease burden, whereas anti-MUC1 antibody levels are decreased.
Representing clinical guidelines in UMl: a comparative study.

PubMed

Hederman, Lucy; Smutek, Daniel; Wade, Vincent; Knape, Thomas

2002-01-01

Clinical guidelines can be represented using models, such as GLIF, specifically designed for healthcare guidelines. This paper demonstrates that they can also be modelled using a mainstream business modelling language such as UML. The paper presents a guideline in GLIF and as UML activity diagrams, and then presents a mapping of GLIF primitives to UML. The potential benefits of using a mainstream modelling language are outlined. These include availability of advanced modelling tools, transfer between modelling tools, and automation via business workflow technology.
UML activity diagram swimlanes in logic controller design

NASA Astrophysics Data System (ADS)

Grobelny, Michał; Grobelna, Iwona

2015-12-01

Logic controller behavior can be specified using various techniques, including UML activity diagrams and control Petri nets. Each technique has its advantages and disadvantages. Application of both specification types in one project allows to take benefits from both of them. Additional elements of UML models make it possible to divide a specification into some parts, considered from other point of view (logic controller, user or system). The paper introduces an idea to use UML activity diagrams with swimlanes to increase the understandability of design models.
Use of Unified Modeling Language (UML) in Model-Based Development (MBD) For Safety-Critical Applications

DTIC Science & Technology

2014-12-01

appears that UML is becoming the de facto MBD language. OMG® states the following on the MDA® FAQ page: “Although not formally required [for MBD], UML...a known limitation [42], so UML users should plan accordingly, especially for safety-critical programs. For example, “models are not used to...description of the MBD tool chain can be produced. That description could be resident in a Plan for Software Aspects of Certification (PSAC) or Software
Characterizing semantic mappings adaptation via biomedical KOS evolution: a case study investigating SNOMED CT and ICD.

PubMed

Dos Reis, Julio Cesar; Pruski, Cédric; Da Silveira, Marcos; Reynaud-Delaître, Chantal

2013-01-01

Mappings established between Knowledge Organization Systems (KOS) increase semantic interoperability between biomedical information systems. However, biomedical knowledge is highly dynamic and changes affecting KOS entities can potentially invalidate part or the totality of existing mappings. Understanding how mappings evolve and what the impacts of KOS evolution on mappings are is therefore crucial for the definition of an automatic approach to maintain mappings valid and up-to-date over time. In this article, we study variations of a specific KOS complex change (split) for two biomedical KOS (SNOMED CT and ICD-9-CM) through a rigorous method of investigation for identifying and refining complex changes, and for selecting representative cases. We empirically analyze and explain their influence on the evolution of associated mappings. Results point out the importance of considering various dimensions of the information described in KOS, like the semantic structure of concepts, the set of relevant information used to define the mappings and the change operations interfering with this set of information.

Characterizing Semantic Mappings Adaptation via Biomedical KOS Evolution: A Case Study Investigating SNOMED CT and ICD

PubMed Central

Reis, Julio Cesar Dos; Pruski, Cédric; Da Silveira, Marcos; Reynaud-Delaître, Chantal

2013-01-01

Mappings established between Knowledge Organization Systems (KOS) increase semantic interoperability between biomedical information systems. However, biomedical knowledge is highly dynamic and changes affecting KOS entities can potentially invalidate part or the totality of existing mappings. Understanding how mappings evolve and what the impacts of KOS evolution on mappings are is therefore crucial for the definition of an automatic approach to maintain mappings valid and up-to-date over time. In this article, we study variations of a specific KOS complex change (split) for two biomedical KOS (SNOMED CT and ICD-9-CM) through a rigorous method of investigation for identifying and refining complex changes, and for selecting representative cases. We empirically analyze and explain their influence on the evolution of associated mappings. Results point out the importance of considering various dimensions of the information described in KOS, like the semantic structure of concepts, the set of relevant information used to define the mappings and the change operations interfering with this set of information. PMID:24551341
Uncompacted myelin lamellae in peripheral nerve biopsy.

PubMed

Vital, Claude; Vital, Anne; Bouillot, Sandrine; Favereaux, Alexandre; Lagueny, Alain; Ferrer, Xavier; Brechenmacher, Christiane; Petry, Klaus G

2003-01-01

Since 1979, the authors have studied 49 peripheral nerve biopsies presenting uncompacted myelin lamellae (UML). Based on the ultrastructural pattern of UML they propose a 3-category classification. The first category includes cases displaying regular UML, which was observed in 43 cases; it was more frequent in 9 cases with polyneuropathy organomegaly endocrinopathy m-protein skin changes (POEMS) syndrome as well as in 1 case of Charcot-Marie-Tooth 1B with a novel point mutation in the P0 gene. The second category consists of cases showing irregular UML, observed in 4 cases with IgM monoclonal gammopathy and anti-myelin-associated glycoprotein (MAG) activity. This group included 1 benign case and 3 B-cell malignant lymphomas. The third category is complex UML, which was present in 2 unrelated patients with an Arg 98 His missense mutation in the P0 protein gene. Irregular and complex UML are respectively related to MAG and P0, which play a crucial role in myelin lamellae compaction and adhesion.
Hierarchical layered and semantic-based image segmentation using ergodicity map

NASA Astrophysics Data System (ADS)

Yadegar, Jacob; Liu, Xiaoqing

2010-04-01

Image segmentation plays a foundational role in image understanding and computer vision. Although great strides have been made and progress achieved on automatic/semi-automatic image segmentation algorithms, designing a generic, robust, and efficient image segmentation algorithm is still challenging. Human vision is still far superior compared to computer vision, especially in interpreting semantic meanings/objects in images. We present a hierarchical/layered semantic image segmentation algorithm that can automatically and efficiently segment images into hierarchical layered/multi-scaled semantic regions/objects with contextual topological relationships. The proposed algorithm bridges the gap between high-level semantics and low-level visual features/cues (such as color, intensity, edge, etc.) through utilizing a layered/hierarchical ergodicity map, where ergodicity is computed based on a space filling fractal concept and used as a region dissimilarity measurement. The algorithm applies a highly scalable, efficient, and adaptive Peano- Cesaro triangulation/tiling technique to decompose the given image into a set of similar/homogenous regions based on low-level visual cues in a top-down manner. The layered/hierarchical ergodicity map is built through a bottom-up region dissimilarity analysis. The recursive fractal sweep associated with the Peano-Cesaro triangulation provides efficient local multi-resolution refinement to any level of detail. The generated binary decomposition tree also provides efficient neighbor retrieval mechanisms for contextual topological object/region relationship generation. Experiments have been conducted within the maritime image environment where the segmented layered semantic objects include the basic level objects (i.e. sky/land/water) and deeper level objects in the sky/land/water surfaces. Experimental results demonstrate the proposed algorithm has the capability to robustly and efficiently segment images into layered semantic objects/regions with contextual topological relationships.
The Semantic Web: From Representation to Realization

NASA Astrophysics Data System (ADS)

Thórisson, Kristinn R.; Spivack, Nova; Wissner, James M.

A semantically-linked web of electronic information - the Semantic Web - promises numerous benefits including increased precision in automated information sorting, searching, organizing and summarizing. Realizing this requires significantly more reliable meta-information than is readily available today. It also requires a better way to represent information that supports unified management of diverse data and diverse Manipulation methods: from basic keywords to various types of artificial intelligence, to the highest level of intelligent manipulation - the human mind. How this is best done is far from obvious. Relying solely on hand-crafted annotation and ontologies, or solely on artificial intelligence techniques, seems less likely for success than a combination of the two. In this paper describe an integrated, complete solution to these challenges that has already been implemented and tested with hundreds of thousands of users. It is based on an ontological representational level we call SemCards that combines ontological rigour with flexible user interface constructs. SemCards are machine- and human-readable digital entities that allow non-experts to create and use semantic content, while empowering machines to better assist and participate in the process. SemCards enable users to easily create semantically-grounded data that in turn acts as examples for automation processes, creating a positive iterative feedback loop of metadata creation and refinement between user and machine. They provide a holistic solution to the Semantic Web, supporting powerful management of the full lifecycle of data, including its creation, retrieval, classification, sorting and sharing. We have implemented the SemCard technology on the semantic Web site Twine.com, showing that the technology is indeed versatile and scalable. Here we present the key ideas behind SemCards and describe the initial implementation of the technology.
Improved production of an enzyme that hydrolyses raw yam starch by Penicillium sp. S-22 using fed-batch fermentation.

PubMed

Sun, Hai-Yan; Ge, Xiang-Yang; Zhang, Wei-Guo

2006-11-01

A newly isolated strain, Penicillium sp. S-22, was used to produce an enzyme that hydrolyses raw yam starch [raw yam starch digesting enzyme (RYSDE)]. The enzyme activity and overall enzyme productivity were respectively 16 U/ml and 0.19 U/ml h in the batch culture. The enzyme activity increased to 85 U/ml by feeding of partially hydrolyzed raw yam starch. When a mixture containing partially hydrolyzed raw yam starch and peptone was fed by a pH-stat strategy, the enzyme activity reached 366 U/ml, 23-fold of that obtained in the batch culture, and the overall productivity reached 3.4 U/ml h, which was 18-fold of that in the batch culture.
The application of the unified modeling language in object-oriented analysis of healthcare information systems.

PubMed

Aggarwal, Vinod

2002-10-01

This paper concerns itself with the beneficial effects of the Unified Modeling Language (UML), a nonproprietary object modeling standard, in specifying, visualizing, constructing, documenting, and communicating the model of a healthcare information system from the user's perspective. The author outlines the process of object-oriented analysis (OOA) using the UML and illustrates this with healthcare examples to demonstrate the practicality of application of the UML by healthcare personnel to real-world information system problems. The UML will accelerate advanced uses of object-orientation such as reuse technology, resulting in significantly higher software productivity. The UML is also applicable in the context of a component paradigm that promises to enhance the capabilities of healthcare information systems and simplify their management and maintenance.
An Infrastructure for UML-Based Code Generation Tools

NASA Astrophysics Data System (ADS)

Wehrmeister, Marco A.; Freitas, Edison P.; Pereira, Carlos E.

The use of Model-Driven Engineering (MDE) techniques in the domain of distributed embedded real-time systems are gain importance in order to cope with the increasing design complexity of such systems. This paper discusses an infrastructure created to build GenERTiCA, a flexible tool that supports a MDE approach, which uses aspect-oriented concepts to handle non-functional requirements from embedded and real-time systems domain. GenERTiCA generates source code from UML models, and also performs weaving of aspects, which have been specified within the UML model. Additionally, this paper discusses the Distributed Embedded Real-Time Compact Specification (DERCS), a PIM created to support UML-based code generation tools. Some heuristics to transform UML models into DERCS, which have been implemented in GenERTiCA, are also discussed.
Tracing the Rationale Behind UML Model Change Through Argumentation

NASA Astrophysics Data System (ADS)

Jureta, Ivan J.; Faulkner, Stéphane

Neglecting traceability—i.e., the ability to describe and follow the life of a requirement—is known to entail misunderstanding and miscommunication, leading to the engineering of poor quality systems. Following the simple principles that (a) changes to UML model instances ought be justified to the stakeholders, (b) justification should proceed in a structured manner to ensure rigor in discussions, critique, and revisions of model instances, and (c) the concept of argument instantiated in a justification process ought to be well defined and understood, the present paper introduces the UML Traceability through Argumentation Method (UML-TAM) to enable the traceability of design rationale in UML while allowing the appropriateness of model changes to be checked by analysis of the structure of the arguments provided to justify such changes.
Authoring and verification of clinical guidelines: a model driven approach.

PubMed

Pérez, Beatriz; Porres, Ivan

2010-08-01

The goal of this research is to provide a framework to enable authoring and verification of clinical guidelines. The framework is part of a larger research project aimed at improving the representation, quality and application of clinical guidelines in daily clinical practice. The verification process of a guideline is based on (1) model checking techniques to verify guidelines against semantic errors and inconsistencies in their definition, (2) combined with Model Driven Development (MDD) techniques, which enable us to automatically process manually created guideline specifications and temporal-logic statements to be checked and verified regarding these specifications, making the verification process faster and cost-effective. Particularly, we use UML statecharts to represent the dynamics of guidelines and, based on this manually defined guideline specifications, we use a MDD-based tool chain to automatically process them to generate the input model of a model checker. The model checker takes the resulted model together with the specific guideline requirements, and verifies whether the guideline fulfils such properties. The overall framework has been implemented as an Eclipse plug-in named GBDSSGenerator which, particularly, starting from the UML statechart representing a guideline, allows the verification of the guideline against specific requirements. Additionally, we have established a pattern-based approach for defining commonly occurring types of requirements in guidelines. We have successfully validated our overall approach by verifying properties in different clinical guidelines resulting in the detection of some inconsistencies in their definition. The proposed framework allows (1) the authoring and (2) the verification of clinical guidelines against specific requirements defined based on a set of property specification patterns, enabling non-experts to easily write formal specifications and thus easing the verification process. Copyright 2010 Elsevier Inc. All rights reserved.
A Game-Theoretic Approach to Branching Time Abstract-Check-Refine Process

NASA Technical Reports Server (NTRS)

Wang, Yi; Tamai, Tetsuo

2009-01-01

Since the complexity of software systems continues to grow, most engineers face two serious problems: the state space explosion problem and the problem of how to debug systems. In this paper, we propose a game-theoretic approach to full branching time model checking on three-valued semantics. The three-valued models and logics provide successful abstraction that overcomes the state space explosion problem. The game style model checking that generates counter-examples can guide refinement or identify validated formulas, which solves the system debugging problem. Furthermore, output of our game style method will give significant information to engineers in detecting where errors have occurred and what the causes of the errors are.
Semantic Importance Sampling for Statistical Model Checking

DTIC Science & Technology

2014-10-18

we implement SIS in a tool called osmosis and use it to verify a number of stochastic systems with rare events. Our results indicate that SIS reduces...background definitions and concepts. Section 4 presents SIS, and Section 5 presents our tool osmosis . In Section 6, we present our experiments and results...Syntactic Extraction ∗( ) dReal + Refinement ∗ |∗| , Monte-Carlo , Fig. 5. Architecture of osmosis
Coverage criteria for test case generation using UML state chart diagram

NASA Astrophysics Data System (ADS)

Salman, Yasir Dawood; Hashim, Nor Laily; Rejab, Mawarny Md; Romli, Rohaida; Mohd, Haslina

2017-10-01

To improve the effectiveness of test data generation during the software test, many studies have focused on the automation of test data generation from UML diagrams. One of these diagrams is the UML state chart diagram. Test cases are generally evaluated according to coverage criteria. However, combinations of multiple criteria are required to achieve better coverage. Different studies used various number and types of coverage criteria in their methods and approaches. The objective of this paper to propose suitable coverage criteria for test case generation using UML state chart diagram especially in handling loops. In order to achieve this objective, this work reviewed previous studies to present the most practical coverage criteria combinations, including all-states, all-transitions, all-transition-pairs, and all-loop-free-paths coverage. Calculation to determine the coverage percentage of the proposed coverage criteria were presented together with an example has they are applied on a UML state chart diagram. This finding would be beneficial in the area of test case generating especially in handling loops in UML state chart diagram.
Characterization of protease from bacillus sp. on medium containing FeCl3 exposed to magnetic field 0.2 mt

NASA Astrophysics Data System (ADS)

Sumardi; Agustrina, Rochmah; Nugroho Ekowati, Christina; Selvie Pasaribu, Yovita

2018-03-01

This purpose of this research is to determine the character of the protease enzymes from Bacillus sp. on media content of FeCl3 exposed to 0.2 mT magnetic field. The data obtained were analyzed descriptively. The result showed that protease enzyme without Fe resulted in the highest activity at pH 8, temperature. 30°C with the addition of activator Mn2+, and Vmax of 0.28 U/ml, and Km of 4.60 U/ml. The protease enzyme on media without magnetic field exposure and containing Fe yielded the highest activity at pH 8, temperature 30°C with the addition of activator Mn2+, and Vmax of 0.33 U/ml, and Km of 5.64 U/ml. The protease enzyme on medium with magnetic field exposure and use Fe as inductors have the highest activity at pH 9, the temperature of 55° C with the addition of activator Mn2+, and Vmax of 0.35 U/ml, and Km 10.04 U/ml.
A Collaborative Support Approach on UML Sequence Diagrams for Aspect-Oriented Software

NASA Astrophysics Data System (ADS)

de Almeida Naufal, Rafael; Silveira, Fábio F.; Guerra, Eduardo M.

AOP and its broader application on software projects brings the importance to provide the separation between aspects and OO components at design time, to leverage the understanding of AO systems, promote aspects' reuse and obtain the benefits of AO modularization. Since the UML is a standard for modeling OO systems, it can be applied to model the decoupling between aspects and OO components. The application of UML to this area is the subject of constant study and is the focus of this paper. In this paper it is presented an extension based on the default UML meta-model, named MIMECORA-DS, to show object-object, object-aspect and aspect-aspect interactions applying the UML's sequence diagram. This research also presents the application of MIMECORA-DS in a case example, to assess its applicability.
The Use of UML for Software Requirements Expression and Management

NASA Technical Reports Server (NTRS)

Murray, Alex; Clark, Ken

2015-01-01

It is common practice to write English-language "shall" statements to embody detailed software requirements in aerospace software applications. This paper explores the use of the UML language as a replacement for the English language for this purpose. Among the advantages offered by the Unified Modeling Language (UML) is a high degree of clarity and precision in the expression of domain concepts as well as architecture and design. Can this quality of UML be exploited for the definition of software requirements? While expressing logical behavior, interface characteristics, timeliness constraints, and other constraints on software using UML is commonly done and relatively straight-forward, achieving the additional aspects of the expression and management of software requirements that stakeholders expect, especially traceability, is far less so. These other characteristics, concerned with auditing and quality control, include the ability to trace a requirement to a parent requirement (which may well be an English "shall" statement), to trace a requirement to verification activities or scenarios which verify that requirement, and to trace a requirement to elements of the software design which implement that requirement. UML Use Cases, designed for capturing requirements, have not always been satisfactory. Some applications of them simply use the Use Case model element as a repository for English requirement statements. Other applications of Use Cases, in which Use Cases are incorporated into behavioral diagrams that successfully communicate the behaviors and constraints required of the software, do indeed take advantage of UML's clarity, but not in ways that support the traceability features mentioned above. Our approach uses the Stereotype construct of UML to precisely identify elements of UML constructs, especially behaviors such as State Machines and Activities, as requirements, and also to achieve the necessary mapping capabilities. We describe this approach in the context of a space-based software application currently under development at the Jet Propulsion Laboratory.
[Analysis of health terminologies for use as ontologies in healthcare information systems].

PubMed

Romá-Ferri, Maria Teresa; Palomar, Manuel

2008-01-01

Ontologies are a resource that allow the concept of meaning to be represented informatically, thus avoiding the limitations imposed by standardized terms. The objective of this study was to establish the extent to which terminologies could be used for the design of ontologies, which could be serve as an aid to resolve problems such as semantic interoperability and knowledge reusability in healthcare information systems. To determine the extent to which terminologies could be used as ontologies, six of the most important terminologies in clinical, epidemiologic, documentation and administrative-economic contexts were analyzed. The following characteristics were verified: conceptual coverage, hierarchical structure, conceptual granularity of the categories, conceptual relations, and the language used for conceptual representation. MeSH, DeCS and UMLS ontologies were considered lightweight. The main differences among these ontologies concern conceptual specification, the types of relation and the restrictions among the associated concepts. SNOMED and GALEN ontologies have declaratory formalism, based on logical descriptions. These ontologies include explicit qualities and show greater restrictions among associated concepts and rule combinations and were consequently considered as heavyweight. Analysis of the declared representation of the terminologies shows the extent to which they could be reused as ontologies. Their degree of usability depends on whether the aim is for healthcare information systems to solve problems of semantic interoperability (lightweight ontologies) or to reuse the systems' knowledge as an aid to decision making (heavyweight ontologies) and for non-structured information retrieval, extraction, and classification.
Conceptual Modeling in the Time of the Revolution: Part II

NASA Astrophysics Data System (ADS)

Mylopoulos, John

Conceptual Modeling was a marginal research topic at the very fringes of Computer Science in the 60s and 70s, when the discipline was dominated by topics focusing on programs, systems and hardware architectures. Over the years, however, the field has moved to centre stage and has come to claim a central role both in Computer Science research and practice in diverse areas, such as Software Engineering, Databases, Information Systems, the Semantic Web, Business Process Management, Service-Oriented Computing, Multi-Agent Systems, Knowledge Management, and more. The transformation was greatly aided by the adoption of standards in modeling languages (e.g., UML), and model-based methodologies (e.g., Model-Driven Architectures) by the Object Management Group (OMG) and other standards organizations. We briefly review the history of the field over the past 40 years, focusing on the evolution of key ideas. We then note some open challenges and report on-going research, covering topics such as the representation of variability in conceptual models, capturing model intentions, and models of laws.
[Research on tumor information grid framework].

PubMed

Zhang, Haowei; Qin, Zhu; Liu, Ying; Tan, Jianghao; Cao, Haitao; Chen, Youping; Zhang, Ke; Ding, Yuqing

2013-10-01

In order to realize tumor disease information sharing and unified management, we utilized grid technology to make the data and software resources which distributed in various medical institutions for effective integration so that we could make the heterogeneous resources consistent and interoperable in both semantics and syntax aspects. This article describes the tumor grid framework, the type of the service being packaged in Web Service Description Language (WSDL) and extensible markup language schemas definition (XSD), the client use the serialized document to operate the distributed resources. The service objects could be built by Unified Modeling Language (UML) as middle ware to create application programming interface. All of the grid resources are registered in the index and released in the form of Web Services based on Web Services Resource Framework (WSRF). Using the system we can build a multi-center, large sample and networking tumor disease resource sharing framework to improve the level of development in medical scientific research institutions and the patient's quality of life.
Toward a Model-Based Approach for Flight System Fault Protection

NASA Technical Reports Server (NTRS)

Day, John; Meakin, Peter; Murray, Alex

2012-01-01

Use SysML/UML to describe the physical structure of the system This part of the model would be shared with other teams - FS Systems Engineering, Planning & Execution, V&V, Operations, etc., in an integrated model-based engineering environment Use the UML Profile mechanism, defining Stereotypes to precisely express the concepts of the FP domain This extends the UML/SysML languages to contain our FP concepts Use UML/SysML, along with our profile, to capture FP concepts and relationships in the model Generate typical FP engineering products (the FMECA, Fault Tree, MRD, V&V Matrices)
Development of the Plate Tectonics and Seismology markup languages with XML

NASA Astrophysics Data System (ADS)

Babaie, H.; Babaei, A.

2003-04-01

The Extensible Markup Language (XML) and its specifications such as the XSD Schema, allow geologists to design discipline-specific vocabularies such as Seismology Markup Language (SeismML) or Plate Tectonics Markup Language (TectML). These languages make it possible to store and interchange structured geological information over the Web. Development of a geological markup language requires mapping geological concepts, such as "Earthquake" or "Plate" into a UML object model, applying a modeling and design environment. We have selected four inter-related geological concepts: earthquake, fault, plate, and orogeny, and developed four XML Schema Definitions (XSD), that define the relationships, cardinalities, hierarchies, and semantics of these concepts. In such a geological concept model, the UML object "Earthquake" is related to one or more "Wave" objects, each arriving to a seismic station at a specific "DateTime", and relating to a specific "Epicenter" object that lies at a unique "Location". The "Earthquake" object occurs along a "Segment" of a "Fault" object, which is related to a specific "Plate" object. The "Fault" has its own associations with such things as "Bend", "Step", and "Segment", and could be of any kind (e.g., "Thrust", "Transform'). The "Plate" is related to many other objects such as "MOR", "Subduction", and "Forearc", and is associated with an "Orogeny" object that relates to "Deformation" and "Strain" and several other objects. These UML objects were mapped into XML Metadata Interchange (XMI) formats, which were then converted into four XSD Schemas. The schemas were used to create and validate the XML instance documents, and to create a relational database hosting the plate tectonics and seismological data in the Microsoft Access format. The SeismML and TectML allow seismologists and structural geologists, among others, to submit and retrieve structured geological data on the Internet. A seismologist, for example, can submit peer-reviewed and reliable data about a specific earthquake to a Java Server Page on our web site hosting the XML application. Other geologists can readily retrieve the submitted data, saved in files or special tables of the designed database, through a search engine designed with J2EE (JSP, servlet, Java Bean) and XML specifications such as XPath, XPointer, and XSLT. When extended to include all the important concepts of seismology and plate tectonics, the two markup languages will make global interchange of geological data a reality.

Nutrient utilisation and particulate organic matter changes during summer in the upper mixed layer (Ross Sea, Antarctica)

NASA Astrophysics Data System (ADS)

Catalano, G.; Povero, P.; Fabiano, M.; Benedetti, F.; Goffart, A.

1997-01-01

The relationships among vertical stability, estimated nutrient utilisation and particulate organic matter in the Ross Sea are analysed from data collected during two cruises in the summers of 1987-1988 and 1989-1990. In the upper mixed layer (UML), identified through the vertical stability E( Z(UML)), nutrient consumption is calculated as the difference between the "diluted" nutrient value and the mean calculated from the integrated value in the UML. The nutrient utilisation ratio and E( Z(UML)) are linearly related for E( Z(UML))≤25, whereas for values > 25, the distribution pattern is more scattered and independent of E( Z(UML)). For E( Z(UML))≥25, utilisation values were ≥4, 0.4 and 10 mmol m -3 for nitrate, phosphate and silicate, respectively. Significant relationships between nutrient depletion and both particulate organic carbon (POC) and particulate protein/particulate carbohydrate ratios (PPRT/PCHO) are found. The analysis of particulate matter distribution vs nutrient utilisation shows that the stations could be divided into two groups having different characteristics. The first group includes coastal stations, where high nutrient utilisation, POC and PPRT/PCHO are typical of areas with high production. In the second group (pelagic stations), nutrient utilisation, POC and PPRT/PCHO are lower. The vertical stability can be used to discriminate among the factors that influence primary production.
CoLeMo: A Collaborative Learning Environment for UML Modelling

ERIC Educational Resources Information Center

Chen, Weiqin; Pedersen, Roger Heggernes; Pettersen, Oystein

2006-01-01

This paper presents the design, implementation, and evaluation of a distributed collaborative UML modelling environment, CoLeMo. CoLeMo is designed for students studying UML modelling. It can also be used as a platform for collaborative design of software. We conducted formative evaluations and a summative evaluation to improve the environment and…
Musical emotions: Functions, origins, evolution

NASA Astrophysics Data System (ADS)

Perlovsky, Leonid

2010-03-01

Theories of music origins and the role of musical emotions in the mind are reviewed. Most existing theories contradict each other, and cannot explain mechanisms or roles of musical emotions in workings of the mind, nor evolutionary reasons for music origins. Music seems to be an enigma. Nevertheless, a synthesis of cognitive science and mathematical models of the mind has been proposed describing a fundamental role of music in the functioning and evolution of the mind, consciousness, and cultures. The review considers ancient theories of music as well as contemporary theories advanced by leading authors in this field. It addresses one hypothesis that promises to unify the field and proposes a theory of musical origin based on a fundamental role of music in cognition and evolution of consciousness and culture. We consider a split in the vocalizations of proto-humans into two types: one less emotional and more concretely-semantic, evolving into language, and the other preserving emotional connections along with semantic ambiguity, evolving into music. The proposed hypothesis departs from other theories in considering specific mechanisms of the mind-brain, which required the evolution of music parallel with the evolution of cultures and languages. Arguments are reviewed that the evolution of language toward becoming the semantically powerful tool of today required emancipation from emotional encumbrances. The opposite, no less powerful mechanisms required a compensatory evolution of music toward more differentiated and refined emotionality. The need for refined music in the process of cultural evolution is grounded in fundamental mechanisms of the mind. This is why today's human mind and cultures cannot exist without today's music. The reviewed hypothesis gives a basis for future analysis of why different evolutionary paths of languages were paralleled by different evolutionary paths of music. Approaches toward experimental verification of this hypothesis in psychological and neuroimaging research are reviewed.
A UML approach to process modelling of clinical practice guidelines for enactment.

PubMed

Knape, T; Hederman, L; Wade, V P; Gargan, M; Harris, C; Rahman, Y

2003-01-01

Although clinical practice guidelines (CPGs) have been suggested as a means of encapsulating best practice in evidence-based medical treatment, their usage in clinical environments has been disappointing. Criticisms of guideline representations have been that they are predominantly narrative and are difficult to incorporate into clinical information systems. This paper analyses the use of UML process modelling techniques for guideline representation and proposes the automated generation of executable guidelines using XMI. This hybrid UML-XMI approach provides flexible authoring of guideline decision and control structures whilst integrating appropriate data flow. It also uses an open XMI standard interface to allow the use of authoring tools and process control systems from multiple vendors. The paper first surveys CPG modelling formalisms followed by a brief introduction to process modelling in UMI. Furthermore, the modelling of CPGs in UML is presented leading to a case study of encoding a diabetes mellitus CPG using UML.
A UML Profile for Developing Databases that Conform to the Third Manifesto

NASA Astrophysics Data System (ADS)

Eessaar, Erki

The Third Manifesto (TTM) presents the principles of a relational database language that is free of deficiencies and ambiguities of SQL. There are database management systems that are created according to TTM. Developers need tools that support the development of databases by using these database management systems. UML is a widely used visual modeling language. It provides built-in extension mechanism that makes it possible to extend UML by creating profiles. In this paper, we introduce a UML profile for designing databases that correspond to the rules of TTM. We created the first version of the profile by translating existing profiles of SQL database design. After that, we extended and improved the profile. We implemented the profile by using UML CASE system StarUML™. We present an example of using the new profile. In addition, we describe problems that occurred during the profile development.
A Survey of UML Based Regression Testing

NASA Astrophysics Data System (ADS)

Fahad, Muhammad; Nadeem, Aamer

Regression testing is the process of ensuring software quality by analyzing whether changed parts behave as intended, and unchanged parts are not affected by the modifications. Since it is a costly process, a lot of techniques are proposed in the research literature that suggest testers how to build regression test suite from existing test suite with minimum cost. In this paper, we discuss the advantages and drawbacks of using UML diagrams for regression testing and analyze that UML model helps in identifying changes for regression test selection effectively. We survey the existing UML based regression testing techniques and provide an analysis matrix to give a quick insight into prominent features of the literature work. We discuss the open research issues like managing and reducing the size of regression test suite, prioritization of the test cases that would be helpful during strict schedule and resources that remain to be addressed for UML based regression testing.
Recommending Education Materials for Diabetic Questions Using Information Retrieval Approaches.

PubMed

Zeng, Yuqun; Liu, Xusheng; Wang, Yanshan; Shen, Feichen; Liu, Sijia; Rastegar-Mojarad, Majid; Wang, Liwei; Liu, Hongfang

2017-10-16

Self-management is crucial to diabetes care and providing expert-vetted content for answering patients' questions is crucial in facilitating patient self-management. The aim is to investigate the use of information retrieval techniques in recommending patient education materials for diabetic questions of patients. We compared two retrieval algorithms, one based on Latent Dirichlet Allocation topic modeling (topic modeling-based model) and one based on semantic group (semantic group-based model), with the baseline retrieval models, vector space model (VSM), in recommending diabetic patient education materials to diabetic questions posted on the TuDiabetes forum. The evaluation was based on a gold standard dataset consisting of 50 randomly selected diabetic questions where the relevancy of diabetic education materials to the questions was manually assigned by two experts. The performance was assessed using precision of top-ranked documents. We retrieved 7510 diabetic questions on the forum and 144 diabetic patient educational materials from the patient education database at Mayo Clinic. The mapping rate of words in each corpus mapped to the Unified Medical Language System (UMLS) was significantly different (P<.001). The topic modeling-based model outperformed the other retrieval algorithms. For example, for the top-retrieved document, the precision of the topic modeling-based, semantic group-based, and VSM models was 67.0%, 62.8%, and 54.3%, respectively. This study demonstrated that topic modeling can mitigate the vocabulary difference and it achieved the best performance in recommending education materials for answering patients' questions. One direction for future work is to assess the generalizability of our findings and to extend our study to other disease areas, other patient education material resources, and online forums. ©Yuqun Zeng, Xusheng Liu, Yanshan Wang, Feichen Shen, Sijia Liu, Majid Rastegar Mojarad, Liwei Wang, Hongfang Liu. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 16.10.2017.
Applying Real-Time UML: Real-World Experiences

NASA Astrophysics Data System (ADS)

Cooling, Niall; Pachschwoell, Stefan

2004-06-01

This paper presents Austrian Aerospace's experiences of applying UML for the design of an embedded real-time avionics system based on Feabhas' "Pragma Process". It describes the complete lifecycle from adoption of UML, through training, CASE-tool selection, system analysis, and software design and development of the project itself. It concludes by reflecting on the experiences obtained and some lessons learnt.
No grammatical gender effect on affective ratings: evidence from Italian and German languages.

PubMed

Montefinese, Maria; Ambrosini, Ettore; Roivainen, Eka

2018-06-06

In this study, we tested the linguistic relativity hypothesis by studying the effect of grammatical gender (feminine vs. masculine) on affective judgments of conceptual representation in Italian and German. In particular, we examined the within- and cross-language grammatical gender effect and its interaction with participants' demographic characteristics (such as, the raters' age and sex) on semantic differential scales (affective ratings of valence, arousal and dominance) in Italian and German speakers. We selected the stimuli and the relative affective measures from Italian and German adaptations of the ANEW (Affective Norms for English Words). Bayesian and frequentist analyses yielded evidence for the absence of within- and cross-languages effects of grammatical gender and sex- and age-dependent interactions. These results suggest that grammatical gender does not affect judgments of affective features of semantic representation in Italian and German speakers, since an overt coding of word grammar is not required. Although further research is recommended to refine the impact of the grammatical gender on properties of semantic representation, these results have implications for any strong view of the linguistic relativity hypothesis.
SAFOD Brittle Microstructure and Mechanics Knowledge Base (SAFOD BM2KB)

NASA Astrophysics Data System (ADS)

Babaie, H. A.; Hadizadeh, J.; di Toro, G.; Mair, K.; Kumar, A.

2008-12-01

We have developed a knowledge base to store and present the data collected by a group of investigators studying the microstructures and mechanics of brittle faulting using core samples from the SAFOD (San Andreas Fault Observatory at Depth) project. The investigations are carried out with a variety of analytical and experimental methods primarily to better understand the physics of strain localization in fault gouge. The knowledge base instantiates an specially-designed brittle rock deformation ontology developed at Georgia State University. The inference rules embedded in the semantic web languages, such as OWL, RDF, and RDFS, which are used in our ontology, allow the Pellet reasoner used in this application to derive additional truths about the ontology and knowledge of this domain. Access to the knowledge base is via a public website, which is designed to provide the knowledge acquired by all the investigators involved in the project. The stored data will be products of studies such as: experiments (e.g., high-velocity friction experiment), analyses (e.g., microstructural, chemical, mass transfer, mineralogical, surface, image, texture), microscopy (optical, HRSEM, FESEM, HRTEM]), tomography, porosity measurement, microprobe, and cathodoluminesence. Data about laboratories, experimental conditions, methods, assumptions, equipments, and mechanical properties and lithology of the studied samples will also be presented on the website per investigation. The ontology was modeled applying the UML (Unified Modeling Language) in Rational Rose, and implemented in OWL-DL (Ontology Web Language) using the Protégé ontology editor. The UML model was converted to OWL-DL by first mapping it to Ecore (.ecore) and Generator model (.genmodel) with the help of the EMF (Eclipse Modeling Framework) plugin in Eclipse. The Ecore model was then mapped to a .uml file, which later was converted into an .owl file and subsequently imported into the Protégé ontology editing environment. The web-interface was developed in java using eclipse as the IDE. The web interfaces to query and submit data were implemented applying JSP, servlets, javascript, and AJAX. The Jena API, a Java framework for building Semantic Web applications, was used to develop the web-interface. Jena provided a programmatic environment for RDF, RDFS, OWL, and SPARQL query engine. Building web applications with AJAX helps retrieving data from the server asynchronously in the background without interfering with the display and behavior of the existing page. The application was deployed on an apache tomcat server at GSU. The SAFOD BM2KB website provides user-friendly search, submit, feedback, and other services. The General Search option allows users to search the knowledge base by selecting the classes (e.g., Experiment, Surface Analysis), their respective attributes (e.g., apparatus, date performed), and the relationships to other classes (e.g., Sample, Laboratory). The Search by Sample option allows users to search the knowledge base based on sample number. The Search by Investigator lets users to search the knowledge base by choosing an investigator who is involved in this project. The website also allows users to submit new data. The Submit Data option opens a page where users can submit the SAFOD data to our knowledge base by selecting specific classes and attributes. The submitted data then become available for query as part of the knowledge base. The SAFOD BM2KB can be accessed from the main SAFOD website.
Program Synthesizes UML Sequence Diagrams

NASA Technical Reports Server (NTRS)

Barry, Matthew R.; Osborne, Richard N.

2006-01-01

A computer program called "Rational Sequence" generates Universal Modeling Language (UML) sequence diagrams of a target Java program running on a Java virtual machine (JVM). Rational Sequence thereby performs a reverse engineering function that aids in the design documentation of the target Java program. Whereas previously, the construction of sequence diagrams was a tedious manual process, Rational Sequence generates UML sequence diagrams automatically from the running Java code.
Five Year Results of US Intergroup/RTOG 9704 With Postoperative CA 19-9 {<=}90 U/mL and Comparison to the CONKO-001 Trial

DOE Office of Scientific and Technical Information (OSTI.GOV)

Berger, Adam C., E-mail: adam.berger@jefferson.edu; Winter, Kathryn; Hoffman, John P.

2012-11-01

Purpose: Radiation Therapy Oncology Group (RTOG) trial 9704 was the largest randomized trial to use adjuvant chemoradiation therapy for patients with pancreatic cancer. This report analyzes 5-year survival by serum level of tumor marker CA 19-9 of {<=}90 vs >90 U/mL and compares results to the those of the CONKO-001 trial. Methods and Materials: CA 19-9 expression was analyzed as a dichotomized variable ({<=}90 vs >90 U/mL). Cox proportional hazard models were used to identify the impact of the CA 19-9 value on overall survival (OS). Actuarial estimates of OS were calculated using the Kaplan-Meier method. Results: Both univariate (hazardmore » ratio [HR] = 3.2; 95% confidence interval [CI], 2.3-4.3, P<.0001) and multivariate (HR = 3.1; 95% CI, 2.2-4.2, P<.0001) analyses demonstrated a statistically significant decrease in OS for CA 19-9 serum level of {>=}90 U/mL. For patients in the gemcitabine (Gem) treatment arm with CA 19-9 <90 U/mL, median survival was 21 months. For patients with CA 19-9 {>=}90 U/mL, this number dropped to 10 months. In patients with pancreatic head tumors in the Gem treatment arm with RT quality assurance per protocol and CA 19-9 of <90 U/mL, median survival and 5-year rate were 24 months and 34%. In comparison, the median survival and 5-year OS rate for patients in the Gem arm of the CONKO trial were 22 months and 21%. Conclusions: This analysis demonstrates that patients with postresection CA 19-9 values {>=}90 U/mL had a significantly worse survival. Patients with pancreatic head tumors treated with Gem with CA 19-9 serum level of <90 U/mL and per protocol RT had favorable survival compared to that seen in the CONKO trial. CA 19-9 is a stratification factor for the current RTOG adjuvant pancreas trial (0848).« less
Novel marine actinobacteria from emerald Andaman & Nicobar Islands: a prospective source for industrial and pharmaceutical byproducts

PubMed Central

2013-01-01

Background Andaman and Nicobar Islands situated in the eastern part of Bay of Bengal are one of the distinguished biodiversity hotspot. Even though number of studies carried out on the marine flora and fauna, the studies on actinobacteria from Andaman and Nicobar Islands are meager. The aim of the present study was to screen the actinobacteria for their characterization and identify the potential sources for industrial and pharmaceutical byproducts. Results A total of 26 actinobacterial strains were isolated from the marine sediments collected from various sites of Port Blair Bay where no collection has been characterized previously. Isolates were categorized under the genera: Saccharopolyspora, Streptomyces, Nocardiopsis, Streptoverticillium, Microtetraspora, Actinopolyspora, Actinokineospora and Dactylosporangium. Majority of the isolates were found to produce industrially important enzymes such as amylase, protease, gelatinase, lipase, DNase, cellulase, urease and phosphatase, and also exhibited substantial antibacterial activity against human pathogens. 77% of isolates exhibited significant hemolytic activity. Among 26 isolates, three strains (NIOT-VKKMA02, NIOT-VKKMA22 and NIOT-VKKMA26) were found to generate appreciable extent of surfactant, amylase, cellulase and protease enzyme. NIOT-VKKMA02 produced surfactant using kerosene as carbon source and emulsified upto E24–63.6%. Moreover, NIOT-VKKMA02, NIOT-VKKMA22 and NIOT-VKKMA26 synthesized 13.27 U/ml, 9.85 U/ml and 8.03 U/ml amylase; 7.75 U/ml, 5.01 U/ml and 2.08 U/ml of cellulase and 11.34 U/ml, 6.89 U/ml and 3.51 U/ml of protease enzyme, respectively. Conclusions High diversity of marine actinobacteria was isolated and characterized in this work including undescribed species and species not previously reported from emerald Andaman and Nicobar Islands, including Streptomyces griseus, Streptomyces venezuelae and Saccharopolyspora salina. The enhanced salt, pH and temperature tolerance of the actinobacterial isolates along with their capacity to secrete commercially valuable primary and secondary metabolites emerges as an attractive feature of these organisms. These results are reported for the first time from these emerald Islands and expand the scope to functionally characterize novel marine actinobacteria and their metabolites for the potential novel molecules of commercial interest. PMID:23800234
Novel marine actinobacteria from emerald Andaman & Nicobar Islands: a prospective source for industrial and pharmaceutical byproducts.

PubMed

Meena, Balakrishnan; Rajan, Lawrance Anbu; Vinithkumar, Nambali Valsalan; Kirubagaran, Ramalingam

2013-06-22

Andaman and Nicobar Islands situated in the eastern part of Bay of Bengal are one of the distinguished biodiversity hotspot. Even though number of studies carried out on the marine flora and fauna, the studies on actinobacteria from Andaman and Nicobar Islands are meager. The aim of the present study was to screen the actinobacteria for their characterization and identify the potential sources for industrial and pharmaceutical byproducts. A total of 26 actinobacterial strains were isolated from the marine sediments collected from various sites of Port Blair Bay where no collection has been characterized previously. Isolates were categorized under the genera: Saccharopolyspora, Streptomyces, Nocardiopsis, Streptoverticillium, Microtetraspora, Actinopolyspora, Actinokineospora and Dactylosporangium. Majority of the isolates were found to produce industrially important enzymes such as amylase, protease, gelatinase, lipase, DNase, cellulase, urease and phosphatase, and also exhibited substantial antibacterial activity against human pathogens. 77% of isolates exhibited significant hemolytic activity. Among 26 isolates, three strains (NIOT-VKKMA02, NIOT-VKKMA22 and NIOT-VKKMA26) were found to generate appreciable extent of surfactant, amylase, cellulase and protease enzyme. NIOT-VKKMA02 produced surfactant using kerosene as carbon source and emulsified upto E(24)-63.6%. Moreover, NIOT-VKKMA02, NIOT-VKKMA22 and NIOT-VKKMA26 synthesized 13.27 U/ml, 9.85 U/ml and 8.03 U/ml amylase; 7.75 U/ml, 5.01 U/ml and 2.08 U/ml of cellulase and 11.34 U/ml, 6.89 U/ml and 3.51 U/ml of protease enzyme, respectively. High diversity of marine actinobacteria was isolated and characterized in this work including undescribed species and species not previously reported from emerald Andaman and Nicobar Islands, including Streptomyces griseus, Streptomyces venezuelae and Saccharopolyspora salina. The enhanced salt, pH and temperature tolerance of the actinobacterial isolates along with their capacity to secrete commercially valuable primary and secondary metabolites emerges as an attractive feature of these organisms. These results are reported for the first time from these emerald Islands and expand the scope to functionally characterize novel marine actinobacteria and their metabolites for the potential novel molecules of commercial interest.
Immunogenicity and safety of the Vi-CRM197 conjugate vaccine against typhoid fever in adults, children, and infants in south and southeast Asia: results from two randomised, observer-blind, age de-escalation, phase 2 trials.

PubMed

Bhutta, Zulfiqar A; Capeding, Maria Rosario; Bavdekar, Ashish; Marchetti, Elisa; Ariff, Shabina; Soofi, Sajid B; Anemona, Alessandra; Habib, Muhammad A; Alberto, Edison; Juvekar, Sanjay; Khan, Rana M Qasim; Marhaba, Rachid; Ali, Noshad; Malubay, Nelia; Kawade, Anand; Saul, Allan; Martin, Laura B; Podda, Audino

2014-02-01

Typhoid vaccination is a public health priority in developing countries where young children are greatly affected by typhoid fever. Because present vaccines are not recommended for children younger than 2 years, the Novartis Vaccines Institute for Global Health developed a conjugate vaccine (Vi-CRM197) for infant immunisation. We aimed to assess the immunogenicity and safety of Vi-CRM197 in participants of various ages in endemic countries in south and southeast Asia. We did two randomised, observer-blind, age de-escalation, phase 2 trials at two sites in Pakistan and India (study A), and at one site in the Philippines (study B), between March 2, 2011, and Aug 9, 2012. Adults aged 18-45 years, children aged 24-59 months, older infants aged 9-12 months, and infants aged 6-8 weeks were randomly assigned (1:1) with a computer-generated randomisation list (block size of four) to receive either 5 μg Vi-CRM197 or 25 μg Vi-polysaccharide vaccine (or 13-valent pneumococcal conjugate vaccine in children younger than 2 years). Both infant populations received Vi-CRM197 concomitantly with vaccines of the Expanded Programme on Immunization (EPI), according to WHO schedule. With the exception of designated study site personnel responsible for vaccine preparation, study investigators, those assessing outcomes, and data analysts were masked to treatment allocation. We specified no a-priori null hypothesis for the immunogenicity or safety objectives and all analyses were descriptive. Analyses were by modified intention-to-treat. These studies are registered with ClinicalTrials.gov, numbers NCT01229176 and NCT01437267. 320 participants were enrolled and vaccinated in the two trials: 200 in study A (all age groups) and 120 in study B (children and infants only), of whom 317 (99%) were included in the modified intention-to-treat analysis. One dose of Vi-CRM197 significantly increased concentrations of anti-Vi antibody in adults (from 113 U/mL [95% CI 67-190] to 208 U/mL [117-369]), children (201 U/mL [138-294] to 368 U/mL [234-580]), and older infants (179 U/mL [129-250] to 249 U/mL [130-477]). However, in children and older infants, a second dose of conjugate vaccine had no incremental effect on antibody titres and, at all ages, concentrations of antibodies increased substantially 6 months after vaccination (from 55 U/mL [33-94] to 63 U/mL [35-114] in adults, from 23 U/mL [15-34] to 51 U/mL [34-76] in children, and from 21 U/mL [14-31] to 22 U/mL [14-33] in older infants). Immune response in infants aged 6-8 weeks was lower than that in older participants and, 6 months after third vaccination, antibody concentrations were significantly higher than pre-vaccination concentrations in Filipino (21 U/mL [16-28] vs 2.88 U/mL [1.95-4.25]), but not Pakistani (3.76 U/mL [2.77-5.08] vs 2.77 U/mL [2.1-3.66]), infants. Vi-CRM197 was safe and well tolerated and did not induce any significant interference with EPI vaccines. No deaths or vaccine-related serious adverse events were reported throughout the studies. Vi-CRM197 is safe and immunogenic in endemic populations of all ages. Given at 9 months of age, concomitantly with measles vaccine, Vi-CRM197 shows a promise for potential inclusion in EPI schedules of countries endemic for typhoid. An apparent absence of booster response and a reduction in antibody titres 6 months after immunisation should be further investigated, but data show that an immunogenic typhoid vaccine can be safely delivered to infants during EPI visits recommended by WHO. Sclavo Vaccines Association and Regione Toscana. Copyright © 2014 Elsevier Ltd. All rights reserved.
UML activity diagrams in requirements specification of logic controllers

NASA Astrophysics Data System (ADS)

Grobelna, Iwona; Grobelny, Michał

2015-12-01

Logic controller specification can be prepared using various techniques. One of them is the wide understandable and user-friendly UML language and its activity diagrams. Using formal methods during the design phase increases the assurance that implemented system meets the project requirements. In the approach we use the model checking technique to formally verify a specification against user-defined behavioral requirements. The properties are usually defined as temporal logic formulas. In the paper we propose to use UML activity diagrams in requirements definition and then to formalize them as temporal logic formulas. As a result, UML activity diagrams can be used both for logic controller specification and for requirements definition, what simplifies the specification and verification process.
Query Expansion Using SNOMED-CT and Weighing Schemes

DTIC Science & Technology

2014-11-01

For this research, we have used SNOMED-CT along with UMLS Methathesaurus as our ontology in medical domain to expand the queries. General Terms...CT along with UMLS Methathesaurus as our ontology in medical domain to expand the queries. 15. SUBJECT TERMS 16. SECURITY CLASSIFICATION OF: 17...University of the Basque country discuss their finding on query expansion using external sources headlined by Unified Medical Language System ( UMLS
ADEpedia 2.0: Integration of Normalized Adverse Drug Events (ADEs) Knowledge from the UMLS.

PubMed

Jiang, Guoqian; Liu, Hongfang; Solbrig, Harold R; Chute, Christopher G

2013-01-01

A standardized Adverse Drug Events (ADEs) knowledge base that encodes known ADE knowledge can be very useful in improving ADE detection for drug safety surveillance. In our previous study, we developed the ADEpedia that is a standardized knowledge base of ADEs based on drug product labels. The objectives of the present study are 1) to integrate normalized ADE knowledge from the Unified Medical Language System (UMLS) into the ADEpedia; and 2) to enrich the knowledge base with the drug-disorder co-occurrence data from a 51-million-document electronic medical records (EMRs) system. We extracted 266,832 drug-disorder concept pairs from the UMLS, covering 14,256 (1.69%) distinct drug concepts and 19,006 (3.53%) distinct disorder concepts. Of them, 71,626 (26.8%) concept pairs from UMLS co-occurred in the EMRs. We performed a preliminary evaluation on the utility of the UMLS ADE data. In conclusion, we have built an ADEpedia 2.0 framework that intends to integrate known ADE knowledge from disparate sources. The UMLS is a useful source for providing standardized ADE knowledge relevant to indications, contraindications and adverse effects, and complementary to the ADE data from drug product labels. The statistics from EMRs would enable the meaningful use of ADE data for drug safety surveillance.
Automatic Synthesis of UML Designs from Requirements in an Iterative Process

NASA Technical Reports Server (NTRS)

Schumann, Johann; Whittle, Jon; Clancy, Daniel (Technical Monitor)

2001-01-01

The Unified Modeling Language (UML) is gaining wide popularity for the design of object-oriented systems. UML combines various object-oriented graphical design notations under one common framework. A major factor for the broad acceptance of UML is that it can be conveniently used in a highly iterative, Use Case (or scenario-based) process (although the process is not a part of UML). Here, the (pre-) requirements for the software are specified rather informally as Use Cases and a set of scenarios. A scenario can be seen as an individual trace of a software artifact. Besides first sketches of a class diagram to illustrate the static system breakdown, scenarios are a favorite way of communication with the customer, because scenarios describe concrete interactions between entities and are thus easy to understand. Scenarios with a high level of detail are often expressed as sequence diagrams. Later in the design and implementation stage (elaboration and implementation phases), a design of the system's behavior is often developed as a set of statecharts. From there (and the full-fledged class diagram), actual code development is started. Current commercial UML tools support this phase by providing code generators for class diagrams and statecharts. In practice, it can be observed that the transition from requirements to design to code is a highly iterative process. In this talk, a set of algorithms is presented which perform reasonable synthesis and transformations between different UML notations (sequence diagrams, Object Constraint Language (OCL) constraints, statecharts). More specifically, we will discuss the following transformations: Statechart synthesis, introduction of hierarchy, consistency of modifications, and "design-debugging".
A Pilot Study of Contextual UMLS Indexing to Improve the Precision of Concept-based Representation in XML-structured Clinical Radiology Reports

PubMed Central

Huang, Yang; Lowe, Henry J.; Hersh, William R.

2003-01-01

Objective: Despite the advantages of structured data entry, much of the patient record is still stored as unstructured or semistructured narrative text. The issue of representing clinical document content remains problematic. The authors' prior work using an automated UMLS document indexing system has been encouraging but has been affected by the generally low indexing precision of such systems. In an effort to improve precision, the authors have developed a context-sensitive document indexing model to calculate the optimal subset of UMLS source vocabularies used to index each document section. This pilot study was performed to evaluate the utility of this indexing approach on a set of clinical radiology reports. Design: A set of clinical radiology reports that had been indexed manually using UMLS concept descriptors was indexed automatically by the SAPHIRE indexing engine. Using the data generated by this process the authors developed a system that simulated indexing, at the document section level, of the same document set using many permutations of a subset of the UMLS constituent vocabularies. Measurements: The precision and recall scores generated by simulated indexing for each permutation of two or three UMLS constituent vocabularies were determined. Results: While there was considerable variation in precision and recall values across the different subtypes of radiology reports, the overall effect of this indexing strategy using the best combination of two or three UMLS constituent vocabularies was an improvement in precision without significant impact of recall. Conclusion: In this pilot study a contextual indexing strategy improved overall precision in a set of clinical radiology reports. PMID:12925544

A pilot study of contextual UMLS indexing to improve the precision of concept-based representation in XML-structured clinical radiology reports.

PubMed

Huang, Yang; Lowe, Henry J; Hersh, William R

2003-01-01

Despite the advantages of structured data entry, much of the patient record is still stored as unstructured or semistructured narrative text. The issue of representing clinical document content remains problematic. The authors' prior work using an automated UMLS document indexing system has been encouraging but has been affected by the generally low indexing precision of such systems. In an effort to improve precision, the authors have developed a context-sensitive document indexing model to calculate the optimal subset of UMLS source vocabularies used to index each document section. This pilot study was performed to evaluate the utility of this indexing approach on a set of clinical radiology reports. A set of clinical radiology reports that had been indexed manually using UMLS concept descriptors was indexed automatically by the SAPHIRE indexing engine. Using the data generated by this process the authors developed a system that simulated indexing, at the document section level, of the same document set using many permutations of a subset of the UMLS constituent vocabularies. The precision and recall scores generated by simulated indexing for each permutation of two or three UMLS constituent vocabularies were determined. While there was considerable variation in precision and recall values across the different subtypes of radiology reports, the overall effect of this indexing strategy using the best combination of two or three UMLS constituent vocabularies was an improvement in precision without significant impact of recall. In this pilot study a contextual indexing strategy improved overall precision in a set of clinical radiology reports.
Bacillus sp. Acting as Dual Role for Corrosion Induction and Corrosion Inhibition with Carbon Steel (CS)

PubMed Central

Karn, Santosh K.; Fang, Guan; Duan, Jizhou

2017-01-01

Present work investigated the role of five different bacteria species as a corrosion inducer as well as corrosion inhibitor with carbon steel (CS). We observed the ability of different bacteria species on the metal surface attachment, biofilm formation, and determined Peroxidase, Catalase enzyme activity in the detached biofilm from the CS surface. We found that each strain has diverse conduct for surface attachment like DS1 3.3, DS2 2.5, DS3 4.3, DS4 4.0, and DS5 4.71 log cfu/cm2 and for biofilm 8.3 log cfu/cm2. The enzyme Peroxidase, Catalase was found in huge concentration inside the biofilm Peroxidase was maximum for DS4 36.0 U/ml and least for DS3 19.54 U/ml. Whereas, Catalase was highest for DS4, DS5 70.14 U/ml and least 57.2 U/ml for DS2. Scanning electron microscopy (SEM) was conducted to examine the biofilm and electrochemical impedance spectroscopy (EIS) were utilized to observe corrosion in the presence of bacteria. The electrochemical results confirmed that DS1, DS3, DS4, and DS5 strains have statistically significant MIC-factors (Microbially Influenced Corrosion) of 5.46, 8.51, 2.36, and 1.04, while DS2 protective effect factor of 0.89. Weight reduction results with carbon steel likewise supports that corrosion was initiated by DS1 and DS3, while DS2 and DS5 have no any impact though with DS4 we watched less weight reduction however assumed no role in the corrosion. We established the relation of Peroxidase enzyme activity of the isolates. DS1, DS3 and having Peroxidase in the range 22.18, 19.54 U/ml which induce the corrosion whereas DS2 and DS5 having 28.57 and 27.0 U/ml has no any effect and DS4 36 U/ml has inhibitory effect, increasing concentration inhibiting the corrosion. For Catalase DS1, DS3 have 67.28, 61.57 U/ml which induce corrosion while DS2 and DS5 57.71 and 59.14 U/ml also has no effect whereas DS4 70.14 U/ml can inhibit corrosion. Results clearly express that in a specific range both enzymes can induce the corrosion. Our goals are to pursuit and locate the potential role of the enzyme in corrosion induction and inhibition. There is still further work is proceeded for the more profound perception. PMID:29114242
A core observational data model for enhancing the interoperability of ontologically annotated environmental data

NASA Astrophysics Data System (ADS)

Schildhauer, M.; Bermudez, L. E.; Bowers, S.; Dibner, P. C.; Gries, C.; Jones, M. B.; McGuinness, D. L.; Cao, H.; Cox, S. J.; Kelling, S.; Lagoze, C.; Lapp, H.; Madin, J.

2010-12-01

Research in the environmental sciences often requires accessing diverse data, collected by numerous data providers over varying spatiotemporal scales, incorporating specialized measurements from a range of instruments. These measurements are typically documented using idiosyncratic, disciplinary specific terms, and stored in management systems ranging from desktop spreadsheets to the Cloud, where the information is often further decomposed or stylized in unpredictable ways. This situation creates major informatics challenges for broadly discovering, interpreting, and merging the data necessary for integrative earth science research. A number of scientific disciplines have recognized these issues, and been developing semantically enhanced data storage frameworks, typically based on ontologies, to enable communities to better circumscribe and clarify the content of data objects within their domain of practice. There is concern, however, that cross-domain compatibility of these semantic solutions could become problematic. We describe here our efforts to address this issue by developing a core, unified Observational Data Model, that should greatly facilitate interoperability among the semantic solutions growing organically within diverse scientific domains. Observational Data Models have emerged independently from several distinct scientific communities, including the biodiversity sciences, ecology, evolution, geospatial sciences, and hydrology, to name a few. Informatics projects striving for data integration within each of these domains had converged on identifying "observations" and "measurements" as fundamental abstractions that provide useful "templates" through which scientific data can be linked— at the structural, composited, or even cell value levels— to domain terms stored in ontologies or other forms of controlled vocabularies. The Scientific Observations Network, SONet (http://sonet.ecoinformatics.org) brings together a number of these observational data efforts, and is harmonizing their models. The specific observational data models currently under consideration include the OGC's Observations and Measurements Encoding Standard, O&M; the ecological community's Extensible Observation Ontology, OBOE'; the evolutionary community's Entity-Quality model, EQ; and the VSTO core classes, intended for describing atmospheric and solar-terrestrial phenomena, VSTO.OWL. These models all share high structural similarities, expressed in different languages (e.g. UML or OWL), and are intended for use with very different forms of data. The main focus of this talk will be describing these Observational Data Models, and more importantly, how harmonizing these will catalyze semantically enhanced access to large additional data resources across the earth and life sciences.
Memory Dysfunction

PubMed Central

Matthews, Brandy R.

2015-01-01

Purpose of Review: This article highlights the dissociable human memory systems of episodic, semantic, and procedural memory in the context of neurologic illnesses known to adversely affect specific neuroanatomic structures relevant to each memory system. Recent Findings: Advances in functional neuroimaging and refinement of neuropsychological and bedside assessment tools continue to support a model of multiple memory systems that are distinct yet complementary and to support the potential for one system to be engaged as a compensatory strategy when a counterpart system fails. Summary: Episodic memory, the ability to recall personal episodes, is the subtype of memory most often perceived as dysfunctional by patients and informants. Medial temporal lobe structures, especially the hippocampal formation and associated cortical and subcortical structures, are most often associated with episodic memory loss. Episodic memory dysfunction may present acutely, as in concussion; transiently, as in transient global amnesia (TGA); subacutely, as in thiamine deficiency; or chronically, as in Alzheimer disease. Semantic memory refers to acquired knowledge about the world. Anterior and inferior temporal lobe structures are most often associated with semantic memory loss. The semantic variant of primary progressive aphasia (svPPA) is the paradigmatic disorder resulting in predominant semantic memory dysfunction. Working memory, associated with frontal lobe function, is the active maintenance of information in the mind that can be potentially manipulated to complete goal-directed tasks. Procedural memory, the ability to learn skills that become automatic, involves the basal ganglia, cerebellum, and supplementary motor cortex. Parkinson disease and related disorders result in procedural memory deficits. Most memory concerns warrant bedside cognitive or neuropsychological evaluation and neuroimaging to assess for specific neuropathologies and guide treatment. PMID:26039844
A semantic data dictionary method for database schema integration in CIESIN

NASA Astrophysics Data System (ADS)

Hinds, N.; Huang, Y.; Ravishankar, C.

1993-08-01

CIESIN (Consortium for International Earth Science Information Network) is funded by NASA to investigate the technology necessary to integrate and facilitate the interdisciplinary use of Global Change information. A clear of this mission includes providing a link between the various global change data sets, in particular the physical sciences and the human (social) sciences. The typical scientist using the CIESIN system will want to know how phenomena in an outside field affects his/her work. For example, a medical researcher might ask: how does air-quality effect emphysema? This and many similar questions will require sophisticated semantic data integration. The researcher who raised the question may be familiar with medical data sets containing emphysema occurrences. But this same investigator may know little, if anything, about the existance or location of air-quality data. It is easy to envision a system which would allow that investigator to locate and perform a ``join'' on two data sets, one containing emphysema cases and the other containing air-quality levels. No such system exists today. One major obstacle to providing such a system will be overcoming the heterogeneity which falls into two broad categories. ``Database system'' heterogeneity involves differences in data models and packages. ``Data semantic'' heterogeneity involves differences in terminology between disciplines which translates into data semantic issues, and varying levels of data refinement, from raw to summary. Our work investigates a global data dictionary mechanism to facilitate a merged data service. Specially, we propose using a semantic tree during schema definition to aid in locating and integrating heterogeneous databases.
Production of xylanase and protease by Penicillium janthinellum CRC 87M-115 from different agricultural wastes.

PubMed

Oliveira, Luciana A; Porto, Ana L F; Tambourgi, Elias B

2006-04-01

Five agricultural wastes were evaluated in submerged fermentation for xylanolytic enzymes production by Penicillium janthinellum. The wastes were hydrolyzed in acid medium and the liquid fraction was used for cultivation. Corn cob (55.3 U/mL) and oat husk (54.8 U/mL) were the best inducers of xylanase. Sugar cane bagasse (23.0 U/mL) and corn husk (23.8 U/mL) were moderately good, while cassava peel was negligible. Protease production was very low in all agro-industrial residues. The maximum biomass yields were 1.30 and 1.17 g/L for cassava peel and corn husk after 180 h, respectively. Xylanolytic activity showed a cell growth associated profile.
Model-Driven Theme/UML

NASA Astrophysics Data System (ADS)

Carton, Andrew; Driver, Cormac; Jackson, Andrew; Clarke, Siobhán

Theme/UML is an existing approach to aspect-oriented modelling that supports the modularisation and composition of concerns, including crosscutting ones, in design. To date, its lack of integration with model-driven engineering (MDE) techniques has limited its benefits across the development lifecycle. Here, we describe our work on facilitating the use of Theme/UML as part of an MDE process. We have developed a transformation tool that adopts model-driven architecture (MDA) standards. It defines a concern composition mechanism, implemented as a model transformation, to support the enhanced modularisation features of Theme/UML. We evaluate our approach by applying it to the development of mobile, context-aware applications-an application area characterised by many non-functional requirements that manifest themselves as crosscutting concerns.
Soluble CD30 levels in recipients undergoing heart transplantation do not predict post-transplant outcome.

PubMed

Ypsilantis, Efthymios; Key, Timothy; Bradley, J Andrew; Morgan, C Helen; Tsui, Stephen; Parameshwar, Jayan; Taylor, Craig J

2009-11-01

The pre-transplant serum level of soluble CD30 (sCD30), a proteolytic derivative of the lymphocyte surface receptor CD30, has been suggested as a biomarker for immunologic risk after organ transplantation. Pre-transplant serum sCD30 levels were determined in 200 consecutive adult heart transplant recipients undertaken at a single center. Transplant outcome (acute rejection in the first 12 months and patient survival up to 5 years post-transplant) was determined. Patients treated with a left ventricular assist device (LVAD) prior to transplantation (n = 28) had higher levels of sCD30 (median 64 U/ml, range 12 to 112 U/ml) than those (n = 172) with no LVAD (median 36 U/ml, range 1 to 158 U/ml, p < 0.0001). Recipients were categorized according to whether sCD30 levels were "low" (lower quartile, <24 U/ml, n = 50), "intermediate" (24 to 58 U/ml, n = 100) or "high" (upper quartile, >58 U/ml, n = 50). Neither acute rejection nor recipient survival differed according to sCD30 level, with values (mean +/- SEM) of 0.30 +/- 0.04, 0.23 +/- 0.03 and 0.30 +/- 0.05 acute rejection episodes per 100 days in the low, intermediate and high groups, respectively, with recipient survival rates at 1 year of 77.7%, 84.9% and 86% and at 5 years of 73.6%, 67.9% and 75.8%, respectively. Pre-transplant serum sCD30 level does not predict acute allograft rejection or recipient survival after heart transplantation, although sCD30 levels are increased by LVAD, possibly as a result of biomaterial-host immune interaction.
Value of soluble CD30 in liver transplantation.

PubMed

Fábrega, E; Unzueta, M G; Cobo, M; Casafont, F; Amado, J A; Romero, F P

2007-09-01

CD30 is a membrane glycoprotein that belongs to the tumor necrosis factor superfamily. It is expressed on activated T cells. After activation of CD30(+) T cells, a soluble form of CD30 (sCD30) released into the bloodstream, can be measured in the serum. The aim of our study was to investigate the time course of serum levels of sCD30 during hepatic allograft rejection. Serum levels of sCD30 were determined in 30 healthy subjects and 50 hepatic transplant recipients. These patients were divided into two groups: group I, 35 patients without rejection; and group II, 15 patients with acute rejection. Samples were collected on day 1 and 7 after transplantation and on the day of liver biopsy. The concentrations of sCD30 were similar in the rejection (40.4 +/- 16.5 U/mL) and nonrejection groups (43.0 +/- 18.2 U/mL) on postoperative day 1. We observed a significant increase in sCD30 levels in the rejection group on postoperative day 7 (76.3 +/- 61.8 U/mL vs 46.8 +/- 20.5 U/mL; P = .01). The difference increased when a diagnosis of acute rejection had been established: namely 133.0 +/- 113.5 U/mL versus 40.1 +/- 22.0 U/mL; (P = .001). These levels were also significantly higher during the entire postoperative period in all the patients, with or without rejection, than those observed in healthy controls (26.6 +/- 5.3 U/mL; P = .005). The release of circulating sCD30 is a prominent feature coinciding with the first episode of hepatic allograft rejection. So, monitoring of sCD30 levels may be useful for the early diagnosis of an acute rejection episode.
Mining continuous activity patterns from animal trajectory data

USGS Publications Warehouse

Wang, Y.; Luo, Ze; Baoping, Yan; Takekawa, John Y.; Prosser, Diann J.; Newman, Scott H.

2014-01-01

The increasing availability of animal tracking data brings us opportunities and challenges to intuitively understand the mechanisms of animal activities. In this paper, we aim to discover animal movement patterns from animal trajectory data. In particular, we propose a notion of continuous activity pattern as the concise representation of underlying similar spatio-temporal movements, and develop an extension and refinement framework to discover the patterns. We first preprocess the trajectories into significant semantic locations with time property. Then, we apply a projection-based approach to generate candidate patterns and refine them to generate true patterns. A sequence graph structure and a simple and effective processing strategy is further developed to reduce the computational overhead. The proposed approaches are extensively validated on both real GPS datasets and large synthetic datasets.
A UML profile for the OBO relation ontology.

PubMed

Guardia, Gabriela D A; Vêncio, Ricardo Z N; de Farias, Cléver R G

2012-01-01

Ontologies have increasingly been used in the biomedical domain, which has prompted the emergence of different initiatives to facilitate their development and integration. The Open Biological and Biomedical Ontologies (OBO) Foundry consortium provides a repository of life-science ontologies, which are developed according to a set of shared principles. This consortium has developed an ontology called OBO Relation Ontology aiming at standardizing the different types of biological entity classes and associated relationships. Since ontologies are primarily intended to be used by humans, the use of graphical notations for ontology development facilitates the capture, comprehension and communication of knowledge between its users. However, OBO Foundry ontologies are captured and represented basically using text-based notations. The Unified Modeling Language (UML) provides a standard and widely-used graphical notation for modeling computer systems. UML provides a well-defined set of modeling elements, which can be extended using a built-in extension mechanism named Profile. Thus, this work aims at developing a UML profile for the OBO Relation Ontology to provide a domain-specific set of modeling elements that can be used to create standard UML-based ontologies in the biomedical domain.
Evaluation of secretome of highly efficient lignocellulolytic Penicillium sp. Dal 5 isolated from rhizosphere of conifers.

PubMed

Rai, Rohit; Kaur, Baljit; Singh, Surender; Di Falco, Macros; Tsang, Adrian; Chadha, B S

2016-09-01

Penicillium sp. (Dal 5) isolated from rhizosphere of conifers from Dalhousie (Himachal Pradesh, India) was found to be an efficient cellulolytic strain. The culture under shake flask on CWR (cellulose, wheat bran and rice straw) medium produced appreciably higher levels of endoglucanase (35.69U/ml), β-glucosidase (4.20U/ml), cellobiohydrolase (2.86U/ml), FPase (1.2U/ml) and xylanase (115U/ml) compared to other Penicillium strains reported in literature. The mass spectroscopy analysis of Penicillium sp. Dal 5 secretome identified 108 proteins constituting an array of CAZymes including glycosyl hydrolases (GH) belonging to 24 different families, polysaccharide lyases (PL), carbohydrate esterases (CE), lytic polysaccharide mono-oxygenases (LPMO) in addition to swollenin and a variety of carbohydrate binding modules (CBM) indicating an elaborate genetic potential of this strain for hydrolysis of lignocellulosics. Further, the culture extract was evaluated for hydrolysis of alkali treated rice straw, wheat straw, bagasse and corn cob at 10% substrate loading rate. Copyright © 2016 Elsevier Ltd. All rights reserved.
Mission Assurance in a Distributed Environment

DTIC Science & Technology

2009-06-01

Notation ( BPMN ) – Graphical representation of business processes in a workflow • Unified Modeling Language (UML) – Use standard UML diagrams to model the system – Component, sequence, activity diagrams
Mapping Japanese medical terms to UMLS Metathesaurus.

PubMed

Onogi, Yuzo; Ohe, Kazuhiko; Tanaka, Masaaki; Nozoe, Atsutake; Sasaki, Tetsuro; Sato, Megumi; Kikuchi, Yuko; Shinohara, Tsuneki; Suzuki, Hiromichi; Kaihara, Shigekoto; Seyama, Yousuke

2004-01-01

This paper introduces and reports the results for a project to map Japanese medical terms to the UMLS Metathesaurus. The "Thesaurus for Medical and Health related Terms version 5" published in 2003 by the Japan Medical Abstracts Society and UMLS version 2002AC provided by NLM were used in this study. The goal was to judge the validity of the correlation between the Japanese and English terms that belong to the same MeSH concept. Fifteen medicine, nursing, and library science professionals, excluding JAMAS, used a custom designed Web interface to perform this task. About 10% of the concepts were judged as invalid, and the reasoning behind these failures were analyzed. Experience from this project can be used to estimate the manpower required to revise the Japanese thesaurus after future revisions to UMLS or MeSH.
Epstein-Barr virus is related with 5-aminosalicylic acid, tonsillectomy, and CD19(+) cells in Crohn's disease.

PubMed

Andreu-Ballester, Juan C; Gil-Borrás, Rafael; García-Ballesteros, Carlos; Catalán-Serra, Ignacio; Amigo, Victoria; Fernández-Fígares, Virgina; Cuéllar, Carmen

2015-04-21

To study anti-Epstein-Barr virus (EBV) IgG antibodies in Crohn's disease in relation to treatment, immune cells, and prior tonsillectomy/appendectomy. This study included 36 CD patients and 36 healthy individuals (controls), and evaluated different clinical scenarios (new patient, remission and active disease), previous mucosa-associated lymphoid tissue removal (tonsillectomy and appendectomy) and therapeutic regimens (5-aminosalicylic acid, azathioprine, anti-tumor necrosis factor, antibiotics, and corticosteroids). T and B cells subsets in peripheral blood were analyzed by flow cytometry (markers included: CD45, CD4, CD8, CD3, CD19, CD56, CD2, CD3, TCRαβ and TCRγδ) to relate with the levels of anti-EBV IgG antibodies, determined by enzyme-linked immunosorbent assay. The lowest anti-EBV IgG levels were observed in the group of patients that were not in a specific treatment (95.4 ± 53.9 U/mL vs 131.5 ± 46.2 U/mL, P = 0.038). The patients that were treated with 5-aminosalicylic acid showed the highest anti-EBV IgG values (144.3 U/mL vs 102.6 U/mL, P = 0.045). CD19(+) cells had the largest decrease in the group of CD patients that received treatment (138.6 vs 223.9, P = 0.022). The analysis of anti-EBV IgG with respect to the presence or absence of tonsillectomy showed the highest values in the tonsillectomy group of CD patients (169.2 ± 20.7 U/mL vs 106.1 ± 50.3 U/mL, P = 0.002). However, in the group of healthy controls, no differences were seen between those who had been tonsillectomized and subjects who had not been operated on (134.0 ± 52.5 U/mL vs 127.7 ± 48.1 U/mL, P = 0.523). High anti-EBV IgG levels in CD are associated with 5-aminosalicylic acid treatment, tonsillectomy, and decrease of CD19(+) cells.
Study of a High-Yield Cellulase System Created by Heavy-Ion Irradiation-Induced Mutagenesis of Aspergillus niger and Mixed Fermentation with Trichoderma reesei

PubMed Central

Chen, Ji-Hong; Li, Wen-Jian; Liu, Jing; Hu, Wei; Xiao, Guo-Qing; Dong, Miao-Yin; Wang, Yu-Chen

2015-01-01

The aim of this study was to evaluate and validate the efficiency of 12C6+ irradiation of Aspergillus niger (A. niger) or mutagenesis via mixed Trichoderma viride (T. viride) culturing as well as a liquid cultivation method for cellulase production via mixed Trichoderma reesei (T. reesei) and A. niger culture fermentation. The first mutagenesis approach was employed to optimize yield from a cellulase-producing strain via heavy-ion mutagenesis and high-throughput screening, and the second was to effectively achieve enzymatic hydrolysis of cellulase from a mixed culture of mutant T. viride and A. niger. We found that 12C6+-ion irradiation induced changes in cellulase biosynthesis in A. niger but had no effect on the time course of the synthesis. It is notable that the exoglucanases (CBH) activities of A. niger strains H11-1 and H differed (6.71 U/mL vs. 6.01 U/mL) and were significantly higher than that of A. niger mutant H3-1. Compared with strain H, the filter paper assay (FPA), endoglucanase (EG) and β-glucosidase (BGL) activities of mutant strain H11-1 were increased by 250.26%, 30.26% and 34.91%, respectively. A mixed culture system was successfully optimized, and the best ratio of T. reesei to A. niger was 5:1 for 96 h with simultaneous inoculation. The BGL activity of the mixed culture increased after 72 h. At 96 h, the FPA and BGL activities of the mixed culture were 689.00 and 797.15 U/mL, respectively, significantly higher than those of monocultures, which were 408.70 and 646.98 U/mL for T. reesei and 447.29 and 658.89 U/mL for A. niger, respectively. The EG activity of the mixed culture was 2342.81 U/mL, a value that was significantly higher than that of monocultures at 2206.57 U/mL for T. reesei and 1727.62 U/mL for A. niger. In summary, cellulose production and hydrolysis yields were significantly enhanced by the proposed combination scheme. PMID:26656155
Combining Open-domain and Biomedical Knowledge for Topic Recognition in Consumer Health Questions.

PubMed

Mrabet, Yassine; Kilicoglu, Halil; Roberts, Kirk; Demner-Fushman, Dina

2016-01-01

Determining the main topics in consumer health questions is a crucial step in their processing as it allows narrowing the search space to a specific semantic context. In this paper we propose a topic recognition approach based on biomedical and open-domain knowledge bases. In the first step of our method, we recognize named entities in consumer health questions using an unsupervised method that relies on a biomedical knowledge base, UMLS, and an open-domain knowledge base, DBpedia. In the next step, we cast topic recognition as a binary classification problem of deciding whether a named entity is the question topic or not. We evaluated our approach on a dataset from the National Library of Medicine (NLM), introduced in this paper, and another from the Genetic and Rare Disease Information Center (GARD). The combination of knowledge bases outperformed the results obtained by individual knowledge bases by up to 16.5% F1 and achieved state-of-the-art performance. Our results demonstrate that combining open-domain knowledge bases with biomedical knowledge bases can lead to a substantial improvement in understanding user-generated health content.
Proper name retrieval in temporal lobe epilepsy: naming of famous faces and landmarks.

PubMed

Benke, Thomas; Kuen, Eva; Schwarz, Michael; Walser, Gerald

2013-05-01

The objective of this study was to further explore proper name (PN) retrieval and conceptual knowledge in patients with left and right temporal lobe epilepsy (69 patients with LTLE and 62 patients with RTLE) using a refined assessment procedure. Based on the performance of a large group of age- and education-matched normals, a new test of famous faces and famous landmarks was designed. Recognition, naming, and semantic knowledge were assessed consecutively, allowing for a better characterization of deficient levels in the naming system. Impairment in PN retrieval was common in the cohort with TLE. Furthermore, side of seizure onset impaired stages of name retrieval differently: LTLE impaired the lexico-phonological processing, whereas RTLE mainly impaired the perceptual-semantic stage of object recognition. In addition to deficient PN retrieval, patients with TLE had reduced conceptual knowledge regarding famous persons and landmarks. Copyright © 2013 Elsevier Inc. All rights reserved.
Semantic Maps Capturing Organization Knowledge in e-Learning

NASA Astrophysics Data System (ADS)

Mavridis, Androklis; Koumpis, Adamantios; Demetriadis, Stavros N.

e-learning, shows much promise in accessibility and opportunity to learn, due to its asynchronous nature and its ability to transmit knowledge fast and effectively. However without a universal standard for online learning and teaching, many systems are proclaimed as “e-learning-compliant”, offering nothing more than automated services for delivering courses online, providing no additional enhancement to reusability and learner personalization. Hence, the focus is not on providing reusable and learner-centered content, but on developing the technology aspects of e-learning. This current trend has made it crucial to find a more refined definition of what constitutes knowledge in the e-learning context. We propose an e-learning system architecture that makes use of a knowledge model to facilitate continuous dialogue and inquiry-based knowledge learning, by exploiting the full benefits of the semantic web as a medium capable for supplying the web with formalized knowledge.
Applied and implied semantics in crystallographic publishing

PubMed Central

2012-01-01

Background Crystallography is a data-rich, software-intensive scientific discipline with a community that has undertaken direct responsibility for publishing its own scientific journals. That community has worked actively to develop information exchange standards allowing readers of structure reports to access directly, and interact with, the scientific content of the articles. Results Structure reports submitted to some journals of the International Union of Crystallography (IUCr) can be automatically validated and published through an efficient and cost-effective workflow. Readers can view and interact with the structures in three-dimensional visualization applications, and can access the experimental data should they wish to perform their own independent structure solution and refinement. The journals also layer on top of this facility a number of automated annotations and interpretations to add further scientific value. Conclusions The benefits of semantically rich information exchange standards have revolutionised the scholarly publishing process for crystallography, and establish a model relevant to many other physical science disciplines. PMID:22932420

Linking the Long Tail of Data: A Bottoms-up Approach to Connecting Scientific Research

NASA Astrophysics Data System (ADS)

Jacob, B.; Arctur, D. K.

2016-12-01

Highly curated ontologies are often developed for big scientific data, but the long tail of research data rarely receives the same treatment. The learning curve for Semantic Web technology is steep, and the value of linking each long-tail data set to known taxonomies and ontologies in isolation rarely justifies the level of effort required to bring a Knowledge Engineer into the project. We present an approach that takes a bottoms-up approach of producing a Linked Data model of datasets mechanically, inferring the shape and structure of the data from the original format, and adding derived variables and semantic linkages via iterative, interactive refinements of that model. In this way, the vast corpus of small but rich scientific data becomes part of the greater linked web of knowledge, and the connectivity of that data can be iteratively improved over time.
UML and Model Checking

NASA Technical Reports Server (NTRS)

Schneider, F.

1999-01-01

UML use cases conceptually identify function points or major requirements that a software system must satisfy. Sequence diagrams expand each use case to show in temporal sequence a more detailed notion of intended system behavior.
UML Profiles for Design Decisions and Non-Functional Requirements

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhu, Liming; Gorton, Ian

2007-06-30

A software architecture is composed of a collection of design decisions. Each design decision helps or hinders certain Non-Functional Requirements (NFR). Current software architecture views focus on expressing components and connectors in the system. Design decisions and their relationships with non-functional requirements are often captured in separate design documentation, not explicitly expressed in any views. This disassociation makes architecture comprehension and architecture evolution harder. In this paper, we propose a UML profile for modeling design decisions and an associated UML profile for modeling non-functional requirements in a generic way. The two UML profiles treat design decisions and nonfunctional requirements asmore » first-class elements. Modeled design decisions always refer to existing architectural elements and thus maintain traceability between the two. We provide a mechanism for checking consistency over this traceability. An exemplar is given as« less
Crude cellulase from oil palm empty fruit bunch by Trichoderma asperellum UPM1 and Aspergillus fumigatus UPM2 for fermentable sugars production.

PubMed

Ibrahim, M F; Razak, M N A; Phang, L Y; Hassan, M A; Abd-Aziz, S

2013-07-01

Cellulase is an enzyme that converts the polymer structure of polysaccharides into fermentable sugars. The high market demand for this enzyme together with the variety of applications in the industry has brought the research on cellulase into focus. In this study, crude cellulase was produced from oil palm empty fruit bunch (OPEFB) pretreated with 2% NaOH with autoclave, which was composed of 59.7% cellulose, 21.6% hemicellulose, and 12.3% lignin using Trichoderma asperellum UPM1 and Aspergillus fumigatus UPM2. Approximately 0.8 U/ml of FPase, 24.7 U/ml of CMCase and 5.0 U/ml of β-glucosidase were produced by T. asperellum UPM1 at a temperature of 35 °C and at an initial pH of 7.0. A 1.7 U/ml of FPase, 24.2 U/ml of CMCase, and 1.1 U/ml of β-glucosidase were produced by A. fumigatus UPM2 at a temperature of 45 °C and at initial pH of 6.0. The crude cellulase was best produced at 1% of substrate concentration for both T. asperellum UPM1 and A. fumigatus UPM2. The hydrolysis percentage of pretreated OPEFB using 5% of crude cellulase concentration from T. asperellum UPM1 and A. fumigatus UPM2 were 3.33% and 19.11%, with the reducing sugars concentration of 1.47 and 8.63 g/l, respectively.
Formalization and Transformation of Informal Analysis Models into Executive REFINE (trademark) Specifications

DTIC Science & Technology

1992-12-01

describing how. 5. EDDA . EDDA is an attempt to add mathematical formalism to SADT. Because it is based on SADT, it cannot easily represent any other...design methodology. EDDA has two forms: G- EDDA , the standard graphical version of SADT, and S- EDDA , a textual language that partially represents the...used. "* EDDA only supports the SADT methodology and is too limited in scope to be useful in our research. "* SAMM lacks the semantic richness of
Left and right basal ganglia and frontal activity during language generation: contributions to lexical, semantic, and phonological processes.

PubMed

Crosson, Bruce; Benefield, Hope; Cato, M Allison; Sadek, Joseph R; Moore, Anna Bacon; Wierenga, Christina E; Gopinath, Kaundinya; Soltysik, David; Bauer, Russell M; Auerbach, Edward J; Gökçay, Didem; Leonard, Christiana M; Briggs, Richard W

2003-11-01

fMRI was used to determine the frontal, basal ganglia, and thalamic structures engaged by three facets of language generation: lexical status of generated items, the use of semantic vs. phonological information during language generation, and rate of generation. During fMRI, 21 neurologically normal subjects performed four tasks: generation of nonsense syllables given beginning and ending consonant blends, generation of words given a rhyming word, generation of words given a semantic category at a fast rate (matched to the rate of nonsense syllable generation), and generation of words given a semantic category at a slow rate (matched to the rate of generating of rhyming words). Components of a left pre-SMA-dorsal caudate nucleus-ventral anterior thalamic loop were active during word generation from rhyming or category cues but not during nonsense syllable generation. Findings indicate that this loop is involved in retrieving words from pre-existing lexical stores. Relatively diffuse activity in the right basal ganglia (caudate nucleus and putamen) also was found during word-generation tasks but not during nonsense syllable generation. Given the relative absence of right frontal activity during the word generation tasks, we suggest that the right basal ganglia activity serves to suppress right frontal activity, preventing right frontal structures from interfering with language production. Current findings establish roles for the left and the right basal ganglia in word generation. Hypotheses are discussed for future research to help refine our understanding of basal ganglia functions in language generation.
A MATLAB toolbox for the efficient estimation of the psychometric function using the updated maximum-likelihood adaptive procedure.

PubMed

Shen, Yi; Dai, Wei; Richards, Virginia M

2015-03-01

A MATLAB toolbox for the efficient estimation of the threshold, slope, and lapse rate of the psychometric function is described. The toolbox enables the efficient implementation of the updated maximum-likelihood (UML) procedure. The toolbox uses an object-oriented architecture for organizing the experimental variables and computational algorithms, which provides experimenters with flexibility in experimental design and data management. Descriptions of the UML procedure and the UML Toolbox are provided, followed by toolbox use examples. Finally, guidelines and recommendations of parameter configurations are given.
PREDOSE: A Semantic Web Platform for Drug Abuse Epidemiology using Social Media

PubMed Central

Cameron, Delroy; Smith, Gary A.; Daniulaityte, Raminta; Sheth, Amit P.; Dave, Drashti; Chen, Lu; Anand, Gaurish; Carlson, Robert; Watkins, Kera Z.; Falck, Russel

2013-01-01

Objectives The role of social media in biomedical knowledge mining, including clinical, medical and healthcare informatics, prescription drug abuse epidemiology and drug pharmacology, has become increasingly significant in recent years. Social media offers opportunities for people to share opinions and experiences freely in online communities, which may contribute information beyond the knowledge of domain professionals. This paper describes the development of a novel Semantic Web platform called PREDOSE (PREscription Drug abuse Online Surveillance and Epidemiology), which is designed to facilitate the epidemiologic study of prescription (and related) drug abuse practices using social media. PREDOSE uses web forum posts and domain knowledge, modeled in a manually created Drug Abuse Ontology (DAO) (pronounced dow), to facilitate the extraction of semantic information from User Generated Content (UGC). A combination of lexical, pattern-based and semantics-based techniques is used together with the domain knowledge to extract fine-grained semantic information from UGC. In a previous study, PREDOSE was used to obtain the datasets from which new knowledge in drug abuse research was derived. Here, we report on various platform enhancements, including an updated DAO, new components for relationship and triple extraction, and tools for content analysis, trend detection and emerging patterns exploration, which enhance the capabilities of the PREDOSE platform. Given these enhancements, PREDOSE is now more equipped to impact drug abuse research by alleviating traditional labor-intensive content analysis tasks. Methods Using custom web crawlers that scrape UGC from publicly available web forums, PREDOSE first automates the collection of web-based social media content for subsequent semantic annotation. The annotation scheme is modeled in the DAO, and includes domain specific knowledge such as prescription (and related) drugs, methods of preparation, side effects, routes of administration, etc. The DAO is also used to help recognize three types of data, namely: 1) entities, 2) relationships and 3) triples. PREDOSE then uses a combination of lexical and semantic-based techniques to extract entities and relationships from the scraped content, and a top-down approach for triple extraction that uses patterns expressed in the DAO. In addition, PREDOSE uses publicly available lexicons to identify initial sentiment expressions in text, and then a probabilistic optimization algorithm (from related research) to extract the final sentiment expressions. Together, these techniques enable the capture of fine-grained semantic information from UGC, and querying, search, trend analysis and overall content analysis of social media related to prescription drug abuse. Moreover, extracted data are also made available to domain experts for the creation of training and test sets for use in evaluation and refinements in information extraction techniques. Results A recent evaluation of the information extraction techniques applied in the PREDOSE platform indicates 85% precision and 72% recall in entity identification, on a manually created gold standard dataset. In another study, PREDOSE achieved 36% precision in relationship identification and 33% precision in triple extraction, through manual evaluation by domain experts. Given the complexity of the relationship and triple extraction tasks and the abstruse nature of social media texts, we interpret these as favorable initial results. Extracted semantic information is currently in use in an online discovery support system, by prescription drug abuse researchers at the Center for Interventions, Treatment and Addictions Research (CITAR) at Wright State University. Conclusion A comprehensive platform for entity, relationship, triple and sentiment extraction from such abstruse texts has never been developed for drug abuse research. PREDOSE has already demonstrated the importance of mining social media by providing data from which new findings in drug abuse research were uncovered. Given the recent platform enhancements, including the refined DAO, components for relationship and triple extraction, and tools for content, trend and emerging pattern analysis, it is expected that PREDOSE will play a significant role in advancing drug abuse epidemiology in future. PMID:23892295
14-3-3η Autoantibodies: Diagnostic Use in Early Rheumatoid Arthritis.

PubMed

Maksymowych, Walter P; Boire, Gilles; van Schaardenburg, Dirkjan; Wichuk, Stephanie; Turk, Samina; Boers, Maarten; Siminovitch, Katherine A; Bykerk, Vivian; Keystone, Ed; Tak, Paul Peter; van Kuijk, Arno W; Landewé, Robert; van der Heijde, Desiree; Murphy, Mairead; Marotta, Anthony

2015-09-01

To describe the expression and diagnostic use of 14-3-3η autoantibodies in early rheumatoid arthritis (RA). 14-3-3η autoantibody levels were measured using an electrochemiluminescent multiplexed assay in 500 subjects (114 disease-modifying antirheumatic drug-naive patients with early RA, 135 with established RA, 55 healthy, 70 autoimmune, and 126 other non-RA arthropathy controls). 14-3-3η protein levels were determined in an earlier analysis. Two-tailed Student t tests and Mann-Whitney U tests compared differences among groups. Receiver-operator characteristic (ROC) curves were generated and diagnostic performance was estimated by area under the curve (AUC), as well as specificity, sensitivity, and likelihood ratios (LR) for optimal cutoffs. Median serum 14-3-3η autoantibody concentrations were significantly higher (p < 0.0001) in patients with early RA (525 U/ml) when compared with healthy controls (235 U/ml), disease controls (274 U/ml), autoimmune disease controls (274 U/ml), patients with osteoarthritis (259 U/ml), and all controls (265 U/ml). ROC curve analysis comparing early RA with healthy controls demonstrated a significant (p < 0.0001) AUC of 0.90 (95% CI 0.85-0.95). At an optimal cutoff of ≥ 380 U/ml, the ROC curve yielded a sensitivity of 73%, a specificity of 91%, and a positive LR of 8.0. Adding 14-3-3η autoantibodies to 14-3-3η protein positivity enhanced the identification of patients with early RA from 59% to 90%; addition of 14-3-3η autoantibodies to anticitrullinated protein antibodies (ACPA) and/or rheumatoid factor (RF) increased identification from 72% to 92%. Seventy-two percent of RF- and ACPA-seronegative patients were positive for 14-3-3η autoantibodies. 14-3-3η autoantibodies, alone and in combination with the 14-3-3η protein, RF, and/or ACPA identified most patients with early RA.
High variation of individual soluble serum CD30 levels of pre-transplantation patients: sCD30 a feasible marker for prediction of kidney allograft rejection?

PubMed

Altermann, Wolfgang; Schlaf, Gerald; Rothhoff, Anita; Seliger, Barbara

2007-10-01

Previous studies have suggested that the pre-transplant levels of the soluble CD30 molecule (sCD30) represent a non-invasive tool which can be used as a biomarker for the prediction of kidney allograft rejections. In order to evaluate the feasibility of sCD30 for pre-transplantation monitoring the sera of potential kidney recipients (n = 652) were collected four times in a 3 months interval. Serum from healthy blood donors (n = 203) served as controls. The sCD30 concentrations of all samples were determined using a commercially available ELISA. This strategy allowed the detection of possible variations of individual sCD30 levels over time. Heterogeneous sCD30 concentrations were found in the samples obtained from individual putative kidney transplant recipients when quarterly measured over 1 year. Total 95% of serum samples obtained from healthy controls exhibited sCD30 values <30 U/ml, whereas most recipients displayed higher serum levels (>30 U/ml). Total 524 patients (80.4%) constantly exhibited serum concentrations of <100 U/ml during the period investigated, whereas 109 patients (16.7%) showed variations by exceeding the proposed 'cut off' of 100 U/ml for one to three times. The frequency of samples exhibiting sCD30 values >100 U/ml was significantly lower than that previously reported. The high degree of variation does not allow the stratification of patients into high and low immunological risk groups based on a single sCD30 value > 100 U/ml. Due to the heterogeneity of sCD30 levels during time course and the high values of SD, its implementation as a pre-transplant marker cannot be justified to generate special provisions for the organ allocation to patients with single sCD30 values > 100 U/ml.
Representing nursing guideline with unified modeling language to facilitate development of a computer system: a case study.

PubMed

Choi, Jeeyae; Choi, Jeungok E

2014-01-01

To provide best recommendations at the point of care, guidelines have been implemented in computer systems. As a prerequisite, guidelines are translated into a computer-interpretable guideline format. Since there are no specific tools to translate nursing guidelines, only a few nursing guidelines are translated and implemented in computer systems. Unified modeling language (UML) is a software writing language and is known to well and accurately represent end-users' perspective, due to the expressive characteristics of the UML. In order to facilitate the development of computer systems for nurses' use, the UML was used to translate a paper-based nursing guideline, and its ease of use and the usefulness were tested through a case study of a genetic counseling guideline. The UML was found to be a useful tool to nurse informaticians and a sufficient tool to model a guideline in a computer program.
An Evaluation of the UMLS in Representing Corpus Derived Clinical Concepts

PubMed Central

Friedlin, Jeff; Overhage, Marc

2011-01-01

We performed an evaluation of the Unified Medical Language System (UMLS) in representing concepts derived from medical narrative documents from three domains: chest x-ray reports, discharge summaries and admission notes. We detected concepts in these documents by identifying noun phrases (NPs) and N-grams, including unigrams (single words), bigrams (word pairs) and trigrams (word triples). After removing NPs and N-grams that did not represent discrete clinical concepts, we processed the remaining with the UMLS MetaMap program. We manually reviewed the results of MetaMap processing to determine whether MetaMap found full, partial or no representation of the concept. For full representations, we determined whether post-coordination was required. Our results showed that a large portion of concepts found in clinical narrative documents are either unrepresented or poorly represented in the current version of the UMLS Metathesaurus and that post-coordination was often required in order to fully represent a concept. PMID:22195097
Prediction of acute renal allograft rejection in early post-transplantation period by soluble CD30.

PubMed

Dong, Wang; Shunliang, Yang; Weizhen, Wu; Qinghua, Wang; Zhangxin, Zeng; Jianming, Tan; He, Wang

2006-06-01

To evaluate the feasibility of serum sCD30 for prediction of acute graft rejection, we analyzed clinical data of 231 patients, whose serum levels of sCD30 were detected by ELISA before and after transplantation. They were divided into three groups: acute rejection group (AR, n = 49), uncomplicated course group (UC, n = 171) and delayed graft function group (DGF, n = 11). Preoperative sCD30 levels of three groups were 183 +/- 74, 177 +/- 82 and 168 +/- 53 U/ml, respectively (P = 0.82). Significant decrease of sCD30 was detected in three groups on day 5 and 10 post-transplantation respectively (52 +/- 30 and 9 +/- 5 U/ml respectively, P < 0.001). Compared with Group UC and DGF, patients of Group AR had higher sCD30 values on day 5 post-transplantation (92 +/- 27 U/ml vs. 41 +/- 20 U/ml and 48 +/- 18 U/ml, P < 0.001). However, sCD30 levels on day 10 post-transplantation were virtually similar in patients of three groups (P = 0.43). Receiver operating characteristic (ROC) curve demonstrated that sCD30 level on day 5 post-transplantation could differentiate patients who subsequently suffered acute allograft rejection from others (area under ROC curve 0.95). According to ROC curve, 65 U/ml may be the optimal operational cut-off level to predict impending graft rejection (specificity 91.8%, sensitivity 87.1%). Measurement of soluble CD30 on day 5 post-transplantation might offer a noninvasive means to recognize patients at risk of impending acute graft rejection during early post-transplantation period.
A semantic graph-based approach to biomedical summarisation.

PubMed

Plaza, Laura; Díaz, Alberto; Gervás, Pablo

2011-09-01

Access to the vast body of research literature that is available in biomedicine and related fields may be improved by automatic summarisation. This paper presents a method for summarising biomedical scientific literature that takes into consideration the characteristics of the domain and the type of documents. To address the problem of identifying salient sentences in biomedical texts, concepts and relations derived from the Unified Medical Language System (UMLS) are arranged to construct a semantic graph that represents the document. A degree-based clustering algorithm is then used to identify different themes or topics within the text. Different heuristics for sentence selection, intended to generate different types of summaries, are tested. A real document case is drawn up to illustrate how the method works. A large-scale evaluation is performed using the recall-oriented understudy for gisting-evaluation (ROUGE) metrics. The results are compared with those achieved by three well-known summarisers (two research prototypes and a commercial application) and two baselines. Our method significantly outperforms all summarisers and baselines. The best of our heuristics achieves an improvement in performance of almost 7.7 percentage units in the ROUGE-1 score over the LexRank summariser (0.7862 versus 0.7302). A qualitative analysis of the summaries also shows that our method succeeds in identifying sentences that cover the main topic of the document and also considers other secondary or "satellite" information that might be relevant to the user. The method proposed is proved to be an efficient approach to biomedical literature summarisation, which confirms that the use of concepts rather than terms can be very useful in automatic summarisation, especially when dealing with highly specialised domains. Copyright © 2011 Elsevier B.V. All rights reserved.
A MATLAB toolbox for the efficient estimation of the psychometric function using the updated maximum-likelihood adaptive procedure

PubMed Central

Richards, V. M.; Dai, W.

2014-01-01

A MATLAB toolbox for the efficient estimation of the threshold, slope, and lapse rate of the psychometric function is described. The toolbox enables the efficient implementation of the updated maximum-likelihood (UML) procedure. The toolbox uses an object-oriented architecture for organizing the experimental variables and computational algorithms, which provides experimenters with flexibility in experimental design and data management. Descriptions of the UML procedure and the UML Toolbox are provided, followed by toolbox use examples. Finally, guidelines and recommendations of parameter configurations are given. PMID:24671826
CASE tools and UML: state of the ART.

PubMed

Agarwal, S

2001-05-01

With increasing need for automated tools to assist complex systems development, software design methods are becoming popular. This article analyzes the state of art in computer-aided software engineering (CASE) tools and unified modeling language (UML), focusing on their evolution, merits, and industry usage. It identifies managerial issues for the tools' adoption and recommends an action plan to select and implement them. While CASE and UML offer inherent advantages like cheaper, shorter, and efficient development cycles, they suffer from poor user satisfaction. The critical success factors for their implementation include, among others, management and staff commitment, proper corporate infrastructure, and user training.
Vehicle System Management Modeling in UML for Ares I

NASA Technical Reports Server (NTRS)

Pearson, Newton W.; Biehn, Bradley A.; Curry, Tristan D.; Martinez, Mario R.

2011-01-01

The Spacecraft & Vehicle Systems Department of Marshall Space Flight Center is responsible for modeling the Vehicle System Management for the Ares I vehicle which was a part of the now canceled Constellation Program. An approach to generating the requirements for the Vehicle System Management was to use the Unified Modeling Language technique to build and test a model that would fulfill the Vehicle System Management requirements. UML has been used on past projects (flight software) in the design phase of the effort but this was the first attempt to use the UML technique from a top down requirements perspective.
Uncompacted Myelin Lamellae and Nodal Ion Channel Disruption in POEMS Syndrome.

PubMed

Hashimoto, Rina; Koike, Haruki; Takahashi, Mie; Ohyama, Ken; Kawagashira, Yuichi; Iijima, Masahiro; Sobue, Gen

2015-12-01

To elucidate the significance of uncompacted myelin lamellae (UML) and ion channel disruption at the nodes of Ranvier in the polyneuropathy, organomegaly, endocrinopathy, monoclonal gammopathy, and skin changes (POEMS) syndrome, we evaluated sural nerve biopsy specimens from 33 patients with POEMS syndrome and from 7 control patients. Uncompacted myelin lamellae distribution was assessed by electron microscopy and immunofluorescence microscopy. In the POEMS patient biopsies, UML were seen more frequently in small versus large myelinated fibers. Paranodes and Schmidt-Lanterman incisures, where normal physiologic UM is located, were frequently associated with UM. Widening of the nodes of Ranvier (i.e. segmental demyelination) was not associated with UML. There was axonal hollowing with neurofilament condensation at Schmidt-Lanterman incisures with abnormal UML, suggesting axonal damage at those sites in the POEMS patient biopsies. Myelin sheath irregularity was conspicuous in large myelinated fibers and was associated with abnormally widened bizarrely shaped Schmidt-Lanterman incisures. Indirect immunofluorescent studies revealed abnormalities of sodium (pan sodium) and potassium (KCNQ2) channels, even at nonwidened nodes of Ranvier. Thus, UML was not apparently associated with segmental demyelination but seemed to be associated with axonal damage. These observations suggest that nodal ion channel disruption may be associated with functional deficits in POEMS syndrome patient nerves.
Evaluation of the heat balance constituents of the upper mixed layer in the North Atlantic

NASA Astrophysics Data System (ADS)

Polonsky, A. B.; Sukhonos, P. A.

2016-11-01

Different physical mechanisms which cause interannual and interdecadal temperature anomalies in the upper mixed layer (UML) of the North Atlantic are investigated using the data of ORA-S3 reanalysis for the period of 1959-2011. It is shown that the annual mean heat budget in UML is mainly caused by the balance between advective heat transfer and horizontal turbulent mixing (estimated as a residual term in the equation of thermal balance). The local UML temperature change and contribution from the heat fluxes on the lower boundary of the UML to the heat budget of the upper layer are insignificant for the time scale under consideration. The contribution of the heat fluxes on the upper UML boundary to the low-frequency variability of the upper layer temperature in the whole North Atlantic area is substantially less than 30%. Areas like the northwestern part of the Northern Subtropical Anticyclonic Gyre (NSAG), where their contribution exceeds 30-60%, are exceptions. The typical time scales of advective heat transfer variability are revealed. In the NSAG area, an interannual variability associated with the North Atlantic Oscillation dominates, while in the North Atlantic subpolar gyre, an interdecadal variability of advective transfers with periods of more than 30 years prevails.
Translation from UML to Markov Model: A Performance Modeling Framework

NASA Astrophysics Data System (ADS)

Khan, Razib Hayat; Heegaard, Poul E.

Performance engineering focuses on the quantitative investigation of the behavior of a system during the early phase of the system development life cycle. Bearing this on mind, we delineate a performance modeling framework of the application for communication system that proposes a translation process from high level UML notation to Continuous Time Markov Chain model (CTMC) and solves the model for relevant performance metrics. The framework utilizes UML collaborations, activity diagrams and deployment diagrams to be used for generating performance model for a communication system. The system dynamics will be captured by UML collaboration and activity diagram as reusable specification building blocks, while deployment diagram highlights the components of the system. The collaboration and activity show how reusable building blocks in the form of collaboration can compose together the service components through input and output pin by highlighting the behavior of the components and later a mapping between collaboration and system component identified by deployment diagram will be delineated. Moreover the UML models are annotated to associate performance related quality of service (QoS) information which is necessary for solving the performance model for relevant performance metrics through our proposed framework. The applicability of our proposed performance modeling framework in performance evaluation is delineated in the context of modeling a communication system.

From the Bench to the Bedside: The Role of Semantic Web and Translational Medicine for Enabling the Next Generation Healthcare Enterprise

NASA Astrophysics Data System (ADS)

Kashyap, Vipul

The success of new innovations and technologies are very often disruptive in nature. At the same time, they enable novel next generation infrastructures and solutions. These solutions introduce great efficiencies in the form of efficient processes and the ability to create, organize, share and manage knowledge effectively; and the same time provide crucial enablers for proposing and realizing new visions. In this paper, we propose a new vision of the next generation healthcare enterprise and discuss how Translational Medicine, which aims to improve communication between the basic and clinical sciences, is a key requirement for achieving this vision. This will lead therapeutic insights may be derived from new scientific ideas - and vice versa. Translation research goes from bench to bedside, where theories emerging from preclinical experimentation are tested on disease-affected human subjects, and from bedside to bench, where information obtained from preliminary human experimentation can be used to refine our understanding of the biological principles underpinning the heterogeneity of human disease and polymorphism(s). Informatics and semantic technologies in particular, has a big role to play in making this a reality. We identify critical requirements, viz., data integration, clinical decision support and knowledge maintenance and provenance; and illustrate semantics-based solutions wrt example scenarios and use cases.
Vertical distribution and composition of phytoplankton under the influence of an upper mixed layer.

PubMed

Ryabov, Alexei B; Rudolf, Lars; Blasius, Bernd

2010-03-07

The vertical distribution of phytoplankton is of fundamental importance for the dynamics and structure of aquatic communities. Here, using an advection-reaction-diffusion model, we investigate the distribution and competition of phytoplankton species in a water column, in which inverse resource gradients of light and a nutrient can limit growth of the biomass. This problem poses a challenge for ecologists, as the location of a production layer is not fixed, but rather depends on many internal parameters and environmental factors. In particular, we study the influence of an upper mixed layer (UML) in this system and show that it leads to a variety of dynamic effects: (i) Our model predicts alternative density profiles with a maximum of biomass either within or below the UML, thereby the system may be bistable or the relaxation from an unstable state may require a long-lasting transition. (ii) Reduced mixing in the deep layer can induce oscillations of the biomass; we show that a UML can sustain these oscillations even if the diffusivity is less than the critical mixing for a sinking phytoplankton population. (iii) A UML can strongly modify the outcome of competition between different phytoplankton species, yielding bistability both in the spatial distribution and in the species composition. (iv) A light limited species can obtain a competitive advantage if the diffusivity in the deep layers is reduced below a critical value. This yields a subtle competitive exclusion effect, where the oscillatory states in the deep layers are displaced by steady solutions in the UML. Finally, we present a novel graphical approach for deducing the competition outcome and for the analysis of the role of a UML in aquatic systems. 2009 Elsevier Ltd. All rights reserved.
Improved identification of noun phrases in clinical radiology reports using a high-performance statistical natural language parser augmented with the UMLS specialist lexicon.

PubMed

Huang, Yang; Lowe, Henry J; Klein, Dan; Cucina, Russell J

2005-01-01

The aim of this study was to develop and evaluate a method of extracting noun phrases with full phrase structures from a set of clinical radiology reports using natural language processing (NLP) and to investigate the effects of using the UMLS(R) Specialist Lexicon to improve noun phrase identification within clinical radiology documents. The noun phrase identification (NPI) module is composed of a sentence boundary detector, a statistical natural language parser trained on a nonmedical domain, and a noun phrase (NP) tagger. The NPI module processed a set of 100 XML-represented clinical radiology reports in Health Level 7 (HL7)(R) Clinical Document Architecture (CDA)-compatible format. Computed output was compared with manual markups made by four physicians and one author for maximal (longest) NP and those made by one author for base (simple) NP, respectively. An extended lexicon of biomedical terms was created from the UMLS Specialist Lexicon and used to improve NPI performance. The test set was 50 randomly selected reports. The sentence boundary detector achieved 99.0% precision and 98.6% recall. The overall maximal NPI precision and recall were 78.9% and 81.5% before using the UMLS Specialist Lexicon and 82.1% and 84.6% after. The overall base NPI precision and recall were 88.2% and 86.8% before using the UMLS Specialist Lexicon and 93.1% and 92.6% after, reducing false-positives by 31.1% and false-negatives by 34.3%. The sentence boundary detector performs excellently. After the adaptation using the UMLS Specialist Lexicon, the statistical parser's NPI performance on radiology reports increased to levels comparable to the parser's native performance in its newswire training domain and to that reported by other researchers in the general nonmedical domain.
[Changes in levels of chorionic gonadotrophin (hCG) and its subunit alpha and beta in pregnancy complicated by diabetes (GDM)].

PubMed

Olszewski, J; Szczurowicz, A; Wójcikowski, C

1995-02-01

The aim of the study was estimation of endocrinological function of placenta in pregnancy complicated by GDM. The study were performed on a group 13 women with GDM and 14 women in normal pregnancy. All women with GDM were treat by diet and intensive insulinotherapy with self monitoring levels of glucose. In women with GDM level of fructosamine and HbAlc were significant higher but in normal range. In 28 and 36 week of pregnancy were determined levels of hCG, alpha hCG, beta hCG, in serum. Level of hCG in control group and in women with GDM were respectively 97.29 U/ml vs. 29.29 U/ml, p < 0.01 in 28 week of pregnancy and 77.23 U/ml vs. 37.93 U/ml, p < 0.05 in 36 week. Level of alpha hCG was lower and beta hCG was higher in group with GDM.
Transforming Functional Requirements from UML into BPEL to Efficiently Develop SOA-Based Systems

NASA Astrophysics Data System (ADS)

Vemulapalli, Anisha; Subramanian, Nary

The intended behavior of any system such as services, tasks or functions can be captured by functional requirements of the system. As our dependence on online services has grown steadily, the web applications are being developed employing the SOA. BPEL4WS provides a means for expressing functional requirements of an SOA-based system by providing constructs to capture business goals and objectives for the system. In this paper we propose an approach for transforming user-centered requirements captured using UML into a corresponding BPEL specification, where the business processes are captured by means of use-cases from which UML sequence diagrams and activity diagrams are extracted. Subsequently these UML models are mapped to BPEL specifications that capture the essence of the initial business requirements to develop the SOA-based system by employing CASE tools. A student housing system is used as a case study to illustrate this approach and the system is validated using NetBeans.
An Information System Development Method Connecting Business Process Modeling and its Experimental Evaluation

NASA Astrophysics Data System (ADS)

Okawa, Tsutomu; Kaminishi, Tsukasa; Kojima, Yoshiyuki; Hirabayashi, Syuichi; Koizumi, Hisao

Business process modeling (BPM) is gaining attention as a measure of analysis and improvement of the business process. BPM analyses the current business process as an AS-IS model and solves problems to improve the current business and moreover it aims to create a business process, which produces values, as a TO-BE model. However, researches of techniques that connect the business process improvement acquired by BPM to the implementation of the information system seamlessly are rarely reported. If the business model obtained by BPM is converted into UML, and the implementation can be carried out by the technique of UML, we can expect the improvement in efficiency of information system implementation. In this paper, we describe a method of the system development, which converts the process model obtained by BPM into UML and the method is evaluated by modeling a prototype of a parts procurement system. In the evaluation, comparison with the case where the system is implemented by the conventional UML technique without going via BPM is performed.
Natural language acquisition in large scale neural semantic networks

NASA Astrophysics Data System (ADS)

Ealey, Douglas

This thesis puts forward the view that a purely signal- based approach to natural language processing is both plausible and desirable. By questioning the veracity of symbolic representations of meaning, it argues for a unified, non-symbolic model of knowledge representation that is both biologically plausible and, potentially, highly efficient. Processes to generate a grounded, neural form of this model-dubbed the semantic filter-are discussed. The combined effects of local neural organisation, coincident with perceptual maturation, are used to hypothesise its nature. This theoretical model is then validated in light of a number of fundamental neurological constraints and milestones. The mechanisms of semantic and episodic development that the model predicts are then used to explain linguistic properties, such as propositions and verbs, syntax and scripting. To mimic the growth of locally densely connected structures upon an unbounded neural substrate, a system is developed that can grow arbitrarily large, data- dependant structures composed of individual self- organising neural networks. The maturational nature of the data used results in a structure in which the perception of concepts is refined by the networks, but demarcated by subsequent structure. As a consequence, the overall structure shows significant memory and computational benefits, as predicted by the cognitive and neural models. Furthermore, the localised nature of the neural architecture also avoids the increasing error sensitivity and redundancy of traditional systems as the training domain grows. The semantic and episodic filters have been demonstrated to perform as well, or better, than more specialist networks, whilst using significantly larger vocabularies, more complex sentence forms and more natural corpora.
Administration of human recombinant activated protein C is not associated with pancreatic parenchymal haemorrhage in L-arginine-induced experimental acute pancreatitis.

PubMed

Jamdar, Saurabh; Babu, Benoy I; Nirmalan, Mahesh; Jeziorska, Maria; McMahon, Raymond F T; Siriwardena, Ajith K

2013-11-10

Microvascular thrombosis is a critical event in severe acute pancreatitis. Human recombinant activated protein C (Xigris®, Eli Lilly, Indianapolis, IN, USA) modulates the interplay between pro-inflammatory and pro-coagulant pathways and maintains microvascular patency. However, the anticoagulant properties of Xigris® may precipitate bleeding from the inflamed pancreas. This study tests the hypothesis that Xigris® can ameliorate experimental acute pancreatitis without causing pancreatic haemorrhage. Sprague Dawley rats were allocated as follows: Group 1: control (n=7); Group 2: acute pancreatitis (n=6); Group 3: administration of Xigris® 500 µg/kg body weight before induction of acute pancreatitis (n=6); and Group 4: Administration of Xigris® 500 µg/kg body weight 30 minutes after induction of acute pancreatitis (n=6). Acute pancreatitis was induced by intraperitoneal administration of L-arginine 300 mg/100 g body weight. Animals were sacrificed at 48 hours and biochemical, haematological, and histological markers of pancreatic haemorrhage and inflammation assessed. Median lipase in animals with acute pancreatitis was 10 U/mL (range: 7-16 U/mL) compared to 5.5 (range: 3-8 U/mL) in controls (P=0.028). Lipase was also elevated in animals given Xigris® both before (12 U/mL, range: 8-22 U/mL; P=0.031 vs. control group) and after (46 U/mL, range: 9-71 U/mL; P=0.015 vs. control group) induction of acute pancreatitis). Haemoglobin levels were similar among all groups (P=0.323). There was no histological evidence of pancreatic haemorrhage in animals treated with Xigris®. Pre-treatment with Xigris® was associated with a significant reduction in pancreatic injury. This effect was absent when Xigris® was administered after induction of acute pancreatitis. Xigris® did not lead to pancreatic haemorrhage in experimental acute pancreatitis. Administration of Xigris® prior to induction of acute pancreatitis was associated with amelioration of injury. This effect was not seen with administration of Xigris® after induction of acute pancreatitis.
Plasma levels of soluble CD30 in kidney graft recipients as predictors of acute allograft rejection.

PubMed

Ayed, K; Abdallah, T B; Bardi, R; Abderrahim, E; Kheder, A

2006-09-01

In renal transplant recipients elevated soluble serum CD30 levels are associated with increased rejection and graft loss. We sought to determine the sCD30 plasma levels before and after kidney transplantation and to assess whether sCD30 was a predictive factor of immunological risk. sCD30 plasma levels were determined by an enzyme-linked immunosorbent assay assay in 52 kidney graft recipients before as well as 7, 15, and 21 days after transplantation. Eighteen patients developed acute allograft rejection (group I) and 34 patients showed uneventful courses (group II). Before transplantation sCD30 plasma levels were elevated in both groups (mean: 162.6 +/- 89.5 U/mL). After transplantation, group I recipients with acute rejection showed higher relative levels of plasma sCD30 on days 7 and 15 (120.8 +/- 74.6 U/mL and 210.6 +/- 108.7 U/mL respectively) compared with group II patients without rejection (95 +/- 45 U/mL and 59.4 +/- 31.6 U/mL), a difference that was significant for group I (P = .0003) and not significant for group II (P = .09). On day 21, sCD30 decreased in the two groups but remained higher among group I patients (120.6 +/- 92.7 U/mL). HLA antibodies were positive in 18 patients (34.6%) with 9 (50%) experiencing at last one episode of acute rejection. Among 34 patients negative for anti-HLA antibodies, nine displayed acute rejection only (26.4%), a difference that was not significant (P > .05). If we consider 100 U/mL as the minimum predictive level for allograft rejection, our results suggested that levels of sCD30 should be taken into consideration with the presence of HLA-antibodies detectable before and after transplantation, especially in patients with more than three HLA mismatches [RR = 3.20 (0.94 < RR < 10.91)]. These data suggested that measurement of plasma sCD30 is a useful procedure for the recognition of rejection in its earliest stages.
A rule-based approach to model checking of UML state machines

NASA Astrophysics Data System (ADS)

Grobelna, Iwona; Grobelny, Michał; Stefanowicz, Łukasz

2016-12-01

In the paper a new approach to formal verification of control process specification expressed by means of UML state machines in version 2.x is proposed. In contrast to other approaches from the literature, we use the abstract and universal rule-based logical model suitable both for model checking (using the nuXmv model checker), but also for logical synthesis in form of rapid prototyping. Hence, a prototype implementation in hardware description language VHDL can be obtained that fully reflects the primary, already formally verified specification in form of UML state machines. Presented approach allows to increase the assurance that implemented system meets the user-defined requirements.
What language is the language-ready brain ready for?. Comment on "Towards a Computational Comparative Neuroprimatology: Framing the language-ready brain" by Michael A. Arbib

NASA Astrophysics Data System (ADS)

Croft, William

2016-03-01

Arbib's computational comparative neuroprimatology [1] is a welcome model for cognitive linguists, that is, linguists who ground their models of language in human cognition and language use in social interaction. Arbib argues that language emerged via biological and cultural coevolution [1]; linguistic knowledge is represented by constructions, and semantic representations of linguistic constructions are grounded in embodied perceptual-motor schemas (the mirror system hypothesis). My comments offer some refinements from a linguistic point of view.
Informative Top-k Retrieval for Advanced Skill Management

NASA Astrophysics Data System (ADS)

Colucci, Simona; di Noia, Tommaso; Ragone, Azzurra; Ruta, Michele; Straccia, Umberto; Tinelli, Eufemia

The paper presents a knowledge-based framework for skills and talent management based on an advanced matchmaking between profiles of candidates and available job positions. Interestingly, informative content of top-k retrieval is enriched through semantic capabilities. The proposed approach allows to: (1) express a requested profile in terms of both hard constraints and soft ones; (2) provide a ranking function based also on qualitative attributes of a profile; (3) explain the resulting outcomes (given a job request, a motivation for the obtained score of each selected profile is provided). Top-k retrieval allows to select most promising candidates according to an ontology formalizing the domain knowledge. Such a knowledge is further exploited to provide a semantic-based explanation of missing or conflicting features in retrieved profiles. They also indicate additional profile characteristics emerging by the retrieval procedure for a further request refinement. A concrete case study followed by an exhaustive experimental campaign is reported to prove the approach effectiveness.
Effect of different fermentation strategies on β-mannanase production in fed-batch bioreactor system.

PubMed

Germec, Mustafa; Yatmaz, Ercan; Karahalil, Ercan; Turhan, İrfan

2017-05-01

Mannanases, one of the important enzyme group for industry, are produced by numerous filamentous fungi, especially Aspergillus species with different fermentation methods. The aim of this study was to show the best fermentation method of β-mannanase production for fungal growth in fermenter. Therefore, different fermentation strategies in fed-batch fermentation (suspended, immobilized cell, biofilm and microparticle-enhanced bioreactor) were applied for β-mannanase production from glucose medium (GM) and carob extract medium (CEM) by using recombinant Aspergillus sojae. The highest β-mannanase activities were obtained from microparticle-enhanced bioreactor strategy. It was found to be 347.47 U/mL by adding 10 g/L of Al 2 O 3 to GM and 439.13 U/mL by adding 1 g/L of talcum into CEM. The maximum β-mannanase activities for suspended, immobilization, and biofilm reactor remained at 72.55 U/mL in GM, 148.81 U/mL in CEM, and 194.09 U/mL in GM, respectively. The reason for that is the excessive, and irregular shaped growth and bulk formation, inadequate oxygen transfer or substrate diffusion in bioreactor. Consequently, the enzyme activity was significantly enhanced by addition of microparticles compared to other fed-batch fermentation strategies. Also, repeatable β-mannanase activities were obtained by controlling of the cell morphology by adding microparticle inside the fermenter.
Using Unified Modelling Language (UML) as a process-modelling technique for clinical-research process improvement.

PubMed

Kumarapeli, P; De Lusignan, S; Ellis, T; Jones, B

2007-03-01

The Primary Care Data Quality programme (PCDQ) is a quality-improvement programme which processes routinely collected general practice computer data. Patient data collected from a wide range of different brands of clinical computer systems are aggregated, processed, and fed back to practices in an educational context to improve the quality of care. Process modelling is a well-established approach used to gain understanding and systematic appraisal, and identify areas of improvement of a business process. Unified modelling language (UML) is a general purpose modelling technique used for this purpose. We used UML to appraise the PCDQ process to see if the efficiency and predictability of the process could be improved. Activity analysis and thinking-aloud sessions were used to collect data to generate UML diagrams. The UML model highlighted the sequential nature of the current process as a barrier for efficiency gains. It also identified the uneven distribution of process controls, lack of symmetric communication channels, critical dependencies among processing stages, and failure to implement all the lessons learned in the piloting phase. It also suggested that improved structured reporting at each stage - especially from the pilot phase, parallel processing of data and correctly positioned process controls - should improve the efficiency and predictability of research projects. Process modelling provided a rational basis for the critical appraisal of a clinical data processing system; its potential maybe underutilized within health care.
PREDOSE: a semantic web platform for drug abuse epidemiology using social media.

PubMed

Cameron, Delroy; Smith, Gary A; Daniulaityte, Raminta; Sheth, Amit P; Dave, Drashti; Chen, Lu; Anand, Gaurish; Carlson, Robert; Watkins, Kera Z; Falck, Russel

2013-12-01

The role of social media in biomedical knowledge mining, including clinical, medical and healthcare informatics, prescription drug abuse epidemiology and drug pharmacology, has become increasingly significant in recent years. Social media offers opportunities for people to share opinions and experiences freely in online communities, which may contribute information beyond the knowledge of domain professionals. This paper describes the development of a novel semantic web platform called PREDOSE (PREscription Drug abuse Online Surveillance and Epidemiology), which is designed to facilitate the epidemiologic study of prescription (and related) drug abuse practices using social media. PREDOSE uses web forum posts and domain knowledge, modeled in a manually created Drug Abuse Ontology (DAO--pronounced dow), to facilitate the extraction of semantic information from User Generated Content (UGC), through combination of lexical, pattern-based and semantics-based techniques. In a previous study, PREDOSE was used to obtain the datasets from which new knowledge in drug abuse research was derived. Here, we report on various platform enhancements, including an updated DAO, new components for relationship and triple extraction, and tools for content analysis, trend detection and emerging patterns exploration, which enhance the capabilities of the PREDOSE platform. Given these enhancements, PREDOSE is now more equipped to impact drug abuse research by alleviating traditional labor-intensive content analysis tasks. Using custom web crawlers that scrape UGC from publicly available web forums, PREDOSE first automates the collection of web-based social media content for subsequent semantic annotation. The annotation scheme is modeled in the DAO, and includes domain specific knowledge such as prescription (and related) drugs, methods of preparation, side effects, and routes of administration. The DAO is also used to help recognize three types of data, namely: (1) entities, (2) relationships and (3) triples. PREDOSE then uses a combination of lexical and semantic-based techniques to extract entities and relationships from the scraped content, and a top-down approach for triple extraction that uses patterns expressed in the DAO. In addition, PREDOSE uses publicly available lexicons to identify initial sentiment expressions in text, and then a probabilistic optimization algorithm (from related research) to extract the final sentiment expressions. Together, these techniques enable the capture of fine-grained semantic information, which facilitate search, trend analysis and overall content analysis using social media on prescription drug abuse. Moreover, extracted data are also made available to domain experts for the creation of training and test sets for use in evaluation and refinements in information extraction techniques. A recent evaluation of the information extraction techniques applied in the PREDOSE platform indicates 85% precision and 72% recall in entity identification, on a manually created gold standard dataset. In another study, PREDOSE achieved 36% precision in relationship identification and 33% precision in triple extraction, through manual evaluation by domain experts. Given the complexity of the relationship and triple extraction tasks and the abstruse nature of social media texts, we interpret these as favorable initial results. Extracted semantic information is currently in use in an online discovery support system, by prescription drug abuse researchers at the Center for Interventions, Treatment and Addictions Research (CITAR) at Wright State University. A comprehensive platform for entity, relationship, triple and sentiment extraction from such abstruse texts has never been developed for drug abuse research. PREDOSE has already demonstrated the importance of mining social media by providing data from which new findings in drug abuse research were uncovered. Given the recent platform enhancements, including the refined DAO, components for relationship and triple extraction, and tools for content, trend and emerging pattern analysis, it is expected that PREDOSE will play a significant role in advancing drug abuse epidemiology in future. Copyright © 2013 Elsevier Inc. All rights reserved.
Supporting Development of Satellite's Guidance Navigation and Control Software: A Product Line Approach

NASA Technical Reports Server (NTRS)

McComas, David; Stark, Michael; Leake, Stephen; White, Michael; Morisio, Maurizio; Travassos, Guilherme H.; Powers, Edward I. (Technical Monitor)

2000-01-01

The NASA Goddard Space Flight Center Flight Software Branch (FSB) is developing a Guidance, Navigation, and Control (GNC) Flight Software (FSW) product line. The demand for increasingly more complex flight software in less time while maintaining the same level of quality has motivated us to look for better FSW development strategies. The GNC FSW product line has been planned to address the core GNC FSW functionality very similar on many recent low/near Earth missions in the last ten years. Unfortunately these missions have not accomplished significant drops in development cost since a systematic approach towards reuse has not been adopted. In addition, new demands are continually being placed upon the FSW which means the FSB must become more adept at providing GNC FSW functionality's core so it can accommodate additional requirements. These domain features together with engineering concepts are influencing the specification, description and evaluation of FSW product line. Domain engineering is the foundation for emerging product line software development approaches. A product line is 'A family of products designed to take advantage of their common aspects and predicted variabilities'. In our product line approach, domain engineering includes the engineering activities needed to produce reusable artifacts for a domain. Application engineering refers to developing an application in the domain starting from reusable artifacts. The focus of this paper is regarding the software process, lessons learned and on how the GNC FSW product line manages variability. Existing domain engineering approaches do not enforce any specific notation for domain analysis or commonality and variability analysis. Usually, natural language text is the preferred tool. The advantage is the flexibility and adapt ability of natural language. However, one has to be ready to accept also its well-known drawbacks, such as ambiguity, inconsistency, and contradictions. While most domain analysis approaches are functionally oriented, the idea of applying the object-oriented approach in domain analysis is not new. Some authors propose to use UML as the notation underlying domain analysis. Our work is based on the same idea of merging UML and domain analysis. Further, we propose a few extensions to UML in order to express variability, and we define precisely their semantics so that a tool can support them. The extensions are designed to be implemented on the API of a popular industrial CASE tool, with obvious advantages in cost and availability of tool support. The paper outlines the product line processes and identifies where variability must be addressed. Then it describes the product line products with respect to how they accommodate variability. The Celestial Body subdomain is used as a working example. Our results to date are summarized and plans for the future are described.
Consistency of Rasch Model Parameter Estimation: A Simulation Study.

ERIC Educational Resources Information Center

van den Wollenberg, Arnold L.; And Others

1988-01-01

The unconditional--simultaneous--maximum likelihood (UML) estimation procedure for the one-parameter logistic model produces biased estimators. The UML method is inconsistent and is not a good alternative to conditional maximum likelihood method, at least with small numbers of items. The minimum Chi-square estimation procedure produces unbiased…
Cross-Language Information Retrieval: An Analysis of Errors.

ERIC Educational Resources Information Center

Ruiz, Miguel E.; Srinivasan, Padmini

1998-01-01

Investigates an automatic method for Cross Language Information Retrieval (CLIR) that utilizes the multilingual Unified Medical Language System (UMLS) Metathesaurus to translate Spanish natural-language queries into English. Results indicate that for Spanish, the UMLS Metathesaurus-based CLIR method is at least equivalent to if not better than…
Visualization of Learning Scenarios with UML4LD

ERIC Educational Resources Information Center

Laforcade, Pierre

2007-01-01

Present Educational Modelling Languages are used to formally specify abstract learning scenarios in a machine-interpretable format. Current tooling does not provide teachers/designers with some graphical facilities to help them in reusing existent scenarios. They need human-readable representations. This paper discusses the UML4LD experimental…
Enhancement of the Acquisition Process for a Combat System-A Case Study to Model the Workflow Processes for an Air Defense System Acquisition

DTIC Science & Technology

2009-12-01

Business Process Modeling BPMN Business Process Modeling Notation SoA Service-oriented Architecture UML Unified Modeling Language CSP...system developers. Supporting technologies include Business Process Modeling Notation ( BPMN ), Unified Modeling Language (UML), model-driven architecture

Net-centric ACT-R-Based Cognitive Architecture with DEVS Unified Process

DTIC Science & Technology

2011-04-01

effort has been spent in analyzing various forms of requirement specifications, viz, state-based, Natural Language based, UML-based, Rule- based, BPMN ...requirement specifications in one of the chosen formats such as BPMN , DoDAF, Natural Language Processing (NLP) based, UML- based, DSL or simply
Designing Control System Application Software for Change

NASA Technical Reports Server (NTRS)

Boulanger, Richard

2001-01-01

The Unified Modeling Language (UML) was used to design the Environmental Systems Test Stand (ESTS) control system software. The UML was chosen for its ability to facilitate a clear dialog between software designer and customer, from which requirements are discovered and documented in a manner which transposes directly to program objects. Applying the UML to control system software design has resulted in a baseline set of documents from which change and effort of that change can be accurately measured. As the Environmental Systems Test Stand evolves, accurate estimates of the time and effort required to change the control system software will be made. Accurate quantification of the cost of software change can be before implementation, improving schedule and budget accuracy.
Arranging ISO 13606 archetypes into a knowledge base using UML connectors.

PubMed

Kopanitsa, Georgy

2014-01-01

To enable the efficient reuse of standard based medical data we propose to develop a higher-level information model that will complement the archetype model of ISO 13606. This model will make use of the relationships that are specified in UML to connect medical archetypes into a knowledge base within a repository. UML connectors were analysed for their ability to be applied in the implementation of a higher-level model that will establish relationships between archetypes. An information model was developed using XML Schema notation. The model allows linking different archetypes of one repository into a knowledge base. Presently it supports several relationships and will be advanced in future.
Facilitating Semantic Interoperability Among Ocean Data Systems: ODIP-R2R Student Outcomes

NASA Astrophysics Data System (ADS)

Stocks, K. I.; Chen, Y.; Shepherd, A.; Chandler, C. L.; Dockery, N.; Elya, J. L.; Smith, S. R.; Ferreira, R.; Fu, L.; Arko, R. A.

2014-12-01

With informatics providing an increasingly important set of tools for geoscientists, it is critical to train the next generation of scientists in information and data techniques. The NSF-supported Rolling Deck to Repository (R2R) Program works with the academic fleet community to routinely document, assess, and preserve the underway sensor data from U.S. research vessels. The Ocean Data Interoperability Platform (ODIP) is an EU-US-Australian collaboration fostering interoperability among regional e-infrastructures through workshops and joint prototype development. The need to align terminology between systems is a common challenge across all of the ODIP prototypes. Five R2R students were supported to address aspects of semantic interoperability within ODIP. Developing a vocabulary matching service that links terms from different vocabularies with similar concept. The service implements Google Refine reconciliation service interface such that users can leverage Google Refine application as a friendly user interface while linking different vocabulary terms. Developing Resource Description Framework (RDF) resources that map Shipboard Automated Meteorological Oceanographic System (SAMOS) vocabularies to internationally served vocabularies. Each SAMOS vocabulary term (data parameter and quality control flag) will be described as an RDF resource page. These RDF resources allow for enhanced discoverability and retrieval of SAMOS data by enabling data searches based on parameter. Improving data retrieval and interoperability by exposing data and mapped vocabularies using Semantic Web technologies. We have collaborated with ODIP participating organizations in order to build a generalized data model that will be used to populate a SPARQL endpoint in order to provide expressive querying over our data files. Mapping local and regional vocabularies used by R2R to those used by ODIP partners. This work is described more fully in a companion poster. Making published Linked Data Web developer-friendly with a RESTful service. This goal was achieved by defining a proxy layer on top of the existing SPARQL endpoint that 1) translates HTTP requests into SPARQL queries, and 2) renders the returned results as required by the request sender using content negotiation, suffixes and parameters.
A Semantic Approach for Knowledge Discovery to Help Mitigate Habitat Loss in the Gulf of Mexico

NASA Astrophysics Data System (ADS)

Ramachandran, R.; Maskey, M.; Graves, S.; Hardin, D.

2008-12-01

Noesis is a meta-search engine and a resource aggregator that uses domain ontologies to provide scoped search capabilities. Ontologies enable Noesis to help users refine their searches for information on the open web and in hidden web locations such as data catalogues with standardized, but discipline specific vocabularies. Through its ontologies Noesis provides a guided refinement of search queries which produces complete and accurate searches while reducing the user's burden to experiment with different search strings. All search results are organized by categories (e. g. all results from Google are grouped together) which may be selected or omitted according to the desire of the user. During the past two years ontologies were developed for sea grasses in the Gulf of Mexico and were used to support a habitat restoration demonstration project. Currently these ontologies are being augmented to address the special characteristics of mangroves. These new ontologies will extend the demonstration project to broader regions of the Gulf including protected mangrove locations in coastal Mexico. Noesis contributes to the decision making process by producing a comprehensive list of relevant resources based on the semantic information contained in the ontologies. Ontologies are organized in a tree like taxonomies, where the child nodes represent the Specializations and the parent nodes represent the Generalizations of a node or concept. Specializations can be used to provide more detailed search, while generalizations are used to make the search broader. Ontologies are also used to link two syntactically different terms to one semantic concept (synonyms). Appending a synonym to the query expands the search, thus providing better search coverage. Every concept has a set of properties that are neither in the same inheritance hierarchy (Specializations / Generalizations) nor equivalent (synonyms). These are called Related Concepts and they are captured in the ontology through property relationships. By using Related Concepts users can search for resources with respect to a particular property. Noesis automatically generates searches that include all of these capabilities, removing the burden from the user and producing broader and more accurate search results. This presentation will demonstrate the features of Noesis and describe its application to habitat studies in the Gulf of Mexico.
The Equivalence of Two Methods of Parameter Estimation for the Rasch Model.

ERIC Educational Resources Information Center

Blackwood, Larry G.; Bradley, Edwin L.

1989-01-01

Two methods of estimating parameters in the Rasch model are compared. The equivalence of likelihood estimations from the model of G. J. Mellenbergh and P. Vijn (1981) and from usual unconditional maximum likelihood (UML) estimation is demonstrated. Mellenbergh and Vijn's model is a convenient method of calculating UML estimates. (SLD)
The Impact of an Automated Learning Component against a Traditional Lecturing Environment

ERIC Educational Resources Information Center

Maycock, Keith W.; Keating, J. G.

2017-01-01

This experimental study investigates the effect on the examination performance of a cohort of first-year undergraduate learners undertaking a Unified Modelling Language (UML) course using an adaptive learning system against a control group of learners undertaking the same UML course through a traditional lecturing environment. The adaptive…
Modified UMS, Modified SemRep and SemMedDB-UTH | Informatics Technology for Cancer Research (ITCR)

Cancer.gov

Modified UMLS, modified SemRep and SemMedDB-UTH – these are resources (UMLS, SemMedDB-UT) and tools (SemRep) created and maintained by National Library of Medicine that we have modified for personalized cancer therapy and returned to the NLM.
A Software Hub for High Assurance Model-Driven Development and Analysis

DTIC Science & Technology

2007-01-23

verification of UML models in TLPVS. In Thomas Baar, Alfred Strohmeier, Ana Moreira, and Stephen J. Mellor, editors, UML 2004 - The Unified Modeling...volume 3785 of Lecture Notes in Computer Science, pages 52–65, Manchester, UK, Nov 2005. Springer. [GH04] Günter Graw and Peter Herrmann. Transformation
Modeling Value Chain Analysis of Distance Education using UML

NASA Astrophysics Data System (ADS)

Acharya, Anal; Mukherjee, Soumen

2010-10-01

Distance education continues to grow as a methodology for the delivery of course content in higher education in India as well as abroad. To manage this growing demand and to provide certain flexibility, there must be certain strategic planning about the use of ICT tools. Value chain analysis is a framework for breaking down the sequence of business functions into a set of activities through which utility could be added to service. Thus it can help to determine the competitive advantage that is enjoyed by an institute. To implement these business functions certain visual representation is required. UML allows for this representation by using a set of structural and behavioral diagrams. In this paper, the first section defines a framework for value chain analysis and highlights its advantages. The second section gives a brief overview of related work in this field. The third section gives a brief discussion on distance education. The fourth section very briefly introduces UML. The fifth section models value chain of distance education using UML. Finally we discuss the limitations and the problems posed in this domain.
Celiac disease: Serologic prevalence in patients with irritable bowel syndrome.

PubMed

Mehdi, Zobeiri; Sakineh, Ebrahimi; Mohammad, Farahvash; Mansour, Rezaei; Alireza, Abdollahi

2012-09-01

The prevalence of irritable bowel syndrome (IBS) in the community is 10%-20% and have symptom based diagnostic criteria. Many symptoms of celiac disease (CD) with 1% prevalence in some communities can mimic IBS. Sensitive and specific serologic tests of CD can detect asymptomatic cases. The purpose of this study was to compare the level of anti-tissue-transglutaminase (tTG) IgA in IBS patients and controls group. This case-control study was performed at a University hospital in which 107 patients with IBS who met the Rome II criteria for their diagnosis were compared with 126 healthy age and sex-matched controls. Both groups were investigated for CD by analysis of their serum tTG IgA antibody with human recombinant antigen. Titers were positive containing over 10u/ml and borderline if they were between 4 and 10 u/ml. 86 percent of IBS patients were female. The mean antibody level was 0.837 u/ml in IBS group and 0.933 u/ml in control group without any significant difference. Results of this study may intensify disagreement on the situation of CD in IBS patients.
Graph-based word sense disambiguation of biomedical documents.

PubMed

Agirre, Eneko; Soroa, Aitor; Stevenson, Mark

2010-11-15

Word Sense Disambiguation (WSD), automatically identifying the meaning of ambiguous words in context, is an important stage of text processing. This article presents a graph-based approach to WSD in the biomedical domain. The method is unsupervised and does not require any labeled training data. It makes use of knowledge from the Unified Medical Language System (UMLS) Metathesaurus which is represented as a graph. A state-of-the-art algorithm, Personalized PageRank, is used to perform WSD. When evaluated on the NLM-WSD dataset, the algorithm outperforms other methods that rely on the UMLS Metathesaurus alone. The WSD system is open source licensed and available from http://ixa2.si.ehu.es/ukb/. The UMLS, MetaMap program and NLM-WSD corpus are available from the National Library of Medicine https://www.nlm.nih.gov/research/umls/, http://mmtx.nlm.nih.gov and http://wsd.nlm.nih.gov. Software to convert the NLM-WSD corpus into a format that can be used by our WSD system is available from http://www.dcs.shef.ac.uk/∼marks/biomedical_wsd under open source license.
Alteration of white-rot basidiomycetes cellulase and xylanase activities in the submerged co-cultivation and optimization of enzyme production by Irpex lacteus and Schizophyllum commune.

PubMed

Metreveli, Eka; Kachlishvili, Eva; Singer, Steven W; Elisashvili, Vladimir

2017-10-01

Mono and dual cultures of four white-rot basidiomycete species were evaluated for cellulase and xylanase activity under submerged fermentation conditions. Co-cultivation of Pycnoporus coccineus or Trametes hirsuta with Schizophyllum commune displayed antagonistic interactions resulting in the decrease of endoglucanase and total cellulase activities. In contrast, increases in cellulase and xylanase activity were revealed through the compatible interactions of Irpex lacteus with S. commune. Co-cultivation conditions were optimized for maximum enzyme production by I. lacteus and S. commune, the best producers of cellulase/xylanase and β-glucosidase, respectively. An optimized medium for the target enzyme production by the mixed culture was established in a laboratory fermenter yielding 7U/mL total cellulase, 142U/mL endoglucanase, 104U/mL xylanase, and 5.2U/mL β-glucosidase. The dual culture approach resulted in an enzymatic mixture with 11% improved lignocellulose saccharification potential compared to enzymes from a monoculture of I. lacteus. Copyright © 2017 Elsevier Ltd. All rights reserved.
Co-production of tannase and pectinase by free and immobilized cells of the yeast Rhodotorula glutinis MP-10 isolated from tannin-rich persimmon (Diospyros kaki L.) fruits.

PubMed

Taskin, Mesut

2013-02-01

Hyper tannase and pectinase-producing yeast Rhodotorula glutinis MP-10 was isolated from persimmon (Diospyros kaki L.) fruits. The main pectinase activity of yeast was exo-polygalacturonase. No pectin methyl esterase and too low pectin lyase activities were detected for this yeast. The maximum exo-activities of tannase and polygalacturonase were determined as 15.2 and 26.9 U/mL for free cells and 19.8 and 28.6 U/mL for immobilized cells, respectively. Immobilized cells could be reused in 13 successive reaction cycles without any loss in the maximum tannase and polygalacturonase activities. Besides, too little decreases in activities of these enzymes were recorded between 14 and 18 cycles. At the end of 18 successive reaction cycles, total 503.1 U/mL of polygalacturonase and 349.6 U/mL of tannase could be produced using the same immobilized cells. This is the first report on the use of free and/or immobilized cells of a microorganism for the co-production of tannase and pectinase.
Evaluation of serum sCD30 in renal transplantation patients with and without acute rejection.

PubMed

Cervelli, C; Fontecchio, G; Scimitarra, M; Azzarone, R; Famulari, A; Pisani, F; Battistoni, C; Di Iulio, B; Fracassi, D; Scarnecchia, M A; Papola, F

2009-05-01

Despite new immunosuppressive approaches, acute rejection episodes (ARE) are still a major cause of early kidney dysfunction with a negative impact on long-term allograft survival. Noninvasive markers able to identify renal ARE earlier than creatinine measurement include sCD30. We sought to establish whether circulating levels of sCD30 in pretransplantation and posttransplantation periods were of clinical relevance to avoid graft damage. Quantitative detection of serum sCD30 was performed using an enzyme-linked immunosorbent assay. Our results demonstrated that the mean concentrations of sCD30 were significantly higher in the sera of renal transplant recipients with ARE (30.04 U/mL) and in uremic patients on the waiting list (37.7 U/mL) compared with healthy controls (HC; 9.44 U/mL), but not nonrejecting patients (12.01 U/mL). Statistical analysis revealed a strong association between high sCD30 levels in posttransplantation sera and ARE risk. This study suggested that sCD30 levels were a reliable predictor of ARE among deceased-donor kidney recipients.
Auditing the multiply-related concepts within the UMLS

PubMed Central

Mougin, Fleur; Grabar, Natalia

2014-01-01

Objective This work focuses on multiply-related Unified Medical Language System (UMLS) concepts, that is, concepts associated through multiple relations. The relations involved in such situations are audited to determine whether they are provided by source vocabularies or result from the integration of these vocabularies within the UMLS. Methods We study the compatibility of the multiple relations which associate the concepts under investigation and try to explain the reason why they co-occur. Towards this end, we analyze the relations both at the concept and term levels. In addition, we randomly select 288 concepts associated through contradictory relations and manually analyze them. Results At the UMLS scale, only 0.7% of combinations of relations are contradictory, while homogeneous combinations are observed in one-third of situations. At the scale of source vocabularies, one-third do not contain more than one relation between the concepts under investigation. Among the remaining source vocabularies, seven of them mainly present multiple non-homogeneous relations between terms. Analysis at the term level also shows that only in a quarter of cases are the source vocabularies responsible for the presence of multiply-related concepts in the UMLS. These results are available at: http://www.isped.u-bordeaux2.fr/ArticleJAMIA/results_multiply_related_concepts.aspx. Discussion Manual analysis was useful to explain the conceptualization difference in relations between terms across source vocabularies. The exploitation of source relations was helpful for understanding why some source vocabularies describe multiple relations between a given pair of terms. PMID:24464853
High pre-transplant soluble CD30 levels are predictive of the grade of rejection.

PubMed

Rajakariar, Ravindra; Jivanji, Naina; Varagunam, Mira; Rafiq, Mohammad; Gupta, Arun; Sheaff, Michael; Sinnott, Paul; Yaqoob, M M

2005-08-01

In renal transplantation, serum soluble CD30 (sCD30) levels in graft recipients are associated with increased rejection and graft loss. We investigated whether pre-transplant sCD30 concentrations are predictive of the grade of rejection. Pre-transplant sera of 51 patients with tubulointerstitial rejection (TIR), 16 patients with vascular rejection (VR) and an age-matched control group of 41 patients with no rejection (NR) were analyzed for sCD30. The transplant biopsies were immunostained for C4d. The median sCD30 level was significantly elevated in the group with VR (248 Units (U)/mL, range: 92-802) when compared with TIR (103 U/mL, range: 36-309, p<0.001) and NR (179 U/mL, range: 70-343, p<0.03). Moreover, patients with TIR had significantly lower sCD30 levels compared to NR. Based on C4d staining, a TH2 driven process, the median sCD30 levels were significantly raised in C4d+ patients compared with C4d- group (177 U/mL vs. 120 U/mL, p<0.05). sCD30 levels measured at time of transplantation correlate with the grade of rejection. High pre-transplant levels are associated with antibody-mediated rejection which carries a poorer prognosis. sCD30 could be another tool to assess immunological risk prior to transplantation and enable a patient centered approach to immunosuppression.
White organic light-emitting diodes with ultra-thin mixed emitting layer

NASA Astrophysics Data System (ADS)

Jeon, T.; Forget, S.; Chenais, S.; Geffroy, B.; Tondelier, D.; Bonnassieux, Y.; Ishow, E.

2012-02-01

White light can be obtained from Organic Light Emitting Diodes by mixing three primary colors, (i.e. red, green and blue) or two complementary colors in the emissive layer. In order to improve the efficiency and stability of the devices, a host-guest system is generally used as an emitting layer. However, the color balance to obtain white light is difficult to control and optimize because the spectrum is very sensitive to doping concentration (especially when a small amount of material is used). We use here an ultra-thin mixed emitting layer (UML) deposited by thermal evaporation to fabricate white organic light emitting diodes (WOLEDs) without co-evaporation. The UML was inserted in the hole-transporting layer consisting of 4, 4'-bis[N-(1-naphtyl)-N-phenylamino]biphenyl (α-NPB) instead of using a conventional doping process. The UML was formed from a single evaporation boat containing a mixture of two dipolar starbust triarylamine molecules (fvin and fcho) presenting very similar structures and thermal properties and emitting in complementary spectral regions (orange and blue respectively) and mixed according to their weight ratio. The composition of the UML specifically allows for fine tuning of the emission color despite its very thin thickness down to 1 nm. Competitive energy transfer processes from fcho and the host interface toward fvin are key parameters to control the relative intensity between red and blue emission. White light with very good CIE 1931 color coordinate (0.34, 0.34) was obtained by simply adjusting the UML film composition.
Scale-up of an alkaline protease from Bacillus pumilus MTCC 7514 utilizing fish meal as a sole source of nutrients.

PubMed

Gupta, Rishikesh Kumar; Prasad, Dinesh; Sathesh, Jaykumar; Naidu, Ramachandra Boopathy; Kamini, Numbi Ramudu; Palanivel, Saravanan; Gowthaman, Marichetti Kuppuswami

2012-09-01

Fish meal grades SL1 and SL2 from Sardine (Sardinella longiceps) and NJ from Pink Perch (Nemipterus japonicas) were evaluated as a sole source of carbon and nitrogen in the medium for alkaline protease production by Bacillus pumilus MTCC 7514. The analysis of the fish meal suggests that the carbon and nitrogen contents in fish meal are sufficient to justify its choice as replacement for other nutrients. Protease production increased significantly (4,914 U/ml) in medium containing only fish meal, compared with the basal medium (2,646 U/ml). However, the elimination of inorganic salts from media reduced the protease productivity. In addition, all the three grades of fish meal yielded almost the same amounts of protease when employed as the sole source of carbon and nitrogen. Nevertheless, the best results were observed in fish meal SL1 medium. Furthermore, protease production was enhanced to 6,966 U/ml and 7,047 U/ml on scaling up from flask (4,914 U/ml) to 3.7 and 20 L fermenters, respectively, using fish meal (10 g/l). Similarly, the corresponding improvement in productivities over flask (102.38 U/ml/h) was 193.5 and 195.75 U/ml/h in 3.7 and 20 L fermenters, respectively. The crude protease was found to have dehairing ability in leather processing, which is bound to have great environmental benefits.
Formal verification of software-based medical devices considering medical guidelines.

PubMed

Daw, Zamira; Cleaveland, Rance; Vetter, Marcus

2014-01-01

Software-based devices have increasingly become an important part of several clinical scenarios. Due to their critical impact on human life, medical devices have very strict safety requirements. It is therefore necessary to apply verification methods to ensure that the safety requirements are met. Verification of software-based devices is commonly limited to the verification of their internal elements without considering the interaction that these elements have with other devices as well as the application environment in which they are used. Medical guidelines define clinical procedures, which contain the necessary information to completely verify medical devices. The objective of this work was to incorporate medical guidelines into the verification process in order to increase the reliability of the software-based medical devices. Medical devices are developed using the model-driven method deterministic models for signal processing of embedded systems (DMOSES). This method uses unified modeling language (UML) models as a basis for the development of medical devices. The UML activity diagram is used to describe medical guidelines as workflows. The functionality of the medical devices is abstracted as a set of actions that is modeled within these workflows. In this paper, the UML models are verified using the UPPAAL model-checker. For this purpose, a formalization approach for the UML models using timed automaton (TA) is presented. A set of requirements is verified by the proposed approach for the navigation-guided biopsy. This shows the capability for identifying errors or optimization points both in the workflow and in the system design of the navigation device. In addition to the above, an open source eclipse plug-in was developed for the automated transformation of UML models into TA models that are automatically verified using UPPAAL. The proposed method enables developers to model medical devices and their clinical environment using clinical workflows as one UML diagram. Additionally, the system design can be formally verified automatically.

Downregulation of cell survival signalling pathways and increased cell damage in hydrogen peroxide-treated human renal proximal tubular cells by alpha-erythropoietin.

PubMed

Andreucci, M; Fuiano, G; Presta, P; Lucisano, G; Leone, F; Fuiano, L; Bisesti, V; Esposito, P; Russo, D; Memoli, B; Faga, T; Michael, A

2009-08-01

Erythropoietin has been shown to have a protective effect in certain models of ischaemia-reperfusion, and in some cases the protection has been correlated with activation of signalling pathways known to play a role in cell survival and proliferation. We have studied whether erythropoietin would overcome direct toxic effects of hydrogen peroxide (H(2)O(2)) treatment to human renal proximal tubular (HK-2) cells. HK-2 cells were incubated with H(2)O(2) (2 mm) for 2 h with or without erythropoietin at concentrations of 100 and 400 U/ml, and cell viability/proliferation was assessed by chemical reduction of MTT. Changes in phosphorylation state of the kinases Akt, glycogen synthase kinase-3beta (GSK-3beta), mammalian target of rapamycin (mTOR) and extracellular signal-regulated kinase 1 and 2 (ERK1/ERK2) were also analysed. Cells incubated with H(2)O(2) alone showed a significant decrease in viability, which did not significantly change by addition of erythropoietin at concentration of 100 U/ml, but was further reduced when concentration of erythropoietin was increased to 400 U/ml. Phosphorylation state of the kinases Akt, GSK-3beta, mTOR and ERK1/ERK2 of H(2)O(2)-treated HK-2 cells was slightly altered in the presence of erythropoietin at concentration of 100 U/ml, but was significantly less in the presence of erythropoietin at a concentration of 400 U/ml. Phosphorylation of forkhead transcription factor FKHRL1 was diminished in cells incubated with H(2)O(2) and erythropoietin at a concentration of 400 U/ml. Erythropoietin, at high concentrations, may significantly increase cellular damage in HK-2 cells subjected to oxidative stress, which may be due in part to decrease in activation of important signalling pathways involved in cell survival and/or cell proliferation.
INSULIN GLARGINE 300 U/ML IS ASSOCIATED WITH LESS WEIGHT GAIN WHILE MAINTAINING GLYCEMIC CONTROL AND LOW RISK OF HYPOGLYCEMIA COMPARED WITH INSULIN GLARGINE 100 U/ML IN AN AGING POPULATION WITH TYPE 2 DIABETES.

PubMed

Munshi, Medha N; Gill, Jasvinder; Chao, Jason; Nikonova, Elena V; Patel, Meenakshi

2018-02-01

Assess efficacy, hypoglycemia, and weight gain in patients with type 2 diabetes (T2D) treated with insulin glargine 300 U/mL (Gla-300) or 100 U/mL (Gla-100) across different age groups. Pooled data were generated for patients randomized to Gla-300 or Gla-100 in the EDITION 2 (NCT01499095) and 3 (NCT01676220) studies. In 4 age groups (<55, ≥55 to <60, ≥60 to <65, ≥65 years), glycated hemoglobin A1C (A1C), percentage of patients reaching A1C <7.5% (58 mmol/mol), weight change, confirmed hypoglycemia (blood glucose ≤70 mg/dL), and/or severe hypoglycemia (events requiring third-party assistance) were analyzed with descriptive statistics and logistic, binomial, and analysis of covariance regression modeling. A1C reductions from baseline and proportions of patients at target were similar for Gla-300 and Gla-100 across all age groups at 6 and 12 months, but hypoglycemia incidence and event rate were lower with Gla-300 at 6 (both P<.001) and 12 months ( P<.001 and P = .005, respectively). Patients on Gla-300 gained less weight than those on Gla-100 at 6 ( P = .027) and 12 months ( P = .021). Changes in weight and daily weight-adjusted insulin dose decreased with increasing age at 6 ( P<.001 and P = .017, respectively) and 12 months ( P<.001 and P = .011, respectively). Older patients with T2D may benefit from treatment with Gla-300, which is associated with a lower hypoglycemia rate and less weight gain with similar efficacy compared with Gla-100. A1C = glycated hemoglobin A1C BMI = body mass index Gla-100 = insulin glargine 100 U/mL Gla-300 = insulin glargine 300 U/mL OAD = oral antidiabetes drug T2D = type 2 diabetes.
Soluble CD30 in patients with antibody-mediated rejection of the kidney allograft.

PubMed

Slavcev, Antonij; Honsova, Eva; Lodererova, Alena; Pavlova, Yelena; Sajdlova, Helena; Vitko, Stefan; Skibova, Jelena; Striz, Ilja; Viklicky, Ondrej

2007-07-01

The aim of our retrospective study was to evaluate the clinical significance of measurement of the soluble CD30 (sCD30) molecule for the prediction of antibody-mediated (humoral) rejection (HR). Sixty-two kidney transplant recipients (thirty-one C4d-positive and thirty-one C4d-negative patients) were included into the study. Soluble CD30 levels were evaluated before transplantation and during periods of graft function deterioration. The median concentrations of the sCD30 molecule were identical in C4d-positive and C4d-negative patients before and after transplantation (65.5 vs. 65.0 and 28.2 vs. 36.0 U/ml, respectively). C4d+ patients who developed DSA de novo had a tendency to have higher sCD30 levels before transplantation (80.7+/-53.6 U/ml, n=8) compared with C4d-negative patients (65.0+/-33.4 U/ml, n=15). Soluble CD30 levels were evaluated as positive and negative (>or=100 U/ml and <100 U/ml respectively) and the sensitivity, specificity and accuracy of sCD30 estimation with regard to finding C4d deposits in peritubular capillaries were determined. The sensitivity of sCD30+ testing was generally below 40%, while the specificity of the test, i.e. the likelihood that if sCD30 testing is negative, C4d deposits would be absent, was 82%. C4d+ patients who developed DSA de novo were evaluated separately; the specificity of sCD30 testing for the incidence of HR in this cohort was 86%. We could not confirm in our study that high sCD30 levels (>or=100 U/ml) might be predictive for the incidence of HR. Negative sCD30 values might be however helpful for identifying patients with a low risk for development of DSA and antibody-mediated rejection.
Old torsion Balance Observations - too old for modern Exploration?

NASA Astrophysics Data System (ADS)

Götze, H.-J.

2003-04-01

Gravity gradiometry is a new gravity measurement technology that could fundamentally change the game of subsurface modelling and enhance geological interpretations: at fully inertial stabilized platforms they provide observed components of the E&{uml;o}tv&{uml;o}s tensor for 3D interpretations in mining and oil exploration and other fields of pure and applied geophysics. Although gravity gradiometry was among the first geophysical methods used successfully in applied Geophysics (E&{uml;o}tv&{uml;o}s torsion balance), the technology fell from favour in the 1930s. From this time measurements, done by torsion balances (Drehwaagen), are presented here which were observed to detect salt domes in the Northwest German basin. The data were digitized from old copies, then reprocessed and recalculated to draw Bouguer anomaly maps. However, the second derivatives of the gravity potential provide also independent data which can be used to constrain forward modelling. 3D modelling of Vxz, Vyz and other components of the E&{uml;o}tv&{uml;o}s tensor provide better insight into the geometry of the salt dome structure than modelling of the Bouguer gravity field. In addition to this first example results from gravity data processing by applying curvature techniques and again 3D forward modelling of second derivatives of the potential of density domains in the uppermost crust in the area of the Dead Sea Transform (Jordan) is presented here. The 3D modelling is conducted by the program package IGMAS which supply possibilities to calculate potential, gravity, its components and the Eötvös tensor components. Based on results so far one can conclude that the knowledge of the "second derivatives of the potential" could fundamentally change the role of gravity field measurements in the process of underground investigations not only for resource exploration but for investigations along large faults systems.
Evaluation of serum CA27.29, CA15-3 and CEA in patients with breast cancer.

PubMed

Hou, M F; Chen, Y L; Tseng, T F; Lin, C M; Chen, M S; Huang, C J; Huang, Y S; Hsieh, J S; Huang, T J; Jong, S B; Huang, Y F

1999-09-01

The Truquant BR radioimmunoassay (RIA) using monoclonal antibody BR 27.29 to recognize a peptide sequence on the MUC-1 gene product for quantification of the CA 27.29 antigen in serum was used in this report to evaluate in 145 patients with breast cancer and compared the other conventional serum markers such as CA15-3 and CEA. The upper limit of normal (25 u/ml) was determined from CA27.29 values 12.4 +/- 4.1 u/ml (mean +/- 3 S.D.) for 112 female subjects apparently free of disease. The CA15-3 levels above 25 u/ml and CEA levels above 5 ng/ml were considered positive values. Thirty-seven cases of 145 patients studied had elevated CA 27.29 levels (sensitivity: 25.5%), 35 of 145 had positive CA15-3 levels (sensitivity 24.1%) and 27 of 145 patients had positive CEA levels (sensitivity: 18.6%) (p < 0.05). One hundred and ten cases of the breast cancer patients (75.8%) did not have metastatic disease. In this group CA 27.29 sensitivity was 6.4%, while CA15-3 sensitivity was 5.5% and CEA sensitivity was 4.5% (p > 0.05). Mean values were 10.2 +/- 9.2 u/ml for CA 27.29, 14.1 +/- 5.6 u/ml for CA 15-3 and 1.7 +/- 1.5 ng/ml for CEA. Thirty-five patients (24.2%) had metastatic disease. In this group CA 27.29 sensitivity was 85.7%, CA15-3 sensitivity was 82.8% and CEA sensitivity was 62.8% (p < 0.05). Mean values for CA27.29 was 152.6 +/- 131.6 u/ml, CA15-3 was 123.1 +/- 107.6 u/ml and 21.8 +/- 36.9 ng/ml of CEA. With regard to the correlation of three tumor markers with clinical stages, patients had significantly higher levels of CA27.29 than CEA, but they were similar to CA 15-3 in metastatic breast cancer. These results suggest CA27.29 to be more sensitive and specific than CEA, but that it is similar to CA15-3 for metastatic breast cancer detection and monitoring.
Physician nurse care: A new use of UMLS to measure professional contribution: Are we talking about the same patient a new graph matching algorithm?

PubMed

Boyd, Andrew D; Dunn Lopez, Karen; Lugaresi, Camillo; Macieira, Tamara; Sousa, Vanessa; Acharya, Sabita; Balasubramanian, Abhinaya; Roussi, Khawllah; Keenan, Gail M; Lussier, Yves A; Li, Jianrong 'John'; Burton, Michel; Di Eugenio, Barbara

2018-05-01

Physician and nurses have worked together for generations; however, their language and training are vastly different; comparing and contrasting their work and their joint impact on patient outcomes is difficult in light of this difference. At the same time, the EHR only includes the physician perspective via the physician-authored discharge summary, but not nurse documentation. Prior research in this area has focused on collaboration and the usage of similar terminology. The objective of the study is to gain insight into interprofessional care by developing a computational metric to identify similarities, related concepts and differences in physician and nurse work. 58 physician discharge summaries and the corresponding nurse plans of care were transformed into Unified Medical Language System (UMLS) Concept Unique Identifiers (CUIs). MedLEE, a Natural Language Processing (NLP) program, extracted "physician terms" from free-text physician summaries. The nursing plans of care were constructed using the HANDS © nursing documentation software. HANDS © utilizes structured terminologies: nursing diagnosis (NANDA-I), outcomes (NOC), and interventions (NIC) to create "nursing terms". The physician's and nurse's terms were compared using the UMLS network for relatedness, overlaying the physician and nurse terms for comparison. Our overarching goal is to provide insight into the care, by innovatively applying graph algorithms to the UMLS network. We reveal the relationships between the care provided by each professional that is specific to the patient level. We found that only 26% of patients had synonyms (identical UMLS CUIs) between the two professions' documentation. On average, physicians' discharge summaries contain 27 terms and nurses' documentation, 18. Traversing the UMLS network, we found an average of 4 terms related (distance less than 2) between the professions, leaving most concepts as unrelated between nurse and physician care. Our hypothesis that physician's and nurse's practice domains are markedly different is supported by the preliminary, quantitative evidence we found. Leveraging the UMLS network and graph traversal algorithms, allows us to compare and contrast nursing and physician care on a single patient, enabling a more complete picture of patient care. We can differentiate professional contributions to patient outcomes and related and divergent concepts by each profession. Copyright © 2018 The Author(s). Published by Elsevier B.V. All rights reserved.
Physician nurse care: A new use of UMLS to measure professional contribution

PubMed Central

Boyd, Andrew D.; Lopez, Karen Dunn; Lugaresi, Camillo; Macieira, Tamara; Sousa, Vanessa; Acharya, Sabita; Balasubramanian, Abhinaya; Roussi, Khawllah; Keenan, Gail M.; Lussier, Yves A.; ‘John’ Li, Jianrong; Burton, Michel; Di Eugenio, Barbara

2018-01-01

Background Physician and nurses have worked together for generations; however, their language and training are vastly different; comparing and contrasting their work and their joint impact on patient outcomes is difficult in light of this difference. At the same time, the EHR only includes the physician perspective via the physician-authored discharge summary, but not nurse documentation. Prior research in this area has focused on collaboration and the usage of similar terminology. Objective The objective of the study is to gain insight into interprofessional care by developing a computational metric to identify similarities, related concepts and differences in physician and nurse work. Methods 58 physician discharge summaries and the corresponding nurse plans of care were transformed into Unified Medical Language System (UMLS) Concept Unique Identifiers (CUIs). MedLEE, a Natural Language Processing (NLP) program, extracted “physician terms” from free-text physician summaries. The nursing plans of care were constructed using the HANDS© nursing documentation software. HANDS© utilizes structured terminologies: nursing diagnosis (NANDA-I), outcomes (NOC), and interventions (NIC) to create “nursing terms”. The physician’s and nurse’s terms were compared using the UMLS network for relatedness, overlaying the physician and nurse terms for comparison. Our overarching goal is to provide insight into the care, by innovatively applying graph algorithms to the UMLS network. We reveal the relationships between the care provided by each professional that is specific to the patient level. Results We found that only 26% of patients had synonyms (identical UMLS CUIs) between the two professions’ documentation. On average, physicians’ discharge summaries contain 27 terms and nurses’ documentation, 18. Traversing the UMLS network, we found an average of 4 terms related (distance less than 2) between the professions, leaving most concepts as unrelated between nurse and physician care. Conclusion Our hypothesis that physician’s and nurse’s practice domains are markedly different is supported by the preliminary, quantitative evidence we found. Leveraging the UMLS network and graph traversal algorithms, allows us to compare and contrast nursing and physician care on a single patient, enabling a more complete picture of patient care. We can differentiate professional contributions to patient outcomes and related and divergent concepts by each profession. PMID:29602435
Impaired semantic inhibition during lexical ambiguity repetition in Parkinson's disease.

PubMed

Copland, David A; Sefe, Gameli; Ashley, Jane; Hudson, Carrie; Chenery, Helen J

2009-09-01

Impairments of semantic processing and inhibition have been observed in Parkinson's disease (PD), however, the consequences of faulty meaning selection and suppression have not been considered in terms of subsequent lexical processing. The present study employed a lexical ambiguity repetition paradigm where the first presentation of an ambiguity paired with a target biasing its dominant or subordinate meaning (e.g., bank - money or bank - river) was followed after several intervening trials by a presentation of the same ambiguity paired with a different target that biases the same (congruent) or a different (incongruent) meaning to that biased on the first presentation. Meaning dominance (dominant or subordinate weaker meanings) and interstimulus interval (ISI) were manipulated. Analyses conducted on the second presentation indicated priming of congruent meanings and no priming for the incongruent meanings at both short and long ISIs in the healthy controls, consistent with suppression of meanings competing with the representation biased in the first presentation. In contrast, the PD group failed to dampen activation for the incongruent meaning at the long ISI when the first presentation was subordinate. This pattern is consistent with an impairment of meaning suppression which is observed under controlled processing conditions and varies as a function of meaning dominance of the first presentation. These findings further refine our understanding of lexical-semantic impairments in PD and suggest a mechanism that may contribute to discourse comprehension impairments in this population.
Phase II evaluation of clinical coding schemes: completeness, taxonomy, mapping, definitions, and clarity. CPRI Work Group on Codes and Structures.

PubMed

Campbell, J R; Carpenter, P; Sneiderman, C; Cohn, S; Chute, C G; Warren, J

1997-01-01

To compare three potential sources of controlled clinical terminology (READ codes version 3.1, SNOMED International, and Unified Medical Language System (UMLS) version 1.6) relative to attributes of completeness, clinical taxonomy, administrative mapping, term definitions and clarity (duplicate coding rate). The authors assembled 1929 source concept records from a variety of clinical information taken from four medical centers across the United States. The source data included medical as well as ample nursing terminology. The source records were coded in each scheme by an investigator and checked by the coding scheme owner. The codings were then scored by an independent panel of clinicians for acceptability. Codes were checked for definitions provided with the scheme. Codes for a random sample of source records were analyzed by an investigator for "parent" and "child" codes within the scheme. Parent and child pairs were scored by an independent panel of medical informatics specialists for clinical acceptability. Administrative and billing code mapping from the published scheme were reviewed for all coded records and analyzed by independent reviewers for accuracy. The investigator for each scheme exhaustively searched a sample of coded records for duplications. SNOMED was judged to be significantly more complete in coding the source material than the other schemes (SNOMED* 70%; READ 57%; UMLS 50%; *p < .00001). SNOMED also had a richer clinical taxonomy judged by the number of acceptable first-degree relatives per coded concept (SNOMED* 4.56, UMLS 3.17; READ 2.14, *p < .005). Only the UMLS provided any definitions; these were found for 49% of records which had a coding assignment. READ and UMLS had better administrative mappings (composite score: READ* 40.6%; UMLS* 36.1%; SNOMED 20.7%, *p < .00001), and SNOMED had substantially more duplications of coding assignments (duplication rate: READ 0%; UMLS 4.2%; SNOMED* 13.9%, *p < .004) associated with a loss of clarity. No major terminology source can lay claim to being the ideal resource for a computer-based patient record. However, based upon this analysis of releases for April 1995, SNOMED International is considerably more complete, has a compositional nature and a richer taxonomy. Is suffers from less clarity, resulting from a lack of syntax and evolutionary changes in its coding scheme. READ has greater clarity and better mapping to administrative schemes (ICD-10 and OPCS-4), is rapidly changing and is less complete. UMLS is a rich lexical resource, with mappings to many source vocabularies. It provides definitions for many of its terms. However, due to the varying granularities and purposes of its source schemes, it has limitations for representation of clinical concepts within a computer-based patient record.
Phase II Evaluation of Clinical Coding Schemes

PubMed Central

Campbell, James R.; Carpenter, Paul; Sneiderman, Charles; Cohn, Simon; Chute, Christopher G.; Warren, Judith

1997-01-01

Abstract Objective: To compare three potential sources of controlled clinical terminology (READ codes version 3.1, SNOMED International, and Unified Medical Language System (UMLS) version 1.6) relative to attributes of completeness, clinical taxonomy, administrative mapping, term definitions and clarity (duplicate coding rate). Methods: The authors assembled 1929 source concept records from a variety of clinical information taken from four medical centers across the United States. The source data included medical as well as ample nursing terminology. The source records were coded in each scheme by an investigator and checked by the coding scheme owner. The codings were then scored by an independent panel of clinicians for acceptability. Codes were checked for definitions provided with the scheme. Codes for a random sample of source records were analyzed by an investigator for “parent” and “child” codes within the scheme. Parent and child pairs were scored by an independent panel of medical informatics specialists for clinical acceptability. Administrative and billing code mapping from the published scheme were reviewed for all coded records and analyzed by independent reviewers for accuracy. The investigator for each scheme exhaustively searched a sample of coded records for duplications. Results: SNOMED was judged to be significantly more complete in coding the source material than the other schemes (SNOMED* 70%; READ 57%; UMLS 50%; *p <.00001). SNOMED also had a richer clinical taxonomy judged by the number of acceptable first-degree relatives per coded concept (SNOMED* 4.56; UMLS 3.17; READ 2.14, *p <.005). Only the UMLS provided any definitions; these were found for 49% of records which had a coding assignment. READ and UMLS had better administrative mappings (composite score: READ* 40.6%; UMLS* 36.1%; SNOMED 20.7%, *p <. 00001), and SNOMED had substantially more duplications of coding assignments (duplication rate: READ 0%; UMLS 4.2%; SNOMED* 13.9%, *p <. 004) associated with a loss of clarity. Conclusion: No major terminology source can lay claim to being the ideal resource for a computer-based patient record. However, based upon this analysis of releases for April 1995, SNOMED International is considerably more complete, has a compositional nature and a richer taxonomy. It suffers from less clarity, resulting from a lack of syntax and evolutionary changes in its coding scheme. READ has greater clarity and better mapping to administrative schemes (ICD-10 and OPCS-4), is rapidly changing and is less complete. UMLS is a rich lexical resource, with mappings to many source vocabularies. It provides definitions for many of its terms. However, due to the varying granularities and purposes of its source schemes, it has limitations for representation of clinical concepts within a computer-based patient record. PMID:9147343
"UML Quiz": Automatic Conversion of Web-Based E-Learning Content in Mobile Applications

ERIC Educational Resources Information Center

von Franqué, Alexander; Tellioglu, Hilda

2014-01-01

Many educational institutions use Learning Management Systems to provide e-learning content to their students. This often includes quizzes that can help students to prepare for exams. However, the content is usually web-optimized and not very usable on mobile devices. In this work a native mobile application ("UML Quiz") that imports…
Diagram, a Learning Environment for Initiation to Object-Oriented Modeling with UML Class Diagrams

ERIC Educational Resources Information Center

Py, Dominique; Auxepaules, Ludovic; Alonso, Mathilde

2013-01-01

This paper presents Diagram, a learning environment for object-oriented modelling (OOM) with UML class diagrams. Diagram an open environment, in which the teacher can add new exercises without constraints on the vocabulary or the size of the diagram. The interface includes methodological help, encourages self-correcting and self-monitoring, and…
The Future of Architecture Collaborative Information Sharing: DoDAF Version 2.03 Updates

DTIC Science & Technology

2012-04-30

Salamander x Select Solution Factory Select Business Solutions BPMN , UML x SimonTool Simon Labs x SimProcess CACI BPMN x System Architecture Management...for DoDAF Mega UML x Metastorm ProVision Metastorm BPMN x Naval Simulation System - 4 Aces METRON x NetViz CA x OPNET OPNET x Tool Name Vendor Primary
Production of Cellulolytic and Hemicellulolytic Enzymes From Aureobasidium pulluans on Solid State Fermentation

NASA Astrophysics Data System (ADS)

Leite, Rodrigo Simões Ribeiro; Bocchini, Daniela Alonso; da Silva Martins, Eduardo; Silva, Dênis; Gomes, Eleni; da Silva, Roberto

This article investigates a strain of the yeast Aureobasidium pullulans for cellulase and hemicellulase production in solid state fermentation. Among the substrates analyzed, the wheat bran culture presented the highest enzymatic production (1.05 U/mL endoglucanase, 1.3 U/mL β-glucosidase, and 5.0 U/mL xylanase). Avicelase activity was not detected. The optimum pH and temperature for xylanase, endoglucanase and β-glucosidase were 5.0 and 50, 4.5 and 60, 4.0 and 75°C, respectively. These enzymes remained stable between a wide range of pH. The β-glucosidase was the most thermostable enzyme remaining 100% active when incubated at 75°C for 1 h.
Exploiting semantic patterns over biomedical knowledge graphs for predicting treatment and causative relations.

PubMed

Bakal, Gokhan; Talari, Preetham; Kakani, Elijah V; Kavuluru, Ramakanth

2018-06-01

Identifying new potential treatment options for medical conditions that cause human disease burden is a central task of biomedical research. Since all candidate drugs cannot be tested with animal and clinical trials, in vitro approaches are first attempted to identify promising candidates. Likewise, identifying different causal relations between biomedical entities is also critical to understand biomedical processes. Generally, natural language processing (NLP) and machine learning are used to predict specific relations between any given pair of entities using the distant supervision approach. To build high accuracy supervised predictive models to predict previously unknown treatment and causative relations between biomedical entities based only on semantic graph pattern features extracted from biomedical knowledge graphs. We used 7000 treats and 2918 causes hand-curated relations from the UMLS Metathesaurus to train and test our models. Our graph pattern features are extracted from simple paths connecting biomedical entities in the SemMedDB graph (based on the well-known SemMedDB database made available by the U.S. National Library of Medicine). Using these graph patterns connecting biomedical entities as features of logistic regression and decision tree models, we computed mean performance measures (precision, recall, F-score) over 100 distinct 80-20% train-test splits of the datasets. For all experiments, we used a positive:negative class imbalance of 1:10 in the test set to model relatively more realistic scenarios. Our models predict treats and causes relations with high F-scores of 99% and 90% respectively. Logistic regression model coefficients also help us identify highly discriminative patterns that have an intuitive interpretation. We are also able to predict some new plausible relations based on false positives that our models scored highly based on our collaborations with two physician co-authors. Finally, our decision tree models are able to retrieve over 50% of treatment relations from a recently created external dataset. We employed semantic graph patterns connecting pairs of candidate biomedical entities in a knowledge graph as features to predict treatment/causative relations between them. We provide what we believe is the first evidence in direct prediction of biomedical relations based on graph features. Our work complements lexical pattern based approaches in that the graph patterns can be used as additional features for weakly supervised relation prediction. Copyright © 2018 Elsevier Inc. All rights reserved.
A View from Above Without Leaving the Ground

NASA Technical Reports Server (NTRS)

2004-01-01

In order to deliver accurate geospatial data and imagery to the remote sensing community, NASA is constantly developing new image-processing algorithms while refining existing ones for technical improvement. For 8 years, the NASA Regional Applications Center at Florida International University has served as a test bed for implementing and validating many of these algorithms, helping the Space Program to fulfill its strategic and educational goals in the area of remote sensing. The algorithms in return have helped the NASA Regional Applications Center develop comprehensive semantic database systems for data management, as well as new tools for disseminating geospatial information via the Internet.
A future Outlook: Web based Simulation of Hydrodynamic models

NASA Astrophysics Data System (ADS)

Islam, A. S.; Piasecki, M.

2003-12-01

Despite recent advances to present simulation results as 3D graphs or animation contours, the modeling user community still faces some shortcomings when trying to move around and analyze data. Typical problems include the lack of common platforms with standard vocabulary to exchange simulation results from different numerical models, insufficient descriptions about data (metadata), lack of robust search and retrieval tools for data, and difficulties to reuse simulation domain knowledge. This research demonstrates how to create a shared simulation domain in the WWW and run a number of models through multi-user interfaces. Firstly, meta-datasets have been developed to describe hydrodynamic model data based on geographic metadata standard (ISO 19115) that has been extended to satisfy the need of the hydrodynamic modeling community. The Extended Markup Language (XML) is used to publish this metadata by the Resource Description Framework (RDF). Specific domain ontology for Web Based Simulation (WBS) has been developed to explicitly define vocabulary for the knowledge based simulation system. Subsequently, this knowledge based system is converted into an object model using Meta Object Family (MOF). The knowledge based system acts as a Meta model for the object oriented system, which aids in reusing the domain knowledge. Specific simulation software has been developed based on the object oriented model. Finally, all model data is stored in an object relational database. Database back-ends help store, retrieve and query information efficiently. This research uses open source software and technology such as Java Servlet and JSP, Apache web server, Tomcat Servlet Engine, PostgresSQL databases, Protégé ontology editor, RDQL and RQL for querying RDF in semantic level, Jena Java API for RDF. Also, we use international standards such as the ISO 19115 metadata standard, and specifications such as XML, RDF, OWL, XMI, and UML. The final web based simulation product is deployed as Web Archive (WAR) files which is platform and OS independent and can be used by Windows, UNIX, or Linux. Keywords: Apache, ISO 19115, Java Servlet, Jena, JSP, Metadata, MOF, Linux, Ontology, OWL, PostgresSQL, Protégé, RDF, RDQL, RQL, Tomcat, UML, UNIX, Windows, WAR, XML
From Informal Safety-Critical Requirements to Property-Driven Formal Validation

NASA Technical Reports Server (NTRS)

Cimatti, Alessandro; Roveri, Marco; Susi, Angelo; Tonetta, Stefano

2008-01-01

Most of the efforts in formal methods have historically been devoted to comparing a design against a set of requirements. The validation of the requirements themselves, however, has often been disregarded, and it can be considered a largely open problem, which poses several challenges. The first challenge is given by the fact that requirements are often written in natural language, and may thus contain a high degree of ambiguity. Despite the progresses in Natural Language Processing techniques, the task of understanding a set of requirements cannot be automatized, and must be carried out by domain experts, who are typically not familiar with formal languages. Furthermore, in order to retain a direct connection with the informal requirements, the formalization cannot follow standard model-based approaches. The second challenge lies in the formal validation of requirements. On one hand, it is not even clear which are the correctness criteria or the high-level properties that the requirements must fulfill. On the other hand, the expressivity of the language used in the formalization may go beyond the theoretical and/or practical capacity of state-of-the-art formal verification. In order to solve these issues, we propose a new methodology that comprises of a chain of steps, each supported by a specific tool. The main steps are the following. First, the informal requirements are split into basic fragments, which are classified into categories, and dependency and generalization relationships among them are identified. Second, the fragments are modeled using a visual language such as UML. The UML diagrams are both syntactically restricted (in order to guarantee a formal semantics), and enriched with a highly controlled natural language (to allow for modeling static and temporal constraints). Third, an automatic formal analysis phase iterates over the modeled requirements, by combining several, complementary techniques: checking consistency; verifying whether the requirements entail some desirable properties; verify whether the requirements are consistent with selected scenarios; diagnosing inconsistencies by identifying inconsistent cores; identifying vacuous requirements; constructing multiple explanations by enabling the fault-tree analysis related to particular fault models; verifying whether the specification is realizable.
An Experiment Comparing Lexical and Statistical Methods for Extracting MeSH Terms from Clinical Free Text

PubMed Central

Cooper, Gregory F.; Miller, Randolph A.

1998-01-01

Abstract Objective: A primary goal of the University of Pittsburgh's 1990-94 UMLS-sponsored effort was to develop and evaluate PostDoc (a lexical indexing system) and Pindex (a statistical indexing system) comparatively, and then in combination as a hybrid system. Each system takes as input a portion of the free text from a narrative part of a patient's electronic medical record and returns a list of suggested MeSH terms to use in formulating a Medline search that includes concepts in the text. This paper describes the systems and reports an evaluation. The intent is for this evaluation to serve as a step toward the eventual realization of systems that assist healthcare personnel in using the electronic medical record to construct patient-specific searches of Medline. Design: The authors tested the performances of PostDoc, Pindex, and a hybrid system, using text taken from randomly selected clinical records, which were stratified to include six radiology reports, six pathology reports, and six discharge summaries. They identified concepts in the clinical records that might conceivably be used in performing a patient-specific Medline search. Each system was given the free text of each record as an input. The extent to which a system-derived list of MeSH terms captured the relevant concepts in these documents was determined based on blinded assessments by the authors. Results: PostDoc output a mean of approximately 19 MeSH terms per report, which included about 40% of the relevant report concepts. Pindex output a mean of approximately 57 terms per report and captured about 45% of the relevant report concepts. A hybrid system captured approximately 66% of the relevant concepts and output about 71 terms per report. Conclusion: The outputs of PostDoc and Pindex are complementary in capturing MeSH terms from clinical free text. The results suggest possible approaches to reduce the number of terms output while maintaining the percentage of terms captured, including the use of UMLS semantic types to constrain the output list to contain only clinically relevant MeSH terms. PMID:9452986
Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach.

PubMed

Weng, Wei-Hung; Wagholikar, Kavishwar B; McCray, Alexa T; Szolovits, Peter; Chueh, Henry C

2017-12-01

The medical subdomain of a clinical note, such as cardiology or neurology, is useful content-derived metadata for developing machine learning downstream applications. To classify the medical subdomain of a note accurately, we have constructed a machine learning-based natural language processing (NLP) pipeline and developed medical subdomain classifiers based on the content of the note. We constructed the pipeline using the clinical NLP system, clinical Text Analysis and Knowledge Extraction System (cTAKES), the Unified Medical Language System (UMLS) Metathesaurus, Semantic Network, and learning algorithms to extract features from two datasets - clinical notes from Integrating Data for Analysis, Anonymization, and Sharing (iDASH) data repository (n = 431) and Massachusetts General Hospital (MGH) (n = 91,237), and built medical subdomain classifiers with different combinations of data representation methods and supervised learning algorithms. We evaluated the performance of classifiers and their portability across the two datasets. The convolutional recurrent neural network with neural word embeddings trained-medical subdomain classifier yielded the best performance measurement on iDASH and MGH datasets with area under receiver operating characteristic curve (AUC) of 0.975 and 0.991, and F1 scores of 0.845 and 0.870, respectively. Considering better clinical interpretability, linear support vector machine-trained medical subdomain classifier using hybrid bag-of-words and clinically relevant UMLS concepts as the feature representation, with term frequency-inverse document frequency (tf-idf)-weighting, outperformed other shallow learning classifiers on iDASH and MGH datasets with AUC of 0.957 and 0.964, and F1 scores of 0.932 and 0.934 respectively. We trained classifiers on one dataset, applied to the other dataset and yielded the threshold of F1 score of 0.7 in classifiers for half of the medical subdomains we studied. Our study shows that a supervised learning-based NLP approach is useful to develop medical subdomain classifiers. The deep learning algorithm with distributed word representation yields better performance yet shallow learning algorithms with the word and concept representation achieves comparable performance with better clinical interpretability. Portable classifiers may also be used across datasets from different institutions.

Towards refactoring the Molecular Function Ontology with a UML profile for function modeling.

PubMed

Burek, Patryk; Loebe, Frank; Herre, Heinrich

2017-10-04

Gene Ontology (GO) is the largest resource for cataloging gene products. This resource grows steadily and, naturally, this growth raises issues regarding the structure of the ontology. Moreover, modeling and refactoring large ontologies such as GO is generally far from being simple, as a whole as well as when focusing on certain aspects or fragments. It seems that human-friendly graphical modeling languages such as the Unified Modeling Language (UML) could be helpful in connection with these tasks. We investigate the use of UML for making the structural organization of the Molecular Function Ontology (MFO), a sub-ontology of GO, more explicit. More precisely, we present a UML dialect, called the Function Modeling Language (FueL), which is suited for capturing functions in an ontologically founded way. FueL is equipped, among other features, with language elements that arise from studying patterns of subsumption between functions. We show how to use this UML dialect for capturing the structure of molecular functions. Furthermore, we propose and discuss some refactoring options concerning fragments of MFO. FueL enables the systematic, graphical representation of functions and their interrelations, including making information explicit that is currently either implicit in MFO or is mainly captured in textual descriptions. Moreover, the considered subsumption patterns lend themselves to the methodical analysis of refactoring options with respect to MFO. On this basis we argue that the approach can increase the comprehensibility of the structure of MFO for humans and can support communication, for example, during revision and further development.
The feasibility of using UML to compare the impact of different brands of computer system on the clinical consultation.

PubMed

Kumarapeli, Pushpa; de Lusignan, Simon; Koczan, Phil; Jones, Beryl; Sheeler, Ian

2007-01-01

UK general practice is universally computerised, with computers used in the consulting room at the point of care. Practices use a range of different brands of computer system, which have developed organically to meet the needs of general practitioners and health service managers. Unified Modelling Language (UML) is a standard modelling and specification notation widely used in software engineering. To examine the feasibility of UML notation to compare the impact of different brands of general practice computer system on the clinical consultation. Multi-channel video recordings of simulated consultation sessions were recorded on three different clinical computer systems in common use (EMIS, iSOFT Synergy and IPS Vision). User action recorder software recorded time logs of keyboard and mouse use, and pattern recognition software captured non-verbal communication. The outputs of these were used to create UML class and sequence diagrams for each consultation. We compared 'definition of the presenting problem' and 'prescribing', as these tasks were present in all the consultations analysed. Class diagrams identified the entities involved in the clinical consultation. Sequence diagrams identified common elements of the consultation (such as prescribing) and enabled comparisons to be made between the different brands of computer system. The clinician and computer system interaction varied greatly between the different brands. UML sequence diagrams are useful in identifying common tasks in the clinical consultation, and for contrasting the impact of the different brands of computer system on the clinical consultation. Further research is needed to see if patterns demonstrated in this pilot study are consistently displayed.
Enzymatic comparison and mortality of Beauveria bassiana against cabbage caterpillar Pieris brassicae LINN.

PubMed

Dhawan, Manish; Joshi, Neelam

Beauveria bassiana, an entomopathogenic fungus, is the alternative biocontrol agent exploited against major economic crop pests. Pieris brassicae L. is an emerging pest of the Brassicaceae family. Therefore, in the present study, fungal isolates of Beauveria bassiana, viz. MTCC 2028, MTCC 4495, MTCC 6291, and NBAII-11, were evaluated for their virulence against third instar larvae of P. brassicae. Among all these fungal isolates, maximum mortality (86.66%) was recorded in B. bassiana MTCC 4495 at higher concentration of spores (10 9 conidia/ml), and the minimum mortality (30.00%) was recorded in B. bassiana MTCC 6291 at a lower concentration (10 7 conidia/ml) after ten days of treatment. The extracellular cuticle-degrading enzyme activities of fungal isolates were measured. Variability was observed both in the pattern of enzyme secretion and the level of enzyme activities among various fungal isolates. B. bassiana MTCC 4495 recorded the maximum mean chitinase (0.51U/ml), protease (1.12U/ml), and lipase activities (1.36U/ml). The minimum mean chitinase and protease activities (0.37 and 0.91U/ml, respectively) were recorded in B. bassiana MTCC 6291. The minimum mean lipase activity (1.04U/ml) was recorded in B. bassiana NBAII-11. Our studies revealed B. bassiana MTCC 4495 as the most pathogenic isolate against P. brassicae, which also recorded maximum extracellular enzyme activities, suggesting the possible roles of extracellular enzymes in the pathogenicity of B. bassiana against P. brassicae. Copyright © 2017 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
Automated encoding of clinical documents based on natural language processing.

PubMed

Friedman, Carol; Shagina, Lyudmila; Lussier, Yves; Hripcsak, George

2004-01-01

The aim of this study was to develop a method based on natural language processing (NLP) that automatically maps an entire clinical document to codes with modifiers and to quantitatively evaluate the method. An existing NLP system, MedLEE, was adapted to automatically generate codes. The method involves matching of structured output generated by MedLEE consisting of findings and modifiers to obtain the most specific code. Recall and precision applied to Unified Medical Language System (UMLS) coding were evaluated in two separate studies. Recall was measured using a test set of 150 randomly selected sentences, which were processed using MedLEE. Results were compared with a reference standard determined manually by seven experts. Precision was measured using a second test set of 150 randomly selected sentences from which UMLS codes were automatically generated by the method and then validated by experts. Recall of the system for UMLS coding of all terms was .77 (95% CI.72-.81), and for coding terms that had corresponding UMLS codes recall was .83 (.79-.87). Recall of the system for extracting all terms was .84 (.81-.88). Recall of the experts ranged from .69 to .91 for extracting terms. The precision of the system was .89 (.87-.91), and precision of the experts ranged from .61 to .91. Extraction of relevant clinical information and UMLS coding were accomplished using a method based on NLP. The method appeared to be comparable to or better than six experts. The advantage of the method is that it maps text to codes along with other related information, rendering the coded output suitable for effective retrieval.
A Proposed Pattern of Enterprise Architecture

DTIC Science & Technology

2013-02-01

consistent architecture descriptions. UPDM comprises extensions to both OMG’s Unified Modelling Language (UML) and Systems Modelling Language ( SysML ...those who use UML and SysML . These represent significant advancements that enable architecture trade-off analyses, architecture model execution...Language ( SysML ), and thus provides for architectural descriptions that contain a rich set of (formally) connected DoDAF/MoDAF viewpoints expressed
Doclet To Synthesize UML

NASA Technical Reports Server (NTRS)

Barry, Matthew R.; Osborne, Richard N.

2005-01-01

The RoseDoclet computer program extends the capability of Java doclet software to automatically synthesize Unified Modeling Language (UML) content from Java language source code. [Doclets are Java-language programs that use the doclet application programming interface (API) to specify the content and format of the output of Javadoc. Javadoc is a program, originally designed to generate API documentation from Java source code, now also useful as an extensible engine for processing Java source code.] RoseDoclet takes advantage of Javadoc comments and tags already in the source code to produce a UML model of that code. RoseDoclet applies the doclet API to create a doclet passed to Javadoc. The Javadoc engine applies the doclet to the source code, emitting the output format specified by the doclet. RoseDoclet emits a Rose model file and populates it with fully documented packages, classes, methods, variables, and class diagrams identified in the source code. The way in which UML models are generated can be controlled by use of new Javadoc comment tags that RoseDoclet provides. The advantage of using RoseDoclet is that Javadoc documentation becomes leveraged for two purposes: documenting the as-built API and keeping the design documentation up to date.
Celiac disease: Serologic prevalence in patients with irritable bowel syndrome

PubMed Central

Mehdi, Zobeiri; Sakineh, Ebrahimi; Mohammad, Farahvash; Mansour, Rezaei; Alireza, Abdollahi

2012-01-01

Background: The prevalence of irritable bowel syndrome (IBS) in the community is 10%–20% and have symptom based diagnostic criteria. Many symptoms of celiac disease (CD) with 1% prevalence in some communities can mimic IBS. Sensitive and specific serologic tests of CD can detect asymptomatic cases. The purpose of this study was to compare the level of anti-tissue-transglutaminase (tTG) IgA in IBS patients and controls group. Materials and Methods: This case-control study was performed at a University hospital in which 107 patients with IBS who met the Rome II criteria for their diagnosis were compared with 126 healthy age and sex-matched controls. Both groups were investigated for CD by analysis of their serum tTG IgA antibody with human recombinant antigen. Titers were positive containing over 10u/ml and borderline if they were between 4 and 10 u/ml. Result: 86 percent of IBS patients were female. The mean antibody level was 0.837 u/ml in IBS group and 0.933 u/ml in control group without any significant difference. Discussion and Conclusion: Results of this study may intensify disagreement on the situation of CD in IBS patients. PMID:23826010
Design alternatives for process group membership and multicast

NASA Technical Reports Server (NTRS)

Birman, Kenneth P.; Cooper, Robert; Gleeson, Barry

1991-01-01

Process groups are a natural tool for distributed programming, and are increasingly important in distributed computing environments. However, there is little agreement on the most appropriate semantics for process group membership and group communication. These issues are of special importance in the Isis system, a toolkit for distributed programming. Isis supports several styles of process group, and a collection of group communication protocols spanning a range of atomicity and ordering properties. This flexibility makes Isis adaptable to a variety of applications, but is also a source of complexity that limits performance. This paper reports on a new architecture that arose from an effort to simplify Isis process group semantics. Our findings include a refined notion of how the clients of a group should be treated, what the properties of a multicast primitive should be when systems contain large numbers of overlapping groups, and a new construct called the casuality domain. As an illustration, we apply the architecture to the problem of converting processes into fault-tolerant process groups in a manner that is 'transparent' to other processes in the system.
Behavioural and electrophysiological effects related to semantic violations during braille reading.

PubMed

Glyn, Vania; Lim, Vanessa K; Hamm, Jeff P; Mathur, Ashwin; Hughes, Barry

2015-10-01

This study investigated the potential to detect event related potentials (ERPs) occurring in response to a specific task in braille reading. This would expand current methodologies for studying the cognitive processes underlying braille reading. An N400 effect paradigm was utilised, whereby proficient blind braille readers read congruent- and incongruent-ending braille sentences. Kinematic and electroencephalography (EEG) data were obtained simultaneously and synchronised. The ERPs differed between the incongruent and congruent sentences in a manner consistent with the N400 effect found with a previous sighted reading paradigm, demonstrating that ERPs can be obtained during braille reading. The frequency of finger reversals and the degree of intermittency in the finger velocity were significantly higher when reading incongruent versus congruent sentence endings. Both reversals and the potential N400 effect may reflect processes involved in semantic unification. These findings have significant implications for the modelling of braille reading. The refinement of the technique will enable other ERPs to be identified and related to behavioural responses, to further our understanding of the braille reading process. Copyright © 2015 Elsevier Ltd. All rights reserved.
Association of Anti-glycan Antibodies and Inflammatory Bowel Disease Course.

PubMed

Paul, S; Boschetti, G; Rinaudo-Gaujous, M; Moreau, A; Del Tedesco, E; Bonneau, J; Presles, E; Mounsef, F; Clavel, L; Genin, C; Flourié, B; Phelip, J-M; Nancey, S; Roblin, X

2015-06-01

The usefulness of anti-glycan antibodies alone or combined with anti-Saccharomyces cerevisiae [ASCA] or perinuclear antineutrophil cytoplasmic [pANCA] antibodies for diagnosis of inflammatory bowel disease [IBD], differentiation between Crohn's disease [CD] and ulcerative colitis [UC], disease stratification including IBD phenotype, and also for determination of the course of the disease, remain unclear. A large panel of serological anti-glycan carbohydrate antibodies, including anti-mannobioside IgG antibodies [AMCA], anti-chitobioside IgA [ACCA], anti-laminaribioside IgG antibodies [ALCA], anti-laminarin [anti-L] and anti-chitine [anti-C] were measured in the serum from a cohort of 195 patients with IBD] [107 CD and 88 UC]. The respective accuracy of isolated or combined markers for diagnosis, disease differentiation, stratification disease phenotype, and severity of the disease course, defined by a wide panel of criteria obtained from the past medical history, was assessed. The positivity of at least one anti-glycan antibody was detected in a significant higher proportion of CD and UC compared with healthy controls [p < 0.0001 and p < 0.0007, respectively]. Whereas ASCA and ANCA antibody status had the highest efficacy to be associated with CD in comparison with UC (area under receiver operating characteristic curve [AUROC] = 0.70 for each], the adjunction of anti-laminarin antibody substantially improved the differentiation between CD and UC [AUROC = 0.77]. Titres of ACCA [> 51U/ml] and anti-laminarin [> 31U/ml] were significantly linked with a higher association with steroid dependency (odds ratio [OR] =2.0 [1.0-4.0], p = 0.03 and OR = 2.4 [1.1-5.2], p = 0.02, respectively]. We further defined the respective performance of anti-glycan antibodies to discriminate between patients with severe or not severe CD and UC course and determined the associated optimal cut-off values: severe CD course was significantly more likely in case of AMCA > 77U/ml [OR = 4.3; p = 0.002], ASCA > 63U/ml [OR = 3.5; p < 0.009] and at a lesser degree ACCA > 50U/ml [OR = 2.8; p < 0.02] and severe UC course was significantly associated with AMCA > 52U/ml [OR = 3.4; p = 0.04] and ACCA > 25U/ml [OR = 3.0; p < 0.04]. Anti-glycan antibodies are valuable serological markers, especially AMCA antibodies that may help clinicians to promptly classify patients into high risk for severe disease. Copyright © 2015 European Crohn’s and Colitis Organisation (ECCO). Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Developing a Domain Ontology: the Case of Water Cycle and Hydrology

NASA Astrophysics Data System (ADS)

Gupta, H.; Pozzi, W.; Piasecki, M.; Imam, B.; Houser, P.; Raskin, R.; Ramachandran, R.; Martinez Baquero, G.

2008-12-01

A semantic web ontology enables semantic data integration and semantic smart searching. Several organizations have attempted to implement smart registration and integration or searching using ontologies. These are the NOESIS (NSF project: LEAD) and HydroSeek (NSF project: CUAHS HIS) data discovery engines and the NSF project GEON. All three applications use ontologies to discover data from multiple sources and projects. The NASA WaterNet project was established to identify creative, innovative ways to bridge NASA research results to real world applications, linking decision support needs to available data, observations, and modeling capability. WaterNet (NASA project) utilized the smart query tool Noesis as a testbed to test whether different ontologies (and different catalog searches) could be combined to match resources with user needs. NOESIS contains the upper level SWEET ontology that accepts plug in domain ontologies to refine user search queries, reducing the burden of multiple keyword searches. Another smart search interface was that developed for CUAHSI, HydroSeek, that uses a multi-layered concept search ontology, tagging variables names from any number of data sources to specific leaf and higher level concepts on which the search is executed. This approach has proven to be quite successful in mitigating semantic heterogeneity as the user does not need to know the semantic specifics of each data source system but just uses a set of common keywords to discover the data for a specific temporal and geospatial domain. This presentation will show tests with Noesis and Hydroseek lead to the conclusion that the construction of a complex, and highly heterogeneous water cycle ontology requires multiple ontology modules. To illustrate the complexity and heterogeneity of a water cycle ontology, Hydroseek successfully utilizes WaterOneFlow to integrate data across multiple different data collections, such as USGS NWIS. However,different methodologies are employed by the Earth Science, the Hydrological, and Hydraulic Engineering Communities, and each community employs models that require different input data. If a sub-domain ontology is created for each of these,describing water balance calculations, then the resulting structure of the semantic network describing these various terms can be rather complex, heterogeneous, and overlapping, and will require "mapping" between equivalent terms in the ontologies, along with the development of an upper level conceptual or domain ontology to utilize and link to those already in existence.
The Best of all Possible Worlds: Applying the Model Driven Architecture Approach to a JC3IEDM OWL Ontology Modeled in UML

DTIC Science & Technology

2014-04-25

EA’s Java application programming interface (API), the team built a tool called OWL2EA that can ingest an OWL file and generate the corresponding UML...ObjectItemStructure specification shown in Figure 10. Running this script in the relational database server MySQL creates the physical schema that
Vertical Mixing in the Dead Sea

NASA Astrophysics Data System (ADS)

Gertman, Isaac; Ozer, Tal; Katsenelson, Boris; Lensky, Nadav

2015-04-01

For hundreds of years, the Dead Sea was characterized by a stable haline stratification, supported by runoff. The penetration of the winter convection was limited to an upper mixed layer (UML) of about 30-50 m. Below the UML, a stable halocline prevented the mixing. As a result of the runoff reduction, the UML salinity increased and the gravitational stability diminished. During the winter of 1978-1979, the sea water overturned, ending the long-term stable hydrological regime. Since 1979, the haline stratification structure reoccurred twice after extremely rainy winters, in 1980-82 and 1992-1995. In other years, the sea was entirely mixed by winter thermal convection ( which occurs from November to March ) and had a seasonal pycnocline beneath the UML during summer. Profiles of temperature and quasi-salinity (density anomaly from 1000 kg/m3 for the chosen reference temperature of 32° C) during the last 19 years, show the formation of summer ``overturning halocline'' beneath the UML, and the thermocline that supports the stable stratification. Another warm and saline layer is formed also during the summer period near the bottom. This layer spreads from the southern part of the sea, where end-brine is discharged to the sea from the Israeli and Jordanian salt plants' evaporation ponds. The end-brine has extremely high salinity (˜ 350 g/kg) and, in spite of the high temperatures ( ˜ 45° C), high density (1350 kg/m^3), it therefore spreads as a gravitational current in the Dead Sea deep basin. Estimation of the density ratio (Rρ) for the Dead Sea water (where measurements of water salinity is quite difficult) was done using quasi-salinity (σ32) and potential temperature (θ): Rρ= [α(partialθ/partial z)]/[β(partial σ32/partial z)], where α and β are temperature expansion and quasi-salinity contraction coefficients respectively. The values of α and β for the Dead Sea water were defined from water samples collected during 2008. The Rρ values confirm that the summer Dead Sea thermohaline structure is appropriate for double diffusion mixing. A salt fingers regime beneath the UML (1.3< Rρ
Implementation of UML Schema to RDBM

NASA Astrophysics Data System (ADS)

Nagni, M.; Ventouras, S.; Parton, G.

2012-04-01

Multiple disciplines - especially those within the earth and physical sciences, and increasingly those within social science and medical fields - require Geographic Information (GI) i.e. information concerning phenomena implicitly or explicitly associated with a location relative to the Earth [1]. Therefore geographic datasets are increasingly being shared, exchanged and frequently used for purposes other than those for which they were originally intended. The ISO Technical Committee 211 (ISO/TC 211) together with Open Geospatial Consortium (OGC) provide a series of standards and guidelines for developing application schemas which should: a) capture relevant conceptual aspects of the data involved; and b) be sufficient to satisfy previously defined use-cases of a specific or cross-domain concerns. In addition, the Hollow World technology offers an accessible and industry-standardised methodology for creating and editing Application Schema UML models which conform to international standards for interoperable GI [2]. We present a technology which seamlessly transforms an Application Schema UML model to a relational database model (RDBM). This technology, using the same UML information model, complements the XML transformation of an information model produced by the FullMoon tool [2]. In preparation for the generation of a RDBM the UML model is first mapped to a collection of OO classes and relationships. Any external dependencies that exist are then resolved through the same mechanism. However, a RDBM does not support a hierarchical (relational) data structure - a function that may be required by UML models. Previous approaches have addressed this problem through use of nested sets or an adjacent list to represent such structure. Our unique strategy addresses the hierarchical data structure issue, whether singular or multiple inheritance, by hiding a delegation pattern within an OO class. This permits the object-relational mapping (ORM) software used to generate the RDBM to easily map the class into the RDBM. In other words the particular structure of the resulting OO class may expose a "composition-like aspect" to the ORM whilst maintaining an "inherited-like aspect" for use within an OO program. This methodology has been used to implement a software application to manages the new CEDA metadata model which is based on MOLES 3.4, Python, Django and SQLAlchemy.
Serum Paraoxonase Activity and Malondialdehyde Serum Concentrations Remain Unaffected in Response to Hydroxyurea Therapy in β-Thalassemia Patients.

PubMed

Zohaib, Muhammad; Ansari, Saqib H; Hashim, Zehra; Shamsi, Tahir S; Zarina, Shamshad

2016-07-01

β-Thalassemia is the most common hereditary disorder characterized by reduced production of β-globin chains of hemoglobin A (HbA). In recent years, hydroxyurea (HU) has shown promising therapeutic benefits in patients with β-thalassemia by fetal hemoglobin augmentation. We have analyzed effects of hydroxyurea treatment on oxidative stress in β-thalassemia patients by assessing activities of paraoxonase (PON) and arylesterase along with malondialdehyde (MDA) and total reactive oxygen species (ROS) concentrations. Blood samples from 159 individuals including 56 HU-treated and 58 untreated β-thalassemia patients and 45 healthy controls were analyzed. PON activity was found to be highest in healthy individuals (177.76 ± 4.44 U/mL) as compared to treated (52.67 ± 3.65 U/mL) and untreated (55.11 ± 3.26 U/mL) patients. A similar trend was observed in the case of arylesterase activity in normal, β-thalassemia-treated, and untreated (210.0 ± 11.25 U/mL, 163.03 ± 9.04 U/mL, 139.77 ± 10.10 U/mL) subjects. Serum MDA concentrations (2.59 ± 0.09 nmol/mL, 2.45 ± 0.08 nmol/mL, and 1.15 ± 0.05 nmol/mL) and total ROS concentrations (3.73 ± 0.20 nmol/mL, 3.54 ± 0.23 nmol/mL, and 2.45 ± 0.14 nmol/mL) were significantly elevated in both groups (untreated and treated) as compared to healthy individuals (P < .01). Oxidative stress was found to be markedly elevated in β-thalassemia patients as compared to healthy controls. Insignificant differences were, however, observed in mean concentrations of PON1 paraoxonase and arylesterase activities, serum MDA concentration and total ROS concentrations between HU-treated and untreated patients. We propose that HU therapy alone seems to be ineffective in managing oxidative stress and is likely to offer a better clinical outcome when supplemented with efficient iron chelation therapy and antioxidants. © 2015, The American College of Clinical Pharmacology.
Immunological monitoring after organ transplantation: potential role of soluble CD30 blood level measurement.

PubMed

Truong, Dinh Quang; Darwish, Ahmed A; Gras, Jérémie; Wieërs, Grégoire; Cornet, Anne; Robert, Annie; Mourad, Michel; Malaise, Jacques; de Ville de Goyet, Jean; Reding, Raymond; Latinne, Dominique

2007-06-01

Analysing the relevance of soluble CD30 (sCD30) in the bloodstream before and after transplantation may be important for the monitoring of transplant recipients. In this study, 27 patients (15 pediatric liver and 12 adult kidney graft recipients) were investigated. In the liver graft group, the patients who developed acute rejection during the first month (n=9) had a slightly higher sCD30 value on pre-transplantation baseline (day 0) and post-transplantation day 7, when compared to patients with normal graft function (n=6) (day 0: 102(1.6) U/ml versus 118(1.5) U/ml, p=0.52) and (day 7: 69(1.5) U/ml versus 83(1.6) U/ml, p=0.47). Increased serum sCD30 was shown to correlate with increased interleukin-10 circulating levels between day 0 and day 7 (r=0.53; p=0.04), whereas, no correlation could be evidenced between interferon-gamma (IFN-gamma) and sCD30 (r=0.02; p=0.47). Similarly, in the kidney transplantation group, no significant difference was found in sCD30 levels at day 0 in both groups with graft rejection or normal graft function (n=6) (85(1.3) U/ml versus 77(1.6) U/ml, p=0.66), but sCD30 decreased significantly at day 7 post-transplantation from baseline value in the rejection group (n=6) (77(1.6) versus 35(1.4); p=0.02). We conclude that increased serum sCD30 was correlated with increased IL-10 (interleukin-10) circulating levels, but not with IFN-gamma levels in the post-transplantation period. Neither pre-transplantation sCD30 nor sCD30 at day 7 post-transplantation could be correlated with acute rejection in liver graft recipient. The monitoring of sCD30 might constitute a tool to assess the risk of acute rejection in renal transplant but did not appear as a valuable mean for early immunological monitoring in the small group of liver allograft recipients patients analysed in this study.
Feature engineering for MEDLINE citation categorization with MeSH.

PubMed

Jimeno Yepes, Antonio Jose; Plaza, Laura; Carrillo-de-Albornoz, Jorge; Mork, James G; Aronson, Alan R

2015-04-08

Research in biomedical text categorization has mostly used the bag-of-words representation. Other more sophisticated representations of text based on syntactic, semantic and argumentative properties have been less studied. In this paper, we evaluate the impact of different text representations of biomedical texts as features for reproducing the MeSH annotations of some of the most frequent MeSH headings. In addition to unigrams and bigrams, these features include noun phrases, citation meta-data, citation structure, and semantic annotation of the citations. Traditional features like unigrams and bigrams exhibit strong performance compared to other feature sets. Little or no improvement is obtained when using meta-data or citation structure. Noun phrases are too sparse and thus have lower performance compared to more traditional features. Conceptual annotation of the texts by MetaMap shows similar performance compared to unigrams, but adding concepts from the UMLS taxonomy does not improve the performance of using only mapped concepts. The combination of all the features performs largely better than any individual feature set considered. In addition, this combination improves the performance of a state-of-the-art MeSH indexer. Concerning the machine learning algorithms, we find that those that are more resilient to class imbalance largely obtain better performance. We conclude that even though traditional features such as unigrams and bigrams have strong performance compared to other features, it is possible to combine them to effectively improve the performance of the bag-of-words representation. We have also found that the combination of the learning algorithm and feature sets has an influence in the overall performance of the system. Moreover, using learning algorithms resilient to class imbalance largely improves performance. However, when using a large set of features, consideration needs to be taken with algorithms due to the risk of over-fitting. Specific combinations of learning algorithms and features for individual MeSH headings could further increase the performance of an indexing system.
Mainstream web standards now support science data too

NASA Astrophysics Data System (ADS)

Richard, S. M.; Cox, S. J. D.; Janowicz, K.; Fox, P. A.

2017-12-01

The science community has developed many models and ontologies for representation of scientific data and knowledge. In some cases these have been built as part of coordinated frameworks. For example, the biomedical communities OBO Foundry federates applications covering various aspects of life sciences, which are united through reference to a common foundational ontology (BFO). The SWEET ontology, originally developed at NASA and now governed through ESIP, is a single large unified ontology for earth and environmental sciences. On a smaller scale, GeoSciML provides a UML and corresponding XML representation of geological mapping and observation data. Some of the key concepts related to scientific data and observations have recently been incorporated into domain-neutral mainstream ontologies developed by the World Wide Web consortium through their Spatial Data on the Web working group (SDWWG). OWL-Time has been enhanced to support temporal reference systems needed for science, and has been deployed in a linked data representation of the International Chronostratigraphic Chart. The Semantic Sensor Network ontology has been extended to cover samples and sampling, including relationships between samples. Gridded data and time-series is supported by applications of the statistical data-cube ontology (QB) for earth observations (the EO-QB profile) and spatio-temporal data (QB4ST). These standard ontologies and encodings can be used directly for science data, or can provide a bridge to specialized domain ontologies. There are a number of advantages in alignment with the W3C standards. The W3C vocabularies use discipline-neutral language and thus support cross-disciplinary applications directly without complex mappings. The W3C vocabularies are already aligned with the core ontologies that are the building blocks of the semantic web. The W3C vocabularies are each tightly scoped thus encouraging good practices in the combination of complementary small ontologies. The W3C vocabularies are hosted on well known, reliable infrastructure. The W3C SDWWG outputs are being selectively adopted by the general schema.org discovery framework.
Prognostic significance of preimmunotherapy serum CA27.29 (MUC-1) mucin level after active specific immunotherapy of metastatic adenocarcinoma patients.

PubMed

MacLean, G D; Reddish, M A; Longenecker, B M

1997-01-01

The TRUQUANT BR radioimmunoassay, which uses monoclonal antibody B27.29 to quantitate CA27.29 mucin antigen (MUC-1 gene product) in serum, has recently received Food and Drug Administration approval for predicting recurrent breast cancer in patients with stage II and III disease. The purpose of this study was to determine whether the new radioimmunoassay for serum MUC-1 has prognostic significance for patients with metastatic adenocarcinoma receiving active specific immunotherapy (ASI). Using 40 U/ml as the upper limit of "normal," patients with metastatic breast and ovarian cancer with a preimmunotherapy serum CA27.29 mucin > 40 U/ml (CA27.29 Hi patients) had a poorer survival than CA27.29 Lo patients (< or = 40 U/ml) after ASI. There was no significant correlation between preimmunotherapy CA27.29 serum levels and measurable tumor burden. The preimmunotherapy CA27.29 serum level was a predictor of poor survival of metastatic colorectal and pancreatic cancer patients independent of other prognostic factors. There seemed to be two populations of pancreatic cancer patients, separated at 60 U/ml serum CA27.29 (CA27.29 Hi versus Lo patients). A CA27.29 serum level of 22 U/ml separated patients with CA27.29 Hi vs. Lo colorectal cancer. Patients with CA27.29 Lo colorectal and pancreatic cancer survived longer after ASI compared with patients with CA27.29 Hi colorectal and pancreatic cancer, respectively. We suggest that various CA27.29 serum levels define poor prognosis patients (CA27.29 Hi secretors) versus good prognosis patients (CA27.29 Lo secretors) for different cancer types.
Evaluation of emergency medical text processor, a system for cleaning chief complaint text data.

PubMed

Travers, Debbie A; Haas, Stephanie W

2004-11-01

Emergency Medical Text Processor (EMT-P) version 1, a natural language processing system that cleans emergency department text (e.g., chst pn, chest pai), was developed to maximize extraction of standard terms (e.g., chest pain). The authors compared the number of standard terms extracted from raw chief complaint (CC) data with that for CC data cleaned with EMT-P and evaluated the accuracy of EMT-P. This cross-sectional observation study included CC text entries for all emergency department visits to three tertiary care centers in 2001. Terms were extracted from CC entries before and after cleaning with EMT-P. Descriptive statistics included number and percentage of all entries (tokens) and all unique entries (types) that matched a standard term from the Unified Medical Language System (UMLS). An expert panel rated the accuracy of the CC-UMLS term matches; inter-rater reliability was measured with kappa. The authors collected 203,509 CC entry tokens, of which 63,946 were unique entry types. For the raw data, 89,337 tokens (44%) and 5,081 types (8%) matched a standard term. After EMT-P cleaning, 168,050 tokens (83%) and 44,430 types (69%) matched a standard term. The expert panel reached consensus on 201 of the 222 CC-UMLS term matches reviewed (kappa=0.69-0.72). Ninety-six percent of the 201 matches were rated equivalent or related. Thirty-eight percent of the nonmatches were found to match UMLS concepts. EMT-P version 1 is relatively accurate, and cleaning with EMT-P improved the CC-UMLS term match rate over raw data. The authors identified areas for improvement in future EMT-P versions and issues to be resolved in developing a standard CC terminology.

Preliminary design of a universal Martian lander

NASA Astrophysics Data System (ADS)

Norman, Timothy L.; Gaskin, David E.; Adkins, Sean; Gunawan, Mary; Johnson, Raquel; Macdonnell, David; Parlock, Andrew; Sarick, John; Bodwell, Charles; Hashimoto, Kouichi

In the next 25 years, mankind will be undertaking yet another giant leap forward in the exploration of the solar system: a manned mission to Mars. This journey will provide important information on the composition and history of both Mars and the Solar System. A manned mission will also provide the opportunity to study how humans can adapt to long term space flight conditions and the Martian environment. As part of the NASA/USRA program, nineteen West Virginia University students conducted a preliminary design of a manned Universal Martian Lander (UML). The UML's design will provide a 'universal' platform, consisting of four modules for living and laboratory experiments and a liquid-fuel propelled Manned Ascent Return Vehicle (MARV). The distinguishing feature of the UML is the 'universal' design of the modules which can be connected to form a network of laboratories and living quarters for future missions thereby reducing development and production costs. The WVU design considers descent to Mars from polar orbit, a six month surface stay, and ascent for rendezvous. The design begins with an unmanned UML landing at Elysium Mons followed by the manned UML landing nearby. During the six month surface stay, the eight modules will be assembled to form a Martian base where scientific experiments will be performed. The mission will also incorporate hydroponic plant growth into a Controlled Ecological Life Support System (CELSS) for water recycling, food production, and to counteract psychological effects of living on Mars. In situ fuel production for the MARV will be produced from gases in the Martian atmosphere. Following surface operations, the eight member crew will use the MARV to return to the Martian Transfer Vehicle (MTV) for the journey home to Earth.
COHeRE: Cross-Ontology Hierarchical Relation Examination for Ontology Quality Assurance.

PubMed

Cui, Licong

Biomedical ontologies play a vital role in healthcare information management, data integration, and decision support. Ontology quality assurance (OQA) is an indispensable part of the ontology engineering cycle. Most existing OQA methods are based on the knowledge provided within the targeted ontology. This paper proposes a novel cross-ontology analysis method, Cross-Ontology Hierarchical Relation Examination (COHeRE), to detect inconsistencies and possible errors in hierarchical relations across multiple ontologies. COHeRE leverages the Unified Medical Language System (UMLS) knowledge source and the MapReduce cloud computing technique for systematic, large-scale ontology quality assurance work. COHeRE consists of three main steps with the UMLS concepts and relations as the input. First, the relations claimed in source vocabularies are filtered and aggregated for each pair of concepts. Second, inconsistent relations are detected if a concept pair is related by different types of relations in different source vocabularies. Finally, the uncovered inconsistent relations are voted according to their number of occurrences across different source vocabularies. The voting result together with the inconsistent relations serve as the output of COHeRE for possible ontological change. The highest votes provide initial suggestion on how such inconsistencies might be fixed. In UMLS, 138,987 concept pairs were found to have inconsistent relationships across multiple source vocabularies. 40 inconsistent concept pairs involving hierarchical relationships were randomly selected and manually reviewed by a human expert. 95.8% of the inconsistent relations involved in these concept pairs indeed exist in their source vocabularies rather than being introduced by mistake in the UMLS integration process. 73.7% of the concept pairs with suggested relationship were agreed by the human expert. The effectiveness of COHeRE indicates that UMLS provides a promising environment to enhance qualities of biomedical ontologies by performing cross-ontology examination.
Auditing the multiply-related concepts within the UMLS.

PubMed

Mougin, Fleur; Grabar, Natalia

2014-10-01

This work focuses on multiply-related Unified Medical Language System (UMLS) concepts, that is, concepts associated through multiple relations. The relations involved in such situations are audited to determine whether they are provided by source vocabularies or result from the integration of these vocabularies within the UMLS. We study the compatibility of the multiple relations which associate the concepts under investigation and try to explain the reason why they co-occur. Towards this end, we analyze the relations both at the concept and term levels. In addition, we randomly select 288 concepts associated through contradictory relations and manually analyze them. At the UMLS scale, only 0.7% of combinations of relations are contradictory, while homogeneous combinations are observed in one-third of situations. At the scale of source vocabularies, one-third do not contain more than one relation between the concepts under investigation. Among the remaining source vocabularies, seven of them mainly present multiple non-homogeneous relations between terms. Analysis at the term level also shows that only in a quarter of cases are the source vocabularies responsible for the presence of multiply-related concepts in the UMLS. These results are available at: http://www.isped.u-bordeaux2.fr/ArticleJAMIA/results_multiply_related_concepts.aspx. Manual analysis was useful to explain the conceptualization difference in relations between terms across source vocabularies. The exploitation of source relations was helpful for understanding why some source vocabularies describe multiple relations between a given pair of terms. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Preliminary design of a universal Martian lander

NASA Technical Reports Server (NTRS)

Norman, Timothy L.; Gaskin, David E.; Adkins, Sean; Gunawan, Mary; Johnson, Raquel; Macdonnell, David; Parlock, Andrew; Sarick, John; Bodwell, Charles; Hashimoto, Kouichi

1993-01-01

In the next 25 years, mankind will be undertaking yet another giant leap forward in the exploration of the solar system: a manned mission to Mars. This journey will provide important information on the composition and history of both Mars and the Solar System. A manned mission will also provide the opportunity to study how humans can adapt to long term space flight conditions and the Martian environment. As part of the NASA/USRA program, nineteen West Virginia University students conducted a preliminary design of a manned Universal Martian Lander (UML). The UML's design will provide a 'universal' platform, consisting of four modules for living and laboratory experiments and a liquid-fuel propelled Manned Ascent Return Vehicle (MARV). The distinguishing feature of the UML is the 'universal' design of the modules which can be connected to form a network of laboratories and living quarters for future missions thereby reducing development and production costs. The WVU design considers descent to Mars from polar orbit, a six month surface stay, and ascent for rendezvous. The design begins with an unmanned UML landing at Elysium Mons followed by the manned UML landing nearby. During the six month surface stay, the eight modules will be assembled to form a Martian base where scientific experiments will be performed. The mission will also incorporate hydroponic plant growth into a Controlled Ecological Life Support System (CELSS) for water recycling, food production, and to counteract psychological effects of living on Mars. In situ fuel production for the MARV will be produced from gases in the Martian atmosphere. Following surface operations, the eight member crew will use the MARV to return to the Martian Transfer Vehicle (MTV) for the journey home to Earth.
Production, purification and application of extracellular chitinase from Cellulosimicrobium cellulans 191

PubMed Central

Fleuri, Luciana F.; Kawaguti, Haroldo Y.; Sato, Hélia H.

2009-01-01

This study concerned the production, purification and application of extracellular chitinase from Cellulosimicrobium cellulans strain 191. In shaken flasks the maximum yield of chitinase was 6.9 U/mL after 72 h of cultivation at 25°C and 200 rpm. In a 5 L fermenter with 1.5 vvm aeration, the highest yield obtained was 4.19 U/mL after 168 h of fermentation at 25°C and 200 rpm, and using 3 vvm, it was 4.38 U/mL after 144 h of fermentation. The chitinase (61 KDa) was purified about 6.65 times by Sepharose CL 4B 200 gel filtration with a yield of 46.61%. The purified enzyme was able to lyse the cell walls of some fungi and to form protoplasts. PMID:24031407
Connecting Architecture and Implementation

NASA Astrophysics Data System (ADS)

Buchgeher, Georg; Weinreich, Rainer

Software architectures are still typically defined and described independently from implementation. To avoid architectural erosion and drift, architectural representation needs to be continuously updated and synchronized with system implementation. Existing approaches for architecture representation like informal architecture documentation, UML diagrams, and Architecture Description Languages (ADLs) provide only limited support for connecting architecture descriptions and implementations. Architecture management tools like Lattix, SonarJ, and Sotoarc and UML-tools tackle this problem by extracting architecture information directly from code. This approach works for low-level architectural abstractions like classes and interfaces in object-oriented systems but fails to support architectural abstractions not found in programming languages. In this paper we present an approach for linking and continuously synchronizing a formalized architecture representation to an implementation. The approach is a synthesis of functionality provided by code-centric architecture management and UML tools and higher-level architecture analysis approaches like ADLs.
Allergy and parasites: the measurement of total and specific IgE levels in urban and rural communities in Rhodesia.

PubMed

Merrett, T G; Merrett, J; Cookson, J B

1976-03-01

Eighty adult asthmatics living in an African city had a significantly higher serum IgE level (799 u/ml) than the control group (350 u/ml). A high proportion (78.7%) of the asthmatics had demonstrable circulating mite-specific IgE antibodies. The rural population of a filariasis endemic region was investigated and although no allergic subjects were identified, the group had a significantly higher IgE level (1613 u/ml) than the asthmatics and also showed a relatively high incidence of grass pollen-specific IgE antibodies (35%). The discrepancy between clinical history and laboratory results supports the mast cell saturation hypothesis and suggests: (a) an explanation for the susceptibility to allergy of African and Asian immigrants to Great Britain, and (b) a practical approach for preventing allergic reactions in vivo.
Polygalacturonase production by calcium alginate immobilized Enterobacter aerogenes NBO2 cells.

PubMed

Darah, I; Nisha, M; Lim, Sheh-Hong

2015-03-01

Bacterial cells of Enterobacter aerogenes NBO2 were entrapped in calcium alginate beads in order to enhance polygalacturonase production compared to free cells. The optimized condition of 5 % (w/v) sodium alginate concentration, agitation speed of 250 rpm, and 15 beads of calcium alginate with inoculum size of 4 % (v/v; 5.4 × 10(7) cells/ml) produced 23.48 U/mL of polygalacturonase compared to free cells of 18.54 U/ml. There was about 26.6 % increment in polygalaturonase production. However, in this study, there was 296.6 % of increment in polygalacturonase production after improvement parameters compared to before improvement parameters of calcium alginate bead immobilization cells (5.92 U/ml). This research has indicated that optimized physical parameters of calcium alginate bead immobilization cells have significantly enhanced the production of polygalacturonase.
UMLS content views appropriate for NLP processing of the biomedical literature vs. clinical text.

PubMed

Demner-Fushman, Dina; Mork, James G; Shooshan, Sonya E; Aronson, Alan R

2010-08-01

Identification of medical terms in free text is a first step in such Natural Language Processing (NLP) tasks as automatic indexing of biomedical literature and extraction of patients' problem lists from the text of clinical notes. Many tools developed to perform these tasks use biomedical knowledge encoded in the Unified Medical Language System (UMLS) Metathesaurus. We continue our exploration of automatic approaches to creation of subsets (UMLS content views) which can support NLP processing of either the biomedical literature or clinical text. We found that suppression of highly ambiguous terms in the conservative AutoFilter content view can partially replace manual filtering for literature applications, and suppression of two character mappings in the same content view achieves 89.5% precision at 78.6% recall for clinical applications. Published by Elsevier Inc.
Path generation algorithm for UML graphic modeling of aerospace test software

NASA Astrophysics Data System (ADS)

Qu, MingCheng; Wu, XiangHu; Tao, YongChao; Chen, Chao

2018-03-01

Aerospace traditional software testing engineers are based on their own work experience and communication with software development personnel to complete the description of the test software, manual writing test cases, time-consuming, inefficient, loopholes and more. Using the high reliability MBT tools developed by our company, the one-time modeling can automatically generate test case documents, which is efficient and accurate. UML model to describe the process accurately express the need to rely on the path is reached, the existing path generation algorithm are too simple, cannot be combined into a path and branch path with loop, or too cumbersome, too complicated arrangement generates a path is meaningless, for aerospace software testing is superfluous, I rely on our experience of ten load space, tailor developed a description of aerospace software UML graphics path generation algorithm.
Automated semantic indexing of figure captions to improve radiology image retrieval.

PubMed

Kahn, Charles E; Rubin, Daniel L

2009-01-01

We explored automated concept-based indexing of unstructured figure captions to improve retrieval of images from radiology journals. The MetaMap Transfer program (MMTx) was used to map the text of 84,846 figure captions from 9,004 peer-reviewed, English-language articles to concepts in three controlled vocabularies from the UMLS Metathesaurus, version 2006AA. Sampling procedures were used to estimate the standard information-retrieval metrics of precision and recall, and to evaluate the degree to which concept-based retrieval improved image retrieval. Precision was estimated based on a sample of 250 concepts. Recall was estimated based on a sample of 40 concepts. The authors measured the impact of concept-based retrieval to improve upon keyword-based retrieval in a random sample of 10,000 search queries issued by users of a radiology image search engine. Estimated precision was 0.897 (95% confidence interval, 0.857-0.937). Estimated recall was 0.930 (95% confidence interval, 0.838-1.000). In 5,535 of 10,000 search queries (55%), concept-based retrieval found results not identified by simple keyword matching; in 2,086 searches (21%), more than 75% of the results were found by concept-based search alone. Concept-based indexing of radiology journal figure captions achieved very high precision and recall, and significantly improved image retrieval.
Biomedical Ontologies in Action: Role in Knowledge Management, Data Integration and Decision Support

PubMed Central

Bodenreider, O.

2008-01-01

Summary Objectives To provide typical examples of biomedical ontologies in action, emphasizing the role played by biomedical ontologies in knowledge management, data integration and decision support. Methods Biomedical ontologies selected for their practical impact are examined from a functional perspective. Examples of applications are taken from operational systems and the biomedical literature, with a bias towards recent journal articles. Results The ontologies under investigation in this survey include SNOMED CT, the Logical Observation Identifiers, Names, and Codes (LOINC), the Foundational Model of Anatomy, the Gene Ontology, RxNorm, the National Cancer Institute Thesaurus, the International Classification of Diseases, the Medical Subject Headings (MeSH) and the Unified Medical Language System (UMLS). The roles played by biomedical ontologies are classified into three major categories: knowledge management (indexing and retrieval of data and information, access to information, mapping among ontologies); data integration, exchange and semantic interoperability; and decision support and reasoning (data selection and aggregation, decision support, natural language processing applications, knowledge discovery). Conclusions Ontologies play an important role in biomedical research through a variety of applications. While ontologies are used primarily as a source of vocabulary for standardization and integration purposes, many applications also use them as a source of computable knowledge. Barriers to the use of ontologies in biomedical applications are discussed. PMID:18660879
What Four Million Mappings Can Tell You about Two Hundred Ontologies

NASA Astrophysics Data System (ADS)

Ghazvinian, Amir; Noy, Natalya F.; Jonquet, Clement; Shah, Nigam; Musen, Mark A.

The field of biomedicine has embraced the Semantic Web probably more than any other field. As a result, there is a large number of biomedical ontologies covering overlapping areas of the field. We have developed BioPortal—an open community-based repository of biomedical ontologies. We analyzed ontologies and terminologies in BioPortal and the Unified Medical Language System (UMLS), creating more than 4 million mappings between concepts in these ontologies and terminologies based on the lexical similarity of concept names and synonyms. We then analyzed the mappings and what they tell us about the ontologies themselves, the structure of the ontology repository, and the ways in which the mappings can help in the process of ontology design and evaluation. For example, we can use the mappings to guide users who are new to a field to the most pertinent ontologies in that field, to identify areas of the domain that are not covered sufficiently by the ontologies in the repository, and to identify which ontologies will serve well as background knowledge in domain-specific tools. While we used a specific (but large) ontology repository for the study, we believe that the lessons we learned about the value of a large-scale set of mappings to ontology users and developers are general and apply in many other domains.
Michaelis kinetic analysis of extracellular cellulase and amylase excreted by Lactobacillus plantarum during cassava fermentation

NASA Astrophysics Data System (ADS)

Frediansyah, Andri; Kurniadi, Muhamad

2017-01-01

Our previous study reveal that single culture of Lactobacillus plantarum has ability to ferment cassava tuber in relation to produce modified cassava flour (mocaf). It was used to accelerate a fermentation process. L. plantarum grow well and produce some extracellular enzymes i.e. cellulase to change the structure and breakdown the cell wall of cassava tuber. Then, the starchy materials will be hydrolyzed by i.e. amylase into simple sugar and convert to organic acid. All of these process will give new characteristic of cassava i.e. lower fiber content, good flavor, taste, aroma and texture and the amount of cyanide acid is lower. Therefore this present study was to analyze Michaelis kinetics of extracellular carboxymethyl cellulase and amylase production by L. plantarum during cassava fermentation. The maximum carboxymethyl cellulase and amylase activity of 8.60 U/ml and 14.07 U/ml, respectively, were obtained from filtrate which has been incubated at 37°C for 18 h under stationary conditions. The Vmax and Km of CMCase were 0.8506 × 10-3 U/ml and 0.9594 × 10-3 g/mL, respectively. For amylase were 9.291 × 10-3 U/ml and 0.9163 × 10-3 g/ml, respectively.
Rosen's (M,R) system in Unified Modelling Language.

PubMed

Zhang, Ling; Williams, Richard A; Gatherer, Derek

2016-01-01

Robert Rosen's (M,R) system is an abstract biological network architecture that is allegedly non-computable on a Turing machine. If (M,R) is truly non-computable, there are serious implications for the modelling of large biological networks in computer software. A body of work has now accumulated addressing Rosen's claim concerning (M,R) by attempting to instantiate it in various software systems. However, a conclusive refutation has remained elusive, principally since none of the attempts to date have unambiguously avoided the critique that they have altered the properties of (M,R) in the coding process, producing merely approximate simulations of (M,R) rather than true computational models. In this paper, we use the Unified Modelling Language (UML), a diagrammatic notation standard, to express (M,R) as a system of objects having attributes, functions and relations. We believe that this instantiates (M,R) in such a way than none of the original properties of the system are corrupted in the process. Crucially, we demonstrate that (M,R) as classically represented in the relational biology literature is implicitly a UML communication diagram. Furthermore, since UML is formally compatible with object-oriented computing languages, instantiation of (M,R) in UML strongly implies its computability in object-oriented coding languages. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Effect of immobile isolated enzymes from rumen liquid by using alginate matrices on the bay leaf extraction

NASA Astrophysics Data System (ADS)

Paramita, Vita; Yulianto, Mohammad Endy; Yohana, Eflita; Arifan, Fahmi; Hanifah, Amjad, Muhammad Taqiyuddin

2015-12-01

This research aims to develop the enzymatically of bay leaves phytochemical extraction process. The novelty and the main innovations of this research is the development of extraction process by using enzymatic extractor and isolate the enzymes from rumen liquid to shift the equilibrium phase, increase the extraction rate and increase the extraction yield. The activity of rumen liquid enzyme was represented by the activity of cellulase and protease. The analyze of total flavonoid content was performed by using UV-Vis Spectrofometry. The activity of immobilized enzyme of cellulase (0.08±0.00 U/ml) was lower than the un-immobilized one (0.23±0.00 U/ml). However, there was no difference activity of the immobilized (0.75±0.00 U/ml) and un-immobilized (0.76±0.01 U/ml) of protease. The model of mass transfer of un-immobilized enzyme can be fitted on the experimental data, however the model of mass transfer of immobilized enzyme did not match with the experimental data. The mass transfer coefficient of enzymatic extraction flavonoids bay leaf without immobilization was 0.17167 s-1 which greater than the reported value of obtained KLa from extraction by using electric heating.
Requirements' Role in Mobilizing and Enabling Design Conversation

NASA Astrophysics Data System (ADS)

Bergman, Mark

Requirements play a critical role in a design conversation of systems and products. Product and system design exists at the crossroads of problems, solutions and requirements. Requirements contextualize problems and solutions, pointing the way to feasible outcomes. These are captured with models and detailed specifications. Still, stakeholders need to be able to understand one-another using shared design representations in order to mobilize bias and transform knowledge towards legitimized, desired results. Many modern modeling languages, including UML, as well as detailed, logic-based specifications are beyond the comprehension of key stakeholders. Hence, they inhibit, rather than promote design conversation. Improved design boundary objects (DBO), especially design requirements boundary objects (DRBO), need to be created and refined to improve the communications between principals. Four key features of design boundary objects that improve and promote design conversation are discussed in detail. A systems analysis and design case study is presented which demonstrates these features in action. It describes how a small team of analysts worked with key stakeholders to mobilize and guide a complex system design discussion towards an unexpected, yet desired outcome within a short time frame.
Auto-Coding UML Statecharts for Flight Software

NASA Technical Reports Server (NTRS)

Benowitz, Edward G; Clark, Ken; Watney, Garth J.

2006-01-01

Statecharts have been used as a means to communicate behaviors in a precise manner between system engineers and software engineers. Hand-translating a statechart to code, as done on some previous space missions, introduces the possibility of errors in the transformation from chart to code. To improve auto-coding, we have developed a process that generates flight code from UML statecharts. Our process is being used for the flight software on the Space Interferometer Mission (SIM).
The relation between cerebral serotonin levels and conditioned behaviour in the rat following the administration of LSD-25 and UML.

PubMed

Torre, M; Torre, E; Bogetto, F

1975-01-01

Successive daily injections of LSD-25 and UML (1-methyl-d-lysergic acid butanolamide) caused progressive depression of brain 5-HT levels in the rat. On the fourth day, the decrease was significant with respect to the highly significant fall observed after a single administration, whereas it had been shown earlier that conditioned behaviour is no longer affected by LSD-25 after 3 days and that simultaneous administration of a single dose of LSD-25 and UML is equally ineffective in this respect. Its depression of 5-HT levels, however, has now been shown to be equal to that of LSD-25 alone at doses that influence conditioned behaviour. The findings indicate that changes in such behaviour are not dependent on brain 5-HT levels and that no link exists between such levels and the psychotomimetic effect of LSD-25 in man.
A UML-based meta-framework for system design in public health informatics.

PubMed

Orlova, Anna O; Lehmann, Harold

2002-01-01

The National Agenda for Public Health Informatics calls for standards in data and knowledge representation within public health, which requires a multi-level framework that links all aspects of public health. The literature of public health informatics and public health informatics application were reviewed. A UML-based systems analysis was performed. Face validity of results was evaluated in analyzing the public health domain of lead poisoning. The core class of the UML-based system of public health is the Public Health Domain, which is associated with multiple Problems, for which Actors provide Perspectives. Actors take Actions that define, generate, utilize and/or evaluate Data Sources. The life cycle of the domain is a sequence of activities attributed to its problems that spirals through multiple iterations and realizations within a domain. The proposed Public Health Informatics Meta-Framework broadens efforts in applying informatics principles to the field of public health

Modeling a Nursing Guideline with Standard Terminology and Unified Modeling Language for a Nursing Decision Support System: A Case Study.

PubMed

Choi, Jeeyae; Jansen, Kay; Coenen, Amy

In recent years, Decision Support Systems (DSSs) have been developed and used to achieve "meaningful use". One approach to developing DSSs is to translate clinical guidelines into a computer-interpretable format. However, there is no specific guideline modeling approach to translate nursing guidelines to computer-interpretable guidelines. This results in limited use of DSSs in nursing. Unified modeling language (UML) is a software writing language known to accurately represent the end-users' perspective, due to its expressive characteristics. Furthermore, standard terminology enabled DSSs have been shown to smoothly integrate into existing health information systems. In order to facilitate development of nursing DSSs, the UML was used to represent a guideline for medication management for older adults encode with the International Classification for Nursing Practice (ICNP®). The UML was found to be a useful and sufficient tool to model a nursing guideline for a DSS.
The Preliminary Design of a Universal Martian Lander

NASA Technical Reports Server (NTRS)

Norman, Timothy L.; Gaskin, David; Adkins, Sean; MacDonnell, David; Ross, Enoch; Hashimoto, Kouichi; Miller, Loran; Sarick, John; Hicks, Jonathan; Parlock, Andrew;

1993-01-01

As part of the NASA/USRA program, nineteen West Virginia University students conducted a preliminary design of a manned Universal Martian Lander (UML). The WVU design considers descent to Mars from polar orbit, a six month surface stay, and ascent for rendezvous. The design begins with an unmanned UML landing at Elysium Mons followed by the manned UML landing nearby. During the six month surface stay, the eight modules are assembled to form a Martian base where scientific experiments are performed. The mission also incorporates hydroponic plant growth into a Controlled Ecological Life Support System (CELSS) for water recycling, food production, and to counteract psycho-logical effects of living on Mars. In situ fuel production for the Martian Ascent and Rendezvous Vehicle (MARV) is produced From gases in the Martian atmosphere. Following surface operations, the eight member crew uses the MARV to return to the Martian Transfer Vehicle (MTV) for the journey home to Earth.

Protoplast fusion enhances lignocellulolytic enzyme activities in Trichoderma reesei.

PubMed

Cui, Yu-xiao; Liu, Jia-jing; Liu, Yan; Cheng, Qi-yue; Yu, Qun; Chen, Xin; Ren, Xiao-dong

2014-12-01

Protoplast fusion was used to obtain a higher production of lignocellulolytic enzymes with protoplast fusion in Trichoderma reesei. The fusant strain T. reesei JL6 was obtained from protoplast fusion from T. reesei strains QM9414, MCG77, and Rut C-30. Filter paper activity of T. reesei JL6 increased by 18% compared with that of Rut C-30. β-Glucosidase, hemicellulase and pectinase activities of T. reesei JL6 were also higher. The former activity was 0.39 Uml(-1), while those of QM9414, MCG77, and Rut C-30 were 0.13, 0.11, and 0.16 Uml(-1), respectively. Pectinase and hemicellulase activities of JL6 were 5.4 and 15.6 Uml(-1), respectively, which were slightly higher than those of the parents. The effects of corn stover and wheat bran carbon sources on the cellulase production and growth curve of T. reesei JL6 were also investigated.
BGen: A UML Behavior Network Generator Tool

NASA Technical Reports Server (NTRS)

Huntsberger, Terry; Reder, Leonard J.; Balian, Harry

2010-01-01

BGen software was designed for autogeneration of code based on a graphical representation of a behavior network used for controlling automatic vehicles. A common format used for describing a behavior network, such as that used in the JPL-developed behavior-based control system, CARACaS ["Control Architecture for Robotic Agent Command and Sensing" (NPO-43635), NASA Tech Briefs, Vol. 32, No. 10 (October 2008), page 40] includes a graph with sensory inputs flowing through the behaviors in order to generate the signals for the actuators that drive and steer the vehicle. A computer program to translate Unified Modeling Language (UML) Freeform Implementation Diagrams into a legacy C implementation of Behavior Network has been developed in order to simplify the development of C-code for behavior-based control systems. UML is a popular standard developed by the Object Management Group (OMG) to model software architectures graphically. The C implementation of a Behavior Network is functioning as a decision tree.
Leveraging terminological resources for mapping between rare disease information sources.

PubMed

Rance, Bastien; Snyder, Michelle; Lewis, Janine; Bodenreider, Olivier

2013-01-01

Rare disease information sources are incompletely and inconsistently cross-referenced to one another, making it difficult for information seekers to navigate across them. The development of such cross-references established manually by experts is generally labor intensive and costly. To develop an automatic mapping between two of the major rare diseases information sources, GARD and Orphanet, by leveraging terminological resources, especially the UMLS. We map the rare disease terms from Orphanet and ORDR to the UMLS. We use the UMLS as a pivot to bridge between the rare disease terminologies. We compare our results to a mapping obtained through manually established cross-references to OMIM. Our mapping has a precision of 94%, a recall of 63% and an F1-score of 76%. Our automatic mapping should help facilitate the development of more complete and consistent cross-references between GARD and Orphanet, and is applicable to other rare disease information sources as well.
Identifying UMLS concepts from ECG Impressions using KnowledgeMap

PubMed Central

Denny, Joshua C.; Spickard, Anderson; Miller, Randolph A; Schildcrout, Jonathan; Darbar, Dawood; Rosenbloom, S. Trent; Peterson, Josh F.

2005-01-01

Electrocardiogram (ECG) impressions represent a wealth of medical information for potential decision support and drug-effect discovery. Much of this information is inaccessible to automated methods in the free-text portion of the ECG report. We studied the application of the KnowledgeMap concept identifier (KMCI) to map Unified Medical Language System (UMLS) concepts from ECG impressions. ECGs were processed by KMCI and the results scored for accuracy by multiple raters. Reviewers also recorded unidentified concepts through the scoring interface. Overall, KMCI correctly identified 1059 out of 1171 concepts for a recall of 0.90. Precision, indicating the proportion of ECG concepts correctly identified, was 0.94. KMCI was particularly effective at identifying ECG rhythms (330/333), perfusion changes (65/66), and noncardiac medical concepts (11/11). In conclusion, KMCI is an effective method for mapping ECG impressions to UMLS concepts. PMID:16779029
Modeling a Nursing Guideline with Standard Terminology and Unified Modeling Language for a Nursing Decision Support System: A Case Study

PubMed Central

Choi, Jeeyae; Jansen, Kay; Coenen, Amy

2015-01-01

In recent years, Decision Support Systems (DSSs) have been developed and used to achieve “meaningful use”. One approach to developing DSSs is to translate clinical guidelines into a computer-interpretable format. However, there is no specific guideline modeling approach to translate nursing guidelines to computer-interpretable guidelines. This results in limited use of DSSs in nursing. Unified modeling language (UML) is a software writing language known to accurately represent the end-users’ perspective, due to its expressive characteristics. Furthermore, standard terminology enabled DSSs have been shown to smoothly integrate into existing health information systems. In order to facilitate development of nursing DSSs, the UML was used to represent a guideline for medication management for older adults encode with the International Classification for Nursing Practice (ICNP®). The UML was found to be a useful and sufficient tool to model a nursing guideline for a DSS. PMID:26958174
Feasibility and Utility of Lexical Analysis for Occupational Health Text.

PubMed

Harber, Philip; Leroy, Gondy

2017-06-01

Assess feasibility and potential utility of natural language processing (NLP) for storing and analyzing occupational health data. Basic NLP lexical analysis methods were applied to 89,000 Mine Safety and Health Administration (MSHA) free text records. Steps included tokenization, term and co-occurrence counts, term annotation, and identifying exposure-health effect relationships. Presence of terms in the Unified Medical Language System (UMLS) was assessed. The methods efficiently demonstrated common exposures, health effects, and exposure-injury relationships. Many workplace terms are not present in UMLS or map inaccurately. Use of free text rather than narrowly defined numerically coded fields is feasible, flexible, and efficient. It has potential to encourage workers and clinicians to provide more data and to support automated knowledge creation. The lexical method used is easily generalizable to other areas. The UMLS vocabularies should be enhanced to be relevant to occupational health.
The OMG Modelling Language (SYSML)

NASA Astrophysics Data System (ADS)

Hause, M.

2007-08-01

On July 6th 2006, the Object Management Group (OMG) announced the adoption of the OMG Systems Modeling Language (OMG SysML). The SysML specification was in response to the joint Request for Proposal issued by the OMG and INCOSE (the International Council on Systems Engineering) for a customized version of UML 2, designed to address the specific needs of system engineers. SysML is a visual modeling language that extends UML 2 in order to support the specification, analysis, design, verification and validation of complex systems. This paper will look at the background of SysML and summarize the SysML specification including the modifications to UML 2.0, along with the new requirement and parametric diagrams. It will also show how SysML artifacts can be used to specify the requirements for other solution spaces such as software and hardware to provide handover to other disciplines.
Levels of soluble CD30 in cord blood and peripheral blood during childhood are not correlated with the development of atopic disease or a family history of atopy.

PubMed

Holmlund, U; Bengtsson, A; Nilsson, C; Kusoffsky, E; Lilja, G; Scheynius, A; Sverremark-Ekström, E

2003-11-01

The CD30 molecule has been linked to Th2 responses. Furthermore, elevated levels of the soluble form of CD30 (sCD30) in blood as well as of the expression of CD30 on the plasma membrane of T cells are associated with atopic disease. To assess the potential usefulness of sCD30 levels as a prognostic indicator of and/or diagnostic marker for the development of atopic disease in children. sCD30 levels in cord blood and peripheral blood from 36 2-year-old (10 atopic and 26 non-atopic) and 74 7-year-old (35 atopic and 39 non-atopic) children were determined employing an ELISA procedure. Atopy was diagnosed on the basis of clinical evaluation in combination with a positive skin prick test. No significant correlation between sCD30 levels in cord blood and the development of atopic disease at 2 or 7 years of age was observed. At 7 years of age, the circulating sCD30 levels in children with atopic disease (median 41 U/mL, range 6-503 U/mL) did not differ from the corresponding values for non-atopic subjects (median 41 U/mL, range 8-402 U/mL). The same was true for children at 2 years of age. Furthermore, the sCD30 levels of children who had developed atopic eczema/dermatitis syndrome by the age of 7 years (median 49 U/mL, range 14-503 U/mL) were not significantly elevated in comparison with those of the non-atopic children. Finally, neither sCD30 levels in cord blood nor peripheral blood at 2 or 7 years of age could be linked to a family history of atopy. These findings indicate that the sCD30 concentration in cord blood is not a reliable prognostic indicator of, nor a useful diagnostic marker for, atopic disease in children up to 7 years of age. If such correlations do exist, they might be masked by age-dependent variations in the circulating levels of sCD30, which may reflect individual differences in the maturation of children's immunological responses.
Gauge equivalence of two different IAnsaaumlItze Rfor non-Abelian charged vortices

DOE Office of Scientific and Technical Information (OSTI.GOV)

Paul, S.K.

1987-05-15

Recently the existence of non-Abelian charged vortices has been established by taking two different Ansa$uml: tze in SU(2) gauge theories. We point out that these two Ansa$uml: tze are in two topologically equivalent prescriptions. We show that they are gauge equivalent only at infinity. We also show that this gauge equivalence is not possible for Z/sub N/ vortices in SU(N) gauge theories for Ngreater than or equal to3.
Defining a Technical Basis for Comparing and Contrasting Emerging Dynamic Discovery Protocols

DTIC Science & Technology

2001-05-02

UPnP, SLP, Bluetooth , and HAVi • Projected specific UML models for Jini, UPnP, and SLP • Completed a Rapide Model of Jini structure, function, and...narrow application focus but targeting a different application domain. (e.g., HAVi, Salutation Consortium, and Bluetooth Service Discovery) • Sun has...Our General Approach? 1/31/2002 7 Particulars of Our Approach Define a Generic UML Model that Encompasses Jini, UPnP, SLP, HAVi, and Bluetooth
Statistical Techniques Complement UML When Developing Domain Models of Complex Dynamical Biosystems.

PubMed

Williams, Richard A; Timmis, Jon; Qwarnstrom, Eva E

2016-01-01

Computational modelling and simulation is increasingly being used to complement traditional wet-lab techniques when investigating the mechanistic behaviours of complex biological systems. In order to ensure computational models are fit for purpose, it is essential that the abstracted view of biology captured in the computational model, is clearly and unambiguously defined within a conceptual model of the biological domain (a domain model), that acts to accurately represent the biological system and to document the functional requirements for the resultant computational model. We present a domain model of the IL-1 stimulated NF-κB signalling pathway, which unambiguously defines the spatial, temporal and stochastic requirements for our future computational model. Through the development of this model, we observe that, in isolation, UML is not sufficient for the purpose of creating a domain model, and that a number of descriptive and multivariate statistical techniques provide complementary perspectives, in particular when modelling the heterogeneity of dynamics at the single-cell level. We believe this approach of using UML to define the structure and interactions within a complex system, along with statistics to define the stochastic and dynamic nature of complex systems, is crucial for ensuring that conceptual models of complex dynamical biosystems, which are developed using UML, are fit for purpose, and unambiguously define the functional requirements for the resultant computational model.
Statistical Techniques Complement UML When Developing Domain Models of Complex Dynamical Biosystems

PubMed Central

Timmis, Jon; Qwarnstrom, Eva E.

2016-01-01

Computational modelling and simulation is increasingly being used to complement traditional wet-lab techniques when investigating the mechanistic behaviours of complex biological systems. In order to ensure computational models are fit for purpose, it is essential that the abstracted view of biology captured in the computational model, is clearly and unambiguously defined within a conceptual model of the biological domain (a domain model), that acts to accurately represent the biological system and to document the functional requirements for the resultant computational model. We present a domain model of the IL-1 stimulated NF-κB signalling pathway, which unambiguously defines the spatial, temporal and stochastic requirements for our future computational model. Through the development of this model, we observe that, in isolation, UML is not sufficient for the purpose of creating a domain model, and that a number of descriptive and multivariate statistical techniques provide complementary perspectives, in particular when modelling the heterogeneity of dynamics at the single-cell level. We believe this approach of using UML to define the structure and interactions within a complex system, along with statistics to define the stochastic and dynamic nature of complex systems, is crucial for ensuring that conceptual models of complex dynamical biosystems, which are developed using UML, are fit for purpose, and unambiguously define the functional requirements for the resultant computational model. PMID:27571414
Production of a biodegradable plastic-degrading enzyme from cheese whey by the phyllosphere yeast Pseudozyma antarctica GB-4(1)W.

PubMed

Watanabe, Takashi; Shinozaki, Yukiko; Suzuki, Ken; Koitabashi, Motoo; Yoshida, Shigenobu; Sameshima-Yamashita, Yuka; Kuze Kitamoto, Hiroko

2014-08-01

Cheese whey is a by-product of cheese production and has high concentrations of lactose (about 5%) and other nutrients. Pseudozyma antarctica produces a unique cutinase-like enzyme, named PaE, that efficiently degrades biodegradable plastics. A previous study showed that a combination of 1% oil and 0.5% lactose increased cutinase-like enzyme production by another species of yeast. In this study, to produce PaE from cheese whey, we investigated the effects of soybean oil on PaE production (expressed as biodegradable plastic-degrading activity) by P. antarctica growing on lactose or cheese whey. In flask cultures, the final PaE activity was only 0.03 U/ml when soybean oil was used as the sole carbon source, but increased to 1.79 U/ml when a limited amount of soybean oil (under 0.5%) was combined with a relatively high concentration of lactose (6%). Using a 5-L jar fermentor with lactose fed-batch cultivation and periodic soybean oil addition, about 14.6 U/ml of PaE was obtained after 5 days of cultivation. When the lactose was replaced with cheese whey, PaE production was 10.8 U/ml after 3 days of cultivation. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Prognostic value of preoperative serum CA 242 in Esophageal squamous cell carcinoma cases.

PubMed

Feng, Ji-Feng; Huang, Ying; Chen, Qi-Xun

2013-01-01

Carbohydrate antigen (CA) 242 is inversely related to prognosis in many cancers. However, few data regarding CA 242 in esophageal cancer (EC) are available. The aim of this study was to determine the prognostic value of CA 242 and propose an optimum cut-off point in predicting survival difference in patients with esophageal squamous cell carcinoma (ESCC). A retrospective analysis was conducted of 192 cases. A receiver operating characteristic (ROC) curve for survival prediction was plotted to verify the optimum cuf- off point. Univariate and multivariate analyses were performed to evaluate prognostic parameters for survival. The positive rate for CA 242 was 7.3% (14/192). The ROC curve for survival prediction gave an optimum cut-off of 2.15 (U/ml). Patients with CA 242 ≤ 2.15 U/ml had significantly better 5-year survival than patients with CA 242 >2.15 U/ml (45.4% versus 22.6%; P=0.003). Multivariate analysis showed that differentiation (P=0.033), CA 242 (P=0.017), T grade (P=0.004) and N staging (P<0.001) were independent prognostic factors. Preoperative CA 242 is a predictive factor for long-term survival in ESCC, especially in nodal-negative patients. We conclude that 2.15 U/ml may be the optimum cuf-off point for CA 242 in predicting survival in ESCC.
Semantic labeling of high-resolution aerial images using an ensemble of fully convolutional networks

NASA Astrophysics Data System (ADS)

Sun, Xiaofeng; Shen, Shuhan; Lin, Xiangguo; Hu, Zhanyi

2017-10-01

High-resolution remote sensing data classification has been a challenging and promising research topic in the community of remote sensing. In recent years, with the rapid advances of deep learning, remarkable progress has been made in this field, which facilitates a transition from hand-crafted features designing to an automatic end-to-end learning. A deep fully convolutional networks (FCNs) based ensemble learning method is proposed to label the high-resolution aerial images. To fully tap the potentials of FCNs, both the Visual Geometry Group network and a deeper residual network, ResNet, are employed. Furthermore, to enlarge training samples with diversity and gain better generalization, in addition to the commonly used data augmentation methods (e.g., rotation, multiscale, and aspect ratio) in the literature, aerial images from other datasets are also collected for cross-scene learning. Finally, we combine these learned models to form an effective FCN ensemble and refine the results using a fully connected conditional random field graph model. Experiments on the ISPRS 2-D Semantic Labeling Contest dataset show that our proposed end-to-end classification method achieves an overall accuracy of 90.7%, a state-of-the-art in the field.
Collaborative Filtering Recommendation on Users' Interest Sequences.

PubMed

Cheng, Weijie; Yin, Guisheng; Dong, Yuxin; Dong, Hongbin; Zhang, Wansong

2016-01-01

As an important factor for improving recommendations, time information has been introduced to model users' dynamic preferences in many papers. However, the sequence of users' behaviour is rarely studied in recommender systems. Due to the users' unique behavior evolution patterns and personalized interest transitions among items, users' similarity in sequential dimension should be introduced to further distinguish users' preferences and interests. In this paper, we propose a new collaborative filtering recommendation method based on users' interest sequences (IS) that rank users' ratings or other online behaviors according to the timestamps when they occurred. This method extracts the semantics hidden in the interest sequences by the length of users' longest common sub-IS (LCSIS) and the count of users' total common sub-IS (ACSIS). Then, these semantics are utilized to obtain users' IS-based similarities and, further, to refine the similarities acquired from traditional collaborative filtering approaches. With these updated similarities, transition characteristics and dynamic evolution patterns of users' preferences are considered. Our new proposed method was compared with state-of-the-art time-aware collaborative filtering algorithms on datasets MovieLens, Flixster and Ciao. The experimental results validate that the proposed recommendation method is effective and outperforms several existing algorithms in the accuracy of rating prediction.
Collaborative Filtering Recommendation on Users’ Interest Sequences

PubMed Central

Cheng, Weijie; Yin, Guisheng; Dong, Yuxin; Dong, Hongbin; Zhang, Wansong

2016-01-01

As an important factor for improving recommendations, time information has been introduced to model users’ dynamic preferences in many papers. However, the sequence of users’ behaviour is rarely studied in recommender systems. Due to the users’ unique behavior evolution patterns and personalized interest transitions among items, users’ similarity in sequential dimension should be introduced to further distinguish users’ preferences and interests. In this paper, we propose a new collaborative filtering recommendation method based on users’ interest sequences (IS) that rank users’ ratings or other online behaviors according to the timestamps when they occurred. This method extracts the semantics hidden in the interest sequences by the length of users’ longest common sub-IS (LCSIS) and the count of users’ total common sub-IS (ACSIS). Then, these semantics are utilized to obtain users’ IS-based similarities and, further, to refine the similarities acquired from traditional collaborative filtering approaches. With these updated similarities, transition characteristics and dynamic evolution patterns of users’ preferences are considered. Our new proposed method was compared with state-of-the-art time-aware collaborative filtering algorithms on datasets MovieLens, Flixster and Ciao. The experimental results validate that the proposed recommendation method is effective and outperforms several existing algorithms in the accuracy of rating prediction. PMID:27195787
Automatic registration of panoramic image sequence and mobile laser scanning data using semantic features

NASA Astrophysics Data System (ADS)

Li, Jianping; Yang, Bisheng; Chen, Chi; Huang, Ronggang; Dong, Zhen; Xiao, Wen

2018-02-01

Inaccurate exterior orientation parameters (EoPs) between sensors obtained by pre-calibration leads to failure of registration between panoramic image sequence and mobile laser scanning data. To address this challenge, this paper proposes an automatic registration method based on semantic features extracted from panoramic images and point clouds. Firstly, accurate rotation parameters between the panoramic camera and the laser scanner are estimated using GPS and IMU aided structure from motion (SfM). The initial EoPs of panoramic images are obtained at the same time. Secondly, vehicles in panoramic images are extracted by the Faster-RCNN as candidate primitives to be matched with potential corresponding primitives in point clouds according to the initial EoPs. Finally, translation between the panoramic camera and the laser scanner is refined by maximizing the overlapping area of corresponding primitive pairs based on the Particle Swarm Optimization (PSO), resulting in a finer registration between panoramic image sequences and point clouds. Two challenging urban scenes were experimented to assess the proposed method, and the final registration errors of these two scenes were both less than three pixels, which demonstrates a high level of automation, robustness and accuracy.

Enhancing clinical concept extraction with distributional semantics

PubMed Central

Cohen, Trevor; Wu, Stephen; Gonzalez, Graciela

2011-01-01

Extracting concepts (such as drugs, symptoms, and diagnoses) from clinical narratives constitutes a basic enabling technology to unlock the knowledge within and support more advanced reasoning applications such as diagnosis explanation, disease progression modeling, and intelligent analysis of the effectiveness of treatment. The recent release of annotated training sets of de-identified clinical narratives has contributed to the development and refinement of concept extraction methods. However, as the annotation process is labor-intensive, training data are necessarily limited in the concepts and concept patterns covered, which impacts the performance of supervised machine learning applications trained with these data. This paper proposes an approach to minimize this limitation by combining supervised machine learning with empirical learning of semantic relatedness from the distribution of the relevant words in additional unannotated text. The approach uses a sequential discriminative classifier (Conditional Random Fields) to extract the mentions of medical problems, treatments and tests from clinical narratives. It takes advantage of all Medline abstracts indexed as being of the publication type “clinical trials” to estimate the relatedness between words in the i2b2/VA training and testing corpora. In addition to the traditional features such as dictionary matching, pattern matching and part-of-speech tags, we also used as a feature words that appear in similar contexts to the word in question (that is, words that have a similar vector representation measured with the commonly used cosine metric, where vector representations are derived using methods of distributional semantics). To the best of our knowledge, this is the first effort exploring the use of distributional semantics, the semantics derived empirically from unannotated text often using vector space models, for a sequence classification task such as concept extraction. Therefore, we first experimented with different sliding window models and found the model with parameters that led to best performance in a preliminary sequence labeling task. The evaluation of this approach, performed against the i2b2/VA concept extraction corpus, showed that incorporating features based on the distribution of words across a large unannotated corpus significantly aids concept extraction. Compared to a supervised-only approach as a baseline, the micro-averaged f-measure for exact match increased from 80.3% to 82.3% and the micro-averaged f-measure based on inexact match increased from 89.7% to 91.3%. These improvements are highly significant according to the bootstrap resampling method and also considering the performance of other systems. Thus, distributional semantic features significantly improve the performance of concept extraction from clinical narratives by taking advantage of word distribution information obtained from unannotated data. PMID:22085698
Post-transplant monitoring of soluble CD30 level as predictor of graft outcome: a single center experience from China.

PubMed

Wang, Dong; Wu, Weizhen; Yang, Shunliang; Wang, Qinghua; Tan, Jianming

2012-12-01

There are no reliable parameters for post-transplantation immunological monitoring, which might enable recipient-tailored immunosuppressive therapy. 250 renal graft recipients were enrolled and detected for sCD30 level pre-transplantation, and on days 5 and 14, and on months 1, 3, 6, 12, 24, 36, 48 and 60 post-transplantation. Analysis was performed on correlation between sCD30 level and acute rejection, lung infection, or graft loss respectively. sCD30 levels descended to a nadir with a mean of 10.2 ± 3.8 U/mL on day 30 post-transplantation, then rose gradually, and approached 21.8 ± 10.1 U/mL on month 3, 34.2 ± 16.5 U/mL on month 6, and 42.9 ± 29.5 U/mL on month 12, then presented a stable level. Recipients with AR had significantly higher sCD30 levels than those without AR on days 5 and 14 post-transplantation. Recipients with pneumonia had significantly lower sCD30 levels within 3 months post-transplantation than those without pneumonia. Significantly higher sCD30 levels were recorded in recipients who suffered graft loss than those with normal graft function on days 5 and 14, and on months 6, 12, and 24. High sCD30 level (≥ 48.3 U/mL) at month 12 post-transplantation has an obvious detrimental effect on renal graft survival (p=0.000, HR=9.075). Serum sCD30 level might reflect immune state of renal graft recipients. Post-transplantation sequential monitoring of sCD30 level is necessary, which might not only identify recipients at the risk of acute rejection and graft loss, but also chosen as an independent predictor of pneumonia in renal transplant recipients. Copyright © 2012 Elsevier B.V. All rights reserved.
High soluble CD30 levels and associated anti-HLA antibodies in patients with failed renal allografts.

PubMed

Karahan, Gonca E; Caliskan, Yasar; Ozdilli, Kursat; Kekik, Cigdem; Bakkaloglu, Huseyin; Caliskan, Bahar; Turkmen, Aydin; Sever, Mehmet S; Oguz, Fatma S

2017-01-13

Serum soluble CD30 (sCD30), a 120-kD glycoprotein that belongs to the tumor necrosis factor receptor family, has been suggested as a marker of rejection in kidney transplant patients. The aim of this study was to evaluate the relationship between sCD30 levels and anti-HLA antibodies, and to compare sCD30 levels in patients undergoing hemodialysis (HD) with and without failed renal allografts and transplant recipients with functioning grafts. 100 patients undergoing HD with failed grafts (group 1), 100 patients undergoing HD who had never undergone transplantation (group 2), and 100 kidney transplant recipients (group 3) were included in this study. Associations of serum sCD30 levels and anti-HLA antibody status were analyzed in these groups. The sCD30 levels of group 1 and group 2 (154 ± 71 U/mL and 103 ± 55 U/mL, respectively) were significantly higher than those of the transplant recipients (group 3) (39 ± 21 U/mL) (p<0.001 and p<0.001). The serum sCD30 levels in group 1 (154 ± 71 U/mL) were also significantly higher than group 2 (103 ± 55 U/mL) (p<0.001). Anti-HLA antibodies were detected in 81 (81%) and 5 (5%) of patients in groups 1 and 2, respectively (p<0.001). When multiple regression analysis was performed to predict sCD30 levels, the independent variables in group 1 were the presence of class I anti-HLA antibodies (β = 0.295; p = 0.003) and age (β = -0.272; p = 0.005), and serum creatinine (β = 0.218; p = 0.027) and presence of class II anti-HLA antibodies (standardized β = 0.194; p = 0.046) in group 3. Higher sCD30 levels and anti-HLA antibodies in patients undergoing HD with failed renal allografts may be related to higher inflammatory status in these patients.
Improved Identification of Noun Phrases in Clinical Radiology Reports Using a High-Performance Statistical Natural Language Parser Augmented with the UMLS Specialist Lexicon

PubMed Central

Huang, Yang; Lowe, Henry J.; Klein, Dan; Cucina, Russell J.

2005-01-01

Objective: The aim of this study was to develop and evaluate a method of extracting noun phrases with full phrase structures from a set of clinical radiology reports using natural language processing (NLP) and to investigate the effects of using the UMLS® Specialist Lexicon to improve noun phrase identification within clinical radiology documents. Design: The noun phrase identification (NPI) module is composed of a sentence boundary detector, a statistical natural language parser trained on a nonmedical domain, and a noun phrase (NP) tagger. The NPI module processed a set of 100 XML-represented clinical radiology reports in Health Level 7 (HL7)® Clinical Document Architecture (CDA)–compatible format. Computed output was compared with manual markups made by four physicians and one author for maximal (longest) NP and those made by one author for base (simple) NP, respectively. An extended lexicon of biomedical terms was created from the UMLS Specialist Lexicon and used to improve NPI performance. Results: The test set was 50 randomly selected reports. The sentence boundary detector achieved 99.0% precision and 98.6% recall. The overall maximal NPI precision and recall were 78.9% and 81.5% before using the UMLS Specialist Lexicon and 82.1% and 84.6% after. The overall base NPI precision and recall were 88.2% and 86.8% before using the UMLS Specialist Lexicon and 93.1% and 92.6% after, reducing false-positives by 31.1% and false-negatives by 34.3%. Conclusion: The sentence boundary detector performs excellently. After the adaptation using the UMLS Specialist Lexicon, the statistical parser's NPI performance on radiology reports increased to levels comparable to the parser's native performance in its newswire training domain and to that reported by other researchers in the general nonmedical domain. PMID:15684131
Increased serum advanced glycation end-products is a distinct finding in lean women with polycystic ovary syndrome (PCOS).

PubMed

Diamanti-Kandarakis, Evanthia; Katsikis, Ilias; Piperi, Christina; Kandaraki, Eleni; Piouka, Athanasia; Papavassiliou, Athanasios G; Panidis, Dimitrios

2008-10-01

Nonenzymatic advanced glycation and oxidation end-products, advanced glycation end-products (AGEs), impart a potent impact on vessels and other tissues in diabetic state and in euglycaemic conditions with increased oxidative stress. Insulin resistant (IR) polycystic ovary syndrome (PCOS) women, have elevated serum AGEs, increased receptor (RAGE) expression, and increased deposition with differential localization in the polycystic ovarian tissue (theca and granulosa) compared to normal. To determine whether the raised AGE levels in noninsulin resistant women with PCOS is a distinct finding compared with those presenting the isolated components of the syndrome and among PCOS subphenotypes. Noninsulin resistant women were selected in order to show that serum AGEs are elevated in PCOS independently of the presence of IR. Clinical trial. One hundred and ninety-three age- and BMI-matched young lean noninsulin resistant women were studied. Among them, 100 women were diagnosed with PCOS according to Rotterdam criteria, and divided to subphenotypes (hyperandrogenaemia with or without PCO morphology and with or without anovulation). Sixty-eight women with the isolated components of the PCOS phenotype were also studied along with 25 healthy women. Serum AGE levels, metabolic, hormonal profiles and intravaginal ultrasound were determined in all subjects. The studied population did not differ in BMI, fasting insulin concentration, waist : hip and glucose : insulin ratios. PCOS women exhibited statistically higher AGEs levels (7.96 +/- 1.87 U/ml, P < 0.001) compared with those with isolated hyperandrogenaemia (5.61 +/- 0.61 U/ml), anovulation (5.53 +/- 1.06 U/ml), US-PCO morphology (5.26 +/- 0.25 U/ml) and controls (5.86 +/- 0.89 U/ml). In PCOS, serum AGEs are distinctly elevated compared with women having the isolated characteristics of the syndrome. No difference was observed between PCOS subphenotypes. As chronic inflammation and increased oxidant stress have been incriminated in the pathophysiology of PCOS, the role of AGEs as inflammatory and oxidant mediators, may be linked with the metabolic and reproductive abnormalities of the syndrome.
Graphical User Interface for an Observing Control System for the UK Infrared Telescope

NASA Astrophysics Data System (ADS)

Tan, M.; Bridger, A.; Wright, G. S.; Adamson, A. J.; Currie, M. J.; Economou, F.

A Graphical user interface for the observing control system of UK Infrared Telescope has been developed as a part of the ORAC (Observatory Reduction and Acquisition Control) Project. We analyzed and designed the system using the Unified Modelling Language (UML) with the CASE tool Rational Rose 98. The system has been implemented in a modular way with Java packages using Swing and RMI. This system is component-based with pluggability. Object orientation concepts and UML notations have been applied throughout the development.
A Formal Modelling Language Extending SysML for Simulation of Continuous and Discrete System

DTIC Science & Technology

2012-11-01

UNCLASSIFIED DSTO-GD-0734 16. A Formal Modelling Language Extending SysML for Simulation of Continuous and Discrete System – Mark Hodson1 and...be conceptual at some level because a one to one mapping with the real system will never exist. SysML is an extension and modification of UML that...simulation, which can provide great insights into the behaviour of complex systems. Although UML and SysML primarily support conceptual modelling they
Investigating the Role of Cyclin D1 in the Promotion of Genomic Instability and Breast Cancer

DTIC Science & Technology

2011-09-01

Tween-20, and protease/phosphatase inhibitors (1mM PMSF, 20U/mL aprotinin, 5mg/mL 12 leupeptin, 1mM DTT, 0.4mM NaF, and 10mM β- glycerophosphate ). Whole...inhibitors (1 mM PMSF, 20 U/ml aprotinin, 5 mg/ml leupeptin, 1 mM DTT, 0.4 mM NaF, and 10 mM b- glycerophosphate ), and protein concentration of samples was
ESIP Documentation Cluster Session: GCMD Keyword Update

NASA Technical Reports Server (NTRS)

Stevens, Tyler

2018-01-01

The Global Change Master Directory (GCMD) Keywords are a hierarchical set of controlled Earth Science vocabularies that help ensure Earth science data and services are described in a consistent and comprehensive manner and allow for the precise searching of collection-level metadata and subsequent retrieval of data and services. Initiated over twenty years ago, the GCMD Keywords are periodically analyzed for relevancy and will continue to be refined and expanded in response to user needs. This talk explores the current status of the GCMD keywords, the value and usage that the keywords bring to different tools/agencies as it relates to data discovery, and how the keywords relate to SWEET (Semantic Web for Earth and Environmental Terminology) Ontologies.
Learning to segment mouse embryo cells

NASA Astrophysics Data System (ADS)

León, Juan; Pardo, Alejandro; Arbeláez, Pablo

2017-11-01

Recent advances in microscopy enable the capture of temporal sequences during cell development stages. However, the study of such sequences is a complex task and time consuming task. In this paper we propose an automatic strategy to adders the problem of semantic and instance segmentation of mouse embryos using NYU's Mouse Embryo Tracking Database. We obtain our instance proposals as refined predictions from the generalized hough transform, using prior knowledge of the embryo's locations and their current cell stage. We use two main approaches to learn the priors: Hand crafted features and automatic learned features. Our strategy increases the baseline jaccard index from 0.12 up to 0.24 using hand crafted features and 0.28 by using automatic learned ones.
Alternative method for quantification of alfa-amylase activity.

PubMed

Farias, D F; Carvalho, A F U; Oliveira, C C; Sousa, N M; Rocha-Bezerrra, L C B; Ferreira, P M P; Lima, G P G; Hissa, D C

2010-05-01

A modification of the sensitive agar diffusion method was developed for macro-scale determination of alfa-amylase. The proposed modifications lower costs with the utilisation of starch as substrate and agar as supporting medium. Thus, a standard curve was built using alfa-amylase solution from Aspergillus oryzae, with concentrations ranging from 2.4 to 7,500 U.mL-1. Clear radial diffusion zones were measured after 4 hours of incubation at 20 A degrees C. A linear relationship between the logarithm of enzyme activities and the area of clear zones was obtained. The method was validated by testing alpha-amylase from barley at the concentrations of 2.4; 60; 300 and 1,500 U.mL-1. The proposed method turned out to be simpler, faster, less expensive and able to determine on a macro-scale alpha-amylase over a wide range (2.4 to 7,500 U.mL-1) in scientific investigation as well as in teaching laboratory activities.
Investigation of Marine-Derived Fungal Diversity and Their Exploitable Biological Activities

PubMed Central

Hong, Joo-Hyun; Jang, Seokyoon; Heo, Young Mok; Min, Mihee; Lee, Hwanhwi; Lee, Young Min; Lee, Hanbyul; Kim, Jae-Jin

2015-01-01

Marine fungi are potential producers of bioactive compounds that may have pharmacological and medicinal applications. Fungi were cultured from marine brown algae and identified using multiple target genes to confirm phylogenetic placement. These target genes included the internal transcribed spacer (ITS), the nuclear large subunit (LSU), and the β-tubulin region. Various biological activities of marine-derived fungi were evaluated, including their antifungal, antioxidant and cellulolytic enzyme activities. As a result, a total of 50 fungi was isolated from the brown algae Sargassum sp. Among the 50 isolated fungi, Corollospora angusta was the dominant species in this study. The genus Arthrinium showed a relatively strong antifungal activity to all of the target plant pathogenic fungi. In particular, Arthrinium saccharicola KUC21221 showed high radical scavenging activity and the highest activities in terms of filter paper units (0.39 U/mL), endoglucanase activity (0.38 U/mL), and β-glucosidase activity (1.04 U/mL). PMID:26133554
Leveraging Terminological Resources for Mapping between Rare Disease Information Sources

PubMed Central

Rance, Bastien; Snyder, Michelle; Lewis, Janine; Bodenreider, Olivier

2015-01-01

Background Rare disease information sources are incompletely and inconsistently cross-referenced to one another, making it difficult for information seekers to navigate across them. The development of such cross-references established manually by experts is generally labor intensive and costly. Objectives To develop an automatic mapping between two of the major rare diseases information sources, GARD and Orphanet, by leveraging terminological resources, especially the UMLS. Methods We map the rare disease terms from Orphanet and ORDR to the UMLS. We use the UMLS as a pivot to bridge between the rare disease terminologies. We compare our results to a mapping obtained through manually established cross-references to OMIM. Results Our mapping has a precision of 94%, a recall of 63% and an F1-score of 76%. Our automatic mapping should help facilitate the development of more complete and consistent cross-references between GARD and Orphanet, and is applicable to other rare disease information sources as well. PMID:23920611
A remote sensing computer-assisted learning tool developed using the unified modeling language

NASA Astrophysics Data System (ADS)

Friedrich, J.; Karslioglu, M. O.

The goal of this work has been to create an easy-to-use and simple-to-make learning tool for remote sensing at an introductory level. Many students struggle to comprehend what seems to be a very basic knowledge of digital images, image processing and image arithmetic, for example. Because professional programs are generally too complex and overwhelming for beginners and often not tailored to the specific needs of a course regarding functionality, a computer-assisted learning (CAL) program was developed based on the unified modeling language (UML), the present standard for object-oriented (OO) system development. A major advantage of this approach is an easier transition from modeling to coding of such an application, if modern UML tools are being used. After introducing the constructed UML model, its implementation is briefly described followed by a series of learning exercises. They illustrate how the resulting CAL tool supports students taking an introductory course in remote sensing at the author's institution.
Enzymatic production of α-ketoglutaric acid from l-glutamic acid via l-glutamate oxidase.

PubMed

Niu, Panqing; Dong, Xiaoxiang; Wang, Yuancai; Liu, Liming

2014-06-10

In this study, a novel strategy for α-ketoglutaric acid (α-KG) production from l-glutamic acid using recombinant l-glutamate oxidase (LGOX) was developed. First, by analyzing the molecular structure characteristics of l-glutamic acid and α-KG, LGOX was found to be the best catalyst for oxidizing the amino group of l-glutamic acid to a ketonic group without the need for exogenous cofactor. Then the LGOX gene was expressed in Escherichia coli BL21 (DE3) in a soluble and active form, and the recombinant LGOX activity reached to a maximum value of 0.59U/mL at pH 6.5, 30°C. Finally, the maximum α-KG concentration reached 104.7g/L from 110g/L l-glutamic acid in 24h, under the following optimum conditions: 1.5U/mL LGOX, 250U/mL catalase, 3mM MnCl2, 30°C, and pH 6.5. Copyright © 2014. Published by Elsevier B.V.
Dependability modeling and assessment in UML-based software development.

PubMed

Bernardi, Simona; Merseguer, José; Petriu, Dorina C

2012-01-01

Assessment of software nonfunctional properties (NFP) is an important problem in software development. In the context of model-driven development, an emerging approach for the analysis of different NFPs consists of the following steps: (a) to extend the software models with annotations describing the NFP of interest; (b) to transform automatically the annotated software model to the formalism chosen for NFP analysis; (c) to analyze the formal model using existing solvers; (d) to assess the software based on the results and give feedback to designers. Such a modeling→analysis→assessment approach can be applied to any software modeling language, be it general purpose or domain specific. In this paper, we focus on UML-based development and on the dependability NFP, which encompasses reliability, availability, safety, integrity, and maintainability. The paper presents the profile used to extend UML with dependability information, the model transformation to generate a DSPN formal model, and the assessment of the system properties based on the DSPN results.
Comparison of BrainTool to other UML modeling and model transformation tools

NASA Astrophysics Data System (ADS)

Nikiforova, Oksana; Gusarovs, Konstantins

2017-07-01

In the last 30 years there were numerous model generated software systems offered targeting problems with the development productivity and the resulting software quality. CASE tools developed due today's date are being advertised as having "complete code-generation capabilities". Nowadays the Object Management Group (OMG) is calling similar arguments in regards to the Unified Modeling Language (UML) models at different levels of abstraction. It is being said that software development automation using CASE tools enables significant level of automation. Actual today's CASE tools are usually offering a combination of several features starting with a model editor and a model repository for a traditional ones and ending with code generator (that could be using a scripting or domain-specific (DSL) language), transformation tool to produce the new artifacts from the manually created and transformation definition editor to define new transformations for the most advanced ones. Present paper contains the results of CASE tool (mainly UML editors) comparison against the level of the automation they are offering.
Performance of the commercially available SERION ELISA classic Echinococcus IgG test for the detection of cystic echinococcosis in clinical practice.

PubMed

Sarink, M J; Koelewijn, R; Slingerland, B C G C; Tielens, A G M; van Genderen, P J J; van Hellemond, J J

2018-06-28

Diagnosis of cystic echinococcosis (CE) is at present mainly based on imaging techniques. Serology has a complementary role, partly due to the small number of standardized and commercially available assays. Therefore we examined the clinical performance of the SERION ELISA classic Echinococcus IgG test. Using 10 U/ml as a cut-off point, and serum samples from 50 CE patients and 105 healthy controls, the sensitivity and specificity were 98.0% and 96.2%, respectively. If patients with other infectious diseases were used as negative controls, the specificity decreased to 76.9%, which causes poor positive predictive values. However, if results between 10 and 15 U/ml are classified as indecisive, the specificity of positive results (≥15 U/ml) increased to 92.5% without greatly affecting the sensitivity (92.0%). Using this approach in combination with imaging studies, the SERION ELISA classic Echinococcosis IgG test can be a useful aid in the diagnosis of CE.
Pharmacokinetics and pharmacodynamics of insulin glargine 300 U/mL in the treatment of diabetes and their clinical relevance.

PubMed

Owens, David R

2016-08-01

A more concentrated insulin glargine formulation, containing 300 U/mL (Gla-300) was approved in 2015 in the US and Europe for the treatment of diabetes mellitus in adults. This drug evaluation focuses on the pharmacokinetics (PK) and pharmacodynamics (PD) of Gla-300 from studies published up to May 2016. The clinical relevance of this new formulation will be addressed. Gla-300 was developed to produce a flatter and more prolonged PK/PD profile compared with insulin glargine 100 U/mL (Gla-100) in order to maintain effective glycemic control and reduce the risk of hypoglycemia. Compared to Gla-100, Gla-300 achieves lower and delayed peak concentrations with a PK exposure that is more stable and evenly distributed across a 24-h dosing interval. As a consequence, Gla-300 results in a consistent glucose-lowering effect with less variability over a 24-h dosing interval, which translates to a reduction in the rate of hypoglycemia (particularly nocturnal events).
Enhanced production of alkaline thermostable keratinolytic protease from calcium alginate immobilized cells of thermoalkalophilic Bacillus halodurans JB 99 exhibiting dehairing activity.

PubMed

Shrinivas, Dengeti; Kumar, Raghwendra; Naik, G R

2012-01-01

The thermoalkalophilic Bacillus halodurans JB 99 cells known for production of novel thermostable alkaline keratinolytic protease were immobilized in calcium alginate matrix. Batch and repeated batch cultivation using calcium alginate immobilized cells were studied for alkaline protease production in submerged fermentation. Immobilized cells with 2.5% alginate and 350 beads/flask of initial cell loading showed enhanced production of alkaline protease by 23.2% (5,275 ± 39.4 U/ml) as compared to free cells (4,280 ± 35.4 U/ml) after 24 h. In the semicontinuous mode of cultivation, immobilized cells under optimized conditions produced an appreciable level of alkaline protease in up to nine cycles and reached a maximal value of 5,975 U/ml after the seventh cycle. The enzyme produced from immobilized cells efficiently degraded chicken feathers in the presence of a reducing agent which can help the poultry industry in the management of keratin-rich waste and obtaining value-added products.

Building an automated problem list based on natural language processing: lessons learned in the early phase of development.

PubMed

Solti, Imre; Aaronson, Barry; Fletcher, Grant; Solti, Magdolna; Gennari, John H; Cooper, Melissa; Payne, Tom

2008-11-06

Detailed problem lists that comply with JCAHO requirements are important components of electronic health records. Besides improving continuity of care electronic problem lists could serve as foundation infrastructure for clinical trial recruitment, research, biosurveillance and billing informatics modules. However, physicians rarely maintain problem lists. Our team is building a system using MetaMap and UMLS to automatically populate the problem list. We report our early results evaluating the application. Three physicians generated gold standard problem lists for 100 cardiology ambulatory progress notes. Our application had 88% sensitivity and 66% precision using a non-modified UMLS dataset. The systemâs misses concentrated in the group of ambiguous problem list entries (Chi-square=27.12 p<0.0001). In addition to the explicit entries, the notes included 10% implicit entry candidates. MetaMap and UMLS are readily applicable to automate the problem list. Ambiguity in medical documents has consequences for performance evaluation of automated systems.
Dependability Modeling and Assessment in UML-Based Software Development

PubMed Central

Bernardi, Simona; Merseguer, José; Petriu, Dorina C.

2012-01-01

Assessment of software nonfunctional properties (NFP) is an important problem in software development. In the context of model-driven development, an emerging approach for the analysis of different NFPs consists of the following steps: (a) to extend the software models with annotations describing the NFP of interest; (b) to transform automatically the annotated software model to the formalism chosen for NFP analysis; (c) to analyze the formal model using existing solvers; (d) to assess the software based on the results and give feedback to designers. Such a modeling→analysis→assessment approach can be applied to any software modeling language, be it general purpose or domain specific. In this paper, we focus on UML-based development and on the dependability NFP, which encompasses reliability, availability, safety, integrity, and maintainability. The paper presents the profile used to extend UML with dependability information, the model transformation to generate a DSPN formal model, and the assessment of the system properties based on the DSPN results. PMID:22988428
Bioinformatics for transporter pharmacogenomics and systems biology: data integration and modeling with UML.

PubMed

Yan, Qing

2010-01-01

Bioinformatics is the rational study at an abstract level that can influence the way we understand biomedical facts and the way we apply the biomedical knowledge. Bioinformatics is facing challenges in helping with finding the relationships between genetic structures and functions, analyzing genotype-phenotype associations, and understanding gene-environment interactions at the systems level. One of the most important issues in bioinformatics is data integration. The data integration methods introduced here can be used to organize and integrate both public and in-house data. With the volume of data and the high complexity, computational decision support is essential for integrative transporter studies in pharmacogenomics, nutrigenomics, epigenetics, and systems biology. For the development of such a decision support system, object-oriented (OO) models can be constructed using the Unified Modeling Language (UML). A methodology is developed to build biomedical models at different system levels and construct corresponding UML diagrams, including use case diagrams, class diagrams, and sequence diagrams. By OO modeling using UML, the problems of transporter pharmacogenomics and systems biology can be approached from different angles with a more complete view, which may greatly enhance the efforts in effective drug discovery and development. Bioinformatics resources of membrane transporters and general bioinformatics databases and tools that are frequently used in transporter studies are also collected here. An informatics decision support system based on the models presented here is available at http://www.pharmtao.com/transporter . The methodology developed here can also be used for other biomedical fields.
The effects of different levels of superoxide dismutase in Modena on boar semen quality during liquid preservation at 17°C.

PubMed

Zhang, Xiao-Gang; Li, Hao; Wang, Le; Hao, Yang-Yi; Liang, Guo-Dong; Ma, Yun-Hui; Yang, Gong-She; Hu, Jian-Hong

2017-01-01

This study was conducted to investigate the influence of superoxide dismutase (SOD) on the quality of boar semen during liquid preservation at 17°C. Semen samples from 10 Duroc boars were collected and pooled, divided into five equal parts and diluted with Modena containing different concentrations (0, 100, 200, 300 and 400 U/mL) of SOD. During the process of liquid preservation at 17°C, sperm motility, acrosome integrity, membrane integrity, total antioxidant capacity (T-AOC) activity, malondialdehyde (MDA) content and hydrogen peroxide (H 2 O 2 ) content were measured and analyzed every 24 h. Meanwhile, effective survival time of boar semen during preservation was evaluated and analyzed. The results indicated that different concentrations of SOD in Modena showed different protective effects on boar sperm quality. Modena supplemented with SOD decreased the effects on reactive oxygen species on boar sperm quality during liquid preservation compared with that of the control group. The added 200 U/mL SOD group showed higher sperm motility, membrane integrity, acrosome integrity, effective survival time and T-AOC activity. Meanwhile, the added 200 U/mL SOD group showed lower MDA content and H 2 O 2 content. In conclusion, addition of SOD to Modena improved the boar sperm quality by reducing oxidative stress during liquid preservation at 17°C and the optimum concentration was 200 U/mL. © 2016 Japanese Society of Animal Science.
Culture Condition Optimization and Pilot Scale Production of the M12 Metalloprotease Myroilysin Produced by the Deep-Sea Bacterium Myroides profundi D25.

PubMed

Shao, Xuan; Ran, Li-Yuan; Liu, Chang; Chen, Xiu-Lan; Zhang, Xi-Ying; Qin, Qi-Long; Zhou, Bai-Cheng; Zhang, Yu-Zhong

2015-06-29

The protease myroilysin is the most abundant protease secreted by marine sedimental bacterium Myroides profundi D25. As a novel elastase of the M12 family, myroilysin has high elastin-degrading activity and strong collagen-swelling ability, suggesting its promising biotechnological potential. Because myroilysin cannot be maturely expressed in Escherichia coli, it is important to be able to improve the production of myroilysin in the wild strain D25. We optimized the culture conditions of strain D25 for protease production by using single factor experiments. Under the optimized conditions, the protease activity of strain D25 reached 1137 ± 53.29 U/mL, i.e., 174% of that before optimization (652 ± 23.78 U/mL). We then conducted small scale fermentations of D25 in a 7.5 L fermentor. The protease activity of strain D25 in small scale fermentations reached 1546.4 ± 82.65 U/mL after parameter optimization. Based on the small scale fermentation results, we further conducted pilot scale fermentations of D25 in a 200 L fermentor, in which the protease production of D25 reached approximately 1100 U/mL. These results indicate that we successfully set up the small and pilot scale fermentation processes of strain D25 for myroilysin production, which should be helpful for the industrial production of myroilysin and the development of its biotechnological potential.
Integrated catchment modelling within a strategic planning and decision making process: Werra case study

NASA Astrophysics Data System (ADS)

Dietrich, Jörg; Funke, Markus

Integrated water resources management (IWRM) redefines conventional water management approaches through a closer cross-linkage between environment and society. The role of public participation and socio-economic considerations becomes more important within the planning and decision making process. In this paper we address aspects of the integration of catchment models into such a process taking the implementation of the European Water Framework Directive (WFD) as an example. Within a case study situated in the Werra river basin (Central Germany), a systems analytic decision process model was developed. This model uses the semantics of the Unified Modeling Language (UML) activity model. As an example application, the catchment model SWAT and the water quality model RWQM1 were applied to simulate the effect of phosphorus emissions from non-point and point sources on water quality. The decision process model was able to guide the participants of the case study through the interdisciplinary planning and negotiation of actions. Further improvements of the integration framework include tools for quantitative uncertainty analyses, which are crucial for real life application of models within an IWRM decision making toolbox. For the case study, the multi-criteria assessment of actions indicates that the polluter pays principle can be met at larger scales (sub-catchment or river basin) without significantly compromising cost efficiency for the local situation.
Automated Semantic Indexing of Figure Captions to Improve Radiology Image Retrieval

PubMed Central

Kahn, Charles E.; Rubin, Daniel L.

2009-01-01

Objective We explored automated concept-based indexing of unstructured figure captions to improve retrieval of images from radiology journals. Design The MetaMap Transfer program (MMTx) was used to map the text of 84,846 figure captions from 9,004 peer-reviewed, English-language articles to concepts in three controlled vocabularies from the UMLS Metathesaurus, version 2006AA. Sampling procedures were used to estimate the standard information-retrieval metrics of precision and recall, and to evaluate the degree to which concept-based retrieval improved image retrieval. Measurements Precision was estimated based on a sample of 250 concepts. Recall was estimated based on a sample of 40 concepts. The authors measured the impact of concept-based retrieval to improve upon keyword-based retrieval in a random sample of 10,000 search queries issued by users of a radiology image search engine. Results Estimated precision was 0.897 (95% confidence interval, 0.857–0.937). Estimated recall was 0.930 (95% confidence interval, 0.838–1.000). In 5,535 of 10,000 search queries (55%), concept-based retrieval found results not identified by simple keyword matching; in 2,086 searches (21%), more than 75% of the results were found by concept-based search alone. Conclusion Concept-based indexing of radiology journal figure captions achieved very high precision and recall, and significantly improved image retrieval. PMID:19261938
Maternal serum soluble CD30 is increased in normal pregnancy, but decreased in preeclampsia and small for gestational age pregnancies.

PubMed

Kusanovic, Juan Pedro; Romero, Roberto; Hassan, Sonia S; Gotsch, Francesca; Edwin, Samuel; Chaiworapongsa, Tinnakorn; Erez, Offer; Mittal, Pooja; Mazaki-Tovi, Shali; Soto, Eleazar; Than, Nandor Gabor; Friel, Lara A; Yoon, Bo Hyun; Espinoza, Jimmy

2007-12-01

Women with preeclampsia and those who deliver small for gestational age (SGA) neonates are characterized by intravascular inflammation (T helper 1 (Th1)-biased immune response). There is controversy about the T helper 2 (Th2) response in preeclampsia and SGA. CD30, a member of the tumor necrosis factor receptor superfamily, is preferentially expressed in vitro and in vivo by activated T cells producing Th2-type cytokines. Its soluble form (sCD30) has been proposed to be an index of Th2 immune response. The objective of this study was to determine whether the maternal serum concentration of sCD30 changes with normal pregnancy, as well as in mothers with preeclampsia and those who deliver SGA neonates. This cross-sectional study included patients in the following groups: (1) non-pregnant women (N = 49); (2) patients with a normal pregnancy (N = 89); (3) patients with preeclampsia (N = 100); and (4) patients who delivered an SGA neonate (N = 78). Maternal serum concentration of sCD30 was measured by a specific and sensitive enzyme-linked immunoassay. Non-parametric tests with post-hoc analysis were used for comparisons. A p value <0.05 was considered statistically significant. (1) The median sCD30 serum concentration of pregnant women was significantly higher than that of non-pregnant women (median 29.7 U/mL, range 12.2-313.2 vs. median 23.2 U/mL, range 14.6-195.1, respectively; p = 0.01). (2) Patients with preeclampsia had a significantly lower median serum concentration of sCD30 than normal pregnant women (median 24.7 U/mL, range 7.6-71.2 vs. median 29.7 U/mL, range 12.2-313.2, respectively; p < 0.05). (3) Mothers with SGA neonates had a lower median concentration of sCD30 than normal pregnant women (median 23.4 U/mL, range 7.1-105.3 vs. median 29.7 U/mL, range 12.2-313.2, respectively; p < 0.05). (4) There was no significant correlation (r = -0.059, p = 0.5) between maternal serum sCD30 concentration and gestational age (19-38 weeks) in normal pregnant women. (1) Patients with preeclampsia and those who deliver an SGA neonate had a significantly lower serum concentration of sCD30 than normal pregnant women. (2) This finding is consistent with the view that preeclampsia and SGA are associated with a polarized Th1 immune response and, perhaps, a reduced Th2 response.
Estimation of Somatomedin-C Levels in Normals and Patients with Pituitary Disease by Radioimmunoassay

PubMed Central

Furlanetto, Richard W.; Underwood, Louis E.; Van Wyk, Judson J.; D'Ercole, A. Joseph

1977-01-01

The development of a radioimmunoassay for somatomedin-C has for the first time made it possible to discriminate between serum concentrations of a single peptide or closely related group of peptides and the net somatomedin activity measured by less specific bioassay and radioreceptor techniques. Antibodies to human somatomedin-C were raised in rabbits using a somatomedin-C ovalbumin complex as the antigen. A variety of peptide hormones at concentrations up to 1 μM are not recognized by the antibody. Insulin at concentrations >0.1 μM cross reacts in a non-parallel fashion; purified somatomedin-A is only 3% as active as somatomedin-C; and radiolabeled cloned rat liver multiplication stimulating activity does not bind to the antibody. Immunoreactive somatomedin-C can also be quantitated in the sera of a variety of subhuman species. Unusual assay kinetics, which are manifest when reactants are incubated under classic “equilibrium” assay conditions, appear to result from the failure of 125I-somatomedin-C to readily equilibrate with the somatomedin-C serum binding protein complex. It is, therefore, necessary to use nonequilibrium assay conditions to quantitate somatomedin-C in serum. With this assay it is possible to detect somatomedin-C in normal subjects using as little as 0.25 μl of unextracted serum. Serum somatomedin-C concentrations in normal subjects were lowest in cord blood and rose rapidly during the first 4 yr of life to near adult levels. In 23 normal adult volunteers, the mean serum somatomedin-C concentration was 1.50±0.10 U/ml (SEM) when compared to a pooled adult serum standard. 19 children with hypopituitary dwarfism had concentrations below 0.20 U/ml. 17 of these were below 0.1 U/ml, the lower limit of sensitivity of the assay. The mean concentration in 14 adults with active acromegaly was 6.28±0.37 U/ml (SEM), five times greater than the normal volunteers. Significant increases in serum somatomedin-C concentrations were observed in 8 of 10 hypopituitary children within 72 h after the parenteral administration of human growth hormone. Three patients with Cushing's disease had elevated serum somatomedin-C concentrations (2.61±0.14 U/ml [SEM]). Three patients with hyperprolactinemia had normal concentrations (1.74±0.11 U/ml [SEM]). The important new discovery brought to light by quantitation of immunoassayable somatomedin in patient sera is that all previously used assays detect, in addition to somatomedin-C, serum substances that are not under as stringent growth hormone control. PMID:893668
Associating clinical archetypes through UMLS Metathesaurus term clusters.

PubMed

Lezcano, Leonardo; Sánchez-Alonso, Salvador; Sicilia, Miguel-Angel

2012-06-01

Clinical archetypes are modular definitions of clinical data, expressed using standard or open constraint-based data models as the CEN EN13606 and openEHR. There is an increasing archetype specification activity that raises the need for techniques to associate archetypes to support better management and user navigation in archetype repositories. This paper reports on a computational technique to generate tentative archetype associations by mapping them through term clusters obtained from the UMLS Metathesaurus. The terms are used to build a bipartite graph model and graph connectivity measures can be used for deriving associations.
Exploring the possibility of modeling a genetic counseling guideline using agile methodology.

PubMed

Choi, Jeeyae

2013-01-01

Increased demand of genetic counseling services heightened the necessity of a computerized genetic counseling decision support system. In order to develop an effective and efficient computerized system, modeling of genetic counseling guideline is an essential step. Throughout this pilot study, Agile methodology with United Modeling Language (UML) was utilized to model a guideline. 13 tasks and 14 associated elements were extracted. Successfully constructed conceptual class and activity diagrams revealed that Agile methodology with UML was a suitable tool to modeling a genetic counseling guideline.
Production, purification, and characterization of lipase from thermophilic and alkaliphilic Bacillus coagulans BTS-3.

PubMed

Kumar, Satyendra; Kikon, Khyodano; Upadhyay, Ashutosh; Kanwar, Shamsher S; Gupta, Reena

2005-05-01

A thermophilic isolate Bacillus coagulans BTS-3 produced an extracellular alkaline lipase, the production of which was substantially enhanced when the type of carbon source, nitrogen source, and the initial pH of culture medium were consecutively optimized. Lipase activity 1.16 U/ml of culture medium was obtained in 48 h at 55 degrees C and pH 8.5 with refined mustard oil as carbon source and a combination of peptone and yeast extract (1:1) as nitrogen sources. The enzyme was purified 40-fold to homogeneity by ammonium sulfate precipitation and DEAE-Sepharose column chromatography. Its molecular weight was 31 kDa on SDS-PAGE. The enzyme showed maximum activity at 55 degrees C and pH 8.5, and was stable between pH 8.0 and 10.5 and at temperatures up to 70 degrees C. The enzyme was found to be inhibited by Al3+, Co2+, Mn2+, and Zn2+ ions while K+, Fe3+, Hg2+, and Mg2+ ions enhanced the enzyme activity; Na+ ions have no effect on enzyme activity. The purified lipase showed a variable specificity/hydrolytic activity towards various 4-nitrophenyl esters.
MediaNet: a multimedia information network for knowledge representation

NASA Astrophysics Data System (ADS)

Benitez, Ana B.; Smith, John R.; Chang, Shih-Fu

2000-10-01

In this paper, we present MediaNet, which is a knowledge representation framework that uses multimedia content for representing semantic and perceptual information. The main components of MediaNet include conceptual entities, which correspond to real world objects, and relationships among concepts. MediaNet allows the concepts and relationships to be defined or exemplified by multimedia content such as images, video, audio, graphics, and text. MediaNet models the traditional relationship types such as generalization and aggregation but adds additional functionality by modeling perceptual relationships based on feature similarity. For example, MediaNet allows a concept such as car to be defined as a type of a transportation vehicle, but which is further defined and illustrated through example images, videos and sounds of cars. In constructing the MediaNet framework, we have built on the basic principles of semiotics and semantic networks in addition to utilizing the audio-visual content description framework being developed as part of the MPEG-7 multimedia content description standard. By integrating both conceptual and perceptual representations of knowledge, MediaNet has potential to impact a broad range of applications that deal with multimedia content at the semantic and perceptual levels. In particular, we have found that MediaNet can improve the performance of multimedia retrieval applications by using query expansion, refinement and translation across multiple content modalities. In this paper, we report on experiments that use MediaNet in searching for images. We construct the MediaNet knowledge base using both WordNet and an image network built from multiple example images and extracted color and texture descriptors. Initial experimental results demonstrate improved retrieval effectiveness using MediaNet in a content-based retrieval system.
An Iterative Inference Procedure Applying Conditional Random Fields for Simultaneous Classification of Land Cover and Land Use

NASA Astrophysics Data System (ADS)

Albert, L.; Rottensteiner, F.; Heipke, C.

2015-08-01

Land cover and land use exhibit strong contextual dependencies. We propose a novel approach for the simultaneous classification of land cover and land use, where semantic and spatial context is considered. The image sites for land cover and land use classification form a hierarchy consisting of two layers: a land cover layer and a land use layer. We apply Conditional Random Fields (CRF) at both layers. The layers differ with respect to the image entities corresponding to the nodes, the employed features and the classes to be distinguished. In the land cover layer, the nodes represent super-pixels; in the land use layer, the nodes correspond to objects from a geospatial database. Both CRFs model spatial dependencies between neighbouring image sites. The complex semantic relations between land cover and land use are integrated in the classification process by using contextual features. We propose a new iterative inference procedure for the simultaneous classification of land cover and land use, in which the two classification tasks mutually influence each other. This helps to improve the classification accuracy for certain classes. The main idea of this approach is that semantic context helps to refine the class predictions, which, in turn, leads to more expressive context information. Thus, potentially wrong decisions can be reversed at later stages. The approach is designed for input data based on aerial images. Experiments are carried out on a test site to evaluate the performance of the proposed method. We show the effectiveness of the iterative inference procedure and demonstrate that a smaller size of the super-pixels has a positive influence on the classification result.
Landmark Image Retrieval by Jointing Feature Refinement and Multimodal Classifier Learning.

PubMed

Zhang, Xiaoming; Wang, Senzhang; Li, Zhoujun; Ma, Shuai; Xiaoming Zhang; Senzhang Wang; Zhoujun Li; Shuai Ma; Ma, Shuai; Zhang, Xiaoming; Wang, Senzhang; Li, Zhoujun

2018-06-01

Landmark retrieval is to return a set of images with their landmarks similar to those of the query images. Existing studies on landmark retrieval focus on exploiting the geometries of landmarks for visual similarity matches. However, the visual content of social images is of large diversity in many landmarks, and also some images share common patterns over different landmarks. On the other side, it has been observed that social images usually contain multimodal contents, i.e., visual content and text tags, and each landmark has the unique characteristic of both visual content and text content. Therefore, the approaches based on similarity matching may not be effective in this environment. In this paper, we investigate whether the geographical correlation among the visual content and the text content could be exploited for landmark retrieval. In particular, we propose an effective multimodal landmark classification paradigm to leverage the multimodal contents of social image for landmark retrieval, which integrates feature refinement and landmark classifier with multimodal contents by a joint model. The geo-tagged images are automatically labeled for classifier learning. Visual features are refined based on low rank matrix recovery, and multimodal classification combined with group sparse is learned from the automatically labeled images. Finally, candidate images are ranked by combining classification result and semantic consistence measuring between the visual content and text content. Experiments on real-world datasets demonstrate the superiority of the proposed approach as compared to existing methods.
Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents

PubMed Central

2010-01-01

Background Drug-drug interactions are frequently reported in the increasing amount of biomedical literature. Information Extraction (IE) techniques have been devised as a useful instrument to manage this knowledge. Nevertheless, IE at the sentence level has a limited effect because of the frequent references to previous entities in the discourse, a phenomenon known as 'anaphora'. DrugNerAR, a drug anaphora resolution system is presented to address the problem of co-referring expressions in pharmacological literature. This development is part of a larger and innovative study about automatic drug-drug interaction extraction. Methods The system uses a set of linguistic rules drawn by Centering Theory over the analysis provided by a biomedical syntactic parser. Semantic information provided by the Unified Medical Language System (UMLS) is also integrated in order to improve the recognition and the resolution of nominal drug anaphors. Besides, a corpus has been developed in order to analyze the phenomena and evaluate the current approach. Each possible case of anaphoric expression was looked into to determine the most effective way of resolution. Results An F-score of 0.76 in anaphora resolution was achieved, outperforming significantly the baseline by almost 73%. This ad-hoc reference line was developed to check the results as there is no previous work on anaphora resolution in pharmalogical documents. The obtained results resemble those found in related-semantic domains. Conclusions The present approach shows very promising results in the challenge of accounting for anaphoric expressions in pharmacological texts. DrugNerAr obtains similar results to other approaches dealing with anaphora resolution in the biomedical domain, but, unlike these approaches, it focuses on documents reflecting drug interactions. The Centering Theory has proved being effective at the selection of antecedents in anaphora resolution. A key component in the success of this framework is the analysis provided by the MMTx program and the DrugNer system that allows to deal with the complexity of the pharmacological language. It is expected that the positive results of the resolver increases performance of our future drug-drug interaction extraction system. PMID:20406499
A UMLS-based spell checker for natural language processing in vaccine safety.

PubMed

Tolentino, Herman D; Matters, Michael D; Walop, Wikke; Law, Barbara; Tong, Wesley; Liu, Fang; Fontelo, Paul; Kohl, Katrin; Payne, Daniel C

2007-02-12

The Institute of Medicine has identified patient safety as a key goal for health care in the United States. Detecting vaccine adverse events is an important public health activity that contributes to patient safety. Reports about adverse events following immunization (AEFI) from surveillance systems contain free-text components that can be analyzed using natural language processing. To extract Unified Medical Language System (UMLS) concepts from free text and classify AEFI reports based on concepts they contain, we first needed to clean the text by expanding abbreviations and shortcuts and correcting spelling errors. Our objective in this paper was to create a UMLS-based spelling error correction tool as a first step in the natural language processing (NLP) pipeline for AEFI reports. We developed spell checking algorithms using open source tools. We used de-identified AEFI surveillance reports to create free-text data sets for analysis. After expansion of abbreviated clinical terms and shortcuts, we performed spelling correction in four steps: (1) error detection, (2) word list generation, (3) word list disambiguation and (4) error correction. We then measured the performance of the resulting spell checker by comparing it to manual correction. We used 12,056 words to train the spell checker and tested its performance on 8,131 words. During testing, sensitivity, specificity, and positive predictive value (PPV) for the spell checker were 74% (95% CI: 74-75), 100% (95% CI: 100-100), and 47% (95% CI: 46%-48%), respectively. We created a prototype spell checker that can be used to process AEFI reports. We used the UMLS Specialist Lexicon as the primary source of dictionary terms and the WordNet lexicon as a secondary source. We used the UMLS as a domain-specific source of dictionary terms to compare potentially misspelled words in the corpus. The prototype sensitivity was comparable to currently available tools, but the specificity was much superior. The slow processing speed may be improved by trimming it down to the most useful component algorithms. Other investigators may find the methods we developed useful for cleaning text using lexicons specific to their area of interest.
A UMLS-based spell checker for natural language processing in vaccine safety

PubMed Central

Tolentino, Herman D; Matters, Michael D; Walop, Wikke; Law, Barbara; Tong, Wesley; Liu, Fang; Fontelo, Paul; Kohl, Katrin; Payne, Daniel C

2007-01-01

Background The Institute of Medicine has identified patient safety as a key goal for health care in the United States. Detecting vaccine adverse events is an important public health activity that contributes to patient safety. Reports about adverse events following immunization (AEFI) from surveillance systems contain free-text components that can be analyzed using natural language processing. To extract Unified Medical Language System (UMLS) concepts from free text and classify AEFI reports based on concepts they contain, we first needed to clean the text by expanding abbreviations and shortcuts and correcting spelling errors. Our objective in this paper was to create a UMLS-based spelling error correction tool as a first step in the natural language processing (NLP) pipeline for AEFI reports. Methods We developed spell checking algorithms using open source tools. We used de-identified AEFI surveillance reports to create free-text data sets for analysis. After expansion of abbreviated clinical terms and shortcuts, we performed spelling correction in four steps: (1) error detection, (2) word list generation, (3) word list disambiguation and (4) error correction. We then measured the performance of the resulting spell checker by comparing it to manual correction. Results We used 12,056 words to train the spell checker and tested its performance on 8,131 words. During testing, sensitivity, specificity, and positive predictive value (PPV) for the spell checker were 74% (95% CI: 74–75), 100% (95% CI: 100–100), and 47% (95% CI: 46%–48%), respectively. Conclusion We created a prototype spell checker that can be used to process AEFI reports. We used the UMLS Specialist Lexicon as the primary source of dictionary terms and the WordNet lexicon as a secondary source. We used the UMLS as a domain-specific source of dictionary terms to compare potentially misspelled words in the corpus. The prototype sensitivity was comparable to currently available tools, but the specificity was much superior. The slow processing speed may be improved by trimming it down to the most useful component algorithms. Other investigators may find the methods we developed useful for cleaning text using lexicons specific to their area of interest. PMID:17295907
Instantaneous Coastline Extraction from LIDAR Point Cloud and High Resolution Remote Sensing Imagery

NASA Astrophysics Data System (ADS)

Li, Y.; Zhoing, L.; Lai, Z.; Gan, Z.

2018-04-01

A new method was proposed for instantaneous waterline extraction in this paper, which combines point cloud geometry features and image spectral characteristics of the coastal zone. The proposed method consists of follow steps: Mean Shift algorithm is used to segment the coastal zone of high resolution remote sensing images into small regions containing semantic information;Region features are extracted by integrating LiDAR data and the surface area of the image; initial waterlines are extracted by α-shape algorithm; a region growing algorithm with is taking into coastline refinement, with a growth rule integrating the intensity and topography of LiDAR data; moothing the coastline. Experiments are conducted to demonstrate the efficiency of the proposed method.
Applying a semantic information Petri Net modeling method to AUV systems design

NASA Astrophysics Data System (ADS)

Feng, Xiao-Ning; Wang, Shuo; Wang, Zhuo; Liu, Qun

2008-12-01

This paper informally introduces colored object-oriented Petri Nets(COOPN) with the application of the AUV system. According to the characteristic of the AUV system’s running environment, the object-oriented method is used in this paper not only to dispart system modules but also construct the refined running model of AUV system, then the colored Petri Net method is used to establish hierarchically detailed model in order to get the performance analyzing information of the system. After analyzing the model implementation, the errors of architecture designing and function realization can be found. If the errors can be modified on time, the experiment time in the pool can be reduced and the cost can be saved.

Constructing a Pre-Emptive System Based on a Multidimentional Matrix and Autocompletion to Improve Diagnostic Coding in Acute Care Hospitals.

PubMed

Noussa-Yao, Joseph; Heudes, Didier; Escudie, Jean-Baptiste; Degoulet, Patrice

2016-01-01

Short-stay MSO (Medicine, Surgery, Obstetrics) hospitalization activities in public and private hospitals providing public services are funded through charges for the services provided (T2A in French). Coding must be well matched to the severity of the patient's condition, to ensure that appropriate funding is provided to the hospital. We propose the use of an autocompletion process and multidimensional matrix, to help physicians to improve the expression of information and to optimize clinical coding. With this approach, physicians without knowledge of the encoding rules begin from a rough concept, which is gradually refined through semantic proximity and uses information on the associated codes stemming of optimized knowledge bases of diagnosis code.
Elevated serum levels of soluble CD30 in patients with atopic dermatitis (AD).

PubMed

Bengtsson, A; Holm, L; Bäck, O; Fransson, J; Scheynius, A

1997-09-01

The immunopathology of AD is still unclear, but evidence for an immune response polarized towards Th2 activity has been provided. The CD30 molecule belongs to the tumour necrosis factor (TNF) receptor family and is expressed on activated T cells with a sustained expression in Th2 cells. This molecule also exists in a soluble form (sCD30). Elevated serum levels of sCD30 have been found in patients with Hodgkin's disease, chronic hepatitis B infection and HIV infection. Studies were undertaken to compare the serum levels of sCD30 in patients with AD (n=49) and healthy non-atopic controls (n=94). The presence of sCD30 was analysed with ELISA. A significantly higher concentration of sCD30 was noted in AD patients, median sCD30 level 29 U/ml (range 1-708 U/ml), compared with healthy non-atopic controls (P<0.001), where the median level was 11 U/ml with a range of 1-1042 U/ml. No correlation was found between sCD30 levels and total serum IgE, or between the AD patients' SCORAD values and concentration of sCD30. sCD30 levels were also analysed in 20 AD patients, which during ketoconazole treatment had improved their clinical scores and reduced their serum IgE and eosinophil cationic protein levels. However, no significant decrease in sCD30 levels was noted after treatment. The results show that patients with AD have elevated levels of sCD30, but without correlation to total serum IgE or disease activity.
A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering.

PubMed

Sarrouti, Mourad; Ouatik El Alaoui, Said

2017-04-01

Passage retrieval, the identification of top-ranked passages that may contain the answer for a given biomedical question, is a crucial component for any biomedical question answering (QA) system. Passage retrieval in open-domain QA is a longstanding challenge widely studied over the last decades. However, it still requires further efforts in biomedical QA. In this paper, we present a new biomedical passage retrieval method based on Stanford CoreNLP sentence/passage length, probabilistic information retrieval (IR) model and UMLS concepts. In the proposed method, we first use our document retrieval system based on PubMed search engine and UMLS similarity to retrieve relevant documents to a given biomedical question. We then take the abstracts from the retrieved documents and use Stanford CoreNLP for sentence splitter to make a set of sentences, i.e., candidate passages. Using stemmed words and UMLS concepts as features for the BM25 model, we finally compute the similarity scores between the biomedical question and each of the candidate passages and keep the N top-ranked ones. Experimental evaluations performed on large standard datasets, provided by the BioASQ challenge, show that the proposed method achieves good performances compared with the current state-of-the-art methods. The proposed method significantly outperforms the current state-of-the-art methods by an average of 6.84% in terms of mean average precision (MAP). We have proposed an efficient passage retrieval method which can be used to retrieve relevant passages in biomedical QA systems with high mean average precision. Copyright © 2017 Elsevier Inc. All rights reserved.
Time process study with UML.

PubMed

Shiki, N; Ohno, Y; Fujii, A; Murata, T; Matsumura, Y

2009-01-01

We propose a new business-process analysis approach, Time Process Study (TPS), which comprises process analysis and time and motion studies (TMS). TPS offsets weaknesses of TMS; the cost of field studies and the difficulties in applying them to tasks whose time span differs from those of usual tasks. In TPS, the job procedures are first displayed using a unified modeling language (UML). Next, time and manpower for each procedure are studied through interviews and TMS, and the information is appended to the UML diagram. We applied TPS in the case of a hospital-based cancer registry (HCR) of a university hospital to clarify the work procedure and the time required, and investigated TPS's availability. Meetings for the study were held once a month from July to September in 2008, and one inquirer committed a total of eight hours to the hospital survey. TPS revealed that HCR consisted of three tasks and 14 functions. The registration required 123 hours/month/person, the quality control required 6.5 hours/ 6 months/person and filing data into the population-based cancer registry required 0.5 hours/6 months/person. Of the total tasks involved in registration, 116.5 hours/month/person were undertaken by a registration worker, which shows the necessity of employing one full-time staff. With TPS, it is straightforward to share the concept among the study-team because the job procedure is first displayed using UML. Therefore, it requires a few workload to conduct TMS and interview. The obtained results were adopted for the review of staff assignment of HCR by Japanese government.
Comparative one-factor-at-a-time, response surface (statistical) and bench-scale bioreactor level optimization of thermoalkaline protease production from a psychrotrophic Pseudomonas putida SKG-1 isolate.

PubMed

Singh, Santosh K; Singh, Sanjay K; Tripathi, Vinayak R; Khare, Sunil K; Garg, Satyendra K

2011-12-28

Production of alkaline protease from various bacterial strains using statistical methods is customary now-a-days. The present work is first attempt for the production optimization of a solvent stable thermoalkaline protease by a psychrotrophic Pseudomonas putida isolate using conventional, response surface methods, and fermentor level optimization. The pre-screening medium amended with optimized (w/v) 1.0% glucose, 2.0% gelatin and 0.5% yeast extract, produced 278 U protease ml(-1) at 72 h incubation. Enzyme production increased to 431 Uml(-1) when Mg2+ (0.01%, w/v) was supplemented. Optimization of physical factors further enhanced protease to 514 Uml(-1) at pH 9.0, 25°C and 200 rpm within 60 h. The combined effect of conventionally optimized variables (glucose, yeast extract, MgSO4 and pH), thereafter predicted by response surface methodology yielded 617 U protease ml(-1) at glucose 1.25% (w/v), yeast extract 0.5% (w/v), MgSO4 0.01% (w/v) and pH 8.8. Bench-scale bioreactor level optimization resulted in enhanced production of 882 U protease ml(-1) at 0.8 vvm aeration and 150 rpm agitation during only 48 h incubation. The optimization of fermentation variables using conventional, statistical approaches and aeration/agitation at fermentor level resulted in ~13.5 folds increase (882 Uml(-1)) in protease production compared to un-optimized conditions (65 Uml(-1)). This is the highest level of thermoalkaline protease reported so far by any psychrotrophic bacterium.
Tumor type M2 pyruvate kinase expression in advanced breast cancer.

PubMed

Lüftner, D; Mesterharm, J; Akrivakis, C; Geppert, R; Petrides, P E; Wernecke, K D; Possinger, K

2000-01-01

Recently, a high validity correlation of the tumor M2 pyruvate kinase (Tu M2-PK) isoenzyme in comparison to standard tumor markers has been demonstrated in solid tumors. We investigated this marker in 67 patients with advanced breast cancer (ABC) in comparison to healthy controls. Plasma Tu M2-PK was measured using an ELISA assay (ScheBo Tech, Giessen, Germany) while serum CA27.29 was determined using a chemiluminescent immunoassay (Bayer Diagnostics, Tarrytown, USA). In a ROC analysis, the cut-off to discriminate patients from controls was established at 15 U/ml for Tu M2-PK (specificity 85%; positive predictive value 81%) and 30 U/ml for CA27.29 (specificity 91%; positive predictive value 92%). Median ABC baseline levels (ranges) in patients with ABC for Tu M2-PK and CA27.29 were 12.8 U/ml (4.8-252,495) and 130 U/ml (13.3-8130), respectively. Response assessment was done in 45 chemotherapy courses of 38 pts. In 13 out of 19 blocks (68.4%) with PD (progressive disease), an elevated level of Tu M2-PK at baseline or in the follow-up was found. In 17 out of 20 blocks (85%) with SD (stable disease), the Tu M2-PK level was normal at baseline or normalised within 4 weeks of treatment. All 6 patients with disease remission had a normal baseline Tu M2-PK level or the levels decreased promptly. Tu M2-PK gives additional information about ABC, indicating disease activity and sensitivity to chemotherapy while CA27.29 reflects tumor burden.
Factor VIII inhibitor in a patient with mild haemophilia A and an Asn618-->Ser mutation responsive to immune tolerance induction and cyclophosphamide.

PubMed

Vlot, André J; Wittebol, Shulamiet; Strengers, Paul F W; Turenhout, Ellen A M; Voorberg, Jan; van den Berg, H Marijke; Mauser-Bunschoten, Eveline P

2002-04-01

We describe a patient with mild haemophilia A (original value of factor VIII activity 0.30 U/ml) who developed an inhibitor (36.1 Bethesda U/ml) which cross-reacted with his endogenous factor VIII. This caused a decline in basal factor VIII level (< 0.01 U/ml) and severe haemorrhagic events. Treatment to induce immune tolerance was started with factor VIII/von Willebrand factor (VWF) concentrates, but inhibitor levels increased progressively and the patient suffered serious bleeding. Cyclophosphamide was administered and, after 8 months treatment, factor VIII levels increased to 0.20 U/ml and the inhibitor could no longer be detected. Screening of his factor VIII gene revealed a missense mutation in exon 13 that predicts substitution of Asn618-->Ser in the A2 domain of factor VIII. Immunoprecipitation analysis showed that the antibodies present in the patient's plasma reacted with metabolically labelled A2 domain and, to a lesser extent, with factor VIII light chain. Inhibitory antibodies were completely neutralized by recombinant A2 domain, whereas no neutralization was observed after the addition of factor VIII light chain (A3-C1-C2) and C2 domain. More detailed analysis showed that the majority of inhibitory antibodies were directed against residues Arg484-Ile508, a previously identified binding site for factor VIII inhibitors. Our findings suggest that immune tolerance therapy and cyclophosphamide were successful in eradicating inhibitory antibodies against a common epitope on factor VIII.
Students' different understandings of class diagrams

NASA Astrophysics Data System (ADS)

Boustedt, Jonas

2012-03-01

The software industry needs well-trained software designers and one important aspect of software design is the ability to model software designs visually and understand what visual models represent. However, previous research indicates that software design is a difficult task to many students. This article reports empirical findings from a phenomenographic investigation on how students understand class diagrams, Unified Modeling Language (UML) symbols, and relations to object-oriented (OO) concepts. The informants were 20 Computer Science students from four different universities in Sweden. The results show qualitatively different ways to understand and describe UML class diagrams and the "diamond symbols" representing aggregation and composition. The purpose of class diagrams was understood in a varied way, from describing it as a documentation to a more advanced view related to communication. The descriptions of class diagrams varied from seeing them as a specification of classes to a more advanced view, where they were described to show hierarchic structures of classes and relations. The diamond symbols were seen as "relations" and a more advanced way was seeing the white and the black diamonds as different symbols for aggregation and composition. As a consequence of the results, it is recommended that UML should be adopted in courses. It is briefly indicated how the phenomenographic results in combination with variation theory can be used by teachers to enhance students' possibilities to reach advanced understanding of phenomena related to UML class diagrams. Moreover, it is recommended that teachers should put more effort in assessing skills in proper usage of the basic symbols and models and students should be provided with opportunities to practise collaborative design, e.g. using whiteboards.
Glycaemic control and hypoglycaemia in people with type 2 diabetes switching from twice‐daily basal insulin to once‐daily insulin glargine 300 U/mL or insulin glargine 100 U/mL (EDITION 1 and EDITION 2 subgroup analysis)

PubMed Central

d'Emden, Michael C.; Fisher, Miles; Ampudia‐Blasco, F. Javier; Stella, Peter; Bizet, Florence; Cali, Anna M. G.; Wysham, Carol H.

2017-01-01

In this post hoc analysis we compared glycaemic control and hypoglycaemia between insulin glargine 300 U/mL (Gla‐300) and glargine 100 U/mL (Gla‐100) administered once daily in people with type 2 diabetes (T2DM) from the EDITION 1 (basal plus mealtime insulin) and EDITION 2 (basal insulin plus oral antihyperglycaemic drugs) trials who were previously receiving twice‐daily insulin. At randomization, 16.9% and 20.0% of people in EDITION 1 and 2, respectively, were receiving twice‐daily basal insulin. Glycated haemoglobin change from baseline to Month 6 was similar over 6 months with Gla‐300 or Gla‐100 (least squares mean difference −0.01%; 95% confidence interval [CI] −0.27 to 0.24] in EDITION 1 and 0.16%; 95% CI −0.25 to 0.57, in EDITION 2). Participants previously receiving twice‐daily insulin in EDITION 1 had a lower risk of confirmed (≤3.9 mmol/L [≤70 mg/dL]) or severe hypoglycaemia with Gla‐300 vs Gla‐100 at night (00:00–05:59 hours), but not at any time (24 hours); in EDITION 2 the risk was reduced at night and any time (24 hours). In conclusion, Gla‐300 provided similar glycaemic control with less hypoglycaemia compared with Gla‐100 in people with T2DM switching from twice‐daily to once‐daily basal insulin. PMID:28736942
Glycaemic control and hypoglycaemia with insulin glargine 300 U/mL compared with glargine 100 U/mL in Japanese adults with type 2 diabetes using basal insulin plus oral anti-hyperglycaemic drugs (EDITION JP 2 randomised 12-month trial including 6-month extension).

PubMed

Terauchi, Y; Koyama, M; Cheng, X; Sumi, M; Riddle, M C; Bolli, G B; Hirose, T

2017-10-01

To compare insulin glargine 300 U/mL (Gla-300) with glargine 100 U/mL (Gla-100) in Japanese adults with uncontrolled type 2 diabetes on basal insulin and oral anti-hyperglycaemic drugs over 12 months. EDITION JP 2 was a randomised, open-label, phase 3 study. Following a 6-month treatment period, participants continued receiving previously assigned once daily Gla-300 or Gla-100, plus oral anti-hyperglycaemic drugs, in a 6-month extension period. Glycaemic control, hypoglycaemia and adverse events were assessed. The 12-month completion rate was 88% for Gla-300 and 96% for Gla-100, with comparable reasons for discontinuation. Mean HbA 1c decrease from baseline to month 12 was 0.3% in both groups. Annualised rates of confirmed (≤3.9mmol/L [≤70mg/dL]) or severe hypoglycaemia were lower with Gla-300 than Gla-100 (nocturnal [00:00-05:59h]: rate ratio 0.41; 95% confidence interval: 0.18 to 0.92; anytime [24h]: rate ratio 0.64; 95% confidence interval: 0.44 to 0.94). Cumulative number of hypoglycaemic events was lower with Gla-300 than Gla-100. Adverse event profiles were comparable between treatments. Over 12 months, Gla-300-treated participants achieved sustained glycaemic control and experienced less hypoglycaemia, particularly at night, versus Gla-100, supporting 6-month results. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Case Studies in Describing Scientific Research Efforts as Linked Data

NASA Astrophysics Data System (ADS)

Gandara, A.; Villanueva-Rosales, N.; Gates, A.

2013-12-01

The Web is growing with numerous scientific resources, prompting increased efforts in information management to consider integration and exchange of scientific resources. Scientists have many options to share scientific resources on the Web; however, existing options provide limited support to scientists in annotating and relating research resources resulting from a scientific research effort. Moreover, there is no systematic approach to documenting scientific research and sharing it on the Web. This research proposes the Collect-Annotate-Refine-Publish (CARP) Methodology as an approach for guiding documentation of scientific research on the Semantic Web as scientific collections. Scientific collections are structured descriptions about scientific research that make scientific results accessible based on context. In addition, scientific collections enhance the Linked Data data space and can be queried by machines. Three case studies were conducted on research efforts at the Cyber-ShARE Research Center of Excellence in order to assess the effectiveness of the methodology to create scientific collections. The case studies exposed the challenges and benefits of leveraging the Semantic Web and Linked Data data space to facilitate access, integration and processing of Web-accessible scientific resources and research documentation. As such, we present the case study findings and lessons learned in documenting scientific research using CARP.
Partial biochemical characterization of crude extract extracellular chitinase enzyme from Bacillus subtilis B 298

NASA Astrophysics Data System (ADS)

Lestari, P.; Prihatiningsih, N.; Djatmiko, H. A.

2017-02-01

Extraction and characterization of extracellular chitinase from Bacillus subtilis B 298 have been done. Growth curve determination of B. subtilis B 298, production curve determination of crude extract chitinase from B. subtilis B 298, and partial biochemical characterization of crude extract chitinase have been achieved in this study. Optimum growth of B. subtilis B 298 was achieved at logarithmic phase within 9 hours incubation time, so it was used as inoculum for enzyme production. According to production curve of the enzyme, it was known that incubation time which gave the highest chitinase activity of 15 hours with activity of 6.937 U/mL respectively. Effect of various temperatures on chitinase activity showed that optimum activity was achieved at 40°C with an activity of 5.764 U/mL respectively. Meanwhile, the optimum pH for chitinase activity was achieved at pH of 5.0 with an activity of 6.813 U/mL respectively. This enzyme was then classified as metalloenzyme due to the decline of the activity by EDTA addition. All divalent cations tested acted as inhibitors.
Novel tryptophan metabolic pathways in auxin biosynthesis in silkworm.

PubMed

Yokoyama, Chiaki; Takei, Mami; Kouzuma, Yoshiaki; Nagata, Shinji; Suzuki, Yoshihito

2017-08-01

In the course of our study of the biosynthetic pathway of auxin, a class of phytohormones, in insects, we proposed the biosynthetic pathway tryptophan (Trp)→indole-3-acetaldoxime (IAOx)→indole-3-acetadehyde (IAAld)→indole-3-acetic acid (IAA). In this study, we identified two branches in the metabolic pathways in the silkworm, possibly affecting the efficiency of IAA production: Trp→indole-3-pyruvic acid→indole-3-lactic acid and IAAld→indole-3-ethanol. We also determined the apparent conversion activities (2.05×10 -7 UmL -1 for Trp→IAA, 1.30×10 -5 UmL -1 for IAOx→IAA, and 3.91×10 -1 UmL -1 for IAAld→IAA), which explain why IAOx and IAAld are barely detectable as either endogenous compounds or metabolites of their precursors. The failure to detect IAAld, even in the presence of an inhibitor of the conversion IAAld→IAA, is explained by a switch in the conversion from IAAld→IAA to IAAld→IEtOH. Copyright © 2017 Elsevier Ltd. All rights reserved.
Planned NLM/AHCPR large-scale vocabulary test: using UMLS technology to determine the extent to which controlled vocabularies cover terminology needed for health care and public health.

PubMed Central

Humphreys, B L; Hole, W T; McCray, A T; Fitzmaurice, J M

1996-01-01

The National Library of Medicine (NLM) and the Agency for Health Care Policy and Research (AHCPR) are sponsoring a test to determine the extent to which a combination of existing health-related terminologies covers vocabulary needed in health information systems. The test vocabularies are the 30 that are fully or partially represented in the 1996 edition of the Unified Medical Language System (UMLS) Metathesaurus, plus three planned additions: the portions of SNOMED International not in the 1996 Metathesaurus Read Clinical Classification, and the Logical Observations Identifiers, Names, and Codes (LOINC) system. These vocabularies are available to testers through a special interface to the Internet-based UMLS Knowledge Source Server. The test will determine the ability of the test vocabularies to serve as a source of controlled vocabulary for health data systems and applications. It should provide the basis for realistic resource estimates for developing and maintaining a comprehensive "standard" health vocabulary that is based on existing terminologies. PMID:8816351
Colaborated Architechture Framework for Composition UML 2.0 in Zachman Framework

NASA Astrophysics Data System (ADS)

Hermawan; Hastarista, Fika

2016-01-01

Zachman Framework (ZF) is the framework of enterprise architechture that most widely adopted in the Enterprise Information System (EIS) development. In this study, has been developed Colaborated Architechture Framework (CAF) to collaborate ZF with Unified Modeling Language (UML) 2.0 modeling. The CAF provides the composition of ZF matrix that each cell is consist of the Model Driven architechture (MDA) from the various UML models and many Software Requirement Specification (SRS) documents. Implementation of this modeling is used to develops Enterprise Resource Planning (ERP). Because ERP have a coverage of applications in large numbers and complexly relations, it is necessary to use Agile Model Driven Design (AMDD) approach as an advanced method to transforms MDA into components of application modules with efficiently and accurately. Finally, through the using of the CAF, give good achievement in fullfilment the needs from all stakeholders that are involved in the overall process stage of Rational Unified Process (RUP), and also obtaining a high satisfaction to fullfiled the functionality features of the ERP software in PT. Iglas (Persero) Gresik.
Unified modeling language and design of a case-based retrieval system in medical imaging.

PubMed Central

LeBozec, C.; Jaulent, M. C.; Zapletal, E.; Degoulet, P.

1998-01-01

One goal of artificial intelligence research into case-based reasoning (CBR) systems is to develop approaches for designing useful and practical interactive case-based environments. Explaining each step of the design of the case-base and of the retrieval process is critical for the application of case-based systems to the real world. We describe herein our approach to the design of IDEM--Images and Diagnosis from Examples in Medicine--a medical image case-based retrieval system for pathologists. Our approach is based on the expressiveness of an object-oriented modeling language standard: the Unified Modeling Language (UML). We created a set of diagrams in UML notation illustrating the steps of the CBR methodology we used. The key aspect of this approach was selecting the relevant objects of the system according to user requirements and making visualization of cases and of the components of the case retrieval process. Further evaluation of the expressiveness of the design document is required but UML seems to be a promising formalism, improving the communication between the developers and users. Images Figure 6 Figure 7 PMID:9929346
Unified modeling language and design of a case-based retrieval system in medical imaging.

PubMed

LeBozec, C; Jaulent, M C; Zapletal, E; Degoulet, P

1998-01-01

One goal of artificial intelligence research into case-based reasoning (CBR) systems is to develop approaches for designing useful and practical interactive case-based environments. Explaining each step of the design of the case-base and of the retrieval process is critical for the application of case-based systems to the real world. We describe herein our approach to the design of IDEM--Images and Diagnosis from Examples in Medicine--a medical image case-based retrieval system for pathologists. Our approach is based on the expressiveness of an object-oriented modeling language standard: the Unified Modeling Language (UML). We created a set of diagrams in UML notation illustrating the steps of the CBR methodology we used. The key aspect of this approach was selecting the relevant objects of the system according to user requirements and making visualization of cases and of the components of the case retrieval process. Further evaluation of the expressiveness of the design document is required but UML seems to be a promising formalism, improving the communication between the developers and users.
XMI2USE: A Tool for Transforming XMI to USE Specifications

NASA Astrophysics Data System (ADS)

Sun, Wuliang; Song, Eunjee; Grabow, Paul C.; Simmonds, Devon M.

The UML-based Specification Environment (USE) tool supports syntactic analysis, type checking, consistency checking, and dynamic validation of invariants and pre-/post conditions specified in the Object Constraint Language (OCL). Due to its animation and analysis power, it is useful when checking critical non-functional properties such as security policies. However, the USE tool requires one to specify (i.e., "write") a model using its own textual language and does not allow one to import any model specification files created by other UML modeling tools. Hence, to make the best use of existing UML tools, we often create a model with OCL constraints using a modeling tool such as the IBM Rational Software Architect (RSA) and then use the USE tool for model validation. This approach, however, requires a manual transformation between the specifications of two different tool formats, which is error-prone and diminishes the benefit of automated model-level validations. In this paper, we describe our own implementation of a specification transformation engine that is based on the Model Driven Architecture (MDA) framework and currently supports automatic tool-level transformations from RSA to USE.
Relationship between serum TNF activity and insulin resistance in dairy cows affected with naturally occurring fatty liver.

PubMed

Ohtsuka, H; Koiwa, M; Hatsugaya, A; Kudo, K; Hoshi, F; Itoh, N; Yokota, H; Okada, H; Kawamura, S

2001-09-01

To clarity the relationship between tumor necrosis factor (TNF) and insulin resistance in dairy cows affected with fatty liver, naturally occurring cases were investigated. The affected cows were classified into following three groups according to histopathologic findings of the liver: mild fat droplet deposition (group 1; n=11), severe fat droplet deposition (group 2; n=10), and cloudy swelling (group 3; n=8). Serum TNF activities in Group 2 (8.67 +/- 2.16 U/ml) and Group 3 (11.65 +/- 1.92 U/ml) were significantly higher than that in Group 1 (3.57 +/- 0.81 U/ml) (p<0.05). The insulin-tolerance tests showed that the insulin-stimulated glucose disposal rates (GDR) in Group 2 (27.6 +/- 7.8%) and Group 3 (15.8 +/- 9.1%) were significantly lower than that in Group 1 (41.7 +/- 9.8%). There was a significant negative correlation between serum TNF activity and GDR in affected cows (r=-0.56, p<0.01). These results indicate that serum TNF activity is correlated with insulin resistance in cows with fatty liver.
Random-Forest Classification of High-Resolution Remote Sensing Images and Ndsm Over Urban Areas

NASA Astrophysics Data System (ADS)

Sun, X. F.; Lin, X. G.

2017-09-01

As an intermediate step between raw remote sensing data and digital urban maps, remote sensing data classification has been a challenging and long-standing research problem in the community of remote sensing. In this work, an effective classification method is proposed for classifying high-resolution remote sensing data over urban areas. Starting from high resolution multi-spectral images and 3D geometry data, our method proceeds in three main stages: feature extraction, classification, and classified result refinement. First, we extract color, vegetation index and texture features from the multi-spectral image and compute the height, elevation texture and differential morphological profile (DMP) features from the 3D geometry data. Then in the classification stage, multiple random forest (RF) classifiers are trained separately, then combined to form a RF ensemble to estimate each sample's category probabilities. Finally the probabilities along with the feature importance indicator outputted by RF ensemble are used to construct a fully connected conditional random field (FCCRF) graph model, by which the classification results are refined through mean-field based statistical inference. Experiments on the ISPRS Semantic Labeling Contest dataset show that our proposed 3-stage method achieves 86.9% overall accuracy on the test data.

The Role of Ontologies in Schema-based Program Synthesis

NASA Technical Reports Server (NTRS)

Bures, Tomas; Denney, Ewen; Fischer, Bernd; Nistor, Eugen C.

2004-01-01

Program synthesis is the process of automatically deriving executable code from (non-executable) high-level specifications. It is more flexible and powerful than conventional code generation techniques that simply translate algorithmic specifications into lower-level code or only create code skeletons from structural specifications (such as UML class diagrams). Key to building a successful synthesis system is specializing to an appropriate application domain. The AUTOBAYES and AUTOFILTER systems, under development at NASA Ames, operate in the two domains of data analysis and state estimation, respectively. The central concept of both systems is the schema, a representation of reusable computational knowledge. This can take various forms, including high-level algorithm templates, code optimizations, datatype refinements, or architectural information. A schema also contains applicability conditions that are used to determine when it can be applied safely. These conditions can refer to the initial specification, to intermediate results, or to elements of the partially-instantiated code. Schema-based synthesis uses AI technology to recursively apply schemas to gradually refine a specification into executable code. This process proceeds in two main phases. A front-end gradually transforms the problem specification into a program represented in an abstract intermediate code. A backend then compiles this further down into a concrete target programming language of choice. A core engine applies schemas on the initial problem specification, then uses the output of those schemas as the input for other schemas, until the full implementation is generated. Since there might be different schemas that implement different solutions to the same problem this process can generate an entire solution tree. AUTOBAYES and AUTOFILTER have reached the level of maturity where they enable users to solve interesting application problems, e.g., the analysis of Hubble Space Telescope images. They are large (in total around 100kLoC Prolog), knowledge intensive systems that employ complex symbolic reasoning to generate a wide range of non-trivial programs for complex application do- mains. Their schemas can have complex interactions, which make it hard to change them in isolation or even understand what an existing schema actually does. Adding more capabilities by increasing the number of schemas will only worsen this situation, ultimately leading to the entropy death of the synthesis system. The root came of this problem is that the domain knowledge is scattered throughout the entire system and only represented implicitly in the schema implementations. In our current work, we are addressing this problem by making explicit the knowledge from Merent parts of the synthesis system. Here; we discuss how Gruber's definition of an ontology as an explicit specification of a conceptualization matches our efforts in identifying and explicating the domain-specific concepts. We outline the dual role ontologies play in schema-based synthesis and argue that they address different audiences and serve different purposes. Their first role is descriptive: they serve as explicit documentation, and help to understand the internal structure of the system. Their second role is prescriptive: they provide the formal basis against which the other parts of the system (e.g., schemas) can be checked. Their final role is referential: ontologies also provide semantically meaningful "hooks" which allow schemas and tools to access the internal state of the program derivation process (e.g., fragments of the generated code) in domain-specific rather than language-specific terms, and thus to modify it in a controlled fashion. For discussion purposes we use AUTOLINEAR, a small synthesis system we are currently experimenting with, which can generate code for solving a system of linear equations, Az = b.
A UML-based ontology for describing hospital information system architectures.

PubMed

Winter, A; Brigl, B; Wendt, T

2001-01-01

To control the heterogeneity inherent to hospital information systems the information management needs appropriate hospital information systems modeling methods or techniques. This paper shows that, for several reasons, available modeling approaches are not able to answer relevant questions of information management. To overcome this major deficiency we offer an UML-based ontology for describing hospital information systems architectures. This ontology views at three layers: the domain layer, the logical tool layer, and the physical tool layer, and defines the relevant components. The relations between these components, especially between components of different layers make the answering of our information management questions possible.
Structuration and acquisition of medical knowledge. Using UMLS in the conceptual graph formalism.

PubMed Central

Volot, F.; Zweigenbaum, P.; Bachimont, B.; Ben Said, M.; Bouaud, J.; Fieschi, M.; Boisvieux, J. F.

1993-01-01

The use of a taxonomy, such as the concept type lattice (CTL) of Conceptual Graphs, is a central structuring piece in a knowledge-based system. The knowledge it contains is constantly used by the system, and its structure provides a guide for the acquisition of other pieces of knowledge. We show how UMLS can be used as a knowledge resource to build a CTL and how the CTL can help the process of acquisition for other kinds of knowledge. We illustrate this method in the context of the MENELAS natural language understanding project. PMID:8130568
Getting Smarter at Managing Avionic Software: The Results of a Two-Day Requirements Elicitation Workshop With DTAES

DTIC Science & Technology

2007-10-01

Visualisation – Diagrammes et graphes : Il y a un manque de bonnes visualisations propres aux systèmes embarqués et en temps réel. Les structures et les...propriétés dynamiques devraient être visualisées. Les diagrammes d’état sont sous-utilisés et plusieurs des diagrammes d’UML et de SysML devraient être...evolution of ObjectTime ∗ (RMC) There is currently a project that is funded by AERAC to study the impact of MDD on avionic software ∗ (RMC) UML is fast
Statechart-based design controllers for FPGA partial reconfiguration

NASA Astrophysics Data System (ADS)

Łabiak, Grzegorz; Wegrzyn, Marek; Rosado Muñoz, Alfredo

2015-09-01

Statechart diagram and UML technique can be a vital part of early conceptual modeling. At the present time there is no much support in hardware design methodologies for reconfiguration features of reprogrammable devices. Authors try to bridge the gap between imprecise UML model and formal HDL description. The key concept in author's proposal is to describe the behavior of the digital controller by statechart diagrams and to map some parts of the behavior into reprogrammable logic by means of group of states which forms sequential automaton. The whole process is illustrated by the example with experimental results.
Maternal Serum Soluble CD30 Is Increased in Normal Pregnancy, but Decreased in Preeclampsia and Small for Gestational Age Pregnancies

PubMed Central

Kusanovic, Juan Pedro; Romero, Roberto; Hassan, Sonia S.; Gotsch, Francesca; Edwin, Samuel; Erez, Offer; Mittal, Pooja; Mazaki-Tovi, Shali; Soto, Eleazar; Than, Nandor Gabor; Friel, Lara A.; Chaiworapongsa, Tinnakorn; Yoon, Bo Hyun; Espinoza, Jimmy

2008-01-01

Objective Women with preeclampsia and those who deliver small for gestational age (SGA) neonates are characterized by intravascular inflammation (T helper 1 (Th1)-biased immune response). There is controversy about the T helper 2 (Th2) response in preeclampsia and SGA. CD30, a member of the tumor necrosis factor receptor superfamily, is preferentially expressed in vitro and in vivo by activated T cells producing Th2-type cytokines. Its soluble form (sCD30) has been proposed to be an index of Th2 immune response. The objective of this study was to determine whether maternal serum concentration of sCD30 changes with normal pregnancy, as well as in mothers with preeclampsia and those who deliver SGA neonates. Methods This cross-sectional study included patients in the following groups: (1) non-pregnant women (N=49); (2) patients with a normal pregnancy (N=89); (3) patients with preeclampsia (N=100); and (4) patients who delivered an SGA neonates (N=78). Maternal serum concentration of sCD30 was measured by a specific and sensitive enzyme-linked immunoassay. Non-parametric tests with post-hoc analysis were used for comparisons. A p value <0.05 was considered statistically significant. Results (1) The median sCD30 serum concentration of pregnant women was significantly higher than that of non-pregnant women (median: 29.7 U/mL, range: 12.2-313.2 vs. median: 23.2 U/mL, range: 14.6-195.1, respectively; p=0.01); (2) Patients with preeclampsia had a significantly lower median serum concentration of sCD30 than normal pregnant women (median: 24.7 U/mL, range: 7.6-71.2 vs. median: 29.7 U/mL, range: 12.2-313.2, respectively; p<0.05); (3) Mothers with SGA neonates had a lower median concentration of sCD30 than normal pregnant women (median: 23.4 U/mL, range: 7.1-105.3 vs. median: 29.7 U/mL, range: 12.2-313.2, respectively; p<0.05); and (4) There was no significant correlation (r=-0.059, p=0.5) between maternal serum sCD30 concentration and gestational age (19-38 weeks) in normal pregnant women. Conclusions (1) Patients with preeclampsia and those who deliver a SGA neonate had a significantly lower serum concentration of sCD30 than normal pregnant women; (2) This finding is consistent with the view that preeclampsia and SGA are associated with a polarized Th1 immune response and, perhaps, a reduced Th2 response. PMID:17853188
A novel pyrogallol red-based assay to assess catalase activity: Optimization by response surface methodology.

PubMed

Abderrahim, Mohamed; Arribas, Silvia M; Condezo-Hoyos, Luis

2017-05-01

Pyrogallol red (PGR) was identified as a novel optical probe for the detection of hydrogen peroxide (H 2 O 2 ) based on horseradish peroxidase (HRP)-catalyzed oxidation. Response surface methodology (RSM) was applied as a tool to optimize the concentrations of PGR (100µmolL -1 ), HRP (1UmL -1 ) and H 2 O 2 (250µmolL -1 ) and used to develop a sensitive PGR-based catalase (CAT) activity assay (PGR-CAT assay). N-ethylmaleimide -NEM- (102mmolL -1 ) was used to avoid interference produced by thiol groups while protecting CAT activity. Incubation time (30min) for samples or CAT used as standard and H 2 O 2 as well as signal stability (stable between 5 and 60min) were also evaluated. PGR-CAT assay was linear within the range of 0-4UmL -1 (R 2 =0.993) and very sensitive with limits of detection (LOD) of 0.005UmL -1 and quantitation (LOQ) of 0.01UmL -1 . PGR-CAT assay showed an adequate intra-day RSD=0.6-9.5% and inter-day RSD=2.4-8.9%. Bland-Altman analysis and Passing-Bablok and Pearson correlation analysis showed good agreement between CAT activity as measured by the PRG-CAT assay and the Amplex Red assay. The PGR-CAT assay is more sensitive than all the other colorimetric assays reported, particularly the Amplex Red assay, and the cost of PGR is a small fraction (about 1/1000) of that of an Amplex Red probe, so it can be expected to find wide use among scientists studying CAT activity in biological samples. Copyright Â© 2017 Elsevier B.V. All rights reserved.
Sustained glycaemic control and less nocturnal hypoglycaemia with insulin glargine 300U/mL compared with glargine 100U/mL in Japanese adults with type 1 diabetes (EDITION JP 1 randomised 12-month trial including 6-month extension).

PubMed

Matsuhisa, Munehide; Koyama, Masayoshi; Cheng, Xi; Sumi, Mariko; Riddle, Matthew C; Bolli, Geremia B; Hirose, Takahisa

2016-12-01

To evaluate the efficacy and safety of insulin glargine 300U/mL (Gla-300) versus glargine 100U/mL (Gla-100) in adults with type 1 diabetes in Japan over 12months. EDITION JP 1 was a multicentre, randomised, open-label phase 3 study. Following a 6-month on-treatment period, participants continued to receive Gla-300 or Gla-100 once daily, plus mealtime insulin, over a 6-month open-label extension phase. HbA1c, hypoglycaemia, body weight and adverse events were assessed. Overall, 114/122 (93%) and 114/121 (94%) of participants in the Gla-300 and Gla-100 group, respectively, completed the 6-month extension phase. Glycaemic control was sustained in both groups up to month 12 (mean HbA1c: Gla-300, 7.9% [62mmol/mol]; Gla-100, 7.8% [62mmol/mol]). Annualised rates of hypoglycaemia were lower with Gla-300 versus Gla-100; significantly for nocturnal confirmed (<3.0mmol/L [<54mg/dL]) or severe hypoglycaemia (2.39 and 3.85 events per participant-year; rate ratio: 0.62 [0.39-0.97]). No between-treatment differences in mean body weight change or adverse events were observed. Over 12months' treatment, participants with type 1 diabetes receiving Gla-300 achieved sustained glycaemic control and experienced less nocturnal hypoglycaemia that was confirmed (<3.0mmol/L [<54mg/dL]) or severe compared with Gla-100, supporting the 6-month results. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Effects of Inteferons on Human B-cell Differentiation in vitro

PubMed Central

Kim, Samyong; Stoetter, Hans; Heimpel, Herrman

1987-01-01

The effects of interferons (IFN) on in vitro differentiation of B-lymphocytes were studied. Peripheral lymphocytes from normal subjects were cultivated under polyclonal activator pokeweed mitogen (PWN) or Epstein-Barr virus (EBV) stimulation. The secreted Ig in the culture supernatants were measured for IgM by ELISA method. To determine the cellular level of IFN action T-cell enriched fraction (Te) or B-cell enriched fraction (Be) were preincubated with IFN prior to recombination culture. IFN had modulatory activities on Ig production; at low to moderately high doses (10–1000 U/ml of IFN-alpha or 12–120 U/ml of IFN-gamma) stimulating when IFN was added until 48 hr after the start of the culture, while after 72 hr from culture start IFN suppressed Ig production. Preincubation of Be-cells with moderately high doses of IFN (120 U/ml of IFN-gamma or 1000 U/ml of IFN-alpha) prior to PWM-stimulation suppressed Ig production. Likewise, in EBV-stimulated culture, high dose IFN suppressed Ig production. But low dose of IFN enhanced ig production in EBV-stimulated culture. Preincubation of Te-cells with IFN prior to PWM-stimulation with Be-cells enhanced the Ig production. The T-cell subset analysis at the end of these culture showed enhanced ratio of T-helper cell relative to T-suppressor cells, suggesting increased T-helper cell proliferation after incubation with IFN. Thus, it is concluded that IFNs have modulatory activities on B-cell differentiation. The mechanism seems to be direct effects on B-cells (in PWM and EBV system) as well as through T-helper cell mediation (PWM system). The IFN-gamma showed more potent (2-to 6-fold) stimulatory activities than IFN-alpha. PMID:2484953
Screening of agricultural wastes as a medium production of catalase for enzymatic fuel cell by Neurospora crassa InaCC F226

NASA Astrophysics Data System (ADS)

Santoso, Pugoh; Yopi

2017-12-01

Explorations of local microorganisms from Indonesia that can produce of catalase are still limited. Neurospora crassa is a fungus which resulting of two kinds of catalase, namely catalase-1 and catalase-3. We studied the production of catalase by Neurospora crassa (no. F226) from Indonesia Culture Collection (InaCC) in Solid State Fermentation (SSF). Among four screened agro wastes (corn cob, rice straw, oil palm empty fruit bunches, and bagasse), rice straw and oil palm empty fruit bunches (OPEFB) were remarked as the most promising substrate suited for the excellent growth and adequate production of catalase. Based on the result, the method of solid state fermentation was suitable to production of catalase. It is caused that the medium served to maintain microbial growth and metabolism. The filamentous filament is more suitable for living on solid media because it has a high tolerance to low water activity, and it has a high potential to excrete hydrolytic enzymes that caused of its morphology. The filamentous filament morphology allows the fungus to form colonies and penetrate the solid substrates in order to obtain nutrients. The results showed that the highest catalase activity was obtained on rice straw and oil palm empty fruit bunches medium with catalase activity of 39.1 U/mL and 37,7 U/mL in 50% moisture content medium, respectively. Optimization of humidity and pH medium in the rice straw were investigated which is the highest activity obtained in 30% moisture content and pH medium of 6. The catalase activity was reached in the value of 53.761 U/mL and 56.903 U/mL by incubated 48 hours and 96 hours, respectively.
Can preoperative and postoperative CA19-9 levels predict survival and early recurrence in patients with resectable hilar cholangiocarcinoma?

PubMed

Wang, Jun-Ke; Hu, Hai-Jie; Shrestha, Anuj; Ma, Wen-Jie; Yang, Qin; Liu, Fei; Cheng, Nan-Sheng; Li, Fu-Yu

2017-07-11

To investigate the predictive values of preoperative and postoperative serum CA19-9 levels on survival and other prognostic factors including early recurrence in patients with resectable hilar cholangiocarcinoma. In univariate analysis, increased preoperative and postoperative CA19-9 levels in the light of different cut-off points (37, 100, 150, 200, 400, 1000 U/ml) were significantly associated with poor survival outcomes, of which the cut-off point of 150 U/ml showed the strongest predictive value (both P < 0.001). Preoperative to postoperative increase in CA19-9 level was also correlated with poor survival outcome (P < 0.001). In multivariate analysis, preoperative CA19-9 level > 150 U/ml was significantly associated with lymph node metastasis (OR = 3.471, 95% CI 1.216-9.905; P = 0.020) and early recurrence (OR = 8.280, 95% CI 2.391-28.674; P = 0.001). Meanwhile, postoperative CA19-9 level > 150 U/ml was also correlated with early recurrence (OR = 4.006, 95% CI 1.107-14.459; P = 0.034). Ninety-eight patients who had undergone curative surgery for hilar cholangiocarcinoma between 1995 and 2014 in our institution were selected for the study. The correlations of preoperative and postoperative serum CA19-9 levels on the basis of different cut-off points with survival and various tumor factors were retrospectively analyzed with univariate and multivariate methods. In patients with resectable hilar cholangiocarcinoma, serum CA19-9 predict survival and early recurrence. Patients with increased preoperative and postoperative CA19-9 levels have poor survival outcomes and higher tendency of early recurrence.
Prediagnostic plasma antibody levels to periodontopathic bacteria and risk of coronary heart disease.

PubMed

Ueno, Masayuki; Izumi, Yuichi; Kawaguchi, Yoko; Ikeda, Ai; Iso, Hiroyasu; Inoue, Manami; Tsugane, Shoichiro

2012-01-01

Many epidemiological studies have indicated that periodontitis is an important risk factor for coronary heart disease (CHD). We examined whether plasma antibody levels to 3 major periodontal pathogens, Aggregatibacter actinomycetemcomitans, Porphyromonas gingivalis, and Prevotella intermedia predicted the risk of CHD events. A nested case-control research design (case: n = 191, control: n = 382), by matching gender, age, study area, date of blood collection, and time since last meal at blood collection, was employed in a large cohort of Japanese community residents.Antibody levels of periodontopathic bacteria were associated with risk of CHD after adjusting for BMI, smoking status, alcohol intake, history of hypertension, history of diabetes mellitus, exercise during leisure time, and perceived mental stress. The association was different by age subgroup. For subjects aged 40-55 years, the medium (31.7-184.9 U/mL) or high tertile plasma antibody level (> 184.9 U/mL) of A. actinomycetemcomitans showed higher risk of CHD (medium: OR = 3.72; 95% CI = 1.20-11.56, high: OR = 4.64; 95% CI = 1.52-14.18) than the low tertile level (< 31.7 U/mL). The ORs of CHD incidence became higher with an increase in IgG level of A. actinomycetemcomitans (P for trend = 0.007). For subjects aged 56-69 years, the high tertile level (> 414.1 U/mL) of P. intermedia was associated with higher risk of CHD (OR = 2.65; 95% CI = 1.18-5.94) in a dose-response fashion (P for trend = 0.007). The possible role of periodontopathic bacteria as a risk factor for CHD incidence was suggested by the results of this study by the elevated antibody level to these bacteria with the increased risk of CHD.
Effects of interleukins on connective tissue type mast cells co-cultured with fibroblasts.

PubMed Central

Levi-Schaffer, F; Segal, V; Shalit, M

1991-01-01

We investigated the effects of interleukin-2 (IL-2), interleukin-3 (IL-3) and interleukin-4 (IL-4) on mouse and rat peritoneal mast cells (MC) co-cultured with 3T3 fibroblasts (MC/3T3). The continuous presence of these cytokines for 7-9 days in the culture media was neither toxic nor caused proliferation of MC, as determined by the stability of MC numbers in culture. Long-term incubation of mouse MC/3T3 with IL-2 (100 U/ml), IL-3 (50 U/ml), IL-4 (50 U/ml) or a mixture of IL-3 and IL-4 (25 U/ml) induced an increase in basal histamine release of 79.3 +/- 19.0%, 41.0 +/- 17.3%, 25.2 +/- 10.4% and 30.2 +/- 3.2%, respectively, over control cells incubated with medium alone. When rat MC/3T3 were incubated for 7 days with the various interleukins an enhancement in histamine release similar to that observed with mouse MC/3T3 was found. Preincubation (1 hr) of rat MC/3T3 with interleukins prior to immunological activation with anti-IgE antibodies enhanced histamine release. The highest effect was observed with IL-3 + IL-4 (60.4 +/- 10.8% increase) followed by IL-2 (51.5 +/- 4.5%), IL-4 (28.6 +/- 10.3%) and IL-3 (13.2 +/- 4.2%). This study demonstrates that when mouse and rat peritoneal MC are cultured with fibroblasts in the presence of interleukins they do not proliferate, suggesting that they preserve their connective tissue type MC phenotype. Moreover, interleukins display a pro-inflammatory effect on these cells by enhancing both basal and anti-IgE-mediated histamine release. PMID:2016117
Processing of poultry feathers by alkaline keratin hydrolyzing enzyme from Serratia sp. HPC 1383.

PubMed

Khardenavis, Anshuman A; Kapley, Atya; Purohit, Hemant J

2009-04-01

The present study describes the production and characterization of a feather hydrolyzing enzyme by Serratia sp. HPC 1383 isolated from tannery sludge, which was identified by the ability to form clear zones around colonies on milk agar plates. The proteolytic activity was expressed in terms of the micromoles of tyrosine released from substrate casein per ml per min (U/mL min). Induction of the inoculum with protein was essential to stimulate higher activity of the enzyme, with 0.03% feathermeal in the inoculum resulting in increased enzyme activity (45U/mL) that further increased to 90U/mL when 3d old inoculum was used. The highest enzyme activity, 130U/mL, was observed in the presence of 0.2% yeast extract. The optimum assay temperature and pH for the enzyme were found to be 60 degrees C and 10.0, respectively. The enzyme had a half-life of 10min at 60 degrees C, which improved slightly to 18min in presence of 1mM Ca(2+). Inhibition of the enzyme by phenylmethyl sulfonyl fluoride (PMSF) indicated that the enzyme was a serine protease. The enzyme was also partially inhibited (39%) by the reducing agent beta-mercaptoethanol and by divalent metal ions such as Zn(2+) (41% inhibition). However, Ca(2+) and Fe(2+) resulted in increases in enzyme activity of 15% and 26%, respectively. The kinetic constants of the keratinase were found to be 3.84 microM (K(m)) and 108.7 microM/mLmin (V(max)). These results suggest that this extracellular keratinase may be a useful alternative and eco-friendly route for handling the abundant amount of waste feathers or for applications in other industrial processes.
Glycaemic control and hypoglycaemia in people with type 2 diabetes switching from twice-daily basal insulin to once-daily insulin glargine 300 U/mL or insulin glargine 100 U/mL (EDITION 1 and EDITION 2 subgroup analysis).

PubMed

Roussel, Ronan; d'Emden, Michael C; Fisher, Miles; Ampudia-Blasco, F Javier; Stella, Peter; Bizet, Florence; Cali, Anna M G; Wysham, Carol H

2018-02-01

In this post hoc analysis we compared glycaemic control and hypoglycaemia between insulin glargine 300 U/mL (Gla-300) and glargine 100 U/mL (Gla-100) administered once daily in people with type 2 diabetes (T2DM) from the EDITION 1 (basal plus mealtime insulin) and EDITION 2 (basal insulin plus oral antihyperglycaemic drugs) trials who were previously receiving twice-daily insulin. At randomization, 16.9% and 20.0% of people in EDITION 1 and 2, respectively, were receiving twice-daily basal insulin. Glycated haemoglobin change from baseline to Month 6 was similar over 6 months with Gla-300 or Gla-100 (least squares mean difference -0.01%; 95% confidence interval [CI] -0.27 to 0.24] in EDITION 1 and 0.16%; 95% CI -0.25 to 0.57, in EDITION 2). Participants previously receiving twice-daily insulin in EDITION 1 had a lower risk of confirmed (≤3.9 mmol/L [≤70 mg/dL]) or severe hypoglycaemia with Gla-300 vs Gla-100 at night (00:00-05:59 hours), but not at any time (24 hours); in EDITION 2 the risk was reduced at night and any time (24 hours). In conclusion, Gla-300 provided similar glycaemic control with less hypoglycaemia compared with Gla-100 in people with T2DM switching from twice-daily to once-daily basal insulin. © 2017 The Authors. Diabetes, Obesity and Metabolism published by John Wiley & Sons Ltd.
Expression of food-grade phytase in Lactococcus lactis from optimized conditions in milk broth.

PubMed

Miao, Yuzhi; Xu, Hui; Fei, Baojin; Qiao, Dairong; Cao, Yi

2013-07-01

The major objective of this study was to engineer lactic acid bacteria to produce the enzyme phytase from a gene native to Bacillus subtilis GYPB04. The phytase gene (phyC) of B. subtilis GYPB04 was cloned into the plasmid pMG36e for expression in Lactococcus lactis. The enzyme activity in L. lactis cultured in GM17 broth was 20.25 U/mL at 36°C. The expressed phytase was characterized as active in a pH range of 2.0-9.0 at a temperature range of 20-80°C, with an optimum pH of 5.5-6.5 and temperature of 60°C. When cultured in food-grade milk broth, the transformed L. lactis grew to an OD(600 nm) value of 1.05 and had a phytase yield of 13.58 U/mL. In same broth under optimized conditions for cell growth and phytase production, the transformant reached an OD(600 nm) value of 1.68 and a phytase yield of 42.12 U/mL, representing approximately 1.6-fold and 3.1-fold increases, respectively, compared to growth in natural milk broth. Fermentation was scaled to 5 L under optimized conditions, and product analysis revealed a final OD(600 nm) value of 1.89 and an extracellular enzyme activity of 24.23 U/mL. The results of this study may be used in the dairy fermentation industry for the development of functional, healthy yogurts and other fermented dairy foods that provide both active phytase and viable probiotics to the consumer. Copyright © 2013 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Soluble CD30 and HLA antibodies as potential risk factors for kidney transplant rejection.

PubMed

Slavcev, Antonij; Lácha, Jiri; Honsová, Eva; Sajdlová, Helena; Lodererová, Alena; Vitko, Stefan; Skibová, Jelena; Striz, Ilja

2005-06-01

Recent literary data suggest that high pre- and post-transplant serum levels of the soluble CD30 (sCD30) molecule may be a risk factor for acute rejection and worse prognosis of the transplanted kidney. The aim of our study was to correlate the concentrations of sCD30 and the presence of HLA antibodies as defined by flow cytometry and ELISA with the clinical course and graft prognosis after transplantation. One hundred and seventeen kidney transplant patients were included into the study. The incidence of rejection episodes, graft function and graft survival for up to 1 year post-transplant were evaluated. Soluble CD30 levels before transplantation were virtually the same in patients who experienced rejection and in non-rejecting patients. In both patient groups, a significant decrease of sCD30 was detected 2 weeks after transplantation (104.4 U/ml before vs. 37.0 U/ml post-transplant, P < 0.001). However, there was a substantial difference in the level of decrease of sCD30 between rejecting and non-rejecting patients. Patients without rejection had lower sCD30 values (31.2 U/ml post-transplant) compared to patients who experienced rejection episodes (62.9 U/ml), P < 0.04. Multifactorial analysis showed that antibodies to HLA class II antigens and elevated concentrations of sCD30 shortly after transplantation were associated with increased risk for acute rejection in the first post-transplant year. Measurement of soluble CD30 after transplantation, taken into consideration with the presence of HLA class II antibodies, might be helpful for evaluating the potential risk for acute rejection.
The utility of serum CA-125 in predicting extra-uterine disease in apparent early-stage endometrial cancer.

PubMed

Nicklin, James; Janda, Monika; Gebski, Val; Jobling, Thomas; Land, Russell; Manolitsas, Tom; McCartney, Anthony; Nascimento, Marcelo; Perrin, Lewis; Baker, Jannah F; Obermair, Andreas

2012-08-15

Surgical staging in early-stage uterine cancer is controversial. Preoperative serum CA-125 may be of clinical value in predicting the presence of extra-uterine disease in patients with apparent early-stage endometrial cancer. Between October 6, 2005, and June 17, 2010, 760 patients were enrolled in an international, multicentre, prospective randomized trial (LACE) comparing laparotomy with laparoscopy in the management of endometrial cancer apparently confined to the uterus. Of these, 657 patients with endometrial adenocarcinoma had a preoperative serum CA-125 value recorded. Multiple cross-validation analysis was undertaken to correlate preoperative serum CA-125 with stage of disease (Stage I vs. Stage II+) after surgery. Patients' median preoperative serum CA-125 was 14 U/ml. A cutoff point of 30 U/ml was associated with the smallest misclassification error, and using this cutoff, 98 patients (14.9%) had elevated CA-125 levels. Of those, 36 (36.7%) had evidence of extra-uterine disease. Of the 116 patients (17.7%) with evidence of extra-uterine disease, 31.0% had an elevated CA-125 level. On univariate and multivariable logistic regression analysis, only preoperative CA-125 level, but no other preoperative clinical characteristics were found to be associated with extra-uterine spread of disease. Utilizing a cutoff point of 30 U/ml achieved a sensitivity, specificity, positive predictive value and negative predictive value of 31.0, 88.5, 36.7 and 85.7%, respectively. Elevated CA-125 above 30 U/ml in patients with apparent early-stage disease is a risk factor for the presence of extra-uterine disease and may assist clinicians in the management of patients with clinical Stage I endometrial cancer. Copyright © 2011 UICC.
Towards Modeling False Memory With Computational Knowledge Bases.

PubMed

Li, Justin; Kohanyi, Emma

2017-01-01

One challenge to creating realistic cognitive models of memory is the inability to account for the vast common-sense knowledge of human participants. Large computational knowledge bases such as WordNet and DBpedia may offer a solution to this problem but may pose other challenges. This paper explores some of these difficulties through a semantic network spreading activation model of the Deese-Roediger-McDermott false memory task. In three experiments, we show that these knowledge bases only capture a subset of human associations, while irrelevant information introduces noise and makes efficient modeling difficult. We conclude that the contents of these knowledge bases must be augmented and, more important, that the algorithms must be refined and optimized, before large knowledge bases can be widely used for cognitive modeling. Copyright © 2016 Cognitive Science Society, Inc.
Workshop on nuclear technology: A joint effort between ANS and the University of Massachusetts-Lowell

DOE Office of Scientific and Technical Information (OSTI.GOV)

Brown, G.J.; McDevitt, M.A.; Schmidt, D.

1992-01-01

The University of Massachusetts Lowell (UML) (formerly University of Lowell) sponsored, along with the American Nuclear Society (ANS), a 5-day workshop entitled 'Understanding and Teaching about Nuclear Technology and Its Place in Our Society.' More than 30 middle and high school teachers from the New England area (Connecticut, New Hampshire, and Massachusetts) attended the workshop, which was held June 24 through 28, 1991. Based on this experience, and with the expectation of replicating if not improving upon initial success, plans are now under way to offer a similar workshop at UML from June 29 through July 3, 1992.

Arranging ISO 13606 archetypes into a knowledge base.

PubMed

Kopanitsa, Georgy

2014-01-01

To enable the efficient reuse of standard based medical data we propose to develop a higher level information model that will complement the archetype model of ISO 13606. This model will make use of the relationships that are specified in UML to connect medical archetypes into a knowledge base within a repository. UML connectors were analyzed for their ability to be applied in the implementation of a higher level model that will establish relationships between archetypes. An information model was developed using XML Schema notation. The model allows linking different archetypes of one repository into a knowledge base. Presently it supports several relationships and will be advanced in future.
Concept locator: a client-server application for retrieval of UMLS metathesaurus concepts through complex boolean query.

PubMed

Nadkarni, P M

1997-08-01

Concept Locator (CL) is a client-server application that accesses a Sybase relational database server containing a subset of the UMLS Metathesaurus for the purpose of retrieval of concepts corresponding to one or more query expressions supplied to it. CL's query grammar permits complex Boolean expressions, wildcard patterns, and parenthesized (nested) subexpressions. CL translates the query expressions supplied to it into one or more SQL statements that actually perform the retrieval. The generated SQL is optimized by the client to take advantage of the strengths of the server's query optimizer, and sidesteps its weaknesses, so that execution is reasonably efficient.
Analysis and design of hospital management information system based on UML

NASA Astrophysics Data System (ADS)

Ma, Lin; Zhao, Huifang; You, Shi Jun; Ge, Wenyong

2018-05-01

With the rapid development of computer technology, computer information management system has been utilized in many industries. Hospital Information System (HIS) is in favor of providing data for directors, lightening the workload for the medical workers, and improving the workers efficiency. According to the HIS demand analysis and system design, this paper focus on utilizing unified modeling language (UML) models to establish the use case diagram, class diagram, sequence chart and collaboration diagram, and satisfying the demands of the daily patient visit, inpatient, drug management and other relevant operations. At last, the paper summarizes the problems of the system and puts forward an outlook of the HIS system.
Tannase enzyme production by entrapped cells of Aspergillus niger FETL FT3 in submerged culture system.

PubMed

Darah, I; Sumathi, G; Jain, K; Lim, S H

2011-09-01

The ability of immobilized cell cultures of Aspergillus niger FETL FT3 to produce extracellular tannase was investigated. The production of enzyme was increased by entrapping the fungus in scouring mesh cubes compared to free cells. Using optimized parameters of six scouring mesh cubes and inoculum size of 1 × 10(6) spores/mL, the tannase production of 3.98 U/mL was obtained from the immobilized cells compared to free cells (2.81 U/mL). It was about 41.64% increment. The immobilized cultures exhibited significant tannase production stability of two repeated runs.
Using UMLS to map from a library to a clinical classification: Improving the functionality of a digital library.

PubMed

Robinson, Judas; de Lusignan, Simon; Kostkova, Patty; Madge, Bruce

2006-01-01

The Metathesaurus of the Unified Medical Language System (UMLS) offers the possibility of mapping between various medical vocabularies. The Primary Care Electronic Library (PCEL) contains a database of over six thousand Medical Subject Headings (MeSH terms) describing the resources of the electronic library. We were interested to know if it was possible to map from MeSH to the Systemized Nomenclature of Medicine Clinical Terms (SNOMED CT). Such a mapping would aid healthcare professionals to retrieve relevant data from our digital library as it would enable links between clinical systems and indexed material.
Concepts and Synonymy in the UMLS Metathesaurus

PubMed Central

Merrill, Gary H.

2009-01-01

This paper advances a detailed exploration of the complex relationships among terms, concepts, and synonymy in the UMLS (Uniﬁed Medical Language System) Metathesaurus, and proposes the study and understanding of the Metathesaurus from a model-theoretic perspective. Initial sections provide the background and motivation for such an approach, and a careful informal treatment of these notions is offered as a context and basis for the formal analysis. What emerges from this is a set of puzzles and confusions in the Metathesaurus and its literature pertaining to synonymy and its relation to terms and concepts. A model theory for a segment of the Metathesaurus is then constructed, and its adequacy relative to the informal treatment is demonstrated. Finally, it is shown how this approach clariﬁes and addresses the puzzles educed from the informal discussion, and how the model-theoretic perspective may be employed to evaluate some fundamental criticisms of the Metathesaurus. For users of the UMLS, two signiﬁcant results of this analysis are a rigorous clariﬁcation of the different senses of synonymy that appear in treatments of the Metathesaurus and an illustration of the dangers in computing inferences involving ambiguous terms. PMID:19838995
Conceptual Model of Clinical Governance Information System for Statistical Indicators by Using UML in Two Sample Hospitals.

PubMed

Jeddi, Fatemeh Rangraz; Farzandipoor, Mehrdad; Arabfard, Masoud; Hosseini, Azam Haj Mohammad

2014-04-01

The purpose of this study was investigating situation and presenting a conceptual model for clinical governance information system by using UML in two sample hospitals. However, use of information is one of the fundamental components of clinical governance; but unfortunately, it does not pay much attention to information management. A cross sectional study was conducted in October 2012- May 2013. Data were gathered through questionnaires and interviews in two sample hospitals. Face and content validity of the questionnaire has been confirmed by experts. Data were collected from a pilot hospital and reforms were carried out and Final questionnaire was prepared. Data were analyzed by descriptive statistics and SPSS 16 software. With the scenario derived from questionnaires, UML diagrams are presented by using Rational Rose 7 software. The results showed that 32.14 percent Indicators of the hospitals were calculated. Database was not designed and 100 percent of the hospital's clinical governance was required to create a database. Clinical governance unit of hospitals to perform its mission, do not have access to all the needed indicators. Defining of Processes and drawing of models and creating of database are essential for designing of information systems.
Conceptual Model of Clinical Governance Information System for Statistical Indicators by Using UML in Two Sample Hospitals.

PubMed

Jeddi, Fatemeh Rangraz; Farzandipoor, Mehrdad; Arabfard, Masoud; Hosseini, Azam Haj Mohammad

2016-04-01

The purpose of this study was investigating situation and presenting a conceptual model for clinical governance information system by using UML in two sample hospitals. However, use of information is one of the fundamental components of clinical governance; but unfortunately, it does not pay much attention to information management. A cross sectional study was conducted in October 2012- May 2013. Data were gathered through questionnaires and interviews in two sample hospitals. Face and content validity of the questionnaire has been confirmed by experts. Data were collected from a pilot hospital and reforms were carried out and Final questionnaire was prepared. Data were analyzed by descriptive statistics and SPSS 16 software. With the scenario derived from questionnaires, UML diagrams are presented by using Rational Rose 7 software. The results showed that 32.14 percent Indicators of the hospitals were calculated. Database was not designed and 100 percent of the hospital's clinical governance was required to create a database. Clinical governance unit of hospitals to perform its mission, do not have access to all the needed indicators. Defining of Processes and drawing of models and creating of database are essential for designing of information systems.
State-Chart Autocoder

NASA Technical Reports Server (NTRS)

Clark, Kenneth; Watney, Garth; Murray, Alexander; Benowitz, Edward

2007-01-01

A computer program translates Unified Modeling Language (UML) representations of state charts into source code in the C, C++, and Python computing languages. ( State charts signifies graphical descriptions of states and state transitions of a spacecraft or other complex system.) The UML representations constituting the input to this program are generated by using a UML-compliant graphical design program to draw the state charts. The generated source code is consistent with the "quantum programming" approach, which is so named because it involves discrete states and state transitions that have features in common with states and state transitions in quantum mechanics. Quantum programming enables efficient implementation of state charts, suitable for real-time embedded flight software. In addition to source code, the autocoder program generates a graphical-user-interface (GUI) program that, in turn, generates a display of state transitions in response to events triggered by the user. The GUI program is wrapped around, and can be used to exercise the state-chart behavior of, the generated source code. Once the expected state-chart behavior is confirmed, the generated source code can be augmented with a software interface to the rest of the software with which the source code is required to interact.
Agricultural waste from the tequila industry as substrate for the production of commercially important enzymes.

PubMed

Huitron, C; Perez, R; Sanchez, A E; Lappe, P; Rocha Zavaleta, L

2008-01-01

Approximately 1 million tons of Agave tequilana plants are processed annually by the Mexican Tequila industry generating vast amounts of agricultural waste. The aim of this study was to investigate the potential use of Agave tequilana waste as substrate for the production of commercially important enzymes. Two strains of Aspergillus niger (CH-A-2010 and CH-A-2016), isolated from agave fields, were found to grow and propagate in submerged cultures using Agave tequilana waste as substrate. Isolates showed simultaneous extracellular inulinase, xylanase, pectinase, and cellulase activities. Aspergillus CH-A-2010 showed the highest production of inulinase activity (1.48 U/ml), whereas Aspergillus niger CH-A-2016 produced the highest xylanase (1.52 U/ml) and endo-pectinase (2.7U/ml) activities. In both cases production of enzyme activities was significantly higher on Agave tequilana waste than that observed on lemon peel and specific polymeric carbohydrates. Enzymatic hydrolysis of raw A. tequilana stems and leaves, by enzymes secreted by the isolates yielded maximum concentrations of reducing sugars of 28.2 g/l, and 9.9 g/l respectively. In conclusion, Agave tequilana waste can be utilized as substrate for the production of important biotechnological enzymes.
Unified Modeling Language (UML) for hospital-based cancer registration processes.

PubMed

Shiki, Naomi; Ohno, Yuko; Fujii, Ayumi; Murata, Taizo; Matsumura, Yasushi

2008-01-01

Hospital-based cancer registry involves complex processing steps that span across multiple departments. In addition, management techniques and registration procedures differ depending on each medical facility. Establishing processes for hospital-based cancer registry requires clarifying specific functions and labor needed. In recent years, the business modeling technique, in which management evaluation is done by clearly spelling out processes and functions, has been applied to business process analysis. However, there are few analytical reports describing the applications of these concepts to medical-related work. In this study, we initially sought to model hospital-based cancer registration processes using the Unified Modeling Language (UML), to clarify functions. The object of this study was the cancer registry of Osaka University Hospital. We organized the hospital-based cancer registration processes based on interview and observational surveys, and produced an As-Is model using activity, use-case, and class diagrams. After drafting every UML model, it was fed-back to practitioners to check its validity and improved. We were able to define the workflow for each department using activity diagrams. In addition, by using use-case diagrams we were able to classify each department within the hospital as a system, and thereby specify the core processes and staff that were responsible for each department. The class diagrams were effective in systematically organizing the information to be used for hospital-based cancer registries. Using UML modeling, hospital-based cancer registration processes were broadly classified into three separate processes, namely, registration tasks, quality control, and filing data. An additional 14 functions were also extracted. Many tasks take place within the hospital-based cancer registry office, but the process of providing information spans across multiple departments. Moreover, additional tasks were required in comparison to using a standardized system because the hospital-based cancer registration system was constructed with the pre-existing computer system in Osaka University Hospital. Difficulty of utilization of useful information for cancer registration processes was shown to increase the task workload. By using UML, we were able to clarify functions and extract the typical processes for a hospital-based cancer registry. Modeling can provide a basis of process analysis for establishment of efficient hospital-based cancer registration processes in each institute.
Lesion correlates of patholinguistic profiles in chronic aphasia: comparisons of syndrome-, modality- and symptom-level assessment.

PubMed

Henseler, Ilona; Regenbrecht, Frank; Obrig, Hellmuth

2014-03-01

One way to investigate the neuronal underpinnings of language competence is to correlate patholinguistic profiles of aphasic patients to corresponding lesion sites. Constituting the beginnings of aphasiology and neurolinguistics over a century ago, this approach has been revived and refined in the past decade by statistical approaches mapping continuous variables (providing metrics that are not simply categorical) on voxel-wise lesion information (voxel-based lesion-symptom mapping). Here we investigate whether and how voxel-based lesion-symptom mapping allows us to delineate specific lesion patterns for differentially fine-grained clinical classifications. The latter encompass 'classical' syndrome-based approaches (e.g. Broca's aphasia), more symptom-oriented descriptions (e.g. agrammatism) and further refinement to linguistic sub-functions (e.g. lexico-semantic deficits for inanimate versus animate items). From a large database of patients treated for aphasia of different aetiologies (n = 1167) a carefully selected group of 102 first ever ischaemic stroke patients with chronic aphasia (∅ 12 months) were included in a VLSM analysis. Specifically, we investigated how performance in the Aachen Aphasia Test-the standard clinical test battery for chronic aphasia in German-relates to distinct brain lesions. The Aachen Aphasia Test evaluates aphasia on different levels: a non-parametric discriminant procedure yields probabilities for the allocation to one of the four 'standard' syndromes (Broca, Wernicke, global and amnestic aphasia), whereas standardized subtests target linguistic modalities (e.g. repetition), or even more specific symptoms (e.g. phoneme repetition). Because some subtests of the Aachen Aphasia Test (e.g. for the linguistic level of lexico-semantics) rely on rather coarse and heterogeneous test items we complemented the analysis with a number of more detailed clinically used tests in selected mostly mildly affected subgroups of patients. Our results indicate that: (i) Aachen Aphasia Test-based syndrome allocation allows for an unexpectedly concise differentiation between 'Broca's' and 'Wernicke's' aphasia corresponding to non-overlapping anterior and posterior lesion sites; whereas (ii) analyses for modalities and specific symptoms yielded more circumscribed but partially overlapping lesion foci, often cutting across the above syndrome territories; and (iii) especially for lexico-semantic capacities more specialized clinical test-batteries are required to delineate precise lesion patterns at this linguistic level. In sum this is the first report on a successful lesion-delineation of syndrome-based aphasia classification highlighting the relevance of vascular distribution for the syndrome level while confirming and extending a number of more linguistically motivated differentiations, based on clinically used tests. We consider such a comprehensive view reaching from the syndrome to a fine-grained symptom-oriented assessment mandatory to converge neurolinguistic, patholinguistic and clinical-therapeutic knowledge on language-competence and impairment.
Expert2OWL: A Methodology for Pattern-Based Ontology Development.

PubMed

Tahar, Kais; Xu, Jie; Herre, Heinrich

2017-01-01

The formalization of expert knowledge enables a broad spectrum of applications employing ontologies as underlying technology. These include eLearning, Semantic Web and expert systems. However, the manual construction of such ontologies is time-consuming and thus expensive. Moreover, experts are often unfamiliar with the syntax and semantics of formal ontology languages such as OWL and usually have no experience in developing formal ontologies. To overcome these barriers, we developed a new method and tool, called Expert2OWL that provides efficient features to support the construction of OWL ontologies using GFO (General Formal Ontology) as a top-level ontology. This method allows a close and effective collaboration between ontologists and domain experts. Essentially, this tool integrates Excel spreadsheets as part of a pattern-based ontology development and refinement process. Expert2OWL enables us to expedite the development process and modularize the resulting ontologies. We applied this method in the field of Chinese Herbal Medicine (CHM) and used Expert2OWL to automatically generate an accurate Chinese Herbology ontology (CHO). The expressivity of CHO was tested and evaluated using ontology query languages SPARQL and DL. CHO shows promising results and can generate answers to important scientific questions such as which Chinese herbal formulas contain which substances, which substances treat which diseases, and which ones are the most frequently used in CHM.
The Semantic Distance Task: Quantifying Semantic Distance with Semantic Network Path Length

ERIC Educational Resources Information Center

Kenett, Yoed N.; Levi, Effi; Anaki, David; Faust, Miriam

2017-01-01

Semantic distance is a determining factor in cognitive processes, such as semantic priming, operating upon semantic memory. The main computational approach to compute semantic distance is through latent semantic analysis (LSA). However, objections have been raised against this approach, mainly in its failure at predicting semantic priming. We…
A UML model for the description of different brain-computer interface systems.

PubMed

Quitadamo, Lucia Rita; Abbafati, Manuel; Saggio, Giovanni; Marciani, Maria Grazia; Cardarilli, Gian Carlo; Bianchi, Luigi

2008-01-01

BCI research lacks a universal descriptive language among labs and a unique standard model for the description of BCI systems. This results in a serious problem in comparing performances of different BCI processes and in unifying tools and resources. In such a view we implemented a Unified Modeling Language (UML) model for the description virtually of any BCI protocol and we demonstrated that it can be successfully applied to the most common ones such as P300, mu-rhythms, SCP, SSVEP, fMRI. Finally we illustrated the advantages in utilizing a standard terminology for BCIs and how the same basic structure can be successfully adopted for the implementation of new systems.
Knowledge acquisition to qualify Unified Medical Language System interconceptual relationships.

PubMed Central

Le Duff, F.; Burgun, A.; Cleret, M.; Pouliquen, B.; Barac'h, V.; Le Beux, P.

2000-01-01

Adding automatically relations between concepts from a database to a knowledge base such as the Unified Medical Language System can be very useful to increase the consistency of the latter one. But the transfer of qualified relationships is more interesting. The most important interest of these new acquisitions is that the UMLS became more compliant and medically pertinent to be used in different medical applications. This paper describes the possibility to inherit automatically medical inter-conceptual relationships qualifiers from a disease description included into a database and to integrate them into the UMLS knowledge base. The paper focuses on the transmission of knowledge from a French medical database to an English one. PMID:11079930
Generating Models of Surgical Procedures using UMLS Concepts and Multiple Sequence Alignment

PubMed Central

Meng, Frank; D’Avolio, Leonard W.; Chen, Andrew A.; Taira, Ricky K.; Kangarloo, Hooshang

2005-01-01

Surgical procedures can be viewed as a process composed of a sequence of steps performed on, by, or with the patient’s anatomy. This sequence is typically the pattern followed by surgeons when generating surgical report narratives for documenting surgical procedures. This paper describes a methodology for semi-automatically deriving a model of conducted surgeries, utilizing a sequence of derived Unified Medical Language System (UMLS) concepts for representing surgical procedures. A multiple sequence alignment was computed from a collection of such sequences and was used for generating the model. These models have the potential of being useful in a variety of informatics applications such as information retrieval and automatic document generation. PMID:16779094
eSPEM - A SPEM Extension for Enactable Behavior Modeling

NASA Astrophysics Data System (ADS)

Ellner, Ralf; Al-Hilank, Samir; Drexler, Johannes; Jung, Martin; Kips, Detlef; Philippsen, Michael

OMG's SPEM - by means of its (semi-)formal notation - allows for a detailed description of development processes and methodologies, but can only be used for a rather coarse description of their behavior. Concepts for a more fine-grained behavior model are considered out of scope of the SPEM standard and have to be provided by other standards like BPDM/BPMN or UML. However, a coarse granularity of the behavior model often impedes a computer-aided enactment of a process model. Therefore, in this paper we present eSPEM, an extension of SPEM, that is based on the UML meta-model and focused on fine-grained behavior and life-cycle modeling and thereby supports automated enactment of development processes.
Semantic Memory in the Clinical Progression of Alzheimer Disease.

PubMed

Tchakoute, Christophe T; Sainani, Kristin L; Henderson, Victor W

2017-09-01

Semantic memory measures may be useful in tracking and predicting progression of Alzheimer disease. We investigated relationships among semantic memory tasks and their 1-year predictive value in women with Alzheimer disease. We conducted secondary analyses of a randomized clinical trial of raloxifene in 42 women with late-onset mild-to-moderate Alzheimer disease. We assessed semantic memory with tests of oral confrontation naming, category fluency, semantic recognition and semantic naming, and semantic density in written narrative discourse. We measured global cognition (Alzheimer Disease Assessment Scale, cognitive subscale), dementia severity (Clinical Dementia Rating sum of boxes), and daily function (Activities of Daily Living Inventory) at baseline and 1 year. At baseline and 1 year, most semantic memory scores correlated highly or moderately with each other and with global cognition, dementia severity, and daily function. Semantic memory task performance at 1 year had worsened one-third to one-half standard deviation. Factor analysis of baseline test scores distinguished processes in semantic and lexical retrieval (semantic recognition, semantic naming, confrontation naming) from processes in lexical search (semantic density, category fluency). The semantic-lexical retrieval factor predicted global cognition at 1 year. Considered separately, baseline confrontation naming and category fluency predicted dementia severity, while semantic recognition and a composite of semantic recognition and semantic naming predicted global cognition. No individual semantic memory test predicted daily function. Semantic-lexical retrieval and lexical search may represent distinct aspects of semantic memory. Semantic memory processes are sensitive to cognitive decline and dementia severity in Alzheimer disease.
Characterizing the reliability of a bioMEMS-based cantilever sensor

NASA Astrophysics Data System (ADS)

Bhalerao, Kaustubh D.

2004-12-01

The cantilever-based BioMEMS sensor represents one instance from many competing ideas of biosensor technology based on Micro Electro Mechanical Systems. The advancement of BioMEMS from laboratory-scale experiments to applications in the field will require standardization of their components and manufacturing procedures as well as frameworks to evaluate their performance. Reliability, the likelihood with which a system performs its intended task, is a compact mathematical description of its performance. The mathematical and statistical foundation of systems-reliability has been applied to the cantilever-based BioMEMS sensor. The sensor is designed to detect one aspect of human ovarian cancer, namely the over-expression of the folate receptor surface protein (FR-alpha). Even as the application chosen is clinically motivated, the objective of this study was to demonstrate the underlying systems-based methodology used to design, develop and evaluate the sensor. The framework development can be readily extended to other BioMEMS-based devices for disease detection and will have an impact in the rapidly growing $30 bn industry. The Unified Modeling Language (UML) is a systems-based framework for design and development of object-oriented information systems which has potential application for use in systems designed to interact with biological environments. The UML has been used to abstract and describe the application of the biosensor, to identify key components of the biosensor, and the technology needed to link them together in a coherent manner. The use of the framework is also demonstrated in computation of system reliability from first principles as a function of the structure and materials of the biosensor. The outcomes of applying the systems-based framework to the study are the following: (1) Characterizing the cantilever-based MEMS device for disease (cell) detection. (2) Development of a novel chemical interface between the analyte and the sensor that provides a degree of selectivity towards the disease. (3) Demonstrating the performance and measuring the reliability of the biosensor prototype, and (4) Identification of opportunities in technological development in order to further refine the proposed biosensor. Application of the methodology to design develop and evaluate the reliability of BioMEMS devices will be beneficial in the streamlining the growth of the BioMEMS industry, while providing a decision-support tool in comparing and adopting suitable technologies from available competing options.

Conceptual Model of Clinical Governance Information System for Statistical Indicators by Using UML in Two Sample Hospitals

PubMed Central

Jeddi, Fatemeh Rangraz; Farzandipoor, Mehrdad; Arabfard, Masoud; Hosseini, Azam Haj Mohammad

2016-01-01

Objective: The purpose of this study was investigating situation and presenting a conceptual model for clinical governance information system by using UML in two sample hospitals. Background: However, use of information is one of the fundamental components of clinical governance; but unfortunately, it does not pay much attention to information management. Material and Methods: A cross sectional study was conducted in October 2012- May 2013. Data were gathered through questionnaires and interviews in two sample hospitals. Face and content validity of the questionnaire has been confirmed by experts. Data were collected from a pilot hospital and reforms were carried out and Final questionnaire was prepared. Data were analyzed by descriptive statistics and SPSS 16 software. Results: With the scenario derived from questionnaires, UML diagrams are presented by using Rational Rose 7 software. The results showed that 32.14 percent Indicators of the hospitals were calculated. Database was not designed and 100 percent of the hospital’s clinical governance was required to create a database. Conclusion: Clinical governance unit of hospitals to perform its mission, do not have access to all the needed indicators. Defining of Processes and drawing of models and creating of database are essential for designing of information systems. PMID:27147804
Conceptual Model of Clinical Governance Information System for Statistical Indicators by Using UML in Two Sample Hospitals

PubMed Central

Jeddi, Fatemeh Rangraz; Farzandipoor, Mehrdad; Arabfard, Masoud; Hosseini, Azam Haj Mohammad

2014-01-01

Objective: The purpose of this study was investigating situation and presenting a conceptual model for clinical governance information system by using UML in two sample hospitals. Background: However, use of information is one of the fundamental components of clinical governance; but unfortunately, it does not pay much attention to information management. Material and Methods: A cross sectional study was conducted in October 2012- May 2013. Data were gathered through questionnaires and interviews in two sample hospitals. Face and content validity of the questionnaire has been confirmed by experts. Data were collected from a pilot hospital and reforms were carried out and Final questionnaire was prepared. Data were analyzed by descriptive statistics and SPSS 16 software. Results: With the scenario derived from questionnaires, UML diagrams are presented by using Rational Rose 7 software. The results showed that 32.14 percent Indicators of the hospitals were calculated. Database was not designed and 100 percent of the hospital’s clinical governance was required to create a database. Conclusion: Clinical governance unit of hospitals to perform its mission, do not have access to all the needed indicators. Defining of Processes and drawing of models and creating of database are essential for designing of information systems. PMID:24825933
Evaluation of various parameters of calcium-alginate immobilization method for enhanced alkaline protease production by Bacillus licheniformis NCIM-2042 using statistical methods.

PubMed

Potumarthi, Ravichandra; Subhakar, Ch; Pavani, A; Jetty, Annapurna

2008-04-01

Calcium-alginate immobilization method for the production of alkaline protease by Bacillus licheniformis NCIM-2042 was optimized statistically. Four variables, such as sodium-alginate concentration, calcium chloride concentration, inoculum size and agitation speed were optimized by 2(4) full factorial central composite design and subsequent analysis and model validation by a second-order regression equation. Eleven carbon, 11 organic nitrogen and seven inorganic nitrogen sources were screened by two-level Plackett-Burman design for maximum alkaline protease production by using optimized immobilized conditions. The levels of four variables, such as Na-alginate 2.78%; CaCl(2), 2.15%; inoculum size, 8.10% and agitation, 139 rpm were found to be optimum for maximal production of protease. Glucose, soybean meal and ammonium sulfate were resulted in maximum protease production at 644 U/ml, 720 U/ml, and 806 U/ml when screened for carbon, organic nitrogen and inorganic nitrogen sources, respectively, using optimized immobilization conditions. Repeated fed batch mode of operation, using optimized immobilized conditions, resulted in continuous operation for 12 cycles without disintegration of beads. Cross-sectional scanning electron microscope images have shown the growth pattern of B. licheniformis in Ca-alginate immobilized beads.
A review method for UML requirements analysis model employing system-side prototyping.

PubMed

Ogata, Shinpei; Matsuura, Saeko

2013-12-01

User interface prototyping is an effective method for users to validate the requirements defined by analysts at an early stage of a software development. However, a user interface prototype system offers weak support for the analysts to verify the consistency of the specifications about internal aspects of a system such as business logic. As the result, the inconsistency causes a lot of rework costs because the inconsistency often makes the developers impossible to actualize the system based on the specifications. For verifying such consistency, functional prototyping is an effective method for the analysts, but it needs a lot of costs and more detailed specifications. In this paper, we propose a review method so that analysts can verify the consistency among several different kinds of diagrams in UML efficiently by employing system-side prototyping without the detailed model. The system-side prototype system does not have any functions to achieve business logic, but visualizes the results of the integration among the diagrams in UML as Web pages. The usefulness of our proposal was evaluated by applying our proposal into a development of Library Management System (LMS) for a laboratory. This development was conducted by a group. As the result, our proposal was useful for discovering the serious inconsistency caused by the misunderstanding among the members of the group.
Expression of IFN-Inducible Genes with Antiviral Function OAS1 and MX1 in Health and under Conditions of Recurrent Herpes Simplex Infection.

PubMed

Karaulov, A V; Shulzhenko, A E; Karsonova, A V

2017-07-01

We studied the expression of IFN-inducible genes OAS1 and Mx1 in lysates of peripheral blood mononuclear cells from patients suffering from recurrent Herpes simplex infections in comparison with healthy people. To induce the expression of the studied genes, blood mononuclears were incubated with recombinant IFN-α2b in concentrations of 1, 10, and 100 U/ml for 3 h and then the content of the studied transcripts was evaluated. Relative expression of OAS1 and Mx1 in patients with recurrent forms of Herpes simplex both during the acute stage and clinical remission did not differ significantly from that in healthy people after stimulation with IFN-α2b in a concentration of 1 U/ml and in higher concentrations (10 and 100 U/ml). It was concluded that intracellular signal transduction in IFN-α-activated cells in vitro was not disturbed in patients with recurrent forms of Herpes simplex infection. Thus, the reported phenomenon of IFN-signalling distortion by Herpes simplex virus proteins observed in experiments on model cell lines infected with Herpes simplex virus was not confirmed in our experiments on peripheral blood mononuclear cells from patients with Herpes simplex infection.
VLTI auxiliary telescopes: a full object-oriented approach

NASA Astrophysics Data System (ADS)

Chiozzi, Gianluca; Duhoux, Philippe; Karban, Robert

2000-06-01

The Very Large Telescope (VLT) Telescope Control Software (TCS) is a portable system. It is now in use or will be used in a whole family of ESO telescopes VLT Unit Telescopes, VLTI Auxiliary Telescopes, NTT, La Silla 3.6, VLT Survey Telescope and Astronomical Site Monitors in Paranal and La Silla). Although it has been developed making extensive usage of Object Oriented (OO) methodologies, the overall development process chosen at the beginning of the project used traditional methods. In order to warranty a longer lifetime to the system (improving documentation and maintainability) and to prepare for future projects, we have introduced a full OO process. We have taken as a basis the United Software Development Process with the Unified Modeling Language (UML) and we have adapted the process to our specific needs. This paper describes how the process has been applied to the VLTI Auxiliary Telescopes Control Software (ATCS). The ATCS is based on the portable VLT TCS, but some subsystems are new or have specific characteristics. The complete process has been applied to the new subsystems, while reused code has been integrated in the UML models. We have used the ATCS on one side to tune the process and train the team members and on the other side to provide a UML and WWW based documentation for the portable VLT TCS.
Isolation and screening of endophytes from the rhizomes of some Zingiberaceae plants for L-asparaginase production.

PubMed

Krishnapura, Prajna Rao; Belur, Prasanna D

2016-01-01

Endophytes are described as microorganisms that colonize the internal tissues of healthy plants without causing any disease. Endophytes isolated from medicinal plants have been attracting considerable attention due to their high biodiversity and their predicted potential to produce a plethora of novel compounds. In this study, an attempt was made to isolate endophytes from rhizomes of five medicinal plants of Zingiberaceae family, and to screen the endophytes for L-asparaginase activity. In total, 50 endophytes (14 bacteria, 22 actinomycetes, and 14 fungi) were isolated from Alpinia galanga, Curcuma amada, Curcuma longa, Hedychium coronarium, and Zingiber officinale; of these, 31 endophytes evidenced positive for L-asparaginase production. All the L-asparaginase-positive isolates showed L-asparaginase activity in the range of 54.17-155.93 U/mL in unoptimized medium. An endophytic fungus isolated from Curcuma amada, identified as Talaromyces pinophilus, was used for further experiments involving studies on the effect of certain nutritional and nonnutritional factors on L-asparaginase production in submerged fermentation. Talaromyces pinophilus initially gave an enzyme activity of 108.95 U/mL, but gradually reduced to 80 U/mL due to strain degeneration. Perhaps this is the first report ever on the production of L-asparaginase from endophytes isolated from medicinal plants of Zingiberaceae family.
Characterization of a thermophilic cellulase from Geobacillus sp. HTA426, an efficient cellulase-producer on alkali pretreated of lignocellulosic biomass.

PubMed

Potprommanee, Laddawan; Wang, Xiao-Qin; Han, Ye-Ju; Nyobe, Didonc; Peng, Yen-Ping; Huang, Qing; Liu, Jing-Yong; Liao, Yu-Ling; Chang, Ken-Lin

2017-01-01

A themophilic cellulase-producing bacterium was isolated from a hot spring district and identified as Geobacillus sp. HTA426. The cellulase enzyme produced by the Geobacillus sp. HTA426 was purified through ammonium sulfate precipitation and ion exchange chromatography, with the recovery yield and fold purification of 10.14% and 5.12, respectively. The purified cellulase has a molecular weight of 40 kDa. The optimum temperature and pH for carboxymethyl cellulase (CMCase) activity of the purified cellulase were 60°C and pH 7.0, respectively. The enzyme was also stable over a wide temperature range of 50°C to 70°C after 5 h of incubation. Moreover, the strain HTA426 was able to grow and produce cellulase on alkali-treated sugarcane bagasse, rice straw and water hyacinth as carbon sources. Enzymatic hydrolysis of sugarcane bagasse, which was regarded as the most effective carbon source for cellulase production (CMCase activity = 103.67 U/mL), followed by rice straw (74.70 U/mL) and water hyacinth (51.10 U/mL). This strain producing an efficient thermostable cellulose is a potential candidate for developing a more efficient and cost-effective process for converting lignocellulosic biomass into biofuel and other industrial process.
Identification and characterization of an anaerobic ethanol-producing cellulolytic bacterial consortium from Great Basin hot springs with agricultural residues and energy crops.

PubMed

Zhao, Chao; Deng, Yunjin; Wang, Xingna; Li, Qiuzhe; Huang, Yifan; Liu, Bin

2014-09-01

In order to obtain the cellulolytic bacterial consortia, sediments from Great Basin hot springs (Nevada, USA) were sampled and enriched with cellulosic biomass as the sole carbon source. The bacterial composition of the resulting anaerobic ethanol-producing celluloytic bacterial consortium, named SV79, was analyzed. With methods of the full-length 16S rRNA librarybased analysis and denaturing gradient gel electrophoresis, 21 bacteria belonging to eight genera were detected from this consortium. Clones with closest relation to the genera Acetivibrio, Clostridium, Cellulosilyticum, Ruminococcus, and Sporomusa were predominant. The cellulase activities and ethanol productions of consortium SV79 using different agricultural residues (sugarcane bagasse and spent mushroom substrate) and energy crops (Spartina anglica, Miscanthus floridulus, and Pennisetum sinese Roxb) were studied. During cultivation, consortium SV79 produced the maximum filter paper activity (FPase, 9.41 U/ml), carboxymethylcellulase activity (CMCase, 6.35 U/ml), and xylanase activity (4.28 U/ml) with sugarcane bagasse, spent mushroom substrate, and S. anglica, respectively. The ethanol production using M. floridulus as substrate was up to 2.63 mM ethanol/g using gas chromatography analysis. It has high potential to be a new candidate for producing ethanol with cellulosic biomass under anoxic conditions in natural environments.
Characterization of a thermophilic cellulase from Geobacillus sp. HTA426, an efficient cellulase-producer on alkali pretreated of lignocellulosic biomass

PubMed Central

Potprommanee, Laddawan; Wang, Xiao-Qin; Han, Ye-Ju; Nyobe, Didonc; Peng, Yen-Ping; Huang, Qing; Liu, Jing-yong; Liao, Yu-Ling; Chang, Ken-Lin

2017-01-01

A themophilic cellulase-producing bacterium was isolated from a hot spring district and identified as Geobacillus sp. HTA426. The cellulase enzyme produced by the Geobacillus sp. HTA426 was purified through ammonium sulfate precipitation and ion exchange chromatography, with the recovery yield and fold purification of 10.14% and 5.12, respectively. The purified cellulase has a molecular weight of 40 kDa. The optimum temperature and pH for carboxymethyl cellulase (CMCase) activity of the purified cellulase were 60°C and pH 7.0, respectively. The enzyme was also stable over a wide temperature range of 50°C to 70°C after 5 h of incubation. Moreover, the strain HTA426 was able to grow and produce cellulase on alkali-treated sugarcane bagasse, rice straw and water hyacinth as carbon sources. Enzymatic hydrolysis of sugarcane bagasse, which was regarded as the most effective carbon source for cellulase production (CMCase activity = 103.67 U/mL), followed by rice straw (74.70 U/mL) and water hyacinth (51.10 U/mL). This strain producing an efficient thermostable cellulose is a potential candidate for developing a more efficient and cost-effective process for converting lignocellulosic biomass into biofuel and other industrial process. PMID:28406925
Membrane damage effect of therapeutic ultrasound on Ehrlich ascitic tumor cells.

PubMed

Hao, Qiao; Liu, Quanhong; Wang, Xiaobing; Wang, Pan; Li, Tao; Tong, Wan Yan

2009-02-01

The biologic effects and the underlying mechanisms of Ehrlich ascitic tumor (EAT) cells induced by ultrasound were investigated in this study. Cells were subjected to ultrasonic irradiation with a frequency of 2.17 MHz and an intensity of 3 W/cm(2) for variable periods of time. Trypan blue exclusion was used to detect the integrity of cellular membrane; the membrane permeability was investigated by the incorporation of fluorescein isothiocyanate dextran during ultrasound exposure; and the cell membrane ultrastructure changes were observed under a scanning electron microscope. The potential mechanism was estimated from the generation of hydroxyl radicals, the lipid peroxidation levels, and intracellular reactive oxygen radicals production. The cell membrane damage effects induced by ultrasound increased with a prolonged exposure time; the fluorescent rates of the cells irradiated with ultrasound for 30 and 60 seconds were 11.46% and 18.50%, respectively; the amount of hydroxyl radicals in 30 (26.10 U/mL) and 60 seconds (28.47 U/mL) were significantly enhanced, compared with the control group (24.44 U/mL); then, the level of lipid peroxidation was also changed from 0.27 to 0.54 (30 seconds) and 1.21 nmol/mL (60 seconds). Shear forces and free radicals produced by acoustic cavitation may play important roles in these actions.
Dietary Patterns After the Weaning and Lactation Period Are Associated With Celiac Disease Autoimmunity in Children.

PubMed

Barroso, Monica; Beth, Sytske A; Voortman, Trudy; Jaddoe, Vincent W V; van Zelm, Menno C; Moll, Henriette A; Kiefte-de Jong, Jessica C

2018-06-01

There have been many studies of associations between infant feeding practices and development of celiac disease during childhood, but few studies have focused on overall diets of young children after the weaning period. We aimed to examine the association between common dietary patterns in infants and the occurrence of celiac disease autoimmunity during childhood. We performed a prospective analysis of data from the Generation R Study that comprised 1997 children born from April 2002 through January 2006 in Rotterdam, the Netherlands. Food consumption around 1 year of age was assessed with a validated food-frequency questionnaire. Dietary data were examined using a priori (based on existing guidelines) and a posteriori (principal component analysis and reduced rank regression) dietary pattern analyses. Five dietary patterns were compared. Celiac disease autoimmunity, determined on the basis of serum concentration of transglutaminase-2 autoantibody (ie, TG2A) below or above 7 U/mL, was evaluated at 6 years. Associations between dietary pattern adherence scores and celiac disease autoimmunity were examined using multivariable logistic regression models. Higher adherence to the a posteriori-derived prudent dietary pattern (high intake of vegetables, vegetable oils, pasta, and grains and low consumption of refined cereals and sweet beverages) at 1 year was significantly associated with lower odds of celiac disease autoimmunity at 6 years (odds ratio, 0.67; 95% confidence interval, 0.53-0.84). No significant associations were found for the 4 remaining dietary patterns. In a prospective study of dietary patterns of young children in the Netherlands, we associated a dietary pattern characterized by high consumption of vegetables and grains and low consumption of refined cereals and sweet beverages, with lower odds of celiac disease autoimmunity. Early-life dietary patterns might therefore be involved in the development of celiac disease during childhood. Copyright © 2018 AGA Institute. Published by Elsevier Inc. All rights reserved.
Overlaid caption extraction in news video based on SVM

NASA Astrophysics Data System (ADS)

Liu, Manman; Su, Yuting; Ji, Zhong

2007-11-01

Overlaid caption in news video often carries condensed semantic information which is key cues for content-based video indexing and retrieval. However, it is still a challenging work to extract caption from video because of its complex background and low resolution. In this paper, we propose an effective overlaid caption extraction approach for news video. We first scan the video key frames using a small window, and then classify the blocks into the text and non-text ones via support vector machine (SVM), with statistical features extracted from the gray level co-occurrence matrices, the LH and HL sub-bands wavelet coefficients and the orientated edge intensity ratios. Finally morphological filtering and projection profile analysis are employed to localize and refine the candidate caption regions. Experiments show its high performance on four 30-minute news video programs.
Preoperative CA125 and fibrinogen in patients with endometrial cancer: a risk model for predicting lymphovascular space invasion

PubMed Central

2017-01-01

Objective The aim of this study was to build a model to predict the risk of lymphovascular space invasion (LVSI) in women with endometrial cancer (EC). Methods From December 2010 to June 2013, 211 patients with EC undergoing surgery at Shanghai First Maternity and Infant Hospital were enrolled in this retrospective study. Those patients were divided into a positive LVSI group and a negative LVSI group. The clinical and pathological characteristics were compared between the two groups; logistic regression was used to explore risk factors associated with LVSI occurrence. The threshold values of significant factors were calculated to build a risk model and predict LVSI. Results There were 190 patients who were negative for LVSI and 21 patients were positive for LVSI out of 211 patients with EC. It was found that tumor grade, depth of myometrial invasion, number of pelvic lymph nodes, and International Federation of Gynecology and Obstetrics (FIGO) stage (p<0.05) were associated with LVSI occurrence. However, cervical involvement and age (p>0.05) were not associated with LVSI. Receiver operating characteristic (ROC) curves revealed that the threshold values of the following factors were correlated with positive LVSI: 28.1 U/mL of CA19-9, 21.2 U/mL of CA125, 2.58 mg/dL of fibrinogen (Fn), 1.84 U/mL of carcinoembryonic antigen (CEA) and (6.35×109)/L of white blood cell (WBC). Logistic regression analysis indicated that CA125 ≥21.2 (p=0.032) and Fn ≥2.58 mg/dL (p=0.014) were significantly associated with LVSI. Conclusion Positive LVSI could be predicted by CA125 ≥21.2 U/mL and Fn ≥2.58 mg/dL in women with EC. It could help gynecologists better adapt surgical staging and adjuvant therapies. PMID:27894164
Prognostic index for chronic- and smoldering-type adult T-cell leukemia-lymphoma.

PubMed

Katsuya, Hiroo; Shimokawa, Mototsugu; Ishitsuka, Kenji; Kawai, Kazuhiro; Amano, Masahiro; Utsunomiya, Atae; Hino, Ryosuke; Hanada, Shuichi; Jo, Tatsuro; Tsukasaki, Kunihiro; Moriuchi, Yukiyoshi; Sueoka, Eisaburo; Yoshida, Shinichiro; Suzushima, Hitoshi; Miyahara, Masaharu; Yamashita, Kiyoshi; Eto, Tetsuya; Suzumiya, Junji; Tamura, Kazuo

2017-07-06

Adult T-cell leukemia-lymphoma (ATL) has been divided into 4 clinical subtypes: acute, lymphoma, chronic, and smoldering. The aim of this study is to develop a novel prognostic index (PI) for chronic and smoldering ATL. We conducted a nationwide retrospective survey on ATL patients, and 248 fully eligible individuals were used in this analysis. In the univariate analysis, sex, performance status, log 10 (soluble interleukin-2 receptor [sIL-2R]), neutrophils count, and lymphadenopathy showed values of P < .05 in training samples. A multivariate analysis was performed on these factors, and only log 10 (sIL-2R) was identified as an independent prognostic factor in training samples. Using a regression coefficient of this variable, a prognostic model was formulated to identify different levels of risk: indolent ATL-PI (iATL-PI) = 1.51 × log 10 (sIL-2R [U/mL]). The values calculated by iATL-PI were divided into 3 groups using a quartile point. In the validation sample, median survival times (MSTs) were 1.6 years, 5.5 years, and not reached for patients in the high-, intermediate-, and low-risk groups, respectively ( P < .0001). To make the scoring system clinically practicable, we simplified iATL-PI according to trichotomizing sIL-2R at 1000 and 6000 U/mL, using a quartile point. Patients with more than 6000 U/mL sIL-2R were categorized into the high-risk group, less than and equal to 1000 U/mL into the low-risk group, and the others into the intermediate-risk group, and MSTs were 1.6 years, not reached, and 5.5 years, respectively ( P < .0001). iATL-PI has potential as a novel tool for a risk-adapted therapeutic approach. © 2017 by The American Society of Hematology.
Artificial Intelligence versus Statistical Modeling and Optimization of Cholesterol Oxidase Production by using Streptomyces Sp.

PubMed Central

Niwas, Ram; Osama, Khwaja; Khan, Saif; Haque, Shafiul; Tripathi, C. K. M.; Mishra, B. N.

2015-01-01

Cholesterol oxidase (COD) is a bi-functional FAD-containing oxidoreductase which catalyzes the oxidation of cholesterol into 4-cholesten-3-one. The wider biological functions and clinical applications of COD have urged the screening, isolation and characterization of newer microbes from diverse habitats as a source of COD and optimization and over-production of COD for various uses. The practicability of statistical/ artificial intelligence techniques, such as response surface methodology (RSM), artificial neural network (ANN) and genetic algorithm (GA) have been tested to optimize the medium composition for the production of COD from novel strain Streptomyces sp. NCIM 5500. All experiments were performed according to the five factor central composite design (CCD) and the generated data was analysed using RSM and ANN. GA was employed to optimize the models generated by RSM and ANN. Based upon the predicted COD concentration, the model developed with ANN was found to be superior to the model developed with RSM. The RSM-GA approach predicted maximum of 6.283 U/mL COD production, whereas the ANN-GA approach predicted a maximum of 9.93 U/mL COD concentration. The optimum concentrations of the medium variables predicted through ANN-GA approach were: 1.431 g/50 mL soybean, 1.389 g/50 mL maltose, 0.029 g/50 mL MgSO4, 0.45 g/50 mL NaCl and 2.235 ml/50 mL glycerol. The experimental COD concentration was concurrent with the GA predicted yield and led to 9.75 U/mL COD production, which was nearly two times higher than the yield (4.2 U/mL) obtained with the un-optimized medium. This is the very first time we are reporting the statistical versus artificial intelligence based modeling and optimization of COD production by Streptomyces sp. NCIM 5500. PMID:26368924
Artificial Intelligence versus Statistical Modeling and Optimization of Cholesterol Oxidase Production by using Streptomyces Sp.

PubMed

Pathak, Lakshmi; Singh, Vineeta; Niwas, Ram; Osama, Khwaja; Khan, Saif; Haque, Shafiul; Tripathi, C K M; Mishra, B N

2015-01-01

Cholesterol oxidase (COD) is a bi-functional FAD-containing oxidoreductase which catalyzes the oxidation of cholesterol into 4-cholesten-3-one. The wider biological functions and clinical applications of COD have urged the screening, isolation and characterization of newer microbes from diverse habitats as a source of COD and optimization and over-production of COD for various uses. The practicability of statistical/ artificial intelligence techniques, such as response surface methodology (RSM), artificial neural network (ANN) and genetic algorithm (GA) have been tested to optimize the medium composition for the production of COD from novel strain Streptomyces sp. NCIM 5500. All experiments were performed according to the five factor central composite design (CCD) and the generated data was analysed using RSM and ANN. GA was employed to optimize the models generated by RSM and ANN. Based upon the predicted COD concentration, the model developed with ANN was found to be superior to the model developed with RSM. The RSM-GA approach predicted maximum of 6.283 U/mL COD production, whereas the ANN-GA approach predicted a maximum of 9.93 U/mL COD concentration. The optimum concentrations of the medium variables predicted through ANN-GA approach were: 1.431 g/50 mL soybean, 1.389 g/50 mL maltose, 0.029 g/50 mL MgSO4, 0.45 g/50 mL NaCl and 2.235 ml/50 mL glycerol. The experimental COD concentration was concurrent with the GA predicted yield and led to 9.75 U/mL COD production, which was nearly two times higher than the yield (4.2 U/mL) obtained with the un-optimized medium. This is the very first time we are reporting the statistical versus artificial intelligence based modeling and optimization of COD production by Streptomyces sp. NCIM 5500.
The effect of concomitant DPPIVi use on glycaemic control and hypoglycaemia with insulin glargine 300 U/mL (Gla-300) versus insulin glargine 100 U/mL (Gla-100) in people with type 2 diabetes: A patient-level meta-analysis of EDITION 2 and 3.

PubMed

Yale, Jean-François; Pettus, Jeremy Hodson; Brito-Sanfiel, Miguel; Lavalle-Gonzalez, Fernando; Merino-Trigo, Ana; Stella, Peter; Chevalier, Soazig; Buzzetti, Raffaella

2018-01-01

To evaluate the effect of concomitant dipeptidyl peptidase IV inhibitor (DPPIVi) use on efficacy and safety of insulin glargine 300 U/mL (Gla-300) versus glargine 100 U/mL (Gla-100) in people with type 2 diabetes on oral antihyperglycaemic drugs. A post hoc patient-level meta-analysis was performed using data from EDITION 2 (basal insulin [N = 811]) and EDITION 3 (insulin-naïve [N = 878]), multicentre, randomised, open-label, parallel-group, phase 3a trials of similar design. Endpoints analysed included HbA1c, hypoglycaemia and adverse events, investigated in subgroups of participants with and without concomitant DPPIVi use. Of 1689 participants randomised, 107 (13%, Gla-300) and 133 (16%, Gla-100) received DPPIVi therapy. The least squares mean change in HbA1c (baseline to month 6) was comparable between treatment groups, irrespective of DPPIVi use (no evidence of heterogeneity of treatment effect across subgroups, p = 0.753), although group sizes were unbalanced. The cumulative mean number of confirmed (≤3.9 mmol/L [≤70 mg/dL]) or severe hypoglycaemic events, and the risk and annualised rate of such events, were consistently lower for Gla-300 than Gla-100 during the night (between 00:00 and 05:59 h) or at any time of day (24 h period), irrespective of DPPIVi use. Severe hypoglycaemia occurred in 8/838 and 10/844 participants in the Gla-300 and Gla-100 groups, respectively, and was not affected by DPPIVi use. The adverse event profile was similar between treatment groups and DPPIVi subgroups. Glycaemic control with Gla-300 was comparable to Gla-100, with less hypoglycaemia during the night and at any time of day (24 h), irrespective of concomitant DPPIVi use. ClinicalTrials.gov NCT01499095; NCT01676220.
Efficacy and Safety of Switching from Insulin Glargine 100 U/mL to the Same Dose of Glargine 300 U/mL in Japanese Type 1 and 2 Diabetes Patients: A Retrospective Analysis.

PubMed

Nakanishi, Shuhei; Iwamoto, Masahiro; Kamei, Shinji; Hirukawa, Hidenori; Shimoda, Masashi; Tatsumi, Fuminori; Kohara, Kenji; Obata, Atsushi; Kimura, Tomohiko; Kinoshita, Tomoe; Irie, Shintaro; Sanada, Junpei; Fushimi, Yoshiro; Nishioka, Momoyo; Mizoguchi, Akiko; Kameyama, Miyuki; Mune, Tomoatsu; Kaku, Kohei; Kaneto, Hideaki

2018-01-01

Objective Insulin glargine [300 U/mL (Gla-300)] achieved better glycemic control and reduced the risk of hypoglycemia in comparison to glargine [100 U/mL; (Gla-100)] in phase 3 trials. This is the first study to retrospectively evaluate the efficacy and safety of Gla-300 in Japanese type 1 and 2 diabetes patients in a routine clinical setting. Methods We analyzed 20 type 1 diabetes patients and 62 type 2 diabetes patients who switched from Gla-100 to the same dose of Gla-300. Sixty type 2 diabetes patients who continued the use of Gla-100 during the study were included as controls. Results At three months after switching, the HbA1c levels were decreased in the patients with type 1 diabetes, but not to a significant extent. In the type 2 diabetes patients, the HbA1c levels were significantly decreased after switching (p<0.01). In contrast, there was no change in the HbA1c levels of the type 2 diabetes patients who continued the use of Gla-100 over the same period. The BMI values of the type 1 diabetes patients tended to decrease (p=0.06) and there was a significant decrease in the BMI values of the type 2 diabetes patients (p<0.05). There was no change in the BMI values of the type 2 diabetes patients who continued the use of Gla-100. The rates of hypoglycemia and adverse events did not change during the follow-up period. Conclusion In the clinical setting, switching from Gla-100 to the same dose of Gla-300 had a favorable effect on glycemic control and body weight control in Japanese type 1 and type 2 diabetes patients, without any increase in adverse events; however, a prospective study should be performed to confirm these findings.
A preoperative serum signature of CEA+/CA125+/CA19-9 ≥ 1000 U/mL indicates poor outcome to pancreatectomy for pancreatic cancer.

PubMed

Liu, Liang; Xu, Huaxiang; Wang, Wenquan; Wu, Chuntao; Chen, Yong; Yang, Jingxuan; Cen, Putao; Xu, Jin; Liu, Chen; Long, Jiang; Guha, Sushovan; Fu, Deliang; Ni, Quanxing; Jatoi, Aminah; Chari, Suresh; McCleary-Wheeler, Angela L; Fernandez-Zapico, Martin E; Li, Min; Yu, Xianjun

2015-05-01

Pancreatectomy is associated with significant morbidity and unpredictable outcome, with few diagnostic tools to determine, which patients gain the most benefit from this treatment, especially before the operation. This study aimed to define a preoperative signature panel of serum markers to indicate response to pancreatectomy for pancreatic cancer. Over 1000 patients with pancreatic cancer treated at two independent high-volume institutions were included in this study and were divided into three groups, including resected, locally advanced and metastatic. Eight serum tumor markers most commonly used in gastrointestinal cancers were analyzed for patient outcome. Preoperative CA19-9 independently indicated surgical response in pancreatic cancer. Patients with CA19-9 ≥1000 U/mL generally had a poor surgical benefit. However, a subset of these patients still achieved a survival advantage when CA19-9 levels decreased postoperatively. CEA and CA125 in the presence of CA19-9 ≥1000 U/mL could independently predict the non-decrease of CA19-9 postoperatively. The combination of the three markers was useful for predicting a worse surgical outcome with a median survival of 5.1 months vs. 23.0 months (p < 0.001) for the training cohort and 7.0 months vs. 18.2 months (p < 0.001) for the validation cohort and also suggested a higher prevalence of early distant metastasis after surgery. Resected patients with this proposed signature showed no survival advantage over patients in the locally advanced group who did not receive pancreatectomy. Therefore, a preoperative serum signature of CEA(+)/CA125(+)/CA19-9 ≥1000 U/mL is associated with poor surgical outcome and can be used to select appropriate patients with pancreatic cancer for pancreatectomy. © 2014 UICC.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.