Sample records for site features text

  1. TargetM6A: Identifying N6-Methyladenosine Sites From RNA Sequences via Position-Specific Nucleotide Propensities and a Support Vector Machine.

    PubMed

    Li, Guang-Qing; Liu, Zi; Shen, Hong-Bin; Yu, Dong-Jun

    2016-10-01

    As one of the most ubiquitous post-transcriptional modifications of RNA, N 6 -methyladenosine ( [Formula: see text]) plays an essential role in many vital biological processes. The identification of [Formula: see text] sites in RNAs is significantly important for both basic biomedical research and practical drug development. In this study, we designed a computational-based method, called TargetM6A, to rapidly and accurately target [Formula: see text] sites solely from the primary RNA sequences. Two new features, i.e., position-specific nucleotide/dinucleotide propensities (PSNP/PSDP), are introduced and combined with the traditional nucleotide composition (NC) feature to formulate RNA sequences. The extracted features are further optimized to obtain a much more compact and discriminative feature subset by applying an incremental feature selection (IFS) procedure. Based on the optimized feature subset, we trained TargetM6A on the training dataset with a support vector machine (SVM) as the prediction engine. We compared the proposed TargetM6A method with existing methods for predicting [Formula: see text] sites by performing stringent jackknife tests and independent validation tests on benchmark datasets. The experimental results show that the proposed TargetM6A method outperformed the existing methods for predicting [Formula: see text] sites and remarkably improved the prediction performances, with MCC = 0.526 and AUC = 0.818. We also provided a user-friendly web server for TargetM6A, which is publicly accessible for academic use at http://csbio.njust.edu.cn/bioinf/TargetM6A.

  2. Discovering body site and severity modifiers in clinical texts

    PubMed Central

    Dligach, Dmitriy; Bethard, Steven; Becker, Lee; Miller, Timothy; Savova, Guergana K

    2014-01-01

    Objective To research computational methods for discovering body site and severity modifiers in clinical texts. Methods We cast the task of discovering body site and severity modifiers as a relation extraction problem in the context of a supervised machine learning framework. We utilize rich linguistic features to represent the pairs of relation arguments and delegate the decision about the nature of the relationship between them to a support vector machine model. We evaluate our models using two corpora that annotate body site and severity modifiers. We also compare the model performance to a number of rule-based baselines. We conduct cross-domain portability experiments. In addition, we carry out feature ablation experiments to determine the contribution of various feature groups. Finally, we perform error analysis and report the sources of errors. Results The performance of our method for discovering body site modifiers achieves F1 of 0.740–0.908 and our method for discovering severity modifiers achieves F1 of 0.905–0.929. Discussion Results indicate that both methods perform well on both in-domain and out-domain data, approaching the performance of human annotators. The most salient features are token and named entity features, although syntactic dependency features also contribute to the overall performance. The dominant sources of errors are infrequent patterns in the data and inability of the system to discern deeper semantic structures. Conclusions We investigated computational methods for discovering body site and severity modifiers in clinical texts. Our best system is released open source as part of the clinical Text Analysis and Knowledge Extraction System (cTAKES). PMID:24091648

  3. Discovering body site and severity modifiers in clinical texts.

    PubMed

    Dligach, Dmitriy; Bethard, Steven; Becker, Lee; Miller, Timothy; Savova, Guergana K

    2014-01-01

    To research computational methods for discovering body site and severity modifiers in clinical texts. We cast the task of discovering body site and severity modifiers as a relation extraction problem in the context of a supervised machine learning framework. We utilize rich linguistic features to represent the pairs of relation arguments and delegate the decision about the nature of the relationship between them to a support vector machine model. We evaluate our models using two corpora that annotate body site and severity modifiers. We also compare the model performance to a number of rule-based baselines. We conduct cross-domain portability experiments. In addition, we carry out feature ablation experiments to determine the contribution of various feature groups. Finally, we perform error analysis and report the sources of errors. The performance of our method for discovering body site modifiers achieves F1 of 0.740-0.908 and our method for discovering severity modifiers achieves F1 of 0.905-0.929. Results indicate that both methods perform well on both in-domain and out-domain data, approaching the performance of human annotators. The most salient features are token and named entity features, although syntactic dependency features also contribute to the overall performance. The dominant sources of errors are infrequent patterns in the data and inability of the system to discern deeper semantic structures. We investigated computational methods for discovering body site and severity modifiers in clinical texts. Our best system is released open source as part of the clinical Text Analysis and Knowledge Extraction System (cTAKES).

  4. 75 FR 22170 - Self-Regulatory Organizations; Chicago Board Options Exchange, Incorporated; Notice of Filing and...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-04-27

    ... Effectiveness of Proposed Rule Change To Remove a Feature and Revise Outdated Text Regarding Certain Execution... Proposed Rule Change The Exchange is proposing to eliminate a feature and revise outdated text regarding certain of its execution rules. The text of the proposed rule change is available on CBOE's Web site at...

  5. A machine-learning approach for predicting palmitoylation sites from integrated sequence-based features.

    PubMed

    Li, Liqi; Luo, Qifa; Xiao, Weidong; Li, Jinhui; Zhou, Shiwen; Li, Yongsheng; Zheng, Xiaoqi; Yang, Hua

    2017-02-01

    Palmitoylation is the covalent attachment of lipids to amino acid residues in proteins. As an important form of protein posttranslational modification, it increases the hydrophobicity of proteins, which contributes to the protein transportation, organelle localization, and functions, therefore plays an important role in a variety of cell biological processes. Identification of palmitoylation sites is necessary for understanding protein-protein interaction, protein stability, and activity. Since conventional experimental techniques to determine palmitoylation sites in proteins are both labor intensive and costly, a fast and accurate computational approach to predict palmitoylation sites from protein sequences is in urgent need. In this study, a support vector machine (SVM)-based method was proposed through integrating PSI-BLAST profile, physicochemical properties, [Formula: see text]-mer amino acid compositions (AACs), and [Formula: see text]-mer pseudo AACs into the principal feature vector. A recursive feature selection scheme was subsequently implemented to single out the most discriminative features. Finally, an SVM method was implemented to predict palmitoylation sites in proteins based on the optimal features. The proposed method achieved an accuracy of 99.41% and Matthews Correlation Coefficient of 0.9773 for a benchmark dataset. The result indicates the efficiency and accuracy of our method in prediction of palmitoylation sites based on protein sequences.

  6. "Spacecraft Reveals Recent Geological Activity on the Moon": Exploring the Features of NASA Twitter Posts and Their Potential to Engage Adolescents

    ERIC Educational Resources Information Center

    Lesley, Mellinee

    2014-01-01

    Through a content analysis of 200 "tweets," this study was an exploration into the distinct features of text posted to NASA's "Twitter" site and the potential for these posts to serve as more engaging scientific text than traditional textbooks for adolescents. Results of the content analysis indicated the tweets and linked…

  7. Discriminative and informative features for biomolecular text mining with ensemble feature selection.

    PubMed

    Van Landeghem, Sofie; Abeel, Thomas; Saeys, Yvan; Van de Peer, Yves

    2010-09-15

    In the field of biomolecular text mining, black box behavior of machine learning systems currently limits understanding of the true nature of the predictions. However, feature selection (FS) is capable of identifying the most relevant features in any supervised learning setting, providing insight into the specific properties of the classification algorithm. This allows us to build more accurate classifiers while at the same time bridging the gap between the black box behavior and the end-user who has to interpret the results. We show that our FS methodology successfully discards a large fraction of machine-generated features, improving classification performance of state-of-the-art text mining algorithms. Furthermore, we illustrate how FS can be applied to gain understanding in the predictions of a framework for biomolecular event extraction from text. We include numerous examples of highly discriminative features that model either biological reality or common linguistic constructs. Finally, we discuss a number of insights from our FS analyses that will provide the opportunity to considerably improve upon current text mining tools. The FS algorithms and classifiers are available in Java-ML (http://java-ml.sf.net). The datasets are publicly available from the BioNLP'09 Shared Task web site (http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/SharedTask/).

  8. The LENR-CANR.ORG Website, its Past and Future

    NASA Astrophysics Data System (ADS)

    Rothwell, J.; Storms, E.

    2005-12-01

    The LENR-CANR.org web site has proven to be a popular source of information about cold fusion. This site has distributed more full text papers about LENR than any other source. In addition, it contains many features that allow easy search and insertion of the discovered references into a document.

  9. Ocean Instruments Web Site for Undergraduate, Secondary and Informal Education

    NASA Astrophysics Data System (ADS)

    Farrington, J. W.; Nevala, A.; Dolby, L. A.

    2004-12-01

    An Ocean Instruments web site has been developed that makes available information about ocean sampling and measurement instruments and platforms. The site features text, pictures, diagrams and background information written or edited by experts in ocean science and engineering and contains links to glossaries and multimedia technologies including video streaming, audio packages, and searchable databases. The site was developed after advisory meetings with selected professors teaching undergraduate classes who responded to the question, what could Woods Hole Oceanographic Institution supply to enhance undergraduate education in ocean sciences, life sciences, and geosciences? Prototypes were developed and tested with students, potential users, and potential contributors. The site is hosted by WHOI. The initial five instruments featured were provided by four WHOI scientists and engineers and by one Sea Education Association faculty member. The site is now open to contributions from scientists and engineers worldwide. The site will not advertise or promote the use of individual ocean instruments.

  10. e-Health and new moms: Contextual factors associated with sources of health information.

    PubMed

    Walker, Lorraine O; Mackert, Michael S; Ahn, Jisoo; Vaughan, Misha W; Sterling, Bobbie S; Guy, Sarah; Hendrickson, Sherry

    2017-11-01

    Guided by the Uses and Gratifications approach, to examine mothers' use and preference of e-Health media, and associated contextual factors. Cross-sectional survey of 165 mothers (White, African-American, and Hispanic) from a stratified random sample. Use of online media about mother-baby care; favorite websites about motherhood and best-liked features of Web sites; channel preferences (Web site, postal mail, text) for receiving three types of health information; and contextual factors, e.g., education. Media use ranged from 96% for health information searches about babies to 46% for YouTube viewing about mother-baby topics. Contextual factors, such as education, were associated with media use. Babycenter was the most frequently reported favorite Web site and rich, relevant information was the best-liked feature. Across three health topics (weight, stress/depression, parenting) mothers preferred receiving information by Web site, followed by postal mail and least by text messaging (χ 2 statistics, p < .001). Stress and race/ethnicity were among factors associated with preferences. Mothers widely used e-Health related media, but use was associated with contextual factors. In public health efforts to reach new mothers, partnering with mother-favored Web sites, focusing on audience-relevant media, and adopting attributes of successful sites are recommended strategies. © 2017 Wiley Periodicals, Inc.

  11. Computer Program for Point Location And Calculation of ERror (PLACER)

    USGS Publications Warehouse

    Granato, Gregory E.

    1999-01-01

    A program designed for point location and calculation of error (PLACER) was developed as part of the Quality Assurance Program of the Federal Highway Administration/U.S. Geological Survey (USGS) National Data and Methodology Synthesis (NDAMS) review process. The program provides a standard method to derive study-site locations from site maps in highwayrunoff, urban-runoff, and other research reports. This report provides a guide for using PLACER, documents methods used to estimate study-site locations, documents the NDAMS Study-Site Locator Form, and documents the FORTRAN code used to implement the method. PLACER is a simple program that calculates the latitude and longitude coordinates of one or more study sites plotted on a published map and estimates the uncertainty of these calculated coordinates. PLACER calculates the latitude and longitude of each study site by interpolating between the coordinates of known features and the locations of study sites using any consistent, linear, user-defined coordinate system. This program will read data entered from the computer keyboard and(or) from a formatted text file, and will write the results to the computer screen and to a text file. PLACER is readily transferable to different computers and operating systems with few (if any) modifications because it is written in standard FORTRAN. PLACER can be used to calculate study site locations in latitude and longitude, using known map coordinates or features that are identifiable in geographic information data bases such as USGS Geographic Names Information System, which is available on the World Wide Web.

  12. Measuring interactivity on tobacco control websites.

    PubMed

    Freeman, Becky; Chapman, Simon

    2012-08-01

    With the increased reach of Web 2.0, Internet users expect webpages to be interactive. No studies have been conducted to assess whether tobacco control-relevant sites have implemented these features. The authors conducted an analysis of an international sample of tobacco control-relevant websites to determine their level of interactivity. The sample included 68 unique websites selected from Google searches in 5 countries, on each country's Google site, using the term smoking. The 68 sites were analyzed for 10 categories of interactive tools. The most common type of interactive content found on 46 (68%) of sites was for multimedia featuring content that was not primarily text based, such as photo galleries, videos, or podcasts. Only 11 (16%) websites-outside of media sites-allowed people to interact and engage with the site owners and other users by allowing posting comments on content and/or hosting forums/discussions. Linkages to social networking sites were low: 17 pages (25%) linked to Twitter, 15 (22%) to Facebook, and 11 (16%) to YouTube. Interactivity and connectedness to online social media appears to still be in its infancy among tobacco control-relevant sites.

  13. The Use of Social Tags in Text and Image Searching on the Web

    ERIC Educational Resources Information Center

    Kim, Yong-Mi

    2011-01-01

    In recent years, tags have become a standard feature on a diverse range of sites on the Web, accompanying blog posts, photos, videos, and online news stories. Tags are descriptive terms attached to Internet resources. Despite the rapid adoption of tagging, how people use tags during the search process is not well understood. There is little…

  14. Journal searching in non-MEDLINE resources on Internet Web sites.

    PubMed

    Lingle, V A

    1997-01-01

    Internet access to the medical journal literature is absorbing the attention of all relevant parties, i.e., publishers, journal vendors, librarians, commercial providers, government agencies, and end users. Journal content on the Web sites spans the range from advertising and ordering information for the print version, to table of contents and abstracts, to downloadable full text and graphics of articles. The searching parameters for systems other than MEDLINE also differ extensively with a wide variety of features and resulting retrieval. This discussion reviews a selection of providers of medical information (particularly the journal literature) on the Internet, making a comparison of what is available on Web sites and how it can be searched.

  15. Guiding Students through Expository Text with Text Feature Walks

    ERIC Educational Resources Information Center

    Kelley, Michelle J.; Clausen-Grace, Nicki

    2010-01-01

    The Text Feature Walk is a structure created and employed by the authors that guides students in the reading of text features in order to access prior knowledge, make connections, and set a purpose for reading expository text. Results from a pilot study are described in order to illustrate the benefits of using the Text Feature Walk over…

  16. Creating and testing a deaf-friendly, stop-smoking web site intervention.

    PubMed

    Jones, Elaine G; Goldsmith, Melissa; Effken, Judith; Button, Kevin; Crago, Michael

    2010-01-01

    Deaf adults' access to smoking cessation programs is limited due to cultural, linguistic, and geographic barriers. Web-based stop-smoking interventions have demonstrated cessation rates comparable to other interventions. The Internet is widely used by Deaf adults, but difficulties with online English text remain. We found no published accounts of Internet interventions promoting smoking cessation among Deaf individuals. The purpose of our project was to create and pilot test a prototype interactive Web site that provides users with information in American Sign Language related to smoking cessation. We utilized web cams to create real-time "video chat rooms" for virtual support groups and had an "ask the experts" feature. Deaf community members participated in all phases of development and testing, and a Deaf former smoker served as the moderator for the site. Evaluations were positive, with emphasis on interactive and visual aspects of the site.

  17. Text feature extraction based on deep learning: a review.

    PubMed

    Liang, Hong; Sun, Xiao; Sun, Yunlei; Gao, Yuan

    2017-01-01

    Selection of text feature item is a basic and important matter for text mining and information retrieval. Traditional methods of feature extraction require handcrafted features. To hand-design, an effective feature is a lengthy process, but aiming at new applications, deep learning enables to acquire new effective feature representation from training data. As a new feature extraction method, deep learning has made achievements in text mining. The major difference between deep learning and conventional methods is that deep learning automatically learns features from big data, instead of adopting handcrafted features, which mainly depends on priori knowledge of designers and is highly impossible to take the advantage of big data. Deep learning can automatically learn feature representation from big data, including millions of parameters. This thesis outlines the common methods used in text feature extraction first, and then expands frequently used deep learning methods in text feature extraction and its applications, and forecasts the application of deep learning in feature extraction.

  18. PredictProtein—an open resource for online prediction of protein structural and functional features

    PubMed Central

    Yachdav, Guy; Kloppmann, Edda; Kajan, Laszlo; Hecht, Maximilian; Goldberg, Tatyana; Hamp, Tobias; Hönigschmid, Peter; Schafferhans, Andrea; Roos, Manfred; Bernhofer, Michael; Richter, Lothar; Ashkenazy, Haim; Punta, Marco; Schlessinger, Avner; Bromberg, Yana; Schneider, Reinhard; Vriend, Gerrit; Sander, Chris; Ben-Tal, Nir; Rost, Burkhard

    2014-01-01

    PredictProtein is a meta-service for sequence analysis that has been predicting structural and functional features of proteins since 1992. Queried with a protein sequence it returns: multiple sequence alignments, predicted aspects of structure (secondary structure, solvent accessibility, transmembrane helices (TMSEG) and strands, coiled-coil regions, disulfide bonds and disordered regions) and function. The service incorporates analysis methods for the identification of functional regions (ConSurf), homology-based inference of Gene Ontology terms (metastudent), comprehensive subcellular localization prediction (LocTree3), protein–protein binding sites (ISIS2), protein–polynucleotide binding sites (SomeNA) and predictions of the effect of point mutations (non-synonymous SNPs) on protein function (SNAP2). Our goal has always been to develop a system optimized to meet the demands of experimentalists not highly experienced in bioinformatics. To this end, the PredictProtein results are presented as both text and a series of intuitive, interactive and visually appealing figures. The web server and sources are available at http://ppopen.rostlab.org. PMID:24799431

  19. Prediction of active sites of enzymes by maximum relevance minimum redundancy (mRMR) feature selection.

    PubMed

    Gao, Yu-Fei; Li, Bi-Qing; Cai, Yu-Dong; Feng, Kai-Yan; Li, Zhan-Dong; Jiang, Yang

    2013-01-27

    Identification of catalytic residues plays a key role in understanding how enzymes work. Although numerous computational methods have been developed to predict catalytic residues and active sites, the prediction accuracy remains relatively low with high false positives. In this work, we developed a novel predictor based on the Random Forest algorithm (RF) aided by the maximum relevance minimum redundancy (mRMR) method and incremental feature selection (IFS). We incorporated features of physicochemical/biochemical properties, sequence conservation, residual disorder, secondary structure and solvent accessibility to predict active sites of enzymes and achieved an overall accuracy of 0.885687 and MCC of 0.689226 on an independent test dataset. Feature analysis showed that every category of the features except disorder contributed to the identification of active sites. It was also shown via the site-specific feature analysis that the features derived from the active site itself contributed most to the active site determination. Our prediction method may become a useful tool for identifying the active sites and the key features identified by the paper may provide valuable insights into the mechanism of catalysis.

  20. The Use of Technology in Participant Tracking and Study Retention: Lessons Learned From a Clinical Trials Network Study.

    PubMed

    Mitchell, Shannon Gwin; Schwartz, Robert P; Alvanzo, Anika A H; Weisman, Monique S; Kyle, Tiffany L; Turrigiano, Eva M; Gibson, Martha L; Perez, Livangelie; McClure, Erin A; Clingerman, Sara; Froias, Autumn; Shandera, Danielle R; Walker, Robrina; Babcock, Dean L; Bailey, Genie L; Miele, Gloria M; Kunkel, Lynn E; Norton, Michael; Stitzer, Maxine L

    2015-01-01

    The growing use of newer communication and Internet technologies, even among low-income and transient populations, require research staff to update their outreach strategies to ensure high follow-up and participant retention rates. This paper presents the views of research assistants on the use of cell phones and the Internet to track participants in a multisite randomized trial of substance use disorder treatment. Preinterview questionnaires exploring tracking and other study-related activities were collected from 21 research staff across the 10 participating US sites. Data were then used to construct a semistructured interview guide that, in turn, was used to interview 12 of the same staff members. The questionnaires and interview data were entered in Atlas.ti and analyzed for emergent themes related to the use of technology for participant-tracking purposes. Study staff reported that most participants had cell phones, despite having unstable physical addresses and landlines. The incoming call feature of most cell phones was useful for participants and research staff alike, and texting proved to have additional benefits. However, reliance on participants' cell phones also proved problematic. Even homeless participants were found to have access to the Internet through public libraries and could respond to study staff e-mails. Some study sites opened generic social media accounts, through which study staff sent private messages to participants. However, the institutional review board (IRB) approval process for tracking participants using social media at some sites was prohibitively lengthy. Internet searches through Google, national paid databases, obituaries, and judiciary Web sites were also helpful tools. Research staff perceive that cell phones, Internet searches, and social networking sites were effective tools to achieve high follow-up rates in drug abuse research. Studies should incorporate cell phone, texting, and social network Web site information on locator forms; obtain IRB approval for contacting participants using social networking Web sites; and include Web searches, texting, and the use of social media in staff training as standard operating procedures.

  1. Feature extraction for document text using Latent Dirichlet Allocation

    NASA Astrophysics Data System (ADS)

    Prihatini, P. M.; Suryawan, I. K.; Mandia, IN

    2018-01-01

    Feature extraction is one of stages in the information retrieval system that used to extract the unique feature values of a text document. The process of feature extraction can be done by several methods, one of which is Latent Dirichlet Allocation. However, researches related to text feature extraction using Latent Dirichlet Allocation method are rarely found for Indonesian text. Therefore, through this research, a text feature extraction will be implemented for Indonesian text. The research method consists of data acquisition, text pre-processing, initialization, topic sampling and evaluation. The evaluation is done by comparing Precision, Recall and F-Measure value between Latent Dirichlet Allocation and Term Frequency Inverse Document Frequency KMeans which commonly used for feature extraction. The evaluation results show that Precision, Recall and F-Measure value of Latent Dirichlet Allocation method is higher than Term Frequency Inverse Document Frequency KMeans method. This shows that Latent Dirichlet Allocation method is able to extract features and cluster Indonesian text better than Term Frequency Inverse Document Frequency KMeans method.

  2. Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use

    ERIC Educational Resources Information Center

    White, Sheida

    2012-01-01

    This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…

  3. Sentiment analysis of feature ranking methods for classification accuracy

    NASA Astrophysics Data System (ADS)

    Joseph, Shashank; Mugauri, Calvin; Sumathy, S.

    2017-11-01

    Text pre-processing and feature selection are important and critical steps in text mining. Text pre-processing of large volumes of datasets is a difficult task as unstructured raw data is converted into structured format. Traditional methods of processing and weighing took much time and were less accurate. To overcome this challenge, feature ranking techniques have been devised. A feature set from text preprocessing is fed as input for feature selection. Feature selection helps improve text classification accuracy. Of the three feature selection categories available, the filter category will be the focus. Five feature ranking methods namely: document frequency, standard deviation information gain, CHI-SQUARE, and weighted-log likelihood -ratio is analyzed.

  4. Features of traffic and transit internet sites

    DOT National Transportation Integrated Search

    2000-02-01

    This paper summarizes the current state of internet sites with respect to these features, first : considering whether sites with the features are available in metro areas, then comparing sites : developed by public and private sectors. In order to de...

  5. Prediction of Protein Modification Sites of Pyrrolidone Carboxylic Acid Using mRMR Feature Selection and Analysis

    PubMed Central

    Zheng, Lu-Lu; Niu, Shen; Hao, Pei; Feng, KaiYan; Cai, Yu-Dong; Li, Yixue

    2011-01-01

    Pyrrolidone carboxylic acid (PCA) is formed during a common post-translational modification (PTM) of extracellular and multi-pass membrane proteins. In this study, we developed a new predictor to predict the modification sites of PCA based on maximum relevance minimum redundancy (mRMR) and incremental feature selection (IFS). We incorporated 727 features that belonged to 7 kinds of protein properties to predict the modification sites, including sequence conservation, residual disorder, amino acid factor, secondary structure and solvent accessibility, gain/loss of amino acid during evolution, propensity of amino acid to be conserved at protein-protein interface and protein surface, and deviation of side chain carbon atom number. Among these 727 features, 244 features were selected by mRMR and IFS as the optimized features for the prediction, with which the prediction model achieved a maximum of MCC of 0.7812. Feature analysis showed that all feature types contributed to the modification process. Further site-specific feature analysis showed that the features derived from PCA's surrounding sites contributed more to the determination of PCA sites than other sites. The detailed feature analysis in this paper might provide important clues for understanding the mechanism of the PCA formation and guide relevant experimental validations. PMID:22174779

  6. Quantifying site-specific physical heterogeneity within an estuarine seascape

    USGS Publications Warehouse

    Kennedy, Cristina G.; Mather, Martha E.; Smith, Joseph M.

    2017-01-01

    Quantifying physical heterogeneity is essential for meaningful ecological research and effective resource management. Spatial patterns of multiple, co-occurring physical features are rarely quantified across a seascape because of methodological challenges. Here, we identified approaches that measured total site-specific heterogeneity, an often overlooked aspect of estuarine ecosystems. Specifically, we examined 23 metrics that quantified four types of common physical features: (1) river and creek confluences, (2) bathymetric variation including underwater drop-offs, (3) land features such as islands/sandbars, and (4) major underwater channel networks. Our research at 40 sites throughout Plum Island Estuary (PIE) provided solutions to two problems. The first problem was that individual metrics that measured heterogeneity of a single physical feature showed different regional patterns. We solved this first problem by combining multiple metrics for a single feature using a within-physical feature cluster analysis. With this approach, we identified sites with four different types of confluences and three different types of underwater drop-offs. The second problem was that when multiple physical features co-occurred, new patterns of total site-specific heterogeneity were created across the seascape. This pattern of total heterogeneity has potential ecological relevance to structure-oriented predators. To address this second problem, we identified sites with similar types of total physical heterogeneity using an across-physical feature cluster analysis. Then, we calculated an additive heterogeneity index, which integrated all physical features at a site. Finally, we tested if site-specific additive heterogeneity index values differed for across-physical feature clusters. In PIE, the sites with the highest additive heterogeneity index values were clustered together and corresponded to sites where a fish predator, adult striped bass (Morone saxatilis), aggregated in a related acoustic tracking study. In summary, we have shown general approaches to quantifying site-specific heterogeneity.

  7. Prediction of Protein-Protein Interaction Sites by Random Forest Algorithm with mRMR and IFS

    PubMed Central

    Li, Bi-Qing; Feng, Kai-Yan; Chen, Lei; Huang, Tao; Cai, Yu-Dong

    2012-01-01

    Prediction of protein-protein interaction (PPI) sites is one of the most challenging problems in computational biology. Although great progress has been made by employing various machine learning approaches with numerous characteristic features, the problem is still far from being solved. In this study, we developed a novel predictor based on Random Forest (RF) algorithm with the Minimum Redundancy Maximal Relevance (mRMR) method followed by incremental feature selection (IFS). We incorporated features of physicochemical/biochemical properties, sequence conservation, residual disorder, secondary structure and solvent accessibility. We also included five 3D structural features to predict protein-protein interaction sites and achieved an overall accuracy of 0.672997 and MCC of 0.347977. Feature analysis showed that 3D structural features such as Depth Index (DPX) and surface curvature (SC) contributed most to the prediction of protein-protein interaction sites. It was also shown via site-specific feature analysis that the features of individual residues from PPI sites contribute most to the determination of protein-protein interaction sites. It is anticipated that our prediction method will become a useful tool for identifying PPI sites, and that the feature analysis described in this paper will provide useful insights into the mechanisms of interaction. PMID:22937126

  8. Aerosol and precipitation chemistry in the southwestern United States: spatiotemporal trends and interrelationships.

    PubMed

    Sorooshian, A; Shingler, T; Harpold, A; Feagles, C W; Meixner, T; Brooks, P D

    2013-08-01

    This study characterizes the spatial and temporal patterns of aerosol and precipitation composition at six sites across the United States Southwest between 1995 and 2010. Precipitation accumulation occurs mostly during the wintertime (December-February) and during the monsoon season (July-September). Rain and snow pH levels are usually between 5-6, with crustal-derived species playing a major role in acid neutralization. These species (Ca 2+ , Mg 2+ , K + , Na + ) exhibit their highest concentrations between March and June in both PM 2.5 and precipitation due mostly to dust. Crustal-derived species concentrations in precipitation exhibit positive relationships with [Formula: see text], [Formula: see text], and Cl - , suggesting that acidic gases likely react with and partition to either crustal particles or hydrometeors enriched with crustal constituents. Concentrations of particulate [Formula: see text] show a statistically significant correlation with rain [Formula: see text] unlike snow [Formula: see text], which may be related to some combination of the vertical distribution of [Formula: see text] (and precursors) and the varying degree to which [Formula: see text]-enriched particles act as cloud condensation nuclei versus ice nuclei in the region. The coarse : fine aerosol mass ratio was correlated with crustal species concentrations in snow unlike rain, suggestive of a preferential role of coarse particles (mainly dust) as ice nuclei in the region. Precipitation [Formula: see text] : [Formula: see text] ratios exhibit the following features with potential explanations discussed: (i) they are higher in precipitation as compared to PM 2.5 ; (ii) they exhibit the opposite annual cycle compared to particulate [Formula: see text] : [Formula: see text] ratios; and (iii) they are higher in snow relative to rain during the wintertime. Long-term trend analysis for the monsoon season shows that the [Formula: see text] : [Formula: see text] ratio in rain increased at the majority of sites due mostly to air pollution regulations of [Formula: see text] precursors.

  9. Extracting Product Features and Opinion Words Using Pattern Knowledge in Customer Reviews

    PubMed Central

    Lynn, Khin Thidar

    2013-01-01

    Due to the development of e-commerce and web technology, most of online Merchant sites are able to write comments about purchasing products for customer. Customer reviews expressed opinion about products or services which are collectively referred to as customer feedback data. Opinion extraction about products from customer reviews is becoming an interesting area of research and it is motivated to develop an automatic opinion mining application for users. Therefore, efficient method and techniques are needed to extract opinions from reviews. In this paper, we proposed a novel idea to find opinion words or phrases for each feature from customer reviews in an efficient way. Our focus in this paper is to get the patterns of opinion words/phrases about the feature of product from the review text through adjective, adverb, verb, and noun. The extracted features and opinions are useful for generating a meaningful summary that can provide significant informative resource to help the user as well as merchants to track the most suitable choice of product. PMID:24459430

  10. Extracting product features and opinion words using pattern knowledge in customer reviews.

    PubMed

    Htay, Su Su; Lynn, Khin Thidar

    2013-01-01

    Due to the development of e-commerce and web technology, most of online Merchant sites are able to write comments about purchasing products for customer. Customer reviews expressed opinion about products or services which are collectively referred to as customer feedback data. Opinion extraction about products from customer reviews is becoming an interesting area of research and it is motivated to develop an automatic opinion mining application for users. Therefore, efficient method and techniques are needed to extract opinions from reviews. In this paper, we proposed a novel idea to find opinion words or phrases for each feature from customer reviews in an efficient way. Our focus in this paper is to get the patterns of opinion words/phrases about the feature of product from the review text through adjective, adverb, verb, and noun. The extracted features and opinions are useful for generating a meaningful summary that can provide significant informative resource to help the user as well as merchants to track the most suitable choice of product.

  11. Drawing on Text Features for Reading Comprehension and Composing

    ERIC Educational Resources Information Center

    Risko, Victoria J.; Walker-Dalhouse, Doris

    2011-01-01

    Students read multiple-genre texts such as graphic novels, poetry, brochures, digitized texts with videos, and informational and narrative texts. Features such as overlapping illustrations and implied cause-and-effect relationships can affect students' comprehension. Teaching with these texts and drawing attention to organizational features hold…

  12. Prediction of lysine ubiquitylation with ensemble classifier and feature selection.

    PubMed

    Zhao, Xiaowei; Li, Xiangtao; Ma, Zhiqiang; Yin, Minghao

    2011-01-01

    Ubiquitylation is an important process of post-translational modification. Correct identification of protein lysine ubiquitylation sites is of fundamental importance to understand the molecular mechanism of lysine ubiquitylation in biological systems. This paper develops a novel computational method to effectively identify the lysine ubiquitylation sites based on the ensemble approach. In the proposed method, 468 ubiquitylation sites from 323 proteins retrieved from the Swiss-Prot database were encoded into feature vectors by using four kinds of protein sequences information. An effective feature selection method was then applied to extract informative feature subsets. After different feature subsets were obtained by setting different starting points in the search procedure, they were used to train multiple random forests classifiers and then aggregated into a consensus classifier by majority voting. Evaluated by jackknife tests and independent tests respectively, the accuracy of the proposed predictor reached 76.82% for the training dataset and 79.16% for the test dataset, indicating that this predictor is a useful tool to predict lysine ubiquitylation sites. Furthermore, site-specific feature analysis was performed and it was shown that ubiquitylation is intimately correlated with the features of its surrounding sites in addition to features derived from the lysine site itself. The feature selection method is available upon request.

  13. Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method.

    PubMed

    Gao, JianZhao; Tao, Xue-Wen; Zhao, Jia; Feng, Yuan-Ming; Cai, Yu-Dong; Zhang, Ning

    2017-01-01

    Lysine acetylation, as one type of post-translational modifications (PTM), plays key roles in cellular regulations and can be involved in a variety of human diseases. However, it is often high-cost and time-consuming to use traditional experimental approaches to identify the lysine acetylation sites. Therefore, effective computational methods should be developed to predict the acetylation sites. In this study, we developed a position-specific method for epsilon lysine acetylation site prediction. Sequences of acetylated proteins were retrieved from the UniProt database. Various kinds of features such as position specific scoring matrix (PSSM), amino acid factors (AAF), and disorders were incorporated. A feature selection method based on mRMR (Maximum Relevance Minimum Redundancy) and IFS (Incremental Feature Selection) was employed. Finally, 319 optimal features were selected from total 541 features. Using the 319 optimal features to encode peptides, a predictor was constructed based on dagging. As a result, an accuracy of 69.56% with MCC of 0.2792 was achieved. We analyzed the optimal features, which suggested some important factors determining the lysine acetylation sites. We developed a position-specific method for epsilon lysine acetylation site prediction. A set of optimal features was selected. Analysis of the optimal features provided insights into the mechanism of lysine acetylation sites, providing guidance of experimental validation. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  14. Cultural Resources Reconnaissance Along the Cheyenne River Arm of Lake Oahe in Dewey, Haakon, Stanley, and Ziebach Counties, South Dakota. Volume 1. Main Report

    DTIC Science & Technology

    1988-03-01

    of site 39ST282................................227 39 Plan of site 39ST283................................230 40 Detailed plans of Features 1 and 2...268 53 Plan of site 39DW64 .............................. 272 54 Plan of Feature 1, site 39DW64 ................... 273 55 Plan of site 39DW65...facing E ........................... 228 46 Site 39ST283, facing NE .......................... 232 47 Detail of Feature 1, site 39ST283, facing NW

  15. Geodatabase of sites, basin boundaries, and topology rules used to store drainage basin boundaries for the U.S. Geological Survey, Colorado Water Science Center

    USGS Publications Warehouse

    Dupree, Jean A.; Crowfoot, Richard M.

    2012-01-01

    This geodatabase and its component datasets are part of U.S. Geological Survey Digital Data Series 650 and were generated to store basin boundaries for U.S. Geological Survey streamgages and other sites in Colorado. The geodatabase and its components were created by the U.S. Geological Survey, Colorado Water Science Center, and are used to derive the numeric drainage areas for Colorado that are input into the U.S. Geological Survey's National Water Information System (NWIS) database and also published in the Annual Water Data Report and on NWISWeb. The foundational dataset used to create the basin boundaries in this geodatabase was the National Watershed Boundary Dataset. This geodatabase accompanies a U.S. Geological Survey Techniques and Methods report (Book 11, Section C, Chapter 6) entitled "Digital Database Architecture and Delineation Methodology for Deriving Drainage Basins, and Comparison of Digitally and Non-Digitally Derived Numeric Drainage Areas." The Techniques and Methods report details the geodatabase architecture, describes the delineation methodology and workflows used to develop these basin boundaries, and compares digitally derived numeric drainage areas in this geodatabase to non-digitally derived areas. 1. COBasins.gdb: This geodatabase contains site locations and basin boundaries for Colorado. It includes a single feature dataset, called BasinsFD, which groups the component feature classes and topology rules. 2. BasinsFD: This feature dataset in the "COBasins.gdb" geodatabase is a digital container that holds the feature classes used to archive site locations and basin boundaries as well as the topology rules that govern spatial relations within and among component feature classes. This feature dataset includes three feature classes: the sites for which basins have been delineated (the "Sites" feature class), basin bounding lines (the "BasinLines" feature class), and polygonal basin areas (the "BasinPolys" feature class). The feature dataset also stores the topology rules (the "BasinsFD_Topology") that constrain the relations within and among component feature classes. The feature dataset also forces any feature classes inside it to have a consistent projection system, which is, in this case, an Albers-Equal-Area projection system. 3. BasinsFD_Topology: This topology contains four persistent topology rules that constrain the spatial relations within the "BasinLines" feature class and between the "BasinLines" feature class and the "BasinPolys" feature classes. 4. Sites: This point feature class contains the digital representations of the site locations for which Colorado Water Science Center basin boundaries have been delineated. This feature class includes point locations for Colorado Water Science Center active (as of September 30, 2009) gages and for other sites. 5. BasinLines: This line feature class contains the perimeters of basins delineated for features in the "Sites" feature class, and it also contains information regarding the sources of lines used for the basin boundaries. 6. BasinPolys: This polygon feature class contains the polygonal basin areas delineated for features in the "Sites" feature class, and it is used to derive the numeric drainage areas published by the Colorado Water Science Center.

  16. GIDEP Batching Tool

    NASA Technical Reports Server (NTRS)

    Fong, Danny; Odell,Dorice; Barry, Peter; Abrahamian, Tomik

    2008-01-01

    This software provides internal, automated search mechanics of GIDEP (Government- Industry Data Exchange Program) Alert data imported from the GIDEP government Web site. The batching tool allows the import of a single parts list in tab-delimited text format into the local JPL GIDEP database. Delimiters from every part number are removed. The original part numbers with delimiters are compared, as well as the newly generated list without the delimiters. The two lists run against the GIDEP imports, and output any matches. This feature only works with Netscape 2.0 or greater, or Internet Explorer 4.0 or greater. The user selects the browser button to choose a text file to import. When the submit button is pressed, this script will import alerts from the text file into the local JPL GIDEP database. This batch tool provides complete in-house control over exported material and data for automated batch match abilities. The batching tool has the ability to match capabilities of the parts list to tables, and yields results that aid further research and analysis. This provides more control over GIDEP information for metrics and reports information not provided by the government site. This software yields results quickly and gives more control over external data from the government site in order to generate other reports not available from the external source. There is enough space to store years of data. The program relates to risk identification and management with regard to projects and GIDEP alert information encompassing flight parts for space exploration.

  17. Multiple receptor conformation docking, dock pose clustering and 3D QSAR studies on human poly(ADP-ribose) polymerase-1 (PARP-1) inhibitors.

    PubMed

    Fatima, Sabiha; Jatavath, Mohan Babu; Bathini, Raju; Sivan, Sree Kanth; Manga, Vijjulatha

    2014-10-01

    Poly(ADP-ribose) polymerase-1 (PARP-1) functions as a DNA damage sensor and signaling molecule. It plays a vital role in the repair of DNA strand breaks induced by radiation and chemotherapeutic drugs; inhibitors of this enzyme have the potential to improve cancer chemotherapy or radiotherapy. Three-dimensional quantitative structure activity relationship (3D QSAR) models were developed using comparative molecular field analysis, comparative molecular similarity indices analysis and docking studies. A set of 88 molecules were docked into the active site of six X-ray crystal structures of poly(ADP-ribose)polymerase-1 (PARP-1), by a procedure called multiple receptor conformation docking (MRCD), in order to improve the 3D QSAR models through the analysis of binding conformations. The docked poses were clustered to obtain the best receptor binding conformation. These dock poses from clustering were used for 3D QSAR analysis. Based on MRCD and QSAR information, some key features have been identified that explain the observed variance in the activity. Two receptor-based QSAR models were generated; these models showed good internal and external statistical reliability that is evident from the [Formula: see text], [Formula: see text] and [Formula: see text]. The identified key features enabled us to design new PARP-1 inhibitors.

  18. Acne severity grading: determining essential clinical components and features using a Delphi consensus.

    PubMed

    Tan, Jerry; Wolfe, Barat; Weiss, Jonathan; Stein-Gold, Linda; Bikowski, Joseph; Del Rosso, James; Webster, Guy F; Lucky, Anne; Thiboutot, Diane; Wilkin, Jonathan; Leyden, James; Chren, Mary-Margaret

    2012-08-01

    There are multiple global scales for acne severity grading but no singular standard. Our objective was to determine the essential clinical components (content items) and features (property-related items) for an acne global grading scale for use in research and clinical practice using an iterative method, the Delphi process. Ten acne experts were invited to participate in a Web-based Delphi survey comprising 3 iterative rounds of questions. In round 1, the experts identified the following clinical components (primary acne lesions, number of lesions, extent, regional involvement, secondary lesions, and patient experiences) and features (clinimetric properties, ease of use, categorization of severity based on photographs or text, and acceptance by all stakeholders). In round 2, consensus for inclusion in the scale was established for primary lesions, number, sites, and extent; as well as clinimetric properties and ease of use. In round 3, consensus for inclusion was further established for categorization and acceptance. Patient experiences were excluded and no consensus was achieved for secondary lesions. The Delphi panel consisted solely of the United States (U.S.)-based acne experts. Using an established method for achieving consensus, experts in acne vulgaris concluded that an ideal acne global grading scale would comprise the essential clinical components of primary acne lesions, their quantity, extent, and facial and extrafacial sites of involvement; with features of clinimetric properties, categorization, efficiency, and acceptance. Copyright © 2011 American Academy of Dermatology, Inc. Published by Mosby, Inc. All rights reserved.

  19. Instant Gratification: Striking a Balance Between Rich Interactive Visualization and Ease of Use for Casual Web Surfers

    NASA Astrophysics Data System (ADS)

    Russell, R. M.; Johnson, R. M.; Gardiner, E. S.; Bergman, J. J.; Genyuk, J.; Henderson, S.

    2004-12-01

    Interactive visualizations can be powerful tools for helping students, teachers, and the general public comprehend significant features in rich datasets and complex systems. Successful use of such visualizations requires viewers to have, or to acquire, adequate expertise in use of the relevant visualization tools. In many cases, the learning curve associated with competent use of such tools is too steep for casual users, such as members of the lay public browsing science outreach web sites or K-12 students and teachers trying to integrate such tools into their learning about geosciences. "Windows to the Universe" (http://www.windows.ucar.edu) is a large (roughly 6,000 web pages), well-established (first posted online in 1995), and popular (over 5 million visitor sessions and 40 million pages viewed per year) science education web site that covers a very broad range of Earth science and space science topics. The primary audience of the site consists of K-12 students and teachers and the general public. We have developed several interactive visualizations for use on the site in conjunction with text and still image reference materials. One major emphasis in the design of these interactives has been to ensure that casual users can quickly learn how to use the interactive features without becoming frustrated and departing before they were able to appreciate the visualizations displayed. We will demonstrate several of these "user-friendly" interactive visualizations and comment on the design philosophy we have employed in developing them.

  20. Prediction of lysine glutarylation sites by maximum relevance minimum redundancy feature selection.

    PubMed

    Ju, Zhe; He, Jian-Jun

    2018-06-01

    Lysine glutarylation is new type of protein acylation modification in both prokaryotes and eukaryotes. To better understand the molecular mechanism of glutarylation, it is important to identify glutarylated substrates and their corresponding glutarylation sites accurately. In this study, a novel bioinformatics tool named GlutPred is developed to predict glutarylation sites by using multiple feature extraction and maximum relevance minimum redundancy feature selection. On the one hand, amino acid factors, binary encoding, and the composition of k-spaced amino acid pairs features are incorporated to encode glutarylation sites. And the maximum relevance minimum redundancy method and the incremental feature selection algorithm are adopted to remove the redundant features. On the other hand, a biased support vector machine algorithm is used to handle the imbalanced problem in glutarylation sites training dataset. As illustrated by 10-fold cross-validation, the performance of GlutPred achieves a satisfactory performance with a Sensitivity of 64.80%, a Specificity of 76.60%, an Accuracy of 74.90% and a Matthew's correlation coefficient of 0.3194. Feature analysis shows that some k-spaced amino acid pair features play the most important roles in the prediction of glutarylation sites. The conclusions derived from this study might provide some clues for understanding the molecular mechanisms of glutarylation. Copyright © 2018 Elsevier Inc. All rights reserved.

  1. The Role of Surface, Semantic and Grammatical Features on Simplification of Spanish Medical Texts: A User Study

    PubMed Central

    Mukherjee, Partha; Leroy, Gondy; Kauchak, David; Navarrete, Brianda Armenta; Diaz, Damian Y.; Colina, Sonia

    2017-01-01

    Simplifying medical texts facilitates readability and comprehension. While most simplification work focuses on English, we investigate whether features important for simplifying English text are similarly helpful for simplifying Spanish text. We conducted a user study on 15 Spanish medical texts using Amazon Mechanical Turk and measured perceived and actual difficulty. Using the median of the difficulty scores, we split the texts into easy and difficult groups and extracted 10 surface, 2 semantic and 4 grammatical features. Using t-tests, we identified those features that significantly distinguish easy text from difficult text in Spanish and compare with prior work in English. We found that easy Spanish texts use more repeated words and adverbs, less negations and more familiar words, similar to English. Also like English, difficult Spanish texts use more nouns and adjectives. However in contrast to English, easier Spanish texts contained longer sentences and used grammatical structures that were more varied. PMID:29854201

  2. Site Features

    EPA Pesticide Factsheets

    This dataset consists of various site features from multiple Superfund sites in U.S. EPA Region 8. These data were acquired from multiple sources at different times and were combined into one region-wide layer.

  3. Relevance popularity: A term event model based feature selection scheme for text classification.

    PubMed

    Feng, Guozhong; An, Baiguo; Yang, Fengqin; Wang, Han; Zhang, Libiao

    2017-01-01

    Feature selection is a practical approach for improving the performance of text classification methods by optimizing the feature subsets input to classifiers. In traditional feature selection methods such as information gain and chi-square, the number of documents that contain a particular term (i.e. the document frequency) is often used. However, the frequency of a given term appearing in each document has not been fully investigated, even though it is a promising feature to produce accurate classifications. In this paper, we propose a new feature selection scheme based on a term event Multinomial naive Bayes probabilistic model. According to the model assumptions, the matching score function, which is based on the prediction probability ratio, can be factorized. Finally, we derive a feature selection measurement for each term after replacing inner parameters by their estimators. On a benchmark English text datasets (20 Newsgroups) and a Chinese text dataset (MPH-20), our numerical experiment results obtained from using two widely used text classifiers (naive Bayes and support vector machine) demonstrate that our method outperformed the representative feature selection methods.

  4. Improved coordinates of features in the vicinity of the Viking lander site on Mars

    NASA Technical Reports Server (NTRS)

    Davies, M. E.; Dole, S. H.

    1980-01-01

    The measurement of longitude of the Viking 1 landing site and the accuracy of the coordinates of features in the area around the landing site are discussed. The longitude must be measured photogrammatically from the small crater, Airy 0, which defines the 0 deg meridian on Mars. The computer program, GIANT, which was used to perform the analytical triangulations, and the photogrammetric computation of the longitude of the Viking 1 lander site are described. Improved coordinates of features in the vicinity of the Viking 1 lander site are presented.

  5. Preferred features of urban parks and forests

    Treesearch

    Herbert W. Schroeder

    1982-01-01

    To make the most efficient use of scarce recreation resources, urban forest managers need to know what features of recreation sites are the most important for creating high-quality recreation environments. In this study, observers viewed photographs of urban forest sites in the Chicago area and described the features of the sites that they liked and disliked. Natural...

  6. Linguistic Features of Middle School Environmental Education Texts.

    ERIC Educational Resources Information Center

    Chenhansa, Suporn; Schleppegrell, Mary

    1998-01-01

    The language used in environmental education texts has linguistic features that affect students' comprehension of concepts and their ability to envision solutions to environmental problems. Findings indicate that features of texts such as abstract nouns and lack of explicit agents impede students' full comprehension of complex issues and obscure…

  7. Text Mining Improves Prediction of Protein Functional Sites

    PubMed Central

    Cohn, Judith D.; Ravikumar, Komandur E.

    2012-01-01

    We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites). The structure analysis was carried out using Dynamics Perturbation Analysis (DPA), which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites) in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions. PMID:22393388

  8. Study on Hybrid Image Search Technology Based on Texts and Contents

    NASA Astrophysics Data System (ADS)

    Wang, H. T.; Ma, F. L.; Yan, C.; Pan, H.

    2018-05-01

    Image search was studied first here based on texts and contents, respectively. The text-based image feature extraction was put forward by integrating the statistical and topic features in view of the limitation of extraction of keywords only by means of statistical features of words. On the other hand, a search-by-image method was put forward based on multi-feature fusion in view of the imprecision of the content-based image search by means of a single feature. The layered-searching method depended on primarily the text-based image search method and additionally the content-based image search was then put forward in view of differences between the text-based and content-based methods and their difficult direct fusion. The feasibility and effectiveness of the hybrid search algorithm were experimentally verified.

  9. Empirical evaluation of cross-site reproducibility in radiomic features for characterizing prostate MRI

    NASA Astrophysics Data System (ADS)

    Chirra, Prathyush; Leo, Patrick; Yim, Michael; Bloch, B. Nicolas; Rastinehad, Ardeshir R.; Purysko, Andrei; Rosen, Mark; Madabhushi, Anant; Viswanath, Satish

    2018-02-01

    The recent advent of radiomics has enabled the development of prognostic and predictive tools which use routine imaging, but a key question that still remains is how reproducible these features may be across multiple sites and scanners. This is especially relevant in the context of MRI data, where signal intensity values lack tissue specific, quantitative meaning, as well as being dependent on acquisition parameters (magnetic field strength, image resolution, type of receiver coil). In this paper we present the first empirical study of the reproducibility of 5 different radiomic feature families in a multi-site setting; specifically, for characterizing prostate MRI appearance. Our cohort comprised 147 patient T2w MRI datasets from 4 different sites, all of which were first pre-processed to correct acquisition-related for artifacts such as bias field, differing voxel resolutions, as well as intensity drift (non-standardness). 406 3D voxel wise radiomic features were extracted and evaluated in a cross-site setting to determine how reproducible they were within a relatively homogeneous non-tumor tissue region; using 2 different measures of reproducibility: Multivariate Coefficient of Variation and Instability Score. Our results demonstrated that Haralick features were most reproducible between all 4 sites. By comparison, Laws features were among the least reproducible between sites, as well as performing highly variably across their entire parameter space. Similarly, the Gabor feature family demonstrated good cross-site reproducibility, but for certain parameter combinations alone. These trends indicate that despite extensive pre-processing, only a subset of radiomic features and associated parameters may be reproducible enough for use within radiomics-based machine learning classifier schemes.

  10. Diversity, Abundance, and Niche Differentiation of Ammonia-Oxidizing Prokaryotes in Mud Deposits of the Eastern China Marginal Seas.

    PubMed

    Yu, Shaolan; Yao, Peng; Liu, Jiwen; Zhao, Bin; Zhang, Guiling; Zhao, Meixun; Yu, Zhigang; Zhang, Xiao-Hua

    2016-01-01

    The eastern China marginal seas (ECMS) are prominent examples of river-dominated ocean margins, whose most characteristic feature is the existence of isolated mud patches on sandy sediments. Ammonia-oxidizing prokaryotes play a crucial role in the nitrogen cycles of many marine environments, including marginal seas. However, few studies have attempted to address the distribution patterns of ammonia-oxidizing prokaryotes in mud deposits of these seas. The horizontal and vertical community composition and abundance of ammonia-oxidizing archaea (AOA) and bacteria (AOB) were investigated in mud deposits of the South Yellow Sea (SYS) and the East China Sea (ECS) by using amoA clone libraries and quantitative PCR. The diversity of AOB was comparable or higher in the mud zone of SYS and lower in ECS when compared with AOA. Vertically, surface sediments had generally higher diversity of AOA and AOB than middle and bottom layers. Diversity of AOA and AOB showed significant correlation with latitude. Nitrosopumilus and Nitrosospira lineages dominated AOA and AOB communities, respectively. Both AOA and AOB assemblages exhibited greater variations across different sites than those among various depths at one site. The abundance of bacterial amoA was generally higher than that of archaeal amoA, and both of them decreased with depth. Niche differentiation, which was affected by dissolved oxygen, salinity, ammonia, and silicate (SiO[Formula: see text]), was observed between AOA and AOB and among different groups of them. The spatial distribution of AOA and AOB was significantly correlated with δ(15)NTN and SiO[Formula: see text], and nitrate and δ(13)C, respectively. Both archaeal and bacterial amoA abundance correlated strongly with SiO[Formula: see text]. This study improves our understanding of spatial distribution of AOA and AOB in ecosystems featuring oceanic mud deposits.

  11. Prediction of cause of death from forensic autopsy reports using text classification techniques: A comparative study.

    PubMed

    Mujtaba, Ghulam; Shuib, Liyana; Raj, Ram Gopal; Rajandram, Retnagowri; Shaikh, Khairunisa

    2018-07-01

    Automatic text classification techniques are useful for classifying plaintext medical documents. This study aims to automatically predict the cause of death from free text forensic autopsy reports by comparing various schemes for feature extraction, term weighing or feature value representation, text classification, and feature reduction. For experiments, the autopsy reports belonging to eight different causes of death were collected, preprocessed and converted into 43 master feature vectors using various schemes for feature extraction, representation, and reduction. The six different text classification techniques were applied on these 43 master feature vectors to construct a classification model that can predict the cause of death. Finally, classification model performance was evaluated using four performance measures i.e. overall accuracy, macro precision, macro-F-measure, and macro recall. From experiments, it was found that that unigram features obtained the highest performance compared to bigram, trigram, and hybrid-gram features. Furthermore, in feature representation schemes, term frequency, and term frequency with inverse document frequency obtained similar and better results when compared with binary frequency, and normalized term frequency with inverse document frequency. Furthermore, the chi-square feature reduction approach outperformed Pearson correlation, and information gain approaches. Finally, in text classification algorithms, support vector machine classifier outperforms random forest, Naive Bayes, k-nearest neighbor, decision tree, and ensemble-voted classifier. Our results and comparisons hold practical importance and serve as references for future works. Moreover, the comparison outputs will act as state-of-art techniques to compare future proposals with existing automated text classification techniques. Copyright © 2017 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.

  12. Full-Text Searching on Major Supermarket Systems: Dialog, Data-Star, and Nexis.

    ERIC Educational Resources Information Center

    Tenopir, Carol; Berglund, Sharon

    1993-01-01

    Examines the similarities, differences, and full-text features of the three most-used online systems for full-text searching in general libraries: DIALOG, Data-Star, and NEXIS. Overlapping databases, unique sources, search features, proximity operators, set building, language enhancement and word equivalencies, and display features are discussed.…

  13. Past Project Expo Sites

    EPA Pesticide Factsheets

    This page provides information for Project Expo sites that were featured at the LMOP Conferences in 2013 and 2014. Project Expo sites were featured as being interested in identifying project partners for the development of an LFG energy project.

  14. Synoptic reporting in tumor pathology: advantages of a web-based system.

    PubMed

    Qu, Zhenhong; Ninan, Shibu; Almosa, Ahmed; Chang, K G; Kuruvilla, Supriya; Nguyen, Nghia

    2007-06-01

    The American College of Surgeons Commission on Cancer (ACS-CoC) mandates that pathology reports at ACS-CoC-approved cancer programs include all scientifically validated data elements for each site and tumor specimen. The College of American Pathologists (CAP) has produced cancer checklists in static text formats to assist reporting. To be inclusive, the CAP checklists are pages long, requiring extensive text editing and multiple intermediate steps. We created a set of dynamic tumor-reporting templates, using Microsoft Active Server Page (ASP.NET), with drop-down list and data-compile features, and added a reminder function to indicate missing information. Users can access this system on the Internet, prepare the tumor report by selecting relevant data from drop-down lists with an embedded tumor staging scheme, and directly transfer the final report into a laboratory information system by using the copy-and-paste function. By minimizing extensive text editing and eliminating intermediate steps, this system can reduce reporting errors, improve work efficiency, and increase compliance.

  15. A cross-site comparison of methods used for hydrogeologic characterization of the Galena-Platteville aquifer in Illinois and Wisconsin, with examples from selected Superfund sites

    USGS Publications Warehouse

    Kay, Robert T.; Mills, Patrick C.; Dunning, Charles P.; Yeskis, Douglas J.; Ursic, James R.; Vendl, Mark

    2004-01-01

    The effectiveness of 28 methods used to characterize the fractured Galena-Platteville aquifer at eight sites in northern Illinois and Wisconsin is evaluated. Analysis of government databases, previous investigations, topographic maps, aerial photographs, and outcrops was essential to understanding the hydrogeology in the area to be investigated. The effectiveness of surface-geophysical methods depended on site geology. Lithologic logging provided essential information for site characterization. Cores were used for stratigraphy and geotechnical analysis. Natural-gamma logging helped identify the effect of lithology on the location of secondary- permeability features. Caliper logging identified large secondary-permeability features. Neutron logs identified trends in matrix porosity. Acoustic-televiewer logs identified numerous secondary-permeability features and their orientation. Borehole-camera logs also identified a number of secondary-permeability features. Borehole ground-penetrating radar identified lithologic and secondary-permeability features. However, the accuracy and completeness of this method is uncertain. Single-point-resistance, density, and normal resistivity logs were of limited use. Water-level and water-quality data identified flow directions and indicated the horizontal and vertical distribution of aquifer permeability and the depth of the permeable features. Temperature, spontaneous potential, and fluid-resistivity logging identified few secondary-permeability features at some sites and several features at others. Flowmeter logging was the most effective geophysical method for characterizing secondary-permeability features. Aquifer tests provided insight into the permeability distribution, identified hydraulically interconnected features, the presence of heterogeneity and anisotropy, and determined effective porosity. Aquifer heterogeneity prevented calculation of accurate hydraulic properties from some tests. Different methods, such as flowmeter logging and slug testing, occasionally produced different interpretations. Aquifer characterization improved with an increase in the number of data points, the period of data collection, and the number of methods used.

  16. PrAS: Prediction of amidation sites using multiple feature extraction.

    PubMed

    Wang, Tong; Zheng, Wei; Wuyun, Qiqige; Wu, Zhenfeng; Ruan, Jishou; Hu, Gang; Gao, Jianzhao

    2017-02-01

    Amidation plays an important role in a variety of pathological processes and serious diseases like neural dysfunction and hypertension. However, identification of protein amidation sites through traditional experimental methods is time consuming and expensive. In this paper, we proposed a novel predictor for Prediction of Amidation Sites (PrAS), which is the first software package for academic users. The method incorporated four representative feature types, which are position-based features, physicochemical and biochemical properties features, predicted structure-based features and evolutionary information features. A novel feature selection method, positive contribution feature selection was proposed to optimize features. PrAS achieved AUC of 0.96, accuracy of 92.1%, sensitivity of 81.2%, specificity of 94.9% and MCC of 0.76 on the independent test set. PrAS is freely available at https://sourceforge.net/p/praspkg. Copyright © 2016 Elsevier Ltd. All rights reserved.

  17. Predicting protein amidation sites by orchestrating amino acid sequence features

    NASA Astrophysics Data System (ADS)

    Zhao, Shuqiu; Yu, Hua; Gong, Xiujun

    2017-08-01

    Amidation is the fourth major category of post-translational modifications, which plays an important role in physiological and pathological processes. Identifying amidation sites can help us understanding the amidation and recognizing the original reason of many kinds of diseases. But the traditional experimental methods for predicting amidation sites are often time-consuming and expensive. In this study, we propose a computational method for predicting amidation sites by orchestrating amino acid sequence features. Three kinds of feature extraction methods are used to build a feature vector enabling to capture not only the physicochemical properties but also position related information of the amino acids. An extremely randomized trees algorithm is applied to choose the optimal features to remove redundancy and dependence among components of the feature vector by a supervised fashion. Finally the support vector machine classifier is used to label the amidation sites. When tested on an independent data set, it shows that the proposed method performs better than all the previous ones with the prediction accuracy of 0.962 at the Matthew's correlation coefficient of 0.89 and area under curve of 0.964.

  18. A Machine Learning Approach to Measurement of Text Readability for EFL Learners Using Various Linguistic Features

    ERIC Educational Resources Information Center

    Kotani, Katsunori; Yoshimi, Takehiko; Isahara, Hitoshi

    2011-01-01

    The present paper introduces and evaluates a readability measurement method designed for learners of EFL (English as a foreign language). The proposed readability measurement method (a regression model) estimates the text readability based on linguistic features, such as lexical, syntactic and discourse features. Text readability refers to the…

  19. Innovations: clinical computing: an audio computer-assisted self-interviewing system for research and screening in public mental health settings.

    PubMed

    Bertollo, David N; Alexander, Mary Jane; Shinn, Marybeth; Aybar, Jalila B

    2007-06-01

    This column describes the nonproprietary software Talker, used to adapt screening instruments to audio computer-assisted self-interviewing (ACASI) systems for low-literacy populations and other populations. Talker supports ease of programming, multiple languages, on-site scoring, and the ability to update a central research database. Key features include highly readable text display, audio presentation of questions and audio prompting of answers, and optional touch screen input. The scripting language for adapting instruments is briefly described as well as two studies in which respondents provided positive feedback on its use.

  20. Dynamics of a quasiparticle in the α-T3 model: role of pseudospin polarization and transverse magnetic field on zitterbewegung.

    PubMed

    Biswas, Tutul; Kanti Ghosh, Tarun

    2018-01-22

    We consider the α-T 3 model which provides a smooth crossover between the honeycomb lattice with pseudospin 1/2 and the dice lattice with pseudospin 1 through the variation of a parameter α. We study the dynamics of a wave packet representing a quasiparticle in the α-T 3 model with zero and finite transverse magnetic field. For zero field, it is shown that the wave packet undergoes a transient zitterbewegung (ZB). Various features of ZB depending on the initial pseudospin polarization of the wave packet have been revealed. For an intermediate value of the parameter α i.e. for [Formula: see text] the resulting ZB consists of two distinct frequencies when the wave packet was located initially in rim site. However, the wave packet exhibits single frequency ZB for [Formula: see text] and [Formula: see text]. It is also unveiled that the frequency of ZB corresponding to [Formula: see text] gets exactly half of that corresponding to the [Formula: see text] case. On the other hand, when the initial wave packet was in hub site, the ZB consists of only one frequency for all values of α. Using stationary phase approximation, we find analytical expression of velocity average which can be used to extract the associated timescale over which the transient nature of ZB persists. On the contrary, the wave packet undergoes permanent ZB in presence of a transverse magnetic field. Due to the presence of a large number of Landau energy levels, the oscillations in ZB appear to be much more complicated. The oscillation pattern depends significantly on the initial pseudospin polarization of the wave packet. Furthermore, it is revealed that the number of the frequency components involved in ZB depends on the parameter α.

  1. The Texts of Literacy Instruction: Obstacles to or Opportunities for Educational Equity?

    ERIC Educational Resources Information Center

    Hiebert, Elfrieda H.

    2017-01-01

    Texts are a central part of reading. Yet our understandings of appropriate text features and distributions of text diets at different points in students' reading development are limited. The thesis of the essay is that, if the trajectory of struggling readers is to change, attention is needed to the features of texts and students' text diets,…

  2. Estimating Douglas-fir site quality from aerial photographs.

    Treesearch

    Grover A. Choate

    1961-01-01

    This study investigated the feasibility of developing a technique for estimating site index of Douglas-fir in the Pacific Northwest, using aerial photos and topographic maps. Physiographic features were used as indicators of site index. Analysis showed that although most of the features were highly significant as criteria for predicting site index, they explained less...

  3. Reduction potentials of protein disulfides and catalysis of glutathionylation and deglutathionylation by glutaredoxin enzymes.

    PubMed

    Ukuwela, Ashwinie A; Bush, Ashley I; Wedd, Anthony G; Xiao, Zhiguang

    2017-11-09

    Glutaredoxins (Grxs) are a class of GSH (glutathione)-dependent thiol-disulfide oxidoreductase enzymes. They use the cellular redox buffer GSSG (glutathione disulfide)/GSH directly to catalyze these exchange reactions. Grxs feature dithiol active sites and can shuttle rapidly between three oxidation states, namely dithiol Grx(SH) 2 , mixed disulfide Grx(SH)(SSG) and oxidized disulfide Grx(SS). Each is characterized by a distinct standard reduction potential [Formula: see text] The [Formula: see text] values for the redox couple Grx(SS)/Grx(SH) 2 are available, but a recent estimate differs by over 100 mV from the literature values. No estimates are available for [Formula: see text] for the mixed disulfide couple Grx(SH)(SSG)/(Grx(SH) 2  + GSH). This work determined both [Formula: see text] and [Formula: see text] for two representative Grx enzymes, Homo sapiens HsGrx1 and Escherichia coli EcGrx1. The empirical approaches were verified rigorously to overcome the sensitivity of these redox-labile enzymes to experimental conditions. The classic method of acid 'quenching' was demonstrated to shift the thiol-disulfide redox equilibria. Both enzymes exhibit an [Formula: see text] (vs. SHE) at a pH of 7.0. Their [Formula: see text] values (-213 and -230 mV for EcGrx1 and HsGrx1, respectively) are slightly less negative than that ([Formula: see text]) of the redox buffer GSSG/2GSH. Both [Formula: see text] and [Formula: see text] vary with log [GSH], but the former more sensitively by a factor of 2. This confers dual catalytic functions to a Grx enzyme as either an oxidase at low [GSH] or as a reductase at high [GSH]. Consequently, these enzymes can participate efficiently in either glutathionylation or deglutathionylation. The catalysis is demonstrated to proceed via a monothiol ping-pong mechanism relying on a single Cys residue only in the dithiol active site. © 2017 The Author(s). Published by Portland Press Limited on behalf of the Biochemical Society.

  4. Technology use among adults who are deaf and hard of hearing: a national survey.

    PubMed

    Maiorana-Basas, Michella; Pagliaro, Claudia M

    2014-07-01

    As society becomes increasingly more dependent on technology, information regarding the use, preference, and accessibility of commonly used devices and services among individuals who are deaf and hard of hearing (DHH) is crucial. Developing technologies that are functional and appropriately accessible allows persons who are DHH to fully participate in society, education, and business while also providing opportunities for personal and professional advancement. Although a few international studies have addressed the technology use of individuals who are DHH, none exist that focus on the needs, preferences, and accessibility of current Internet- and mobile-based technologies. Consequently, a national survey was conducted in the United States to determine the preference, frequency of use, and accessibility of various technologies (hardware, software, Web sites) by adults who are DHH and living in the United States. Findings indicate frequent use of smartphones and personal computers, specifically for text-based communication and web surfing, and little use of Teletypewriter/Telecommunications Device for the Deaf. Web site feature preferences include pictures and text, and captions over signed translations. Some results varied by demographics. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  5. Scaffolding Students' Independent Decoding of Unfamiliar Text with a Prototype of an eBook-Feature

    ERIC Educational Resources Information Center

    Gissel, Stig T.

    2015-01-01

    This study was undertaken to design, evaluate and refine an eBook-feature that supports students' decoding of unfamiliar text. The feature supports students' independent reading of eBooks with text-to-speech, graded support in the form of syllabification and rhyme analogy, and by dividing the word material into different categories based on the…

  6. Visual affective classification by combining visual and text features.

    PubMed

    Liu, Ningning; Wang, Kai; Jin, Xin; Gao, Boyang; Dellandréa, Emmanuel; Chen, Liming

    2017-01-01

    Affective analysis of images in social networks has drawn much attention, and the texts surrounding images are proven to provide valuable semantic meanings about image content, which can hardly be represented by low-level visual features. In this paper, we propose a novel approach for visual affective classification (VAC) task. This approach combines visual representations along with novel text features through a fusion scheme based on Dempster-Shafer (D-S) Evidence Theory. Specifically, we not only investigate different types of visual features and fusion methods for VAC, but also propose textual features to effectively capture emotional semantics from the short text associated to images based on word similarity. Experiments are conducted on three public available databases: the International Affective Picture System (IAPS), the Artistic Photos and the MirFlickr Affect set. The results demonstrate that the proposed approach combining visual and textual features provides promising results for VAC task.

  7. Visual affective classification by combining visual and text features

    PubMed Central

    Liu, Ningning; Wang, Kai; Jin, Xin; Gao, Boyang; Dellandréa, Emmanuel; Chen, Liming

    2017-01-01

    Affective analysis of images in social networks has drawn much attention, and the texts surrounding images are proven to provide valuable semantic meanings about image content, which can hardly be represented by low-level visual features. In this paper, we propose a novel approach for visual affective classification (VAC) task. This approach combines visual representations along with novel text features through a fusion scheme based on Dempster-Shafer (D-S) Evidence Theory. Specifically, we not only investigate different types of visual features and fusion methods for VAC, but also propose textual features to effectively capture emotional semantics from the short text associated to images based on word similarity. Experiments are conducted on three public available databases: the International Affective Picture System (IAPS), the Artistic Photos and the MirFlickr Affect set. The results demonstrate that the proposed approach combining visual and textual features provides promising results for VAC task. PMID:28850566

  8. Text-Based Conferencing: Features vs. Functionality

    ERIC Educational Resources Information Center

    Anderson, Lynn; McCarthy, Cathy

    2005-01-01

    This report examines three text-based conferencing products: "WowBB", "Invision Power Board", and "vBulletin". Their selection was prompted by a feature-by-feature comparison of the same products on the "WowBB" website. The comparison chart painted a misleading impression of "WowBB's" features in relation to the other two products; so the…

  9. Do Particular Design Features Assist People with Aphasia to Comprehend Text? An Exploratory Study

    ERIC Educational Resources Information Center

    Wilson, Lucy; Read, Jennifer

    2016-01-01

    Background: Much of the evidence underlying guidelines for producing accessible information for people with aphasia focuses on client preference for particular design features. There is limited evidence regarding the effects of these features on comprehension. Aims: To examine the effects of specific design features on text comprehension. It was…

  10. The Use of Technology in Participant Tracking and Study Retention: Lessons Learned from a Clinical Trials Network Study

    PubMed Central

    Mitchell, Shannon Gwin; Schwartz, Robert P.; Alvanzo, Anika A. H.; Weisman, Monique S.; Kyle, Tiffany L.; Turrigiano, Eva M.; Gibson, Martha L.; Perez, Livangelie; McClure, Erin A.; Clingerman, Sara; Froias, Autumn; Shandera, Danielle R.; Walker, Robrina; Babcock, Dean L.; Bailey, Genie L.; Miele, Gloria M.; Kunkel, Lynn E.; Norton, Michael; Stitzer, Maxine L.

    2015-01-01

    Background The growing use of newer communication and internet technologies, even among low income and transient populations, require research staff to update their outreach strategies to ensure high follow-up and participant retention rates. This paper presents the views of research assistants on the use of cell phones and the internet to track participants in a multi-site randomized trial of substance use disorder treatment. Methods Pre-interview questionnaires exploring tracking and other study-related activities were collected from 21 research staff across the 10 participating US sites. Data were then used to construct a semi-structured interview guide which, in turn, was used to interview 12 of the same staff members. The questionnaires and interview data were entered in Atlas.ti and analyzed for emergent themes related to the use of technology for participant tracking purposes. Results Study staff reported that most participants had cell phones, despite having unstable physical addresses and landlines. The incoming call feature of most cell phones was useful for participants and research staff alike, and texting proved to have additional benefits. However, reliance on participants’ cell phones also proved problematic. Even homeless participants were found to have access to the internet through public libraries and could respond to study staff e-mails. Some study sites opened generic social media accounts, through which study staff sent private messages to participants. However, the Institutional Review Board (IRB) approval process for tracking participants using social media at some sites was prohibitively lengthy. Internet searches through Google, national paid databases, obituaries, and judiciary websites were also helpful tools. Conclusions Research staff perceive that cell phones, internet searches, and social networking sites were effective tools to achieve high follow-up rates in drug abuse research. Studies should incorporate cell phone, texting, and social network website information on locator forms; obtain IRB approval for contacting participants using social networking websites; and include web searches, texting, and the use of social media in staff training as standard operating procedures. PMID:25671593

  11. Food and beverage brands that market to children and adolescents on the internet: a content analysis of branded web sites.

    PubMed

    Henry, Anna E; Story, Mary

    2009-01-01

    To identify food and beverage brand Web sites featuring designated children's areas, assess marketing techniques present on those industry Web sites, and determine nutritional quality of branded food items marketed to children. Systematic content analysis of food and beverage brand Web sites and nutrient analysis of food and beverages advertised on these Web sites. The World Wide Web. One-hundred thirty Internet Web sites of food and beverage brands with top media expenditures based on the America's Top 2000 Brands section of Brandweek magazine's annual "Superbrands" report. A standardized content analysis rating form to determine marketing techniques used on the food and beverage brand Web sites. Nutritional analysis of food brands was conducted. Of 130 Web sites analyzed, 48% featured designated children's areas. These Web sites featured a variety of Internet marketing techniques, including advergaming on 85% of the Web sites and interactive programs on 92% of the Web sites. Branded spokescharacters and tie-ins to other products were featured on the majority of the Web sites, as well. Few food brands (13%) with Web sites that market to children met the nutrition criteria set by the National Alliance for Nutrition and Activity. Nearly half of branded Web sites analyzed used designated children's areas to market food and beverages to children, 87% of which were of low nutritional quality. Nutrition professionals should advocate the use of advertising techniques to encourage healthful food choices for children.

  12. 36 CFR 67.2 - Definitions.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... structure encompasses the historic building and its site, landscape features, and environment, generally... means a building and its site and landscape features. Registered Historic District means any district...

  13. 36 CFR 67.2 - Definitions.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... structure encompasses the historic building and its site, landscape features, and environment, generally... means a building and its site and landscape features. Registered Historic District means any district...

  14. 36 CFR 67.2 - Definitions.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... structure encompasses the historic building and its site, landscape features, and environment, generally... means a building and its site and landscape features. Registered Historic District means any district...

  15. 36 CFR 67.2 - Definitions.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... structure encompasses the historic building and its site, landscape features, and environment, generally... means a building and its site and landscape features. Registered Historic District means any district...

  16. 10 CFR 100.10 - Factors to be considered when evaluating sites.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... reactor incorporates unique or unusual features having a significant bearing on the probability or consequences of accidental release of radioactive materials; (4) The safety features that are to be engineered... radioactive fission products. In addition, the site location and the engineered features included as...

  17. 10 CFR 100.10 - Factors to be considered when evaluating sites.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... reactor incorporates unique or unusual features having a significant bearing on the probability or consequences of accidental release of radioactive materials; (4) The safety features that are to be engineered... radioactive fission products. In addition, the site location and the engineered features included as...

  18. 10 CFR 100.10 - Factors to be considered when evaluating sites.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... reactor incorporates unique or unusual features having a significant bearing on the probability or consequences of accidental release of radioactive materials; (4) The safety features that are to be engineered... radioactive fission products. In addition, the site location and the engineered features included as...

  19. 77 FR 48550 - Technicolor Creative Services, Post Production Feature Mastering Division Including On-Site...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-08-14

    ... Services, Post Production Feature Mastering Division Including On-Site Leased Workers From Ajilon... Services, Post Production Feature Mastering Division, Hollywood, California (subject firm). The worker... the workers meet the eligibility requirements of the Trade Act of 1974. Conclusion After careful...

  20. Detecting spam comments on Indonesia’s Instagram posts

    NASA Astrophysics Data System (ADS)

    Septiandri, Ali Akbar; Wibisono, Okiriza

    2017-01-01

    In this paper we experimented with several feature sets for detecting spam comments in social media contents authored by Indonesian public figures. We define spam comments as comments which have promotional purposes (e.g. referring other users to products and services) and thus not related to the content to which the comments are posted. Three sets of features are evaluated for detecting spams: (1) hand-engineered features such as comment length, number of capital letters, and number of emojis, (2) keyword features such as whether the comment contains advertising words or product-related words, and (3) text features, namely, bag-of-words, TF-IDF, and fastText embeddings, each combined with latent semantic analysis. With 24,000 manually-annotated comments scraped from Instagram posts authored by more than 100 Indonesian public figures, we compared the performance of these feature sets and their combinations using 3 popular classification algorithms: Na¨ıve Bayes, SVM, and XGBoost. We find that using all three feature sets (with fastText embedding for the text features) gave the best F 1-score of 0.9601 on a holdout dataset. More interestingly, fastText embedding combined with hand-engineered features (i.e. without keyword features) yield similar F 1-score of 0.9523, and McNemar’s test failed to reject the hypothesis that the two results are not significantly different. This result is important as keyword features are largely dependent on the dataset and may not be as generalisable as the other feature sets when applied to new data. For future work, we hope to collect bigger and more diverse dataset of Indonesian spam comments, improve our model’s performance and generalisability, and publish a programming package for others to reliably detect spam comments.

  1. Separate but Equal? A Comparison of Content on Library Web Pages and Their Text Versions

    ERIC Educational Resources Information Center

    Hazard, Brenda L.

    2008-01-01

    This study examines the Web sites of the Association of Research Libraries member libraries to determine the presence of a separate text version of the default graphical homepage. The content of the text version and the homepage is compared. Of 121 Web sites examined, twenty libraries currently offer a text version. Ten sites maintain wholly…

  2. Formal Features of Cyberspace: Relationships between Web Page Complexity and Site Traffic.

    ERIC Educational Resources Information Center

    Bucy, Erik P.; Lang, Annie; Potter, Robert F.; Grabe, Maria Elizabeth

    1999-01-01

    Examines differences between the formal features of commercial versus noncommercial Web sites, and the relationship between Web page complexity and amount of traffic a site receives. Findings indicate that, although most pages in this stage of the Web's development remain technologically simple and noninteractive, there are significant…

  3. Analysis and Prediction of Myristoylation Sites Using the mRMR Method, the IFS Method and an Extreme Learning Machine Algorithm.

    PubMed

    Wang, ShaoPeng; Zhang, Yu-Hang; Huang, GuoHua; Chen, Lei; Cai, Yu-Dong

    2017-01-01

    Myristoylation is an important hydrophobic post-translational modification that is covalently bound to the amino group of Gly residues on the N-terminus of proteins. The many diverse functions of myristoylation on proteins, such as membrane targeting, signal pathway regulation and apoptosis, are largely due to the lipid modification, whereas abnormal or irregular myristoylation on proteins can lead to several pathological changes in the cell. To better understand the function of myristoylated sites and to correctly identify them in protein sequences, this study conducted a novel computational investigation on identifying myristoylation sites in protein sequences. A training dataset with 196 positive and 84 negative peptide segments were obtained. Four types of features derived from the peptide segments following the myristoylation sites were used to specify myristoylatedand non-myristoylated sites. Then, feature selection methods including maximum relevance and minimum redundancy (mRMR), incremental feature selection (IFS), and a machine learning algorithm (extreme learning machine method) were adopted to extract optimal features for the algorithm to identify myristoylation sites in protein sequences, thereby building an optimal prediction model. As a result, 41 key features were extracted and used to build an optimal prediction model. The effectiveness of the optimal prediction model was further validated by its performance on a test dataset. Furthermore, detailed analyses were also performed on the extracted 41 features to gain insight into the mechanism of myristoylation modification. This study provided a new computational method for identifying myristoylation sites in protein sequences. We believe that it can be a useful tool to predict myristoylation sites from protein sequences. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  4. Variability in Text Features in Six Grade 1 Basal Reading Programs

    ERIC Educational Resources Information Center

    Foorman, Barbara R.; Francis, David J.; Davidson, Kevin C.; Harm, Michael W.; Griffin, Jennifer

    2004-01-01

    California and Texas mandate 75% to 80% decodable texts for first-grade reading programs, yet these percentages have no empirical base. This study examines the text selections in 6 first-grade programs from the perspective of lexical, semantic, and syntactic features. The composition of text differed across the 6 programs with respect to length,…

  5. TEPAPA: a novel in silico feature learning pipeline for mining prognostic and associative factors from text-based electronic medical records.

    PubMed

    Lin, Frank Po-Yen; Pokorny, Adrian; Teng, Christina; Epstein, Richard J

    2017-07-31

    Vast amounts of clinically relevant text-based variables lie undiscovered and unexploited in electronic medical records (EMR). To exploit this untapped resource, and thus facilitate the discovery of informative covariates from unstructured clinical narratives, we have built a novel computational pipeline termed Text-based Exploratory Pattern Analyser for Prognosticator and Associator discovery (TEPAPA). This pipeline combines semantic-free natural language processing (NLP), regular expression induction, and statistical association testing to identify conserved text patterns associated with outcome variables of clinical interest. When we applied TEPAPA to a cohort of head and neck squamous cell carcinoma patients, plausible concepts known to be correlated with human papilloma virus (HPV) status were identified from the EMR text, including site of primary disease, tumour stage, pathologic characteristics, and treatment modalities. Similarly, correlates of other variables (including gender, nodal status, recurrent disease, smoking and alcohol status) were also reliably recovered. Using highly-associated patterns as covariates, a patient's HPV status was classifiable using a bootstrap analysis with a mean area under the ROC curve of 0.861, suggesting its predictive utility in supporting EMR-based phenotyping tasks. These data support using this integrative approach to efficiently identify disease-associated factors from unstructured EMR narratives, and thus to efficiently generate testable hypotheses.

  6. Book reviews in medical journals.

    PubMed Central

    Kroenke, K

    1986-01-01

    In a study of book reviews published in four general medical journals over a six-month period, 480 reviews were analyzed. Twenty-five features that reviewers address when evaluating a text were identified, and the frequency of commentary for each feature was determined. The mean number of features addressed per review was 9.0. Reviews averaged 389 words, but review length did not correlate with the length or scope of the book, with the number of features addressed, nor with the reviewer's assessment of the text. Extraneous commentary by the reviewer occurred in 16% of the reviews. This editorializing appeared in lengthier reviews that addressed fewer features. Favorable reviews were far more common than unfavorable ones (88.5% vs. 11.5%). Consequently, for the fifty-five books reviewed in more than one journal, agreement regarding rating of the text was high (86%). Results of this study may provide useful guidelines for reviewers of medical texts. PMID:3947772

  7. World Wide Web Based Image Search Engine Using Text and Image Content Features

    NASA Astrophysics Data System (ADS)

    Luo, Bo; Wang, Xiaogang; Tang, Xiaoou

    2003-01-01

    Using both text and image content features, a hybrid image retrieval system for Word Wide Web is developed in this paper. We first use a text-based image meta-search engine to retrieve images from the Web based on the text information on the image host pages to provide an initial image set. Because of the high-speed and low cost nature of the text-based approach, we can easily retrieve a broad coverage of images with a high recall rate and a relatively low precision. An image content based ordering is then performed on the initial image set. All the images are clustered into different folders based on the image content features. In addition, the images can be re-ranked by the content features according to the user feedback. Such a design makes it truly practical to use both text and image content for image retrieval over the Internet. Experimental results confirm the efficiency of the system.

  8. CRIE: An automated analyzer for Chinese texts.

    PubMed

    Sung, Yao-Ting; Chang, Tao-Hsing; Lin, Wei-Chun; Hsieh, Kuan-Sheng; Chang, Kuo-En

    2016-12-01

    Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been developed for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguistic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in language arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.

  9. Sending and Receiving Text Messages with Sexual Content: Relations with Early Sexual Activity and Borderline Personality Features in Late Adolescence

    PubMed Central

    Brinkley, Dawn Y.; Ackerman, Robert A.; Ehrenreich, Samuel E.; Underwood, Marion K.

    2017-01-01

    This research examined adolescents’ written text messages with sexual content to investigate how sexting relates to sexual activity and borderline personality features. Participants (N = 181, 85 girls) completed a measure of borderline personality features prior to 10th grade and were subsequently given smartphones configured to capture the content of their text messages. Four days of text messaging were micro-coded for content related to sex. Following 12th grade, participants reported on their sexual activity and again completed a measure of borderline personality features. Results showed that engaging in sexting at age 16 was associated with reporting an early sexual debut, having sexual intercourse experience, having multiple sex partners, and engaging in drug use in combination with sexual activity two years later. Girls engaging in sex talk were more likely to have had sexual intercourse by age 18. Text messaging about hypothetical sex in grade 10 also predicted borderline personality features at age 18. These findings suggest that sending text messages with sexual content poses risks for adolescents. Programs to prevent risky sexual activity and to promote psychological health could be enhanced by teaching adolescents to use digital communication responsibly. PMID:28824224

  10. Sending and Receiving Text Messages with Sexual Content: Relations with Early Sexual Activity and Borderline Personality Features in Late Adolescence.

    PubMed

    Brinkley, Dawn Y; Ackerman, Robert A; Ehrenreich, Samuel E; Underwood, Marion K

    2017-05-01

    This research examined adolescents' written text messages with sexual content to investigate how sexting relates to sexual activity and borderline personality features. Participants (N = 181, 85 girls) completed a measure of borderline personality features prior to 10 th grade and were subsequently given smartphones configured to capture the content of their text messages. Four days of text messaging were micro-coded for content related to sex. Following 12 th grade, participants reported on their sexual activity and again completed a measure of borderline personality features. Results showed that engaging in sexting at age 16 was associated with reporting an early sexual debut, having sexual intercourse experience, having multiple sex partners, and engaging in drug use in combination with sexual activity two years later. Girls engaging in sex talk were more likely to have had sexual intercourse by age 18. Text messaging about hypothetical sex in grade 10 also predicted borderline personality features at age 18. These findings suggest that sending text messages with sexual content poses risks for adolescents. Programs to prevent risky sexual activity and to promote psychological health could be enhanced by teaching adolescents to use digital communication responsibly.

  11. Meteorological data for four sites at surface-disruption features in Yucca Flat, Nevada Test Site, Nye County, Nevada, 1985-86

    USGS Publications Warehouse

    Carman, Rita L.

    1994-01-01

    Surface-disruption features, or craters, resulting from underground nuclear testing at the Nevada Test Site may increase the potential for ground-water recharge in an area that would normally produce little, if any, recharge. This report presents selected meteorological data resulting from a study of two surface-disruption features during May 1985 through June 1986. The data were collected at four adjacent sites in Yucca Flat, about 56 kilometers north of Mercury, Nevada. Three sites (one in each of two craters and one at an undisturbed site at the original land surface) were instrumented to collect meteorological data for calculating bare-soil evaporation. These data include (1) long-wave radiation, (2) short-wave radiation, (3) net radiation, (4) air temperae, and (5) soil surface temperature. Meteorological data also were collected at a weather station at an undisturbed site near the study craters. Data collected at this site include (1) air temperature, (2) relative humidity, (3) wind velocity, and (4) wind direction.

  12. Feature engineering for MEDLINE citation categorization with MeSH.

    PubMed

    Jimeno Yepes, Antonio Jose; Plaza, Laura; Carrillo-de-Albornoz, Jorge; Mork, James G; Aronson, Alan R

    2015-04-08

    Research in biomedical text categorization has mostly used the bag-of-words representation. Other more sophisticated representations of text based on syntactic, semantic and argumentative properties have been less studied. In this paper, we evaluate the impact of different text representations of biomedical texts as features for reproducing the MeSH annotations of some of the most frequent MeSH headings. In addition to unigrams and bigrams, these features include noun phrases, citation meta-data, citation structure, and semantic annotation of the citations. Traditional features like unigrams and bigrams exhibit strong performance compared to other feature sets. Little or no improvement is obtained when using meta-data or citation structure. Noun phrases are too sparse and thus have lower performance compared to more traditional features. Conceptual annotation of the texts by MetaMap shows similar performance compared to unigrams, but adding concepts from the UMLS taxonomy does not improve the performance of using only mapped concepts. The combination of all the features performs largely better than any individual feature set considered. In addition, this combination improves the performance of a state-of-the-art MeSH indexer. Concerning the machine learning algorithms, we find that those that are more resilient to class imbalance largely obtain better performance. We conclude that even though traditional features such as unigrams and bigrams have strong performance compared to other features, it is possible to combine them to effectively improve the performance of the bag-of-words representation. We have also found that the combination of the learning algorithm and feature sets has an influence in the overall performance of the system. Moreover, using learning algorithms resilient to class imbalance largely improves performance. However, when using a large set of features, consideration needs to be taken with algorithms due to the risk of over-fitting. Specific combinations of learning algorithms and features for individual MeSH headings could further increase the performance of an indexing system.

  13. Learning discriminative functional network features of schizophrenia

    NASA Astrophysics Data System (ADS)

    Gheiratmand, Mina; Rish, Irina; Cecchi, Guillermo; Brown, Matthew; Greiner, Russell; Bashivan, Pouya; Polosecki, Pablo; Dursun, Serdar

    2017-03-01

    Associating schizophrenia with disrupted functional connectivity is a central idea in schizophrenia research. However, identifying neuroimaging-based features that can serve as reliable "statistical biomarkers" of the disease remains a challenging open problem. We argue that generalization accuracy and stability of candidate features ("biomarkers") must be used as additional criteria on top of standard significance tests in order to discover more robust biomarkers. Generalization accuracy refers to the utility of biomarkers for making predictions about individuals, for example discriminating between patients and controls, in novel datasets. Feature stability refers to the reproducibility of the candidate features across different datasets. Here, we extracted functional connectivity network features from fMRI data at both high-resolution (voxel-level) and a spatially down-sampled lower-resolution ("supervoxel" level). At the supervoxel level, we used whole-brain network links, while at the voxel level, due to the intractably large number of features, we sampled a subset of them. We compared statistical significance, stability and discriminative utility of both feature types in a multi-site fMRI dataset, composed of schizophrenia patients and healthy controls. For both feature types, a considerable fraction of features showed significant differences between the two groups. Also, both feature types were similarly stable across multiple data subsets. However, the whole-brain supervoxel functional connectivity features showed a higher cross-validation classification accuracy of 78.7% vs. 72.4% for the voxel-level features. Cross-site variability and heterogeneity in the patient samples in the multi-site FBIRN dataset made the task more challenging compared to single-site studies. The use of the above methodology in combination with the fully data-driven approach using the whole brain information have the potential to shed light on "biomarker discovery" in schizophrenia.

  14. Arabic OCR: toward a complete system

    NASA Astrophysics Data System (ADS)

    El-Bialy, Ahmed M.; Kandil, Ahmed H.; Hashish, Mohamed; Yamany, Sameh M.

    1999-12-01

    Latin and Chinese OCR systems have been studied extensively in the literature. Yet little work was performed for Arabic character recognition. This is due to the technical challenges found in the Arabic text. Due to its cursive nature, a powerful and stable text segmentation is needed. Also; features capturing the characteristics of the rich Arabic character representation are needed to build the Arabic OCR. In this paper a novel segmentation technique which is font and size independent is introduced. This technique can segment the cursive written text line even if the line suffers from small skewness. The technique is not sensitive to the location of the centerline of the text line and can segment different font sizes and type (for different character sets) occurring on the same line. Features extraction is considered one of the most important phases of the text reading system. Ideally, the features extracted from a character image should capture the essential characteristics of this character that are independent of the font type and size. In such ideal case, the classifier stores a single prototype per character. However, it is practically challenging to find such ideal set of features. In this paper, a set of features that reflect the topological aspects of Arabia characters is proposed. These proposed features integrated with a topological matching technique introduce an Arabic text reading system that is semi Omni.

  15. Selection of nest-site habitat by interior least terns in relation to sandbar construction

    USGS Publications Warehouse

    Sherfy, M.H.; Stucker, J.H.; Buhl, D.A.

    2012-01-01

    Federally endangered interior least terns (Sternula antillarum) nest on bare or sparsely vegetated sandbars on midcontinent river systems. Loss of nesting habitat has been implicated as a cause of population declines, and managing these habitats is a major initiative in population recovery. One such initiative involves construction of mid-channel sandbars on the Missouri River, where natural sandbar habitat has declined in quantity and quality since the late 1990s. We evaluated nest-site habitat selection by least terns on constructed and natural sandbars by comparing vegetation, substrate, and debris variables at nest sites (na =a 798) and random points (na =a 1,113) in bare or sparsely vegetated habitats. Our logistic regression models revealed that a broader suite of habitat features was important in nest-site selection on constructed than on natural sandbars. Odds ratios for habitat variables indicated that avoidance of habitat features was the dominant nest-site selection process on both sandbar types, with nesting terns being attracted to nest-site habitat features (gravel and debris) and avoiding vegetation only on constructed sandbars, and avoiding silt and leaf litter on both sandbar types. Despite the seemingly uniform nature of these habitats, our results suggest that a complex suite of habitat features influences nest-site choice by least terns. However, nest-site selection in this social, colonially nesting species may be influenced by other factors, including spatial arrangement of bare sand habitat, proximity to other least terns, and prior habitat occupancy by piping plovers (Charadrius melodus). We found that nest-site selection was sensitive to subtle variation in habitat features, suggesting that rigor in maintaining habitat condition will be necessary in managing sandbars for the benefit of least terns. Further, management strategies that reduce habitat features that are avoided by least terns may be the most beneficial to nesting least terns. ?? 2011 The Wildlife Society.

  16. Selection of nest-site habitat by interior least terns in relation to sandbar construction

    USGS Publications Warehouse

    Sherfy, Mark H.; Stucker, Jennifer H.; Buhl, Deborah A.

    2012-01-01

    Federally endangered interior least terns (Sternula antillarum) nest on bare or sparsely vegetated sandbars on midcontinent river systems. Loss of nesting habitat has been implicated as a cause of population declines, and managing these habitats is a major initiative in population recovery. One such initiative involves construction of mid-channel sandbars on the Missouri River, where natural sandbar habitat has declined in quantity and quality since the late 1990s. We evaluated nest-site habitat selection by least terns on constructed and natural sandbars by comparing vegetation, substrate, and debris variables at nest sites (n = 798) and random points (n = 1,113) in bare or sparsely vegetated habitats. Our logistic regression models revealed that a broader suite of habitat features was important in nest-site selection on constructed than on natural sandbars. Odds ratios for habitat variables indicated that avoidance of habitat features was the dominant nest-site selection process on both sandbar types, with nesting terns being attracted to nest-site habitat features (gravel and debris) and avoiding vegetation only on constructed sandbars, and avoiding silt and leaf litter on both sandbar types. Despite the seemingly uniform nature of these habitats, our results suggest that a complex suite of habitat features influences nest-site choice by least terns. However, nest-site selection in this social, colonially nesting species may be influenced by other factors, including spatial arrangement of bare sand habitat, proximity to other least terns, and prior habitat occupancy by piping plovers (Charadrius melodus). We found that nest-site selection was sensitive to subtle variation in habitat features, suggesting that rigor in maintaining habitat condition will be necessary in managing sandbars for the benefit of least terns. Further, management strategies that reduce habitat features that are avoided by least terns may be the most beneficial to nesting least terns.

  17. Is the recall of verbal-spatial information from working memory affected by symptoms of ADHD?

    PubMed

    Caterino, Linda C; Verdi, Michael P

    2012-10-01

    OJECTIVE: The Kulhavy model for text learning using organized spatial displays proposes that learning will be increased when participants view visual images prior to related text. In contrast to previous studies, this study also included students who exhibited symptoms of ADHD. Participants were presented with either a map-text or text-map condition. The map-text condition led to a significantly higher performance than the text-map condition, overall. However, students who endorsed more symptoms of inattention and hyperactivity-impulsivity scored more poorly when asked to recall text facts, text features, and map features and were less able to correctly place map features on a reconstructed map than were students who endorsed fewer symptoms. The results of the study support the Kulhavy model for typical students; however, the benefit of viewing a display prior to text was not seen for students with ADHD symptoms, thus supporting previous studies that have demonstrated that ADHD appears to negatively affect operations that occur in working memory.

  18. TSAPA: identification of tissue-specific alternative polyadenylation sites in plants.

    PubMed

    Ji, Guoli; Chen, Moliang; Ye, Wenbin; Zhu, Sheng; Ye, Congting; Su, Yaru; Peng, Haonan; Wu, Xiaohui

    2018-06-15

    Alternative polyadenylation (APA) is now emerging as a widespread mechanism modulated tissue-specifically, which highlights the need to define tissue-specific poly(A) sites for profiling APA dynamics across tissues. We have developed an R package called TSAPA based on the machine learning model for identifying tissue-specific poly(A) sites in plants. A feature space including more than 200 features was assembled to specifically characterize poly(A) sites in plants. The classification model in TSAPA can be customized by selecting desirable features or classifiers. TSAPA is also capable of predicting tissue-specific poly(A) sites in unannotated intergenic regions. TSAPA will be a valuable addition to the community for studying dynamics of APA in plants. https://github.com/BMILAB/TSAPA. Supplementary data are available at Bioinformatics online.

  19. Shocked quartz in the cretaceous-tertiary boundary clays: Evidence for a global distribution

    USGS Publications Warehouse

    Bohor, B.F.; Modreski, P.J.; Foord, E.E.

    1987-01-01

    Shocked quartz grains displaying planar features were isolated from Cretaceous-Tertiary boundary days at five sites in Europe, a core from the north-central Pacific Ocean, and a site in New Zealand. At all of these sites, the planar features in the shocked quartz can be indexed to rational crystallographic planes of the quartz lattice. The grains display streaking indicative of shock in x-ray diffraction photographs and also show reduced refractive indices. These characteristic features of shocked quartz at several sites worldwide confirm that an impact event at the Cretaceous-Tertiary boundary distributed ejecta products in an earth-girdling dust cloud, as postulated by the Alvarez impact hypothesis.

  20. Predicting conformational ensembles and genome-wide transcription factor binding sites from DNA sequences.

    PubMed

    Andrabi, Munazah; Hutchins, Andrew Paul; Miranda-Saavedra, Diego; Kono, Hidetoshi; Nussinov, Ruth; Mizuguchi, Kenji; Ahmad, Shandar

    2017-06-22

    DNA shape is emerging as an important determinant of transcription factor binding beyond just the DNA sequence. The only tool for large scale DNA shape estimates, DNAshape was derived from Monte-Carlo simulations and predicts four broad and static DNA shape features, Propeller twist, Helical twist, Minor groove width and Roll. The contributions of other shape features e.g. Shift, Slide and Opening cannot be evaluated using DNAshape. Here, we report a novel method DynaSeq, which predicts molecular dynamics-derived ensembles of a more exhaustive set of DNA shape features. We compared the DNAshape and DynaSeq predictions for the common features and applied both to predict the genome-wide binding sites of 1312 TFs available from protein interaction quantification (PIQ) data. The results indicate a good agreement between the two methods for the common shape features and point to advantages in using DynaSeq. Predictive models employing ensembles from individual conformational parameters revealed that base-pair opening - known to be important in strand separation - was the best predictor of transcription factor-binding sites (TFBS) followed by features employed by DNAshape. Of note, TFBS could be predicted not only from the features at the target motif sites, but also from those as far as 200 nucleotides away from the motif.

  1. Word-level recognition of multifont Arabic text using a feature vector matching approach

    NASA Astrophysics Data System (ADS)

    Erlandson, Erik J.; Trenkle, John M.; Vogt, Robert C., III

    1996-03-01

    Many text recognition systems recognize text imagery at the character level and assemble words from the recognized characters. An alternative approach is to recognize text imagery at the word level, without analyzing individual characters. This approach avoids the problem of individual character segmentation, and can overcome local errors in character recognition. A word-level recognition system for machine-printed Arabic text has been implemented. Arabic is a script language, and is therefore difficult to segment at the character level. Character segmentation has been avoided by recognizing text imagery of complete words. The Arabic recognition system computes a vector of image-morphological features on a query word image. This vector is matched against a precomputed database of vectors from a lexicon of Arabic words. Vectors from the database with the highest match score are returned as hypotheses for the unknown image. Several feature vectors may be stored for each word in the database. Database feature vectors generated using multiple fonts and noise models allow the system to be tuned to its input stream. Used in conjunction with database pruning techniques, this Arabic recognition system has obtained promising word recognition rates on low-quality multifont text imagery.

  2. Comparisons and Selections of Features and Classifiers for Short Text Classification

    NASA Astrophysics Data System (ADS)

    Wang, Ye; Zhou, Zhi; Jin, Shan; Liu, Debin; Lu, Mi

    2017-10-01

    Short text is considerably different from traditional long text documents due to its shortness and conciseness, which somehow hinders the applications of conventional machine learning and data mining algorithms in short text classification. According to traditional artificial intelligence methods, we divide short text classification into three steps, namely preprocessing, feature selection and classifier comparison. In this paper, we have illustrated step-by-step how we approach our goals. Specifically, in feature selection, we compared the performance and robustness of the four methods of one-hot encoding, tf-idf weighting, word2vec and paragraph2vec, and in the classification part, we deliberately chose and compared Naive Bayes, Logistic Regression, Support Vector Machine, K-nearest Neighbor and Decision Tree as our classifiers. Then, we compared and analysed the classifiers horizontally with each other and vertically with feature selections. Regarding the datasets, we crawled more than 400,000 short text files from Shanghai and Shenzhen Stock Exchanges and manually labeled them into two classes, the big and the small. There are eight labels in the big class, and 59 labels in the small class.

  3. A systematic identification of species-specific protein succinylation sites using joint element features information.

    PubMed

    Hasan, Md Mehedi; Khatun, Mst Shamima; Mollah, Md Nurul Haque; Yong, Cao; Guo, Dianjing

    2017-01-01

    Lysine succinylation, an important type of protein posttranslational modification, plays significant roles in many cellular processes. Accurate identification of succinylation sites can facilitate our understanding about the molecular mechanism and potential roles of lysine succinylation. However, even in well-studied systems, a majority of the succinylation sites remain undetected because the traditional experimental approaches to succinylation site identification are often costly, time-consuming, and laborious. In silico approach, on the other hand, is potentially an alternative strategy to predict succinylation substrates. In this paper, a novel computational predictor SuccinSite2.0 was developed for predicting generic and species-specific protein succinylation sites. This predictor takes the composition of profile-based amino acid and orthogonal binary features, which were used to train a random forest classifier. We demonstrated that the proposed SuccinSite2.0 predictor outperformed other currently existing implementations on a complementarily independent dataset. Furthermore, the important features that make visible contributions to species-specific and cross-species-specific prediction of protein succinylation site were analyzed. The proposed predictor is anticipated to be a useful computational resource for lysine succinylation site prediction. The integrated species-specific online tool of SuccinSite2.0 is publicly accessible.

  4. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification.

    PubMed

    Wang, Yin; Li, Rudong; Zhou, Yuhua; Ling, Zongxin; Guo, Xiaokui; Xie, Lu; Liu, Lei

    2016-01-01

    Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.

  5. An Educational System to Help Students Assess Website Features and Identify High-Risk Websites

    ERIC Educational Resources Information Center

    Kajiyama, Tomoko; Echizen, Isao

    2015-01-01

    Purpose: The purpose of this paper is to propose an effective educational system to help students assess Web site risk by providing an environment in which students can better understand a Web site's features and determine the risks of accessing the Web site for themselves. Design/methodology/approach: The authors have enhanced a prototype…

  6. A Recollimation Shock in a Stationary Jet Feature with Limb-brightening in the Gamma-Ray-emitting Narrow-line Seyfert 1 Galaxy 1H 0323+342

    NASA Astrophysics Data System (ADS)

    Doi, Akihiro; Hada, Kazuhiro; Kino, Motoki; Wajima, Kiyoaki; Nakahara, Satomi

    2018-04-01

    We report the discovery of a local convergence of a jet cross section in the quasi-stationary jet feature in the γ-ray-emitting narrow-line Seyfert 1 galaxy (NLS1) 1H 0323+342. The convergence site is located at ∼7 mas (corresponding to the order of 100 pc in deprojection) from the central engine. We also found limb-brightened jet structures at both the upstream and downstream of the convergence site. We propose that the quasi-stationary feature showing the jet convergence and limb-brightening occurs as a consequence of recollimation shock in the relativistic jets. The quasi-stationary feature is one of the possible γ-ray-emitting sites in this NLS1, in analogy with the HST-1 complex in the M87 jet. Monitoring observations have revealed that superluminal components passed through the convergence site and the peak intensity of the quasi-stationary feature, which showed apparent coincidences with the timing of observed γ-ray activities.

  7. A combinatorial feature selection approach to describe the QSAR of dual site inhibitors of acetylcholinesterase.

    PubMed

    Asadabadi, Ebrahim Barzegari; Abdolmaleki, Parviz; Barkooie, Seyyed Mohsen Hosseini; Jahandideh, Samad; Rezaei, Mohammad Ali

    2009-12-01

    Regarding the great potential of dual binding site inhibitors of acetylcholinesterase as the future potent drugs of Alzheimer's disease, this study was devoted to extraction of the most effective structural features of these inhibitors from among a large number of quantitative descriptors. To do this, we adopted a unique approach in quantitative structure-activity relationships. An efficient feature selection method was emphasized in such an approach, using the confirmative results of different routine and novel feature selection methods. The proposed methods generated quite consistent results ensuring the effectiveness of the selected structural features.

  8. Distinctiveness and encoding effects in online sentence comprehension

    PubMed Central

    Hofmeister, Philip; Vasishth, Shravan

    2014-01-01

    In explicit memory recall and recognition tasks, elaboration and contextual isolation both facilitate memory performance. Here, we investigate these effects in the context of sentence processing: targets for retrieval during online sentence processing of English object relative clause constructions differ in the amount of elaboration associated with the target noun phrase, or the homogeneity of superficial features (text color). Experiment 1 shows that greater elaboration for targets during the encoding phase reduces reading times at retrieval sites, but elaboration of non-targets has considerably weaker effects. Experiment 2 illustrates that processing isolated superficial features of target noun phrases—here, a green word in a sentence with words colored white—does not lead to enhanced memory performance, despite triggering longer encoding times. These results are interpreted in the light of the memory models of Nairne, 1990, 2001, 2006, which state that encoding remnants contribute to the set of retrieval cues that provide the basis for similarity-based interference effects. PMID:25566105

  9. Towards semantically sensitive text clustering: a feature space modeling technology based on dimension extension.

    PubMed

    Liu, Yuanchao; Liu, Ming; Wang, Xin

    2015-01-01

    The objective of text clustering is to divide document collections into clusters based on the similarity between documents. In this paper, an extension-based feature modeling approach towards semantically sensitive text clustering is proposed along with the corresponding feature space construction and similarity computation method. By combining the similarity in traditional feature space and that in extension space, the adverse effects of the complexity and diversity of natural language can be addressed and clustering semantic sensitivity can be improved correspondingly. The generated clusters can be organized using different granularities. The experimental evaluations on well-known clustering algorithms and datasets have verified the effectiveness of our approach.

  10. Towards Semantically Sensitive Text Clustering: A Feature Space Modeling Technology Based on Dimension Extension

    PubMed Central

    Liu, Yuanchao; Liu, Ming; Wang, Xin

    2015-01-01

    The objective of text clustering is to divide document collections into clusters based on the similarity between documents. In this paper, an extension-based feature modeling approach towards semantically sensitive text clustering is proposed along with the corresponding feature space construction and similarity computation method. By combining the similarity in traditional feature space and that in extension space, the adverse effects of the complexity and diversity of natural language can be addressed and clustering semantic sensitivity can be improved correspondingly. The generated clusters can be organized using different granularities. The experimental evaluations on well-known clustering algorithms and datasets have verified the effectiveness of our approach. PMID:25794172

  11. Subgraph augmented non-negative tensor factorization (SANTF) for modeling clinical narrative text

    PubMed Central

    Xin, Yu; Hochberg, Ephraim; Joshi, Rohit; Uzuner, Ozlem; Szolovits, Peter

    2015-01-01

    Objective Extracting medical knowledge from electronic medical records requires automated approaches to combat scalability limitations and selection biases. However, existing machine learning approaches are often regarded by clinicians as black boxes. Moreover, training data for these automated approaches at often sparsely annotated at best. The authors target unsupervised learning for modeling clinical narrative text, aiming at improving both accuracy and interpretability. Methods The authors introduce a novel framework named subgraph augmented non-negative tensor factorization (SANTF). In addition to relying on atomic features (e.g., words in clinical narrative text), SANTF automatically mines higher-order features (e.g., relations of lymphoid cells expressing antigens) from clinical narrative text by converting sentences into a graph representation and identifying important subgraphs. The authors compose a tensor using patients, higher-order features, and atomic features as its respective modes. We then apply non-negative tensor factorization to cluster patients, and simultaneously identify latent groups of higher-order features that link to patient clusters, as in clinical guidelines where a panel of immunophenotypic features and laboratory results are used to specify diagnostic criteria. Results and Conclusion SANTF demonstrated over 10% improvement in averaged F-measure on patient clustering compared to widely used non-negative matrix factorization (NMF) and k-means clustering methods. Multiple baselines were established by modeling patient data using patient-by-features matrices with different feature configurations and then performing NMF or k-means to cluster patients. Feature analysis identified latent groups of higher-order features that lead to medical insights. We also found that the latent groups of atomic features help to better correlate the latent groups of higher-order features. PMID:25862765

  12. Vaccine adverse event text mining system for extracting features from vaccine safety reports.

    PubMed

    Botsis, Taxiarchis; Buttolph, Thomas; Nguyen, Michael D; Winiecki, Scott; Woo, Emily Jane; Ball, Robert

    2012-01-01

    To develop and evaluate a text mining system for extracting key clinical features from vaccine adverse event reporting system (VAERS) narratives to aid in the automated review of adverse event reports. Based upon clinical significance to VAERS reviewing physicians, we defined the primary (diagnosis and cause of death) and secondary features (eg, symptoms) for extraction. We built a novel vaccine adverse event text mining (VaeTM) system based on a semantic text mining strategy. The performance of VaeTM was evaluated using a total of 300 VAERS reports in three sequential evaluations of 100 reports each. Moreover, we evaluated the VaeTM contribution to case classification; an information retrieval-based approach was used for the identification of anaphylaxis cases in a set of reports and was compared with two other methods: a dedicated text classifier and an online tool. The performance metrics of VaeTM were text mining metrics: recall, precision and F-measure. We also conducted a qualitative difference analysis and calculated sensitivity and specificity for classification of anaphylaxis cases based on the above three approaches. VaeTM performed best in extracting diagnosis, second level diagnosis, drug, vaccine, and lot number features (lenient F-measure in the third evaluation: 0.897, 0.817, 0.858, 0.874, and 0.914, respectively). In terms of case classification, high sensitivity was achieved (83.1%); this was equal and better compared to the text classifier (83.1%) and the online tool (40.7%), respectively. Our VaeTM implementation of a semantic text mining strategy shows promise in providing accurate and efficient extraction of key features from VAERS narratives.

  13. Teleradiology and screening mammography: a telemammography system evaluation and comparison to clinical results

    NASA Astrophysics Data System (ADS)

    Leader, Joseph K.; Chough, Denise; Clearfield, Ronald J.; Ganott, Marie A.; Hakim, Christiane; Hardesty, Lara; Shindel, Betty; Sumkin, Jules H.; Drescher, John M.; Maitz, Glenn S.; Gur, David

    2005-04-01

    Radiologists' performance reviewing and rating breast cancer screening mammography exams using a telemammography system was evaluated and compared with the actual clinical interpretations of the same interpretations. Mammography technologists from three remote imaging sites transmitted 245 exams to a central site (radiologists), which they (the technologists) believed needed additional procedures (termed "recall"). Current exam image data and non-image data (i.e., technologist's text message, technologist's graphic marks, patient's prior report, and Computer Aided Detection (CAD) results) were transmitted to the central site and displayed on three high-resolution, portrait monitors. Seven radiologists interpreted ("recall" or "no recall") the exams using the telemammography workstation in three separate multi-mode studies. The mean telemammography recall rates ranged from 72.3% to 82.5% while the actual clinical recall rates ranged from 38.4% to 42.3% across the three studies. Mean Kappa of agreement ranged from 0.102 to 0.213 and mean percent agreement ranged from 48.7% to 57.4% across the three studies. Eighty-seven percent of the disagreement interpretations occurred when the telemammography interpretation resulted in a recommendation to recall and the clinical interpretation resulted in a recommendation not to recall. The poor agreement between the telemammography and clinical interpretations may indicate a critical dependence on images from prior screening exams rather than any text based information. The technologists were sensitive, if not specific, to the mammography features and changes that may lead to recall. Using the telemammography system the radiologists were able to reduce the recommended recalls by the technologist by approximately 25 percent.

  14. On Social e-Learning

    NASA Astrophysics Data System (ADS)

    Kim, Won; Jeong, Ok-Ran

    Social Web sites include social networking sites and social media sites. They make it possible for people to share user-created contents online and to interact and stay connected with their online people networks. The social features of social Web sites, appropriately adapted, can help turn e-learning into social e-learning and make e-learning significantly more effective. In this paper, we develop requirements for social e-learning systems. They include incorporating the many of the social features of social Web sites, accounting for all key stakeholders and learning subjects, and curbing various types of misuses by people. We also examine the capabilities of representative social e-learning Web sites that are available today.

  15. Text analysis devices, articles of manufacture, and text analysis methods

    DOEpatents

    Turner, Alan E; Hetzler, Elizabeth G; Nakamura, Grant C

    2013-05-28

    Text analysis devices, articles of manufacture, and text analysis methods are described according to some aspects. In one aspect, a text analysis device includes processing circuitry configured to analyze initial text to generate a measurement basis usable in analysis of subsequent text, wherein the measurement basis comprises a plurality of measurement features from the initial text, a plurality of dimension anchors from the initial text and a plurality of associations of the measurement features with the dimension anchors, and wherein the processing circuitry is configured to access a viewpoint indicative of a perspective of interest of a user with respect to the analysis of the subsequent text, and wherein the processing circuitry is configured to use the viewpoint to generate the measurement basis.

  16. Site occupancy, composition and magnetic structure dependencies of martensitic transformation in Mn2Ni1 + x Sn1-x.

    PubMed

    Kundu, Ashis; Ghosh, Subhradip

    2017-11-29

    A delicate balance between various factors such as site occupancy, composition and magnetic ordering seems to affect the stability of the martensitic phase in [Formula: see text] [Formula: see text] [Formula: see text]. Using first-principles DFT calculations, we explore the impacts of each one of these factors on the martensitic stability of this system. Our results on total energies, magnetic moments and electronic structures upon changes in the composition, the magnetic configurations and the site occupancies show that the occupancies at the 4d sites in the inverse Heusler crystal structure play the most crucial role. The presence of Mn at the 4d sites originally occupied by Sn and its interaction with the Mn atoms at other sites decide the stability of the martensitic phases. This explains the discrepancy between the experiments and earlier DFT calculations regarding phase stability in [Formula: see text]NiSn. Our results qualitatively explain the trends observed experimentally with regard to martensitic phase stability and the magnetisations in Ni-excess, Sn-deficient [Formula: see text]NiSn system.

  17. Evaluating stability of histomorphometric features across scanner and staining variations: predicting biochemical recurrence from prostate cancer whole slide images

    NASA Astrophysics Data System (ADS)

    Leo, Patrick; Lee, George; Madabhushi, Anant

    2016-03-01

    Quantitative histomorphometry (QH) is the process of computerized extraction of features from digitized tissue slide images. Typically these features are used in machine learning classifiers to predict disease presence, behavior and outcome. Successful robust classifiers require features that both discriminate between classes of interest and are stable across data from multiple sites. Feature stability may be compromised by variation in slide staining and scanning procedures. These laboratory specific variables include dye batch, slice thickness and the whole slide scanner used to digitize the slide. The key therefore is to be able to identify features that are not only discriminating between the classes of interest (e.g. cancer and non-cancer or biochemical recurrence and non- recurrence) but also features that will not wildly fluctuate on slides representing the same tissue class but from across multiple different labs and sites. While there has been some recent efforts at understanding feature stability in the context of radiomics applications (i.e. feature analysis of radiographic images), relatively few attempts have been made at studying the trade-off between feature stability and discriminability for histomorphometric and digital pathology applications. In this paper we present two new measures, preparation-induced instability score (PI) and latent instability score (LI), to quantify feature instability across and within datasets. Dividing PI by LI yields a ratio for how often a feature for a specific tissue class (e.g. low grade prostate cancer) is different between datasets from different sites versus what would be expected from random chance alone. Using this ratio we seek to quantify feature vulnerability to variations in slide preparation and digitization. Since our goal is to identify stable QH features we evaluate these features for their stability and thus inclusion in machine learning based classifiers in a use case involving prostate cancer. Specifically we examine QH features which may predict 5 year biochemical recurrence for prostate cancer patients who have undergone radical prostatectomy from digital slide images of surgically excised tissue specimens, 5 year biochemical recurrence being a strong predictor of disease recurrence. In this study we evaluated the ability of our feature robustness indices to identify the most stable and predictive features of 5 year biochemical recurrence using digitized slide images of surgically excised prostate cancer specimens from 80 different patients across 4 different sites. A total of 242 features from 5 different feature families were investigated to identify the most stable QH features from our set. Our feature robustness indices (PI and LI) suggested that five feature families (graph, shape, co-occurring gland tensors, gland sub-graphs, texture) were susceptible to variations in slide preparation and digitization across various sites. The family least affected was shape features in which 19.3% of features varied across laboratories while the most vulnerable family, at 55.6%, was the gland disorder features. However the disorder features were the most stable within datasets being different between random halves of a dataset in an average of just 4.1% of comparisons while texture features were the most unstable being different at a rate of 4.7%. We also compared feature stability across two datasets before and after color normalization. Color normalization decreased feature stability with 8% and 34% of features different between the two datasets in two outcome groups prior to normalization and 49% and 51% different afterwards. Our results appear to suggest that evaluation of QH features across multiple sites needs to be undertaken to assess robustness and class discriminability alone should not represent the benchmark for selection of QH features to build diagnostic and prognostic digital pathology classifiers.

  18. Sandhill crane roost selection, human disturbance, and forage resources

    USGS Publications Warehouse

    Pearse, Aaron T.; Krapu, Gary; Brandt, David

    2017-01-01

    Sites used for roosting represent a key habitat requirement for many species of birds because availability and quality of roost sites can influence individual fitness. Birds select roost sites based on numerous factors, requirements, and motivations, and selection of roosts can be dynamic in time and space because of various ecological and environmental influences. For sandhill cranes (Antigone canadensis) at their main spring-staging area along the Platte River in south-central Nebraska, USA, past investigations of roosting cranes focused on physical channel characteristics related to perceived security as motivating roost distribution. We used 6,310 roost sites selected by 313 sandhill cranes over 5 spring migration seasons (2003–2007) to quantify resource selection functions of roost sites on the central Platte River using a discrete choice analysis. Sandhill cranes generally showed stronger selection for wider channels with shorter bank vegetation situated farther from potential human disturbance features such as roads, bridges, and dwellings. Furthermore, selection for roost sites with preferable physical characteristics (wide channels with short bank vegetation) was more resilient to nearby disturbance features than more narrow channels with taller bank vegetation. The amount of cornfields surrounding sandhill crane roost sites positively influenced relative probability of use but only for more narrow channels < 100 m and those with shorter bank vegetation. We confirmed key resource features that sandhill cranes selected at river channels along the Platte River, and after incorporating spatial variation due to human disturbance, our understanding of roost site selection was more robust, providing insights on how disturbance may interact with physical habitat features. Managers can use information on roost-site selection when developing plans to increase probability of crane use at existing roost sites and to identify new areas for potential use if existing sites become limited.

  19. Global and Local Features Based Classification for Bleed-Through Removal

    NASA Astrophysics Data System (ADS)

    Hu, Xiangyu; Lin, Hui; Li, Shutao; Sun, Bin

    2016-12-01

    The text on one side of historical documents often seeps through and appears on the other side, so the bleed-through is a common problem in historical document images. It makes the document images hard to read and the text difficult to recognize. To improve the image quality and readability, the bleed-through has to be removed. This paper proposes a global and local features extraction based bleed-through removal method. The Gaussian mixture model is used to get the global features of the images. Local features are extracted by the patch around each pixel. Then, the extreme learning machine classifier is utilized to classify the scanned images into the foreground text and the bleed-through component. Experimental results on real document image datasets show that the proposed method outperforms the state-of-the-art bleed-through removal methods and preserves the text strokes well.

  20. Guiding Readers to New Understandings through Electronic Text.

    ERIC Educational Resources Information Center

    Patterson, Nancy, Ed.; Pipkin, Gloria, Ed.

    2001-01-01

    Argues that computer technology can help to engage struggling readers in meaningful transactions with text. Lists and describes seven web sites that will captivate reluctant readers. Notes three web sites that send students on "WebQuests" to transact with text in order to build knowledge. Discusses other ways to engage students in text via…

  1. Collaboration for Education with the Apple Learning Interchange

    NASA Astrophysics Data System (ADS)

    Young, Patrick A.; Zimmerman, T.; Knierman, K. A.

    2006-12-01

    We present a progressive effort to deliver online education and outreach resources in collaboration with the Apple Learning Interchange, a free community for educators. We have created a resource site with astronomy activities, video training for the activities, and the possibility of interactive training through video chat services. Also in development is an online textbook for graduate and advanced undergraduate courses in stellar evolution, featuring an updatable and annotated text with multimedia content, online lectures, podcasts, and a framework for interactive simulation activities. Both sites will be highly interactive, combining online discussions, the opportunity for live video interaction, and a growing library of student work samples. This effort promises to provide a compelling model for collaboration between science educators and corporations. As scientists, we provide content knowledge and a compelling reason to communicate, while Apple provides technical expertise, a deep knowledge of online education, and a way for us to reach a wide audience of higher education, community outreach, and K-12 educators.

  2. Identification of S-glutathionylation sites in species-specific proteins by incorporating five sequence-derived features into the general pseudo-amino acid composition.

    PubMed

    Zhao, Xiaowei; Ning, Qiao; Ai, Meiyue; Chai, Haiting; Yang, Guifu

    2016-06-07

    As a selective and reversible protein post-translational modification, S-glutathionylation generates mixed disulfides between glutathione (GSH) and cysteine residues, and plays an important role in regulating protein activity, stability, and redox regulation. To fully understand S-glutathionylation mechanisms, identification of substrates and specific S-Glutathionylated sites is crucial. Experimental identification of S-glutathionylated sites is labor-intensive and time consuming, so establishing an effective computational method is much desirable due to their convenient and fast speed. Therefore, in this study, a new bioinformatics tool named SSGlu (Species-Specific identification of Protein S-glutathionylation Sites) was developed to identify species-specific protein S-glutathionylated sites, utilizing support vector machines that combine multiple sequence-derived features with a two-step feature selection. By 5-fold cross validation, the performance of SSGlu was measured with an AUC of 0.8105 and 0.8041 for Homo sapiens and Mus musculus, respectively. Additionally, SSGlu was compared with the existing methods, and the higher MCC and AUC of SSGlu demonstrated that SSGlu was very promising to predict S-glutathionylated sites. Furthermore, a site-specific analysis showed that S-glutathionylation intimately correlated with the features derived from its surrounding sites. The conclusions derived from this study might help to understand more of the S-glutathionylation mechanism and guide the related experimental validation. For public access, SSGlu is freely accessible at http://59.73.198.144:8080/SSGlu/. Copyright © 2016 Elsevier Ltd. All rights reserved.

  3. Identifying 5-methylcytosine sites in RNA sequence using composite encoding feature into Chou's PseKNC.

    PubMed

    Sabooh, M Fazli; Iqbal, Nadeem; Khan, Mukhtaj; Khan, Muslim; Maqbool, H F

    2018-05-01

    This study examines accurate and efficient computational method for identification of 5-methylcytosine sites in RNA modification. The occurrence of 5-methylcytosine (m 5 C) plays a vital role in a number of biological processes. For better comprehension of the biological functions and mechanism it is necessary to recognize m 5 C sites in RNA precisely. The laboratory techniques and procedures are available to identify m 5 C sites in RNA, but these procedures require a lot of time and resources. This study develops a new computational method for extracting the features of RNA sequence. In this method, first the RNA sequence is encoded via composite feature vector, then, for the selection of discriminate features, the minimum-redundancy-maximum-relevance algorithm was used. Secondly, the classification method used has been based on a support vector machine by using jackknife cross validation test. The suggested method efficiently identifies m 5 C sites from non- m 5 C sites and the outcome of the suggested algorithm is 93.33% with sensitivity of 90.0 and specificity of 96.66 on bench mark datasets. The result exhibits that proposed algorithm shown significant identification performance compared to the existing computational techniques. This study extends the knowledge about the occurrence sites of RNA modification which paves the way for better comprehension of the biological uses and mechanism. Copyright © 2018 Elsevier Ltd. All rights reserved.

  4. Habitat selection in a rocky landscape: experimentally decoupling the influence of retreat site attributes from that of landscape features.

    PubMed

    Croak, Benjamin M; Pike, David A; Webb, Jonathan K; Shine, Richard

    2012-01-01

    Organisms selecting retreat sites may evaluate not only the quality of the specific shelter, but also the proximity of that site to resources in the surrounding area. Distinguishing between habitat selection at these two spatial scales is complicated by co-variation among microhabitat factors (i.e., the attributes of individual retreat sites often correlate with their proximity to landscape features). Disentangling this co-variation may facilitate the restoration or conservation of threatened systems. To experimentally examine the role of landscape attributes in determining retreat-site quality for saxicolous ectotherms, we deployed 198 identical artificial rocks in open (sun-exposed) sites on sandstone outcrops in southeastern Australia, and recorded faunal usage of those retreat sites over the next 29 months. Several landscape-scale attributes were associated with occupancy of experimental rocks, but different features were important for different species. For example, endangered broad-headed snakes (Hoplocephalus bungaroides) preferred retreat sites close to cliff edges, flat rock spiders (Hemicloea major) preferred small outcrops, and velvet geckos (Oedura lesueurii) preferred rocks close to the cliff edge with higher-than-average sun exposure. Standardized retreat sites can provide robust experimental data on the effects of landscape-scale attributes on retreat site selection, revealing interspecific divergences among sympatric taxa that use similar habitats.

  5. Accuracy of gap analysis habitat models in predicting physical features for wildlife-habitat associations in the southwest U.S.

    USGS Publications Warehouse

    Boykin, K.G.; Thompson, B.C.; Propeck-Gray, S.

    2010-01-01

    Despite widespread and long-standing efforts to model wildlife-habitat associations using remotely sensed and other spatially explicit data, there are relatively few evaluations of the performance of variables included in predictive models relative to actual features on the landscape. As part of the National Gap Analysis Program, we specifically examined physical site features at randomly selected sample locations in the Southwestern U.S. to assess degree of concordance with predicted features used in modeling vertebrate habitat distribution. Our analysis considered hypotheses about relative accuracy with respect to 30 vertebrate species selected to represent the spectrum of habitat generalist to specialist and categorization of site by relative degree of conservation emphasis accorded to the site. Overall comparison of 19 variables observed at 382 sample sites indicated ???60% concordance for 12 variables. Directly measured or observed variables (slope, soil composition, rock outcrop) generally displayed high concordance, while variables that required judgments regarding descriptive categories (aspect, ecological system, landform) were less concordant. There were no differences detected in concordance among taxa groups, degree of specialization or generalization of selected taxa, or land conservation categorization of sample sites with respect to all sites. We found no support for the hypothesis that accuracy of habitat models is inversely related to degree of taxa specialization when model features for a habitat specialist could be more difficult to represent spatially. Likewise, we did not find support for the hypothesis that physical features will be predicted with higher accuracy on lands with greater dedication to biodiversity conservation than on other lands because of relative differences regarding available information. Accuracy generally was similar (>60%) to that observed for land cover mapping at the ecological system level. These patterns demonstrate resilience of gap analysis deductive model processes to the type of remotely sensed or interpreted data used in habitat feature predictions. ?? 2010 Elsevier B.V.

  6. SU-F-R-33: Can CT and CBCT Be Used Simultaneously for Radiomics Analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Luo, R; Wang, J; Zhong, H

    2016-06-15

    Purpose: To investigate whether CBCT and CT can be used in radiomics analysis simultaneously. To establish a batch correction method for radiomics in two similar image modalities. Methods: Four sites including rectum, bladder, femoral head and lung were considered as region of interest (ROI) in this study. For each site, 10 treatment planning CT images were collected. And 10 CBCT images which came from same site of same patient were acquired at first radiotherapy fraction. 253 radiomics features, which were selected by our test-retest study at rectum cancer CT (ICC>0.8), were calculated for both CBCT and CT images in MATLAB.more » Simple scaling (z-score) and nonlinear correction methods were applied to the CBCT radiomics features. The Pearson Correlation Coefficient was calculated to analyze the correlation between radiomics features of CT and CBCT images before and after correction. Cluster analysis of mixed data (for each site, 5 CT and 5 CBCT data are randomly selected) was implemented to validate the feasibility to merge radiomics data from CBCT and CT. The consistency of clustering result and site grouping was verified by a chi-square test for different datasets respectively. Results: For simple scaling, 234 of the 253 features have correlation coefficient ρ>0.8 among which 154 features haveρ>0.9 . For radiomics data after nonlinear correction, 240 of the 253 features have ρ>0.8 among which 220 features have ρ>0.9. Cluster analysis of mixed data shows that data of four sites was almost precisely separated for simple scaling(p=1.29 * 10{sup −7}, χ{sup 2} test) and nonlinear correction (p=5.98 * 10{sup −7}, χ{sup 2} test), which is similar to the cluster result of CT data (p=4.52 * 10{sup −8}, χ{sup 2} test). Conclusion: Radiomics data from CBCT can be merged with those from CT by simple scaling or nonlinear correction for radiomics analysis.« less

  7. Meta-Analysis of DNA Tumor-Viral Integration Site Selection Indicates a Role for Repeats, Gene Expression and Epigenetics

    PubMed Central

    Doolittle-Hall, Janet M.; Cunningham Glasspoole, Danielle L.; Seaman, William T.; Webster-Cyriaque, Jennifer

    2015-01-01

    Oncoviruses cause tremendous global cancer burden. For several DNA tumor viruses, human genome integration is consistently associated with cancer development. However, genomic features associated with tumor viral integration are poorly understood. We sought to define genomic determinants for 1897 loci prone to hosting human papillomavirus (HPV), hepatitis B virus (HBV) or Merkel cell polyomavirus (MCPyV). These were compared to HIV, whose enzyme-mediated integration is well understood. A comprehensive catalog of integration sites was constructed from the literature and experimentally-determined HPV integration sites. Features were scored in eight categories (genes, expression, open chromatin, histone modifications, methylation, protein binding, chromatin segmentation and repeats) and compared to random loci. Random forest models determined loci classification and feature selection. HPV and HBV integrants were not fragile site associated. MCPyV preferred integration near sensory perception genes. Unique signatures of integration-associated predictive genomic features were detected. Importantly, repeats, actively-transcribed regions and histone modifications were common tumor viral integration signatures. PMID:26569308

  8. Nationwide forestry applications program: Ten-Ecosystem Study (TES) site 5 report, Kershaw County, South Carolina, report 4

    NASA Technical Reports Server (NTRS)

    Dillman, R. D. (Principal Investigator)

    1978-01-01

    The author has identified the following significant results. The Kershaw County site, South Carolina, was selected to be representative of both the oak-pine ecosystem and the southeastern pine ecosystem. The following processing results have concluded that: (1) early spring LANDSAT data provide the best contrast between forest features; (2) level 2 forest features (softwood, hardwood, grassland, and water) can be classified with an accuracy of 70% + or - 5.7% at the 90% confidence level; (3) level 3 species classification was inconclusive; (4) temporal data did not provide a significant increase in classification accuracy of level 2 features, over single date classification to warrant the additional processing; and (5) training fields from only 10% of the site can be used to classify the entire site.

  9. A Semi-Supervised Learning Approach to Enhance Health Care Community–Based Question Answering: A Case Study in Alcoholism

    PubMed Central

    Klabjan, Diego; Jonnalagadda, Siddhartha Reddy

    2016-01-01

    Background Community-based question answering (CQA) sites play an important role in addressing health information needs. However, a significant number of posted questions remain unanswered. Automatically answering the posted questions can provide a useful source of information for Web-based health communities. Objective In this study, we developed an algorithm to automatically answer health-related questions based on past questions and answers (QA). We also aimed to understand information embedded within Web-based health content that are good features in identifying valid answers. Methods Our proposed algorithm uses information retrieval techniques to identify candidate answers from resolved QA. To rank these candidates, we implemented a semi-supervised leaning algorithm that extracts the best answer to a question. We assessed this approach on a curated corpus from Yahoo! Answers and compared against a rule-based string similarity baseline. Results On our dataset, the semi-supervised learning algorithm has an accuracy of 86.2%. Unified medical language system–based (health related) features used in the model enhance the algorithm’s performance by proximately 8%. A reasonably high rate of accuracy is obtained given that the data are considerably noisy. Important features distinguishing a valid answer from an invalid answer include text length, number of stop words contained in a test question, a distance between the test question and other questions in the corpus, and a number of overlapping health-related terms between questions. Conclusions Overall, our automated QA system based on historical QA pairs is shown to be effective according to the dataset in this case study. It is developed for general use in the health care domain, which can also be applied to other CQA sites. PMID:27485666

  10. Qualitative analysis of programmatic initiatives to text patients with mobile devices in resource-limited health systems.

    PubMed

    Garg, Sachin K; Lyles, Courtney R; Ackerman, Sara; Handley, Margaret A; Schillinger, Dean; Gourley, Gato; Aulakh, Veenu; Sarkar, Urmimala

    2016-02-06

    Text messaging is an affordable, ubiquitous, and expanding mobile communication technology. However, safety net health systems in the United States that provide more care to uninsured and low-income patients may face additional financial and infrastructural challenges in utilizing this technology. Formative evaluations of texting implementation experiences are limited. We interviewed safety net health systems piloting texting initiatives to study facilitators and barriers to real-world implementation. We conducted telephone interviews with various stakeholders who volunteered from each of the eight California-based safety net systems that received external funding to pilot a texting-based program of their choosing to serve a primary care need. We developed a semi-structured interview guide based partly on the Consolidated Framework for Implementation Research (CFIR), which encompasses several domains: the intervention, individuals involved, contextual factors, and implementation process. We inductively and deductively (using CFIR) coded transcripts, and categorized themes into facilitators and barriers. We performed eight interviews (one interview per pilot site). Five sites had no prior texting experience. Sites applied texting for programs related to medication adherence and monitoring, appointment reminders, care coordination, and health education and promotion. No site texted patient-identifying health information, and most sites manually obtained informed consent from each participating patient. Facilitators of implementation included perceived enthusiasm from patients, staff and management belief that texting is patient-centered, and the early identification of potential barriers through peer collaboration among grantees. Navigating government regulations that protect patient privacy and guide the handling of protected health information emerged as a crucial barrier. A related technical challenge in five sites was the labor-intensive tracking and documenting of texting communications due to an inability to integrate texting platforms with electronic health records. Despite enthusiasm for the texting programs from the involved individuals and organizations, inadequate data management capabilities and unclear privacy and security regulations for mobile health technology slowed the initial implementation and limited the clinical use of texting in the safety net and scope of pilots. Future implementation work and research should investigate how different texting platform and intervention designs affect efficacy, as well as explore issues that may affect sustainability and the scalability.

  11. Set of Frequent Word Item sets as Feature Representation for Text with Indonesian Slang

    NASA Astrophysics Data System (ADS)

    Sa'adillah Maylawati, Dian; Putri Saptawati, G. A.

    2017-01-01

    Indonesian slang are commonly used in social media. Due to their unstructured syntax, it is difficult to extract their features based on Indonesian grammar for text mining. To do so, we propose Set of Frequent Word Item sets (SFWI) as text representation which is considered match for Indonesian slang. Besides, SFWI is able to keep the meaning of Indonesian slang with regard to the order of appearance sentence. We use FP-Growth algorithm with adding separation sentence function into the algorithm to extract the feature of SFWI. The experiments is done with text data from social media such as Facebook, Twitter, and personal website. The result of experiments shows that Indonesian slang were more correctly interpreted based on SFWI.

  12. Text2Quit: results from a pilot test of a personalized, interactive mobile health smoking cessation program.

    PubMed

    Abroms, Lorien C; Ahuja, Meenakshi; Kodl, Yvonne; Thaweethai, Lalida; Sims, Justin; Winickoff, Jonathan P; Windsor, Richard A

    2012-01-01

    Text messaging programs on mobile phones have shown some promise in helping people quit smoking. Text2Quit is an automated, personalized, and interactive mobile health program that sends text messages and e-mails timed around a participant's quit date over the course of 3 months. The text messages include pre- and post-quit educational messages, peer ex-smoker messages, medication reminders and relapse messages, and multiple opportunities for interaction. Study participants were university students (N = 23) enrolled in the Text2Quit program. Participants were surveyed at baseline and at 2 and 4 weeks after enrollment. The majority of participants agreed that they liked the program at 2 and 4 weeks after enrollment (90.5% and 82.3%, respectively). Support for text messages was found to be moderate and higher than that of the e-mail and web components. Of participants, 75% reported reading most or all of the texts. On average, users made 11.8 responses to the texts over a 4-week period, although responses declined after the quit date. The interactive feature for tracking cigarettes was the most used interactive feature, followed by the craving trivia game. This pilot test provides some support for the Text2Quit program. A future iteration of the program will include additional tracking features in both the pre-quit and post-quit protocols and an easier entry into the not-quit protocol. Future studies are recommended that identify the value of the interactive and personalized features that characterize this program.

  13. Text2Quit: Results from a Pilot Test of a Personalized, Interactive Mobile Health Smoking Cessation Program

    PubMed Central

    ABROMS, LORIEN C.; AHUJA, MEENAKSHI; KODL, YVONNE; THAWEETHAI, LALIDA; SIMS, JUSTIN; WINICKOFF, JONATHAN; WINDSOR, RICHARD A.

    2012-01-01

    Text messaging programs on mobile phones have shown some promise in helping people quit smoking. Text2Quit is an automated, personalized and interactive mobile health program that sends text messages and emails timed around a participant’s quit date over the course of 3 months. The text messages include pre- and post-quit educational messages, peer ex-smoker messages, medication reminders and relapse messages, as well as multiple opportunities for interaction. Study participants were university students (n=23) enrolled in the Text2Quit program. Participants were surveyed at baseline and at 2 and 4 weeks post-enrollment. The vast majority of participants agreed that they liked the program at 2 and 4 weeks post-enrollment (90.5% and 82.3%, respectively). Support for text messages was found to be moderate, and higher than that of the email and web components. Seventy-five percent of participants reported reading most or all of the texts. On average, users made 11.8 responses to the texts over a 4 week period, although responses declined following the quit date. The interactive feature for tracking cigarettes was the most used interactive feature, followed by the craving trivia game. This pilot test provides some support for the Text2Quit program. A future iteration of the program will include additional tracking features in both the pre-quit and post-quit protocol and an easier entry into the not-quit protocol. Future studies are recommended that identify the value of the interactive and personalized features that characterize this program. PMID:22548598

  14. Concentration dependence of Li+/Na+ diffusion in manganese hexacyanoferrates

    NASA Astrophysics Data System (ADS)

    Takachi, Masamitsu; Fukuzumi, Yuya; Moritomo, Yutaka

    2016-06-01

    Manganese hexacyanoferrates (Mn-HCFs) with a jungle-gym-type structure are promising cathode materials for Li+/Na+ secondary batteries (LIBs/SIBs). Here, we investigated the diffusion constants D Li/D Na of Li+/Na+ against the Li+/Na+ concentration x Na/x Li and temperature (T) of A 1.32Mn[Fe(CN)6]0.833.6H2O (A = Li and Na). We evaluated the activation energy E\\text{a}\\text{Li}/E\\text{a}\\text{Na} of D Li/D Na against x Na/x Li. We found that E\\text{a}\\text{Na} steeply increases with x Na from 0.41 eV at x Na = 0.69 to 0.7 eV at 1.1. The increase in E\\text{a}\\text{Na} is ascribed to the occupancy effect of the Na+ site. The increase in E\\text{a}\\text{Li} is suppressed, probably because the number of Li+ sites is three times that of Na+ sites.

  15. Lung nodule malignancy classification using only radiologist-quantified image features as inputs to statistical learning algorithms: probing the Lung Image Database Consortium dataset with two statistical learning methods.

    PubMed

    Hancock, Matthew C; Magnan, Jerry F

    2016-10-01

    In the assessment of nodules in CT scans of the lungs, a number of image-derived features are diagnostically relevant. Currently, many of these features are defined only qualitatively, so they are difficult to quantify from first principles. Nevertheless, these features (through their qualitative definitions and interpretations thereof) are often quantified via a variety of mathematical methods for the purpose of computer-aided diagnosis (CAD). To determine the potential usefulness of quantified diagnostic image features as inputs to a CAD system, we investigate the predictive capability of statistical learning methods for classifying nodule malignancy. We utilize the Lung Image Database Consortium dataset and only employ the radiologist-assigned diagnostic feature values for the lung nodules therein, as well as our derived estimates of the diameter and volume of the nodules from the radiologists' annotations. We calculate theoretical upper bounds on the classification accuracy that are achievable by an ideal classifier that only uses the radiologist-assigned feature values, and we obtain an accuracy of 85.74 [Formula: see text], which is, on average, 4.43% below the theoretical maximum of 90.17%. The corresponding area-under-the-curve (AUC) score is 0.932 ([Formula: see text]), which increases to 0.949 ([Formula: see text]) when diameter and volume features are included and has an accuracy of 88.08 [Formula: see text]. Our results are comparable to those in the literature that use algorithmically derived image-based features, which supports our hypothesis that lung nodules can be classified as malignant or benign using only quantified, diagnostic image features, and indicates the competitiveness of this approach. We also analyze how the classification accuracy depends on specific features and feature subsets, and we rank the features according to their predictive power, statistically demonstrating the top four to be spiculation, lobulation, subtlety, and calcification.

  16. Automatic topic identification of health-related messages in online health community using text classification.

    PubMed

    Lu, Yingjie

    2013-01-01

    To facilitate patient involvement in online health community and obtain informative support and emotional support they need, a topic identification approach was proposed in this paper for identifying automatically topics of the health-related messages in online health community, thus assisting patients in reaching the most relevant messages for their queries efficiently. Feature-based classification framework was presented for automatic topic identification in our study. We first collected the messages related to some predefined topics in a online health community. Then we combined three different types of features, n-gram-based features, domain-specific features and sentiment features to build four feature sets for health-related text representation. Finally, three different text classification techniques, C4.5, Naïve Bayes and SVM were adopted to evaluate our topic classification model. By comparing different feature sets and different classification techniques, we found that n-gram-based features, domain-specific features and sentiment features were all considered to be effective in distinguishing different types of health-related topics. In addition, feature reduction technique based on information gain was also effective to improve the topic classification performance. In terms of classification techniques, SVM outperformed C4.5 and Naïve Bayes significantly. The experimental results demonstrated that the proposed approach could identify the topics of online health-related messages efficiently.

  17. Perimeter intrusion detection and assessment system

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Eaton, M.J.; Jacobs, J.; McGovern, D.E.

    1977-11-01

    To obtain an effective perimeter intrusion detection system requires careful sensor selection, procurement, and installation. The selection process involves a thorough understanding of the unique site features and how these features affect the performance of each type of sensor. It is necessary to develop procurement specifications to establish acceptable sensor performance limits. Careful explanation and inspection of critical installation dimensions is required during on-site construction. The implementation of these activities at a particular site is discussed.

  18. JOVIAL/Ada Microprocessor Study.

    DTIC Science & Technology

    1982-04-01

    Study Final Technical Report interesting feature of the nodes is that they provide multiple virtual terminals, so it is possible to monitor several...Terminal Interface Tasking Except ion Handling A more elaborate system could allow such features as spooling, background jobs or multiple users. To a large...Another editor feature is the buffer. Buffers may hold small amounts of text or entire text objects. They allow multiple files to be edited simultaneously

  19. Geologic features of dam sites in the Nehalem, Rogue, and Willamette River basins, Oregon, 1935-37

    USGS Publications Warehouse

    Piper, A.M.

    1947-01-01

    The present report comprises brief descriptions of geologic features at 19 potential dam sites in the Nehalem, Rogue, and Willamette River basins in western Oregon. The topography of these site and of the corresponding reservoir site was mapped in 1934-36 under an allocation of funds, by the Public Works Administration for river-utilization surveys by the Conservation Branch of the United States Geological Survey. The field program in Oregon has been under the immediate charge of R. O. Helland. The 19 dam sites are distributed as follows: three on the Nehalem River, on the west or Pacific slope of the Oregon Coast range; four on Little Butte Creek and two on Evans Creek, tributaries of the Rogue River in the eastern part of the Klamath Mountains; four on the South and Middle Santiam Rivers, tributaries of the Willamette River from the west slope of the Cascade mountains; and six on tributaries of the Willamette River from the east slope of the Coast Range. Except in the Evans Creek basin, all the rocks in the districts that were studied are of comparatively late geological age. They include volcanic rocks, crystalline rocks of several types, marine and nonmarine sedimentary rocks, and recent stream deposits. The study of geologic features has sought to estimate the bearing power and water-tightness of the rocks at each dam site, also to place rather broad limits on the type of dam for which the respective sites seem best suited. It was not considered necessary to study the corresponding reservoir sites in detail for excessive leakage appears to be unlikely. Except at three of the four site in the Santiam River basin, no test pits have been dug nor exploratory holes drilled, so that geologic features have been interpreted wholly from natural outcrops and from highway and railroad cuts. Because these outcrops and cuts are few, many problems related to the construction and maintenance of dams can not be answered at the this time and all critical features of the sites should be thoroughly explored by test pits and drilled holes before any dam is designed. This applied especially to sites in the Nehalem and Willamette River basins where commonly the cover of timber and brush is dense and the rocks are rather deeply weathered. On the Middle Santiam and South Santiam Rivers, the Cascadia, Greenpeter, and Sweet Home sits have been studies intensively by the United States Engineer Department, whose work included exploration by diamond-drill holes and test pits. Their conclusions as to geologic features are given in a report by McKitrick and have been reviewed by the writer. Data from this source have been used freely in the discussion of the respective sites in this report. The probability of destructive earthquakes in the region appears to be small but is not negligible. Prudence suggests that any high dam should embody features to assure stability against moderately strong earth motions.

  20. Constructing and validating readability models: the method of integrating multilevel linguistic features with machine learning.

    PubMed

    Sung, Yao-Ting; Chen, Ju-Ling; Cha, Ji-Her; Tseng, Hou-Chiang; Chang, Tao-Hsing; Chang, Kuo-En

    2015-06-01

    Multilevel linguistic features have been proposed for discourse analysis, but there have been few applications of multilevel linguistic features to readability models and also few validations of such models. Most traditional readability formulae are based on generalized linear models (GLMs; e.g., discriminant analysis and multiple regression), but these models have to comply with certain statistical assumptions about data properties and include all of the data in formulae construction without pruning the outliers in advance. The use of such readability formulae tends to produce a low text classification accuracy, while using a support vector machine (SVM) in machine learning can enhance the classification outcome. The present study constructed readability models by integrating multilevel linguistic features with SVM, which is more appropriate for text classification. Taking the Chinese language as an example, this study developed 31 linguistic features as the predicting variables at the word, semantic, syntax, and cohesion levels, with grade levels of texts as the criterion variable. The study compared four types of readability models by integrating unilevel and multilevel linguistic features with GLMs and an SVM. The results indicate that adopting a multilevel approach in readability analysis provides a better representation of the complexities of both texts and the reading comprehension process.

  1. A Novel Feature Selection Technique for Text Classification Using Naïve Bayes.

    PubMed

    Dey Sarkar, Subhajit; Goswami, Saptarsi; Agarwal, Aman; Aktar, Javed

    2014-01-01

    With the proliferation of unstructured data, text classification or text categorization has found many applications in topic classification, sentiment analysis, authorship identification, spam detection, and so on. There are many classification algorithms available. Naïve Bayes remains one of the oldest and most popular classifiers. On one hand, implementation of naïve Bayes is simple and, on the other hand, this also requires fewer amounts of training data. From the literature review, it is found that naïve Bayes performs poorly compared to other classifiers in text classification. As a result, this makes the naïve Bayes classifier unusable in spite of the simplicity and intuitiveness of the model. In this paper, we propose a two-step feature selection method based on firstly a univariate feature selection and then feature clustering, where we use the univariate feature selection method to reduce the search space and then apply clustering to select relatively independent feature sets. We demonstrate the effectiveness of our method by a thorough evaluation and comparison over 13 datasets. The performance improvement thus achieved makes naïve Bayes comparable or superior to other classifiers. The proposed algorithm is shown to outperform other traditional methods like greedy search based wrapper or CFS.

  2. Activation amplitude and temporal synchrony among back extensor and abdominal muscles during a controlled transfer task: comparison of men and women.

    PubMed

    Hubley-Kozey, Cheryl L; Butler, Heather L; Kozey, John W

    2012-08-01

    Muscle synergies are important for spinal stability, but few studies examine temporal responses of spinal muscles to dynamic perturbations. This study examined activation amplitudes and temporal synergies among compartments of the back extensor and among abdominal wall muscles in response to dynamic bidirectional moments of force. We further examined whether responses were different between men and women. 19 women and 18 men performed a controlled transfer task. Surface electromyograms from bilateral sites over 6 back extensor compartments and 6 abdominal wall muscle sites were analyzed using principal component analysis. Key features were extracted from the measured electromyographic waveforms capturing amplitude and temporal variations among muscle sites. Three features explained 97% of the variance. Scores for each feature were computed for each measured waveform and analysis of variance found significant (p<.05) muscle main effects and a sex by muscle interaction. For the back extensors, post hoc analysis revealed that upper and more medial sites were recruited to higher amplitudes, medial sites responded to flexion moments, and the more lateral sites responded to lateral flexion moments. Women had more differences among muscle sites than men for the lateral flexion moment feature. For the abdominal wall muscles the oblique muscles responded with synergies related to fiber orientation, with women having higher amplitudes and more responsiveness to the lateral flexion moment than men. Synergies between the abdominal and back extensor sites as the moment demands change are discussed. These findings illustrate differential activation among erector spinae compartments and abdominal wall muscle sites supporting a highly organized pattern of response to bidirectional external moments with asynchronies more apparent in women. Copyright © 2012 Elsevier B.V. All rights reserved.

  3. Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification.

    PubMed

    Yi, Chucai; Tian, Yingli

    2012-09-01

    In this paper, we propose a novel framework to extract text regions from scene images with complex backgrounds and multiple text appearances. This framework consists of three main steps: boundary clustering (BC), stroke segmentation, and string fragment classification. In BC, we propose a new bigram-color-uniformity-based method to model both text and attachment surface, and cluster edge pixels based on color pairs and spatial positions into boundary layers. Then, stroke segmentation is performed at each boundary layer by color assignment to extract character candidates. We propose two algorithms to combine the structural analysis of text stroke with color assignment and filter out background interferences. Further, we design a robust string fragment classification based on Gabor-based text features. The features are obtained from feature maps of gradient, stroke distribution, and stroke width. The proposed framework of text localization is evaluated on scene images, born-digital images, broadcast video images, and images of handheld objects captured by blind persons. Experimental results on respective datasets demonstrate that the framework outperforms state-of-the-art localization algorithms.

  4. Presence of pro-tobacco messages on the Web.

    PubMed

    Hong, Traci; Cody, Michael J

    2002-01-01

    Ignored in the finalized Master Settlement Agreement (National Association of Attorneys General, 1998), the unmonitored, unregulated World Wide Web (Web) can operate as a major vehicle for delivering pro-tobacco messages, images, and products to millions of young consumers. A content analysis of 318 randomly sampled pro-tobacco Web sites revealed that tobacco has a pervasive presence on the Web, especially on e-commerce sites and sites featuring hobbies, recreation, and "fetishes." Products can be ordered online on nearly 50% of the sites, but only 23% of the sites included underage verification. Further, only 11% of these sites contain health warnings. Instead, pro-tobacco sites frequently associate smoking with "glamorous" and "alternative" lifestyles, and with images of young males and young (thin, attractive) females. Finally, many of the Web sites offered interactive site features that are potentially appealing to young Web users. Recommendations for future research and counterstrategies are discussed.

  5. Invasion of American bullfrogs along the Yellowstone River

    USGS Publications Warehouse

    Sepulveda, Adam; Layhee, Megan J.; Stagliano, Dave; Chaffin, Jake; Begley, Allison; Maxell, Bryce A.

    2015-01-01

    The American bullfrog (Lithobates catesbeianus) is a globally distributed invasive species that was introduced to the Yellowstone River floodplain of Montana. Knowledge about floodplain habitat features that allow for bullfrog persistence and spread will help identify effective control strategies. We used field surveys in 2010, 2012 and 2013 to describe bullfrog spread in the Yellowstone River floodplain and the habitat features that are associated with bullfrog occupancy and colonization. Bullfrogs in our study area expanded from ~ 60 km in 2010 to 106 km in 2013, and are spreading to up- and downstream habitats. The number of breeding sites (i.e., presence of bullfrog eggs or larvae) increased from 12 sites in 2010 to 45 sites in 2013. We found that bullfrogs were associated with deeper waters, emergent vegetation and public-access sites, which are habitat features that characterize permanent waters and describe human-mediated introductions. Control strategies that reduce the hydroperiod of breeding sites may help to limit bullfrog persistence and spread, while an increase in public outreach and education may help prevent further bullfrog introductions at public-access sites.

  6. Struggling readers learning with graphic-rich digital science text: Effects of a Highlight & Animate Feature and Manipulable Graphics

    NASA Astrophysics Data System (ADS)

    Defrance, Nancy L.

    Technology offers promise of 'leveling the playing field' for struggling readers. That is, instructional support features within digital texts may enable all readers to learn. This quasi-experimental study examined the effects on learning of two support features, which offered unique opportunities to interact with text. The Highlight & Animate Feature highlighted an important idea in prose, while simultaneously animating its representation in an adjacent graphic. It invited readers to integrate ideas depicted in graphics and prose, using each one to interpret the other. The Manipulable Graphics had parts that the reader could operate to discover relationships among phenomena. It invited readers to test or refine the ideas that they brought to, or gleaned from, the text. Use of these support features was compulsory. Twenty fifth grade struggling readers read a graphic-rich digital science text in a clinical interview setting, under one of two conditions: using either the Highlight & Animate Feature or the Manipulable Graphics. Participants in both conditions made statistically significant gains on a multiple choice measure of knowledge of the topic of the text. While there were no significant differences by condition in the amount of knowledge gained; there were significant differences in the quality of knowledge expressed. Transcripts revealed that understandings about light and vision, expressed by those who used the Highlight & Animate Feature, were more often conceptually and linguistically 'complete.' That is, their understandings included both a description of phenomena as well as an explanation of underlying scientific principles, which participants articulated using the vocabulary of the text. This finding may be attributed to the multiple opportunities to integrate graphics (depicting the behavior of phenomena) and prose (providing the scientific explanation of that phenomena), which characterized the Highlight & Animate Condition. Those who used the Manipulable Graphics were more likely to express complete understandings when they were able to structure a systematic investigation of the graphic and when the graphic was designed to confront their own naive conceptions about light and vision. The Manipulable Graphics also provided a foothold for those who entered the study with very little prior knowledge of the topic.

  7. Robust Segmentation of Planar and Linear Features of Terrestrial Laser Scanner Point Clouds Acquired from Construction Sites.

    PubMed

    Maalek, Reza; Lichti, Derek D; Ruwanpura, Janaka Y

    2018-03-08

    Automated segmentation of planar and linear features of point clouds acquired from construction sites is essential for the automatic extraction of building construction elements such as columns, beams and slabs. However, many planar and linear segmentation methods use scene-dependent similarity thresholds that may not provide generalizable solutions for all environments. In addition, outliers exist in construction site point clouds due to data artefacts caused by moving objects, occlusions and dust. To address these concerns, a novel method for robust classification and segmentation of planar and linear features is proposed. First, coplanar and collinear points are classified through a robust principal components analysis procedure. The classified points are then grouped using a new robust clustering method, the robust complete linkage method. A robust method is also proposed to extract the points of flat-slab floors and/or ceilings independent of the aforementioned stages to improve computational efficiency. The applicability of the proposed method is evaluated in eight datasets acquired from a complex laboratory environment and two construction sites at the University of Calgary. The precision, recall, and accuracy of the segmentation at both construction sites were 96.8%, 97.7% and 95%, respectively. These results demonstrate the suitability of the proposed method for robust segmentation of planar and linear features of contaminated datasets, such as those collected from construction sites.

  8. Robust Segmentation of Planar and Linear Features of Terrestrial Laser Scanner Point Clouds Acquired from Construction Sites

    PubMed Central

    Maalek, Reza; Lichti, Derek D; Ruwanpura, Janaka Y

    2018-01-01

    Automated segmentation of planar and linear features of point clouds acquired from construction sites is essential for the automatic extraction of building construction elements such as columns, beams and slabs. However, many planar and linear segmentation methods use scene-dependent similarity thresholds that may not provide generalizable solutions for all environments. In addition, outliers exist in construction site point clouds due to data artefacts caused by moving objects, occlusions and dust. To address these concerns, a novel method for robust classification and segmentation of planar and linear features is proposed. First, coplanar and collinear points are classified through a robust principal components analysis procedure. The classified points are then grouped using a new robust clustering method, the robust complete linkage method. A robust method is also proposed to extract the points of flat-slab floors and/or ceilings independent of the aforementioned stages to improve computational efficiency. The applicability of the proposed method is evaluated in eight datasets acquired from a complex laboratory environment and two construction sites at the University of Calgary. The precision, recall, and accuracy of the segmentation at both construction sites were 96.8%, 97.7% and 95%, respectively. These results demonstrate the suitability of the proposed method for robust segmentation of planar and linear features of contaminated datasets, such as those collected from construction sites. PMID:29518062

  9. High Precision Prediction of Functional Sites in Protein Structures

    PubMed Central

    Buturovic, Ljubomir; Wong, Mike; Tang, Grace W.; Altman, Russ B.; Petkovic, Dragutin

    2014-01-01

    We address the problem of assigning biological function to solved protein structures. Computational tools play a critical role in identifying potential active sites and informing screening decisions for further lab analysis. A critical parameter in the practical application of computational methods is the precision, or positive predictive value. Precision measures the level of confidence the user should have in a particular computed functional assignment. Low precision annotations lead to futile laboratory investigations and waste scarce research resources. In this paper we describe an advanced version of the protein function annotation system FEATURE, which achieved 99% precision and average recall of 95% across 20 representative functional sites. The system uses a Support Vector Machine classifier operating on the microenvironment of physicochemical features around an amino acid. We also compared performance of our method with state-of-the-art sequence-level annotator Pfam in terms of precision, recall and localization. To our knowledge, no other functional site annotator has been rigorously evaluated against these key criteria. The software and predictive models are incorporated into the WebFEATURE service at http://feature.stanford.edu/wf4.0-beta. PMID:24632601

  10. Web sites selling cigarettes: how many are there in the USA and what are their sales practices?

    PubMed

    Ribisl, K M; Kim, A E; Williams, R S

    2001-12-01

    To estimate the number and geographic location of web sites selling cigarettes in the USA, and to examine their sales and marketing practices. Comprehensive searches were conducted using four keyword terms and five popular internet search engines, supplemented by sites identified in a news article. Over 1800 sites were examined to identify 88 internet cigarette vendors. Trained raters examined the content of each site using a standardised coding instrument to assess geographic location, presence of warnings, products sold, and promotional strategies. USA. Internet cigarette vendors were located in 23 states. Nearly half (n = 43) were located in New York state, and many were in tobacco producing states with low cigarette excise taxes. Indian reservations housed 49 of the 88 sites. Only 28.4% of sites featured the US Surgeon General's health warnings and 81.8% featured minimum age of sale warnings. Nearly all sites (96.6%) sold premium or value brand cigarettes, 21.6% sold duty-free Marlboros, and 8.0% sold bidis. Approximately one third featured special promotional programmes. Internet cigarette vendors present new regulatory and enforcement challenges for tobacco control advocates because of the difficulty in regulating internet content and because many vendors are on Indian reservations.

  11. Predict and Analyze Protein Glycation Sites with the mRMR and IFS Methods

    PubMed Central

    Gu, Wenxiang; Zhang, Wenyi; Wang, Jianan

    2015-01-01

    Glycation is a nonenzymatic process in which proteins react with reducing sugar molecules. The identification of glycation sites in protein may provide guidelines to understand the biological function of protein glycation. In this study, we developed a computational method to predict protein glycation sites by using the support vector machine classifier. The experimental results showed that the prediction accuracy was 85.51% and an overall MCC was 0.70. Feature analysis indicated that the composition of k-spaced amino acid pairs feature contributed the most for glycation sites prediction. PMID:25961025

  12. Education in health research methodology: use of a wiki for knowledge translation.

    PubMed

    Hamm, Michele P; Klassen, Terry P; Scott, Shannon D; Moher, David; Hartling, Lisa

    2013-01-01

    A research-practice gap exists between what is known about conducting methodologically rigorous randomized controlled trials (RCTs) and what is done. Evidence consistently shows that pediatric RCTs are susceptible to high risk of bias; therefore novel methods of influencing the design and conduct of trials are required. The objective of this study was to develop and pilot test a wiki designed to educate pediatric trialists and trainees in the principles involved in minimizing risk of bias in RCTs. The focus was on preliminary usability testing of the wiki. The wiki was developed through adaptation of existing knowledge translation strategies and through tailoring the site to the identified needs of the end-users. The wiki was evaluated for usability and user preferences regarding the content and formatting. Semi-structured interviews were conducted with 15 trialists and systematic reviewers, representing varying levels of experience with risk of bias or the conduct of trials. Data were analyzed using content analysis. Participants found the wiki to be well organized, easy to use, and straightforward to navigate. Suggestions for improvement tended to focus on clarification of the text or on esthetics, rather than on the content or format. Participants liked the additional features of the site that were supplementary to the text, such as the interactive examples, and the components that focused on practical applications, adding relevance to the theory presented. While the site could be used by both trialists and systematic reviewers, the lack of a clearly defined target audience caused some confusion among participants. Participants were supportive of using a wiki as a novel educational tool. The results of this pilot test will be used to refine the risk of bias wiki, which holds promise as a knowledge translation intervention for education in medical research methodology.

  13. Education in Health Research Methodology: Use of a Wiki for Knowledge Translation

    PubMed Central

    Hamm, Michele P.; Klassen, Terry P.; Scott, Shannon D.; Moher, David; Hartling, Lisa

    2013-01-01

    Introduction A research-practice gap exists between what is known about conducting methodologically rigorous randomized controlled trials (RCTs) and what is done. Evidence consistently shows that pediatric RCTs are susceptible to high risk of bias; therefore novel methods of influencing the design and conduct of trials are required. The objective of this study was to develop and pilot test a wiki designed to educate pediatric trialists and trainees in the principles involved in minimizing risk of bias in RCTs. The focus was on preliminary usability testing of the wiki. Methods The wiki was developed through adaptation of existing knowledge translation strategies and through tailoring the site to the identified needs of the end-users. The wiki was evaluated for usability and user preferences regarding the content and formatting. Semi-structured interviews were conducted with 15 trialists and systematic reviewers, representing varying levels of experience with risk of bias or the conduct of trials. Data were analyzed using content analysis. Results Participants found the wiki to be well organized, easy to use, and straightforward to navigate. Suggestions for improvement tended to focus on clarification of the text or on esthetics, rather than on the content or format. Participants liked the additional features of the site that were supplementary to the text, such as the interactive examples, and the components that focused on practical applications, adding relevance to the theory presented. While the site could be used by both trialists and systematic reviewers, the lack of a clearly defined target audience caused some confusion among participants. Conclusions Participants were supportive of using a wiki as a novel educational tool. The results of this pilot test will be used to refine the risk of bias wiki, which holds promise as a knowledge translation intervention for education in medical research methodology. PMID:23741424

  14. Short text sentiment classification based on feature extension and ensemble classifier

    NASA Astrophysics Data System (ADS)

    Liu, Yang; Zhu, Xie

    2018-05-01

    With the rapid development of Internet social media, excavating the emotional tendencies of the short text information from the Internet, the acquisition of useful information has attracted the attention of researchers. At present, the commonly used can be attributed to the rule-based classification and statistical machine learning classification methods. Although micro-blog sentiment analysis has made good progress, there still exist some shortcomings such as not highly accurate enough and strong dependence from sentiment classification effect. Aiming at the characteristics of Chinese short texts, such as less information, sparse features, and diverse expressions, this paper considers expanding the original text by mining related semantic information from the reviews, forwarding and other related information. First, this paper uses Word2vec to compute word similarity to extend the feature words. And then uses an ensemble classifier composed of SVM, KNN and HMM to analyze the emotion of the short text of micro-blog. The experimental results show that the proposed method can make good use of the comment forwarding information to extend the original features. Compared with the traditional method, the accuracy, recall and F1 value obtained by this method have been improved.

  15. Emotion Recognition of Weblog Sentences Based on an Ensemble Algorithm of Multi-label Classification and Word Emotions

    NASA Astrophysics Data System (ADS)

    Li, Ji; Ren, Fuji

    Weblogs have greatly changed the communication ways of mankind. Affective analysis of blog posts is found valuable for many applications such as text-to-speech synthesis or computer-assisted recommendation. Traditional emotion recognition in text based on single-label classification can not satisfy higher requirements of affective computing. In this paper, the automatic identification of sentence emotion in weblogs is modeled as a multi-label text categorization task. Experiments are carried out on 12273 blog sentences from the Chinese emotion corpus Ren_CECps with 8-dimension emotion annotation. An ensemble algorithm RAKEL is used to recognize dominant emotions from the writer's perspective. Our emotion feature using detailed intensity representation for word emotions outperforms the other main features such as the word frequency feature and the traditional lexicon-based feature. In order to deal with relatively complex sentences, we integrate grammatical characteristics of punctuations, disjunctive connectives, modification relations and negation into features. It achieves 13.51% and 12.49% increases for Micro-averaged F1 and Macro-averaged F1 respectively compared to the traditional lexicon-based feature. Result shows that multiple-dimension emotion representation with grammatical features can efficiently classify sentence emotion in a multi-label problem.

  16. An ensemble method for extracting adverse drug events from social media.

    PubMed

    Liu, Jing; Zhao, Songzheng; Zhang, Xiaodi

    2016-06-01

    Because adverse drug events (ADEs) are a serious health problem and a leading cause of death, it is of vital importance to identify them correctly and in a timely manner. With the development of Web 2.0, social media has become a large data source for information on ADEs. The objective of this study is to develop a relation extraction system that uses natural language processing techniques to effectively distinguish between ADEs and non-ADEs in informal text on social media. We develop a feature-based approach that utilizes various lexical, syntactic, and semantic features. Information-gain-based feature selection is performed to address high-dimensional features. Then, we evaluate the effectiveness of four well-known kernel-based approaches (i.e., subset tree kernel, tree kernel, shortest dependency path kernel, and all-paths graph kernel) and several ensembles that are generated by adopting different combination methods (i.e., majority voting, weighted averaging, and stacked generalization). All of the approaches are tested using three data sets: two health-related discussion forums and one general social networking site (i.e., Twitter). When investigating the contribution of each feature subset, the feature-based approach attains the best area under the receiver operating characteristics curve (AUC) values, which are 78.6%, 72.2%, and 79.2% on the three data sets. When individual methods are used, we attain the best AUC values of 82.1%, 73.2%, and 77.0% using the subset tree kernel, shortest dependency path kernel, and feature-based approach on the three data sets, respectively. When using classifier ensembles, we achieve the best AUC values of 84.5%, 77.3%, and 84.5% on the three data sets, outperforming the baselines. Our experimental results indicate that ADE extraction from social media can benefit from feature selection. With respect to the effectiveness of different feature subsets, lexical features and semantic features can enhance the ADE extraction capability. Kernel-based approaches, which can stay away from the feature sparsity issue, are qualified to address the ADE extraction problem. Combining different individual classifiers using suitable combination methods can further enhance the ADE extraction effectiveness. Copyright © 2016 Elsevier B.V. All rights reserved.

  17. Dynamic Digital Maps as Vehicles for Distributing Digital Geologic Maps and Embedded Analytical Data and Multimedia

    NASA Astrophysics Data System (ADS)

    Condit, C. D.; Mninch, M.

    2012-12-01

    The Dynamic Digital Map (DDM) is an ideal vehicle for the professional geologist to use to describe the geologic setting of key sites to the public in a format that integrates and presents maps and associated analytical data and multimedia without the need for an ArcGIS interface. Maps with field trip guide stops that include photographs, movies and figures and animations, showing, for example, how the features seen in the field formed, or how data might be best visualized in "time-frame" sequences are ideally included in DDMs. DDMs distribute geologic maps, images, movies, analytical data, and text such as field guides, in an integrated cross-platform, web enabled format that are intuitive to use, easily and quickly searchable, and require no additional proprietary software to operate. Maps, photos, movies and animations are stored outside the program, which acts as an organizational framework and index to present these data. Once created, the DDM can be downloaded from the web site hosting it in the flavor matching the user's operating system (e.g. Linux, Windows and Macintosh) as zip, dmg or tar files (and soon as iOS and Android tablet apps). When decompressed, the DDM can then access its associated data directly from that site with no browser needed. Alternatively, the entire package can be distributed and used from CD, DVD, or flash-memory storage. The intent of this presentation is to introduce the variety of geology that can be accessed from the over 25 DDMs created to date, concentrating on the DDM of the Springerville Volcanic Field. We will highlight selected features of some of them, introduce a simplified interface to the original DDM (that we renamed DDMC for Classic) and give a brief look at a the recently (2010-2011) completed geologic maps of the Springerville Volcanic field to see examples of each of the features discussed above, and a display of the integrated analytical data set. We will also highlight the differences between the classic or DDMCs and the new Dynamic Digital Map Extended (DDME) designed from the ground up to take advantage of the expanded connectedness this redesigned program will accommodate.

  18. Map Feature Content and Text Recall of Good and Poor Readers.

    ERIC Educational Resources Information Center

    Amlund, Jeanne T.; And Others

    1985-01-01

    Reports two experiments evaluating the effect of map feature content on text recall by subjects of varying reading skill levels. Finds that both experiments support the conjoint retention hypothesis, in which dual-coding of spatial and verbal information and their interaction in memory enhance recall. (MM)

  19. Web sites selling cigarettes: how many are there in the USA and what are their sales practices?

    PubMed Central

    Ribisl, K.; Kim, A.; Williams, R.

    2001-01-01

    OBJECTIVES—To estimate the number and geographic location of web sites selling cigarettes in the USA, and to examine their sales and marketing practices.
METHODS—Comprehensive searches were conducted using four keyword terms and five popular internet search engines, supplemented by sites identified in a news article. Over 1800 sites were examined to identify 88 internet cigarette vendors.
MEASURES—Trained raters examined the content of each site using a standardised coding instrument to assess geographic location, presence of warnings, products sold, and promotional strategies.
SETTING—USA.
RESULTS—Internet cigarette vendors were located in 23 states. Nearly half (n = 43) were located in New York state, and many were in tobacco producing states with low cigarette excise taxes. Indian reservations housed 49 of the 88 sites. Only 28.4% of sites featured the US Surgeon General's health warnings and 81.8% featured minimum age of sale warnings. Nearly all sites (96.6%) sold premium or value brand cigarettes, 21.6% sold duty-free Marlboros, and 8.0% sold bidis. Approximately one third featured special promotional programmes.
CONCLUSIONS—Internet cigarette vendors present new regulatory and enforcement challenges for tobacco control advocates because of the difficulty in regulating internet content and because many vendors are on Indian reservations.


Keywords: youth access; internet; web sites; policy PMID:11740027

  20. Region 9 NPL Sites (Superfund Sites 2013)

    EPA Pesticide Factsheets

    NPL site POINT locations for the US EPA Region 9. NPL (National Priorities List) sites are hazardous waste sites that are eligible for extensive long-term cleanup under the Superfund program. Eligibility is determined by a scoring method called Hazard Ranking System. Sites with high scores are listed on the NPL. The majority of the locations are derived from polygon centroids of digitized site boundaries. The remaining locations were generated from address geocoding and digitizing. Area covered by this data set include Arizona, California, Nevada, Hawaii, Guam, American Samoa, Northern Marianas and Trust Territories. Attributes include NPL status codes, NPL industry type codes and environmental indicators. Related table, NPL_Contaminants contains information about contaminated media types and chemicals. This is a one-to-many relate and can be related to the feature class using the relationship classes under the Feature Data Set ENVIRO_CONTAMINANT.

  1. Archaeological Investigations in the Gainesville Lake Area of the Tennessee-Tombigbee Waterway. Volume I. The Gainesville Lake Area Excavations.

    DTIC Science & Technology

    1981-01-01

    97 71. Site 1Pi61, Removing Trees ........ .................. . 97 72. Site lPi6l, Testing the Midden ....... ................ . 97 6...the use of plant and animal species changes through time. Volume IV also describes the human skeletal remains from all excavated sites and discusses the...Gainesville Lake area were cultural features. A few 5 features resulted from forces other than human behavior ( tree roots, ro- dent burrows, erosional gullies

  2. Identifying sports videos using replay, text, and camera motion features

    NASA Astrophysics Data System (ADS)

    Kobla, Vikrant; DeMenthon, Daniel; Doermann, David S.

    1999-12-01

    Automated classification of digital video is emerging as an important piece of the puzzle in the design of content management systems for digital libraries. The ability to classify videos into various classes such as sports, news, movies, or documentaries, increases the efficiency of indexing, browsing, and retrieval of video in large databases. In this paper, we discuss the extraction of features that enable identification of sports videos directly from the compressed domain of MPEG video. These features include detecting the presence of action replays, determining the amount of scene text in vide, and calculating various statistics on camera and/or object motion. The features are derived from the macroblock, motion,and bit-rate information that is readily accessible from MPEG video with very minimal decoding, leading to substantial gains in processing speeds. Full-decoding of selective frames is required only for text analysis. A decision tree classifier built using these features is able to identify sports clips with an accuracy of about 93 percent.

  3. Linguistic feature analysis for protein interaction extraction

    PubMed Central

    2009-01-01

    Background The rapid growth of the amount of publicly available reports on biomedical experimental results has recently caused a boost of text mining approaches for protein interaction extraction. Most approaches rely implicitly or explicitly on linguistic, i.e., lexical and syntactic, data extracted from text. However, only few attempts have been made to evaluate the contribution of the different feature types. In this work, we contribute to this evaluation by studying the relative importance of deep syntactic features, i.e., grammatical relations, shallow syntactic features (part-of-speech information) and lexical features. For this purpose, we use a recently proposed approach that uses support vector machines with structured kernels. Results Our results reveal that the contribution of the different feature types varies for the different data sets on which the experiments were conducted. The smaller the training corpus compared to the test data, the more important the role of grammatical relations becomes. Moreover, deep syntactic information based classifiers prove to be more robust on heterogeneous texts where no or only limited common vocabulary is shared. Conclusion Our findings suggest that grammatical relations play an important role in the interaction extraction task. Moreover, the net advantage of adding lexical and shallow syntactic features is small related to the number of added features. This implies that efficient classifiers can be built by using only a small fraction of the features that are typically being used in recent approaches. PMID:19909518

  4. Different approaches for identifying important concepts in probabilistic biomedical text summarization.

    PubMed

    Moradi, Milad; Ghadiri, Nasser

    2018-01-01

    Automatic text summarization tools help users in the biomedical domain to acquire their intended information from various textual resources more efficiently. Some of biomedical text summarization systems put the basis of their sentence selection approach on the frequency of concepts extracted from the input text. However, it seems that exploring other measures rather than the raw frequency for identifying valuable contents within an input document, or considering correlations existing between concepts, may be more useful for this type of summarization. In this paper, we describe a Bayesian summarization method for biomedical text documents. The Bayesian summarizer initially maps the input text to the Unified Medical Language System (UMLS) concepts; then it selects the important ones to be used as classification features. We introduce six different feature selection approaches to identify the most important concepts of the text and select the most informative contents according to the distribution of these concepts. We show that with the use of an appropriate feature selection approach, the Bayesian summarizer can improve the performance of biomedical summarization. Using the Recall-Oriented Understudy for Gisting Evaluation (ROUGE) toolkit, we perform extensive evaluations on a corpus of scientific papers in the biomedical domain. The results show that when the Bayesian summarizer utilizes the feature selection methods that do not use the raw frequency, it can outperform the biomedical summarizers that rely on the frequency of concepts, domain-independent and baseline methods. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Examining Teachers' Personal and Professional Use of Facebook: Recommendations for Teacher Education Programming

    ERIC Educational Resources Information Center

    Steinbrecher, Trisha; Hart, Juliet

    2012-01-01

    Members of the Net Generation are increasingly using social networking sites to interact with individuals both on and off campus. In this study, we employed a quantitative approach with an exploration of descriptive data to examine "Facebook" site features pre-service educators use and how those features are utilized in personal and…

  6. AutoSite: an automated approach for pseudo-ligands prediction—from ligand-binding sites identification to predicting key ligand atoms

    PubMed Central

    Ravindranath, Pradeep Anand; Sanner, Michel F.

    2016-01-01

    Motivation: The identification of ligand-binding sites from a protein structure facilitates computational drug design and optimization, and protein function assignment. We introduce AutoSite: an efficient software tool for identifying ligand-binding sites and predicting pseudo ligand corresponding to each binding site identified. Binding sites are reported as clusters of 3D points called fills in which every point is labelled as hydrophobic or as hydrogen bond donor or acceptor. From these fills AutoSite derives feature points: a set of putative positions of hydrophobic-, and hydrogen-bond forming ligand atoms. Results: We show that AutoSite identifies ligand-binding sites with higher accuracy than other leading methods, and produces fills that better matches the ligand shape and properties, than the fills obtained with a software program with similar capabilities, AutoLigand. In addition, we demonstrate that for the Astex Diverse Set, the feature points identify 79% of hydrophobic ligand atoms, and 81% and 62% of the hydrogen acceptor and donor hydrogen ligand atoms interacting with the receptor, and predict 81.2% of water molecules mediating interactions between ligand and receptor. Finally, we illustrate potential uses of the predicted feature points in the context of lead optimization in drug discovery projects. Availability and Implementation: http://adfr.scripps.edu/AutoDockFR/autosite.html Contact: sanner@scripps.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27354702

  7. A Fairy-Tale Landscape

    NASA Technical Reports Server (NTRS)

    2008-01-01

    [figure removed for brevity, see original site] Click on image for animation

    Fun, fairy-tale nicknames have been assigned to features in this animated view of the workspace reachable by the robotic arm of NASA's Phoenix Mars Lander. For example, 'Sleepy Hollow' denotes a trench and 'Headless' designates a rock.

    A 'National Park,' marked by purple text and a purple arrow, has been set aside for protection until scientists and engineers have tested the operation of the robotic scoop. First touches with the scoop will be to the left of the 'National Park' line.

    Scientists use such informal names for easy identification of features of interest during the mission.

    In this view, rocks are circled in yellow, other areas of interest in green. The images were taken by the lander's 7-foot mast camera, called the Surface Stereo Imager.

    The Phoenix Mission is led by the University of Arizona, Tucson, on behalf of NASA. Project management of the mission is by NASA's Jet Propulsion Laboratory, Pasadena, Calif. Spacecraft development is by Lockheed Martin Space Systems, Denver.

  8. Brownfields Green Avenue Sites: Technical Memorandum - Conceptual Design for Sustainable Redevelopment

    EPA Pesticide Factsheets

    This technical memorandum briefly describes the site and proposed conceptual site plan, indicates conceptual design considerations, specifies recommended green and sustainable features, and offers other recommendations

  9. Teaching Text Structure: Examining the Affordances of Children's Informational Texts

    ERIC Educational Resources Information Center

    Jones, Cindy D.; Clark, Sarah K.; Reutzel, D. Ray

    2016-01-01

    This study investigated the affordances of informational texts to serve as model texts for teaching text structure to elementary school children. Content analysis of a random sampling of children's informational texts from top publishers was conducted on text structure organization and on the inclusion of text features as signals of text…

  10. Comparison of the application of B-mode and strain elastography ultrasound in the estimation of lymph node metastasis of papillary thyroid carcinoma based on a radiomics approach.

    PubMed

    Liu, Tongtong; Ge, Xifeng; Yu, Jinhua; Guo, Yi; Wang, Yuanyuan; Wang, Wenping; Cui, Ligang

    2018-06-21

    B-mode ultrasound (B-US) and strain elastography ultrasound (SE-US) images have a potential to distinguish thyroid tumor with different lymph node (LN) status. The purpose of our study is to investigate whether the application of multi-modality images including B-US and SE-US can improve the discriminability of thyroid tumor with LN metastasis based on a radiomics approach. Ultrasound (US) images including B-US and SE-US images of 75 papillary thyroid carcinoma (PTC) cases were retrospectively collected. A radiomics approach was developed in this study to estimate LNs status of PTC patients. The approach included image segmentation, quantitative feature extraction, feature selection and classification. Three feature sets were extracted from B-US, SE-US, and multi-modality containing B-US and SE-US. They were used to evaluate the contribution of different modalities. A total of 684 radiomics features have been extracted in our study. We used sparse representation coefficient-based feature selection method with 10-bootstrap to reduce the dimension of feature sets. Support vector machine with leave-one-out cross-validation was used to build the model for estimating LN status. Using features extracted from both B-US and SE-US, the radiomics-based model produced an area under the receiver operating characteristic curve (AUC) [Formula: see text] 0.90, accuracy (ACC) [Formula: see text] 0.85, sensitivity (SENS) [Formula: see text] 0.77 and specificity (SPEC) [Formula: see text] 0.88, which was better than using features extracted from B-US or SE-US separately. Multi-modality images provided more information in radiomics study. Combining use of B-US and SE-US could improve the LN metastasis estimation accuracy for PTC patients.

  11. Literacy Practices in Computer-Mediated Communication in Hong Kong.

    ERIC Educational Resources Information Center

    Lee, Carmen

    2002-01-01

    Examines linguistic features of text-based computer-mediated communication (CMC) in Hong Kong. The study is based on a 70,000-word corpus of electronic mail and ICQ instant messaging texts, which were collected from students in Hong Kong. Identified language-specific features that may be seen as new literacy practices within the theoretical…

  12. Comparing the Lexical Features of EAP Students' Essays by Prompt and Rating

    ERIC Educational Resources Information Center

    Lavallée, Maxime; McDonough, Kim

    2015-01-01

    Previous research has shown that high frequency lexical items, such as AWL words and formulaic expressions, may differentiate between texts written by expert and novice writers (Chen & Baker, 2010; Hancioglu, 2009), and that lexical features related to breadth, depth, and accessibility differentiate among texts from L2 writers of different…

  13. A rare case of acute presentation of trocar site hernia from robot-assisted laparoscopic partial nephrectomy.

    PubMed

    Ng, Zi Qin; Pemberton, Richard; Tan, Patrick

    2018-02-15

    Trocar site hernia is not a common acute complication encountered after robot-assisted surgery, especially in the urological cohort of patients. A few case reports of small bowel obstruction secondary to incarceration by trocar site hernia have been described in gynaecological surgery and prostatectomies. As the clinical presentation is non-specific, late diagnosis has significant implication on morbidity and mortality. Here, we present a rare case of a patient with recent robot-assisted laparoscopic partial nephrectomy for a renal cell carcinoma presented with features of impending bowel obstruction secondary to incarcerated small bowel in the trocar site. We also reviewed the literature focusing on clinical features of trocar site hernia and preventive measures.

  14. Study on detecting leachate leakage of municipal solid waste landfill site.

    PubMed

    Liu, Jiangang; Cao, Xianxian; Ai, Yingbo; Zhou, Dongdong; Han, Qiting

    2015-06-01

    The article studies the detection of the leakage passage of leachate in a waste landfill dam. The leachate of waste landfill has its own features, like high conductivity, high chroma and an increasing temperature, also, the horizontal flow velocity of groundwater on the leakage site increases. This article proposes a comprehensive tracing method to identify the leakage site of an impermeable membrane by using these features. This method has been applied to determine two leakage sites of the Yahu municipal solid waste landfill site in Pingshan District, Shenzhen, China, which shows that there are two leachate leakage passages in the waste landfill dam A between NZK-2 and NZK-3, and between NZK-6 and NZK-7. © The Author(s) 2015.

  15. Scoping Review and Evaluation of SMS/text Messaging Platforms for mHealth Projects or Clinical Interventions

    PubMed Central

    Iribarren, Sarah; Brown, William; Giguere, Rebecca; Stone, Patricia; Schnall, Rebecca; Staggers, Nancy; Carballo-Diéguez, Alex

    2017-01-01

    Objectives Mobile technology supporting text messaging interventions (TMIs) continues to evolve, presenting challenges for researchers and healthcare professionals who need to choose software solutions to best meet their program needs. The objective of this review was to systematically identify and compare text messaging platforms and to summarize their advantages and disadvantages as described in peer-reviewed literature. Methods A scoping review was conducted using four steps: 1) identify currently available platforms through online searches and in mHealth repositories; 2) expand evaluation criteria of an mHealth mobile messaging toolkit and prior user experiences as researchers; 3) evaluate each platform’s functions and features based on the expanded criteria and a vendor survey; and 4) assess the documentation of platform use in the peer-review literature. Platforms meeting inclusion criteria were assessed independently by three reviewers and discussed until consensus was reached. The PRISMA guidelines were followed to report findings. Results Of the 1041 potentially relevant search results, 27 platforms met inclusion criteria. Most were excluded because they were not platforms (e.g., guides, toolkits, reports, or SMS gateways). Of the 27 platforms, only 12 were identified in existing mHealth repositories, 10 from Google searches, while five were found in both. The expanded evaluation criteria included 22 items. Results indicate no uniform presentation of platform features and functions, often making these difficult to discern. Fourteen of the platforms were reported as open source, 10 focused on health care and 16 were tailored to meet needs of low resource settings (not mutually exclusive). Fifteen platforms had do-it-yourself setup (programming not required) while the remainder required coding/programming skills or setups could be built to specification by the vendor. Frequently described features included data security and access to the platform via cloud-based systems. Pay structures and reported targeted end-users varied. Peer-reviewed publications listed only 6 of the 27 platforms across 21 publications. The majority of these articles reported the name of the platform used but did not describe advantages or disadvantages. Conclusions Searching for and comparing mHealth platforms for TMIs remains a challenge. The results of this review can serve as a resource for researchers and healthcare professionals wanting to integrate TMIs into health interventions. Steps to identify, compare and assess advantages and disadvantages are outlined for consideration. Expanded evaluation criteria can be used by future researchers. Continued and more comprehensive platform tools should be integrated into mHealth repositories. Detailed descriptions of platform advantages and disadvantages are needed when mHealth researchers publish findings to expand the body of research on texting-based tools for healthcare. Standardized descriptions and features are recommended for vendor sites. PMID:28347445

  16. A content analysis of Web sites promoting smoking culture and lifestyle.

    PubMed

    Ribisl, Kurt M; Lee, Rebecca E; Henriksen, Lisa; Haladjian, Harry H

    2003-02-01

    The present study examined smoking culture and lifestyle Web sites listed on Yahoo!, a popular Internet search catalog, to determine whether the sites were easily accessible to youth, featured age or health warnings, and mentioned specific tobacco brands. A content analysis of photographs on these sites assessed the demographics of individuals depicted and the amount of smoking and nudity in the photographs. The sample included 30 Web sites, all of which were accessible to youth and did not require age verification services to enter them. Cigarette brand names were mentioned in writing on 35% of the sites, and brand images were present on 24% of the sites. Nearly all of the photographs (95%) depicted smoking, 92% featured women, and 7% contained partial or full nudity. These results underscore the need for greater research and monitoring of smoking-related Internet content by health educators and tobacco control advocates.

  17. The Effectiveness of Course Web Sites in Higher Education: An Exploratory Study.

    ERIC Educational Resources Information Center

    Comunale, Christie L.; Sexton, Thomas R.; Voss, Diana J. Pedagano

    2002-01-01

    Describes an exploratory study of the educational effectiveness of course Web sites among undergraduate accounting students and graduate students in business statistics. Measured Web site visit frequency, usefulness of each site feature, and the impacts of Web sites on perceived learning and course performance. (Author/LRW)

  18. A New Scheme to Characterize and Identify Protein Ubiquitination Sites.

    PubMed

    Nguyen, Van-Nui; Huang, Kai-Yao; Huang, Chien-Hsun; Lai, K Robert; Lee, Tzong-Yi

    2017-01-01

    Protein ubiquitination, involving the conjugation of ubiquitin on lysine residue, serves as an important modulator of many cellular functions in eukaryotes. Recent advancements in proteomic technology have stimulated increasing interest in identifying ubiquitination sites. However, most computational tools for predicting ubiquitination sites are focused on small-scale data. With an increasing number of experimentally verified ubiquitination sites, we were motivated to design a predictive model for identifying lysine ubiquitination sites for large-scale proteome dataset. This work assessed not only single features, such as amino acid composition (AAC), amino acid pair composition (AAPC) and evolutionary information, but also the effectiveness of incorporating two or more features into a hybrid approach to model construction. The support vector machine (SVM) was applied to generate the prediction models for ubiquitination site identification. Evaluation by five-fold cross-validation showed that the SVM models learned from the combination of hybrid features delivered a better prediction performance. Additionally, a motif discovery tool, MDDLogo, was adopted to characterize the potential substrate motifs of ubiquitination sites. The SVM models integrating the MDDLogo-identified substrate motifs could yield an average accuracy of 68.70 percent. Furthermore, the independent testing result showed that the MDDLogo-clustered SVM models could provide a promising accuracy (78.50 percent) and perform better than other prediction tools. Two cases have demonstrated the effective prediction of ubiquitination sites with corresponding substrate motifs.

  19. Genome-Wide Analysis of Transposon and Retroviral Insertions Reveals Preferential Integrations in Regions of DNA Flexibility.

    PubMed

    Vrljicak, Pavle; Tao, Shijie; Varshney, Gaurav K; Quach, Helen Ngoc Bao; Joshi, Adita; LaFave, Matthew C; Burgess, Shawn M; Sampath, Karuna

    2016-04-07

    DNA transposons and retroviruses are important transgenic tools for genome engineering. An important consideration affecting the choice of transgenic vector is their insertion site preferences. Previous large-scale analyses of Ds transposon integration sites in plants were done on the basis of reporter gene expression or germ-line transmission, making it difficult to discern vertebrate integration preferences. Here, we compare over 1300 Ds transposon integration sites in zebrafish with Tol2 transposon and retroviral integration sites. Genome-wide analysis shows that Ds integration sites in the presence or absence of marker selection are remarkably similar and distributed throughout the genome. No strict motif was found, but a preference for structural features in the target DNA associated with DNA flexibility (Twist, Tilt, Rise, Roll, Shift, and Slide) was observed. Remarkably, this feature is also found in transposon and retroviral integrations in maize and mouse cells. Our findings show that structural features influence the integration of heterologous DNA in genomes, and have implications for targeted genome engineering. Copyright © 2016 Vrljicak et al.

  20. Palliative Care Texts

    MedlinePlus

    ... page: https://medlineplus.gov/palliativecaretexts.html Palliative Care Texts To use the sharing features on this page, please enable JavaScript. Free text messages to support you and your family during ...

  1. Academic writing in a corpus of 4th grade science notebooks: An analysis of student language use and adult expectations of the genres of school science

    NASA Astrophysics Data System (ADS)

    Esquinca, Alberto

    This is a study of language use in the context of an inquiry-based science curriculum in which conceptual understanding ratings are used split texts into groups of "successful" and "unsuccessful" texts. "Successful" texts could include known features of science language. 420 texts generated by students in 14 classrooms from three school districts, culled from a prior study on the effectiveness of science notebooks to assess understanding, in addition to the aforementioned ratings are the data sources. In science notebooks, students write in the process of learning (here, a unit on electricity). The analytical framework is systemic functional linguistics (Halliday and Matthiessen, 2004; Eggins, 2004), specifically the concepts of genre, register and nominalization. Genre classification involves an analysis of the purpose and register features in the text (Schleppegrell, 2004). The use of features of the scientific academic register, namely the use relational processes and nominalization (Halliday and Martin, 1993), requires transitivity analysis and noun analysis. Transitivity analysis, consisting of the identification of the process type, is conducted on 4737 ranking clauses. A manual count of each noun used in the corpus allows for a typology of nouns. Four school science genres, procedures, procedural recounts reports and explanations, are found. Most texts (85.4%) are factual, and 14.1% are classified as explanations, the analytical genre. Logistic regression analysis indicates that there is no significant probability that the texts classified as explanation are placed in the group of "successful" texts. In addition, material process clauses predominate in the corpus, followed by relational process clauses. Results of a logistic regression analysis indicate that there is a significant probability (Chi square = 15.23, p < .0001) that texts with a high rate of relational processes are placed in the group of "successful" texts. In addition, 59.5% of 6511 nouns are references to physical materials, followed by references to abstract concepts (35.54%). Only two of the concept nouns were found to be nominalized referents in definition model sentences. In sum, the corpus has recognizable genres and features science language, and relational processes are more prevalent in "successful" texts. However, the pervasive feature of science language, nominalization, is scarce.

  2. Owgis 2.0: Open Source Java Application that Builds Web GIS Interfaces for Desktop Andmobile Devices

    NASA Astrophysics Data System (ADS)

    Zavala Romero, O.; Chassignet, E.; Zavala-Hidalgo, J.; Pandav, H.; Velissariou, P.; Meyer-Baese, A.

    2016-12-01

    OWGIS is an open source Java and JavaScript application that builds easily configurable Web GIS sites for desktop and mobile devices. The current version of OWGIS generates mobile interfaces based on HTML5 technology and can be used to create mobile applications. The style of the generated websites can be modified using COMPASS, a well known CSS Authoring Framework. In addition, OWGIS uses several Open Geospatial Consortium standards to request datafrom the most common map servers, such as GeoServer. It is also able to request data from ncWMS servers, allowing the websites to display 4D data from NetCDF files. This application is configured by XML files that define which layers, geographic datasets, are displayed on the Web GIS sites. Among other features, OWGIS allows for animations; streamlines from vector data; virtual globe display; vertical profiles and vertical transects; different color palettes; the ability to download data; and display text in multiple languages. OWGIS users are mainly scientists in the oceanography, meteorology and climate fields.

  3. Large-scale feature searches of collections of medical imagery

    NASA Astrophysics Data System (ADS)

    Hedgcock, Marcus W.; Karshat, Walter B.; Levitt, Tod S.; Vosky, D. N.

    1993-09-01

    Large scale feature searches of accumulated collections of medical imagery are required for multiple purposes, including clinical studies, administrative planning, epidemiology, teaching, quality improvement, and research. To perform a feature search of large collections of medical imagery, one can either search text descriptors of the imagery in the collection (usually the interpretation), or (if the imagery is in digital format) the imagery itself. At our institution, text interpretations of medical imagery are all available in our VA Hospital Information System. These are downloaded daily into an off-line computer. The text descriptors of most medical imagery are usually formatted as free text, and so require a user friendly database search tool to make searches quick and easy for any user to design and execute. We are tailoring such a database search tool (Liveview), developed by one of the authors (Karshat). To further facilitate search construction, we are constructing (from our accumulated interpretation data) a dictionary of medical and radiological terms and synonyms. If the imagery database is digital, the imagery which the search discovers is easily retrieved from the computer archive. We describe our database search user interface, with examples, and compare the efficacy of computer assisted imagery searches from a clinical text database with manual searches. Our initial work on direct feature searches of digital medical imagery is outlined.

  4. Elusive or Illuminating: Using the Web To Explore the Salem Witchcraft Trials.

    ERIC Educational Resources Information Center

    Hurter, Stephanie R.

    2003-01-01

    Presents Web sites useful for teaching about the Salem (Massachusetts) witchcraft trials. Includes Web sites that offer primary source material, collections of Web sites, teaching material, and sites that are interactive, including features, such as QuickTime movies. (CMK)

  5. Breeding site selection by coho salmon (Oncorhynchus kisutch) in relation to large wood additions and factors that influence reproductive success

    USGS Publications Warehouse

    Clark, Steven M.; Dunham, Jason B.; McEnroe, Jeffery R.; Lightcap, Scott W.

    2014-01-01

    The fitness of female Pacific salmon (Oncorhynchus spp.) with respect to breeding behavior can be partitioned into at least four fitness components: survival to reproduction, competition for breeding sites, success of egg incubation, and suitability of the local environment near breeding sites for early rearing of juveniles. We evaluated the relative influences of habitat features linked to these fitness components with respect to selection of breeding sites by coho salmon (Oncorhynchus kisutch). We also evaluated associations between breeding site selection and additions of large wood, as the latter were introduced into the study system as a means of restoring habitat conditions to benefit coho salmon. We used a model selection approach to organize specific habitat features into groupings reflecting fitness components and influences of large wood. Results of this work suggest that female coho salmon likely select breeding sites based on a wide range of habitat features linked to all four hypothesized fitness components. More specifically, model parameter estimates indicated that breeding site selection was most strongly influenced by proximity to pool-tail crests and deeper water (mean and maximum depths). Linkages between large wood and breeding site selection were less clear. Overall, our findings suggest that breeding site selection by coho salmon is influenced by a suite of fitness components in addition to the egg incubation environment, which has been the emphasis of much work in the past.

  6. Aeolian features and processes at the Mars Pathfinder landing site

    USGS Publications Warehouse

    Greeley, Ronald; Kraft, Michael; Sullivan, Robert; Wilson, Gregory; Bridges, Nathan; Herkenhoff, Ken; Kuzmin, Ruslan O.; Malin, Michael; Ward, Wes

    1999-01-01

    The Mars Pathfinder landing site contains abundant features attributed to aeolian, or wind, processes. These include wind tails, drift deposits, duneforms of various types, ripplelike features, and ventifacts (the first clearly seen on Mars). Many of these features are consistant with formation involving sand-size particles. Although some features, such as dunes, could develop from saltating sand-size aggregates of finer grains, the discovery of ventifact flutes cut in rocks strongly suggests that at least some of the grains are crystalline, rather than aggregates. Excluding the ventifacts, the orientations of the wind-related features correlate well with the orientations of bright wind steaks seen on Viking Orbiter images in the general area. They also correlate with wind direction predictions from the NASA-Ames General Circulation Model (GCM) which show that the strongest winds in the area occur in the northern hemisphere winter and are directed toward 209°. Copyright 1999 by the American Geophysical Union.

  7. HIV-1 protease cleavage site prediction based on two-stage feature selection method.

    PubMed

    Niu, Bing; Yuan, Xiao-Cheng; Roeper, Preston; Su, Qiang; Peng, Chun-Rong; Yin, Jing-Yuan; Ding, Juan; Li, HaiPeng; Lu, Wen-Cong

    2013-03-01

    Knowledge of the mechanism of HIV protease cleavage specificity is critical to the design of specific and effective HIV inhibitors. Searching for an accurate, robust, and rapid method to correctly predict the cleavage sites in proteins is crucial when searching for possible HIV inhibitors. In this article, HIV-1 protease specificity was studied using the correlation-based feature subset (CfsSubset) selection method combined with Genetic Algorithms method. Thirty important biochemical features were found based on a jackknife test from the original data set containing 4,248 features. By using the AdaBoost method with the thirty selected features the prediction model yields an accuracy of 96.7% for the jackknife test and 92.1% for an independent set test, with increased accuracy over the original dataset by 6.7% and 77.4%, respectively. Our feature selection scheme could be a useful technique for finding effective competitive inhibitors of HIV protease.

  8. Reservoir High's TE Site Wins Web Site of the Month

    ERIC Educational Resources Information Center

    Tech Directions, 2008

    2008-01-01

    This article features "Mr. Rhine's Technology Education Web Site," a winner of the Web Site of the Month. This Web site was designed by Luke Rhine, a teacher at the Reservoir High School in Fulton, Maryland. Rhine's Web site offers course descriptions and syllabuses, class calendars, lectures and presentations, design briefs and other course…

  9. PubDNA Finder: a web database linking full-text articles to sequences of nucleic acids.

    PubMed

    García-Remesal, Miguel; Cuevas, Alejandro; Pérez-Rey, David; Martín, Luis; Anguita, Alberto; de la Iglesia, Diana; de la Calle, Guillermo; Crespo, José; Maojo, Víctor

    2010-11-01

    PubDNA Finder is an online repository that we have created to link PubMed Central manuscripts to the sequences of nucleic acids appearing in them. It extends the search capabilities provided by PubMed Central by enabling researchers to perform advanced searches involving sequences of nucleic acids. This includes, among other features (i) searching for papers mentioning one or more specific sequences of nucleic acids and (ii) retrieving the genetic sequences appearing in different articles. These additional query capabilities are provided by a searchable index that we created by using the full text of the 176 672 papers available at PubMed Central at the time of writing and the sequences of nucleic acids appearing in them. To automatically extract the genetic sequences occurring in each paper, we used an original method we have developed. The database is updated monthly by automatically connecting to the PubMed Central FTP site to retrieve and index new manuscripts. Users can query the database via the web interface provided. PubDNA Finder can be freely accessed at http://servet.dia.fi.upm.es:8080/pubdnafinder

  10. Infrared spectra and crystal chemistry of scapolites: implications for Martian mineralogy

    USGS Publications Warehouse

    Swayze, G.A.; Clark, R.N.

    1990-01-01

    Near-infrared and midinfrared spectra of a wide range of scapolite compositions were studied to determine the cause of the 2.36-??m features that have been correlated with similar features in the near-IR spectrum of Mars. We attribute the 2.36-??m features to vibrations caused by HCO-3 and HSO-4 in the anion sites of scapolite. The 2.36-??m absorption complex consists of four overlapping bands. The relative intensities of all four bands vary according to the HCO-3/HSO-4 ratio and disordered anion site occupancy. The positional disorder of HCO-3 and HSO4 in the low-symmetry anion site of scapolite gives the 2.36-??m band complex a unique spectral signature not likely to be duplicated in any other mineral. -from Authors

  11. Optimal classification for the diagnosis of duchenne muscular dystrophy images using support vector machines.

    PubMed

    Zhang, Ming-Huan; Ma, Jun-Shan; Shen, Ying; Chen, Ying

    2016-09-01

    This study aimed to investigate the optimal support vector machines (SVM)-based classifier of duchenne muscular dystrophy (DMD) magnetic resonance imaging (MRI) images. T1-weighted (T1W) and T2-weighted (T2W) images of the 15 boys with DMD and 15 normal controls were obtained. Textural features of the images were extracted and wavelet decomposed, and then, principal features were selected. Scale transform was then performed for MRI images. Afterward, SVM-based classifiers of MRI images were analyzed based on the radical basis function and decomposition levels. The cost (C) parameter and kernel parameter [Formula: see text] were used for classification. Then, the optimal SVM-based classifier, expressed as [Formula: see text]), was identified by performance evaluation (sensitivity, specificity and accuracy). Eight of 12 textural features were selected as principal features (eigenvalues [Formula: see text]). The 16 SVM-based classifiers were obtained using combination of (C, [Formula: see text]), and those with lower C and [Formula: see text] values showed higher performances, especially classifier of [Formula: see text]). The SVM-based classifiers of T1W images showed higher performance than T1W images at the same decomposition level. The T1W images in classifier of [Formula: see text]) at level 2 decomposition showed the highest performance of all, and its overall correct sensitivity, specificity, and accuracy reached 96.9, 97.3, and 97.1 %, respectively. The T1W images in SVM-based classifier [Formula: see text] at level 2 decomposition showed the highest performance of all, demonstrating that it was the optimal classification for the diagnosis of DMD.

  12. Continued benefits of a technical assistance web site to local tobacco control coalitions during a state budget shortfall.

    PubMed

    Buller, David B; Young, Walter F; Bettinghaus, Erwin P; Borland, Ron; Walther, Joseph B; Helme, Donald; Andersen, Peter A; Cutter, Gary R; Maloy, Julie A

    2011-01-01

    A state budget shortfall defunded 10 local tobacco coalitions during a randomized trial but defunded coalitions continued to have access to 2 technical assistance Web sites. To test the ability of Web-based technology to provide technical assistance to local tobacco control coalitions. Randomized 2-group trial with local tobacco control coalitions as the unit of randomization. Local communities (ie, counties) within the State of Colorado. Leaders and members in 34 local tobacco control coalitions funded by the state health department in Colorado. Two technical assistance Web sites: A Basic Web site with text-based information and a multimedia Enhanced Web site containing learning modules, resources, and communication features. Use of the Web sites in minutes, pages, and session and evaluations of coalition functioning on coalition development, conflict resolution, leadership satisfaction, decision-making satisfaction, shared mission, personal involvement, and organization involvement in survey of leaders and members. Coalitions that were defunded but had access to the multimedia Enhanced Web site during the Fully Funded period and after defunding continued to use it (treatment group × funding status × period, F(3,714) = 3.18, P = .0234). Coalitions with access to the Basic Web site had low Web site use throughout and use by defunded coalitions was nearly zero when funding ceased. Members in defunded Basic Web site coalitions reported that their coalitions functioned worse than defunded Enhanced Web site coalitions (coalition development: group × status, F(1,360) = 4.81, P = .029; conflict resolution: group × status, F(1,306) = 5.69, P = .018; leadership satisfaction: group × status, F(1,342) = 5.69, P = .023). The Enhanced Web site may have had a protective effect on defunded coalitions. Defunded coalitions may have increased their capacity by using the Enhanced Web site when fully funded or by continuing to use the available online resources after defunding. Web-based technical assistance with online training and resources may be a good investment when future funding is not ensured.

  13. Early Readers and Electronic Texts: CD-ROM Storybook Features That Influence Reading Behaviors

    ERIC Educational Resources Information Center

    Lefever-Davis, Shirley; Pearman, Cathy

    2005-01-01

    This research explores the impact of CD-ROM storybook features on the reading behaviors of 6- and 7-year-old students with limited exposure to CD-ROM storybooks. Six categories of behaviors were identified: tracking, electronic feature dependency, distractibility, spectator stance, electronic feature limitations, and electronic features as tools.…

  14. Assessing mental stress from the photoplethysmogram: a numerical study

    PubMed Central

    Charlton, Peter H; Celka, Patrick; Farukh, Bushra; Chowienczyk, Phil; Alastruey, Jordi

    2018-01-01

    Abstract Objective: Mental stress is detrimental to cardiovascular health, being a risk factor for coronary heart disease and a trigger for cardiac events. However, it is not currently routinely assessed. The aim of this study was to identify features of the photoplethysmogram (PPG) pulse wave which are indicative of mental stress. Approach: A numerical model of pulse wave propagation was used to simulate blood pressure signals, from which simulated PPG pulse waves were estimated using a transfer function. Pulse waves were simulated at six levels of stress by changing the model input parameters both simultaneously and individually, in accordance with haemodynamic changes associated with stress. Thirty-two feature measurements were extracted from pulse waves at three measurement sites: the brachial, radial and temporal arteries. Features which changed significantly with stress were identified using the Mann–Kendall monotonic trend test. Main results: Seventeen features exhibited significant trends with stress in measurements from at least one site. Three features showed significant trends at all three sites: the time from pulse onset to peak, the time from the dicrotic notch to pulse end, and the pulse rate. More features showed significant trends at the radial artery (15) than the brachial (8) or temporal (7) arteries. Most features were influenced by multiple input parameters. Significance: The features identified in this study could be used to monitor stress in healthcare and consumer devices. Measurements at the radial artery may provide superior performance than the brachial or temporal arteries. In vivo studies are required to confirm these observations. PMID:29658894

  15. Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models

    PubMed Central

    Dai, Jin; Liu, Xin

    2014-01-01

    The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud concept jumping up (CCJU-TC) is proposed. It can efficiently accomplish conversion between qualitative concept and quantitative data. Through the conversion from text set to text information table based on VSM model, the text qualitative concept, which is extraction from the same category, is jumping up as a whole category concept. According to the cloud similarity between the test text and each category concept, the test text is assigned to the most similar category. By the comparison among different text classifiers in different feature selection set, it fully proves that not only does CCJU-TC have a strong ability to adapt to the different text features, but also the classification performance is also better than the traditional classifiers. PMID:24711737

  16. Terrestrial Analogs to Wind-Related Features at the Viking and Pathfinder Landing Sites on Mars

    NASA Technical Reports Server (NTRS)

    Greeley, Ronald; Bridges, Nathan T.; Kuzmin, Ruslan O.; Laity, Julie E.

    2002-01-01

    Features in the Mojave Desert and Iceland provide insight into the characteristics and origin of Martian wind-related landforms seen by the Viking and Pathfinder landers. The terrestrial sites were chosen because they exhibit diverse wind features that are generally well understood. These features have morphologies comparable to those on Mars and include origins by deposition and erosion, with erosional processes modifying both soils and rocks. Duneforms and drifts are the most common depositional features seen at the Martian landing sites and indicate supplies of sand-sized particles blown by generally unidirectional winds. Erosional features include lag deposits, moat-like depressions around some rocks, and exhumed soil horizons. They indicate that wind can deflate at least some sediments and that this process is particularly effective where the wind interacts with rocks. The formation of ripples and wind tails involves a combination of depositional and erosional processes. Rock erosional features, or ventifacts, are recognized by their overall shapes, erosional flutes, and characteristic surface textures resulting from abrasion by windblown particles. The physics of saltation requires that particles in ripples and duneforms are predominantly sand-sized (60-2000 microns). The orientations of duneforms, wind tails, moats, and ventifacts are correlated with surface winds above particle threshold. Such winds are influenced by local topography and are correlated with winds at higher altitudes predicted by atmospheric models.

  17. Shared Features of L2 Writing: Intergroup Homogeneity and Text Classification

    ERIC Educational Resources Information Center

    Crossley, Scott A.; McNamara, Danielle S.

    2011-01-01

    This study investigates intergroup homogeneity within high intermediate and advanced L2 writers of English from Czech, Finnish, German, and Spanish first language backgrounds. A variety of linguistic features related to lexical sophistication, syntactic complexity, and cohesion were used to compare texts written by L1 speakers of English to L2…

  18. Developing an Approach for Comparing Students' Multimodal Text Creations: A Case Study

    ERIC Educational Resources Information Center

    Levy, Mike; Kimber, Kay

    2009-01-01

    Classroom teachers routinely make judgments on the quality of their students' work based on their recognition of how effectively the student has assembled key features of the genre or the medium. Yet how readily can teachers talk about the features of student-created multimodal texts in ways that can improve learning and performance? This article…

  19. An Analysis of English Business Letters from the Perspective of Interpersonal Function

    ERIC Educational Resources Information Center

    Xu, Bo

    2012-01-01

    The purpose of the present study is to find out the features of English business letters. Halliday's systemic functional linguistics is used as the theoretical framework, mainly, interpersonal fucntion. The English business letter (EBL) is an important written text used for international business communication and it has its own features of text.…

  20. Estimating Missing Features to Improve Multimedia Information Retrieval

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bagherjeiran, A; Love, N S; Kamath, C

    Retrieval in a multimedia database usually involves combining information from different modalities of data, such as text and images. However, all modalities of the data may not be available to form the query. The retrieval results from such a partial query are often less than satisfactory. In this paper, we present an approach to complete a partial query by estimating the missing features in the query. Our experiments with a database of images and their associated captions show that, with an initial text-only query, our completion method has similar performance to a full query with both image and text features.more » In addition, when we use relevance feedback, our approach outperforms the results obtained using a full query.« less

  1. e-Ana and e-Mia: A Content Analysis of Pro–Eating Disorder Web Sites

    PubMed Central

    Schenk, Summer; Wilson, Jenny L.; Peebles, Rebecka

    2010-01-01

    Objectives. The Internet offers Web sites that describe, endorse, and support eating disorders. We examined the features of pro–eating disorder Web sites and the messages to which users may be exposed. Methods. We conducted a systematic content analysis of 180 active Web sites, noting site logistics, site accessories, “thinspiration” material (images and prose intended to inspire weight loss), tips and tricks, recovery, themes, and perceived harm. Results. Practically all (91%) of the Web sites were open to the public, and most (79%) had interactive features. A large majority (84%) offered pro-anorexia content, and 64% provided pro-bulimia content. Few sites focused on eating disorders as a lifestyle choice. Thinspiration material appeared on 85% of the sites, and 83% provided overt suggestions on how to engage in eating-disordered behaviors. Thirty-eight percent of the sites included recovery-oriented information or links. Common themes were success, control, perfection, and solidarity. Conclusions. Pro–eating disorder Web sites present graphic material to encourage, support, and motivate site users to continue their efforts with anorexia and bulimia. Continued monitoring will offer a valuable foundation to build a better understanding of the effects of these sites on their users. PMID:20558807

  2. Portable automatic text classification for adverse drug reaction detection via multi-corpus training.

    PubMed

    Sarker, Abeed; Gonzalez, Graciela

    2015-02-01

    Automatic detection of adverse drug reaction (ADR) mentions from text has recently received significant interest in pharmacovigilance research. Current research focuses on various sources of text-based information, including social media-where enormous amounts of user posted data is available, which have the potential for use in pharmacovigilance if collected and filtered accurately. The aims of this study are: (i) to explore natural language processing (NLP) approaches for generating useful features from text, and utilizing them in optimized machine learning algorithms for automatic classification of ADR assertive text segments; (ii) to present two data sets that we prepared for the task of ADR detection from user posted internet data; and (iii) to investigate if combining training data from distinct corpora can improve automatic classification accuracies. One of our three data sets contains annotated sentences from clinical reports, and the two other data sets, built in-house, consist of annotated posts from social media. Our text classification approach relies on generating a large set of features, representing semantic properties (e.g., sentiment, polarity, and topic), from short text nuggets. Importantly, using our expanded feature sets, we combine training data from different corpora in attempts to boost classification accuracies. Our feature-rich classification approach performs significantly better than previously published approaches with ADR class F-scores of 0.812 (previously reported best: 0.770), 0.538 and 0.678 for the three data sets. Combining training data from multiple compatible corpora further improves the ADR F-scores for the in-house data sets to 0.597 (improvement of 5.9 units) and 0.704 (improvement of 2.6 units) respectively. Our research results indicate that using advanced NLP techniques for generating information rich features from text can significantly improve classification accuracies over existing benchmarks. Our experiments illustrate the benefits of incorporating various semantic features such as topics, concepts, sentiments, and polarities. Finally, we show that integration of information from compatible corpora can significantly improve classification performance. This form of multi-corpus training may be particularly useful in cases where data sets are heavily imbalanced (e.g., social media data), and may reduce the time and costs associated with the annotation of data in the future. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  3. Portable Automatic Text Classification for Adverse Drug Reaction Detection via Multi-corpus Training

    PubMed Central

    Gonzalez, Graciela

    2014-01-01

    Objective Automatic detection of Adverse Drug Reaction (ADR) mentions from text has recently received significant interest in pharmacovigilance research. Current research focuses on various sources of text-based information, including social media — where enormous amounts of user posted data is available, which have the potential for use in pharmacovigilance if collected and filtered accurately. The aims of this study are: (i) to explore natural language processing approaches for generating useful features from text, and utilizing them in optimized machine learning algorithms for automatic classification of ADR assertive text segments; (ii) to present two data sets that we prepared for the task of ADR detection from user posted internet data; and (iii) to investigate if combining training data from distinct corpora can improve automatic classification accuracies. Methods One of our three data sets contains annotated sentences from clinical reports, and the two other data sets, built in-house, consist of annotated posts from social media. Our text classification approach relies on generating a large set of features, representing semantic properties (e.g., sentiment, polarity, and topic), from short text nuggets. Importantly, using our expanded feature sets, we combine training data from different corpora in attempts to boost classification accuracies. Results Our feature-rich classification approach performs significantly better than previously published approaches with ADR class F-scores of 0.812 (previously reported best: 0.770), 0.538 and 0.678 for the three data sets. Combining training data from multiple compatible corpora further improves the ADR F-scores for the in-house data sets to 0.597 (improvement of 5.9 units) and 0.704 (improvement of 2.6 units) respectively. Conclusions Our research results indicate that using advanced NLP techniques for generating information rich features from text can significantly improve classification accuracies over existing benchmarks. Our experiments illustrate the benefits of incorporating various semantic features such as topics, concepts, sentiments, and polarities. Finally, we show that integration of information from compatible corpora can significantly improve classification performance. This form of multi-corpus training may be particularly useful in cases where data sets are heavily imbalanced (e.g., social media data), and may reduce the time and costs associated with the annotation of data in the future. PMID:25451103

  4. Characterizing spatial structure of sediment E. coli populations to inform sampling design.

    PubMed

    Piorkowski, Gregory S; Jamieson, Rob C; Hansen, Lisbeth Truelstrup; Bezanson, Greg S; Yost, Chris K

    2014-01-01

    Escherichia coli can persist in streambed sediments and influence water quality monitoring programs through their resuspension into overlying waters. This study examined the spatial patterns in E. coli concentration and population structure within streambed morphological features during baseflow and following stormflow to inform sampling strategies for representative characterization of E. coli populations within a stream reach. E. coli concentrations in bed sediments were significantly different (p = 0.002) among monitoring sites during baseflow, and significant interactive effects (p = 0.002) occurred among monitoring sites and morphological features following stormflow. Least absolute shrinkage and selection operator (LASSO) regression revealed that water velocity and effective particle size (D 10) explained E. coli concentration during baseflow, whereas sediment organic carbon, water velocity and median particle diameter (D 50) were important explanatory variables following stormflow. Principle Coordinate Analysis illustrated the site-scale differences in sediment E. coli populations between disconnected stream segments. Also, E. coli populations were similar among depositional features within a reach, but differed in relation to high velocity features (e.g., riffles). Canonical correspondence analysis resolved that E. coli population structure was primarily explained by spatial (26.9–31.7 %) over environmental variables (9.2–13.1 %). Spatial autocorrelation existed among monitoring sites and morphological features for both sampling events, and gradients in mean particle diameter and water velocity influenced E. coli population structure for the baseflow and stormflow sampling events, respectively. Representative characterization of streambed E. coli requires sampling of depositional and high velocity environments to accommodate strain selectivity among these features owing to sediment and water velocity heterogeneity.

  5. Quality of medication information available on retail pharmacy Web sites.

    PubMed

    Ghoshal, Malini; Walji, Muhammad F

    2006-12-01

    The Internet is becoming an important source for medication information. Although the quality of consumer medication information (CMI) in brick and mortar pharmacies has been reported to be suboptimal, little is known about the quality of CMI offered by pharmacy Web sites. To evaluate the quality, readability, and provision of Web functionality of 4 popular medications (atenolol, nitroglycerin, atorvastatin, and glyburide) available on the websites of 3 of the largest retail pharmacies: Walgreens, CVS Pharmacy, and Rite Aid. The quality of online medication information was evaluated by 2 reviewers using a preexisting evaluation instrument created by a national panel of experts. Readability level was assessed using the Gunning Fog Test. We also assessed the presence of 4 Web-specific functional criteria: (1) capability for font enlargement, (2) availability of a glossary of terms, (3) presence of an "Ask a pharmacist" feature, and (4) access to detailed medication information or full prescribing information. Overall, medication information was 77% adherent to the criteria evaluated. When broken down by drug, CMI was most adherent for atorvastatin (83%), followed by glyburide (77%), atenolol (76%), and nitroglycerin (75%). The average readability level was found to be 10th grade. No pharmacy Web site provided the ability for font enlargement, a glossary of terms, or access to detailed medication information; however, all pharmacy Web sites provided an "Ask a pharmacist" service. Although pharmacy Web sites were found to have an overall good content quality, the high readability level of text, areas of incomplete information, and limited use of desirable Web functionality suggest room for improvement.

  6. Perceptions of Business Students' Feature Requirements in Educational Web Sites

    ERIC Educational Resources Information Center

    Hazari, Sunil; Johnson, Barbara

    2007-01-01

    There is paucity of original research that explains phenomena related to content organization and site design of educational Web sites. Educational Web sites are often used to provide Web-based instruction, which itself is a relatively recent phenomenon for business schools, and additional research is needed in this area. Educational Web sites are…

  7. A Feature Selection Method Based on Fisher's Discriminant Ratio for Text Sentiment Classification

    NASA Astrophysics Data System (ADS)

    Wang, Suge; Li, Deyu; Wei, Yingjie; Li, Hongxia

    With the rapid growth of e-commerce, product reviews on the Web have become an important information source for customers' decision making when they intend to buy some product. As the reviews are often too many for customers to go through, how to automatically classify them into different sentiment orientation categories (i.e. positive/negative) has become a research problem. In this paper, based on Fisher's discriminant ratio, an effective feature selection method is proposed for product review text sentiment classification. In order to validate the validity of the proposed method, we compared it with other methods respectively based on information gain and mutual information while support vector machine is adopted as the classifier. In this paper, 6 subexperiments are conducted by combining different feature selection methods with 2 kinds of candidate feature sets. Under 1006 review documents of cars, the experimental results indicate that the Fisher's discriminant ratio based on word frequency estimation has the best performance with F value 83.3% while the candidate features are the words which appear in both positive and negative texts.

  8. Relating interesting quantitative time series patterns with text events and text features

    NASA Astrophysics Data System (ADS)

    Wanner, Franz; Schreck, Tobias; Jentner, Wolfgang; Sharalieva, Lyubka; Keim, Daniel A.

    2013-12-01

    In many application areas, the key to successful data analysis is the integrated analysis of heterogeneous data. One example is the financial domain, where time-dependent and highly frequent quantitative data (e.g., trading volume and price information) and textual data (e.g., economic and political news reports) need to be considered jointly. Data analysis tools need to support an integrated analysis, which allows studying the relationships between textual news documents and quantitative properties of the stock market price series. In this paper, we describe a workflow and tool that allows a flexible formation of hypotheses about text features and their combinations, which reflect quantitative phenomena observed in stock data. To support such an analysis, we combine the analysis steps of frequent quantitative and text-oriented data using an existing a-priori method. First, based on heuristics we extract interesting intervals and patterns in large time series data. The visual analysis supports the analyst in exploring parameter combinations and their results. The identified time series patterns are then input for the second analysis step, in which all identified intervals of interest are analyzed for frequent patterns co-occurring with financial news. An a-priori method supports the discovery of such sequential temporal patterns. Then, various text features like the degree of sentence nesting, noun phrase complexity, the vocabulary richness, etc. are extracted from the news to obtain meta patterns. Meta patterns are defined by a specific combination of text features which significantly differ from the text features of the remaining news data. Our approach combines a portfolio of visualization and analysis techniques, including time-, cluster- and sequence visualization and analysis functionality. We provide two case studies, showing the effectiveness of our combined quantitative and textual analysis work flow. The workflow can also be generalized to other application domains such as data analysis of smart grids, cyber physical systems or the security of critical infrastructure, where the data consists of a combination of quantitative and textual time series data.

  9. Cascade detection for the extraction of localized sequence features; specificity results for HIV-1 protease and structure-function results for the Schellman loop.

    PubMed

    Newell, Nicholas E

    2011-12-15

    The extraction of the set of features most relevant to function from classified biological sequence sets is still a challenging problem. A central issue is the determination of expected counts for higher order features so that artifact features may be screened. Cascade detection (CD), a new algorithm for the extraction of localized features from sequence sets, is introduced. CD is a natural extension of the proportional modeling techniques used in contingency table analysis into the domain of feature detection. The algorithm is successfully tested on synthetic data and then applied to feature detection problems from two different domains to demonstrate its broad utility. An analysis of HIV-1 protease specificity reveals patterns of strong first-order features that group hydrophobic residues by side chain geometry and exhibit substantial symmetry about the cleavage site. Higher order results suggest that favorable cooperativity is weak by comparison and broadly distributed, but indicate possible synergies between negative charge and hydrophobicity in the substrate. Structure-function results for the Schellman loop, a helix-capping motif in proteins, contain strong first-order features and also show statistically significant cooperativities that provide new insights into the design of the motif. These include a new 'hydrophobic staple' and multiple amphipathic and electrostatic pair features. CD should prove useful not only for sequence analysis, but also for the detection of multifactor synergies in cross-classified data from clinical studies or other sources. Windows XP/7 application and data files available at: https://sites.google.com/site/cascadedetect/home. nacnewell@comcast.net Supplementary information is available at Bioinformatics online.

  10. Full-Text Databases in Medicine.

    ERIC Educational Resources Information Center

    Sievert, MaryEllen C.; And Others

    1995-01-01

    Describes types of full-text databases in medicine; discusses features for searching full-text journal databases available through online vendors; reviews research on full-text databases in medicine; and describes the MEDLINE/Full-Text Research Project at the University of Missouri (Columbia) which investigated precision, recall, and relevancy.…

  11. Toward high-throughput phenotyping: unbiased automated feature extraction and selection from knowledge sources.

    PubMed

    Yu, Sheng; Liao, Katherine P; Shaw, Stanley Y; Gainer, Vivian S; Churchill, Susanne E; Szolovits, Peter; Murphy, Shawn N; Kohane, Isaac S; Cai, Tianxi

    2015-09-01

    Analysis of narrative (text) data from electronic health records (EHRs) can improve population-scale phenotyping for clinical and genetic research. Currently, selection of text features for phenotyping algorithms is slow and laborious, requiring extensive and iterative involvement by domain experts. This paper introduces a method to develop phenotyping algorithms in an unbiased manner by automatically extracting and selecting informative features, which can be comparable to expert-curated ones in classification accuracy. Comprehensive medical concepts were collected from publicly available knowledge sources in an automated, unbiased fashion. Natural language processing (NLP) revealed the occurrence patterns of these concepts in EHR narrative notes, which enabled selection of informative features for phenotype classification. When combined with additional codified features, a penalized logistic regression model was trained to classify the target phenotype. The authors applied our method to develop algorithms to identify patients with rheumatoid arthritis and coronary artery disease cases among those with rheumatoid arthritis from a large multi-institutional EHR. The area under the receiver operating characteristic curves (AUC) for classifying RA and CAD using models trained with automated features were 0.951 and 0.929, respectively, compared to the AUCs of 0.938 and 0.929 by models trained with expert-curated features. Models trained with NLP text features selected through an unbiased, automated procedure achieved comparable or slightly higher accuracy than those trained with expert-curated features. The majority of the selected model features were interpretable. The proposed automated feature extraction method, generating highly accurate phenotyping algorithms with improved efficiency, is a significant step toward high-throughput phenotyping. © The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  12. Predicting lysine glycation sites using bi-profile bayes feature extraction.

    PubMed

    Ju, Zhe; Sun, Juhe; Li, Yanjie; Wang, Li

    2017-12-01

    Glycation is a nonenzymatic post-translational modification which has been found to be involved in various biological processes and closely associated with many metabolic diseases. The accurate identification of glycation sites is important to understand the underlying molecular mechanisms of glycation. As the traditional experimental methods are often labor-intensive and time-consuming, it is desired to develop computational methods to predict glycation sites. In this study, a novel predictor named BPB_GlySite is proposed to predict lysine glycation sites by using bi-profile bayes feature extraction and support vector machine algorithm. As illustrated by 10-fold cross-validation, BPB_GlySite achieves a satisfactory performance with a Sensitivity of 63.68%, a Specificity of 72.60%, an Accuracy of 69.63% and a Matthew's correlation coefficient of 0.3499. Experimental results also indicate that BPB_GlySite significantly outperforms three existing glycation sites predictors: NetGlycate, PreGly and Gly-PseAAC. Therefore, BPB_GlySite can be a useful bioinformatics tool for the prediction of glycation sites. A user-friendly web-server for BPB_GlySite is established at 123.206.31.171/BPB_GlySite/. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. Segmental Rescoring in Text Recognition

    DTIC Science & Technology

    2014-02-04

    description relates to rescoring text hypotheses in text recognition based on segmental features. Offline printed text and handwriting recognition (OHR) can... Handwriting , College Park, Md., 2006, which is incorporated by reference here. For the set of training images 202, a character modeler 208 receives

  14. Efficient and sparse feature selection for biomedical text classification via the elastic net: Application to ICU risk stratification from nursing notes.

    PubMed

    Marafino, Ben J; Boscardin, W John; Dudley, R Adams

    2015-04-01

    Sparsity is often a desirable property of statistical models, and various feature selection methods exist so as to yield sparser and interpretable models. However, their application to biomedical text classification, particularly to mortality risk stratification among intensive care unit (ICU) patients, has not been thoroughly studied. To develop and characterize sparse classifiers based on the free text of nursing notes in order to predict ICU mortality risk and to discover text features most strongly associated with mortality. We selected nursing notes from the first 24h of ICU admission for 25,826 adult ICU patients from the MIMIC-II database. We then developed a pair of stochastic gradient descent-based classifiers with elastic-net regularization. We also studied the performance-sparsity tradeoffs of both classifiers as their regularization parameters were varied. The best-performing classifier achieved a 10-fold cross-validated AUC of 0.897 under the log loss function and full L2 regularization, while full L1 regularization used just 0.00025% of candidate input features and resulted in an AUC of 0.889. Using the log loss (range of AUCs 0.889-0.897) yielded better performance compared to the hinge loss (0.850-0.876), but the latter yielded even sparser models. Most features selected by both classifiers appear clinically relevant and correspond to predictors already present in existing ICU mortality models. The sparser classifiers were also able to discover a number of informative - albeit nonclinical - features. The elastic-net-regularized classifiers perform reasonably well and are capable of reducing the number of features required by over a thousandfold, with only a modest impact on performance. Copyright © 2015 Elsevier Inc. All rights reserved.

  15. Improving Classification of Protein Interaction Articles Using Context Similarity-Based Feature Selection.

    PubMed

    Chen, Yifei; Sun, Yuxing; Han, Bing-Qing

    2015-01-01

    Protein interaction article classification is a text classification task in the biological domain to determine which articles describe protein-protein interactions. Since the feature space in text classification is high-dimensional, feature selection is widely used for reducing the dimensionality of features to speed up computation without sacrificing classification performance. Many existing feature selection methods are based on the statistical measure of document frequency and term frequency. One potential drawback of these methods is that they treat features separately. Hence, first we design a similarity measure between the context information to take word cooccurrences and phrase chunks around the features into account. Then we introduce the similarity of context information to the importance measure of the features to substitute the document and term frequency. Hence we propose new context similarity-based feature selection methods. Their performance is evaluated on two protein interaction article collections and compared against the frequency-based methods. The experimental results reveal that the context similarity-based methods perform better in terms of the F1 measure and the dimension reduction rate. Benefiting from the context information surrounding the features, the proposed methods can select distinctive features effectively for protein interaction article classification.

  16. Student-Accessible Science Texts: Elements of Design

    ERIC Educational Resources Information Center

    McTigue, Erin M.; Slough, Scott W.

    2010-01-01

    Within this article, we introduce our conception of text accessibility. First, we synthesize recent research on informational text quality and present key attributes proven to contribute to comprehension of science texts beyond the readability formula. These features include (a) the concreteness of text, (b) the voice of the author, (c) coherent…

  17. The Haw River Sites: Archaeological Investigations at Two Stratified Sites in the North Carolina Piedmont. Volume III.

    DTIC Science & Technology

    1982-04-01

    Excavation Unit 5 - Square 8 - Level 4 - Feature 12 - n - 12 (ID numbers 368-379 in Appendix 3, Table 3) Vessel II Block C - Excavation Unit 6 - Square 8...Level 4 - Feature 7 - See Figure 7.13 - n 38 (ID numbers 380-417 In Appendix 3, Table 3) OO Vessel Ill Block C - Excavation Unit 7 - Square 1 - levels...4-8 - Feature 5 - See Figure 7.12 - n - 90 (ID numbers 418-507 In Appendix 3, Table 3) Vessel IV Block C - Excavation Unit 7 - Square 1 - Levels 4-8

  18. Reading without Words: Using the Arrival to Teach Visual Literacy with English Language Learners

    ERIC Educational Resources Information Center

    Mathews, Sarah A.

    2014-01-01

    This article highlights the use of Shaun Tan's "The Arrival" to teach literacy to English Language Learners in social studies classrooms. The featured text is a book that displays the complexity of migration within a text that does not feature a single written word. The author describes a variety of mini-lessons geared towards…

  19. Aluminium X-ray absorption Near Edge Structure in model compounds and Earth's surface minerals

    NASA Astrophysics Data System (ADS)

    Ildefonse, P.; Cabaret, D.; Sainctavit, P.; Calas, G.; Flank, A.-M.; Lagarde, P.

    Aluminium K-edge X-ray absorption near edge spectra (XANES) of a suite of silicate and oxides minerals consist of electronic excitations occurring in the edge region, and multiple scattering resonances at higher energies. The main XANES feature for four-fold Al is at around 2 eV lower energy than the main XANES feature for six-fold Al. This provides a useful probe for coordination numbers in clay minerals, gels, glasses or material with unknown Al-coordination number. Six-fold aluminium yields a large variety of XANES features which can be correlated with octahedral point symmetry, number of aluminium sites and distribution of Al-O distances. These three parameters may act together, and the quantitative interpretation of XANES spectra is difficult. For a low point symmetry (1), variations are mainly related to the number of Al sites and distribution of Al-O distances: pyrophyllite, one Al site, is clearly distinguished from kaolinite and gibbsite presenting two Al sites. For a given number of Al-site (1), variations are controlled by changes in point symmetry, the number of XANES features being increased as point symmetry decreases. For a given point symmetry (1) and a given number of Al site (1), variations are related to second nearest neighbours (gibbsite versus kaolinite). The amplitude of the XANES feature at about 1566 eV is a useful probe for the assessment of AlIV/Altotal ratios in 2/1 phyllosilicates. Al-K XANES has been performed on synthetic Al-bearing goethites which cannot be studied by 27Al NMR. At low Al content, Al-K XANES is very different from that of α-AlOOH but at the highest level, XANES spectrum tends to that of diaspore. Al-K XAS is thus a promising tool for the structural study of poorly ordered materials such as clay minerals and natural alumino-silicate gels together with Al-subsituted Fe-oxyhydroxides.

  20. Habitat features and predictive habitat modeling for the Colorado chipmunk in southern New Mexico

    USGS Publications Warehouse

    Rivieccio, M.; Thompson, B.C.; Gould, W.R.; Boykin, K.G.

    2003-01-01

    Two subspecies of Colorado chipmunk (state threatened and federal species of concern) occur in southern New Mexico: Tamias quadrivittatus australis in the Organ Mountains and T. q. oscuraensis in the Oscura Mountains. We developed a GIS model of potentially suitable habitat based on vegetation and elevation features, evaluated site classifications of the GIS model, and determined vegetation and terrain features associated with chipmunk occurrence. We compared GIS model classifications with actual vegetation and elevation features measured at 37 sites. At 60 sites we measured 18 habitat variables regarding slope, aspect, tree species, shrub species, and ground cover. We used logistic regression to analyze habitat variables associated with chipmunk presence/absence. All (100%) 37 sample sites (28 predicted suitable, 9 predicted unsuitable) were classified correctly by the GIS model regarding elevation and vegetation. For 28 sites predicted suitable by the GIS model, 18 sites (64%) appeared visually suitable based on habitat variables selected from logistic regression analyses, of which 10 sites (36%) were specifically predicted as suitable habitat via logistic regression. We detected chipmunks at 70% of sites deemed suitable via the logistic regression models. Shrub cover, tree density, plant proximity, presence of logs, and presence of rock outcrop were retained in the logistic model for the Oscura Mountains; litter, shrub cover, and grass cover were retained in the logistic model for the Organ Mountains. Evaluation of predictive models illustrates the need for multi-stage analyses to best judge performance. Microhabitat analyses indicate prospective needs for different management strategies between the subspecies. Sensitivities of each population of the Colorado chipmunk to natural and prescribed fire suggest that partial burnings of areas inhabited by Colorado chipmunks in southern New Mexico may be beneficial. These partial burnings may later help avoid a fire that could substantially reduce habitat of chipmunks over a mountain range.

  1. Representing nested semantic information in a linear string of text using XML.

    PubMed

    Krauthammer, Michael; Johnson, Stephen B; Hripcsak, George; Campbell, David A; Friedman, Carol

    2002-01-01

    XML has been widely adopted as an important data interchange language. The structure of XML enables sharing of data elements with variable degrees of nesting as long as the elements are grouped in a strict tree-like fashion. This requirement potentially restricts the usefulness of XML for marking up written text, which often includes features that do not properly nest within other features. We encountered this problem while marking up medical text with structured semantic information from a Natural Language Processor. Traditional approaches to this problem separate the structured information from the actual text mark up. This paper introduces an alternative solution, which tightly integrates the semantic structure with the text. The resulting XML markup preserves the linearity of the medical texts and can therefore be easily expanded with additional types of information.

  2. Representing nested semantic information in a linear string of text using XML.

    PubMed Central

    Krauthammer, Michael; Johnson, Stephen B.; Hripcsak, George; Campbell, David A.; Friedman, Carol

    2002-01-01

    XML has been widely adopted as an important data interchange language. The structure of XML enables sharing of data elements with variable degrees of nesting as long as the elements are grouped in a strict tree-like fashion. This requirement potentially restricts the usefulness of XML for marking up written text, which often includes features that do not properly nest within other features. We encountered this problem while marking up medical text with structured semantic information from a Natural Language Processor. Traditional approaches to this problem separate the structured information from the actual text mark up. This paper introduces an alternative solution, which tightly integrates the semantic structure with the text. The resulting XML markup preserves the linearity of the medical texts and can therefore be easily expanded with additional types of information. PMID:12463856

  3. Tashkeela: Novel corpus of Arabic vocalized texts, data for auto-diacritization systems.

    PubMed

    Zerrouki, Taha; Balla, Amar

    2017-04-01

    Arabic diacritics are often missed in Arabic scripts. This feature is a handicap for new learner to read َArabic, text to speech conversion systems, reading and semantic analysis of Arabic texts. The automatic diacritization systems are the best solution to handle this issue. But such automation needs resources as diactritized texts to train and evaluate such systems. In this paper, we describe our corpus of Arabic diacritized texts. This corpus is called Tashkeela. It can be used as a linguistic resource tool for natural language processing such as automatic diacritics systems, dis-ambiguity mechanism, features and data extraction. The corpus is freely available, it contains 75 million of fully vocalized words mainly 97 books from classical and modern Arabic language. The corpus is collected from manually vocalized texts using web crawling process.

  4. A Geospatial Database that Supports Derivation of Climatological Features of Severe Weather

    NASA Astrophysics Data System (ADS)

    Phillips, M.; Ansari, S.; Del Greco, S.

    2007-12-01

    The Severe Weather Data Inventory (SWDI) at NOAA's National Climatic Data Center (NCDC) provides user access to archives of several datasets critical to the detection and evaluation of severe weather. These datasets include archives of: · NEXRAD Level-III point features describing general storm structure, hail, mesocyclone and tornado signatures · National Weather Service Storm Events Database · National Weather Service Local Storm Reports collected from storm spotters · National Weather Service Warnings · Lightning strikes from Vaisala's National Lightning Detection Network (NLDN) SWDI archives all of these datasets in a spatial database that allows for convenient searching and subsetting. These data are accessible via the NCDC web site, Web Feature Services (WFS) or automated web services. The results of interactive web page queries may be saved in a variety of formats, including plain text, XML, Google Earth's KMZ, standards-based NetCDF and Shapefile. NCDC's Storm Risk Assessment Project (SRAP) uses data from the SWDI database to derive gridded climatology products that show the spatial distributions of the frequency of various events. SRAP also can relate SWDI events to other spatial data such as roads, population, watersheds, and other geographic, sociological, or economic data to derive products that are useful in municipal planning, emergency management, the insurance industry, and other areas where there is a need to quantify and qualify how severe weather patterns affect people and property.

  5. Digisonde at Sondrestrom to monitor the ionospheric polar cap and cusp region. Technical report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Crowley, G.; Reinisch, B.W.; Kitrosser, D.F.

    1990-01-01

    In July 1989, the Air Force meridional chain of Digisondes was completed by the installation of a system in Sondrestromfjord, Greenland (66.98 deg N, 50.94 deg W). In this report we describe the Sondrestrom site and instrument, and the relationship between Sondrestrom and the other AF sites. We also established the importance of this site by describing its geophysically unique features. Finally, some of the first measurements from Sondrestrom are presented, and interpreted in terms of high latitude features. Keywords: Geomagnetism; Plasmas physics drift; Magnetosphere; Ionosphere; Polar cap; Ionosondes.

  6. Site-specific electronic structure analysis by channeling EELS and first-principles calculations.

    PubMed

    Tatsumi, Kazuyoshi; Muto, Shunsuke; Yamamoto, Yu; Ikeno, Hirokazu; Yoshioka, Satoru; Tanaka, Isao

    2006-01-01

    Site-specific electronic structures were investigated by electron energy loss spectroscopy (EELS) under electron channeling conditions. The Al-K and Mn-L(2,3) electron energy loss near-edge structure (ELNES) of, respectively, NiAl2O4 and Mn3O4 were measured. Deconvolution of the raw spectra with the instrumental resolution function restored the blunt and hidden fine features, which allowed us to interpret the experimental spectral features by comparing with theoretical spectra obtained by first-principles calculations. The present method successfully revealed the electronic structures specific to the differently coordinated cationic sites.

  7. Exploration of the West Florida Shelf Blue Holes Investigation of Physical and Biological Characteristics and Archaeological Implications of Unique Karst Features

    NASA Astrophysics Data System (ADS)

    Culter, J. K.

    2006-12-01

    The west Florida continental shelf is nearly as large as peninsular Florida and embraces a vast mosaic of marine habitats. The dominant shelf habitats have been described and studied to some degree. However, the offshore submerged sinkhole and spring features (blue holes) have not been scientifically described or studied, with the exception of one site called the Mudhole, a saltwater spring off Ft. Myers Beach. These features are relatively small habitats by standards of aerial coverage, but are probably more common than previously thought. These habitats are very unique shelf features, a reef in reverse, representing island habitats on the Florida shelf. This study was initiated in summer 2005 to describe the biota associated with the offshore blue hole features of this region and search for new sites. Eleven sites off the west central Florida coast have been verified and data has been collected at eight locations, all greater than 30 miles offshore. Most blue holes exhibit similar structural features, which divide the biota into zones. Pelagic species, such as amberjack, occupy the water column above the holes and reef species populate the rim. All of the sites investigated harbor one or more goliath grouper (Epinephelus itajara) and most of the features have resident nurse sharks (Ginglymostoma cirratum). Pelagic sharks periodically visit the sites and sea turtles are frequently observed at or near the holes. Whale sharks (Rhincodon typus) also seem to have an affinity for these features. The reef fauna that occupy the rim rapidly decline in abundance and diversity deeper into the holes with the deepest fauna being calcareous tube dwelling polychaetes that grow down to the edges of the hydrogen sulfide layer. There is pronounced temperature stratification within all holes. All of the sites investigated to date are relatively deep, by standards of recreational scuba diving, and divers utilized open circuit trimix to conduct the investigations. The key components of the documentation of the features included; vertical video transects, still photography, collections of biota, placement of recording thermographs and conductivity meters, collection of bottom sediment and rocks. During the course of the investigation a previously unknown cave feature over 45 meters in height and 76 meters in bottom diameter was discovered and dubbed Megadome. The cave is connected to the Gulf by a circular shaft approximately 0.75 meters in diameter and 11 meters in length. Instrumentation placed at Megadome revealed lunar tidal period water exchange, possible freshwater outflow and water movement in response to hurricanes. The upper portion of the cave contains a fouling community of sponges worms, tunicates and other organisms. A sink hole off the mouth of Tampa Bay was shown to contain a saltwater spring, venting at 72 m (235 fsw). Research is continuing. This research has been supported by a grant from NOAA, Ocean Exploration Program and Mote Marine Laboratory.

  8. Clinical Analysis of Midfacial Fractures

    PubMed Central

    Yamamoto, Kazuhiko; Matsusue, Yumiko; Horita, Satoshi; Murakami, Kazuhiro; Sugiura, Tsutomu; Kirita, Tadaaki

    2014-01-01

    Purpose: To analyze the features of midfacial fractures. Methods: Data of 320 patients treated for midfacial fractures during the past 10 years were retrospectively analyzed. Results: Patients were 192 male and 128 female. Their age ranged from 1 to 96 years old with the average of 42.1. Injury most frequently occurred by traffic accidents in 168 patients, followed by falls in 78, assaults in 31 and sports in 25. Pattern of the fractures was classified into zygoma in 159 patients, alveolus in 60, multiple sites in 54, maxilla in 45 and nasal bone in 2. Facial injury severity scale ranged from 1 to 12 with the average of 1.52. Injuries to other sites of the body were found in 90 patients. Fractures of multiple sites showed higher facial injury severity scale and were associated with injuries to other sites of the body at a higher rate. Observation was most frequently chosen in 153 patients, followed by open reduction and internal fixation in 72, intramaxillary fixation in 43 and transcutaneous reduction in 26. Conclusions: Midfacial fractures showed a variety of features in terms of the site and severity and associated injuries. Understanding these features is important to manage these patients properly. PMID:24757396

  9. Learning Semantic Tags from Big Data for Clinical Text Representation.

    PubMed

    Li, Yanpeng; Liu, Hongfang

    2015-01-01

    In clinical text mining, it is one of the biggest challenges to represent medical terminologies and n-gram terms in sparse medical reports using either supervised or unsupervised methods. Addressing this issue, we propose a novel method for word and n-gram representation at semantic level. We first represent each word by its distance with a set of reference features calculated by reference distance estimator (RDE) learned from labeled and unlabeled data, and then generate new features using simple techniques of discretization, random sampling and merging. The new features are a set of binary rules that can be interpreted as semantic tags derived from word and n-grams. We show that the new features significantly outperform classical bag-of-words and n-grams in the task of heart disease risk factor extraction in i2b2 2014 challenge. It is promising to see that semantics tags can be used to replace the original text entirely with even better prediction performance as well as derive new rules beyond lexical level.

  10. Mud Volcanoes - A New Class of Sites for Geological and Astrobiological Exploration of Mars

    NASA Technical Reports Server (NTRS)

    Allen, C.C.; Oehler, D.Z.; Baker, D.M.

    2009-01-01

    Mud volcanoes provide a unique low-temperature window into the Earth s subsurface - including the deep biosphere - and may prove to be significant sources of atmospheric methane. The identification of analogous features on Mars would provide an important new class of sites for geological and astrobiological exploration. We report new work suggesting that features in Acidalia Planitia are most consistent with their being mud volcanoes.

  11. Prediction of human disease-associated phosphorylation sites with combined feature selection approach and support vector machine.

    PubMed

    Xu, Xiaoyi; Li, Ao; Wang, Minghui

    2015-08-01

    Phosphorylation is a crucial post-translational modification, which regulates almost all cellular processes in life. It has long been recognised that protein phosphorylation has close relationship with diseases, and therefore many researches are undertaken to predict phosphorylation sites for disease treatment and drug design. However, despite the success achieved by these approaches, no method focuses on disease-associated phosphorylation sites prediction. Herein, for the first time the authors propose a novel approach that is specially designed to identify associations between phosphorylation sites and human diseases. To take full advantage of local sequence information, a combined feature selection method-based support vector machine (CFS-SVM) that incorporates minimum-redundancy-maximum-relevance filtering process and forward feature selection process is developed. Performance evaluation shows that CFS-SVM is significantly better than the widely used classifiers including Bayesian decision theory, k nearest neighbour and random forest. With the extremely high specificity of 99%, CFS-SVM can still achieve a high sensitivity. Besides, tests on extra data confirm the effectiveness and general applicability of CFS-SVM approach on a variety of diseases. Finally, the analysis of selected features and corresponding kinases also help the understanding of the potential mechanism of disease-phosphorylation relationships and guide further experimental validations.

  12. Discovering the Ancient Maya from Space

    NASA Technical Reports Server (NTRS)

    Sever, T. L.

    2008-01-01

    The Pet6n region of northern Guatemala contains some of the most significant Mayan archeological sites in Latin America. It was in this region that the Maya civilization began, flourished, and abruptly disappeared. Remote sensing technology is helping to locate and map ancient Maya sites that are threatened today by accelerating deforestation and looting. Thematic Mapper, IKONOS, and QuickBird satellite, and airborne STAR-3i and AIRSAR radar data, combined with Global Positioning System (GPS) technology, are successfully detecting ancient Maya features such as sites, roadways, canals, and water reservoirs. Satellite imagery is also being used to map the bajos, which are seasonally flooded swamps that cover over 40% of the land surface. Through the use of various airborne and satellite sensor systems we have been able to detect and map ancient causeways, temples, reservoirs, and land forms, and locate these features on the ground through GPS technology. Recently, we have discovered that there is a strong relationship between a tropical forest vegetation signature in satellite imagery and the location of archeological sites. We believe that the use of limestone and lime plasters in ancient Maya construction affects the moisture, nutrition, and plant species of the surface vegetation. We have mapped these vegetation signatures in the imagery and verified through field survey that they are indicative of archeological sites. Through the use of remote sensing and GIS technology it is possible to identify unrecorded archeological features in a dense tropical forest environment and monitor these cultural features for their protection.

  13. Discovering the Ancient Maya From Space

    NASA Technical Reports Server (NTRS)

    Sever, T. L.

    2007-01-01

    The Peten region of northern Guatemala contains some of the most significant Mayan archeological sites in Latin America. It was in this region that the Maya civilization began, flourished, and abruptly disappeared. Remote sensing technology is helping to locate and map ancient Maya sites that are threatened today by accelerating deforestation and looting. Thematic Mapper, IKONOS, and QuickBird satellite, and airborne STAR-3i and AIRSAR radar data, combined with Global Positioning System (GPS) technology, are successfully detecting ancient Maya features such as sites, roadways, canals, and water reservoirs. Satellite imagery is also being used to map the bajos, which are seasonally flooded swamps that cover over 40% of the land surface. Through the use of various airborne and satellite sensor systems we have been able to detect and map ancient causeways, temples, reservoirs, and land forms, and locate these features on the ground through GPS technology. Recently, we have discovered that there is a strong relationship between a tropical forest vegetation signature in satellite imagery and the location of archeological sites. We believe that the use o f limestone and lime plasters in ancient Maya construction affects the moisture, nutrition, and plant species of the surface vegetation. We have mapped these vegetation signatures in the imagery and verified through field survey that they are indicative of archeological sites. Through the use of remote sensing and GIS technology it is possible to identify unrecorded archeological features in a dense tropical forest environment and monitor these cultural features for their protection.

  14. Parkes full polarization spectra of OH masers - II. Galactic longitudes 240° to 350°

    NASA Astrophysics Data System (ADS)

    Caswell, J. L.; Green, J. A.; Phillips, C. J.

    2014-04-01

    Full polarization measurements of 1665 and 1667 MHz OH masers at 261 sites of massive star formation have been made with the Parkes radio telescope. Here, we present the resulting spectra for 157 southern sources, complementing our previously published 104 northerly sources. For most sites, these are the first measurements of linear polarization, with good spectral resolution and complete velocity coverage. Our spectra exhibit the well-known predominance of highly circularly polarized features, interpreted as σ components of Zeeman patterns. Focusing on the generally weaker and rarer linear polarization, we found three examples of likely full Zeeman triplets (a linearly polarized π component, straddled in velocity by σ components), adding to the solitary example previously reported. We also identify 40 examples of likely isolated π components, contradicting past beliefs that π components might be extremely rare. These were recognized at 20 sites where a feature with high linear polarization on one transition is accompanied on the other transition by a matching feature, at the same velocity and also with significant linear polarization. Large velocity ranges are rare, but we find eight exceeding 25 km s-1, some of them indicating high-velocity blue-shifted outflows. Variability was investigated on time-scales of one year and over several decades. More than 20 sites (of 200) show high variability (intensity changes by factors of 4 or more) in some prominent features. Highly stable sites are extremely rare.

  15. Accurate in silico prediction of species-specific methylation sites based on information gain feature optimization.

    PubMed

    Wen, Ping-Ping; Shi, Shao-Ping; Xu, Hao-Dong; Wang, Li-Na; Qiu, Jian-Ding

    2016-10-15

    As one of the most important reversible types of post-translational modification, protein methylation catalyzed by methyltransferases carries many pivotal biological functions as well as many essential biological processes. Identification of methylation sites is prerequisite for decoding methylation regulatory networks in living cells and understanding their physiological roles. Experimental methods are limitations of labor-intensive and time-consuming. While in silicon approaches are cost-effective and high-throughput manner to predict potential methylation sites, but those previous predictors only have a mixed model and their prediction performances are not fully satisfactory now. Recently, with increasing availability of quantitative methylation datasets in diverse species (especially in eukaryotes), there is a growing need to develop a species-specific predictor. Here, we designed a tool named PSSMe based on information gain (IG) feature optimization method for species-specific methylation site prediction. The IG method was adopted to analyze the importance and contribution of each feature, then select the valuable dimension feature vectors to reconstitute a new orderly feature, which was applied to build the finally prediction model. Finally, our method improves prediction performance of accuracy about 15% comparing with single features. Furthermore, our species-specific model significantly improves the predictive performance compare with other general methylation prediction tools. Hence, our prediction results serve as useful resources to elucidate the mechanism of arginine or lysine methylation and facilitate hypothesis-driven experimental design and validation. The tool online service is implemented by C# language and freely available at http://bioinfo.ncu.edu.cn/PSSMe.aspx CONTACT: jdqiu@ncu.edu.cnSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  16. Semantic Role Labeling of Clinical Text: Comparing Syntactic Parsers and Features

    PubMed Central

    Zhang, Yaoyun; Jiang, Min; Wang, Jingqi; Xu, Hua

    2016-01-01

    Semantic role labeling (SRL), which extracts shallow semantic relation representation from different surface textual forms of free text sentences, is important for understanding clinical narratives. Since semantic roles are formed by syntactic constituents in the sentence, an effective parser, as well as an effective syntactic feature set are essential to build a practical SRL system. Our study initiates a formal evaluation and comparison of SRL performance on a clinical text corpus MiPACQ, using three state-of-the-art parsers, the Stanford parser, the Berkeley parser, and the Charniak parser. First, the original parsers trained on the open domain syntactic corpus Penn Treebank were employed. Next, those parsers were retrained on the clinical Treebank of MiPACQ for further comparison. Additionally, state-of-the-art syntactic features from open domain SRL were also examined for clinical text. Experimental results showed that retraining the parsers on clinical Treebank improved the performance significantly, with an optimal F1 measure of 71.41% achieved by the Berkeley parser. PMID:28269926

  17. Analyzing the Cohesion of English Text and Discourse with Automated Computer Tools

    ERIC Educational Resources Information Center

    Jeon, Moongee

    2014-01-01

    This article investigates the lexical and discourse features of English text and discourse with automated computer technologies. Specifically, this article examines the cohesion of English text and discourse with automated computer tools, Coh-Metrix and TEES. Coh-Metrix is a text analysis computer tool that can analyze English text and discourse…

  18. Text Simplification and Comprehensible Input: A Case for an Intuitive Approach

    ERIC Educational Resources Information Center

    Crossley, Scott A.; Allen, David; McNamara, Danielle S.

    2012-01-01

    Texts are routinely simplified to make them more comprehensible for second language learners. However, the effects of simplification upon the linguistic features of texts remain largely unexplored. Here we examine the effects of one type of text simplification: intuitive text simplification. We use the computational tool, Coh-Metrix, to examine…

  19. The Distribution of the Informative Intensity of the Text in Terms of its Structure (On Materials of the English Texts in the Mining Sphere)

    NASA Astrophysics Data System (ADS)

    Znikina, Ludmila; Rozhneva, Elena

    2017-11-01

    The article deals with the distribution of informative intensity of the English-language scientific text based on its structural features contributing to the process of formalization of the scientific text and the preservation of the adequacy of the text with derived semantic information in relation to the primary. Discourse analysis is built on specific compositional and meaningful examples of scientific texts taken from the mining field. It also analyzes the adequacy of the translation of foreign texts into another language, the relationships between elements of linguistic systems, the degree of a formal conformance, translation with the specific objectives and information needs of the recipient. Some key words and ideas are emphasized in the paragraphs of the English-language mining scientific texts. The article gives the characteristic features of the structure of paragraphs of technical text and examples of constructions in English scientific texts based on a mining theme with the aim to explain the possible ways of their adequate translation.

  20. Learning from Science Text: Role of an Elaborate Analogy. Reading Research Report No. 71.

    ERIC Educational Resources Information Center

    Glynn, Shawn M.

    A study examined the role that an elaborate analogy can play when high school students learn a concept from a leading science textbook. The elaborate analogy had graphic and text components that integrated and mapped key features from the analogy (a factory) to the target concept (an animal cell). The target features were parts of the cell and, by…

  1. Product Recommendation System Based on Personal Preference Model Using CAM

    NASA Astrophysics Data System (ADS)

    Murakami, Tomoko; Yoshioka, Nobukazu; Orihara, Ryohei; Furukawa, Koichi

    Product recommendation system is realized by applying business rules acquired by data maining techniques. Business rules such as demographical patterns of purchase, are able to cover the groups of users that have a tendency to purchase products, but it is difficult to recommend products adaptive to various personal preferences only by utilizing them. In addition to that, it is very costly to gather the large volume of high quality survey data, which is necessary for good recommendation based on personal preference model. A method collecting kansei information automatically without questionnaire survey is required. The constructing personal preference model from less favor data is also necessary, since it is costly for the user to input favor data. In this paper, we propose product recommendation system based on kansei information extracted by text mining and user's preference model constructed by Category-guided Adaptive Modeling, CAM for short. CAM is a feature construction method that can generate new features constructing the space where same labeled examples are close and different labeled examples are far away from some labeled examples. It is possible to construct personal preference model by CAM despite less information of likes and dislikes categories. In the system, retrieval agent gathers the products' specification and user agent manages preference model, user's likes and dislikes. Kansei information of the products is gained by applying text mining technique to the reputation documents about the products on the web site. We carry out some experimental studies to make sure that prefrence model obtained by our method performs effectively.

  2. Individual variation in nest size and nest site features of the Bornean orangutans (Pongo pygmaeus).

    PubMed

    Rayadin, Yaya; Saitoh, Takashi

    2009-05-01

    Nest construction is a daily habit of independent orangutans for sleeping or resting. Data on their nests have been used in various ecological studies (e.g., density estimation, ranging behavior, evolution of material culture) because they are the most observable field signs. We investigated nest size and nest site features of Bornean orangutans in the wild during 10 months' fieldwork at three sites in East Kalimantan, Indonesia: Kutai National Park, Birawa, and Meratus. To examine individual variation, we followed 31 individual orangutans and recorded the 92 nests they made for nest size (diameter) and nest site features (height of nest above ground, tree species used for the nest site, the diameter and height of the tree, whether the nest was new or reused, and nest location within the tree). Analyses taking age-sex classes of the focal individuals into consideration showed significant age-sex differences in nest size and location, but not in nest height or nest tree features (diameter, height of tree, and height of lowest branch). Mature orangutans (adult females, unflanged and flanged males) made larger nests than immatures (juveniles and adolescents). Flanged male orangutans with larger nests used stable locations for nesting sites and reused old nests more frequently than immatures. The overall proportion of nests in open (exposed) locations was higher than in closed (sheltered) locations. Flanged males and immatures frequently made open nests, whereas adult females with an infant preferred closed locations. The good correspondence between nest size and age-sex classes indicates that nest size variation may reflect body size and therefore age-sex variation in the population. (c) 2009 Wiley-Liss, Inc.

  3. A deep learning framework for modeling structural features of RNA-binding protein targets

    PubMed Central

    Zhang, Sai; Zhou, Jingtian; Hu, Hailin; Gong, Haipeng; Chen, Ligong; Cheng, Chao; Zeng, Jianyang

    2016-01-01

    RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp. PMID:26467480

  4. Site quality relationships for shortleaf pine

    Treesearch

    David L. Graney

    1986-01-01

    Existing information about site quality relationships for shortleaf pine (Pinus echinata Mill.) in the southeastern United States is reviewed in this paper. Estimates of site quality, whether from direct tree measurements or indirect estimates based on soil and site features, are only local observations for many points on the landscape. To be of value to the land...

  5. Social Networking Sites and Language Learning

    ERIC Educational Resources Information Center

    Brick, Billy

    2011-01-01

    This article examines a study of seven learners who logged their experiences on the language leaning social networking site Livemocha over a period of three months. The features of the site are described and the likelihood of their future success is considered. The learners were introduced to the Social Networking Site (SNS) and asked to learn a…

  6. 40 CFR 228.6 - Specific criteria for site selection.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 40 Protection of Environment 25 2011-07-01 2011-07-01 false Specific criteria for site selection... selection. (a) In the selection of disposal sites, in addition to other necessary or appropriate factors...) Existence at or in close proximity to the site of any significant natural or cultural features of historical...

  7. Factors in life science textbooks that may deter girls' interest in science

    NASA Astrophysics Data System (ADS)

    Potter, Ellen F.; Rosser, Sue V.

    In order to examine factors that may deter girls' interest in science, five seventh-grade life science textbooks were analyzed for sexism in language, images, and curricular content, and for features of activities that have been found to be useful for motivating girls. Although overt sexism was not apparent, subtle forms of sexism in the selection of language, images, and curricular content were found. Activities had some features useful to girls, but other features were seldom included. Teachers may wish to use differences that were found among texts as one basis for text selection.

  8. Two Different Approaches to Automated Mark Up of Emotions in Text

    NASA Astrophysics Data System (ADS)

    Francisco, Virginia; Hervás, Raqucl; Gervás, Pablo

    This paper presents two different approaches to automated marking up of texts with emotional labels. For the first approach a corpus of example texts previously annotated by human evaluators is mined for an initial assignment of emotional features to words. This results in a List of Emotional Words (LEW) which becomes a useful resource for later automated mark up. The mark up algorithm in this first approach mirrors closely the steps taken during feature extraction, employing for the actual assignment of emotional features a combination of the LEW resource and WordNet for knowledge-based expansion of words not occurring in LEW. The algorithm for automated mark up is tested against new text samples to test its coverage. The second approach mark up texts during their generation. We have a knowledge base which contains the necessary information for marking up the text. This information is related to actions and characters. The algorithm in this case employ the information of the knowledge database and decides the correct emotion for every sentence. The algorithm for automated mark up is tested against four different texts. The results of the two approaches are compared and discussed with respect to three main issues: relative adequacy of each one of the representations used, correctness and coverage of the proposed algorithms, and additional techniques and solutions that may be employed to improve the results.

  9. An RNAi-Enhanced Logic Circuit for Cancer Specific Detection and Destruction

    DTIC Science & Technology

    2013-02-01

    monomeric protein secreted by Corynebacterium diphtheriae, and pro-apoptotic members of Bcl-2 family: mBax (Mus musculus), hBax ( Homo sapiens ), and its...Gata3 mStaple. Intron- feature sequences – donor site, branch point, poly- pyrimidine tract, and acceptor site – were selected based on previously...sequences found in literature our intron features were chosen according SplicePort [4], an online analyzer that detects the likelihood of splicing to

  10. Geoheritage Values at Greenmantle Farm

    NASA Astrophysics Data System (ADS)

    Etches, J. D.

    2009-05-01

    The Greenmantle Farm occurrence near Wilberforce, Ontario is a marble feature within the Grenville Province of the Precambrian Shield that hosts a diverse suite of amphibole minerals. The marble is of undetermined petrogenesis, and is possibly either a primary carbonatite intrusion or a derived melt of metasedimentary origin. The site is the type locality for the rare mineral fluorrichterite. Other minerals of note are orthoclase and apatite. Crystal size is relatively large, and all minerals, with the exception of calcite, exhibit generally good to excellent euhedral form. Of note is that the mineral occurrences at this site have not been subjected to any human disturbance including mechanical or hand tool disruption. The site also provides excellent examples of a number of geological features and ecosystem dynamics. In particular, faulting, moisture regime landscape interrelationships, order of crystallization in zoned dykes, and calciphile plant associations are demonstrated. This site represents an exceptional viewing opportunity of an unspoiled mineral occurrence while providing illustrative examples of the interrelationship of abiotic and biotic features. In terms of research, the site will prove to be a valuable subject in regard to amphibole composition, amphibole differentiation in calcareous melts, and will ultimately provide insight into the formation of the occurrence. Determination of what circumstances these marble bodies formed under would add a significant piece of information to the complex history of the Grenville province. This research will be assisted by the completely uncompromised nature of the site. The potential educational value of the site for researchers and grade school students alike is exceptional.

  11. The role of topographic structure and soil macrofauna presence at spoil heaps during spontaneous succession.

    NASA Astrophysics Data System (ADS)

    Walmsley, Alena; Vachová, Pavla; Vach, Marek

    2016-04-01

    This research was investigating whether topographic features, which determine soil nutrient and moisture distribution, in combination with soil fauna (wireworm and earthworm) presence, affect plant community composition at a spontaneously revegetated post mining area with an undulating surface. Two sites of different age with 3 types of topographic features were selected, soil moisture and nutrient content were measured, plant community composition and soil macrofauna community was sampled at each position. Wireworms were present at all positions and were most abundant at bottoms of waves at the younger site; their presence was correlated with several plant species, but the direction of the interaction isn't clear. Earthworms were only present at the older site and had highest abundance at flat sections. Earthworm presence affected the amount of nitrogen in soil - the most nitrogen content was at the site with highest earthworm density and was followed by higher diversity of plant community. The plant community composition was generally correlated with plant available nutrient content - especially P and N. We infer that topographic features affect nutrient and soil fauna distribution, which consequently influences plant community composition.

  12. Semantic Indexing of Multimedia Content Using Visual, Audio, and Text Cues

    NASA Astrophysics Data System (ADS)

    Adams, W. H.; Iyengar, Giridharan; Lin, Ching-Yung; Naphade, Milind Ramesh; Neti, Chalapathy; Nock, Harriet J.; Smith, John R.

    2003-12-01

    We present a learning-based approach to the semantic indexing of multimedia content using cues derived from audio, visual, and text features. We approach the problem by developing a set of statistical models for a predefined lexicon. Novel concepts are then mapped in terms of the concepts in the lexicon. To achieve robust detection of concepts, we exploit features from multiple modalities, namely, audio, video, and text. Concept representations are modeled using Gaussian mixture models (GMM), hidden Markov models (HMM), and support vector machines (SVM). Models such as Bayesian networks and SVMs are used in a late-fusion approach to model concepts that are not explicitly modeled in terms of features. Our experiments indicate promise in the proposed classification and fusion methodologies: our proposed fusion scheme achieves more than 10% relative improvement over the best unimodal concept detector.

  13. Marker Registration Technique for Handwritten Text Marker in Augmented Reality Applications

    NASA Astrophysics Data System (ADS)

    Thanaborvornwiwat, N.; Patanukhom, K.

    2018-04-01

    Marker registration is a fundamental process to estimate camera poses in marker-based Augmented Reality (AR) systems. We developed AR system that creates correspondence virtual objects on handwritten text markers. This paper presents a new method for registration that is robust for low-content text markers, variation of camera poses, and variation of handwritten styles. The proposed method uses Maximally Stable Extremal Regions (MSER) and polygon simplification for a feature point extraction. The experiment shows that we need to extract only five feature points per image which can provide the best registration results. An exhaustive search is used to find the best matching pattern of the feature points in two images. We also compared performance of the proposed method to some existing registration methods and found that the proposed method can provide better accuracy and time efficiency.

  14. The effects of pre-processing strategies in sentiment analysis of online movie reviews

    NASA Astrophysics Data System (ADS)

    Zin, Harnani Mat; Mustapha, Norwati; Murad, Masrah Azrifah Azmi; Sharef, Nurfadhlina Mohd

    2017-10-01

    With the ever increasing of internet applications and social networking sites, people nowadays can easily express their feelings towards any products and services. These online reviews act as an important source for further analysis and improved decision making. These reviews are mostly unstructured by nature and thus, need processing like sentiment analysis and classification to provide a meaningful information for future uses. In text analysis tasks, the appropriate selection of words/features will have a huge impact on the effectiveness of the classifier. Thus, this paper explores the effect of the pre-processing strategies in the sentiment analysis of online movie reviews. In this paper, supervised machine learning method was used to classify the reviews. The support vector machine (SVM) with linear and non-linear kernel has been considered as classifier for the classification of the reviews. The performance of the classifier is critically examined based on the results of precision, recall, f-measure, and accuracy. Two different features representations were used which are term frequency and term frequency-inverse document frequency. Results show that the pre-processing strategies give a significant impact on the classification process.

  15. Prominent feature extraction for review analysis: an empirical study

    NASA Astrophysics Data System (ADS)

    Agarwal, Basant; Mittal, Namita

    2016-05-01

    Sentiment analysis (SA) research has increased tremendously in recent times. SA aims to determine the sentiment orientation of a given text into positive or negative polarity. Motivation for SA research is the need for the industry to know the opinion of the users about their product from online portals, blogs, discussion boards and reviews and so on. Efficient features need to be extracted for machine-learning algorithm for better sentiment classification. In this paper, initially various features are extracted such as unigrams, bi-grams and dependency features from the text. In addition, new bi-tagged features are also extracted that conform to predefined part-of-speech patterns. Furthermore, various composite features are created using these features. Information gain (IG) and minimum redundancy maximum relevancy (mRMR) feature selection methods are used to eliminate the noisy and irrelevant features from the feature vector. Finally, machine-learning algorithms are used for classifying the review document into positive or negative class. Effects of different categories of features are investigated on four standard data-sets, namely, movie review and product (book, DVD and electronics) review data-sets. Experimental results show that composite features created from prominent features of unigram and bi-tagged features perform better than other features for sentiment classification. mRMR is a better feature selection method as compared with IG for sentiment classification. Boolean Multinomial Naïve Bayes) algorithm performs better than support vector machine classifier for SA in terms of accuracy and execution time.

  16. Building a sense of virtual community: the role of the features of social networking sites.

    PubMed

    Chen, Chi-Wen; Lin, Chiun-Sin

    2014-07-01

    In recent years, social networking sites have received increased attention because of the potential of this medium to transform business by building virtual communities. However, theoretical and empirical studies investigating how specific features of social networking sites contribute to building a sense of virtual community (SOVC)-an important dimension of a successful virtual community-are rare. Furthermore, SOVC scales have been developed, and research on this issue has been called for, but few studies have heeded this call. On the basis of prior literature, this study proposes that perceptions of the three most salient features of social networking sites-system quality (SQ), information quality (IQ), and social information exchange (SIE)-play a key role in fostering SOVC. In particular, SQ is proposed to increase IQ and SIE, and SIE is proposed to enhance IQ, both of which thereafter build SOVC. The research model was examined in the context of Facebook, one of the most popular social networking sites in the world. We adopted Blanchard's scales to measure SOVC. Data gathered using a Web-based questionnaire, and analyzed with partial least squares, were utilized to test the model. The results demonstrate that SIE, SQ, and IQ are the factors that form SOVC. The findings also suggest that SQ plays a fundamental role in supporting SIE and IQ in social networking sites. Implications for theory, practice, and future research directions are discussed.

  17. Analyzing and Integrating Models of Multiple Text Comprehension

    ERIC Educational Resources Information Center

    List, Alexandra; Alexander, Patricia A.

    2017-01-01

    We introduce a special issue featuring four theoretical models of multiple text comprehension. We present a central framework for conceptualizing the four models in this special issue. Specifically, we chart the models according to how they consider learner, texts, task, and context factors in explaining multiple text comprehension. In addition,…

  18. Text against Text: Counterbalancing the Hegemony of Assessment.

    ERIC Educational Resources Information Center

    Cosgrove, Cornelius

    A study examined whether composition specialists can counterbalance the potential privileging of the assessment perspective, or of self-appointed interpreters of that perspective, through the study of assessment discourse as text. Fourteen assessment texts were examined, most of them journal articles and most of them featuring the common…

  19. Syntactic Complexity as an Aspect of Text Complexity

    ERIC Educational Resources Information Center

    Frantz, Roger S.; Starr, Laura E.; Bailey, Alison L.

    2015-01-01

    Students' ability to read complex texts is emphasized in the Common Core State Standards (CCSS) for English Language Arts and Literacy. The standards propose a three-part model for measuring text complexity. Although the model presents a robust means for determining text complexity based on a variety of features inherent to a text as well as…

  20. Using the Text Structures of Information Books to Teach Writing in the Primary Grades

    ERIC Educational Resources Information Center

    Clark, Sarah K.; Jones, Cindy D.; Reutzel, D. Ray

    2013-01-01

    Teaching children in the primary grades the text structures and features used by authors of information text has been shown to improve comprehension of information texts and provide the scaffolding and support these children need in order to write their own information texts. As teachers implement the "English Language Arts Common Core State…

  1. Visual Design Guidelines for Improving Learning from Dynamic and Interactive Digital Text

    ERIC Educational Resources Information Center

    Jin, Sung-Hee

    2013-01-01

    Despite the dynamic and interactive features of digital text, the visual design guidelines for digital text are similar to those for printed text. The purpose of this study was to develop visual design guidelines for improving learning from dynamic and interactive digital text and to validate them by controlled testing. Two structure design…

  2. DHSpred: support-vector-machine-based human DNase I hypersensitive sites prediction using the optimal features selected by random forest.

    PubMed

    Manavalan, Balachandran; Shin, Tae Hwan; Lee, Gwang

    2018-01-05

    DNase I hypersensitive sites (DHSs) are genomic regions that provide important information regarding the presence of transcriptional regulatory elements and the state of chromatin. Therefore, identifying DHSs in uncharacterized DNA sequences is crucial for understanding their biological functions and mechanisms. Although many experimental methods have been proposed to identify DHSs, they have proven to be expensive for genome-wide application. Therefore, it is necessary to develop computational methods for DHS prediction. In this study, we proposed a support vector machine (SVM)-based method for predicting DHSs, called DHSpred (DNase I Hypersensitive Site predictor in human DNA sequences), which was trained with 174 optimal features. The optimal combination of features was identified from a large set that included nucleotide composition and di- and trinucleotide physicochemical properties, using a random forest algorithm. DHSpred achieved a Matthews correlation coefficient and accuracy of 0.660 and 0.871, respectively, which were 3% higher than those of control SVM predictors trained with non-optimized features, indicating the efficiency of the feature selection method. Furthermore, the performance of DHSpred was superior to that of state-of-the-art predictors. An online prediction server has been developed to assist the scientific community, and is freely available at: http://www.thegleelab.org/DHSpred.html.

  3. DHSpred: support-vector-machine-based human DNase I hypersensitive sites prediction using the optimal features selected by random forest

    PubMed Central

    Manavalan, Balachandran; Shin, Tae Hwan; Lee, Gwang

    2018-01-01

    DNase I hypersensitive sites (DHSs) are genomic regions that provide important information regarding the presence of transcriptional regulatory elements and the state of chromatin. Therefore, identifying DHSs in uncharacterized DNA sequences is crucial for understanding their biological functions and mechanisms. Although many experimental methods have been proposed to identify DHSs, they have proven to be expensive for genome-wide application. Therefore, it is necessary to develop computational methods for DHS prediction. In this study, we proposed a support vector machine (SVM)-based method for predicting DHSs, called DHSpred (DNase I Hypersensitive Site predictor in human DNA sequences), which was trained with 174 optimal features. The optimal combination of features was identified from a large set that included nucleotide composition and di- and trinucleotide physicochemical properties, using a random forest algorithm. DHSpred achieved a Matthews correlation coefficient and accuracy of 0.660 and 0.871, respectively, which were 3% higher than those of control SVM predictors trained with non-optimized features, indicating the efficiency of the feature selection method. Furthermore, the performance of DHSpred was superior to that of state-of-the-art predictors. An online prediction server has been developed to assist the scientific community, and is freely available at: http://www.thegleelab.org/DHSpred.html PMID:29416743

  4. Stopover habitats of spring migrating surf scoters in southeast Alaska

    USGS Publications Warehouse

    Lok, E.K.; Esler, Daniel N.; Takekawa, John Y.; De La Cruz, S.W.; Sean, Boyd W.; Nysewander, D.R.; Evenson, J.R.; Ward, D.H.

    2011-01-01

    Habitat conditions and nutrient reserve levels during spring migration have been suggested as important factors affecting population declines in waterfowl, emphasizing the need to identify key sites used during spring and understand habitat features and resource availability at stopover sites. We used satellite telemetry to identify stopover sites used by surf scoters migrating through southeast Alaska during spring. We then contrasted habitat features of these sites to those of random sites to determine habitat attributes corresponding to use by migrating scoters. We identified 14 stopover sites based on use by satellite tagged surf scoters from several wintering sites. We identified Lynn Canal as a particularly important stopover site for surf scoters originating throughout the Pacific winter range; approximately half of tagged coastally migrating surf scoters used this site, many for extended periods. Stopover sites were farther from the mainland coast and closer to herring spawn sites than random sites, whereas physical shoreline habitat attributes were generally poor predictors of site use. The geography and resource availability within southeast Alaska provides unique and potentially critical stopover habitat for spring migrating surf scoters. Our work identifies specific sites and habitat resources that deserve conservation and management consideration. Aggregations of birds are vulnerable to human activity impacts such as contaminant spills and resource management decisions. This information is of value to agencies and organizations responsible for emergency response planning, herring fisheries management, and bird and ecosystem conservation.

  5. Segmenting texts from outdoor images taken by mobile phones using color features

    NASA Astrophysics Data System (ADS)

    Liu, Zongyi; Zhou, Hanning

    2011-01-01

    Recognizing texts from images taken by mobile phones with low resolution has wide applications. It has been shown that a good image binarization can substantially improve the performances of OCR engines. In this paper, we present a framework to segment texts from outdoor images taken by mobile phones using color features. The framework consists of three steps: (i) the initial process including image enhancement, binarization and noise filtering, where we binarize the input images in each RGB channel, and apply component level noise filtering; (ii) grouping components into blocks using color features, where we compute the component similarities by dynamically adjusting the weights of RGB channels, and merge groups hierachically, and (iii) blocks selection, where we use the run-length features and choose the Support Vector Machine (SVM) as the classifier. We tested the algorithm using 13 outdoor images taken by an old-style LG-64693 mobile phone with 640x480 resolution. We compared the segmentation results with Tsar's algorithm, a state-of-the-art camera text detection algorithm, and show that our algorithm is more robust, particularly in terms of the false alarm rates. In addition, we also evaluated the impacts of our algorithm on the Abbyy's FineReader, one of the most popular commercial OCR engines in the market.

  6. Recent Developments on the Turbulence Modeling Resource Website (Invited)

    NASA Technical Reports Server (NTRS)

    Rumssey, Christopher L.

    2015-01-01

    The NASA Langley Turbulence Model Resource (TMR) website has been active for over five years. Its main goal of providing a one-stop, easily accessible internet site for up-to-date information on Reynolds-averaged Navier-Stokes turbulence models remains unchanged. In particular, the site strives to provide an easy way for users to verify their own implementations of widely-used turbulence models, and to compare the results from different models for a variety of simple unit problems covering a range of flow physics. Some new features have been recently added to the website. This paper documents the site's features, including recent developments, future plans, and open questions.

  7. Nevada National Security Site Environmental Report 2011 Attachment A: Site Description

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cathy Wills, ed.

    2012-09-12

    This attachment expands on the general description of the Nevada National Security Site (NNSS) presented in the Introduction to the Nevada National Security Site Environmental Report 2011. Included are subsections that summarize the site's geological, hydrological, climatological, and ecological setting and the cultural resources of the NNSS. The subsections are meant to aid the reader in understanding the complex physical and biological environment of the NNSS. An adequate knowledge of the site's environment is necessary to assess the environmental impacts of new projects, design and implement environmental monitoring activities for current site operations, and assess the impacts of site operationsmore » on the public residing in the vicinity of the NNSS. The NNSS environment contributes to several key features of the site that afford protection to the inhabitants of adjacent areas from potential exposure to radioactivity or other contaminants resulting from NNSS operations. These key features include the general remote location of the NNSS, restricted access, extended wind transport times, the great depths to slow-moving groundwater, little or no surface water, and low population density. This attachment complements the annual summary of monitoring program activities and dose assessments presented in the main body of this report.« less

  8. Impact geologists, beware!

    NASA Astrophysics Data System (ADS)

    Melosh, H. J.

    2017-09-01

    Impact geologists have long assumed that shock metamorphic features, such as planar fractures and Planar Deformation Features (PDFs) in quartz are reliable indicators of an extraterrestrial impact. A new paper by Chen et al. (2017) now shows that such features might arise in terrestrial lightning strikes, thus raising the bar for identification of impact sites.

  9. Insights into multimodal imaging classification of ADHD

    PubMed Central

    Colby, John B.; Rudie, Jeffrey D.; Brown, Jesse A.; Douglas, Pamela K.; Cohen, Mark S.; Shehzad, Zarrar

    2012-01-01

    Attention deficit hyperactivity disorder (ADHD) currently is diagnosed in children by clinicians via subjective ADHD-specific behavioral instruments and by reports from the parents and teachers. Considering its high prevalence and large economic and societal costs, a quantitative tool that aids in diagnosis by characterizing underlying neurobiology would be extremely valuable. This provided motivation for the ADHD-200 machine learning (ML) competition, a multisite collaborative effort to investigate imaging classifiers for ADHD. Here we present our ML approach, which used structural and functional magnetic resonance imaging data, combined with demographic information, to predict diagnostic status of individuals with ADHD from typically developing (TD) children across eight different research sites. Structural features included quantitative metrics from 113 cortical and non-cortical regions. Functional features included Pearson correlation functional connectivity matrices, nodal and global graph theoretical measures, nodal power spectra, voxelwise global connectivity, and voxelwise regional homogeneity. We performed feature ranking for each site and modality using the multiple support vector machine recursive feature elimination (SVM-RFE) algorithm, and feature subset selection by optimizing the expected generalization performance of a radial basis function kernel SVM (RBF-SVM) trained across a range of the top features. Site-specific RBF-SVMs using these optimal feature sets from each imaging modality were used to predict the class labels of an independent hold-out test set. A voting approach was used to combine these multiple predictions and assign final class labels. With this methodology we were able to predict diagnosis of ADHD with 55% accuracy (versus a 39% chance level in this sample), 33% sensitivity, and 80% specificity. This approach also allowed us to evaluate predictive structural and functional features giving insight into abnormal brain circuitry in ADHD. PMID:22912605

  10. Seafloor massive sulfide deposits support unique megafaunal assemblages: Implications for seabed mining and conservation.

    PubMed

    Boschen, Rachel E; Rowden, Ashley A; Clark, Malcolm R; Pallentin, Arne; Gardner, Jonathan P A

    2016-04-01

    Mining of seafloor massive sulfides (SMS) is imminent, but the ecology of assemblages at SMS deposits is poorly known. Proposed conservation strategies include protected areas to preserve biodiversity at risk from mining impacts. Determining site suitability requires biological characterisation of the mine site and protected area(s). Video survey of a proposed mine site and protected area off New Zealand revealed unique megafaunal assemblages at the mine site. Significant relationships were identified between assemblage structure and environmental conditions, including hydrothermal features. Unique assemblages occurred at both active and inactive chimneys and are particularly at risk from mining-related impacts. The occurrence of unique assemblages at the mine site suggests that the proposed protected area is insufficient alone and should instead form part of a network. These results provide support for including hydrothermally active and inactive features within networks of protected areas and emphasise the need for quantitative survey data of proposed sites. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  11. Mission to Malapert

    NASA Astrophysics Data System (ADS)

    Otten, N. D.; Amoroso, E.; Jones, H. L.; Kitchell, F.; Wettergreen, D. S.; Whittaker, W. L.

    2016-11-01

    This work presents methodology for evaluating lunar landing site amenability and identifies promising sites for landing on Malapert Mountain, which features shallow slopes, uninterrupted Earth visibility, and ten-plus days of uninterrupted sunlight.

  12. Does an Interactive WebCT Site Help Students Learn?

    ERIC Educational Resources Information Center

    Elicker, Joelle D.; O'Malley, Alison L.; Williams, Christine M.

    2008-01-01

    We examined whether students with access to a supplemental course Web site enhanced with e-mail, discussion boards, and chat room capability reacted to it more positively than students who used a Web site with the same content but no communication features. Students used the Web sites on a voluntary basis. At the end of the semester, students…

  13. Preliminary analysis of Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) for mineralogic mapping at sites in Nevada and Colorado

    NASA Technical Reports Server (NTRS)

    Kruse, Fred A.; Taranik, Dan L.; Kierein-Young, Kathryn S.

    1988-01-01

    Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) data for sites in Nevada and Colorado were evaluated to determine their utility for mineralogical mapping in support of geologic investigations. Equal energy normalization is commonly used with imaging spectrometer data to reduce albedo effects. Spectra, profiles, and stacked, color-coded spectra were extracted from the AVIRIS data using an interactive analysis program (QLook) and these derivative data were compared to Airborne Imaging Spectrometer (AIS) results, field and laboratory spectra, and geologic maps. A feature extraction algorithm was used to extract and characterize absorption features from AVIRIS and laboratory spectra, allowing direct comparison of the position and shape of absorption features. Both muscovite and carbonate spectra were identified in the Nevada AVIRIS data by comparison with laboratory and AIS spectra, and an image was made that showed the distribution of these minerals for the entire site. Additional, distinctive spectra were located for an unknown mineral. For the two Colorado sites, the signal-to-noise problem was significantly worse and attempts to extract meaningful spectra were unsuccessful. Problems with the Colorado AVIRIS data were accentuated by the IAR reflectance technique because of moderate vegetation cover. Improved signal-to-noise and alternative calibration procedures will be required to produce satisfactory reflectance spectra from these data. Although the AVIRIS data were useful for mapping strong mineral absorption features and producing mineral maps at the Nevada site, it is clear that significant improvements to the instrument performance are required before AVIRIS will be an operational instrument.

  14. Textpresso site-specific recombinases: A text-mining server for the recombinase literature including Cre mice and conditional alleles.

    PubMed

    Urbanski, William M; Condie, Brian G

    2009-12-01

    Textpresso Site Specific Recombinases (http://ssrc.genetics.uga.edu/) is a text-mining web server for searching a database of more than 9,000 full-text publications. The papers and abstracts in this database represent a wide range of topics related to site-specific recombinase (SSR) research tools. Included in the database are most of the papers that report the characterization or use of mouse strains that express Cre recombinase as well as papers that describe or analyze mouse lines that carry conditional (floxed) alleles or SSR-activated transgenes/knockins. The database also includes reports describing SSR-based cloning methods such as the Gateway or the Creator systems, papers reporting the development or use of SSR-based tools in systems such as Drosophila, bacteria, parasites, stem cells, yeast, plants, zebrafish, and Xenopus as well as publications that describe the biochemistry, genetics, or molecular structure of the SSRs themselves. Textpresso Site Specific Recombinases is the only comprehensive text-mining resource available for the literature describing the biology and technical applications of SSRs. (c) 2009 Wiley-Liss, Inc.

  15. A Lithium-ion Battery Using Partially Lithiated Graphite Anode and Amphi-redox LiMn2O4 Cathode.

    PubMed

    Jeon, Yuju; Noh, Hyun Kuk; Song, Hyun-Kon

    2017-11-01

    Delithiation followed by lithiation of Li + -occupied (n-type) tetrahedral sites of cubic LiMn 2 O 4 spinel (LMO) at ~4 [Formula: see text] (delivering ~100 mAh g LMO -1 ) has been used for energy storage by lithium ion batteries (LIBs). In this work, we utilized unoccupied (p-type) octahedral sites of LMO available for lithiation at ~3 [Formula: see text] (delivering additional ~100 mAh g LMO -1 ) that have never been used for LIBs in full-cell configuration. The whole capacity of amphi-redox LMO, including both oxidizable n-type and reducible p-type redox sites, at ~200 mAh g LMO -1 was realized by using the reactions both at 4 [Formula: see text] and 3 [Formula: see text]. Durable reversibility of the 3 V reaction was achieved by graphene-wrapping LMO nanoparticles (LMO@Gn). Prelithiated graphite (Li n C 6 , n < 1) was used as anodes to lithiate the unoccupied octahedral sites of LMO for the 3 V reaction.

  16. Creating Virtual Fieldwork Experiences of Geoheritage Sites as Educator Professional Development (Invited)

    NASA Astrophysics Data System (ADS)

    Duggan-Haas, D.

    2013-12-01

    Geoheritage sites are identified as such because they include excellent examples of geologic features or processes, or they have played an important role in the development of geologic understandings. These characteristics also make them excellent sites for teaching in the field, for teaching educators about the nature of fieldwork, and for making Virtual Fieldwork Experiences (VFEs, multimedia representations of field sites). Through the NSF-funded Regional and Local Earth (ReaL) Earth Inquiry Project, we have engaged educators in these practices. The nature of geoheritage sites is anomalous -- if this were not the case, the sites would not gain recognition. Anomalous features or processes can be powerful learning tools when placed into comparison with the more mundane, and the Earth system science of sites local to schools is likely to be mundane. By comparing the mundane and the extraordinary, it is hoped we can learn more about both. The professional development (PD) in ReaL Earth Inquiry begins with a face-to-face workshop within the teachers' region at a site that is interesting from an Earth system science perspective. Though we recognize and emphasize that all sites are interesting from an ESS perspective if you know how to look, the sites typically have features worthy of geoheritage designation. PD does not end with the end of the workshop but continues with online study groups where teachers work together to complete the workshop site VFE, and transition to work on VFEs of sites local to their schools. Throughout the program, participants engage in: - mentored fieldwork that pays attention to the skills and knowledge needed to lead fieldwork; - instruction in and use of a wide range of technologies for making VFEs; - study of a coherent conceptual framework connected to the project's driving question: Why does this place look the way it does? - and, use of resources for supporting all of the above The resources include templates for making VFEs and a framework summarized in the attached graphic organizer that features a series of questions that can be productively asked of any field site. By working with educators, we not only produce curriculum resources in the form of VFEs, we also engage in educator PD that produces evidence of its effectiveness, at least in terms of indications that educators are engaged in field study both at the workshop site and after they return home. Production of local VFEs sometimes involves students. The VFE Graphic Organizer, showing a series of questions that may be asked about any site, all under the project's driving question: Why does this place look the way it does?

  17. Change In Length of Stay and Readmissions among Hospitalized Medical Patients after Inpatient Medicine Service Adoption of Mobile Secure Text Messaging.

    PubMed

    Patel, Mitesh S; Patel, Neha; Small, Dylan S; Rosin, Roy; Rohrbach, Jeffrey I; Stromberg, Nathaniel; Hanson, C William; Asch, David A

    2016-08-01

    Changes in the medium of communication from paging to mobile secure text messaging may change clinical care, but the effects of these changes on patient outcomes have not been well examined. To evaluate the association between inpatient medicine service adoption of mobile secure text messaging and patient length of stay and readmissions. Observational study. Patients admitted to medicine services at the Hospital of the University of Pennsylvania (intervention site; n = 8995 admissions of 6484 patients) and Penn Presbyterian Medical Center (control site; n = 6799 admissions of 4977 patients) between May 1, 2012, and April 30, 2014. Mobile secure text messaging. Change in length of stay and 30-day readmissions, comparing patients at the intervention site to the control site before (May 1, 2012 to April 30, 2013) and after (May 1, 2013 to April 30, 2014) the intervention, adjusting for time trends and patient demographics, comorbidities, insurance, and disposition. During the pre-intervention period, the mean length of stay ranged from 4.0 to 5.0 days at the control site and from 5.2 to 6.7 days at the intervention site, but trends were similar. In the first month after the intervention, the mean length of stay was unchanged at the control site (4.7 to 4.7 days) but declined at the intervention site (6.0 to 5.4 days). Trends were mostly similar during the rest of the post-intervention period, ranging from 4.4 to 5.6 days at the control site and from 5.4 to 6.5 days at the intervention site. Readmission rates varied significantly within sites before and after the intervention, but overall trends were similar. In adjusted analyses, there was a significant decrease in length of stay for the intervention site relative to the control site during the post-intervention period compared to the pre-intervention period (-0.77 days ; 95 % CI, -1.14, -0.40; P < 0.001). There was no significant difference in the odds of readmission (OR, 0.97; 95 % CI: 0.81, 1.17; P = 0.77). These findings were supported by multiple sensitivity analyses. Compared to a control group over time, hospitalized medical patients on inpatient services whose care providers and staff were offered mobile secure text messaging showed a relative decrease in length of stay and no change in readmissions.

  18. Biological and functional relevance of CASP predictions

    PubMed Central

    Liu, Tianyun; Ish‐Shalom, Shirbi; Torng, Wen; Lafita, Aleix; Bock, Christian; Mort, Matthew; Cooper, David N; Bliven, Spencer; Capitani, Guido; Mooney, Sean D.

    2017-01-01

    Abstract Our goal is to answer the question: compared with experimental structures, how useful are predicted models for functional annotation? We assessed the functional utility of predicted models by comparing the performances of a suite of methods for functional characterization on the predictions and the experimental structures. We identified 28 sites in 25 protein targets to perform functional assessment. These 28 sites included nine sites with known ligand binding (holo‐sites), nine sites that are expected or suggested by experimental authors for small molecule binding (apo‐sites), and Ten sites containing important motifs, loops, or key residues with important disease‐associated mutations. We evaluated the utility of the predictions by comparing their microenvironments to the experimental structures. Overall structural quality correlates with functional utility. However, the best‐ranked predictions (global) may not have the best functional quality (local). Our assessment provides an ability to discriminate between predictions with high structural quality. When assessing ligand‐binding sites, most prediction methods have higher performance on apo‐sites than holo‐sites. Some servers show consistently high performance for certain types of functional sites. Finally, many functional sites are associated with protein‐protein interaction. We also analyzed biologically relevant features from the protein assemblies of two targets where the active site spanned the protein‐protein interface. For the assembly targets, we find that the features in the models are mainly determined by the choice of template. PMID:28975675

  19. A framework for semisupervised feature generation and its applications in biomedical literature mining.

    PubMed

    Li, Yanpeng; Hu, Xiaohua; Lin, Hongfei; Yang, Zhihao

    2011-01-01

    Feature representation is essential to machine learning and text mining. In this paper, we present a feature coupling generalization (FCG) framework for generating new features from unlabeled data. It selects two special types of features, i.e., example-distinguishing features (EDFs) and class-distinguishing features (CDFs) from original feature set, and then generalizes EDFs into higher-level features based on their coupling degrees with CDFs in unlabeled data. The advantage is: EDFs with extreme sparsity in labeled data can be enriched by their co-occurrences with CDFs in unlabeled data so that the performance of these low-frequency features can be greatly boosted and new information from unlabeled can be incorporated. We apply this approach to three tasks in biomedical literature mining: gene named entity recognition (NER), protein-protein interaction extraction (PPIE), and text classification (TC) for gene ontology (GO) annotation. New features are generated from over 20 GB unlabeled PubMed abstracts. The experimental results on BioCreative 2, AIMED corpus, and TREC 2005 Genomics Track show that 1) FCG can utilize well the sparse features ignored by supervised learning. 2) It improves the performance of supervised baselines by 7.8 percent, 5.0 percent, and 5.8 percent, respectively, in the tree tasks. 3) Our methods achieve 89.1, 64.5 F-score, and 60.1 normalized utility on the three benchmark data sets.

  20. Clinical features, proximate causes, and consequences of active convulsive epilepsy in Africa

    PubMed Central

    Kariuki, Symon M; Matuja, William; Akpalu, Albert; Kakooza-Mwesige, Angelina; Chabi, Martin; Wagner, Ryan G; Connor, Myles; Chengo, Eddie; Ngugi, Anthony K; Odhiambo, Rachael; Bottomley, Christian; White, Steven; Sander, Josemir W; Neville, Brian G R; Newton, Charles R J C

    2014-01-01

    Purpose Epilepsy is common in sub-Saharan Africa (SSA), but the clinical features and consequences are poorly characterized. Most studies are hospital-based, and few studies have compared different ecological sites in SSA. We described active convulsive epilepsy (ACE) identified in cross-sectional community-based surveys in SSA, to understand the proximate causes, features, and consequences. Methods We performed a detailed clinical and neurophysiologic description of ACE cases identified from a community survey of 584,586 people using medical history, neurologic examination, and electroencephalography (EEG) data from five sites in Africa: South Africa; Tanzania; Uganda; Kenya; and Ghana. The cases were examined by clinicians to discover risk factors, clinical features, and consequences of epilepsy. We used logistic regression to determine the epilepsy factors associated with medical comorbidities. Key Findings Half (51%) of the 2,170 people with ACE were children and 69% of seizures began in childhood. Focal features (EEG, seizure types, and neurologic deficits) were present in 58% of ACE cases, and these varied significantly with site. Status epilepticus occurred in 25% of people with ACE. Only 36% received antiepileptic drugs (phenobarbital was the most common drug [95%]), and the proportion varied significantly with the site. Proximate causes of ACE were adverse perinatal events (11%) for onset of seizures before 18 years; and acute encephalopathy (10%) and head injury prior to seizure onset (3%). Important comorbidities were malnutrition (15%), cognitive impairment (23%), and neurologic deficits (15%). The consequences of ACE were burns (16%), head injuries (postseizure) (1%), lack of education (43%), and being unmarried (67%) or unemployed (57%) in adults, all significantly more common than in those without epilepsy. Significance There were significant differences in the comorbidities across sites. Focal features are common in ACE, suggesting identifiable and preventable causes. Malnutrition and cognitive and neurologic deficits are common in people with ACE and should be integrated into the management of epilepsy in this region. Consequences of epilepsy such as burns, lack of education, poor marriage prospects, and unemployment need to be addressed. PMID:24116877

  1. Clinical features, proximate causes, and consequences of active convulsive epilepsy in Africa.

    PubMed

    Kariuki, Symon M; Matuja, William; Akpalu, Albert; Kakooza-Mwesige, Angelina; Chabi, Martin; Wagner, Ryan G; Connor, Myles; Chengo, Eddie; Ngugi, Anthony K; Odhiambo, Rachael; Bottomley, Christian; White, Steven; Sander, Josemir W; Neville, Brian G R; Newton, Charles R J C; Twine, Rhian; Gómez Olivé, F Xavier; Collinson, Mark; Kahn, Kathleen; Tollman, Stephen; Masanja, Honratio; Mathew, Alexander; Pariyo, George; Peterson, Stefan; Ndyomughenyi, Donald; Bauni, Evasius; Kamuyu, Gathoni; Odera, Victor Mung'ala; Mageto, James O; Ae-Ngibise, Ken; Akpalu, Bright; Agbokey, Francis; Adjei, Patrick; Owusu-Agyei, Seth; Kleinschmidt, Immo; Doku, Victor C K; Odermatt, Peter; Nutman, Thomas; Wilkins, Patricia; Noh, John

    2014-01-01

    Epilepsy is common in sub-Saharan Africa (SSA), but the clinical features and consequences are poorly characterized. Most studies are hospital-based, and few studies have compared different ecological sites in SSA. We described active convulsive epilepsy (ACE) identified in cross-sectional community-based surveys in SSA, to understand the proximate causes, features, and consequences. We performed a detailed clinical and neurophysiologic description of ACE cases identified from a community survey of 584,586 people using medical history, neurologic examination, and electroencephalography (EEG) data from five sites in Africa: South Africa; Tanzania; Uganda; Kenya; and Ghana. The cases were examined by clinicians to discover risk factors, clinical features, and consequences of epilepsy. We used logistic regression to determine the epilepsy factors associated with medical comorbidities. Half (51%) of the 2,170 people with ACE were children and 69% of seizures began in childhood. Focal features (EEG, seizure types, and neurologic deficits) were present in 58% of ACE cases, and these varied significantly with site. Status epilepticus occurred in 25% of people with ACE. Only 36% received antiepileptic drugs (phenobarbital was the most common drug [95%]), and the proportion varied significantly with the site. Proximate causes of ACE were adverse perinatal events (11%) for onset of seizures before 18 years; and acute encephalopathy (10%) and head injury prior to seizure onset (3%). Important comorbidities were malnutrition (15%), cognitive impairment (23%), and neurologic deficits (15%). The consequences of ACE were burns (16%), head injuries (postseizure) (1%), lack of education (43%), and being unmarried (67%) or unemployed (57%) in adults, all significantly more common than in those without epilepsy. There were significant differences in the comorbidities across sites. Focal features are common in ACE, suggesting identifiable and preventable causes. Malnutrition and cognitive and neurologic deficits are common in people with ACE and should be integrated into the management of epilepsy in this region. Consequences of epilepsy such as burns, lack of education, poor marriage prospects, and unemployment need to be addressed. Wiley Periodicals, Inc. © 2013 The Authors. Epilepsia published by Wiley Periodicals, Inc. on behalf of the International League Against Epilepsy.

  2. [Formula: see text]Official Position of the American Academy of Clinical Neuropsychology (AACN): Guidelines for Practicum Training in Clinical Neuropsychology.

    PubMed

    Nelson, Aaron P; Roper, Brad L; Slomine, Beth S; Morrison, Chris; Greher, Michael R; Janusz, Jennifer; Larson, Jennifer C; Meadows, Mary-Ellen; Ready, Rebecca E; Rivera Mindt, Monica; Whiteside, Doug M; Willment, Kim; Wodushek, Thomas R

    2015-01-01

    Practical experience is central to the education and training of neuropsychologists, beginning in graduate school and extending through postdoctoral fellowship. However, historically, little attention has been given to the structure and requirements of practicum training in clinical neuropsychology. A working group of senior-level neuropsychologists, as well as a current postdoctoral fellow, all from a diverse range of settings (The AACN Practicum Guidelines Workgroup), was formed to propose guidelines for practicum training in clinical neuropsychology. The Workgroup reviewed relevant literature and sought input from professional organizations involved in education and training in neuropsychology. The proposed guidelines provide a definition of practicum training in clinical neuropsychology, detail entry and exit criteria across competencies relevant to practicum training in clinical neuropsychology, and discuss the relationship between doctoral training programs and practicum training sites. The proposed guidelines also provide a methodology for competency-based evaluation of clinical neuropsychology practicum trainees and outline characteristics and features that are integral to an effective training environment. Although the guidelines discussed below may not be implemented in their entirety across all clinical neuropsychology practicum training sites, they are consistent with the latest developments in competency-based education.

  3. Using Publishers' Web Sites for Reference Collection Development.

    ERIC Educational Resources Information Center

    Holmberg, Melissa

    2000-01-01

    Analyzes the ways publishers' Web sites can be used by librarians to locate additional science and technology reference materials which fall within budget constraints while meeting the needs of the patrons. Reviews specific publishers' Web sites to compare features and show how they differ. (Author/LRW)

  4. Displaying employee testimonials on recruitment web sites: effects of communication media, employee race, and job seeker race on organizational attraction and information credibility.

    PubMed

    Walker, H Jack; Feild, Hubert S; Giles, William F; Armenakis, Achilles A; Bernerth, Jeremy B

    2009-09-01

    This study investigated participants' reactions to employee testimonials presented on recruitment Web sites. The authors manipulated the presence of employee testimonials, richness of media communicating testimonials (video with audio vs. picture with text), and representation of racial minorities in employee testimonials. Participants were more attracted to organizations and perceived information as more credible when testimonials were included on recruitment Web sites. Testimonials delivered via video with audio had higher attractiveness and information credibility ratings than those given via picture with text. Results also showed that Blacks responded more favorably, whereas Whites responded more negatively, to the recruiting organization as the proportion of minorities shown giving testimonials on the recruitment Web site increased. However, post hoc analyses revealed that use of a richer medium (video with audio vs. picture with text) to communicate employee testimonials tended to attenuate these racial effects.

  5. Rotation-invariant features for multi-oriented text detection in natural images.

    PubMed

    Yao, Cong; Zhang, Xin; Bai, Xiang; Liu, Wenyu; Ma, Yi; Tu, Zhuowen

    2013-01-01

    Texts in natural scenes carry rich semantic information, which can be used to assist a wide range of applications, such as object recognition, image/video retrieval, mapping/navigation, and human computer interaction. However, most existing systems are designed to detect and recognize horizontal (or near-horizontal) texts. Due to the increasing popularity of mobile-computing devices and applications, detecting texts of varying orientations from natural images under less controlled conditions has become an important but challenging task. In this paper, we propose a new algorithm to detect texts of varying orientations. Our algorithm is based on a two-level classification scheme and two sets of features specially designed for capturing the intrinsic characteristics of texts. To better evaluate the proposed method and compare it with the competing algorithms, we generate a comprehensive dataset with various types of texts in diverse real-world scenes. We also propose a new evaluation protocol, which is more suitable for benchmarking algorithms for detecting texts in varying orientations. Experiments on benchmark datasets demonstrate that our system compares favorably with the state-of-the-art algorithms when handling horizontal texts and achieves significantly enhanced performance on variant texts in complex natural scenes.

  6. Optimising web site designs for people with learning disabilities

    PubMed Central

    Williams, Peter; Hennig, Christian

    2015-01-01

    Much relevant internet-mediated information is inaccessible to people with learning disabilities because of difficulties in navigating the web. This paper reports on the methods undertaken to determine how information can be optimally presented for this cohort. Qualitative work is outlined where attributes relating to site layout affecting usability were elicited. A study comparing web sites of different design layouts exhibiting these attributes is discussed, with the emphasis on methodology. Eight interfaces were compared using various combinations of menu position (vertical or horizontal), text size and the absence or presence of images to determine which attributes of a site have the greatest performance impact. Study participants were also asked for their preferences, via a ‘smiley-face’ rating scale and simple interviews. ‘Acquiescence bias’ was minimised by avoiding polar (‘yes/no’) interrogatives, achieved by asking participants to compare layouts (such as horizontal versus vertical menu), with reasons coaxed from those able to articulate them. Preferred designs were for large text and images. This was the reverse of those facilitating fastest retrieval times, a discrepancy due to preferences being judged on aesthetic considerations. Design recommendations that reconcile preference and performance findings are offered. These include using a horizontal menu, juxtaposing images and text, and reducing text from sentences to phrases, thus facilitating preferred large text without increasing task times. PMID:26097431

  7. Landscape features influence postrelease predation on endangered black-footed ferrets

    USGS Publications Warehouse

    Poessel, S.A.; Breck, S.W.; Biggins, D.E.; Livieri, T.M.; Crooks, K.R.; Angeloni, L.

    2011-01-01

    Predation can be a critical factor influencing recovery of endangered species. In most recovery efforts lethal and nonlethal influences of predators are not sufficiently understood to allow prediction of predation risk, despite its importance. We investigated whether landscape features could be used to model predation risk from coyotes (Canis latrans) and great horned owls (Bubo virginianus) on the endangered black-footed ferret (Mustela nigripes). We used location data of reintroduced ferrets from 3 sites in South Dakota to determine whether exposure to landscape features typically associated with predators affected survival of ferrets, and whether ferrets considered predation risk when choosing habitat near perches potentially used by owls or near linear features predicted to be used by coyotes. Exposure to areas near likely owl perches reduced ferret survival, but landscape features potentially associated with coyote movements had no appreciable effect on survival. Ferrets were located within 90 m of perches more than expected in 2 study sites that also had higher ferret mortality due to owl predation. Densities of potential coyote travel routes near ferret locations were no different than expected in all 3 sites. Repatriated ferrets might have selected resources based on factors other than predator avoidance. Considering an easily quantified landscape feature (i.e., owl perches) can enhance success of reintroduction efforts for ferrets. Nonetheless, development of predictive models of predation risk and management strategies to mitigate that risk is not necessarily straightforward for more generalist predators such as coyotes. ?? 2011 American Society of Mammalogists.

  8. What Experiences Do Expository Books on Recommended Book Lists Offer to K-2 Students?

    ERIC Educational Resources Information Center

    Kletzien, Sharon B.; Dreher, Mariam Jean

    2017-01-01

    Teachers can use expository texts to teach academic vocabulary, content knowledge, text structure, and text features. National associations' recommended book lists are often used to identify books for classrooms. Previously we identified expository texts on these lists from 2001-2002 and 2011-2012. The current study explored instructional…

  9. Connectives and Layout as Processing Signals: How Textual Features Affect Students' Processing and Text Representation

    ERIC Educational Resources Information Center

    van Silfhout, Gerdineke; Evers-Vermeul, Jacqueline; Mak, Willem M.; Sanders, Ted J. M.

    2014-01-01

    When students read their school text, they may make a coherent mental representation of it that contains coherence relations between the text segments. The construction of such a representation is a prerequisite for learning from texts. This article focuses on the influence of connectives ("therefore," "furthermore") and layout…

  10. Text Mapping Plus: Improving Comprehension through Supported Retellings

    ERIC Educational Resources Information Center

    Lapp, Diane; Fisher, Douglas; Johnson, Kelly

    2010-01-01

    Modeled in this column is the teaching of a text mapping routine that supports students reading and remembering the salient features of the text. The authors renamed the story mapping technique "text mapping plus" because they found that as students added relational words and graphics to their maps their retells of both fiction and nonnarrative…

  11. Navigation and Comprehension of Digital Expository Texts: Hypertext Structure, Previous Domain Knowledge, and Working Memory Capacity

    ERIC Educational Resources Information Center

    Burin, Debora I.; Barreyro, Juan P.; Saux, Gastón; Irrazábal, Natalia C.

    2015-01-01

    Introduction: In contemporary information societies, reading digital text has become pervasive. One of the most distinctive features of digital texts is their internal connections via hyperlinks, resulting in non-linear hypertexts. Hypertext structure and previous knowledge affect navigation and comprehension of digital expository texts. From the…

  12. PubMed enhancements: fulfilling the promise of a great product.

    PubMed

    Schott, Michael J

    2004-01-01

    There have been many recent changes to PubMed to enhance its usefulness. Those changes include: LinkOut Libraries (local holding field), PubMed Central (full-text articles archived by the National Library of Medicine), and LinkOut (access to full-text articles right from the PubMed citation). Medical librarians should be aware of how these features work to best assist their clients. These new features offer the possibility of true desktop access for library patrons. Not only will patrons appreciate these new features, but their use in libraries will literally change what we do, who does it, and how it is done.

  13. Extracting BI-RADS Features from Portuguese Clinical Texts.

    PubMed

    Nassif, Houssam; Cunha, Filipe; Moreira, Inês C; Cruz-Correia, Ricardo; Sousa, Eliana; Page, David; Burnside, Elizabeth; Dutra, Inês

    2012-01-01

    In this work we build the first BI-RADS parser for Portuguese free texts, modeled after existing approaches to extract BI-RADS features from English medical records. Our concept finder uses a semantic grammar based on the BIRADS lexicon and on iterative transferred expert knowledge. We compare the performance of our algorithm to manual annotation by a specialist in mammography. Our results show that our parser's performance is comparable to the manual method.

  14. Changes in blast zone albedo patterns around new martian impact craters

    NASA Astrophysics Data System (ADS)

    Daubar, I. J.; Dundas, C. M.; Byrne, S.; Geissler, P.; Bart, G. D.; McEwen, A. S.; Russell, P. S.; Chojnacki, M.; Golombek, M. P.

    2016-03-01

    "Blast zones" (BZs) around new martian craters comprise various albedo features caused by the initial impact, including diffuse halos, extended linear and arcuate rays, secondary craters, ejecta patterns, and dust avalanches. We examined these features for changes in repeat images separated by up to four Mars years. Here we present the first comprehensive survey of the qualitative and quantitative changes observed in impact blast zones over time. Such changes are most likely due to airfall of high-albedo dust restoring darkened areas to their original albedo, the albedo of adjacent non-impacted surfaces. Although some sites show drastic changes over short timescales, nearly half of the sites show no obvious changes over several Mars years. Albedo changes are more likely to occur at higher-latitude sites, lower-elevation sites, and at sites with smaller central craters. No correlation was seen between amount of change and Dust Cover Index, relative halo size, or historical regional albedo changes. Quantitative albedo measurements of the diffuse dark halos relative to their surroundings yielded estimates of fading lifetimes for these features. The average lifetime among sites with measurable fading is ∼15 Mars years; the median is ∼8 Mars years for a linear brightening. However, at approximately half of sites with three or more repeat images, a nonlinear function with rapid initial fading followed by a slow increase in albedo provides a better fit to the fading behavior; this would predict even longer lifetimes. The predicted lifetimes of BZs are comparable to those of slope streaks, and considered representative of fading by global atmospheric dust deposition; they last significantly longer than dust devil or rover tracks, albedo features that are erased by different processes. These relatively long lifetimes indicate that the measurement of the current impact rate by Daubar et al. (Daubar, I.J. et al. [2013]. Icarus 225, 506-516. http://dx.doi.org/10.1016/j.icarus.2013.04.009) does not suffer significantly from overall under-sampling due to blast zones fading before new impact sites can be initially discovered. However, the prevalence of changes seen around smaller craters may explain in part their shallower size frequency distribution.

  15. Changes in blast zone albedo patterns around new martian impact craters

    USGS Publications Warehouse

    Daubar, Ingrid J.; Dundas, Colin; Byrne, Shane; Geissler, Paul; Bart, Gwen; McEwen, Alfred S.; Russell, Patrick; Chojnacki, Matthew; Golombek, M.P.

    2016-01-01

    “Blast zones” (BZs) around new martian craters comprise various albedo features caused by the initial impact, including diffuse halos, extended linear and arcuate rays, secondary craters, ejecta patterns, and dust avalanches. We examined these features for changes in repeat images separated by up to four Mars years. Here we present the first comprehensive survey of the qualitative and quantitative changes observed in impact blast zones over time. Such changes are most likely due to airfall of high-albedo dust restoring darkened areas to their original albedo, the albedo of adjacent non-impacted surfaces. Although some sites show drastic changes over short timescales, nearly half of the sites show no obvious changes over several Mars years. Albedo changes are more likely to occur at higher-latitude sites, lower-elevation sites, and at sites with smaller central craters. No correlation was seen between amount of change and Dust Cover Index, relative halo size, or historical regional albedo changes. Quantitative albedo measurements of the diffuse dark halos relative to their surroundings yielded estimates of fading lifetimes for these features. The average lifetime among sites with measurable fading is ∼15 Mars years; the median is ∼8 Mars years for a linear brightening. However, at approximately half of sites with three or more repeat images, a nonlinear function with rapid initial fading followed by a slow increase in albedo provides a better fit to the fading behavior; this would predict even longer lifetimes. The predicted lifetimes of BZs are comparable to those of slope streaks, and considered representative of fading by global atmospheric dust deposition; they last significantly longer than dust devil or rover tracks, albedo features that are erased by different processes. These relatively long lifetimes indicate that the measurement of the current impact rate by Daubar et al. (Daubar, I.J. et al. [2013]. Icarus 225, 506–516. http://dx.doi.org/10.1016/j.icarus.2013.04.009) does not suffer significantly from overall under-sampling due to blast zones fading before new impact sites can be initially discovered. However, the prevalence of changes seen around smaller craters may explain in part their shallower size frequency distribution.

  16. General geology and geomorphology of the Mars Pathfinder landing site

    USGS Publications Warehouse

    Ward, A.W.; Gaddis, L.R.; Kirk, R.L.; Soderblom, L.A.; Tanaka, K.L.; Golombek, M.P.; Parker, T.J.; Greeley, Ronald; Kuzmin, R.O.

    1999-01-01

    The Mars Pathfinder (MPF) spacecraft landed on relatively young (late Hesperian-early Amazonian; 3.1-0.7 Ga) plains in Chryse Planitia near the mouth of Ares Vallis. Images returned from the spacecraft reveal a complex landscape of ridges and troughs, large hills and crater rims, rocks and boulders of various sizes and shapes, and surficial deposits, indicating a complex, multistage geologic history of the landing site. After the deposition of one or more bedrock units, depositional and erosional fluvial processes shaped much of the present landscape. Multiple erosional events are inferred on the basis of observations of numerous channels, different orientations of many streamlined tails from their associated knobs and hills, and superposition of lineations and streamlines. Medium- and small-scale features, interpreted to be related to late-stage drainage of floodwaters, are recognized in several areas at the landing site. Streamlined knobs and hills seen in Viking orbiter images support this inference, as they seem to be complex forms, partly erosional and partly depositional, and may also indicate a series of scouring and depositional events that, in some cases, further eroded or partially buried these landforms. Although features such as these are cited as evidence for catastrophic flooding at Ares Vallis, some of these features may also be ascribed to alternative primary or secondary depositional processes, such as glacial or mass-wasting processes. Close inspection of the landing site reveals rocks that are interpreted to be volcanic in origin and others that may be conglomeratic. If such sedimentary rocks are confirmed, fluvial processes have had a greater significance on Mars than previously thought. For the last several hundred million to few billion years, eolian processes have been dominant. Dunes and dune-like features, ventifacts, and deflation and exhumation features around several rocks probably are the most recent landforms. The relatively pristine nature of the overall landscape at the MPF site suggests weathering and erosion processes on Mars are exceptionally slow.

  17. Automatic detection and recognition of signs from natural scenes.

    PubMed

    Chen, Xilin; Yang, Jie; Zhang, Jing; Waibel, Alex

    2004-01-01

    In this paper, we present an approach to automatic detection and recognition of signs from natural scenes, and its application to a sign translation task. The proposed approach embeds multiresolution and multiscale edge detection, adaptive searching, color analysis, and affine rectification in a hierarchical framework for sign detection, with different emphases at each phase to handle the text in different sizes, orientations, color distributions and backgrounds. We use affine rectification to recover deformation of the text regions caused by an inappropriate camera view angle. The procedure can significantly improve text detection rate and optical character recognition (OCR) accuracy. Instead of using binary information for OCR, we extract features from an intensity image directly. We propose a local intensity normalization method to effectively handle lighting variations, followed by a Gabor transform to obtain local features, and finally a linear discriminant analysis (LDA) method for feature selection. We have applied the approach in developing a Chinese sign translation system, which can automatically detect and recognize Chinese signs as input from a camera, and translate the recognized text into English.

  18. Using complex networks for text classification: Discriminating informative and imaginative documents

    NASA Astrophysics Data System (ADS)

    de Arruda, Henrique F.; Costa, Luciano da F.; Amancio, Diego R.

    2016-01-01

    Statistical methods have been widely employed in recent years to grasp many language properties. The application of such techniques have allowed an improvement of several linguistic applications, such as machine translation and document classification. In the latter, many approaches have emphasised the semantical content of texts, as is the case of bag-of-word language models. These approaches have certainly yielded reasonable performance. However, some potential features such as the structural organization of texts have been used only in a few studies. In this context, we probe how features derived from textual structure analysis can be effectively employed in a classification task. More specifically, we performed a supervised classification aiming at discriminating informative from imaginative documents. Using a networked model that describes the local topological/dynamical properties of function words, we achieved an accuracy rate of up to 95%, which is much higher than similar networked approaches. A systematic analysis of feature relevance revealed that symmetry and accessibility measurements are among the most prominent network measurements. Our results suggest that these measurements could be used in related language applications, as they play a complementary role in characterising texts.

  19. Emphasizing Social Features in Information Portals: Effects on New Member Engagement

    PubMed Central

    Sharma, Nikhil; Butler, Brian S.; Irwin, Jeannie; Spallek, Heiko

    2013-01-01

    Many information portals are adding social features with hopes of enhancing the overall user experience. Invitations to join and welcome pages that highlight these social features are expected to encourage use and participation. While this approach is widespread and seems plausible, the effect of providing and highlighting social features remains to be tested. We studied the effects of emphasizing social features on users' response to invitations, their decisions to join, their willingness to provide profile information, and their engagement with the portal's social features. The results of a quasi-experiment found no significant effect of social emphasis in invitations on receivers' responsiveness. However, users receiving invitations highlighting social benefits were less likely to join the portal and provide profile information. Social emphasis in the initial welcome page for the site also was found to have a significant effect on whether individuals joined the portal, how much profile information they provided and shared, and how much they engaged with social features on the site. Unexpectedly, users who were welcomed in a social manner were less likely to join and provided less profile information; they also were less likely to engage with social features of the portal. This suggests that even in online contexts where social activity is an increasingly common feature, highlighting the presence of social features may not always be the optimal presentation strategy. PMID:23626487

  20. A Computer-Aided Abstracting Tool Kit.

    ERIC Educational Resources Information Center

    Craven, Timothy C.

    1993-01-01

    Reports on the development of a prototype computerized abstractor's assistant called TEXNET, a text network management system. Features of the system discussed include semantic dependency links; displays of text structure; basic text editing; extracting; weighting methods; and listings of frequent words. (Contains 25 references.) (LRW)

  1. Evaluation of the Relevance of a Web-Based "Ask an Expert" Feature: StratSoy and Soy and Human Health Queries.

    ERIC Educational Resources Information Center

    Wool, D. L.; Kanfer, A. G.; Michaels, J.; Thompson, S.; Morris, S. A.; Hasler, C. M.

    2000-01-01

    A study of the "Ask an Expert" feature of StratSoy, a Web-based information system, surveyed 50 users and 48 using it for the first time. Topic areas of interest and web site features desired by respondents were identified. (JOW)

  2. Which Academic Papers Do Researchers Tend to Feature on ResearchGate?

    ERIC Educational Resources Information Center

    Liu, Xuan Zhen; Fang, Hui

    2018-01-01

    Introduction: The academic social network site ResearchGate (www.researchgate.net) enables researchers to feature up to five of their research products (including papers, datasets and chapters) in a 'Featured research' section on their ResearchGate home page. This provides an opportunity to discover how researchers view their own publications.…

  3. Food and Beverage Brands that Market to Children and Adolescents on the Internet: A Content Analysis of Branded Web Sites

    ERIC Educational Resources Information Center

    Henry, Anna E.; Story, Mary

    2009-01-01

    Objective: To identify food and beverage brand Web sites featuring designated children's areas, assess marketing techniques present on those industry Web sites, and determine nutritional quality of branded food items marketed to children. Design: Systematic content analysis of food and beverage brand Web sites and nutrient analysis of food and…

  4. Environmental Assessment, Change in C-17 Flight Training Operations at Grant County International Airport, Washington by Joint Base Lewis-McChord

    DTIC Science & Technology

    2011-10-01

    ground (subsurface) deposits. Examples of prehistoric archaeological resources include village sites, campsites, lithic scatters, burials, hearths ...or hearth features), processing sites, caves and rock shelters, and petroglyph and pictograph sites. Examples of historic archaeological resources

  5. Explore Arctic Health.

    PubMed

    Lebow, Mahria

    2014-04-01

    The Arctic Health web site is a portal to Arctic-specific, health related content. The site provides expertly organized and annotated resources pertinent to northern peoples and places, including health information, research publications and environmental information. This site also features the Arctic Health Publications Database, which indexes an array of Arctic-related resources.

  6. Type 2 Diabetes Screening Test by Means of a Pulse Oximeter.

    PubMed

    Moreno, Enrique Monte; Lujan, Maria Jose Anyo; Rusinol, Montse Torrres; Fernandez, Paqui Juarez; Manrique, Pilar Nunez; Trivino, Cristina Aragon; Miquel, Magda Pedrosa; Rodriguez, Marife Alvarez; Burguillos, M Jose Gonzalez

    2017-02-01

    In this paper, we propose a method for screening for the presence of type 2 diabetes by means of the signal obtained from a pulse oximeter. The screening system consists of two parts: the first analyzes the signal obtained from the pulse oximeter, and the second consists of a machine-learning module. The system consists of a front end that extracts a set of features form the pulse oximeter signal. These features are based on physiological considerations. The set of features were the input of a machine-learning algorithm that determined the class of the input sample, i.e., whether the subject had diabetes or not. The machine-learning algorithms were random forests, gradient boosting, and linear discriminant analysis as benchmark. The system was tested on a database of [Formula: see text] subjects (two samples per subject) collected from five community health centers. The mean receiver operating characteristic area found was [Formula: see text]% (median value [Formula: see text]% and range [Formula: see text]%), with a specificity =  [Formula: see text]% for a threshold that gave a sensitivity = [Formula: see text]%. We present a screening method for detecting diabetes that has a performance comparable to the glycated haemoglobin (haemoglobin A1c HbA1c) test, does not require blood extraction, and yields results in less than 5 min.

  7. Two-Dimensional (2-D) Acoustic Fish Tracking at River Mile 85, Sacramento River, California

    DTIC Science & Technology

    2013-06-01

    on fish become known (USACE 2004). Levee repair and constructed habitat features included (1) protection of the toe and upper slopes of the bank...be recovered rather than being lost due to sediment dunes , large woody material floating downstream, and vandalism. The RM 85 site was a relatively...into the river channel. The addition of this material narrowed the channel and created a scour feature along the toe of the repair site. VPS array

  8. Remote Sensing in Archaeology: Visible Temporal Change of Archaeological Features of the Peten, Guatemala

    NASA Technical Reports Server (NTRS)

    Lowry, James D., Jr.

    1999-01-01

    The purpose of this archaeological research was two-fold; the location of Mayan sites and features in order to learn more of this cultural group, and the (cultural) preservation of these sites and features for the future using Landsat Thematic Mapper (TM) images. Because the rainy season, traditionally at least, lasts about six months (about June to December), the time of year the image is acquired plays an important role in spectral reflectance. Images from 1986, 1995, and 1997 were selected because it was felt they would provide the best opportunity for success in layering different bands from different years together to attempt to see features not completely visible in any one year. False-color composites were created including bands 3, 4, and 5 using a mixture of years and bands. One particular combination that yielded tremendously interesting results included band 5 from 1997, band 4 from 1995, and band 3 from 1986. A number of straight linear features (probably Mayan causeways) run through the bajos that Dr. Sever believes are features previously undiscovered. At this point, early indications are that this will be a successful method for locating "new" Mayan archaeological features in the Peten.

  9. Computer aided diagnosis system for Alzheimer disease using brain diffusion tensor imaging features selected by Pearson's correlation.

    PubMed

    Graña, M; Termenon, M; Savio, A; Gonzalez-Pinto, A; Echeveste, J; Pérez, J M; Besga, A

    2011-09-20

    The aim of this paper is to obtain discriminant features from two scalar measures of Diffusion Tensor Imaging (DTI) data, Fractional Anisotropy (FA) and Mean Diffusivity (MD), and to train and test classifiers able to discriminate Alzheimer's Disease (AD) patients from controls on the basis of features extracted from the FA or MD volumes. In this study, support vector machine (SVM) classifier was trained and tested on FA and MD data. Feature selection is done computing the Pearson's correlation between FA or MD values at voxel site across subjects and the indicative variable specifying the subject class. Voxel sites with high absolute correlation are selected for feature extraction. Results are obtained over an on-going study in Hospital de Santiago Apostol collecting anatomical T1-weighted MRI volumes and DTI data from healthy control subjects and AD patients. FA features and a linear SVM classifier achieve perfect accuracy, sensitivity and specificity in several cross-validation studies, supporting the usefulness of DTI-derived features as an image-marker for AD and to the feasibility of building Computer Aided Diagnosis systems for AD based on them. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  10. Image based book cover recognition and retrieval

    NASA Astrophysics Data System (ADS)

    Sukhadan, Kalyani; Vijayarajan, V.; Krishnamoorthi, A.; Bessie Amali, D. Geraldine

    2017-11-01

    In this we are developing a graphical user interface using MATLAB for the users to check the information related to books in real time. We are taking the photos of the book cover using GUI, then by using MSER algorithm it will automatically detect all the features from the input image, after this it will filter bifurcate non-text features which will be based on morphological difference between text and non-text regions. We implemented a text character alignment algorithm which will improve the accuracy of the original text detection. We will also have a look upon the built in MATLAB OCR recognition algorithm and an open source OCR which is commonly used to perform better detection results, post detection algorithm is implemented and natural language processing to perform word correction and false detection inhibition. Finally, the detection result will be linked to internet to perform online matching. More than 86% accuracy can be obtained by this algorithm.

  11. Discovering Astronomy: An Astro 101 e-book

    NASA Astrophysics Data System (ADS)

    Shawl, Stephen J.; Byrd, Gene; Deustua, Susana E.; LoPresto, Michael C.

    2016-01-01

    Discovering Astronomy, now available in its 6th edition as an eText, has many advantages and features for your students. We have partnered with etextink.com and WebAssign.net to produce an affordable set of cost-saving options for your students. Also available is the Discovering Astronomy Activity Manual, which provides students with an active-learning experience.Our etext is device independent and thus accessible through any web browser. Americans with Disabilities Act compatibility provides access for all students. Hotlinks to outside sites provide further information for interested students. Lecture demonstration videos of important concepts, made specifically for this new edition, are embedded within the text as appropriate. Students can highlight text, take notes, and bookmark locations within the text. Important terms are linked to the glossary. Search capabilities allow students to easily find what they want.Instructors can interact with their students directly through the etext once the class roster has been provided. For example, instructors can embed assignments into their students' etext and add their own notes and updates, which are immediately visible to their students.Updates can be quickly made by us as new findings become available. For example, updates from New Horizons were added at the time of the closest approach to Pluto, and an update on the recent announcement of current water on Mars was added the day of the announcement.We will present results of our own experience with college and high school students' use of Discovering Astronomy in online courses.Details of the book, a sample chapter, and other information are available at discoveringastronomy.weebly.com.

  12. Geospatial Analytics in Retail Site Selection and Sales Prediction.

    PubMed

    Ting, Choo-Yee; Ho, Chiung Ching; Yee, Hui Jia; Matsah, Wan Razali

    2018-03-01

    Studies have shown that certain features from geography, demography, trade area, and environment can play a vital role in retail site selection, largely due to the impact they asserted on retail performance. Although the relevant features could be elicited by domain experts, determining the optimal feature set can be intractable and labor-intensive exercise. The challenges center around (1) how to determine features that are important to a particular retail business and (2) how to estimate retail sales performance given a new location? The challenges become apparent when the features vary across time. In this light, this study proposed a nonintervening approach by employing feature selection algorithms and subsequently sales prediction through similarity-based methods. The results of prediction were validated by domain experts. In this study, data sets from different sources were transformed and aggregated before an analytics data set that is ready for analysis purpose could be obtained. The data sets included data about feature location, population count, property type, education status, and monthly sales from 96 branches of a telecommunication company in Malaysia. The finding suggested that (1) optimal retail performance can only be achieved through fulfillment of specific location features together with the surrounding trade area characteristics and (2) similarity-based method can provide solution to retail sales prediction.

  13. Intertextual Content Analysis: An Approach for Analysing Text-Related Discussions with Regard to Movability in Reading and How Text Content Is Handled

    ERIC Educational Resources Information Center

    Hallesson, Yvonne; Visén, Pia

    2018-01-01

    Reading and discussing texts as a means for learning subject content are regular features within educational contexts. This paper presents an approach for intertextual content analysis (ICA) of such text-related discussions revealing what the participants make of the text. Thus, in contrast to many other approaches for analysing conversation that…

  14. FAST: FAST Analysis of Sequences Toolbox

    PubMed Central

    Lawrence, Travis J.; Kauffman, Kyle T.; Amrine, Katherine C. H.; Carper, Dana L.; Lee, Raymond S.; Becich, Peter J.; Canales, Claudia J.; Ardell, David H.

    2015-01-01

    FAST (FAST Analysis of Sequences Toolbox) provides simple, powerful open source command-line tools to filter, transform, annotate and analyze biological sequence data. Modeled after the GNU (GNU's Not Unix) Textutils such as grep, cut, and tr, FAST tools such as fasgrep, fascut, and fastr make it easy to rapidly prototype expressive bioinformatic workflows in a compact and generic command vocabulary. Compact combinatorial encoding of data workflows with FAST commands can simplify the documentation and reproducibility of bioinformatic protocols, supporting better transparency in biological data science. Interface self-consistency and conformity with conventions of GNU, Matlab, Perl, BioPerl, R, and GenBank help make FAST easy and rewarding to learn. FAST automates numerical, taxonomic, and text-based sorting, selection and transformation of sequence records and alignment sites based on content, index ranges, descriptive tags, annotated features, and in-line calculated analytics, including composition and codon usage. Automated content- and feature-based extraction of sites and support for molecular population genetic statistics make FAST useful for molecular evolutionary analysis. FAST is portable, easy to install and secure thanks to the relative maturity of its Perl and BioPerl foundations, with stable releases posted to CPAN. Development as well as a publicly accessible Cookbook and Wiki are available on the FAST GitHub repository at https://github.com/tlawrence3/FAST. The default data exchange format in FAST is Multi-FastA (specifically, a restriction of BioPerl FastA format). Sanger and Illumina 1.8+ FastQ formatted files are also supported. FAST makes it easier for non-programmer biologists to interactively investigate and control biological data at the speed of thought. PMID:26042145

  15. The Use of Systemic-Functional Linguistics in Automated Text Mining

    DTIC Science & Technology

    2009-03-01

    what degree two or more documents are similar in terms of their meaning. Simply put, such a cognitive model aims to link the physical manifestation...These features, both in terms of frequency and their chaining across a text, were taken as salient stylistic features that had a direct relationship to...because SFL attempts to model these cognitive processes, this has the potential to improve NLP tasks by making them more ’human-like’. Secondly

  16. Extracting BI-RADS Features from Portuguese Clinical Texts

    PubMed Central

    Nassif, Houssam; Cunha, Filipe; Moreira, Inês C.; Cruz-Correia, Ricardo; Sousa, Eliana; Page, David; Burnside, Elizabeth; Dutra, Inês

    2013-01-01

    In this work we build the first BI-RADS parser for Portuguese free texts, modeled after existing approaches to extract BI-RADS features from English medical records. Our concept finder uses a semantic grammar based on the BIRADS lexicon and on iterative transferred expert knowledge. We compare the performance of our algorithm to manual annotation by a specialist in mammography. Our results show that our parser’s performance is comparable to the manual method. PMID:23797461

  17. Geologic Surface Effects of Underground Nuclear Testing, Buckboard Mesa, Climax Stock, Dome Mountain, Frenchman Flat, Rainier/Aqueduct Mesa, and Shoshone Mountain, Nevada Test Site, Nevada

    USGS Publications Warehouse

    Grasso, Dennis N.

    2003-01-01

    Surface effects maps were produced for 72 of 89 underground detonations conducted at the Frenchman Flat, Rainier Mesa and Aqueduct Mesa, Climax Stock, Shoshone Mountain, Buckboard Mesa, and Dome Mountain testing areas of the Nevada Test Site between August 10, 1957 (Saturn detonation, Area 12) and September 18, 1992 (Hunters Trophy detonation, Area 12). The ?Other Areas? Surface Effects Map Database, which was used to construct the maps shown in this report, contains digital reproductions of these original maps. The database is provided in both ArcGIS (v. 8.2) geodatabase format and ArcView (v. 3.2) shapefile format. This database contains sinks, cracks, faults, and other surface effects having a combined (cumulative) length of 136.38 km (84.74 mi). In GIS digital format, the user can view all surface effects maps simultaneously, select and view the surface effects of one or more sites of interest, or view specific surface effects by area or site. Three map layers comprise the database. They are: (1) the surface effects maps layer (oase_n27f), (2) the bar symbols layer (oase_bar_n27f), and (3) the ball symbols layer (oase_ball_n27f). Additionally, an annotation layer, named 'Ball_and_Bar_Labels,' and a polygon features layer, named 'Area12_features_poly_n27f,' are contained in the geodatabase version of the database. The annotation layer automatically labels all 295 ball-and-bar symbols shown on these maps. The polygon features layer displays areas of ground disturbances, such as rock spall and disturbed ground caused by the detonations. Shapefile versions of the polygon features layer in Nevada State Plane and Universal Transverse Mercator projections, named 'area12_features_poly_n27f.shp' and 'area12_features_poly_u83m.shp,' are also provided in the archive.

  18. Assessing Text Readability Using Cognitively Based Indices

    ERIC Educational Resources Information Center

    Crossley, Scott A.; Greenfield, Jerry; McNamara, Danielle S.

    2008-01-01

    Many programs designed to compute the readability of texts are narrowly based on surface-level linguistic features and take too little account of the processes which a reader brings to the text. This study is an exploratory examination of the use of Coh-Metrix, a computational tool that measures cohesion and text difficulty at various levels of…

  19. Using Text Sets to Facilitate Critical Thinking in Sixth Graders

    ERIC Educational Resources Information Center

    Scales, Roya Q.; Tracy, Kelly N.

    2017-01-01

    This case study examines features and processes of a sixth grade teacher (Jane) utilizing text sets as a tool for facilitating critical thinking. Jane's strong vision and student-centered beliefs informed her use of various texts to teach language arts as she worked to address demands of the Common Core State Standards. Text sets promoted multiple…

  20. Monitoring tobacco brand websites to understand marketing strategies aimed at tobacco product users and potential users.

    PubMed

    Escobedo, Patricia; Cruz, Tess Boley; Tsai, Kai-Ya; Allem, Jon-Patrick; Soto, Daniel W; Kirkpatrick, Matthew G; Pattarroyo, Monica; Unger, Jennifer B

    2017-09-11

    Limited information exists about strategies and methods used on brand marketing websites to transmit pro-tobacco messages to tobacco users and potential users. This study compared age verification methods, themes, interactive activities and links to social media across tobacco brand websites. This study examined 12 tobacco brand websites representing four tobacco product categories: cigarettes, cigar/cigarillos, smokeless tobacco, and e-cigarettes. Website content was analyzed by tobacco product category and data from all website visits (n = 699) were analyzed. Adult smokers (n=32) coded websites during a one-year period, indicating whether or not they observed any of 53 marketing themes, seven interactive activities, or five external links to social media sites. Most (58%) websites required online registration before entering, however e-cigarette websites used click-through age verification. Compared to cigarette sites, cigar/cigarillo sites were more likely to feature themes related to "party" lifestyle, and e-cigarette websites were much more likely to feature themes related to harm reduction. Cigarette sites featured greater levels of interactive content compared to other tobacco products. Compared to cigarette sites, cigar/cigarillo sites were more likely to feature activities related to events and music. Compared to cigarette sites, both cigar and e-cigarette sites were more likely to direct visitors to external social media sites. Marketing methods and strategies normalize tobacco use by providing website visitors with positive themes combined with interactive content, and is an area of future research. Moreover, all tobacco products under federal regulatory authority should be required to use more stringent age verification gates. Findings indicate the Food and Drug Administration (FDA) should require brand websites of all tobacco products under its regulatory authority use more stringent age verification gates by requiring all visitors be at least 18 years of age and register online prior to entry. This is important given that marketing strategies may encourage experimentation with tobacco or deter quit attempts among website visitors. Future research should examine the use of interactive activities and social media on a wide variety of tobacco brand websites as interactive content is associated with more active information processing. © The Author 2017. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  1. Munitions related feature extraction from LIDAR data.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Roberts, Barry L.

    2010-06-01

    The characterization of former military munitions ranges is critical in the identification of areas likely to contain residual unexploded ordnance (UXO). Although these ranges are large, often covering tens-of-thousands of acres, the actual target areas represent only a small fraction of the sites. The challenge is that many of these sites do not have records indicating locations of former target areas. The identification of target areas is critical in the characterization and remediation of these sites. The Strategic Environmental Research and Development Program (SERDP) and Environmental Security Technology Certification Program (ESTCP) of the DoD have been developing and implementing techniquesmore » for the efficient characterization of large munitions ranges. As part of this process, high-resolution LIDAR terrain data sets have been collected over several former ranges. These data sets have been shown to contain information relating to former munitions usage at these ranges, specifically terrain cratering due to high-explosives detonations. The location and relative intensity of crater features can provide information critical in reconstructing the usage history of a range, and indicate areas most likely to contain UXO. We have developed an automated procedure using an adaptation of the Circular Hough Transform for the identification of crater features in LIDAR terrain data. The Circular Hough Transform is highly adept at finding circular features (craters) in noisy terrain data sets. This technique has the ability to find features of a specific radius providing a means of filtering features based on expected scale and providing additional spatial characterization of the identified feature. This method of automated crater identification has been applied to several former munitions ranges with positive results.« less

  2. Network analysis of named entity co-occurrences in written texts

    NASA Astrophysics Data System (ADS)

    Amancio, Diego Raphael

    2016-06-01

    The use of methods borrowed from statistics and physics to analyze written texts has allowed the discovery of unprecedent patterns of human behavior and cognition by establishing links between models features and language structure. While current models have been useful to unveil patterns via analysis of syntactical and semantical networks, only a few works have probed the relevance of investigating the structure arising from the relationship between relevant entities such as characters, locations and organizations. In this study, we represent entities appearing in the same context as a co-occurrence network, where links are established according to a null model based on random, shuffled texts. Computational simulations performed in novels revealed that the proposed model displays interesting topological features, such as the small world feature, characterized by high values of clustering coefficient. The effectiveness of our model was verified in a practical pattern recognition task in real networks. When compared with traditional word adjacency networks, our model displayed optimized results in identifying unknown references in texts. Because the proposed representation plays a complementary role in characterizing unstructured documents via topological analysis of named entities, we believe that it could be useful to improve the characterization of written texts (and related systems), specially if combined with traditional approaches based on statistical and deeper paradigms.

  3. Development and Application of a Three-Dimensional Finite Element Vapor Intrusion Model

    PubMed Central

    Pennell, Kelly G.; Bozkurt, Ozgur; Suuberg, Eric M.

    2010-01-01

    Details of a three-dimensional finite element model of soil vapor intrusion, including the overall modeling process and the stepwise approach, are provided. The model is a quantitative modeling tool that can help guide vapor intrusion characterization efforts. It solves the soil gas continuity equation coupled with the chemical transport equation, allowing for both advective and diffusive transport. Three-dimensional pressure, velocity, and chemical concentration fields are produced from the model. Results from simulations involving common site features, such as impervious surfaces, porous foundation sub-base material, and adjacent structures are summarized herein. The results suggest that site-specific features are important to consider when characterizing vapor intrusion risks. More importantly, the results suggest that soil gas or subslab gas samples taken without proper regard for particular site features may not be suitable for evaluating vapor intrusion risks; rather, careful attention needs to be given to the many factors that affect chemical transport into and around buildings. PMID:19418819

  4. A Predictive Model of Intein Insertion Site for Use in the Engineering of Molecular Switches

    PubMed Central

    Apgar, James; Ross, Mary; Zuo, Xiao; Dohle, Sarah; Sturtevant, Derek; Shen, Binzhang; de la Vega, Humberto; Lessard, Philip; Lazar, Gabor; Raab, R. Michael

    2012-01-01

    Inteins are intervening protein domains with self-splicing ability that can be used as molecular switches to control activity of their host protein. Successfully engineering an intein into a host protein requires identifying an insertion site that permits intein insertion and splicing while allowing for proper folding of the mature protein post-splicing. By analyzing sequence and structure based properties of native intein insertion sites we have identified four features that showed significant correlation with the location of the intein insertion sites, and therefore may be useful in predicting insertion sites in other proteins that provide native-like intein function. Three of these properties, the distance to the active site and dimer interface site, the SVM score of the splice site cassette, and the sequence conservation of the site showed statistically significant correlation and strong predictive power, with area under the curve (AUC) values of 0.79, 0.76, and 0.73 respectively, while the distance to secondary structure/loop junction showed significance but with less predictive power (AUC of 0.54). In a case study of 20 insertion sites in the XynB xylanase, two features of native insertion sites showed correlation with the splice sites and demonstrated predictive value in selecting non-native splice sites. Structural modeling of intein insertions at two sites highlighted the role that the insertion site location could play on the ability of the intein to modulate activity of the host protein. These findings can be used to enrich the selection of insertion sites capable of supporting intein splicing and hosting an intein switch. PMID:22649521

  5. Biological and functional relevance of CASP predictions.

    PubMed

    Liu, Tianyun; Ish-Shalom, Shirbi; Torng, Wen; Lafita, Aleix; Bock, Christian; Mort, Matthew; Cooper, David N; Bliven, Spencer; Capitani, Guido; Mooney, Sean D; Altman, Russ B

    2018-03-01

    Our goal is to answer the question: compared with experimental structures, how useful are predicted models for functional annotation? We assessed the functional utility of predicted models by comparing the performances of a suite of methods for functional characterization on the predictions and the experimental structures. We identified 28 sites in 25 protein targets to perform functional assessment. These 28 sites included nine sites with known ligand binding (holo-sites), nine sites that are expected or suggested by experimental authors for small molecule binding (apo-sites), and Ten sites containing important motifs, loops, or key residues with important disease-associated mutations. We evaluated the utility of the predictions by comparing their microenvironments to the experimental structures. Overall structural quality correlates with functional utility. However, the best-ranked predictions (global) may not have the best functional quality (local). Our assessment provides an ability to discriminate between predictions with high structural quality. When assessing ligand-binding sites, most prediction methods have higher performance on apo-sites than holo-sites. Some servers show consistently high performance for certain types of functional sites. Finally, many functional sites are associated with protein-protein interaction. We also analyzed biologically relevant features from the protein assemblies of two targets where the active site spanned the protein-protein interface. For the assembly targets, we find that the features in the models are mainly determined by the choice of template. © 2017 The Authors Proteins: Structure, Function and Bioinformatics Published by Wiley Periodicals, Inc.

  6. View of EPA Farm cattle shelter (featuring horse trailer), facing ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    View of EPA Farm cattle shelter (featuring horse trailer), facing northwest - Nevada Test Site, Environmental Protection Agency Farm, Shelter Unit Type, Area 15, Yucca Flat, 10-2 Road near Circle Road, Mercury, Nye County, NV

  7. Why SRS Matters - F Area

    ScienceCinema

    Howell, Steve; Tadlock, Bill; Beeler, Dewitt; Gardner, Curt

    2018-06-22

    A video series presenting an overview of the Savannah River Site's (SRS) mission and operations. Each episode features a specific area/operation and how it contributes to help make the world safer. This episode features F Area's mission and operations.

  8. Why SRS Matters - E Area

    ScienceCinema

    Howell, Steve; Mooneyhan, Verne; Tempel, Kevin; Bullington, Michele

    2018-06-22

    A video series presenting an overview of the Savannah River Site's (SRS) mission and operations. Each episode features a specific area/operation and how it contributes to help make the world safer. This episode features E Area's mission and operations.

  9. Scoping review and evaluation of SMS/text messaging platforms for mHealth projects or clinical interventions.

    PubMed

    Iribarren, Sarah J; Brown, William; Giguere, Rebecca; Stone, Patricia; Schnall, Rebecca; Staggers, Nancy; Carballo-Diéguez, Alex

    2017-05-01

    Mobile technology supporting text messaging interventions (TMIs) continues to evolve, presenting challenges for researchers and healthcare professionals who need to choose software solutions to best meet their program needs. The objective of this review was to systematically identify and compare text messaging platforms and to summarize their advantages and disadvantages as described in peer-reviewed literature. A scoping review was conducted using four steps: 1) identify currently available platforms through online searches and in mHealth repositories; 2) expand evaluation criteria of an mHealth mobile messaging toolkit and integrate prior user experiences as researchers; 3) evaluate each platform's functions and features based on the expanded criteria and a vendor survey; and 4) assess the documentation of platform use in the peer-review literature. Platforms meeting inclusion criteria were assessed independently by three reviewers and discussed until consensus was reached. The PRISMA guidelines were followed to report findings. Of the 1041 potentially relevant search results, 27 platforms met inclusion criteria. Most were excluded because they were not platforms (e.g., guides, toolkits, reports, or SMS gateways). Of the 27 platforms, only 12 were identified in existing mHealth repositories, 10 from Google searches, while five were found in both. The expanded evaluation criteria included 22 items. Results indicate no uniform presentation of platform features and functions, often making these difficult to discern. Fourteen of the platforms were reported as open source, 10 focused on health care and 16 were tailored to meet needs of low resource settings (not mutually exclusive). Fifteen platforms had do-it-yourself setup (programming not required) while the remainder required coding/programming skills or setups could be built to specification by the vendor. Frequently described features included data security and access to the platform via cloud-based systems. Pay structures and reported targeted end-users varied. Peer-reviewed publications listed only 6 of the 27 platforms across 21 publications. The majority of these articles reported the name of the platform used but did not describe advantages or disadvantages. Searching for and comparing mHealth platforms for TMIs remains a challenge. The results of this review can serve as a resource for researchers and healthcare professionals wanting to integrate TMIs into health interventions. Steps to identify, compare and assess advantages and disadvantages are outlined for consideration. Expanded evaluation criteria can be used by future researchers. Continued and more comprehensive platform tools should be integrated into mHealth repositories. Detailed descriptions of platform advantages and disadvantages are needed when mHealth researchers publish findings to expand the body of research on TMI tools for healthcare. Standardized descriptions and features are recommended for vendor sites. Copyright © 2017 Elsevier B.V. All rights reserved.

  10. Hospital's redesigned Web site patient-friendly, comprehensive. Site one-of-a-kind in Twin Cities market area.

    PubMed

    Rees, T

    2001-01-01

    North Memorial Medical Center, Robbinsdale, Minn., has opened a brightly redesigned Web site. It is patient-friendly and features a different approach to provide healthcare information called "care areas," which are organized by condition, such as heart care, cancer care and childbirth. This approach led to the the site being named North Memorial Online Care Center.

  11. Genome-Wide Locations of Potential Epimutations Associated with Environmentally Induced Epigenetic Transgenerational Inheritance of Disease Using a Sequential Machine Learning Prediction Approach.

    PubMed

    Haque, M Muksitul; Holder, Lawrence B; Skinner, Michael K

    2015-01-01

    Environmentally induced epigenetic transgenerational inheritance of disease and phenotypic variation involves germline transmitted epimutations. The primary epimutations identified involve altered differential DNA methylation regions (DMRs). Different environmental toxicants have been shown to promote exposure (i.e., toxicant) specific signatures of germline epimutations. Analysis of genomic features associated with these epimutations identified low-density CpG regions (<3 CpG / 100bp) termed CpG deserts and a number of unique DNA sequence motifs. The rat genome was annotated for these and additional relevant features. The objective of the current study was to use a machine learning computational approach to predict all potential epimutations in the genome. A number of previously identified sperm epimutations were used as training sets. A novel machine learning approach using a sequential combination of Active Learning and Imbalance Class Learner analysis was developed. The transgenerational sperm epimutation analysis identified approximately 50K individual sites with a 1 kb mean size and 3,233 regions that had a minimum of three adjacent sites with a mean size of 3.5 kb. A select number of the most relevant genomic features were identified with the low density CpG deserts being a critical genomic feature of the features selected. A similar independent analysis with transgenerational somatic cell epimutation training sets identified a smaller number of 1,503 regions of genome-wide predicted sites and differences in genomic feature contributions. The predicted genome-wide germline (sperm) epimutations were found to be distinct from the predicted somatic cell epimutations. Validation of the genome-wide germline predicted sites used two recently identified transgenerational sperm epimutation signature sets from the pesticides dichlorodiphenyltrichloroethane (DDT) and methoxychlor (MXC) exposure lineage F3 generation. Analysis of this positive validation data set showed a 100% prediction accuracy for all the DDT-MXC sperm epimutations. Observations further elucidate the genomic features associated with transgenerational germline epimutations and identify a genome-wide set of potential epimutations that can be used to facilitate identification of epigenetic diagnostics for ancestral environmental exposures and disease susceptibility.

  12. GATA: A graphic alignment tool for comparative sequenceanalysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nix, David A.; Eisen, Michael B.

    2005-01-01

    Several problems exist with current methods used to align DNA sequences for comparative sequence analysis. Most dynamic programming algorithms assume that conserved sequence elements are collinear. This assumption appears valid when comparing orthologous protein coding sequences. Functional constraints on proteins provide strong selective pressure against sequence inversions, and minimize sequence duplications and feature shuffling. For non-coding sequences this collinearity assumption is often invalid. For example, enhancers contain clusters of transcription factor binding sites that change in number, orientation, and spacing during evolution yet the enhancer retains its activity. Dotplot analysis is often used to estimate non-coding sequence relatedness. Yet dotmore » plots do not actually align sequences and thus cannot account well for base insertions or deletions. Moreover, they lack an adequate statistical framework for comparing sequence relatedness and are limited to pairwise comparisons. Lastly, dot plots and dynamic programming text outputs fail to provide an intuitive means for visualizing DNA alignments.« less

  13. Raising Awareness of Individual Creative Potential in Bioscientists Using a Web-Site Based Approach

    ERIC Educational Resources Information Center

    Adams, David J.; Hugh-Jones, Siobhan; Sutherland, Ed

    2010-01-01

    We report the preliminary results of work with a unique, web-site-based approach designed to help individual bioscientists identify and develop their individual creative capacity. The site includes a number of features that encourage individuals to interact with creativity techniques, communicate with colleagues remotely using an electronic notice…

  14. Metropolitan open-space protection with uncertain site availability

    Treesearch

    Robert G. Haight; Stephanie A. Snyder; Charles S. Revelle

    2005-01-01

    Urban planners acquire open space to protect natural areas and provide public access to recreation opportunities. Because of limited budgets and dynamic land markets, acquisitions take place sequentially depending on available funds and sites. To address these planning features, we formulated a two-period site selection model with two objectives: maximize the...

  15. Blue Ribbon Web Sites Contest Winners.

    ERIC Educational Resources Information Center

    Southworth, Samuel A.

    2001-01-01

    Presents a collection of prize-winning Web sites created by K-8 teachers nationwide. Some of the unique features of the Web sites include an online student-written newspaper; a sing-along section; a chronicle of the past 3 years of classes to see how the classes have evolved; and student art and writing projects. (SM)

  16. 36 CFR 14.22 - Reimbursement of costs.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... paragraph (a)(3)(i) of this section (e.g., for communication sites, reservoir sites, plant sites, and other non-linear facilities)—$250 for each 40 acres or fraction thereof. (iii) If a project has the features... applicant an estimate, based on the best available cost information, of the costs which would be incurred by...

  17. 36 CFR 14.22 - Reimbursement of costs.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... paragraph (a)(3)(i) of this section (e.g., for communication sites, reservoir sites, plant sites, and other non-linear facilities)—$250 for each 40 acres or fraction thereof. (iii) If a project has the features... applicant an estimate, based on the best available cost information, of the costs which would be incurred by...

  18. 36 CFR 14.22 - Reimbursement of costs.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... paragraph (a)(3)(i) of this section (e.g., for communication sites, reservoir sites, plant sites, and other non-linear facilities)—$250 for each 40 acres or fraction thereof. (iii) If a project has the features... applicant an estimate, based on the best available cost information, of the costs which would be incurred by...

  19. 36 CFR 14.22 - Reimbursement of costs.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... paragraph (a)(3)(i) of this section (e.g., for communication sites, reservoir sites, plant sites, and other non-linear facilities)—$250 for each 40 acres or fraction thereof. (iii) If a project has the features... applicant an estimate, based on the best available cost information, of the costs which would be incurred by...

  20. Catching (and Keeping!) E-Patrons.

    ERIC Educational Resources Information Center

    Puacz, Jeanne Holba

    2002-01-01

    Based on experiences of the Vigo County Public Library in Terre Haute, Indiana, this article outlines ways libraries can attract patrons to their Web sites and features that can keep them returning. Discusses marketing and publicity; basic content and special sources and services; attractive and easy-to-use site design; good Web site maintenance;…

  1. PDNAsite: Identification of DNA-binding Site from Protein Sequence by Incorporating Spatial and Sequence Context

    PubMed Central

    Zhou, Jiyun; Xu, Ruifeng; He, Yulan; Lu, Qin; Wang, Hongpeng; Kong, Bing

    2016-01-01

    Protein-DNA interactions are involved in many fundamental biological processes essential for cellular function. Most of the existing computational approaches employed only the sequence context of the target residue for its prediction. In the present study, for each target residue, we applied both the spatial context and the sequence context to construct the feature space. Subsequently, Latent Semantic Analysis (LSA) was applied to remove the redundancies in the feature space. Finally, a predictor (PDNAsite) was developed through the integration of the support vector machines (SVM) classifier and ensemble learning. Results on the PDNA-62 and the PDNA-224 datasets demonstrate that features extracted from spatial context provide more information than those from sequence context and the combination of them gives more performance gain. An analysis of the number of binding sites in the spatial context of the target site indicates that the interactions between binding sites next to each other are important for protein-DNA recognition and their binding ability. The comparison between our proposed PDNAsite method and the existing methods indicate that PDNAsite outperforms most of the existing methods and is a useful tool for DNA-binding site identification. A web-server of our predictor (http://hlt.hitsz.edu.cn:8080/PDNAsite/) is made available for free public accessible to the biological research community. PMID:27282833

  2. Knowledge-transfer learning for prediction of matrix metalloprotease substrate-cleavage sites.

    PubMed

    Wang, Yanan; Song, Jiangning; Marquez-Lago, Tatiana T; Leier, André; Li, Chen; Lithgow, Trevor; Webb, Geoffrey I; Shen, Hong-Bin

    2017-07-18

    Matrix Metalloproteases (MMPs) are an important family of proteases that play crucial roles in key cellular and disease processes. Therefore, MMPs constitute important targets for drug design, development and delivery. Advanced proteomic technologies have identified type-specific target substrates; however, the complete repertoire of MMP substrates remains uncharacterized. Indeed, computational prediction of substrate-cleavage sites associated with MMPs is a challenging problem. This holds especially true when considering MMPs with few experimentally verified cleavage sites, such as for MMP-2, -3, -7, and -8. To fill this gap, we propose a new knowledge-transfer computational framework which effectively utilizes the hidden shared knowledge from some MMP types to enhance predictions of other, distinct target substrate-cleavage sites. Our computational framework uses support vector machines combined with transfer machine learning and feature selection. To demonstrate the value of the model, we extracted a variety of substrate sequence-derived features and compared the performance of our method using both 5-fold cross-validation and independent tests. The results show that our transfer-learning-based method provides a robust performance, which is at least comparable to traditional feature-selection methods for prediction of MMP-2, -3, -7, -8, -9 and -12 substrate-cleavage sites on independent tests. The results also demonstrate that our proposed computational framework provides a useful alternative for the characterization of sequence-level determinants of MMP-substrate specificity.

  3. Lava Tubes as Martian Analog sites on Hawaii Island

    NASA Astrophysics Data System (ADS)

    Andersen, Christian; Hamilton, J. C.; Adams, M.

    2013-10-01

    The existence of geologic features similar to skylights seen in Mars Reconnaissance Orbiter HIRISE imagery suggest Martian lava tube networks. Along with pit craters, these features are evidence of a past era of vulcanism. If these were contemporary with the wet Mars eras, then it is suggestive that any Martian life may have retreated into these subsurface oases. Hawaii island has numerous lava tubes of differing ages, humidity, lengths and sizes that make ideal analog test environments for future Mars exploration. PISCES has surveyed multiple candidate sites during the past summer with a team of University of Hawaii at Hilo student interns. It should be noted that Lunar features have also been similarly discovered via Lunar Reconnaissance Orbiter LROC imagery.

  4. Arabic writer identification based on diacritic's features

    NASA Astrophysics Data System (ADS)

    Maliki, Makki; Al-Jawad, Naseer; Jassim, Sabah A.

    2012-06-01

    Natural languages like Arabic, Kurdish, Farsi (Persian), Urdu, and any other similar languages have many features, which make them different from other languages like Latin's script. One of these important features is diacritics. These diacritics are classified as: compulsory like dots which are used to identify/differentiate letters, and optional like short vowels which are used to emphasis consonants. Most indigenous and well trained writers often do not use all or some of these second class of diacritics, and expert readers can infer their presence within the context of the writer text. In this paper, we investigate the use of diacritics shapes and other characteristic as parameters of feature vectors for Arabic writer identification/verification. Segmentation techniques are used to extract the diacritics-based feature vectors from examples of Arabic handwritten text. The results of evaluation test will be presented, which has been carried out on an in-house database of 50 writers. Also the viability of using diacritics for writer recognition will be demonstrated.

  5. Sentence alignment using feed forward neural network.

    PubMed

    Fattah, Mohamed Abdel; Ren, Fuji; Kuroiwa, Shingo

    2006-12-01

    Parallel corpora have become an essential resource for work in multi lingual natural language processing. However, sentence aligned parallel corpora are more efficient than non-aligned parallel corpora for cross language information retrieval and machine translation applications. In this paper, we present a new approach to align sentences in bilingual parallel corpora based on feed forward neural network classifier. A feature parameter vector is extracted from the text pair under consideration. This vector contains text features such as length, punctuate score, and cognate score values. A set of manually prepared training data has been assigned to train the feed forward neural network. Another set of data was used for testing. Using this new approach, we could achieve an error reduction of 60% over length based approach when applied on English-Arabic parallel documents. Moreover this new approach is valid for any language pair and it is quite flexible approach since the feature parameter vector may contain more/less or different features than that we used in our system such as lexical match feature.

  6. Clinical, Biologic, and Prognostic Differences on the Basis of Primary Tumor Site in Neuroblastoma: A Report From the International Neuroblastoma Risk Group Project

    PubMed Central

    Vo, Kieuhoa T.; Matthay, Katherine K.; Neuhaus, John; London, Wendy B.; Hero, Barbara; Ambros, Peter F.; Nakagawara, Akira; Miniati, Doug; Wheeler, Kate; Pearson, Andrew D.J.; Cohn, Susan L.; DuBois, Steven G.

    2014-01-01

    Purpose Neuroblastoma (NB) is a heterogeneous tumor arising from sympathetic tissues. The impact of primary tumor site in influencing the heterogeneity of NB remains unclear. Patients and Methods Children younger than age 21 years diagnosed with NB or ganglioneuroblastoma between 1990 and 2002 and with known primary site were identified from the International Neuroblastoma Risk Group database. Data were compared between sites with respect to clinical and biologic features, as well as event-free survival (EFS) and overall survival (OS). Results Among 8,369 children, 47% had adrenal tumors. All evaluated clinical and biologic variables differed statistically between primary sites. The features that were > 10% discrepant between sites were stage 4 disease, MYCN amplification, elevated ferritin, elevated lactate dehydrogenase, and segmental chromosomal aberrations, all of which were more frequent in adrenal versus nonadrenal tumors (P < .001). Adrenal tumors were more likely than nonadrenal tumors (adjusted odds ratio, 2.09; 95% CI, 1.67 to 2.63; P < .001) and thoracic tumors were less likely than nonthoracic tumors (adjusted odds ratio, 0.20; 95% CI, 0.11 to 0.39; P < .001) to have MYCN amplification after controlling for age, stage, and histologic grade. EFS and OS differed significantly according to the primary site (P < .001 for both comparisons). After controlling for age, MYCN status, and stage, patients with adrenal tumors had higher risk for events (hazard ratio, 1.13 compared with nonadrenal tumors; 95% CI, 1.03 to 1.23; P = .008), and patients with thoracic tumors had lower risk for events (HR, 0.79 compared with nonthoracic; 95% CI, 0.67 to 0.92; P = .003). Conclusion Clinical and biologic features show important differences by NB primary site, with adrenal and thoracic sites associated with inferior and superior survival, respectively. Future studies will need to investigate the biologic origin of these differences. PMID:25154816

  7. Using environmental heterogeneity to plan for sea-level rise.

    PubMed

    Hunter, Elizabeth A; Nibbelink, Nathan P

    2017-12-01

    Environmental heterogeneity is increasingly being used to select conservation areas that will provide for future biodiversity under a variety of climate scenarios. This approach, termed conserving nature's stage (CNS), assumes environmental features respond to climate change more slowly than biological communities, but will CNS be effective if the stage were to change as rapidly as the climate? We tested the effectiveness of using CNS to select sites in salt marshes for conservation in coastal Georgia (U.S.A.), where environmental features will change rapidly as sea level rises. We calculated species diversity based on distributions of 7 bird species with a variety of niches in Georgia salt marshes. Environmental heterogeneity was assessed across six landscape gradients (e.g., elevation, salinity, and patch area). We used 2 approaches to select sites with high environmental heterogeneity: site complementarity (environmental diversity [ED]) and local environmental heterogeneity (environmental richness [ER]). Sites selected based on ER predicted present-day species diversity better than randomly selected sites (up to an 8.1% improvement), were resilient to areal loss from SLR (1.0% average areal loss by 2050 compared with 0.9% loss of randomly selected sites), and provided habitat to a threatened species (0.63 average occupancy compared with 0.6 average occupancy of randomly selected sites). Sites selected based on ED predicted species diversity no better or worse than random and were not resilient to SLR (2.9% average areal loss by 2050). Despite the discrepancy between the 2 approaches, CNS is a viable strategy for conservation site selection in salt marshes because the ER approach was successful. It has potential for application in other coastal areas where SLR will affect environmental features, but its performance may depend on the magnitude of geological changes caused by SLR. Our results indicate that conservation planners that had heretofore excluded low-lying coasts from CNS planning could include coastal ecosystems in regional conservation strategies. © 2017 Society for Conservation Biology.

  8. Text String Detection from Natural Scenes by Structure-based Partition and Grouping

    PubMed Central

    Yi, Chucai; Tian, YingLi

    2012-01-01

    Text information in natural scene images serves as important clues for many image-based applications such as scene understanding, content-based image retrieval, assistive navigation, and automatic geocoding. However, locating text from complex background with multiple colors is a challenging task. In this paper, we explore a new framework to detect text strings with arbitrary orientations in complex natural scene images. Our proposed framework of text string detection consists of two steps: 1) Image partition to find text character candidates based on local gradient features and color uniformity of character components. 2) Character candidate grouping to detect text strings based on joint structural features of text characters in each text string such as character size differences, distances between neighboring characters, and character alignment. By assuming that a text string has at least three characters, we propose two algorithms of text string detection: 1) adjacent character grouping method, and 2) text line grouping method. The adjacent character grouping method calculates the sibling groups of each character candidate as string segments and then merges the intersecting sibling groups into text string. The text line grouping method performs Hough transform to fit text line among the centroids of text candidates. Each fitted text line describes the orientation of a potential text string. The detected text string is presented by a rectangle region covering all characters whose centroids are cascaded in its text line. To improve efficiency and accuracy, our algorithms are carried out in multi-scales. The proposed methods outperform the state-of-the-art results on the public Robust Reading Dataset which contains text only in horizontal orientation. Furthermore, the effectiveness of our methods to detect text strings with arbitrary orientations is evaluated on the Oriented Scene Text Dataset collected by ourselves containing text strings in non-horizontal orientations. PMID:21411405

  9. Text string detection from natural scenes by structure-based partition and grouping.

    PubMed

    Yi, Chucai; Tian, YingLi

    2011-09-01

    Text information in natural scene images serves as important clues for many image-based applications such as scene understanding, content-based image retrieval, assistive navigation, and automatic geocoding. However, locating text from a complex background with multiple colors is a challenging task. In this paper, we explore a new framework to detect text strings with arbitrary orientations in complex natural scene images. Our proposed framework of text string detection consists of two steps: 1) image partition to find text character candidates based on local gradient features and color uniformity of character components and 2) character candidate grouping to detect text strings based on joint structural features of text characters in each text string such as character size differences, distances between neighboring characters, and character alignment. By assuming that a text string has at least three characters, we propose two algorithms of text string detection: 1) adjacent character grouping method and 2) text line grouping method. The adjacent character grouping method calculates the sibling groups of each character candidate as string segments and then merges the intersecting sibling groups into text string. The text line grouping method performs Hough transform to fit text line among the centroids of text candidates. Each fitted text line describes the orientation of a potential text string. The detected text string is presented by a rectangle region covering all characters whose centroids are cascaded in its text line. To improve efficiency and accuracy, our algorithms are carried out in multi-scales. The proposed methods outperform the state-of-the-art results on the public Robust Reading Dataset, which contains text only in horizontal orientation. Furthermore, the effectiveness of our methods to detect text strings with arbitrary orientations is evaluated on the Oriented Scene Text Dataset collected by ourselves containing text strings in nonhorizontal orientations.

  10. Improve Reading with Complex Texts

    ERIC Educational Resources Information Center

    Fisher, Douglas; Frey, Nancy

    2015-01-01

    The Common Core State Standards have cast a renewed light on reading instruction, presenting teachers with the new requirements to teach close reading of complex texts. Teachers and administrators should consider a number of essential features of close reading: They are short, complex texts; rich discussions based on worthy questions; revisiting…

  11. Opening Mathematics Texts: Resisting the Seduction

    ERIC Educational Resources Information Center

    Wagner, David

    2012-01-01

    This analysis of the writing in a grade 7 mathematics textbook distinguishes between closed texts and open texts, which acknowledge multiple possibilities. I use tools that have recently been applied in mathematics contexts, focussing on grammatical features that include personal pronouns, modality, and types of imperatives, as well as on…

  12. DOE Research and Development Accomplishments Help

    Science.gov Websites

    be used to search, locate, access, and electronically download full-text research and development (R Browse Downloading, Viewing, and/or Searching Full-text Documents/Pages Searching the Database Search Features Search allows you to search the OCRed full-text document and bibliographic information, the

  13. Death as Insight into Life: Adolescents' Gothic Text Encounters

    ERIC Educational Resources Information Center

    Del Nero, Jennifer

    2017-01-01

    This qualitative case study explores adolescents' responses to texts containing death and destruction, a seminal trope of the Gothic literary genre. Participants read both classic and popular culture texts featuring characters grappling with death in their seventh grade reading classroom. Observations, interviews, and documents were collected and…

  14. Teaching Scientific Metaphors through Informational Text Read-Alouds

    ERIC Educational Resources Information Center

    Barnes, Erica M.; Oliveira, Alandeom W.

    2018-01-01

    Elementary students are expected to use various features of informational texts to build knowledge in the content areas. In science informational texts, scientific metaphors are commonly used to make sense of complex and invisible processes. Although elementary students may be familiar with literary metaphors as used in narratives, they may be…

  15. Cohesive Features of Deep Text Comprehension Processes

    ERIC Educational Resources Information Center

    Allen, Laura K.; Jacovina, Matthew E.; McNamara, Danielle S.

    2016-01-01

    This study investigates how cohesion manifests in readers' thought processes while reading texts when they are instructed to engage in self-explanation, a strategy associated with deeper, more successful comprehension. In Study 1, college students (n = 21) were instructed to either paraphrase or self-explain science texts. Paraphrasing was…

  16. Parenchymal texture analysis in digital mammography: robust texture feature identification and equivalence across devices.

    PubMed

    Keller, Brad M; Oustimov, Andrew; Wang, Yan; Chen, Jinbo; Acciavatti, Raymond J; Zheng, Yuanjie; Ray, Shonket; Gee, James C; Maidment, Andrew D A; Kontos, Despina

    2015-04-01

    An analytical framework is presented for evaluating the equivalence of parenchymal texture features across different full-field digital mammography (FFDM) systems using a physical breast phantom. Phantom images (FOR PROCESSING) are acquired from three FFDM systems using their automated exposure control setting. A panel of texture features, including gray-level histogram, co-occurrence, run length, and structural descriptors, are extracted. To identify features that are robust across imaging systems, a series of equivalence tests are performed on the feature distributions, in which the extent of their intersystem variation is compared to their intrasystem variation via the Hodges-Lehmann test statistic. Overall, histogram and structural features tend to be most robust across all systems, and certain features, such as edge enhancement, tend to be more robust to intergenerational differences between detectors of a single vendor than to intervendor differences. Texture features extracted from larger regions of interest (i.e., [Formula: see text]) and with a larger offset length (i.e., [Formula: see text]), when applicable, also appear to be more robust across imaging systems. This framework and observations from our experiments may benefit applications utilizing mammographic texture analysis on images acquired in multivendor settings, such as in multicenter studies of computer-aided detection and breast cancer risk assessment.

  17. Detecting and Analyzing Cybercrime in Text-Based Communication of Cybercriminal Networks through Computational Linguistic and Psycholinguistic Feature Modeling

    ERIC Educational Resources Information Center

    Mbaziira, Alex Vincent

    2017-01-01

    Cybercriminals are increasingly using Internet-based text messaging applications to exploit their victims. Incidents of deceptive cybercrime in text-based communication are increasing and include fraud, scams, as well as favorable and unfavorable fake reviews. In this work, we use a text-based deception detection approach to train models for…

  18. Modern erosion rates and loss of coastal features and sites, Beaufort Sea coastline, Alaska

    USGS Publications Warehouse

    Jones, Benjamin M.; Hinkel, Kenneth M.; Arp, C.D.; Eisner, Wendy R.

    2008-01-01

    This study presents modern erosion rate measurements based upon vertical aerial photography captured in 1955, 1979, and 2002 for a 100 km segment of the Beaufort Sea coastline. Annual erosion rates from 1955 to 2002 averaged 5.6 m a-1. However, mean erosion rates increased from 5.0 m a-1 in 1955-79 to 6.2 m a-1 in 1979-2002. Furthermore, from the first period to the second, erosion rates increased at 60% (598) of the 992 sites analyzed, decreased at 31% (307), and changed less than ?? 30 cm at 9% (87). Historical observations and quantitative studies over the past 175 years allowed us to place our erosion rate measurements into a longer-term context. Several of the coastal features along this stretch of coastline received Western place names during the Dease and Simpson expedition in 1837, and the majority of those features had been lost by the early 1900s as a result of coastline erosion, suggesting that erosion has been active over at least the historical record. Incorporation of historical and modern observations also allowed us to detect the loss of both cultural and historical sites and modern infrastructure. U.S. Geological Survey topographic maps reveal a number of known cultural and historical sites, as well as sites with modern infrastructure constructed as recently as the 1950s, that had disappeared by the early 2000s as a result of coastal erosion. We were also able to identify sites that are currently being threatened by an encroaching coastline. Our modern erosion rate measurements can potentially be used to predict when a historical site or modern infrastructure will be affected if such erosion rates persist. ?? The Arctic Institute of North America.

  19. Blasting for abandoned-mine land reclamation (closure of individual subsidence features and erratic, undocumented underground coal-mine workings). Final report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Workman, J.L.; Thompson, J.

    1991-01-01

    The study has examined the feasibility of blasting for mitigating various abandoned mine land features on AML sites. The investigation included extensive field trial blasts at sites in North Dakota and Montana. A blasting technique was used that was based on spherical cratering concepts. At the Beulah, North Dakota site thirteen individual vertical openings (sinkholes) were blasted with the intent to fill the voids. The blasts were designed to displace material laterally into the void. Good success was had in filling the sinkholes. At the White site in Montana erratic underground rooms with no available documentation were collapsed. An aditmore » leading into the mine was also blasted. Both individual room blasting and area pattern blasting were studied. A total of eight blasts were fired on the one acre area. Exploration requirements and costs were found to be extensive.« less

  20. Ground-penetrating radar and electromagnetic surveys at the Monroe Crossroads battlefield site, Fort Bragg, North Carolina

    USGS Publications Warehouse

    Kessler, Richard; Strain, R.E.; Marlowe, J. I.; Currin, K.B.

    1996-01-01

    A ground-penetrating radar survey was conducted at the Monroe Crossroads Battlefield site at Fort Bragg, North Carolina, to determine possible locations of subsurface archaeological features. An electromagnetic survey also was conducted at the site to verify and augment the ground-penetrating radar data. The surveys were conducted over a 67,200-square-foot grid with a grid point spacing of 20 feet. During the ground-penetrating radar survey, 87 subsurface anomalies were detected based on visual inspection of the field records. These anomalies were flagged in the field as they appeared on the ground-penetrating radar records and were located by a land survey. The electromagnetic survey produced two significant readings at ground-penetrating radar anomaly locations. The National Park Service excavated 44 of the 87 anomaly locations at the Civil War battlefield site. Four of these excavations produced significant archaeological features, including one at an abandoned well.

  1. Morphological, biochemical, and histopathological indices and contaminant burdens of cotton rats (Sigmodon hispidus) at three hazardous waste sites near Houston, Texas, USA

    USGS Publications Warehouse

    Rattner, B.A.; Flickinger, Edward L.; Hoffman, D.J.

    1993-01-01

    Male cotton rats (Sigmodon hispidus) were studied at three industrial waste sites near Houston, Texas, to determine whether various morphological, biochemical, and histopathological indices provided evidence of contaminant exposure and toxic insult. Only modest changes were detected in cotton rats residing at waste sites compared with reference sites. No single parameter was consistently altered, except hepatic cytochrome P-450 concentration which was lower ( [Formula: see text] ) at two waste sites, and tended to be lower ( [Formula: see text] ) at a third waste site. Elevated petroleum hydrocarbon concentrations were detected in rats at one waste site, but contaminant burdens of rats from the other sites were unremarkable. Unlike rats captured in summer, those trapped in winter exhibited hepatocellular hypertrophy and up to a 65% increase in liver: body weight ratio, cytochrome P-450 concentration, and activities of aniline hydroxylase, aryl hydrocarbon hydroxylase, and glutathione S-transferase. Although genotoxicity has been previously documented in cotton rats residing at two of the waste sites, biomarkers in the present study provided little evidence of exposure and damage

  2. A Design Analysis Model for Developing World Wide Web Sites.

    ERIC Educational Resources Information Center

    Ma, Yan

    2002-01-01

    Examines the relationship between and among designers, text, and users of the Galter Health Sciences Library Web site at Northwestern University by applying reader-response criticism. Highlights include Web site design; comparison of designers' intentions with the actual organization of knowledge on the Web site; and compares designer's intentions…

  3. Similarity-Based Recommendation of New Concepts to a Terminology

    PubMed Central

    Chandar, Praveen; Yaman, Anil; Hoxha, Julia; He, Zhe; Weng, Chunhua

    2015-01-01

    Terminologies can suffer from poor concept coverage due to delays in addition of new concepts. This study tests a similarity-based approach to recommending concepts from a text corpus to a terminology. Our approach involves extraction of candidate concepts from a given text corpus, which are represented using a set of features. The model learns the important features to characterize a concept and recommends new concepts to a terminology. Further, we propose a cost-effective evaluation methodology to estimate the effectiveness of terminology enrichment methods. To test our methodology, we use the clinical trial eligibility criteria free-text as an example text corpus to recommend concepts for SNOMED CT. We computed precision at various rank intervals to measure the performance of the methods. Results indicate that our automated algorithm is an effective method for concept recommendation. PMID:26958170

  4. Incidence and Clinical Features of Respiratory Syncytial Virus Infections in a Population-Based Surveillance Site in the Nile Delta Region

    DTIC Science & Technology

    2013-01-01

    S U P P L E M E N T A R T I C L E Incidence and Clinical Features of Respiratory Syncytial Virus Infections in a Population-Based Surveillance Site...19a. NAME OF RESPONSIBLE PERSON a. REPORT unclassified b. ABSTRACT unclassified c . THIS PAGE unclassified Standard Form 298 (Rev. 8-98...Patients of any age were eligible if they presented with ≥1 sign of acute infection, (docu- mented fever ≥38° C or history of subjective fever with

  5. Texts of Our Institutional Lives. "Don't You Mean "Slaves," Not "Servants"?": Literary and Institutional Texts for an Interdisciplinary Classroom

    ERIC Educational Resources Information Center

    Ashton, Susanna

    2006-01-01

    The author describes an undergraduate course she taught on "Representations of Slavery." In particular, she explains how the course involved studying an historic site on her university's campus: the former slave plantation of leading antebellum racist John C. Calhoun. She also analyzes how her school represents the site on its Web pages. (Contains…

  6. Robust Classification and Segmentation of Planar and Linear Features for Construction Site Progress Monitoring and Structural Dimension Compliance Control

    NASA Astrophysics Data System (ADS)

    Maalek, R.; Lichti, D. D.; Ruwanpura, J.

    2015-08-01

    The application of terrestrial laser scanners (TLSs) on construction sites for automating construction progress monitoring and controlling structural dimension compliance is growing markedly. However, current research in construction management relies on the planned building information model (BIM) to assign the accumulated point clouds to their corresponding structural elements, which may not be reliable in cases where the dimensions of the as-built structure differ from those of the planned model and/or the planned model is not available with sufficient detail. In addition outliers exist in construction site datasets due to data artefacts caused by moving objects, occlusions and dust. In order to overcome the aforementioned limitations, a novel method for robust classification and segmentation of planar and linear features is proposed to reduce the effects of outliers present in the LiDAR data collected from construction sites. First, coplanar and collinear points are classified through a robust principal components analysis procedure. The classified points are then grouped using a robust clustering method. A method is also proposed to robustly extract the points belonging to the flat-slab floors and/or ceilings without performing the aforementioned stages in order to preserve computational efficiency. The applicability of the proposed method is investigated in two scenarios, namely, a laboratory with 30 million points and an actual construction site with over 150 million points. The results obtained by the two experiments validate the suitability of the proposed method for robust segmentation of planar and linear features in contaminated datasets, such as those collected from construction sites.

  7. The operation and maintenance of a crest-stage gaging station

    USGS Publications Warehouse

    Friday, John

    1965-01-01

    Rigid datum controls must be maintained at the gage site throughout the period of record. Physical changes of the site resulting from flood flows or manmade alterations must be evaluated. If a drainage structure such as a culvert is part of the site features, free-flow conditions must be maintained or obstructions carefully documented.

  8. The Target of the Question: A Taxonomy of Textual Features for Cambridge University "O" Levels English

    ERIC Educational Resources Information Center

    Benjamin, Shanti Isabelle

    2015-01-01

    This study investigates the typical textual features that are most frequently targeted in short-answer reading comprehension questions of the Cambridge University "O" Level English Paper 2. Test writers' awareness of how textual features impact on understanding of meanings in text decisions will determine to great extent their decisions…

  9. --No Title--

    Science.gov Websites

    {background-color:#0079C2;text-align:center}.feature-border{border-top:3px solid #0079C2}.news{border-top:11px }.banner-continuum .issue-info,.feature .number,.multimedia-container{background-color:#5E6A71}.analysis -issue .banner-continuum{background-color:#a5acaf}.analysis-issue .feature-border{border-top:3px solid

  10. Toward better public health reporting using existing off the shelf approaches: A comparison of alternative cancer detection approaches using plaintext medical data and non-dictionary based feature selection.

    PubMed

    Kasthurirathne, Suranga N; Dixon, Brian E; Gichoya, Judy; Xu, Huiping; Xia, Yuni; Mamlin, Burke; Grannis, Shaun J

    2016-04-01

    Increased adoption of electronic health records has resulted in increased availability of free text clinical data for secondary use. A variety of approaches to obtain actionable information from unstructured free text data exist. These approaches are resource intensive, inherently complex and rely on structured clinical data and dictionary-based approaches. We sought to evaluate the potential to obtain actionable information from free text pathology reports using routinely available tools and approaches that do not depend on dictionary-based approaches. We obtained pathology reports from a large health information exchange and evaluated the capacity to detect cancer cases from these reports using 3 non-dictionary feature selection approaches, 4 feature subset sizes, and 5 clinical decision models: simple logistic regression, naïve bayes, k-nearest neighbor, random forest, and J48 decision tree. The performance of each decision model was evaluated using sensitivity, specificity, accuracy, positive predictive value, and area under the receiver operating characteristics (ROC) curve. Decision models parameterized using automated, informed, and manual feature selection approaches yielded similar results. Furthermore, non-dictionary classification approaches identified cancer cases present in free text reports with evaluation measures approaching and exceeding 80-90% for most metrics. Our methods are feasible and practical approaches for extracting substantial information value from free text medical data, and the results suggest that these methods can perform on par, if not better, than existing dictionary-based approaches. Given that public health agencies are often under-resourced and lack the technical capacity for more complex methodologies, these results represent potentially significant value to the public health field. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. Harvesting geographic features from heterogeneous raster maps

    NASA Astrophysics Data System (ADS)

    Chiang, Yao-Yi

    2010-11-01

    Raster maps offer a great deal of geospatial information and are easily accessible compared to other geospatial data. However, harvesting geographic features locked in heterogeneous raster maps to obtain the geospatial information is challenging. This is because of the varying image quality of raster maps (e.g., scanned maps with poor image quality and computer-generated maps with good image quality), the overlapping geographic features in maps, and the typical lack of metadata (e.g., map geocoordinates, map source, and original vector data). Previous work on map processing is typically limited to a specific type of map and often relies on intensive manual work. In contrast, this thesis investigates a general approach that does not rely on any prior knowledge and requires minimal user effort to process heterogeneous raster maps. This approach includes automatic and supervised techniques to process raster maps for separating individual layers of geographic features from the maps and recognizing geographic features in the separated layers (i.e., detecting road intersections, generating and vectorizing road geometry, and recognizing text labels). The automatic technique eliminates user intervention by exploiting common map properties of how road lines and text labels are drawn in raster maps. For example, the road lines are elongated linear objects and the characters are small connected-objects. The supervised technique utilizes labels of road and text areas to handle complex raster maps, or maps with poor image quality, and can process a variety of raster maps with minimal user input. The results show that the general approach can handle raster maps with varying map complexity, color usage, and image quality. By matching extracted road intersections to another geospatial dataset, we can identify the geocoordinates of a raster map and further align the raster map, separated feature layers from the map, and recognized features from the layers with the geospatial dataset. The road vectorization and text recognition results outperform state-of-art commercial products, and with considerably less user input. The approach in this thesis allows us to make use of the geospatial information of heterogeneous maps locked in raster format.

  12. U.S. Army Environmental Restoration Programs Guidance Manual

    DTIC Science & Technology

    1998-04-01

    without delay. In addition to sampling, the SI usually includes a reconnaissance of the site’s layout, surrounding topographical features , and the...chemical monitoring of some, but not necessarily all, of the following: 2.1.1 Surface Features (topographic mapping, etc.) (natural and manmade features ...include some, but not necessarily all, of the following: 3.1.1 Surface Features 3.1.2 Meteorology 3.1.3 Surface-Water Hydrology 3.1.4 Geology 3.1.5

  13. Archaeological Salvage Excavations at the L.A. Strickland I Site (22Ts765), Tishomingo County, Mississippi.

    DTIC Science & Technology

    1978-12-01

    were identified to genus and when possible to species. These samples give an indication of the wood types present and their proportionate representation...in the features. Nutshells were sorted by genus and weighed. Seeds which retained their diagnostic characteristics were identified and counted. The...Vitis sp.), l persimmon ( Diospyros viiniana) and 1 round seed misiffng all diagnostic features. Feature 2. The sample from Feature 2 contained 55 grams

  14. Brownfields Waterfront Sustainability Pilot, Allentown PA: Technical Memorandum on Conceptual Design Using Low Impact Development

    EPA Pesticide Factsheets

    This technical memorandum briefly describes the site and the master plan, indicates design constraints considered, specifies recommended LID stormwater techniques and features for sustainable redevelopment of the site, and offers other recommendations.

  15. The GLIMS Glacier Database

    NASA Astrophysics Data System (ADS)

    Raup, B. H.; Khalsa, S. S.; Armstrong, R.

    2007-12-01

    The Global Land Ice Measurements from Space (GLIMS) project has built a geospatial and temporal database of glacier data, composed of glacier outlines and various scalar attributes. These data are being derived primarily from satellite imagery, such as from ASTER and Landsat. Each "snapshot" of a glacier is from a specific time, and the database is designed to store multiple snapshots representative of different times. We have implemented two web-based interfaces to the database; one enables exploration of the data via interactive maps (web map server), while the other allows searches based on text-field constraints. The web map server is an Open Geospatial Consortium (OGC) compliant Web Map Server (WMS) and Web Feature Server (WFS). This means that other web sites can display glacier layers from our site over the Internet, or retrieve glacier features in vector format. All components of the system are implemented using Open Source software: Linux, PostgreSQL, PostGIS (geospatial extensions to the database), MapServer (WMS and WFS), and several supporting components such as Proj.4 (a geographic projection library) and PHP. These tools are robust and provide a flexible and powerful framework for web mapping applications. As a service to the GLIMS community, the database contains metadata on all ASTER imagery acquired over glacierized terrain. Reduced-resolution of the images (browse imagery) can be viewed either as a layer in the MapServer application, or overlaid on the virtual globe within Google Earth. The interactive map application allows the user to constrain by time what data appear on the map. For example, ASTER or glacier outlines from 2002 only, or from Autumn in any year, can be displayed. The system allows users to download their selected glacier data in a choice of formats. The results of a query based on spatial selection (using a mouse) or text-field constraints can be downloaded in any of these formats: ESRI shapefiles, KML (Google Earth), MapInfo, GML (Geography Markup Language) and GMT (Generic Mapping Tools). This "clip-and-ship" function allows users to download only the data they are interested in. Our flexible web interfaces to the database, which includes various support layers (e.g. a layer to help collaborators identify satellite imagery over their region of expertise) will facilitate enhanced analysis to be undertaken on glacier systems, their distribution, and their impacts on other Earth systems.

  16. Bathymetric and velocimetric surveys at highway bridges crossing the Missouri River near Kansas City, Missouri, June 2–4, 2015

    USGS Publications Warehouse

    Huizinga, Richard J.

    2016-06-22

    A local spatial minimum average channel-bed elevation at structure A7650 (site 10) compared to adjacent sites may indicate this site is at or near a local feature that controls sediment deposition and scour. The average channel-bed elevation values and the distribution of channel-bed elevations imply that sediment unable to deposit near structure A7650 is flushed downstream and deposits at the next downstream site, structure A5817 (site 11).

  17. Space Place Prime

    NASA Technical Reports Server (NTRS)

    Fitzpatrick, Austin J.; Novati, Alexander; Fisher, Diane K.; Leon, Nancy J.; Netting, Ruth

    2013-01-01

    Space Place Prime is public engagement and education software for use on iPad. It targets a multi-generational audience with news, images, videos, and educational articles from the Space Place Web site and other NASA sources. New content is downloaded daily (or whenever the user accesses the app) via the wireless connection. In addition to the Space Place Web site, several NASA RSS feeds are tapped to provide new content. Content is retained for the previous several days, or some number of editions of each feed. All content is controlled on the server side, so features about the latest news, or changes to any content, can be made without updating the app in the Apple Store. It gathers many popular NASA features into one app. The interface is a boundless, slidable- in-any-direction grid of images, unique for each feature, and iconized as image, video, or article. A tap opens the feature. An alternate list mode presents menus of images, videos, and articles separately. Favorites can be tagged for permanent archive. Face - book, Twitter, and e-mail connections make any feature shareable.

  18. Using statistical text classification to identify health information technology incidents

    PubMed Central

    Chai, Kevin E K; Anthony, Stephen; Coiera, Enrico; Magrabi, Farah

    2013-01-01

    Objective To examine the feasibility of using statistical text classification to automatically identify health information technology (HIT) incidents in the USA Food and Drug Administration (FDA) Manufacturer and User Facility Device Experience (MAUDE) database. Design We used a subset of 570 272 incidents including 1534 HIT incidents reported to MAUDE between 1 January 2008 and 1 July 2010. Text classifiers using regularized logistic regression were evaluated with both ‘balanced’ (50% HIT) and ‘stratified’ (0.297% HIT) datasets for training, validation, and testing. Dataset preparation, feature extraction, feature selection, cross-validation, classification, performance evaluation, and error analysis were performed iteratively to further improve the classifiers. Feature-selection techniques such as removing short words and stop words, stemming, lemmatization, and principal component analysis were examined. Measurements κ statistic, F1 score, precision and recall. Results Classification performance was similar on both the stratified (0.954 F1 score) and balanced (0.995 F1 score) datasets. Stemming was the most effective technique, reducing the feature set size to 79% while maintaining comparable performance. Training with balanced datasets improved recall (0.989) but reduced precision (0.165). Conclusions Statistical text classification appears to be a feasible method for identifying HIT reports within large databases of incidents. Automated identification should enable more HIT problems to be detected, analyzed, and addressed in a timely manner. Semi-supervised learning may be necessary when applying machine learning to big data analysis of patient safety incidents and requires further investigation. PMID:23666777

  19. Encounters and Content Sharing in an Urban Village: Reading Texts Through an Archaeological Lens

    NASA Astrophysics Data System (ADS)

    Garcia, Nicole; Foth, Marcus; Hearn, Greg

    Archaeology provides a framework of analysis and interpretation that is useful for disentangling the textual layers of a contemporary lived-in urban space. The producers and readers of texts may include those who planned and developed the site and those who now live, visit, and work there. Some of the social encounters and content sharing between these people may be artificially produced or manufactured in the hope that certain social situations will occur. Others may be serendipitous. With archaeology's original focus on places that are no longer inhabited, it is often only the remaining artifacts and features of the built environment that form the basis for interpreting the social relationships of past people. Our analysis, however, is framed within a contemporary notion of archaeological artifacts in an urban setting. Unlike an excavation, where the past is revealed through digging into the landscape, the application of landscape archaeology within a present day urban context is necessarily more experiential, visual, and based on recording and analyzing the physical traces of social encounters and relationships between residents and visitors. These physical traces are present within the creative content, and the built and natural elements of the environment. This chapter explores notions of social encounters and content sharing in an urban village by analyzing three different types of texts: the design of the built environment; content produced by residents through a geospatial web application; and, print and online media produced in digital storytelling workshops.

  20. β-secretase inhibitors for Alzheimer's disease: identification using pharmacoinformatics.

    PubMed

    Islam, Md Ataul; Pillay, Tahir S

    2018-02-01

    In this study we searched for potential β-site amyloid precursor protein cleaving enzyme1 (BACE1) inhibitors using pharmacoinformatics. A large dataset containing 7155 known BACE1 inhibitors was evaluated for pharmacophore model generation. The final model (R = 0.950, RMSD = 1.094, Q 2  = 0.901, se = 0.332, [Formula: see text] = 0.901, [Formula: see text] = 0.756, sp = 0.468, [Formula: see text] = 0.667) was revealed with the importance of spatial arrangement of hydrogen bond acceptor and donor, hydrophobicity and aromatic ring features. The validated model was then used to search NCI and InterBioscreen databases for promising BACE1 inhibitors. The initial hits from both databases were sorted using a number of criteria and finally three molecules from each database were considered for further validation using molecular docking and molecular dynamics studies. Different protonation states of Asp32 and Asp228 dyad were analysed and best protonated form used for molecular docking study. Observation of the number of binding interactions in the molecular docking study supported the potential of these molecules being promising inhibitors. Values of RMSD, RMSF, Rg in molecular dynamics study and binding energies unquestionably explained that final screened molecules formed stable complexes inside the receptor cavity of BACE1. Hence, it can be concluded that the final screened six compounds may be potential therapeutic agents for Alzheimer's disease.

  1. Intuitiveness of Symbol Features for Air Traffic Management

    NASA Technical Reports Server (NTRS)

    Ngo, Mary Kim; Vu, Kim-Phuong L.; Thorpe, Elaine; Battiste, Vernol; Strybel, Thomas Z.

    2012-01-01

    We present the results of two online surveys asking participants to indicate what type of air traffic information might be conveyed by a number of symbols and symbol features (color, fill, text, and shape). The results of this initial study suggest that the well-developed concepts of ownership, altitude, and trajectory are readily associated with certain symbol features, while the relatively novel concept of equipage was not clearly associated with any specific symbol feature.

  2. Recognition of pornographic web pages by classifying texts and images.

    PubMed

    Hu, Weiming; Wu, Ou; Chen, Zhouyao; Fu, Zhouyu; Maybank, Steve

    2007-06-01

    With the rapid development of the World Wide Web, people benefit more and more from the sharing of information. However, Web pages with obscene, harmful, or illegal content can be easily accessed. It is important to recognize such unsuitable, offensive, or pornographic Web pages. In this paper, a novel framework for recognizing pornographic Web pages is described. A C4.5 decision tree is used to divide Web pages, according to content representations, into continuous text pages, discrete text pages, and image pages. These three categories of Web pages are handled, respectively, by a continuous text classifier, a discrete text classifier, and an algorithm that fuses the results from the image classifier and the discrete text classifier. In the continuous text classifier, statistical and semantic features are used to recognize pornographic texts. In the discrete text classifier, the naive Bayes rule is used to calculate the probability that a discrete text is pornographic. In the image classifier, the object's contour-based features are extracted to recognize pornographic images. In the text and image fusion algorithm, the Bayes theory is used to combine the recognition results from images and texts. Experimental results demonstrate that the continuous text classifier outperforms the traditional keyword-statistics-based classifier, the contour-based image classifier outperforms the traditional skin-region-based image classifier, the results obtained by our fusion algorithm outperform those by either of the individual classifiers, and our framework can be adapted to different categories of Web pages.

  3. Long-range correlations and burstiness in written texts: Universal and language-specific aspects

    NASA Astrophysics Data System (ADS)

    Constantoudis, Vassilios; Kalimeri, Maria; Diakonos, Fotis; Karamanos, Konstantinos; Papadimitriou, Constantinos; Chatzigeorgiou, Manolis; Papageorgiou, Harris

    2016-08-01

    Recently, methods from the statistical physics of complex systems have been applied successfully to identify universal features in the long-range correlations (LRCs) of written texts. However, in real texts, these universal features are being intermingled with language-specific influences. This paper aims at the characterization and further understanding of the interplay between universal and language-specific effects on the LRCs in texts. To this end, we apply the language-sensitive mapping of written texts to word-length series (wls) and analyse large parallel (of same content) corpora from 10 languages classified to four families (Romanic, Germanic, Greek and Uralic). The autocorrelation functions of the wls reveal tiny but persistent LRCs decaying at large scales following a power-law with a language-independent exponent ˜0.60-0.65. The impact of language is displayed in the amplitude of correlations where a relative standard deviation >40% among the analyzed languages is observed. The classification to language families seems to play a significant role since, the Finnish and Germanic languages exhibit more correlations than the Greek and Roman families. To reveal the origins of the LRCs, we focus on the long words and perform burst and correlation analysis in their positions along the corpora. We find that the universal features are linked more to the correlations of the inter-long word distances while the language-specific aspects are related more to their distributions.

  4. Epileptic Seizure Prediction Using Diffusion Distance and Bayesian Linear Discriminate Analysis on Intracranial EEG.

    PubMed

    Yuan, Shasha; Zhou, Weidong; Chen, Liyan

    2018-02-01

    Epilepsy is a chronic neurological disorder characterized by sudden and apparently unpredictable seizures. A system capable of forecasting the occurrence of seizures is crucial and could open new therapeutic possibilities for human health. This paper addresses an algorithm for seizure prediction using a novel feature - diffusion distance (DD) in intracranial Electroencephalograph (iEEG) recordings. Wavelet decomposition is conducted on segmented electroencephalograph (EEG) epochs and subband signals at scales 3, 4 and 5 are utilized to extract the diffusion distance. The features of all channels composing a feature vector are then fed into a Bayesian Linear Discriminant Analysis (BLDA) classifier. Finally, postprocessing procedure is applied to reduce false prediction alarms. The prediction method is evaluated on the public intracranial EEG dataset, which consists of 577.67[Formula: see text]h of intracranial EEG recordings from 21 patients with 87 seizures. We achieved a sensitivity of 85.11% for a seizure occurrence period of 30[Formula: see text]min and a sensitivity of 93.62% for a seizure occurrence period of 50[Formula: see text]min, both with the seizure prediction horizon of 10[Formula: see text]s. Our false prediction rate was 0.08/h. The proposed method yields a high sensitivity as well as a low false prediction rate, which demonstrates its potential for real-time prediction of seizures.

  5. An audit of alcohol brand websites.

    PubMed

    Gordon, Ross

    2011-11-01

    The study investigated the nature and content of alcohol brand websites in the UK. The research involved an audit of the websites of the 10 leading alcohol brands by sales in the UK across four categories: lager, spirits, Flavoured Alcoholic Beverages and cider/perry. Each site was visited twice over a 1-month period with site features and content recorded using a pro-forma. The content of websites was then reviewed against the regulatory codes governing broadcast advertising of alcohol. It was found that 27 of 40 leading alcohol brands had a dedicated website. Sites featured sophisticated content, including sports and music sections, games, downloads and competitions. Case studies of two brand websites demonstrate the range of content features on such sites. A review of the application of regulatory codes covering traditional advertising found some content may breach the codes. Study findings illustrate the sophisticated range of content accessible on alcohol brand websites. When applying regulatory codes covering traditional alcohol marketing channels it is apparent that some content on alcohol brand websites would breach the codes. This suggests the regulation of alcohol brand websites may be an issue requiring attention from policymakers. Further research in this area would help inform this process. © 2010 Australasian Professional Society on Alcohol and other Drugs.

  6. Archaeological investigations at a toolstone source area and temporary camp: Sample Unit 19-25, Nevada Test Site, Nye County, Nevada. Technical report No. 77

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jones, R.C.; DuBarton, A.; Edwards, S.

    1993-12-31

    Archaeological investigations were initiated at Sample Unit 19--25 to retrieve information concerning settlement and subsistence data on the aboriginal hunter and gatherers in the area. Studies included collection and mapping of 35.4 acres at site 26NY1408 and excavation and mapping of 0.02 acres at site 26NY7847. Cultural resources include two rock and brush structures and associated caches and a large lithic toolstone source area and lithic artifact scatter. Temporally diagnostic artifacts indicate periodic use throughout the last 12,000 years; however dates associated with projectile points indicate most use was in the Middle and Late Archaic. Radiocarbon dates from the rockmore » and brush structures at site 26NY7847 indicate a construction date of A.D. 1640 and repair between A.D. 1800 and 1950 for feature 1 and between A.D. 1330 and 1390 and repair at A.D. 1410 for feature 2. The dates associated with feature 2 place its construction significantly earlier than similar structures found elsewhere on Pahute Mesa. Activity areas appear to reflect temporary use of the area for procurement of available lithic and faunal resources and the manufacture of tools.« less

  7. Common structural features of cholesterol binding sites in crystallized soluble proteins

    PubMed Central

    Bukiya, Anna N.; Dopico, Alejandro M.

    2017-01-01

    Cholesterol-protein interactions are essential for the architectural organization of cell membranes and for lipid metabolism. While cholesterol-sensing motifs in transmembrane proteins have been identified, little is known about cholesterol recognition by soluble proteins. We reviewed the structural characteristics of binding sites for cholesterol and cholesterol sulfate from crystallographic structures available in the Protein Data Bank. This analysis unveiled key features of cholesterol-binding sites that are present in either all or the majority of sites: i) the cholesterol molecule is generally positioned between protein domains that have an organized secondary structure; ii) the cholesterol hydroxyl/sulfo group is often partnered by Asn, Gln, and/or Tyr, while the hydrophobic part of cholesterol interacts with Leu, Ile, Val, and/or Phe; iii) cholesterol hydrogen-bonding partners are often found on α-helices, while amino acids that interact with cholesterol’s hydrophobic core have a slight preference for β-strands and secondary structure-lacking protein areas; iv) the steroid’s C21 and C26 constitute the “hot spots” most often seen for steroid-protein hydrophobic interactions; v) common “cold spots” are C8–C10, C13, and C17, at which contacts with the proteins were not detected. Several common features we identified for soluble protein-steroid interaction appear evolutionarily conserved. PMID:28420706

  8. An evaluation of applicability of seismic refraction method in identifying shallow archaeological features A case study at archaeological site

    NASA Astrophysics Data System (ADS)

    Jahangardi, Morteza; Hafezi Moghaddas, Naser; Keivan Hosseini, Sayyed; Garazhian, Omran

    2015-04-01

    We applied the seismic refraction method at archaeological site, Tepe Damghani located in Sabzevar, NE of Iran, in order to determine the structures of archaeological interests. This pre-historical site has special conditions with respect to geographical location and geomorphological setting, so it is an urban archaeological site, and in recent years it has been used as an agricultural field. In spring and summer of 2012, the third season of archaeological excavation was carried out. Test trenches of excavations in this site revealed that cultural layers were often disturbed adversely due to human activities such as farming and road construction in recent years. Conditions of archaeological cultural layers in southern and eastern parts of Tepe are slightly better, for instance, in test trench 3×3 m²1S03, third test trench excavated in the southern part of Tepe, an adobe in situ architectural structure was discovered that likely belongs to cultural features of a complex with 5 graves. After conclusion of the third season of archaeological excavation, all of the test trenches were filled with the same soil of excavated test trenches. Seismic refraction method was applied with12 channels of P geophones in three lines with a geophone interval of 0.5 meter and a 1.5 meter distance between profiles on test trench 1S03. The goal of this operation was evaluation of applicability of seismic method in identification of archaeological features, especially adobe wall structures. Processing of seismic data was done with the seismic software, SiesImager. Results were presented in the form of seismic section for every profile, so that identification of adobe wall structures was achieved hardly. This could be due to that adobe wall had been built with the same materials of the natural surrounding earth. Thus, there is a low contrast and it has an inappropriate effect on seismic processing and identifying of archaeological features. Hence the result could be that application of the seismic method in order to determine the archaeological features, having the same conditions, is not affordable and efficient in comparison to GPR or magnetic methods which yield more desirable results.

  9. Complicating Canons: A Critical Literacy Challenge to Common Core Assessment

    ERIC Educational Resources Information Center

    Peel, Anne

    2017-01-01

    The widespread adoption of the Common Core State Standards in the US has prioritized rigorous reading of complex texts. The emphasis on text complexity has led to instructional and assessment materials that constrain critical literacy practices by emphasizing quantitative features of text, such as sentence length, and a static list of text…

  10. Leveling L2 Texts through Readability: Combining Multilevel Linguistic Features with the CEFR

    ERIC Educational Resources Information Center

    Sung, Yao-Ting; Lin, Wei-Chun; Dyson, Scott Benjamin; Chang, Kuo-En; Chen, Yu-Chia

    2015-01-01

    Selecting appropriate texts for L2 (second/foreign language) learners is an important approach to enhancing motivation and, by extension, learning. There is currently no tool for classifying foreign language texts according to a language proficiency framework, which makes it difficult for students and educators to determine the precise…

  11. Examining Kindergarten Students' Use of and Interest in Informational Text

    ERIC Educational Resources Information Center

    Hall, Anna H.; Matthew Boyer, D.; Beschorner, Elizabeth A.

    2017-01-01

    This article describes a dual-case study that was conducted to examine the effects of The Tools Approach on kindergarten students' use of and interest in informational text. Children in one teacher's kindergarten classroom during two subsequent years participated in a writing intervention which included learning about text features, conducting…

  12. Visual Literacy in Science

    ERIC Educational Resources Information Center

    McTigue, Erin; Croix, Amanda

    2010-01-01

    While diagrams make the text more visually appealing and provide an image of the text, they also do much more. Subsequently, the authors designed a series of lessons for students to discover the many purposes of graphics in science. A particular utility of these interdisciplinary lessons is that they are used with any science text featuring visual…

  13. Metacatalog of Planetary Surface Features for Multicriteria Evaluation of Surface Evolution: the Integrated Planetary Feature Database

    NASA Astrophysics Data System (ADS)

    Hargitai, Henrik

    2016-10-01

    We have created a metacatalog, or catalog or catalogs, of surface features of Mars that also includes the actual data in the catalogs listed. The goal is to make mesoscale surface feature databases available in one place, in a GIS-ready format. The databases can be directly imported to ArcGIS or other GIS platforms, like Google Mars. Some of the catalogs in our database are also ingested into the JMARS platform.All catalogs have been previously published in a peer-reviewed journal, but they may contain updates of the published catalogs. Many of the catalogs are "integrated", i.e. they merge databases or information from various papers on the same topic, including references to each individual features listed.Where available, we have included shapefiles with polygon or linear features, however, most of the catalogs only contain point data of their center points and morphological data.One of the unexpected results of the planetary feature metacatalog is that some features have been described by several papers, using different, i.e., conflicting designations. This shows the need for the development of an identification system suitable for mesoscale (100s m to km sized) features that tracks papers and thus prevents multiple naming of the same feature.The feature database can be used for multicriteria analysis of a terrain, thus enables easy distribution pattern analysis and the correlation of the distribution of different landforms and features on Mars. Such catalog makes a scientific evaluation of potential landing sites easier and more effective during the selection process and also supports automated landing site selections.The catalog is accessible at https://planetarydatabase.wordpress.com/.

  14. Archaeological Investigations in the Upper Tombigbee Valley, Mississippi: Phase I. Volume 2.

    DTIC Science & Technology

    1983-01-01

    Macrobotanical Remains in Feature ...... 7.113 7.13 Site 221T576: Inhumation Analysis .................... 7.2H4 7.14 Site 221T576: Percentage Distribution...several aspects of the site stratigraphy provided a viable framework for analysis . Perhaps the most im- portant of the insights gained from the...component. In order to facilitate manipulation of artifact samples from the site, three analytical units were recognized during analysis : Zone 1, which

  15. Informatics in radiology (infoRAD): HTML and Web site design for the radiologist: a primer.

    PubMed

    Ryan, Anthony G; Louis, Luck J; Yee, William C

    2005-01-01

    A Web site has enormous potential as a medium for the radiologist to store, present, and share information in the form of text, images, and video clips. With a modest amount of tutoring and effort, designing a site can be as painless as preparing a Microsoft PowerPoint presentation. The site can then be used as a hub for the development of further offshoots (eg, Web-based tutorials, storage for a teaching library, publication of information about one's practice, and information gathering from a wide variety of sources). By learning the basics of hypertext markup language (HTML), the reader will be able to produce a simple and effective Web page that permits display of text, images, and multimedia files. The process of constructing a Web page can be divided into five steps: (a) creating a basic template with formatted text, (b) adding color, (c) importing images and multimedia files, (d) creating hyperlinks, and (e) uploading one's page to the Internet. This Web page may be used as the basis for a Web-based tutorial comprising text documents and image files already in one's possession. Finally, there are many commercially available packages for Web page design that require no knowledge of HTML.

  16. StereoGene: rapid estimation of genome-wide correlation of continuous or interval feature data.

    PubMed

    Stavrovskaya, Elena D; Niranjan, Tejasvi; Fertig, Elana J; Wheelan, Sarah J; Favorov, Alexander V; Mironov, Andrey A

    2017-10-15

    Genomics features with similar genome-wide distributions are generally hypothesized to be functionally related, for example, colocalization of histones and transcription start sites indicate chromatin regulation of transcription factor activity. Therefore, statistical algorithms to perform spatial, genome-wide correlation among genomic features are required. Here, we propose a method, StereoGene, that rapidly estimates genome-wide correlation among pairs of genomic features. These features may represent high-throughput data mapped to reference genome or sets of genomic annotations in that reference genome. StereoGene enables correlation of continuous data directly, avoiding the data binarization and subsequent data loss. Correlations are computed among neighboring genomic positions using kernel correlation. Representing the correlation as a function of the genome position, StereoGene outputs the local correlation track as part of the analysis. StereoGene also accounts for confounders such as input DNA by partial correlation. We apply our method to numerous comparisons of ChIP-Seq datasets from the Human Epigenome Atlas and FANTOM CAGE to demonstrate its wide applicability. We observe the changes in the correlation between epigenomic features across developmental trajectories of several tissue types consistent with known biology and find a novel spatial correlation of CAGE clusters with donor splice sites and with poly(A) sites. These analyses provide examples for the broad applicability of StereoGene for regulatory genomics. The StereoGene C ++ source code, program documentation, Galaxy integration scripts and examples are available from the project homepage http://stereogene.bioinf.fbb.msu.ru/. favorov@sensi.org. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  17. A Deep Similarity Metric Learning Model for Matching Text Chunks to Spatial Entities

    NASA Astrophysics Data System (ADS)

    Ma, K.; Wu, L.; Tao, L.; Li, W.; Xie, Z.

    2017-12-01

    The matching of spatial entities with related text is a long-standing research topic that has received considerable attention over the years. This task aims at enrich the contents of spatial entity, and attach the spatial location information to the text chunk. In the data fusion field, matching spatial entities with the corresponding describing text chunks has a big range of significance. However, the most traditional matching methods often rely fully on manually designed, task-specific linguistic features. This work proposes a Deep Similarity Metric Learning Model (DSMLM) based on Siamese Neural Network to learn similarity metric directly from the textural attributes of spatial entity and text chunk. The low-dimensional feature representation of the space entity and the text chunk can be learned separately. By employing the Cosine distance to measure the matching degree between the vectors, the model can make the matching pair vectors as close as possible. Mearnwhile, it makes the mismatching as far apart as possible through supervised learning. In addition, extensive experiments and analysis on geological survey data sets show that our DSMLM model can effectively capture the matching characteristics between the text chunk and the spatial entity, and achieve state-of-the-art performance.

  18. Wind Drifts at Viking 1 Landing Site

    NASA Technical Reports Server (NTRS)

    1997-01-01

    This image is of so-called wind drifts seen at the Viking 1 landing site. These are somewhat different from the features seen at the Pathfinder site in two important ways. 1) These landforms have no apparent slip-or avalanche-face as do both terrestrial dunes and the Pathfinder features, and may represent deposits of sediment falling from the air, as opposed to dune sand, which 'hops' or saltates along the ground; 2) these features may indicate erosion on one side, because of the layering and apparent scouring on their right sides. They may, therefore have been deposited by a wind moving left to right, partly or weakly cemented or solidified by surface processes at some later time, then eroded by a second wind (right to left), exposing their internal structure.

    Mars Pathfinder is the second in NASA's Discovery program of low-cost spacecraft with highly focused science goals. The Jet Propulsion Laboratory, Pasadena, CA, developed and manages the Mars Pathfinder mission for NASA's Office of Space Science, Washington, D.C. JPL is a division of the California Institute of Technology (Caltech).

  19. Integrating query of relational and textual data in clinical databases: a case study.

    PubMed

    Fisk, John M; Mutalik, Pradeep; Levin, Forrest W; Erdos, Joseph; Taylor, Caroline; Nadkarni, Prakash

    2003-01-01

    The authors designed and implemented a clinical data mart composed of an integrated information retrieval (IR) and relational database management system (RDBMS). Using commodity software, which supports interactive, attribute-centric text and relational searches, the mart houses 2.8 million documents that span a five-year period and supports basic IR features such as Boolean searches, stemming, and proximity and fuzzy searching. Results are relevance-ranked using either "total documents per patient" or "report type weighting." Non-curated medical text has a significant degree of malformation with respect to spelling and punctuation, which creates difficulties for text indexing and searching. Presently, the IR facilities of RDBMS packages lack the features necessary to handle such malformed text adequately. A robust IR+RDBMS system can be developed, but it requires integrating RDBMSs with third-party IR software. RDBMS vendors need to make their IR offerings more accessible to non-programmers.

  20. TarPmiR: a new approach for microRNA target site prediction.

    PubMed

    Ding, Jun; Li, Xiaoman; Hu, Haiyan

    2016-09-15

    The identification of microRNA (miRNA) target sites is fundamentally important for studying gene regulation. There are dozens of computational methods available for miRNA target site prediction. Despite their existence, we still cannot reliably identify miRNA target sites, partially due to our limited understanding of the characteristics of miRNA target sites. The recently published CLASH (crosslinking ligation and sequencing of hybrids) data provide an unprecedented opportunity to study the characteristics of miRNA target sites and improve miRNA target site prediction methods. Applying four different machine learning approaches to the CLASH data, we identified seven new features of miRNA target sites. Combining these new features with those commonly used by existing miRNA target prediction algorithms, we developed an approach called TarPmiR for miRNA target site prediction. Testing on two human and one mouse non-CLASH datasets, we showed that TarPmiR predicted more than 74.2% of true miRNA target sites in each dataset. Compared with three existing approaches, we demonstrated that TarPmiR is superior to these existing approaches in terms of better recall and better precision. The TarPmiR software is freely available at http://hulab.ucf.edu/research/projects/miRNA/TarPmiR/ CONTACTS: haihu@cs.ucf.edu or xiaoman@mail.ucf.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  1. Understanding the Growth of ESL Paragraph Writing Skills and Its Relationships with Linguistic Features

    ERIC Educational Resources Information Center

    Aryadoust, Vahid

    2016-01-01

    This study sought to examine the development of paragraph writing skills of 116 English as a second language university students over the course of 12 weeks and the relationship between the linguistic features of students' written texts as measured by Coh-Metrix--a computational system for estimating textual features such as cohesion and…

  2. Exploring the Feasibility of Using Writing Process Features to Assess Text Production Skills. Research Report. ETS RR-15-26

    ERIC Educational Resources Information Center

    Deane, Paul; Zhang, Mo

    2015-01-01

    In this report, we examine the feasibility of characterizing writing performance using process features derived from a keystroke log. Using data derived from a set of "CBAL"™ writing assessments, we examine the following research questions: (a) How stable are the keystroke timing and process features across testing occasions?; (b) How…

  3. Feature generation and representations for protein-protein interaction classification.

    PubMed

    Lan, Man; Tan, Chew Lim; Su, Jian

    2009-10-01

    Automatic detecting protein-protein interaction (PPI) relevant articles is a crucial step for large-scale biological database curation. The previous work adopted POS tagging, shallow parsing and sentence splitting techniques, but they achieved worse performance than the simple bag-of-words representation. In this paper, we generated and investigated multiple types of feature representations in order to further improve the performance of PPI text classification task. Besides the traditional domain-independent bag-of-words approach and the term weighting methods, we also explored other domain-dependent features, i.e. protein-protein interaction trigger keywords, protein named entities and the advanced ways of incorporating Natural Language Processing (NLP) output. The integration of these multiple features has been evaluated on the BioCreAtIvE II corpus. The experimental results showed that both the advanced way of using NLP output and the integration of bag-of-words and NLP output improved the performance of text classification. Specifically, in comparison with the best performance achieved in the BioCreAtIvE II IAS, the feature-level and classifier-level integration of multiple features improved the performance of classification 2.71% and 3.95%, respectively.

  4. Geothermal-energy files in computer storage: sites, cities, and industries

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    O'Dea, P.L.

    1981-12-01

    The site, city, and industrial files are described. The data presented are from the hydrothermal site file containing about three thousand records which describe some of the principal physical features of hydrothermal resources in the United States. Data elements include: latitude, longitude, township, range, section, surface temperature, subsurface temperature, the field potential, and well depth for commercialization. (MHR)

  5. Initial Field Trial of a Coach-Supported Web-Based Depression Treatment.

    PubMed

    Schueller, Stephen M; Mohr, David C

    2015-08-01

    Early web-based depression treatments were often self-guided and included few interactive elements, instead focusing mostly on delivering informational content online. Newer programs include many more types of features. As such, trials should analyze the ways in which people use these sites in order to inform the design of subsequent sites and models of support. The current study describes of a field trial consisting of 9 patients with major depressive disorder who completed a 12-week program including weekly coach calls. Patients usage varied widely, however, patients who formed regular patterns tended to persist with the program for the longest. Future sites might be able to facilitate user engagement by designing features to support regular use and to use coaches to help establish patterns to increase long-term use and benefit.

  6. Analysis of Smartphone Interruptions on Academic General Internal Medicine Wards

    PubMed Central

    C.Wu, Robert

    2017-01-01

    Summary Introduction Hospital-based medical services are increasingly utilizing team-based pagers and smartphones to streamline communications. However, an unintended consequence may be higher volumes of interruptions potentially leading to medical error. There is likely a level at which interruptions are excessive and cause a ‘crisis mode’ climate. Methods We retrospectively collected phone, text messaging, and email interruptions directed to hospital-assigned smartphones on eight General Internal Medicine (GIM) teams at two tertiary care centres in Toronto, Ontario from April 2013 to September 2014. We also calculated the number of times these interruptions exceeded a pre-specified threshold per hour, termed ‘crisis mode’, defined as at least five interruptions in 30 minutes. We analyzed the correlation between interruptions and date, site, and patient volumes. Results A total of 187,049 interruptions were collected over an 18-month period. Daily weekday interruptions rose sharply in the morning, peaking between 11 AM to 12 PM and measuring 4.8 and 3.7 mean interruptions/hour at each site, respectively. Mean daily interruptions per team totaled 46.2 ± 3.6 at Site 1 and 39.2 ± 4.2 at Site 2. The ‘crisis mode’ threshold was exceeded, on average, 2.3 times/day per GIM team during weekdays. In a multivariable linear regression analysis, site (β6.43 CI95% 5.44 – 7.42, p<0.001), day of the week (with Friday having the most interruptions) (β0.481 CI95% 0.236 – 0.730, p<0.05) and patient census (β1.55 CI95% 1.42 – 1.67, p<0.05) were all predictive of daily interruption volume although there was a significant interaction effect between site and patient census (β-0.941 CI95% -1.18 – -0.703, p<0.05). Conclusion Interruptions were related to site-specific features, including volume, suggesting that future interventions should target the culture of individual hospitals. Excessive interruptions may have implications for patient safety especially when exceeding a maximal threshold over short periods of time. PMID:28066851

  7. Analysis of Smartphone Interruptions on Academic General Internal Medicine Wards. Frequent Interruptions may cause a 'Crisis Mode' Work Climate.

    PubMed

    Vaisman, Alon; Wu, Robert C

    2017-01-04

    Hospital-based medical services are increasingly utilizing team-based pagers and smartphones to streamline communications. However, an unintended consequence may be higher volumes of interruptions potentially leading to medical error. There is likely a level at which interruptions are excessive and cause a 'crisis mode' climate. We retrospectively collected phone, text messaging, and email interruptions directed to hospital-assigned smartphones on eight General Internal Medicine (GIM) teams at two tertiary care centres in Toronto, Ontario from April 2013 to September 2014. We also calculated the number of times these interruptions exceeded a pre-specified threshold per hour, termed 'crisis mode', defined as at least five interruptions in 30 minutes. We analyzed the correlation between interruptions and date, site, and patient volumes. A total of 187,049 interruptions were collected over an 18-month period. Daily weekday interruptions rose sharply in the morning, peaking between 11 AM to 12 PM and measuring 4.8 and 3.7 mean interruptions/hour at each site, respectively. Mean daily interruptions per team totaled 46.2 ± 3.6 at Site 1 and 39.2 ± 4.2 at Site 2. The 'crisis mode' threshold was exceeded, on average, 2.3 times/day per GIM team during weekdays. In a multivariable linear regression analysis, site (β6.43 CI95% 5.44 - 7.42, p<0.001), day of the week (with Friday having the most interruptions) (β0.481 CI95% 0.236 - 0.730, p<0.05) and patient census (β1.55 CI95% 1.42 - 1.67, p<0.05) were all predictive of daily interruption volume although there was a significant interaction effect between site and patient census (β-0.941 CI95% -1.18 - -0.703, p<0.05). Interruptions were related to site-specific features, including volume, suggesting that future interventions should target the culture of individual hospitals. Excessive interruptions may have implications for patient safety especially when exceeding a maximal threshold over short periods of time.

  8. Nevada National Security Site Environmental Report 2012 Attachment A: Site Description

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wills, Cathy A

    This attachment expands on the general description of the Nevada National Security Site (NNSS) presented in the Introduction to the Nevada National Security Site Environmental Report 2012 (National Security Technologies, LLC [NSTec], 2013). Included are subsections that summarize the site’s geological, hydrological, climatological, and ecological setting and the cultural resources of the NNSS. The subsections are meant to aid the reader in understanding the complex physical and biological environment of the NNSS. An adequate knowledge of the site’s environment is necessary to assess the environmental impacts of new projects, design and implement environmental monitoring activities for current site operations, andmore » assess the impacts of site operations on the public residing in the vicinity of the NNSS. The NNSS environment contributes to several key features of the site that afford protection to the inhabitants of adjacent areas from potential exposure to radioactivity or other contaminants resulting from NNSS operations. These key features include the general remote location of the NNSS, restricted access, extended wind transport times, the great depths to slow-moving groundwater, little or no surface water, and low population density. This attachment complements the annual summary of monitoring program activities and dose assessments presented in the main body of this report.« less

  9. Nevada National Security Site Environmental Report 2013 Attachment A: Site Description

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wills, C.

    This attachment expands on the general description of the Nevada National Security Site (NNSS) presented in the Introduction to the Nevada National Security Site Environmental Report 2012 (National Security Technologies, LLC [NSTec], 2013). Included are subsections that summarize the site’s geological, hydrological, climatological, and ecological setting and the cultural resources of the NNSS. The subsections are meant to aid the reader in understanding the complex physical and biological environment of the NNSS. An adequate knowledge of the site’s environment is necessary to assess the environmental impacts of new projects, design and implement environmental monitoring activities for current site operations, andmore » assess the impacts of site operations on the public residing in the vicinity of the NNSS. The NNSS environment contributes to several key features of the site that afford protection to the inhabitants of adjacent areas from potential exposure to radioactivity or other contaminants resulting from NNSS operations. These key features include the general remote location of the NNSS, restricted access, extended wind transport times, the great depths to slow-moving groundwater, little or no surface water, and low population density. This attachment complements the annual summary of monitoring program activities and dose assessments presented in the main body of this report.« less

  10. Soil gas screening for chlorinated solvents at three contaminated karst sites in Tennessee

    USGS Publications Warehouse

    Wolfe, W.J.; Williams, S.D.

    2002-01-01

    Soil gas was sampled using active sampling techniques and passive collectors at three sites in Tennessee to evaluate the effectiveness of these techniques for locating chlorinated solvent sources and flowpaths in karst aquifers. Actively collected soil gas samples were analyzed in the field with a portable gas chromatograph, and the passive soil gas collectors were analyzed in the lab with gas chromatography/mass spectrometry. Results of the sampling indicate that the effectiveness of both techniques is highly dependent on the distribution of the contaminants in the subsurface, the geomorphic and hydrogeologic characteristics of the site, and, in one case, on seasonal conditions. Both active and passive techniques identified areas of elevated subsurface chlorinated solvent concentrations at a landfill site where contamination remains concentrated in the regolith. Neither technique detected chlorinated solvents known to be moving in the bedrock at a manufacturing site characterized by thick regolith and an absence of surficial karst features. Passive soil gas sampling had varied success detecting flowpaths for chloroform in the bedrock at a train derailment site characterized by shallow regolith and abundant surficial karst features. At the train derailment site, delineation of the contaminant flowpath through passive soil gas sampling was stronger and more detailed under Winter conditions than summer.

  11. Mojo Hand, a TALEN design tool for genome editing applications.

    PubMed

    Neff, Kevin L; Argue, David P; Ma, Alvin C; Lee, Han B; Clark, Karl J; Ekker, Stephen C

    2013-01-16

    Recent studies of transcription activator-like (TAL) effector domains fused to nucleases (TALENs) demonstrate enormous potential for genome editing. Effective design of TALENs requires a combination of selecting appropriate genetic features, finding pairs of binding sites based on a consensus sequence, and, in some cases, identifying endogenous restriction sites for downstream molecular genetic applications. We present the web-based program Mojo Hand for designing TAL and TALEN constructs for genome editing applications (http://www.talendesign.org). We describe the algorithm and its implementation. The features of Mojo Hand include (1) automatic download of genomic data from the National Center for Biotechnology Information, (2) analysis of any DNA sequence to reveal pairs of binding sites based on a user-defined template, (3) selection of restriction-enzyme recognition sites in the spacer between the TAL monomer binding sites including options for the selection of restriction enzyme suppliers, and (4) output files designed for subsequent TALEN construction using the Golden Gate assembly method. Mojo Hand enables the rapid identification of TAL binding sites for use in TALEN design. The assembly of TALEN constructs, is also simplified by using the TAL-site prediction program in conjunction with a spreadsheet management aid of reagent concentrations and TALEN formulation. Mojo Hand enables scientists to more rapidly deploy TALENs for genome editing applications.

  12. Double-u double-u double-u dot APIC dot org: a review of the APIC World Wide Web site.

    PubMed

    Harr, J

    1996-12-01

    The widespread use of the Internet and the development of the World Wide Web have led to a revolution in electronic communication and information access. The Association for Professional in Infection Control and Epidemiology (APIC) has developed a site on the World Wide Web to provide mechanisms for international on-line information access and exchange on issues related to the practice of infection control and the application of epidemiology. From the home page of the APIC Web site, users can access information on professional resources, publications, educational offering, governmental affairs, the APIC organization, and the infection control profession. Among the chief features of the site is a discussion forum for posing questions and sharing information about infection control and epidemiology. The site also contains a searchable database of practice-related abstracts and descriptions and order forms for APIC publications. Users will find continuing education course descriptions and registration forms, legislative and regulatory action alerts and a congressional mailer, chapter and committee information, and infection control information of interest to the general public. APIC is considering several potential future enhancements to their Web site and will continue to review the site's content and features to provide current and useful information to infection control professionals.

  13. Malignant melanoma of sun-protected sites: a review of clinical, histological, and molecular features.

    PubMed

    Merkel, Emily A; Gerami, Pedram

    2017-06-01

    In most cases of cutaneous melanoma, ultraviolet (UV) radiation is recognized as a prominent risk factor. Less is known regarding the mechanisms of mutagenesis for melanoma arising in sun-protected sites, such as acral and mucosal melanoma. Acral and mucosal melanoma share many common features, including a late age of onset, a broad radial growth phase with prominent lentiginous growth, the presence of field cancerization cells, and, in most cases, lack of a precursor nevus. In addition to early chromosomal instability, many of the same genes are also involved in these two distinct melanoma subtypes. To better understand non-UV-mediated pathogenesis in melanoma, we conducted a joint literature review of clinical, histological, and molecular features in acral and mucosal melanoma. We also reviewed the current literature regarding aberrations in KIT, PDGFRA, TERT, and other commonly involved genes. By comparing common features of these two subtypes, we suggest potential mechanisms underlying acral and/or mucosal melanoma and offer direction for future investigations.

  14. The structural and content aspects of abstracts versus bodies of full text journal articles are different

    PubMed Central

    2010-01-01

    Background An increase in work on the full text of journal articles and the growth of PubMedCentral have the opportunity to create a major paradigm shift in how biomedical text mining is done. However, until now there has been no comprehensive characterization of how the bodies of full text journal articles differ from the abstracts that until now have been the subject of most biomedical text mining research. Results We examined the structural and linguistic aspects of abstracts and bodies of full text articles, the performance of text mining tools on both, and the distribution of a variety of semantic classes of named entities between them. We found marked structural differences, with longer sentences in the article bodies and much heavier use of parenthesized material in the bodies than in the abstracts. We found content differences with respect to linguistic features. Three out of four of the linguistic features that we examined were statistically significantly differently distributed between the two genres. We also found content differences with respect to the distribution of semantic features. There were significantly different densities per thousand words for three out of four semantic classes, and clear differences in the extent to which they appeared in the two genres. With respect to the performance of text mining tools, we found that a mutation finder performed equally well in both genres, but that a wide variety of gene mention systems performed much worse on article bodies than they did on abstracts. POS tagging was also more accurate in abstracts than in article bodies. Conclusions Aspects of structure and content differ markedly between article abstracts and article bodies. A number of these differences may pose problems as the text mining field moves more into the area of processing full-text articles. However, these differences also present a number of opportunities for the extraction of data types, particularly that found in parenthesized text, that is present in article bodies but not in article abstracts. PMID:20920264

  15. Computation of reliable textural indices from multimodal brain MRI: suggestions based on a study of patients with diffuse intrinsic pontine glioma.

    PubMed

    Goya-Outi, Jessica; Orlhac, Fanny; Calmon, Raphael; Alentorn, Agusti; Nioche, Christophe; Philippe, Cathy; Puget, Stéphanie; Boddaert, Nathalie; Buvat, Irène; Grill, Jacques; Frouin, Vincent; Frouin, Frederique

    2018-05-10

    Few methodological studies regarding widely used textural indices robustness in MRI have been reported. In this context, this study aims to propose some rules to compute reliable textural indices from multimodal 3D brain MRI. Diagnosis and post-biopsy MR scans including T1, post-contrast T1, T2 and FLAIR images from thirty children with diffuse intrinsic pontine glioma (DIPG) were considered. The hybrid white stripe method was adapted to standardize MR intensities. Sixty textural indices were then computed for each modality in different regions of interest (ROI), including tumor and white matter (WM). Three types of intensity binning were compared [Formula: see text]: constant bin width and relative bounds; [Formula: see text] constant number of bins and relative bounds; [Formula: see text] constant number of bins and absolute bounds. The impact of the volume of the region was also tested within the WM. First, the mean Hellinger distance between patient-based intensity distributions decreased by a factor greater than 10 in WM and greater than 2.5 in gray matter after standardization. Regarding the binning strategy, the ranking of patients was highly correlated for 188/240 features when comparing [Formula: see text] with [Formula: see text], but for only 20 when comparing [Formula: see text] with [Formula: see text], and nine when comparing [Formula: see text] with [Formula: see text]. Furthermore, when using [Formula: see text] or [Formula: see text] texture indices reflected tumor heterogeneity as assessed visually by experts. Last, 41 features presented statistically significant differences between contralateral WM regions when ROI size slightly varies across patients, and none when using ROI of the same size. For regions with similar size, 224 features were significantly different between WM and tumor. Valuable information from texture indices can be biased by methodological choices. Recommendations are to standardize intensities in MR brain volumes, to use intensity binning with constant bin width, and to define regions with the same volumes to get reliable textural indices.

  16. Examining the Effects of Text Genre and Structure on Fourth-and Fifth-Grade Students' High-Level Comprehension as Evidenced in Small-Group Discussions

    ERIC Educational Resources Information Center

    Li, Mengyi; Murphy, P. Karen; Firetto, Carla M.

    2014-01-01

    Although there is a rich literature on the role of text genre and structure on students' literal comprehension, more research is needed regarding the role of these text features on students' high-level comprehension as evidenced in their small-group discussions. As such, the present study examined the effects of text genre (i.e., narrative and…

  17. JCE Feature Columns

    NASA Astrophysics Data System (ADS)

    Holmes, Jon L.

    1999-05-01

    The Features area of JCE Online is now readily accessible through a single click from our home page. In the Features area each column is linked to its own home page. These column home pages also have links to them from the online Journal Table of Contents pages or from any article published as part of that feature column. Using these links you can easily find abstracts of additional articles that are related by topic. Of course, JCE Online+ subscribers are then just one click away from the entire article. Finding related articles is easy because each feature column "site" contains links to the online abstracts of all the articles that have appeared in the column. In addition, you can find the mission statement for the column and the email link to the column editor that I mentioned above. At the discretion of its editor, a feature column site may contain additional resources. As an example, the Chemical Information Instructor column edited by Arleen Somerville will have a periodically updated bibliography of resources for teaching and using chemical information. Due to the increase in the number of these resources available on the WWW, it only makes sense to publish this information online so that you can get to these resources with a simple click of the mouse. We expect that there will soon be additional information and resources at several other feature column sites. Following in the footsteps of the Chemical Information Instructor, up-to-date bibliographies and links to related online resources can be made available. We hope to extend the online component of our feature columns with moderated online discussion forums. If you have a suggestion for an online resource you would like to see included, let the feature editor or JCE Online (jceonline@chem.wisc.edu) know about it. JCE Internet Features JCE Internet also has several feature columns: Chemical Education Resource Shelf, Conceptual Questions and Challenge Problems, Equipment Buyers Guide, Hal's Picks, Mathcad in the Chemistry Curriculum, and WWW Site Review. These columns differ from the print feature columns in that they use the Internet as the publication medium. Doing so allows these features to include continually updated information, digital components, and links to other online resources. The Conceptual Questions and Challenge Problems feature of JCE Internet serves as a good example for the kinds of resources that you can expect to find in an online feature column. Like other columns it contains a mission statement that defines the role of the column. It includes a digital library of continually updated examples of conceptual questions and challenge problems. (As I write this we have just added several new questions to the library.) It also includes a list of links to related online resources, information for authors about how to write questions and problems, and information for teachers about how to use conceptual questions and challenge problems. Teaching with Technology home page at JCE Online. One-Stop Feature Shop The updated Feature area of JCE Online offers information about all JCE feature columns in one place. It gives you a quick and convenient way to access a group of articles in a particular subject area. It provides authors and readers with a good definition of the column and its mission. It complements the print feature columns with online resources. It provides up-to-date bibliographies for selected areas of interest. And last, but not least, it provides that email address you can use to send that message of appreciation to the feature editor for his or her contribution to JCE and the chemical education community.

  18. Caught on the Web

    ERIC Educational Resources Information Center

    Isakson, Carol

    2005-01-01

    In this article, the author presents several Web sites supporting electronic presentation skills. The sites featured here will help fine-tune one's skills in modeling effective presentations and provide suggestions for managing student presentations meeting National Educational Technology Standards (NETS). Most use PowerPoint, the current industry…

  19. 10 CFR 60.122 - Siting criteria.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... Period. (12) Earthquakes which have occurred historically that if they were to be repeated could affect the site significantly. (13) Indications, based on correlations of earthquakes with tectonic processes and features, that either the frequency of occurrence or magnitude of earthquakes may increase. (14...

  20. 10 CFR 60.122 - Siting criteria.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... Period. (12) Earthquakes which have occurred historically that if they were to be repeated could affect the site significantly. (13) Indications, based on correlations of earthquakes with tectonic processes and features, that either the frequency of occurrence or magnitude of earthquakes may increase. (14...

  1. 10 CFR 60.122 - Siting criteria.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... Period. (12) Earthquakes which have occurred historically that if they were to be repeated could affect the site significantly. (13) Indications, based on correlations of earthquakes with tectonic processes and features, that either the frequency of occurrence or magnitude of earthquakes may increase. (14...

  2. 10 CFR 60.122 - Siting criteria.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... Period. (12) Earthquakes which have occurred historically that if they were to be repeated could affect the site significantly. (13) Indications, based on correlations of earthquakes with tectonic processes and features, that either the frequency of occurrence or magnitude of earthquakes may increase. (14...

  3. Designing a Text Messaging Intervention to Improve Physical Activity Behavior Among Low-Income Latino Patients With Diabetes: A Discrete-Choice Experiment, Los Angeles, 2014-2015.

    PubMed

    Ramirez, Magaly; Wu, Shinyi; Beale, Elizabeth

    2016-12-22

    Automated text messaging can deliver self-management education to activate self-care behaviors among people with diabetes. We demonstrated how a discrete-choice experiment was used to determine the features of a text-messaging intervention that are important to urban, low-income Latino patients with diabetes and that could support improvement in their physical activity behavior. In a discrete-choice experiment from December 2014 through August 2015 we conducted a survey to elicit information on patient preferences for 5 features of a text-messaging intervention. We described 2 hypothetical interventions and in 7 pairwise comparisons asked respondents to indicate which they preferred. Respondents (n = 125) were recruited in person from a diabetes management program of a safety-net ambulatory care clinic in Los Angeles; clinicians referred patients to the research assistant after routine clinic visits. Data were analyzed by using conditional logistic regression. We found 2 intervention features that were considered by the survey respondents to be important: 1) the frequency of text messaging and 2) physical activity behavior-change education (the former being more important than the latter). Physical activity goal setting, feedback on physical activity performance, and social support were not significantly important. A discrete-choice experiment is a feasible way to elicit information on patient preferences for a text-messaging intervention designed to support behavior change. However, discrepancies may exist between patients' stated preferences and their actual behavior. Future research should validate and expand our findings.

  4. A neural joint model for entity and relation extraction from biomedical text.

    PubMed

    Li, Fei; Zhang, Meishan; Fu, Guohong; Ji, Donghong

    2017-03-31

    Extracting biomedical entities and their relations from text has important applications on biomedical research. Previous work primarily utilized feature-based pipeline models to process this task. Many efforts need to be made on feature engineering when feature-based models are employed. Moreover, pipeline models may suffer error propagation and are not able to utilize the interactions between subtasks. Therefore, we propose a neural joint model to extract biomedical entities as well as their relations simultaneously, and it can alleviate the problems above. Our model was evaluated on two tasks, i.e., the task of extracting adverse drug events between drug and disease entities, and the task of extracting resident relations between bacteria and location entities. Compared with the state-of-the-art systems in these tasks, our model improved the F1 scores of the first task by 5.1% in entity recognition and 8.0% in relation extraction, and that of the second task by 9.2% in relation extraction. The proposed model achieves competitive performances with less work on feature engineering. We demonstrate that the model based on neural networks is effective for biomedical entity and relation extraction. In addition, parameter sharing is an alternative method for neural models to jointly process this task. Our work can facilitate the research on biomedical text mining.

  5. An online survey to study the relationship between patients’ health literacy and coping style and their preferences for self-management-related information

    PubMed Central

    Vosbergen, Sandra; Peek, Niels; Mulder-Wiggers, Johanna MR; Kemps, Hareld MC; Kraaijenhagen, Roderik A; Jaspers, Monique WM; Lacroix, Joyca PW

    2014-01-01

    Objective To evaluate patients’ preferences for message features and assess their relationships with health literacy, monitor–blunter coping style, and other patient-dependent characteristics. Methods Patients with coronary heart disease completed an internet-based survey, which assessed health literacy and monitor–blunter coping style, as well as various other patient characteristics such as sociodemographics, disease history, and explicit information preferences. To assess preferences for message features, nine text sets differing in one of nine message features were composed, and participants were asked to state their preferences. Results The survey was completed by 213 patients. For three of the nine text sets, a relationship was found between patient preference and health literacy or monitor–blunter coping style. Patients with low health literacy preferred the text based on patient experience. Patients with a monitoring coping style preferred information on short-term effects of their treatment and mentioning of explicit risks. Various other patient characteristics such as marital status, social support, disease history, and age also showed a strong association. Conclusion Individual differences exist in patients’ preferences for message features, and these preferences relate to patient characteristics such as health literacy and monitor–blunter coping style. PMID:24851044

  6. Boca de Potrerillos

    NASA Astrophysics Data System (ADS)

    Murray, William Breen

    Boca de Potrerillos is an archaeological site located in the municipio of Mina, Nuevo León, about 60 km. northwest of Monterrey, Mexicós third largest city. Its principal feature is one of the largest concentrations of petroglyphs in the country. Archaeoastronomical features include petroglyphic markers of the cardinal directions, dot configurations which count lunar synodic periods, and one of the earliest horizon calendars in North America. They indicate that the site was probably used for sky observation from the Middle Archaic time period onward and may represent evidence of the initial stages in the development of Mesoamerican numeration and astronomy.

  7. A Determination of Eligibility to the National Register of Historic Places for Select Historic Properties Along the Souris River in North Dakota

    DTIC Science & Technology

    1989-08-01

    Company Historic District," 1985. "Historic Resources of Hardin, Montana," :984. "Silver Bow Brewery Malt House," 1982. "Silver Bow County Poor Farm...34i QQL-- Q ..i FEATURE TYPE CULTURAL MATERIAL ’iii Site Type -0,- Cm Scatter , , Bone 0 Chimney %.Z Ceramics tA Context .Q, Depression 1 Charcoal i...Sec , QQQ i, QQ . Q, ,LTL, L- Twp R ,.. , Sec ,., QQO 1- QQ’ L- Q’ ’ FEATURE TYPE CULTURAL MATERIAL &. Site Type m, Cm Scatter ,.Z, Bone Chimney

  8. Project Math-Co. Career Based Math Text Book Produced Entirely by Eighth Grade Students at Wiscasset Middle School.

    ERIC Educational Resources Information Center

    Wiscasset Middle School, ME.

    This career-based mathematics text book was written for eighth grade students by eighth grade students at Wiscasset Middle School, Wiscasset, Maine. The text has an innovative format and features interviews with various townspeople of Wiscasset concerning their occupations; from the interviews, information is presented about training needed,…

  9. Graphics in Children's Informational Texts: A Content Analysis

    ERIC Educational Resources Information Center

    Fingeret, Lauren

    2012-01-01

    This dissertation is comprised of two manuscripts resulting from a single study, which examines a) the types of graphics that appear, and in what frequencies, in children's informational texts, and b) the defining features of different graphics. Graphics are ubiquitous in children's informational texts and a lot is known about the impact…

  10. The Shifting Sands in the Effects of Source Text Summarizability on Summary Writing

    ERIC Educational Resources Information Center

    Yu, Guoxing

    2009-01-01

    This paper reports the effects of the properties of source texts on summarization. One hundred and fifty-seven undergraduates were asked to write summaries of one of three extended English texts of similar length and readability, but differing in other discoursal features such as lexical diversity and macro-organization. The effects of…

  11. Lidar observations of wind- and wave-driven morphological evolution of coastal foredunes

    NASA Astrophysics Data System (ADS)

    Spore, N.; Brodie, K. L.; Kershner, C. M.

    2016-02-01

    Coastal foredunes are continually evolving geomorphic features that are slowly built up by wind-blown sand and rapidly eroded during storms by large waves and swash. Landward aeolian transport removes sediment from the active beach and surf-zone, trapping it in the dune, where as coastal erosion both removes sediment from the dune and can decrease the overall fetch and sediment supply available to the dune. Understanding how wave and wind-driven process interact with each other and the dune-beach system itself is a critical component of improving predictions of coastal evolution. To investigate these processes, two 50 m alongshore by 25 m cross-shore patches of dune along an open coast beach fronting the Atlantic Ocean in Duck, NC were scanned with a high resolution terrestrial lidar scanner ( 5000 points per m^2) every three weeks over the last year to observe detailed morphological evolution of the dune and upper beach. Sequential scans were co-registered to each other using fixed objects in the field of view, significantly increasing precision and accuracy of the observations. The north study site featured a 7.5 m tall scarped foredune system, where as the southern study site featured a 6 m tall, hummocky, prograding foredune. Initial analyses show large accretion events on the southern prograding site. For example, during one three week period in February, portions of the site accreted over 40 cm. In contrast, during the same three week period at the northern site (less than 1 km away), response was alongshore variable with erosion and accretion of roughly 10 cm on the foredune face. Further analysis will focus on separating wind vs. wave driven evolution of these sites. Funded by the USACE Coastal Inlets Research Program.

  12. Intein-mediated site-specific synthesis of tumor-targeting protein delivery system: Turning PEG dilemma into prodrug-like feature

    PubMed Central

    Chen, Yingzhi; Zhang, Meng; Jin, Hongyue; Tang, Yisi; Wang, Huiyuan; Xu, Qin; Li, Yaping; Li, Feng; Huang, Yongzhuo

    2017-01-01

    Poor tumor-targeted and cytoplasmic delivery is a bottleneck for protein toxin-based cancer therapy. Ideally, a protein toxin drug should remain stealthy in circulation for prolonged half-life and reduced side toxicity, but turn activated at tumor. PEGylation is a solution to achieve the first goal, but creates a hurdle for the second because PEG rejects interaction between the drugs and tumor cells therein. Such PEG dilemma is an unsolved problem in protein delivery. Herein proposed is a concept of turning PEG dilemma into prodrug-like feature. A site-selectively PEGylated, gelatinase-triggered cell-penetrating trichosanthin protein delivery system is developed with three specific aims. The first is to develop an intein-based ligation method for achieving site-specific modification of protein toxins. The second is to develop a prodrug feature that renders protein toxins remaining stealthy in blood for reduced side toxicity and improved EPR effect. The third is to develop a gelatinase activatable cell-penetration strategy for enhanced tumor targeting and cytoplasmic delivery. Of note, site-specific modification is a big challenge in protein drug research, especially for such a complicated, multifunctional protein delivery system. We successfully develop a protocol for constructing a macromolecular prodrug system with intein-mediated ligation synthesis. With an on-column process of purification and intein-mediated cleavage, the site-specific PEGylation then can be readily achieved by conjugation with the activated C-terminus, thus constructing a PEG-capped, cell-penetrating trichosanthin system with a gelatinase-cleavable linker that enables tumor-specific activation of cytoplasmic delivery. It provides a promising method to address the PEG dilemma for enhanced protein drug delivery, and importantly, a facile protocol for site-specific modification of such a class of protein drugs for improving their druggability and industrial translation. PMID:27914267

  13. Engineering geologic and geotechnical analysis of paleoseismic shaking using liquefaction effects: Field examples

    USGS Publications Warehouse

    Green, R.A.; Obermeier, S.F.; Olson, S.M.

    2005-01-01

    The greatest impediments to the widespread acceptance of back-calculated ground motion characteristics from paleoliquefaction studies typically stem from three uncertainties: (1) the significance of changes in the geotechnical properties of post-liquefied sediments (e.g., "aging" and density changes), (2) the selection of appropriate geotechnical soil indices from individual paleoliquefaction sites, and (3) the methodology for integration of back-calculated results of strength of shaking from individual paleoliquefaction sites into a regional assessment of paleoseismic strength of shaking. Presented herein are two case studies that illustrate the methods outlined by Olson et al. [Engineering Geology, this issue] for addressing these uncertainties. The first case study is for a site near Memphis, Tennessee, wherein cone penetration test data from side-by-side locations, one of liquefaction and the other of no liquefaction, are used to readily discern that the influence of post-liquefaction "aging" and density changes on the measured in situ soil indices is minimal. In the second case study, 12 sites that are at scattered locations in the Wabash Valley and that exhibit paleoliquefaction features are analyzed. The features are first provisionally attributed to the Vincennes Earthquake, which occurred around 6100 years BP, and are used to illustrate our proposed approach for selecting representative soil indices of the liquefied sediments. These indices are used in back-calculating the strength of shaking at the individual sites, the results from which are then incorporated into a regional assessment of the moment magnitude, M, of the Vincennes Earthquake. The regional assessment validated the provisional assumption that the paleoliquefaction features at the scattered sites were induced by the Vincennes Earthquake, in the main, which was determined to have M ??? 7.5. The uncertainties and assumptions used in the assessment are discussed in detail. ?? 2004 Elsevier B.V. All rights reserved.

  14. Magnetic resonance imaging features of Great Danes with and without clinical signs of cervical spondylomyelopathy

    PubMed Central

    Martin-Vaquero, Paula; da Costa, Ronaldo C.

    2014-01-01

    Objective To characterize and compare the MRI morphological features of the cervical vertebral column of Great Danes with and without clinical signs of cervical spondylomyelopathy (CSM). Design Prospective cohort study. Animals 30 Great Danes (15 clinically normal and 15 CSM-affected). Procedures All dogs underwent MRI of the cervical vertebral column (C2–3 through T1–2). Features evaluated included sites of subarachnoid space compression, spinal cord compression, or both; degree, cause, and direction of compression; MRI signal changes of the spinal cord; articular process (facet) joint characteristics; internal vertebral venous plexus visibility; and presence of extradural synovial cysts as well as presence and degree of intervertebral disk degeneration and foraminal stenosis. Results Clinically normal and CSM-affected dogs had 11 and 61 compressive sites, respectively, detected with MRI. All CSM-affected dogs had ≥ 1 site of spinal cord compression. No signal changes were observed in spinal cords of normal dogs, whereas 14 sites of hyperintensity were found in 9 CSM-affected dogs. Foraminal stenosis was present in 11 clinically normal and all CSM-affected dogs. The number of stenotic foraminal sites was significantly greater in the CSM-affected group, and severe stenosis appeared to be more common in this group than in the clinically normal group. Significant differences were identified between clinically normal and CSM-affected dogs with regard to amount of synovial fluid evident, regularity of articular surfaces, degree of articular process joint proliferation, and internal vertebral venous plexus visibility. Conclusions and Clinical Relevance Abnormalities were detected with MRI in several clinically normal Great Danes. Severe spinal cord compression, number of stenotic foraminal sites, and signal changes within the spinal cord distinguished CSM-affected from clinically normal Great Danes. PMID:25075822

  15. Magnetic resonance imaging features of Great Danes with and without clinical signs of cervical spondylomyelopathy.

    PubMed

    Martin-Vaquero, Paula; da Costa, Ronaldo C

    2014-08-15

    To characterize and compare the MRI morphological features of the cervical vertebral column of Great Danes with and without clinical signs of cervical spondylomyelopathy (CSM). Prospective cohort study. 30 Great Danes (15 clinically normal and 15 CSM-affected). All dogs underwent MRI of the cervical vertebral column (C2-3 through T1-2). Features evaluated included sites of subarachnoid space compression, spinal cord compression, or both; degree, cause, and direction of compression; MRI signal changes of the spinal cord; articular process (facet) joint characteristics; internal vertebral venous plexus visibility; and presence of extradural synovial cysts as well as presence and degree of intervertebral disk degeneration and foraminal stenosis. Clinically normal and CSM-affected dogs had 11 and 61 compressive sites, respectively, detected with MRI. All CSM-affected dogs had ≥ 1 site of spinal cord compression. No signal changes were observed in spinal cords of normal dogs, whereas 14 sites of hyperintensity were found in 9 CSM-affected dogs. Foraminal stenosis was present in 11 clinically normal and all CSM-affected dogs. The number of stenotic foraminal sites was significantly greater in the CSM-affected group, and severe stenosis appeared to be more common in this group than in the clinically normal group. Significant differences were identified between clinically normal and CSM-affected dogs with regard to amount of synovial fluid evident, regularity of articular surfaces, degree of articular process joint proliferation, and internal vertebral venous plexus visibility. Abnormalities were detected with MRI in several clinically normal Great Danes. Severe spinal cord compression, number of stenotic foraminal sites, and signal changes within the spinal cord distinguished CSM-affected from clinically normal Great Danes.

  16. Design and Multicentric Implementation of a Generic Software Architecture for Patient Recruitment Systems Re-Using Existing HIS Tools and Routine Patient Data

    PubMed Central

    Trinczek, B.; Köpcke, F.; Leusch, T.; Majeed, R.W.; Schreiweis, B.; Wenk, J.; Bergh, B.; Ohmann, C.; Röhrig, R.; Prokosch, H.U.; Dugas, M.

    2014-01-01

    Summary Objective (1) To define features and data items of a Patient Recruitment System (PRS); (2) to design a generic software architecture of such a system covering the requirements; (3) to identify implementation options available within different Hospital Information System (HIS) environments; (4) to implement five PRS following the architecture and utilizing the implementation options as proof of concept. Methods Existing PRS were reviewed and interviews with users and developers conducted. All reported PRS features were collected and prioritized according to their published success and user’s request. Common feature sets were combined into software modules of a generic software architecture. Data items to process and transfer were identified for each of the modules. Each site collected implementation options available within their respective HIS environment for each module, provided a prototypical implementation based on available implementation possibilities and supported the patient recruitment of a clinical trial as a proof of concept. Results 24 commonly reported and requested features of a PRS were identified, 13 of them prioritized as being mandatory. A UML version 2 based software architecture containing 5 software modules covering these features was developed. 13 data item groups processed by the modules, thus required to be available electronically, have been identified. Several implementation options could be identified for each module, most of them being available at multiple sites. Utilizing available tools, a PRS could be implemented in each of the five participating German university hospitals. Conclusion A set of required features and data items of a PRS has been described for the first time. The software architecture covers all features in a clear, well-defined way. The variety of implementation options and the prototypes show that it is possible to implement the given architecture in different HIS environments, thus enabling more sites to successfully support patient recruitment in clinical trials. PMID:24734138

  17. Design and multicentric implementation of a generic software architecture for patient recruitment systems re-using existing HIS tools and routine patient data.

    PubMed

    Trinczek, B; Köpcke, F; Leusch, T; Majeed, R W; Schreiweis, B; Wenk, J; Bergh, B; Ohmann, C; Röhrig, R; Prokosch, H U; Dugas, M

    2014-01-01

    (1) To define features and data items of a Patient Recruitment System (PRS); (2) to design a generic software architecture of such a system covering the requirements; (3) to identify implementation options available within different Hospital Information System (HIS) environments; (4) to implement five PRS following the architecture and utilizing the implementation options as proof of concept. Existing PRS were reviewed and interviews with users and developers conducted. All reported PRS features were collected and prioritized according to their published success and user's request. Common feature sets were combined into software modules of a generic software architecture. Data items to process and transfer were identified for each of the modules. Each site collected implementation options available within their respective HIS environment for each module, provided a prototypical implementation based on available implementation possibilities and supported the patient recruitment of a clinical trial as a proof of concept. 24 commonly reported and requested features of a PRS were identified, 13 of them prioritized as being mandatory. A UML version 2 based software architecture containing 5 software modules covering these features was developed. 13 data item groups processed by the modules, thus required to be available electronically, have been identified. Several implementation options could be identified for each module, most of them being available at multiple sites. Utilizing available tools, a PRS could be implemented in each of the five participating German university hospitals. A set of required features and data items of a PRS has been described for the first time. The software architecture covers all features in a clear, well-defined way. The variety of implementation options and the prototypes show that it is possible to implement the given architecture in different HIS environments, thus enabling more sites to successfully support patient recruitment in clinical trials.

  18. Evaluation of Terrestrial LIDAR for Monitoring Geomorphic Change at Archeological Sites in Grand Canyon National Park, Arizona

    USGS Publications Warehouse

    Collins, Brian D.; Brown, Kristin M.; Fairley, Helen C.

    2008-01-01

    This report presents the results of an evaluation of terrestrial light detection and ranging (LIDAR) for monitoring geomorphic change at archeological sites located within Grand Canyon National Park, Ariz. Traditionally, topographic change-detection studies have used total station methods for the collection of data related to key measurable features of site erosion such as the location of thalwegs and knickpoints of gullies that traverse archeological sites (for example, Pederson and others, 2003). Total station methods require survey teams to walk within and on the features of interest within the archeological sites to take accurate measurements. As a result, site impacts may develop such as trailing, damage to cryptogamic crusts, and surface compaction that can exacerbate future erosion of the sites. National Park Service (NPS) resource managers have become increasingly concerned that repeated surveys for research and monitoring purposes may have a detrimental impact on the resources that researchers are trying to study and protect. Beginning in 2006, the Sociocultural Program of the U.S. Geological Survey's (USGS) Grand Canyon Monitoring and Research Center (GCMRC) initiated an evaluation of terrestrial LIDAR as a new monitoring tool that might enhance data quality and reduce site impacts. This evaluation was conducted as one part of an ongoing study to develop objective, replicable, quantifiable monitoring protocols for tracking the status and trend of variables affecting archeological site condition along the Colorado River corridor. The overall study consists of two elements: (1) an evaluation of the methodology through direct comparison to geomorphologic metrics already being collected by total station methods (this report) and (2) an evaluation of terrestrial LIDAR's ability to detect topographic change through the collection of temporally different datasets (a report on this portion of the study is anticipated early in 2009). The main goals of the first element of study were to 1. test the methodology and survey protocols of terrestrial LIDAR surveying under actual archeological site field conditions, 2. examine the ability to collect topographic data of entire archeological sites given such constraints as vegetation and rough topography, and 3. evaluate the ability of terrestrial LIDAR to accurately map the locations of key geomorphic features already being collected by total station methods such as gully thalweg and knickpoint locations. This report focuses on the ability of terrestrial LIDAR to duplicate total station methods, including typical erosion-related change features such as the plan view gully thalweg location and the gully thalweg long profile. The report also presents information concerning the use of terrestrial LIDAR for archeological site monitoring in a general sense. In addition, a detailed comparison of the site impacts caused by both total station and terrestrial LIDAR survey methods is presented using a suite of indicators, including total field survey time, field footstep count, and data-processing time. A thorough discussion of the relative benefits and limitations of using terrestrial LIDAR for monitoring erosion-induced changes at archeological sites in Grand Canyon National Park concludes this report.

  19. Multi-Site Diagnostic Classification of Schizophrenia Using Discriminant Deep Learning with Functional Connectivity MRI.

    PubMed

    Zeng, Ling-Li; Wang, Huaning; Hu, Panpan; Yang, Bo; Pu, Weidan; Shen, Hui; Chen, Xingui; Liu, Zhening; Yin, Hong; Tan, Qingrong; Wang, Kai; Hu, Dewen

    2018-04-01

    A lack of a sufficiently large sample at single sites causes poor generalizability in automatic diagnosis classification of heterogeneous psychiatric disorders such as schizophrenia based on brain imaging scans. Advanced deep learning methods may be capable of learning subtle hidden patterns from high dimensional imaging data, overcome potential site-related variation, and achieve reproducible cross-site classification. However, deep learning-based cross-site transfer classification, despite less imaging site-specificity and more generalizability of diagnostic models, has not been investigated in schizophrenia. A large multi-site functional MRI sample (n = 734, including 357 schizophrenic patients from seven imaging resources) was collected, and a deep discriminant autoencoder network, aimed at learning imaging site-shared functional connectivity features, was developed to discriminate schizophrenic individuals from healthy controls. Accuracies of approximately 85·0% and 81·0% were obtained in multi-site pooling classification and leave-site-out transfer classification, respectively. The learned functional connectivity features revealed dysregulation of the cortical-striatal-cerebellar circuit in schizophrenia, and the most discriminating functional connections were primarily located within and across the default, salience, and control networks. The findings imply that dysfunctional integration of the cortical-striatal-cerebellar circuit across the default, salience, and control networks may play an important role in the "disconnectivity" model underlying the pathophysiology of schizophrenia. The proposed discriminant deep learning method may be capable of learning reliable connectome patterns and help in understanding the pathophysiology and achieving accurate prediction of schizophrenia across multiple independent imaging sites. Copyright © 2018 German Center for Neurodegenerative Diseases (DZNE). Published by Elsevier B.V. All rights reserved.

  20. The effect of top-level domains and advertisements on health web-site credibility.

    PubMed

    Walther, Joseph B; Wang, Zuoming; Loh, Tracy

    2004-09-03

    Concerns over health information on the Internet have generated efforts to enhance credibility markers; yet how users actually assess the credibility of online health information is largely unknown. This study set out to (1) establish a parsimonious and valid questionnaire instrument to measure credibility of Internet health information by drawing on various previous measures of source, news, and other credibility scales; and (2) to identify the effects of Web-site domains and advertising on credibility perceptions. Respondents (N = 156) examined one of 12 Web-site mock-ups and completed credibility scales in a 3 x 2 x 2 between-subjects experimental design. Factor analysis and validity checks were used for item reduction, and analysis of variance was employed for hypothesis testing of Web-site features' effects. In an attempt to construct a credibility instrument, three dimensions of credibility (safety, trustworthiness, and dynamism) were retained, reflecting traditional credibility sub-themes, but composed of items from disparate sources. When testing the effect of the presence or absence of advertising on a Web site on credibility, we found that this depends on the site's domain, with a trend for advertisements having deleterious effects on the credibility of sites with .org domain, but positive effects on sites with .com or .edu domains. Health-information Web-site providers should select domains purposefully when they can, especially if they must accept on-site advertising. Credibility perceptions may not be invariant or stable, but rather are sensitive to topic and context. Future research may employ these findings in order to compare other forms of health-information delivery to optimal Web-site features.

  1. Quantum-enhanced feature selection with forward selection and backward elimination

    NASA Astrophysics Data System (ADS)

    He, Zhimin; Li, Lvzhou; Huang, Zhiming; Situ, Haozhen

    2018-07-01

    Feature selection is a well-known preprocessing technique in machine learning, which can remove irrelevant features to improve the generalization capability of a classifier and reduce training and inference time. However, feature selection is time-consuming, particularly for the applications those have thousands of features, such as image retrieval, text mining and microarray data analysis. It is crucial to accelerate the feature selection process. We propose a quantum version of wrapper-based feature selection, which converts a classical feature selection to its quantum counterpart. It is valuable for machine learning on quantum computer. In this paper, we focus on two popular kinds of feature selection methods, i.e., wrapper-based forward selection and backward elimination. The proposed feature selection algorithm can quadratically accelerate the classical one.

  2. Kinesiology taping and the world wide web: a quality and content analysis of internet-based information.

    PubMed

    Beutel, Bryan G; Cardone, Dennis A

    2014-10-01

    Due to limited regulation of websites, the quality and content of online health-related information has been questioned as prior studies have shown that websites often misrepresent orthopaedic conditions and treatments. Kinesio tape has gained popularity among athletes and the general public despite limited evidence supporting its efficacy. The primary objective of this study was to assess the quality and content of Internet-based information on Kinesio taping. An Internet search using the terms "Kinesio tape" and "kinesiology tape" was performed using the Google search engine. Websites returned within the first two pages of results, as well as hyperlinks embedded within these sites, were included in the study. These sites were subsequently classified by type. The quality of the website was determined by the Health On the Net (HON) score, an objective metric based upon recommendations from the United Nations for the ethical representation of health information. A content analysis was performed by noting specific misleading versus balanced features in each website. A total of 31 unique websites were identified. The majority of the websites (71%) were commercial. Out of a total possible 16 points, the mean HON score among the websites was 8.9 points (SD 2.2 points). The number of misleading features was significantly higher than the balanced features (p < 0.001). Fifty-eight percent of sites used anecdotal testimonials to promote the product. Only small percentages of websites discussed complications, alternatives, or provided accurate medical outcomes. Overall, commercial sites had a greater number of misleading features compared to non-commercial sites (p = 0.01). Websites discussing Kinesio tape are predominantly of poor quality and present misleading, imbalanced information. It is of ever-increasing importance that healthcare providers work to ensure that reliable, balanced, and accurate information be available to Internet users. IV.

  3. Science@NASA: Direct to People!

    NASA Technical Reports Server (NTRS)

    Koczor, Ronald J.; Adams, Mitzi; Gallagher, Dennis; Whitaker, Ann (Technical Monitor)

    2002-01-01

    Science@NASA is a science communication effort sponsored by NASA's Marshall Space Flight Center. It is the result of a four year research project between Marshall, the University of Florida College of Journalism and Communications and the internet communications company, Bishop Web Works. The goals of Science@NASA are to inform, inspire, and involve people in the excitement of NASA science by bringing that science directly to them. We stress not only the reporting of the facts of a particular topic, but also the context and importance of the research. Science@NASA involves several levels of activity from academic communications research to production of content for 6 websites, in an integrated process involving all phases of production. A Science Communications Roundtable Process is in place that includes scientists, managers, writers, editors, and Web technical experts. The close connection between the scientists and the writers/editors assures a high level of scientific accuracy in the finished products. The websites each have unique characters and are aimed at different audience segments: 1. http://science.nasa.gov. (SNG) Carries stories featuring various aspects of NASA science activity. The site carries 2 or 3 new stories each week in written and audio formats for science-attentive adults. 2. http://liftoff.msfc.nasa.gov. Features stories from SNG that are recast for a high school level audience. J-Track and J-Pass applets for tracking satellites are our most popular product. 3. http://kids. msfc.nasa.gov. This is the Nursemaids site and is aimed at a middle school audience. The NASAKids Club is a new feature at the site. 4. http://www.thursdaysclassroom.com . This site features lesson plans and classroom activities for educators centered around one of the science stories carried on SNG. 5. http://www.spaceweather.com. This site gives the status of solar activity and its interactions with the Earth's ionosphere and magnetosphere.

  4. 78 FR 59866 - New Car Assessment Program (NCAP)

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-09-30

    ... because ESC is now required for all light vehicles. For many years, NCAP has provided comparative... site, www.safercar.gov . NCAP provides comparative information on the safety performance and features... Features on www.safercar.gov are designed to assist drivers in avoiding backover crashes. After considering...

  5. Visualization of protein sequence features using JavaScript and SVG with pViz.js.

    PubMed

    Mukhyala, Kiran; Masselot, Alexandre

    2014-12-01

    pViz.js is a visualization library for displaying protein sequence features in a Web browser. By simply providing a sequence and the locations of its features, this lightweight, yet versatile, JavaScript library renders an interactive view of the protein features. Interactive exploration of protein sequence features over the Web is a common need in Bioinformatics. Although many Web sites have developed viewers to display these features, their implementations are usually focused on data from a specific source or use case. Some of these viewers can be adapted to fit other use cases but are not designed to be reusable. pViz makes it easy to display features as boxes aligned to a protein sequence with zooming functionality but also includes predefined renderings for secondary structure and post-translational modifications. The library is designed to further customize this view. We demonstrate such applications of pViz using two examples: a proteomic data visualization tool with an embedded viewer for displaying features on protein structure, and a tool to visualize the results of the variant_effect_predictor tool from Ensembl. pViz.js is a JavaScript library, available on github at https://github.com/Genentech/pviz. This site includes examples and functional applications, installation instructions and usage documentation. A Readme file, which explains how to use pViz with examples, is available as Supplementary Material A. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  6. Multimodal Event Detection in Twitter Hashtag Networks

    DOE PAGES

    Yilmaz, Yasin; Hero, Alfred O.

    2016-07-01

    In this study, event detection in a multimodal Twitter dataset is considered. We treat the hashtags in the dataset as instances with two modes: text and geolocation features. The text feature consists of a bag-of-words representation. The geolocation feature consists of geotags (i.e., geographical coordinates) of the tweets. Fusing the multimodal data we aim to detect, in terms of topic and geolocation, the interesting events and the associated hashtags. To this end, a generative latent variable model is assumed, and a generalized expectation-maximization (EM) algorithm is derived to learn the model parameters. The proposed method is computationally efficient, and lendsmore » itself to big datasets. Lastly, experimental results on a Twitter dataset from August 2014 show the efficacy of the proposed method.« less

  7. A Feature-Based Approach to Modeling Protein–DNA Interactions

    PubMed Central

    Segal, Eran

    2008-01-01

    Transcription factor (TF) binding to its DNA target site is a fundamental regulatory interaction. The most common model used to represent TF binding specificities is a position specific scoring matrix (PSSM), which assumes independence between binding positions. However, in many cases, this simplifying assumption does not hold. Here, we present feature motif models (FMMs), a novel probabilistic method for modeling TF–DNA interactions, based on log-linear models. Our approach uses sequence features to represent TF binding specificities, where each feature may span multiple positions. We develop the mathematical formulation of our model and devise an algorithm for learning its structural features from binding site data. We also developed a discriminative motif finder, which discovers de novo FMMs that are enriched in target sets of sequences compared to background sets. We evaluate our approach on synthetic data and on the widely used TF chromatin immunoprecipitation (ChIP) dataset of Harbison et al. We then apply our algorithm to high-throughput TF ChIP data from mouse and human, reveal sequence features that are present in the binding specificities of mouse and human TFs, and show that FMMs explain TF binding significantly better than PSSMs. Our FMM learning and motif finder software are available at http://genie.weizmann.ac.il/. PMID:18725950

  8. Classifying brain metastases by their primary site of origin using a radiomics approach based on texture analysis: a feasibility study.

    PubMed

    Ortiz-Ramón, Rafael; Larroza, Andrés; Ruiz-España, Silvia; Arana, Estanislao; Moratal, David

    2018-05-14

    To examine the capability of MRI texture analysis to differentiate the primary site of origin of brain metastases following a radiomics approach. Sixty-seven untreated brain metastases (BM) were found in 3D T1-weighted MRI of 38 patients with cancer: 27 from lung cancer, 23 from melanoma and 17 from breast cancer. These lesions were segmented in 2D and 3D to compare the discriminative power of 2D and 3D texture features. The images were quantized using different number of gray-levels to test the influence of quantization. Forty-three rotation-invariant texture features were examined. Feature selection and random forest classification were implemented within a nested cross-validation structure. Classification was evaluated with the area under receiver operating characteristic curve (AUC) considering two strategies: multiclass and one-versus-one. In the multiclass approach, 3D texture features were more discriminative than 2D features. The best results were achieved for images quantized with 32 gray-levels (AUC = 0.873 ± 0.064) using the top four features provided by the feature selection method based on the p-value. In the one-versus-one approach, high accuracy was obtained when differentiating lung cancer BM from breast cancer BM (four features, AUC = 0.963 ± 0.054) and melanoma BM (eight features, AUC = 0.936 ± 0.070) using the optimal dataset (3D features, 32 gray-levels). Classification of breast cancer and melanoma BM was unsatisfactory (AUC = 0.607 ± 0.180). Volumetric MRI texture features can be useful to differentiate brain metastases from different primary cancers after quantizing the images with the proper number of gray-levels. • Texture analysis is a promising source of biomarkers for classifying brain neoplasms. • MRI texture features of brain metastases could help identifying the primary cancer. • Volumetric texture features are more discriminative than traditional 2D texture features.

  9. Full-text automated detection of surgical site infections secondary to neurosurgery in Rennes, France.

    PubMed

    Campillo-Gimenez, Boris; Garcelon, Nicolas; Jarno, Pascal; Chapplain, Jean Marc; Cuggia, Marc

    2013-01-01

    The surveillance of Surgical Site Infections (SSI) contributes to the management of risk in French hospitals. Manual identification of infections is costly, time-consuming and limits the promotion of preventive procedures by the dedicated teams. The introduction of alternative methods using automated detection strategies is promising to improve this surveillance. The present study describes an automated detection strategy for SSI in neurosurgery, based on textual analysis of medical reports stored in a clinical data warehouse. The method consists firstly, of enrichment and concept extraction from full-text reports using NOMINDEX, and secondly, text similarity measurement using a vector space model. The text detection was compared to the conventional strategy based on self-declaration and to the automated detection using the diagnosis-related group database. The text-mining approach showed the best detection accuracy, with recall and precision equal to 92% and 40% respectively, and confirmed the interest of reusing full-text medical reports to perform automated detection of SSI.

  10. Effects of Individual Differences and Situational Features on Age Differences in Mindless Reading.

    PubMed

    Shake, Matthew C; Shulley, Leah J; Soto-Freita, Angelica M

    2016-09-01

    Mindless reading occurs when an individual shifts their attention away from the text and toward other off-task thoughts. This study examined whether previously reported age-related declines in mindless reading episodes are due primarily to (a) situational features related to the text itself (e.g., text genre or interest in the text) and/or (b) individual differences in cognitive ability. Participants read 2 texts written in different genres but about the same topic. During reading, they were randomly probed to indicate whether they were on-task or mind-wandering. They also indicated their perceptions regarding the interest and difficulty of the text, and completed a battery of cognitive ability measures. The results showed that (a) text genre may engender some age differences in mindless reading and (b) greater age and perceived interest in the text were each uniquely predictive of reduced mindless reading for both text genres. Individual differences in cognitive abilities (e.g., working memory, vocabulary) did not account for additional significant variance in mindless reading after interest and age were taken into account. Our findings are discussed in terms of implications for age differences in lapses of attention during reading and predictors of mind-wandering generally. © The Author 2015. Published by Oxford University Press on behalf of The Gerontological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  11. SITE-SPECIFIC PROTOCOL FOR MEASURING SOIL RADON POTENTIALS FOR FLORIDA HOUSES

    EPA Science Inventory

    The report describes a protocol for site-specific measurement of radon potentials for Florida houses that is consistent with existing residential radon protection maps. The protocol gives further guidance on the possible need for radon-protective house construction features. In a...

  12. Environmental cleanup: The challenge at the Hanford Site, Washington, USA

    NASA Astrophysics Data System (ADS)

    Gray, Robert H.; Becker, C. Dale

    1993-07-01

    Numerous challenges face those involved with developing a coordinated and consistent approach to cleaning up the US Department of Energy’s (DOE) Hanford Site in southeastern Washington. These challenges are much greater than those encountered when the site was selected and the world’s first nuclear complex was developed almost 50 years ago. This article reviews Hanford’s history, operations, waste storage/disposal activities, environmental monitoring, and today’s approach to characterize and clean up Hanford under a Federal Facility Agreement and Consent Order, signed by DOE, the Environmental Protection Agency, and the Washington Sate Department of Ecology. Although cleanup of defense-related waste at Hanford holds many positive benefits, negative features include high costs to the US taxpayer, numerous uncertainties concerning the technologies to be employed and the risks involved, and the high probability that special interest groups and activists at large will never be completely satisfied. Issues concerning future use of the site, whether to protect and preserve its natural features or open it to public exploitation, remain to be resolved.

  13. Machine learning approaches to diagnosis and laterality effects in semantic dementia discourse.

    PubMed

    Garrard, Peter; Rentoumi, Vassiliki; Gesierich, Benno; Miller, Bruce; Gorno-Tempini, Maria Luisa

    2014-06-01

    Advances in automatic text classification have been necessitated by the rapid increase in the availability of digital documents. Machine learning (ML) algorithms can 'learn' from data: for instance a ML system can be trained on a set of features derived from written texts belonging to known categories, and learn to distinguish between them. Such a trained system can then be used to classify unseen texts. In this paper, we explore the potential of the technique to classify transcribed speech samples along clinical dimensions, using vocabulary data alone. We report the accuracy with which two related ML algorithms [naive Bayes Gaussian (NBG) and naive Bayes multinomial (NBM)] categorized picture descriptions produced by: 32 semantic dementia (SD) patients versus 10 healthy, age-matched controls; and SD patients with left- (n = 21) versus right-predominant (n = 11) patterns of temporal lobe atrophy. We used information gain (IG) to identify the vocabulary features that were most informative to each of these two distinctions. In the SD versus control classification task, both algorithms achieved accuracies of greater than 90%. In the right- versus left-temporal lobe predominant classification, NBM achieved a high level of accuracy (88%), but this was achieved by both NBM and NBG when the features used in the training set were restricted to those with high values of IG. The most informative features for the patient versus control task were low frequency content words, generic terms and components of metanarrative statements. For the right versus left task the number of informative lexical features was too small to support any specific inferences. An enriched feature set, including values derived from Quantitative Production Analysis (QPA) may shed further light on this little understood distinction. Copyright © 2013 Elsevier Ltd. All rights reserved.

  14. Spatial variation in breeding habitat selection by Cerulean Warblers (Setophaga cerulea) throughout the Appalachian Mountains

    USGS Publications Warehouse

    Boves, Than J.; Buehler, David A.; Sheehan, James; Wood, Petra Bohall; Rodewald, Amanda D.; Larkin, Jeffrey L.; Keyser, Patrick D.; Newell, Felicity L.; Evans, Andrea; George, Gregory A.; Wigley, T.B.

    2013-01-01

    Studies of habitat selection are often of limited utility because they focus on small geographic areas, fail to examine behavior at multiple scales, or lack an assessment of the fitness consequences of habitat decisions. These limitations can hamper the identification of successful site-specific management strategies, which are urgently needed for severely declining species like Cerulean Warblers (Setophaga cerulea). We assessed how breeding habitat decisions made by Cerulean Warblers at multiple scales, and the subsequent effects of these decisions on nest survival, varied across the Appalachian Mountains. Selection for structural habitat features varied substantially among areas, particularly at the territory scale. Males within the least-forested landscapes selected microhabitat features that reflected more closed-canopy forest conditions, whereas males in highly forested landscapes favored features associated with canopy disturbance. Selection of nest-patch and nest-site attributes by females was more consistent across areas, with females selecting for increased tree size and understory cover and decreased basal area and midstory cover. Floristic preferences were similar across study areas: White Oak (Quercus alba), Cucumber-tree (Magnolia acuminata), and Sugar Maple (Acer saccharum) were preferred as nest trees, whereas red oak species (subgenus Erythrobalanus) and Red Maple (A. rubrum) were avoided. The habitat features that were related to nest survival also varied among study areas, and preferred features were negatively associated with nest survival at one area. Thus, our results indicate that large-scale spatial heterogeneity may influence local habitat-selection behavior and that it may be necessary to articulate site-specific management strategies for Cerulean Warblers.

  15. Internet marketing directed at children on food and restaurant websites in two policy environments.

    PubMed

    Kent, M Potvin; Dubois, L; Kent, E A; Wanless, A J

    2013-04-01

    Food and beverage marketing has been associated with childhood obesity yet little research has examined the influence of advertising policy on children's exposure to food/beverage marketing on the Internet. The purpose of this study was to assess the influence of Quebec's Consumer Protection Act and the self-regulatory Canadian Children's Food and Beverage Advertising Initiative (CAI) on food manufacturer and restaurant websites in Canada. A content analysis of 147 French and English language food and restaurant websites was undertaken. The presence of child-directed content was assessed and an analysis of marketing features, games and activities, child protection features, and the promotion of healthy lifestyle messages was then examined on those sites with child-directed content. There were statistically no fewer French language websites (n = 22) with child-directed content compared to English language websites (n = 27). There were no statistically significant differences in the number of the various marketing features, or in the average number of marketing features between the English and French websites. There were no fewer CAI websites (n = 14) with child-directed content compared to non-CAI websites (n = 13). The CAI sites had more healthy lifestyle messages and child protection features compared to the non-CAI sites. Systematic surveillance of the Consumer Protection Act in Quebec is recommended. In the rest of Canada, the CAI needs to be significantly expanded or replaced by regulatory measures to adequately protect children from the marketing of foods/beverages high in fat, sugar, and sodium on the Internet. Copyright © 2012 The Obesity Society.

  16. Mobile Code: The Future of the Internet

    DTIC Science & Technology

    1999-01-01

    code ( mobile agents) to multiple proxies or servers " Customization " (e.g., re-formatting, filtering, metasearch) Information overload Diversified... Mobile code is necessary, rather than client-side code, since many customization features (such as information monitoring) do not work if the...economic foundation for Web sites, many Web sites earn money solely from advertisements . If these sites allow mobile agents to easily access the content

  17. Site Plan: A First Step.

    ERIC Educational Resources Information Center

    Gould, Bryant; Finci, David

    1986-01-01

    A campus-wide site plan looks at the campus as a whole and defines immediate and long-range problems and potential. Text and photographs illustrate site development at McClennan Community College in Waco, Texas, and at the University of Toledo and the University of Dayton, both in Ohio. (MLF)

  18. Temporal Evolution of SL-9 Impact Sites on Jupiter and Global Maps of Jupiter from Multi-Observatory Visible and Infrared Images

    NASA Technical Reports Server (NTRS)

    Limaye, Sanjay S.

    1996-01-01

    The objective of this research was to investigate the temporal behavior of the impact features on Jupiter created by the fragments of the Shoemaker Levy-9 comet that collided with the planet in July 1994. The primary observations used in the study were ground based images of Jupiter acquired from the Swedish Solar Vacuum Tube on the island of La Palma in the Canary Islands. The measurement of position of the impact features in images acquired immediately after the impact over a period of a few days revealed that the apparent drift rates were too high and that a repetitive pattern could be seen in the longitude position on successive rotations. This could be explained only by the fact that the measured longitudes of the impact sites were being affected by parallax due to a significant elevation of the impact debris above the nominal cloud top altitude value used for image navigation. Once the apparent positions are analyzed as a function of the meridian angle, the parallax equation can be used to infer the height of the impact features above the cloud deck, once the true impact position (longitude) for the feature is known. Due to their inherent high spatial resolution, the HST measurements of the impact site locations have been accepted widely. However, these suffer from the parallax themselves since few of them were obtained at central meridian. Ground based imaging have the potential to improve this knowledge as they do observe most of the impact sites on either side of the central meridian, except for the degraded resolution. Measurements over a large number of images enables us to minimize the position error through regression and thus estimate both the actual impact site location devoid of parallax bias, and also of the altitude level of the impact debris above the cloud deck. With rapid imaging there is the potential to examine the time evolution of the altitude level. Several hundred ground based images were processed, navigated and subjected to the impact site location measurements. HST images were also acquired and used to calibrate the results and to improve the sample. The resources available enabled an in-depth study only of impact site A, however, many more images have since become available through the global network observations through Lowell Observatory.

  19. Thermal Imaging of the Waccasassa Bay Preserve: Image Acquisition and Processing

    USGS Publications Warehouse

    Raabe, Ellen A.; Bialkowska-Jelinska, Elzbieta

    2010-01-01

    Thermal infrared (TIR) imagery was acquired along coastal Levy County, Florida, in March 2009 with the goal of identifying groundwater-discharge locations in Waccasassa Bay Preserve State Park (WBPSP). Groundwater discharge is thermally distinct in winter when Floridan aquifer temperature, 71-72 degrees F, contrasts with the surrounding cold surface waters. Calibrated imagery was analyzed to assess temperature anomalies and related thermal traces. The influence of warm Gulf water and image artifacts on small features was successfully constrained by image evaluation in three separate zones: Creeks, Bay, and Gulf. Four levels of significant water-temperature anomalies were identified, and 488 sites of interest were mapped. Among the sites identified, at least 80 were determined to be associated with image artifacts and human activity, such as excavation pits and the Florida Barge Canal. Sites of interest were evaluated for geographic concentration and isolation. High site densities, indicating interconnectivity and prevailing flow, were located at Corrigan Reef, No. 4 Channel, Winzy Creek, Cow Creek, Withlacoochee River, and at excavation sites. In other areas, low to moderate site density indicates the presence of independent vents and unique flow paths. A directional distribution assessment of natural seep features produced a northwest trend closely matching the strike direction of regional faults. Naturally occurring seeps were located in karst ponds and tidal creeks, and several submerged sites were detected in Waccasassa River and Bay, representing the first documentation of submarine vents in the Waccasassa region. Drought conditions throughout the region placed constraints on positive feature identification. Low discharge or displacement by landward movement of saltwater may have reduced or reversed flow during this season. Approximately two-thirds of seep locations in the overlap between 2009 and 2005 TIR night imagery were positively re-identified in 2009. These results indicate a 33 percent chance of feature omission in the 2009 imagery. This assessment of seep location and distribution contributes to an understanding of the underlying geology, the role of fault and fracture patterns, and the presence of both interconnected and constrained flow paths in the region. The maps and evaluations will enhance Park management efforts, interpretation of Park resources, and increase understanding of the combined effects of land and water use on the coastal lowlands, estuarine habitats, and natural resources of WBPSP.

  20. The Evaluation of High School Geography 9 and High School Geography 11 Text Books with Some Formulas of Readability

    ERIC Educational Resources Information Center

    Gecit, Yilmaz

    2010-01-01

    The purpose of this study is to evaluate readability of 9th and 11th grade geography text-books currently used in schools. As known, one of the most fundamental features in a text-book is the readability of the text by students. In addition, it is also very important that the fluency and suitability of books match age level. In this study, the…

  1. Chromatin Landscapes of Retroviral and Transposon Integration Profiles

    PubMed Central

    Badhai, Jitendra; Rust, Alistair G.; Rad, Roland; Hilkens, John; Berns, Anton; van Lohuizen, Maarten; Wessels, Lodewyk F. A.; de Ridder, Jeroen

    2014-01-01

    The ability of retroviruses and transposons to insert their genetic material into host DNA makes them widely used tools in molecular biology, cancer research and gene therapy. However, these systems have biases that may strongly affect research outcomes. To address this issue, we generated very large datasets consisting of to unselected integrations in the mouse genome for the Sleeping Beauty (SB) and piggyBac (PB) transposons, and the Mouse Mammary Tumor Virus (MMTV). We analyzed (epi)genomic features to generate bias maps at both local and genome-wide scales. MMTV showed a remarkably uniform distribution of integrations across the genome. More distinct preferences were observed for the two transposons, with PB showing remarkable resemblance to bias profiles of the Murine Leukemia Virus. Furthermore, we present a model where target site selection is directed at multiple scales. At a large scale, target site selection is similar across systems, and defined by domain-oriented features, namely expression of proximal genes, proximity to CpG islands and to genic features, chromatin compaction and replication timing. Notable differences between the systems are mainly observed at smaller scales, and are directed by a diverse range of features. To study the effect of these biases on integration sites occupied under selective pressure, we turned to insertional mutagenesis (IM) screens. In IM screens, putative cancer genes are identified by finding frequently targeted genomic regions, or Common Integration Sites (CISs). Within three recently completed IM screens, we identified 7%–33% putative false positive CISs, which are likely not the result of the oncogenic selection process. Moreover, results indicate that PB, compared to SB, is more suited to tag oncogenes. PMID:24721906

  2. Coastline shifts and probable ship landing site submerged off ancient Locri-Epizefiri, southern Italy

    USGS Publications Warehouse

    Tennent, J.M.; Stanley, J.-D.; Hart, P.E.; Bernasconi, M.P.

    2009-01-01

    A geophysical survey provides new information on marine features located seaward of Locri-Epizefiri (Locri), an ancient Greek settlement on the Ionian coastal margin in southern Italy. The study supplements previous work by archaeologists who long searched for the site's harbor and recently identified what was once a marine basin that is now on land next to the city walls of Locri. Profiles obtained offshore, between the present coast and outer shelf, made with a high-resolution, seismic subbottom-profiling system, record spatial and temporal variations of buried Holocene deposits. Two of these submerged features are part of a probable now-submerged ship landing facility. The offshore features can be linked to coastline displacements that occurred off Locri: a sea-to-land shift before Greek settlement, followed by a shoreline reversal from the archaeological site back to sea, and more recently, a return landward. The seaward directed coastal shift that occurred after Locri's occupation by Greeks was likely caused by land uplift near the coastal margin and tectonic seaward shift of the coast, as documented along this geologically active sector of the Calabrian Arc. The seismic survey records an angular, hook-shaped, low rise that extends from the present shore and is now buried on the inner shelf. The rise, enclosing a core lens of poorly stratified to transparent acoustic layers, bounds a broad, low-elevation zone positioned immediately seaward of the shoreline. Close proximity of the raised feature to the low-elevation area suggests it may have been a fabricated structure that functioned as a wave-break for a ship-landing site. The study indicates that the basin extended offshore as a function of the coastline's seaward migration during and/or after Greek occupation of Locri.

  3. Mines, prospects, and occurrences of nonmetallic mineral commodities in the Greenville 1 degree by 2 degrees Quadrangle, South Carolina, Georgia, and North Carolina

    USGS Publications Warehouse

    D'Agostino, John P.; O'Connor, Bruce J.; Zupan, Alan J.W.; Maybin, Arthur H.

    1994-01-01

    Mines, prospects, and occurrences of nonmetal mineral commodities in the Greenville 1° x 2° quadrangle are tabulated in this report. There are 488 symbols representing 579 mines, prospects, and occurrences located in the quadrangle. There are 379 symbols used for 466 features in Georgia, 106 symbols for 110 features in South Carolina, and 3 symbols for 3 features in North Carolina. The table lists, in consecutive orders for each county (fig. 1), the map number of each feature, which correlates and locates the item on the accompanying Greenville 1° x 2° quadrangle map. Also listed are the known name of the feature; the 7.5 topographic map on which the commodity site is located; the Transverse Mercator (UTM) northing and easting grid coordinates from the appropriate 7.5’ topographic map; the commodity; remarks; and references. Some locations are known, but many sites are not verified and their locations are only approximate. Reference are listed in References Cited and referred to by number to save space. The generalized tectonic framework for the quadrangle is shown in figure 2.

  4. Aggressive television ad campaign for Cooper University Hospital features hometown celebrity.

    PubMed

    2006-01-01

    Cooper University Hospital in Camden, NJ, features an extensive ambulatory care network that includes practice sites across eight counties of Southern New Jersey. Recently, the hospital worked with Willing Strategic Advertising to produce an award-winning television advertising campaign endorsed by New Jersey-born TV personality, Kelly Ripa.

  5. What If? Conditionals in Educational Registers

    ERIC Educational Resources Information Center

    Louwerse, Max M.; Crossley, Scott A.; Jeuniaux, Patrick

    2008-01-01

    Many corpus linguistic studies have investigated classification of texts into genres and registers, but relatively few of these studies have looked at linguistic features in educational registers. From a pedagogical perspective it is important to determine whether certain linguistic features behave differently across registers within particular…

  6. Workbook on Identification of Aedes Aegypti Larvae.

    ERIC Educational Resources Information Center

    Pratt, Harry D.; And Others

    This self-instructional booklet is designed to enable yellow fever control workers to identify the larvae of "Aedes aegypti." The morphological features of mosquito larvae are illustrated in this partially programed text, and the distinguishing features of "A. aegypti" indicated. A glossary is included. (AL)

  7. CONTRASTIVE CULTURAL FEATURES IN FL TEACHING.

    ERIC Educational Resources Information Center

    FISCHER, MILLA

    CONTRASTIVE CULTURAL FEATURES SHOULD BE INCLUDED WITHIN THE FRAMEWORK OF THE GRAMMATICAL LESSON AS A MEANS OF COUNTERBALANCING THE GENERALLY UNSATISFACTORY MATERIAL USED FOR RUSSIAN TEXTS. LESSONS FOR AMERICAN STUDENTS LEARNING RUSSIAN SHOULD INCLUDE PHONOLOGICAL DRILLS ON VOWEL LENGTHS, DISTRIBUTION OF VOICED OBSTRUENTS, AND OBSTRUENT CLUSTERS,…

  8. Automatic Text Summarization for Indonesian Language Using TextTeaser

    NASA Astrophysics Data System (ADS)

    Gunawan, D.; Pasaribu, A.; Rahmat, R. F.; Budiarto, R.

    2017-04-01

    Text summarization is one of the solution for information overload. Reducing text without losing the meaning not only can save time to read, but also maintain the reader’s understanding. One of many algorithms to summarize text is TextTeaser. Originally, this algorithm is intended to be used for text in English. However, due to TextTeaser algorithm does not consider the meaning of the text, we implement this algorithm for text in Indonesian language. This algorithm calculates four elements, such as title feature, sentence length, sentence position and keyword frequency. We utilize TextRank, an unsupervised and language independent text summarization algorithm, to evaluate the summarized text yielded by TextTeaser. The result shows that the TextTeaser algorithm needs more improvement to obtain better accuracy.

  9. Evaluation of Mapping Methodologies at a Legacy Test Site

    NASA Astrophysics Data System (ADS)

    Sussman, A. J.; Schultz-Fellenz, E. S.; Roback, R. C.; Kelley, R. E.; Drellack, S.; Reed, D.; Miller, E.; Cooper, D. I.; Sandoval, M.; Wang, R.

    2013-12-01

    On June 12th, 1985, a nuclear test with an announced yield between 20-150kt was detonated in rhyolitic lava in a vertical emplacement borehole at a depth of 608m below the surface. This test did not collapse to the surface and form a crater, but rather resulted in a subsurface collapse with more subtle surface expressions of deformation, providing an opportunity to evaluate the site using a number of surface mapping methodologies. The site was investigated over a two-year time span by several mapping teams. In order to determine the most time efficient and accurate approach for mapping post-shot surface features at a legacy test site, a number of different techniques were employed. The site was initially divided into four quarters, with teams applying various methodologies, techniques, and instrumentations to each quarter. Early methods included transect lines and site gridding with a Brunton pocket transit, flagging tape, measuring tape, and stakes; surveying using a hand-held personal GPS to locate observed features with an accuracy of × 5-10m; and extensive photo-documentation. More recent methods have incorporated the use of near survey grade GPS devices to allow careful location and mapping of surface features. Initially, gridding was employed along with the high resolution GPS surveys, but this was found to be time consuming and of little observational value. Raw visual observation (VOB) data included GPS coordinates for artifacts or features of interest, field notes, and photographs. A categorization system was used to organize the myriad of items, in order to aid in database searches and for visual presentation of findings. The collected data set was imported into a geographic information system (GIS) as points, lines, or polygons and overlain onto a digital color orthophoto map of the test site. Once these data were mapped, spectral data were collected using a high resolution field spectrometer. In addition to geo-locating the field observations with 10cm resolution GPS, LiDAR and hyperspectral imagery were also acquired. The LiDAR and hyperspectral data are being processed and will be added to the existing geo-referenced database as separate information layers for remote sensing analysis of surface features associated with the legacy test. By consolidating the various components of a VOB data point (coordinates, photo and item description) into a standalone database, searching or querying for other components or collects such as subsurface geophysical and/or airborne imagery is made much easier. Work by Los Alamos National Laboratory was sponsored by the National Nuclear Security Administration Award No. DE-AC52-06NA25946/NST10-NCNS-PD00. Work by National Security Technologies, LLC, was performed under Contract No. DE AC52 06NA25946 with the U.S. Department of Energy.

  10. Photogrammetric analysis of horizon panoramas: The Pathfinder landing site in Viking orbiter images

    USGS Publications Warehouse

    Oberst, J.; Jaumann, R.; Zeitler, W.; Hauber, E.; Kuschel, M.; Parker, T.; Golombek, M.; Malin, M.; Soderblom, L.

    1999-01-01

    Tiepoint measurements, block adjustment techniques, and sunrise/sunset pictures were used to obtain precise pointing data with respect to north for a set of 33 IMP horizon images. Azimuth angles for five prominent topographic features seen at the horizon were measured and correlated with locations of these features in Viking orbiter images. Based on this analysis, the Pathfinder line/sample coordinates in two raw Viking images were determined with approximate errors of 1 pixel, or 40 m. Identification of the Pathfinder location in orbit imagery yields geological context for surface studies of the landing site. Furthermore, the precise determination of coordinates in images together with the known planet-fixed coordinates of the lander make the Pathfinder landing site the most important anchor point in current control point networks of Mars. Copyright 1999 by the American Geophysical Union.

  11. Sir William Herschel's notebooks - Abstracts of solar observations

    NASA Technical Reports Server (NTRS)

    Hoyt, Douglas V.; Schatten, Kenneth H.

    1992-01-01

    An introduction to the background of Sir William Herschel's notebooks and the historical context within which his observations were made are provided. The observations have relevance in reconstructing solar behavior, as discussed in a separate analysis paper by Hoyt and Schatten (1992), and in understanding active features on the sun such as faculae. The text of Herschel's notebooks with modern terms used throughout forms the body of this paper. The complete text has not previously been published and is not easily accessible to scholars. Herschel used different words for solar features than are used today, and thus, for clarity, his terminology is changed on two occasions. A glossary explains the terminology changed. In the text of the notebooks, several contemporaries are mentioned; a brief description of Herschel's colleagues is provided.

  12. Typha latifolia (broadleaf cattail) as bioindicator of different types of pollution in aquatic ecosystems-application of self-organizing feature map (neural network).

    PubMed

    Klink, Agnieszka; Polechońska, Ludmiła; Cegłowska, Aurelia; Stankiewicz, Andrzej

    2016-07-01

    The contents of Cd, Cu, Fe, Mn, Ni, Pb, and Zn in leaves of Typha latifolia (broadleaf cattail), water and bottom sediment from 72 study sites designated in different regions of Poland were determined using atomic absorption spectrometry. The aim of the study was to evaluate potential use of T. latifolia in biomonitoring of trace metal pollution. The self-organizing feature map (SOFM) identifying groups of sampling sites with similar concentrations of metals in cattail leaves was able to classify study sites according to similar use and potential sources of pollution. Maps prepared for water and bottom sediment showed corresponding groups of sampling sites which suggested similarity of samples features. High concentrations of Fe, Cd, Cu, and Ni were characteristic for industrial areas. Elevated Pb concentrations were noted in regions with intensive vehicle traffic, while high Mn and Zn contents were reported in leaves from the agricultural area. Manganese content in leaves of T. latifolia was high irrespectively of the concentrations in bottom sediments and water so cattail can be considered the leaf accumulator of Mn. Once trained, SOFMs can be applied in ecological investigations and could form a future basis for recognizing the type of pollution in aquatic environments by analyzing the concentrations of elements in T. latifolia.

  13. Social Networking Adapted for Distributed Scientific Collaboration

    NASA Technical Reports Server (NTRS)

    Karimabadi, Homa

    2012-01-01

    Share is a social networking site with novel, specially designed feature sets to enable simultaneous remote collaboration and sharing of large data sets among scientists. The site will include not only the standard features found on popular consumer-oriented social networking sites such as Facebook and Myspace, but also a number of powerful tools to extend its functionality to a science collaboration site. A Virtual Observatory is a promising technology for making data accessible from various missions and instruments through a Web browser. Sci-Share augments services provided by Virtual Observatories by enabling distributed collaboration and sharing of downloaded and/or processed data among scientists. This will, in turn, increase science returns from NASA missions. Sci-Share also enables better utilization of NASA s high-performance computing resources by providing an easy and central mechanism to access and share large files on users space or those saved on mass storage. The most common means of remote scientific collaboration today remains the trio of e-mail for electronic communication, FTP for file sharing, and personalized Web sites for dissemination of papers and research results. Each of these tools has well-known limitations. Sci-Share transforms the social networking paradigm into a scientific collaboration environment by offering powerful tools for cooperative discourse and digital content sharing. Sci-Share differentiates itself by serving as an online repository for users digital content with the following unique features: a) Sharing of any file type, any size, from anywhere; b) Creation of projects and groups for controlled sharing; c) Module for sharing files on HPC (High Performance Computing) sites; d) Universal accessibility of staged files as embedded links on other sites (e.g. Facebook) and tools (e.g. e-mail); e) Drag-and-drop transfer of large files, replacing awkward e-mail attachments (and file size limitations); f) Enterprise-level data and messaging encryption; and g) Easy-to-use intuitive workflow.

  14. Searching for the IRA "disappeared": ground-penetrating radar investigation of a churchyard burial site, Northern Ireland.

    PubMed

    Ruffell, Alastair

    2005-11-01

    A search for the body of a victim of terrorist abduction and murder was made in a graveyard on the periphery of a major conurbation in Northern Ireland. The area is politically sensitive and the case of high profile. This required non-invasive, completely non-destructive and rapid assessment of the scene. A MALA RAMAC ground-penetrating radar system was used to achieve these objectives. Unprocessed and processed 400 MHz data show the presence of a collapse feature above and around a known 1970s burial with no similar collapse above the suspect location. In the saturated, clay-rich sediments of the site, 200 MHz data offered no advantage over 400 MHz data. Unprocessed 100 MHz data shows a series of multiples in the known burial with no similar features in the suspect location. Processed 100 MHz lines defined the shape of the collapse around the known burial to 2 m depth, together with the geometry of the platform (1 m depth) the gravedigger used in the 1970s to construct the site. In addition, processed 100 MHz data showed both the dielectric contrast in and internal reflection geometry of the soil imported above the known grave. Thus the sequence, geometry, difference in infill and infill direction of the grave was reconstructed 30 years after burial. The suspect site showed no evidence of shallow or deep inhumation. Subsequently, the missing person's body was found some distance from this site, vindicating the results and interpretation from ground-penetrating radar. The acquisition, processing, collapse feature and sequence stratigraphic interpretation of the known burial and empty (suspect) burial site may be useful proxies for other, similar investigations. GPR was used to evaluate this site within 3 h of the survey commencing, using unprocessed data. An additional day of processing established that the suspect body did not reside here, which was counter to police and community intelligence.

  15. Cell Phone Decision Making: Adolescents' Perceptions of How and Why They Make the Choice to Text or Call

    ERIC Educational Resources Information Center

    Blair, Bethany L.; Fletcher, Anne C.; Gaskin, Erin R.

    2015-01-01

    The primary aim of this study was to examine how and why adolescents make decisions regarding whether to conduct their communication via texting versus calling features of cellular telephones. Individual semistructured qualitative interviews were conducted with 41 adolescents aged 14 to 18 focusing on their use of calling and texting when…

  16. Reading Guided by Automated Graphical Representations: How Model-Based Text Visualizations Facilitate Learning in Reading Comprehension Tasks

    ERIC Educational Resources Information Center

    Pirnay-Dummer, Pablo; Ifenthaler, Dirk

    2011-01-01

    Our study integrates automated natural language-oriented assessment and analysis methodologies into feasible reading comprehension tasks. With the newly developed T-MITOCAR toolset, prose text can be automatically converted into an association net which has similarities to a concept map. The "text to graph" feature of the software is based on…

  17. The Role of Reading Time Complexity and Reading Speed in Text Comprehension

    ERIC Educational Resources Information Center

    Wallot, Sebastian; O'Brien, Beth A.; Haussmann, Anna; Kloos, Heidi; Lyby, Marlene S.

    2014-01-01

    Reading speed is commonly used as an index of reading fluency. However, reading speed is not a consistent predictor of text comprehension, when speed and comprehension are measured on the same text within the same reader. This might be due to the somewhat ambiguous nature of reading speed, which is sometimes regarded as a feature of the reading…

  18. Solutions for Coding Societal Events

    DTIC Science & Technology

    2016-12-01

    develop a prototype system for civil unrest event extraction, and (3) engineer BBN ACCENT (ACCurate Events from Natural Text ) to support broad use by...56 iv List of Tables Table 1: Features in similarity metric. Abbreviations are as follows. TG: text graph...extraction of a stream of events (e.g. protests, attacks, etc.) from unstructured text (e.g. news, social media). This technical report presents results

  19. Addressing Students' Alternative Conceptions on the Propagation of Periodic Waves Using a Refutational Text

    ERIC Educational Resources Information Center

    Caleon, Imelda; Subramaniam, R.

    2013-01-01

    The effectiveness of a refutational text in addressing the alternative conceptions held by secondary school students on the topic of wave propagation in an elastic medium was explored in this study. The refutational text, which was 816 words long and featured the particle-spring model, was found to be more effective in promoting conceptual change…

  20. English Textbooks for Russian Students: Problems and Specific Features

    ERIC Educational Resources Information Center

    Solnyshkina, Marina I.; Vishnyakova, Ol'ga D.; Gafiyatova, Elzara V.; Gabitov, Azat I.

    2017-01-01

    The research identifies the complexity level of eight texts from Spotlight 11 used in Russian TEFL to prepare students for National Unified Exam in English and assess their reading skills. The results of the analyses conducted with the help of T.E.R.A., an automated text processor, prove that all texts fell within the range of 6-9 Flesch-Kincaid…

  1. Sentiment analysis: a comparison of deep learning neural network algorithm with SVM and naϊve Bayes for Indonesian text

    NASA Astrophysics Data System (ADS)

    Calvin Frans Mariel, Wahyu; Mariyah, Siti; Pramana, Setia

    2018-03-01

    Deep learning is a new era of machine learning techniques that essentially imitate the structure and function of the human brain. It is a development of deeper Artificial Neural Network (ANN) that uses more than one hidden layer. Deep Learning Neural Network has a great ability on recognizing patterns from various data types such as picture, audio, text, and many more. In this paper, the authors tries to measure that algorithm’s ability by applying it into the text classification. The classification task herein is done by considering the content of sentiment in a text which is also called as sentiment analysis. By using several combinations of text preprocessing and feature extraction techniques, we aim to compare the precise modelling results of Deep Learning Neural Network with the other two commonly used algorithms, the Naϊve Bayes and Support Vector Machine (SVM). This algorithm comparison uses Indonesian text data with balanced and unbalanced sentiment composition. Based on the experimental simulation, Deep Learning Neural Network clearly outperforms the Naϊve Bayes and SVM and offers a better F-1 Score while for the best feature extraction technique which improves that modelling result is Bigram.

  2. The Effects of Student and Text Characteristics on the Oral Reading Fluency of Middle-Grade Students

    PubMed Central

    Barth, Amy E.; Tolar, Tammy D.; Fletcher, Jack M.; Francis, David

    2014-01-01

    We evaluated the effects of student characteristics (sight word reading efficiency, phonological decoding, verbal knowledge, level of reading ability, grade, gender) and text features (passage difficulty, length, genre, and language and discourse attributes) on the oral reading fluency of a sample of middle-school students in Grades 6–8 (N = 1,794). Students who were struggling (n = 704) and typically developing readers (n = 1,028) were randomly assigned to read five 1-min passages from each of 5 Lexile bands (within student range of 550 Lexiles). A series of multilevel analyses showed that student and text characteristics contributed uniquely to oral reading fluency rates. Student characteristics involving sight word reading efficiency and level of decoding ability accounted for more variability than reader type and verbal knowledge, with small, but statistically significant effects of grade and gender. The most significant text feature was passage difficulty level. Interactions involving student text characteristics, especially attributes involving overall ability level and difficulty of the text, were also apparent. These results support views of the development of oral reading fluency that involve interactions of student and text characteristics and highlight the importance of scaling for passage difficulty level in assessing individual differences in oral reading fluency. PMID:24567659

  3. Adding a Feature: Can a Pop-Up Chat Box Enhance Virtual Reference Services?

    PubMed

    Fan, Suhua Caroline; Fought, Rick L; Gahn, Paul C

    2017-01-01

    Online users seek help from virtual reference services via email, phone, texting, and live chat. Technologies have enabled new features in library websites to help make this service more accessible and effective. This article is an evaluation of an experimental pop-up live chat box on the website of a health sciences library to see whether the feature would enhance virtual reference services.

  4. Single-Word Recognition Need Not Depend on Single-Word Features: Narrative Coherence Counteracts Effects of Single-Word Features That Lexical Decision Emphasizes

    ERIC Educational Resources Information Center

    Teng, Dan W.; Wallot, Sebastian; Kelty-Stephen, Damian G.

    2016-01-01

    Research on reading comprehension of connected text emphasizes reliance on single-word features that organize a stable, mental lexicon of words and that speed or slow the recognition of each new word. However, the time needed to recognize a word might not actually be as fixed as previous research indicates, and the stability of the mental lexicon…

  5. Critical Linguistics: A Starting Point for Oppositional Reading.

    ERIC Educational Resources Information Center

    Janks, Hilary

    This document focuses on specific linguistic features that serve ideological functions in texts written in South Africa from 1985 to 1988. The features examined include: naming; metaphors; old words with new meanings; words becoming tainted; renaming or lexicalization; overlexicalization; strategies for resisting classification; tense and aspect;…

  6. Are History Textbooks More "Considerate" after 20 Years?

    ERIC Educational Resources Information Center

    Berkeley, Sheri; King-Sears, Margaret E.; Hott, Brittany L.; Bradley-Black, Katherine

    2014-01-01

    Features of eighth-grade history textbooks were examined through replication of a 20-year-old study that investigated "considerateness" of textbooks. Considerate texts provide clear, coherent information and include features that promote students' comprehension, such as explicit use of organizational structures, a range of question types…

  7. Nevada National Security Site Environmental Report 2016, Attachment A: Site Description

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wills, Cathy

    This attachment expands on the general description of the Nevada National Security Site (NNSS) presented in the Introduction to the Nevada National Security Site Environmental Report 2016 (prepared by National Security Technologies, LLC [NSTec], 2017). Included are subsections that summarize the site’s geological, hydrological, climatological, and ecological settings and the cultural resources of the NNSS. The subsections are meant to aid the reader in understanding the complex physical and biological environment of the NNSS. An adequate knowledge of the site’s environment is necessary to assess the environmental impacts of new projects, design and implement environmental monitoring activities for current sitemore » operations, and assess the impacts of site operations on the public residing in the vicinity of the NNSS. The NNSS environment contributes to several key features of the site that afford protection to the inhabitants of adjacent areas from potential exposure to radioactivity or other contaminants resulting from NNSS operations. These key features include the general remote location of the NNSS, restricted access, extended wind transport times, the great depths to slow-moving groundwater, little or no surface water, and low population density. This attachment complements the annual summary of monitoring program activities and dose assessments presented in the main body of this report.« less

  8. Systematic Analysis and Prediction of In Situ Cross Talk of O-GlcNAcylation and Phosphorylation

    PubMed Central

    Li, Ao; Wang, Minghui

    2015-01-01

    Reversible posttranslational modification (PTM) plays a very important role in biological process by changing properties of proteins. As many proteins are multiply modified by PTMs, cross talk of PTMs is becoming an intriguing topic and draws much attention. Currently, lots of evidences suggest that the PTMs work together to accomplish a specific biological function. However, both the general principles and underlying mechanism of PTM crosstalk are elusive. In this study, by using large-scale datasets we performed evolutionary conservation analysis, gene ontology enrichment, motif extraction of proteins with cross talk of O-GlcNAcylation and phosphorylation cooccurring on the same residue. We found that proteins with in situ O-GlcNAc/Phos cross talk were significantly enriched in some specific gene ontology terms and no obvious evolutionary pressure was observed. Moreover, 3 functional motifs associated with O-GlcNAc/Phos sites were extracted. We further used sequence features and GO features to predict O-GlcNAc/Phos cross talk sites based on phosphorylated sites and O-GlcNAcylated sites separately by the use of SVM model. The AUC of classifier based on phosphorylated sites is 0.896 and the other classifier based on GlcNAcylated sites is 0.843. Both classifiers achieved a relatively better performance compared with other existing methods. PMID:26601103

  9. Systematic Analysis and Prediction of In Situ Cross Talk of O-GlcNAcylation and Phosphorylation.

    PubMed

    Yao, Heming; Li, Ao; Wang, Minghui

    2015-01-01

    Reversible posttranslational modification (PTM) plays a very important role in biological process by changing properties of proteins. As many proteins are multiply modified by PTMs, cross talk of PTMs is becoming an intriguing topic and draws much attention. Currently, lots of evidences suggest that the PTMs work together to accomplish a specific biological function. However, both the general principles and underlying mechanism of PTM crosstalk are elusive. In this study, by using large-scale datasets we performed evolutionary conservation analysis, gene ontology enrichment, motif extraction of proteins with cross talk of O-GlcNAcylation and phosphorylation cooccurring on the same residue. We found that proteins with in situ O-GlcNAc/Phos cross talk were significantly enriched in some specific gene ontology terms and no obvious evolutionary pressure was observed. Moreover, 3 functional motifs associated with O-GlcNAc/Phos sites were extracted. We further used sequence features and GO features to predict O-GlcNAc/Phos cross talk sites based on phosphorylated sites and O-GlcNAcylated sites separately by the use of SVM model. The AUC of classifier based on phosphorylated sites is 0.896 and the other classifier based on GlcNAcylated sites is 0.843. Both classifiers achieved a relatively better performance compared with other existing methods.

  10. OH-PRED: prediction of protein hydroxylation sites by incorporating adapted normal distribution bi-profile Bayes feature extraction and physicochemical properties of amino acids.

    PubMed

    Jia, Cang-Zhi; He, Wen-Ying; Yao, Yu-Hua

    2017-03-01

    Hydroxylation of proline or lysine residues in proteins is a common post-translational modification event, and such modifications are found in many physiological and pathological processes. Nonetheless, the exact molecular mechanism of hydroxylation remains under investigation. Because experimental identification of hydroxylation is time-consuming and expensive, bioinformatics tools with high accuracy represent desirable alternatives for large-scale rapid identification of protein hydroxylation sites. In view of this, we developed a supporter vector machine-based tool, OH-PRED, for the prediction of protein hydroxylation sites using the adapted normal distribution bi-profile Bayes feature extraction in combination with the physicochemical property indexes of the amino acids. In a jackknife cross validation, OH-PRED yields an accuracy of 91.88% and a Matthew's correlation coefficient (MCC) of 0.838 for the prediction of hydroxyproline sites, and yields an accuracy of 97.42% and a MCC of 0.949 for the prediction of hydroxylysine sites. These results demonstrate that OH-PRED increased significantly the prediction accuracy of hydroxyproline and hydroxylysine sites by 7.37 and 14.09%, respectively, when compared with the latest predictor PredHydroxy. In independent tests, OH-PRED also outperforms previously published methods.

  11. Atmospheric fossil fuel CO2 traced by 14CO2 and air quality index pollutant observations in Beijing and Xiamen, China.

    PubMed

    Niu, Zhenchuan; Zhou, Weijian; Feng, Xue; Feng, Tian; Wu, Shugang; Cheng, Peng; Lu, Xuefeng; Du, Hua; Xiong, Xiaohu; Fu, Yunchong

    2018-06-01

    Radiocarbon ( 14 C) is the most accurate tracer available for quantifying atmospheric CO 2 derived from fossil fuel (CO 2ff ), but it is expensive and time-consuming to measure. Here, we used common hourly Air Quality Index (AQI) pollutants (AQI, PM 2.5 , PM 10 , and CO) to indirectly trace diurnal CO 2ff variations during certain days at the urban sites in Beijing and Xiamen, China, based on linear relationships between AQI pollutants and CO 2ff traced by 14 C ([Formula: see text]) for semimonthly samples obtained in 2014. We validated these indirectly traced CO 2ff (CO 2ff-in ) concentrations against [Formula: see text] concentrations traced by simultaneous diurnal 14 CO 2 observations. Significant (p < 0.05) strong correlations were observed between each of the separate AQI pollutants and [Formula: see text] for the semimonthly samples. Diurnal variations in CO 2ff traced by each of the AQI pollutants generally showed similar trends to those of [Formula: see text], with high agreement at the sampling site in Beijing and relatively poor agreement at the sampling site in Xiamen. AQI pollutant tracers showed high normalized root-mean-square (NRMS) errors for the summer diurnal samples due to low [Formula: see text] concentrations. After the removal of these summer samples, the NRMS errors for AQI pollutant tracers were in the range of 31.6-64.2%. CO generally showed a high agreement and low NRMS errors among these indirect tracers. Based on these linear relationships, monthly CO 2ff averages at the sampling sites in Beijing and Xiamen were traced using CO concentration as a tracer. The monthly CO 2ff averages at the Beijing site showed a shallow U-type variation. These results indicate that CO can be used to trace CO 2ff variations in Chinese cities with CO 2ff concentrations above 5 ppm.

  12. 40 CFR 63.681 - Definitions.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... features permanently integrated into the design of the unit. Emission point means an individual tank... system is not a drain and collection system that is designed and operated for the sole purpose of..., or transfer system used to manage off-site material. Off-site material service means any time when a...

  13. 40 CFR 63.681 - Definitions.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... features permanently integrated into the design of the unit. Emission point means an individual tank... system is not a drain and collection system that is designed and operated for the sole purpose of..., or transfer system used to manage off-site material. Off-site material service means any time when a...

  14. Colleges Get Free Web Pages but with a Catch: Advertising.

    ERIC Educational Resources Information Center

    Blumenstyk, Goldie

    1999-01-01

    Striving to streamline student services and improve electronic communication, colleges and universities are signing on with companies that offer sophisticated World Wide Web sites through which students can accomplish basic administrative functions and receive information. The sites are often free of charge but also feature advertising messages…

  15. 21st Century Recruiting: Automated, Digital, Electronic.

    ERIC Educational Resources Information Center

    Patterson, Valerie

    1997-01-01

    Examines ways in which technology is changing staffing office practices. Discusses features of the worldwide web, some of the potential problems in establishing a web site, and the importance of carefully planning a web site. Looks at digital resume warehouses and the increased power such warehouses offers recruiters. (RJM)

  16. Canyon Country Ecosystems Research Site

    Science.gov Websites

    Soil Crust Home Crust 101 Advanced Gallery References CCERS Site Links updated: April 24, 2006 cyanobacteria, algae, lichens and mosses, that stabilize soil against wind and water erosion, enhance water 3400 m, and features a range of climates on both the same and different soil and bedrock substrates

  17. A Handbook of the Job-Site English Project 1985-86.

    ERIC Educational Resources Information Center

    Acevedo, Sheila; Dovel, Frankie

    The Orange County Public Schools' Job Site English Project was initiated to provide employees of businesses and industries with work-related English for speakers of other languages. The program features individualized curricula that are developed after the curriculum writer visits the business/industry in need of services, analyzes the…

  18. YouTube as a Participatory Culture

    ERIC Educational Resources Information Center

    Chau, Clement

    2010-01-01

    There is an explosion of youth subscriptions to original content-media-sharing Web sites such as YouTube. These Web sites combine media production and distribution with social networking features, making them an ideal place to create, connect, collaborate, and circulate. By encouraging youth to become media creators and social networkers, new…

  19. The Power in the Portal

    ERIC Educational Resources Information Center

    Chamberlain, Cathy

    2005-01-01

    Educational portals put together links to sites and resources educators would be interested in viewing. They eliminate the hours of searching that might be invested if typical search engines were used. Educational portals feature lessons, units, printable resources, creative ideas, and more. Many of these sites are free, while others are…

  20. 76 FR 78645 - Auction of FM Broadcast Construction Permits Scheduled for March 27, 2012; Notice and Filing...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-12-19

    ... touch upon impermissible subject matters because they may convey pricing information and bidding... FCC auction does not constitute an endorsement by the FCC of any particular service, technology, or... habitats, historical or archaeological sites, Indian religious sites, floodplains, and surface features. In...

Top