web usage mining: Topics by Science.gov

Sample records for web usage mining

A Framework for Web Usage Mining in Electronic Government

NASA Astrophysics Data System (ADS)

Zhou, Ping; Le, Zhongjian

Web usage mining has been a major component of management strategy to enhance organizational analysis and decision. The literature on Web usage mining that deals with strategies and technologies for effectively employing Web usage mining is quite vast. In recent years, E-government has received much attention from researchers and practitioners. Huge amounts of user access data are produced in Electronic government Web site everyday. The role of these data in the success of government management cannot be overstated because they affect government analysis, prediction, strategies, tactical, operational planning and control. Web usage miming in E-government has an important role to play in setting government objectives, discovering citizen behavior, and determining future courses of actions. Web usage mining in E-government has not received adequate attention from researchers or practitioners. We developed a framework to promote a better understanding of the importance of Web usage mining in E-government. Using the current literature, we developed the framework presented herein, in hopes that it would stimulate more interest in this important area.
Web usage data mining agent

NASA Astrophysics Data System (ADS)

Madiraju, Praveen; Zhang, Yanqing

2002-03-01

When a user logs in to a website, behind the scenes the user leaves his/her impressions, usage patterns and also access patterns in the web servers log file. A web usage mining agent can analyze these web logs to help web developers to improve the organization and presentation of their websites. They can help system administrators in improving the system performance. Web logs provide invaluable help in creating adaptive web sites and also in analyzing the network traffic analysis. This paper presents the design and implementation of a Web usage mining agent for digging in to the web log files.
Study on online community user motif using web usage mining

NASA Astrophysics Data System (ADS)

Alphy, Meera; Sharma, Ajay

2016-04-01

The Web usage mining is the application of data mining, which is used to extract useful information from the online community. The World Wide Web contains at least 4.73 billion pages according to Indexed Web and it contains at least 228.52 million pages according Dutch Indexed web on 6th august 2015, Thursday. It’s difficult to get needed data from these billions of web pages in World Wide Web. Here is the importance of web usage mining. Personalizing the search engine helps the web user to identify the most used data in an easy way. It reduces the time consumption; automatic site search and automatic restore the useful sites. This study represents the old techniques to latest techniques used in pattern discovery and analysis in web usage mining from 1996 to 2015. Analyzing user motif helps in the improvement of business, e-commerce, personalisation and improvement of websites.
Applying Web Usage Mining for Personalizing Hyperlinks in Web-Based Adaptive Educational Systems

ERIC Educational Resources Information Center

Romero, Cristobal; Ventura, Sebastian; Zafra, Amelia; de Bra, Paul

2009-01-01

Nowadays, the application of Web mining techniques in e-learning and Web-based adaptive educational systems is increasing exponentially. In this paper, we propose an advanced architecture for a personalization system to facilitate Web mining. A specific Web mining tool is developed and a recommender engine is integrated into the AHA! system in…
Web usage mining at an academic health sciences library: an exploratory study.

PubMed

Bracke, Paul J

2004-10-01

This paper explores the potential of multinomial logistic regression analysis to perform Web usage mining for an academic health sciences library Website. Usage of database-driven resource gateway pages was logged for a six-month period, including information about users' network addresses, referring uniform resource locators (URLs), and types of resource accessed. It was found that referring URL did vary significantly by two factors: whether a user was on-campus and what type of resource was accessed. Although the data available for analysis are limited by the nature of the Web and concerns for privacy, this method demonstrates the potential for gaining insight into Web usage that supplements Web log analysis. It can be used to improve the design of static and dynamic Websites today and could be used in the design of more advanced Web systems in the future.
A Dynamic Recommender System for Improved Web Usage Mining and CRM Using Swarm Intelligence.

PubMed

Alphy, Anna; Prabakaran, S

2015-01-01

In modern days, to enrich e-business, the websites are personalized for each user by understanding their interests and behavior. The main challenges of online usage data are information overload and their dynamic nature. In this paper, to address these issues, a WebBluegillRecom-annealing dynamic recommender system that uses web usage mining techniques in tandem with software agents developed for providing dynamic recommendations to users that can be used for customizing a website is proposed. The proposed WebBluegillRecom-annealing dynamic recommender uses swarm intelligence from the foraging behavior of a bluegill fish. It overcomes the information overload by handling dynamic behaviors of users. Our dynamic recommender system was compared against traditional collaborative filtering systems. The results show that the proposed system has higher precision, coverage, F1 measure, and scalability than the traditional collaborative filtering systems. Moreover, the recommendations given by our system overcome the overspecialization problem by including variety in recommendations.
A Dynamic Recommender System for Improved Web Usage Mining and CRM Using Swarm Intelligence

PubMed Central

Alphy, Anna; Prabakaran, S.

2015-01-01

In modern days, to enrich e-business, the websites are personalized for each user by understanding their interests and behavior. The main challenges of online usage data are information overload and their dynamic nature. In this paper, to address these issues, a WebBluegillRecom-annealing dynamic recommender system that uses web usage mining techniques in tandem with software agents developed for providing dynamic recommendations to users that can be used for customizing a website is proposed. The proposed WebBluegillRecom-annealing dynamic recommender uses swarm intelligence from the foraging behavior of a bluegill fish. It overcomes the information overload by handling dynamic behaviors of users. Our dynamic recommender system was compared against traditional collaborative filtering systems. The results show that the proposed system has higher precision, coverage, F1 measure, and scalability than the traditional collaborative filtering systems. Moreover, the recommendations given by our system overcome the overspecialization problem by including variety in recommendations. PMID:26229978
Using ant-behavior-based simulation model AntWeb to improve website organization

NASA Astrophysics Data System (ADS)

Li, Weigang; Pinheiro Dib, Marcos V.; Teles, Wesley M.; Morais de Andrade, Vlaudemir; Alves de Melo, Alba C. M.; Cariolano, Judas T.

2002-03-01

Some web usage mining algorithms showed the potential application to find the difference among the organizations expected by visitors to the website. However, there are still no efficient method and criterion for a web administrator to measure the performance of the modification. In this paper, we developed an AntWeb, a model inspired by ants' behavior to simulate the sequence of visiting the website, in order to measure the efficient of the web structure. We implemented a web usage mining algorithm using backtrack to the intranet website of the Politec Informatic Ltd., Brazil. We defined throughput (the number of visitors to reach their target pages per time unit relates to the total number of visitors) as an index to measure the website's performance. We also used the link in a web page to represent the effect of visitors' pheromone trails. For every modification in the website organization, for example, putting a link from the expected location to the target object, the simulation reported the value of throughput as a quick answer about this modification. The experiment showed the stability of our simulation model, and a positive modification to the intranet website of the Politec.
A Generic Framework for Extraction of Knowledge from Social Web Sources (Social Networking Websites) for an Online Recommendation System

ERIC Educational Resources Information Center

Sathick, Javubar; Venkat, Jaya

2015-01-01

Mining social web data is a challenging task and finding user interest for personalized and non-personalized recommendation systems is another important task. Knowledge sharing among web users has become crucial in determining usage of web data and personalizing content in various social websites as per the user's wish. This paper aims to design a…
Web Usage Mining: Application to an Online Educational Digital Library Service

ERIC Educational Resources Information Center

Palmer, Bart C.

2012-01-01

This dissertation was situated in the crossroads of educational data mining (EDM), educational digital libraries (such as the National Science Digital Library; http://nsdl.org), and examination of teacher behaviors while creating online learning resources in an end-user authoring system, the Instructional Architect (IA; http://ia.usu.edu). The…
Data Mining for Web Site Evaluation: An Exploration of Site Usage by Graduate Social Work Students

ERIC Educational Resources Information Center

Santhiveeran, Janaki

2006-01-01

This paper evaluates the actual use of a course Website by graduate social work students. The study utilized data mining techniques to discover meaningful trends by using the data from server logs. The course Website was accessed 24,730 times by all 49 graduate students during a semester. The students utilized the course Website 23 hours a day, 7…
Automatic Recommendations for E-Learning Personalization Based on Web Usage Mining Techniques and Information Retrieval

ERIC Educational Resources Information Center

Khribi, Mohamed Koutheair; Jemni, Mohamed; Nasraoui, Olfa

2009-01-01

In this paper, we describe an automatic personalization approach aiming to provide online automatic recommendations for active learners without requiring their explicit feedback. Recommended learning resources are computed based on the current learner's recent navigation history, as well as exploiting similarities and dissimilarities among…
Web Usage Mining Analysis of Federated Search Tools for Egyptian Scholars

ERIC Educational Resources Information Center

Mohamed, Khaled A.; Hassan, Ahmed

2008-01-01

Purpose: This paper aims to examine the behaviour of the Egyptian scholars while accessing electronic resources through two federated search tools. The main purpose of this article is to provide guidance for federated search tool technicians and support teams about user issues, including the need for training. Design/methodology/approach: Log…
Enhanced DIII-D Data Management Through a Relational Database

NASA Astrophysics Data System (ADS)

Burruss, J. R.; Peng, Q.; Schachter, J.; Schissel, D. P.; Terpstra, T. B.

2000-10-01

A relational database is being used to serve data about DIII-D experiments. The database is optimized for queries across multiple shots, allowing for rapid data mining by SQL-literate researchers. The relational database relates different experiments and datasets, thus providing a big picture of DIII-D operations. Users are encouraged to add their own tables to the database. Summary physics quantities about DIII-D discharges are collected and stored in the database automatically. Meta-data about code runs, MDSplus usage, and visualization tool usage are collected, stored in the database, and later analyzed to improve computing. Documentation on the database may be accessed through programming languages such as C, Java, and IDL, or through ODBC compliant applications such as Excel and Access. A database-driven web page also provides a convenient means for viewing database quantities through the World Wide Web. Demonstrations will be given at the poster.
Keynote Talk: Mining the Web 2.0 for Improved Image Search

NASA Astrophysics Data System (ADS)

Baeza-Yates, Ricardo

There are several semantic sources that can be found in the Web that are either explicit, e.g. Wikipedia, or implicit, e.g. derived from Web usage data. Most of them are related to user generated content (UGC) or what is called today the Web 2.0. In this talk we show how to use these sources of evidence in Flickr, such as tags, visual annotations or clicks, which represent the the wisdom of crowds behind UGC, to improve image search. These results are the work of the multimedia retrieval team at Yahoo! Research Barcelona and they are already being used in Yahoo! image search. This work is part of a larger effort to produce a virtuous data feedback circuit based on the right combination many different technologies to leverage the Web itself.
UkrVO astronomical WEB services

NASA Astrophysics Data System (ADS)

Mazhaev, A.

2017-02-01

Ukraine Virtual Observatory (UkrVO) has been a member of the International Virtual Observatory Alliance (IVOA) since 2011. The virtual observatory (VO) is not a magic solution to all problems of data storing and processing, but it provides certain standards for building infrastructure of astronomical data center. The astronomical databases help data mining and offer to users an easy access to observation metadata, images within celestial sphere and results of image processing. The astronomical web services (AWS) of UkrVO give to users handy tools for data selection from large astronomical catalogues for a relatively small region of interest in the sky. Examples of the AWS usage are showed.
A Node Linkage Approach for Sequential Pattern Mining

PubMed Central

Navarro, Osvaldo; Cumplido, René; Villaseñor-Pineda, Luis; Feregrino-Uribe, Claudia; Carrasco-Ochoa, Jesús Ariel

2014-01-01

Sequential Pattern Mining is a widely addressed problem in data mining, with applications such as analyzing Web usage, examining purchase behavior, and text mining, among others. Nevertheless, with the dramatic increase in data volume, the current approaches prove inefficient when dealing with large input datasets, a large number of different symbols and low minimum supports. In this paper, we propose a new sequential pattern mining algorithm, which follows a pattern-growth scheme to discover sequential patterns. Unlike most pattern growth algorithms, our approach does not build a data structure to represent the input dataset, but instead accesses the required sequences through pseudo-projection databases, achieving better runtime and reducing memory requirements. Our algorithm traverses the search space in a depth-first fashion and only preserves in memory a pattern node linkage and the pseudo-projections required for the branch being explored at the time. Experimental results show that our new approach, the Node Linkage Depth-First Traversal algorithm (NLDFT), has better performance and scalability in comparison with state of the art algorithms. PMID:24933123
Using Open Web APIs in Teaching Web Mining

ERIC Educational Resources Information Center

Chen, Hsinchun; Li, Xin; Chau, M.; Ho, Yi-Jen; Tseng, Chunju

2009-01-01

With the advent of the World Wide Web, many business applications that utilize data mining and text mining techniques to extract useful business information on the Web have evolved from Web searching to Web mining. It is important for students to acquire knowledge and hands-on experience in Web mining during their education in information systems…
Web mining in soft computing framework: relevance, state of the art and future directions.

PubMed

Pal, S K; Talwar, V; Mitra, P

2002-01-01

The paper summarizes the different characteristics of Web data, the basic components of Web mining and its different types, and the current state of the art. The reason for considering Web mining, a separate field from data mining, is explained. The limitations of some of the existing Web mining methods and tools are enunciated, and the significance of soft computing (comprising fuzzy logic (FL), artificial neural networks (ANNs), genetic algorithms (GAs), and rough sets (RSs) are highlighted. A survey of the existing literature on "soft Web mining" is provided along with the commercially available systems. The prospective areas of Web mining where the application of soft computing needs immediate attention are outlined with justification. Scope for future research in developing "soft Web mining" systems is explained. An extensive bibliography is also provided.
A Two-Tiered Model for Analyzing Library Web Site Usage Statistics, Part 1: Web Server Logs.

ERIC Educational Resources Information Center

Cohen, Laura B.

2003-01-01

Proposes a two-tiered model for analyzing web site usage statistics for academic libraries: one tier for library administrators that analyzes measures indicating library use, and a second tier for web site managers that analyzes measures aiding in server maintenance and site design. Discusses the technology of web site usage statistics, and…

Abnormal Web Usage Control by Proxy Strategies.

ERIC Educational Resources Information Center

Yu, Hsiang-Fu; Tseng, Li-Ming

2002-01-01

Approaches to designing a proxy server with Web usage control and to making the proxy server effective on local area networks are proposed to prevent abnormal Web access and to prioritize Web usage. A system is implemented to demonstrate the approaches. The implementation reveals that the proposed approaches are effective, such that the abnormal…
Web Mining for Web Image Retrieval.

ERIC Educational Resources Information Center

Chen, Zheng; Wenyin, Liu; Zhang, Feng; Li, Mingjing; Zhang, Hongjiang

2001-01-01

Presents a prototype system for image retrieval from the Internet using Web mining. Discusses the architecture of the Web image retrieval prototype; document space modeling; user log mining; and image retrieval experiments to evaluate the proposed system. (AEF)
Use of Internet audience measurement data to gauge market share for online health information services.

PubMed

Wood, Fred B; Benson, Dennis; LaCroix, Eve-Marie; Siegel, Elliot R; Fariss, Susan

2005-07-01

The transition to a largely Internet and Web-based environment for dissemination of health information has changed the health information landscape and the framework for evaluation of such activities. A multidimensional evaluative approach is needed. This paper discusses one important dimension of Web evaluation-usage data. In particular, we discuss the collection and analysis of external data on website usage in order to develop a better understanding of the health information (and related US government information) market space, and to estimate the market share or relative levels of usage for National Library of Medicine (NLM) and National Institutes of Health (NIH) websites compared to other health information providers. The primary method presented is Internet audience measurement based on Web usage by external panels of users and assembled by private vendors-in this case, comScore. A secondary method discussed is Web usage based on Web log software data. The principle metrics for both methods are unique visitors and total pages downloaded per month. NLM websites (primarily MedlinePlus and PubMed) account for 55% to 80% of total NIH website usage depending on the metric used. In turn, NIH.gov top-level domain usage (inclusive of NLM) ranks second only behind WebMD in the US domestic home health information market and ranks first on a global basis. NIH.gov consistently ranks among the top three or four US government top-level domains based on global Web usage. On a site-specific basis, the top health information websites in terms of global usage appear to be WebMD, MSN Health, PubMed, Yahoo! Health, AOL Health, and MedlinePlus. Based on MedlinePlus Web log data and external Internet audience measurement data, the three most heavily used cancer-centric websites appear to be www.cancer.gov (National Cancer Institute), www.cancer.org (American Cancer Society), and www.breastcancer.org (non-profit organization). Internet audience measurement has proven useful to NLM, with significant advantages compared to sole reliance on usage data from Web log software. Internet audience data has helped NLM better understand the relative usage of NLM and NIH websites in the intersection of the health information and US government information market sectors, which is the primary market intersector for NLM and NIH. However important, Web usage is only one dimension of a complete Web evaluation framework, and other primary research methods, such as online user surveys, usability tests, and focus groups, are also important for comprehensive evaluation that includes qualitative elements, such as user satisfaction and user friendliness, as well as quantitative indicators of website usage.
Use of Internet Audience Measurement Data to Gauge Market Share for Online Health Information Services

PubMed Central

Benson, Dennis; LaCroix, Eve-Marie; Siegel, Elliot R; Fariss, Susan

2005-01-01

Background The transition to a largely Internet and Web-based environment for dissemination of health information has changed the health information landscape and the framework for evaluation of such activities. A multidimensional evaluative approach is needed. Objective This paper discusses one important dimension of Web evaluation—usage data. In particular, we discuss the collection and analysis of external data on website usage in order to develop a better understanding of the health information (and related US government information) market space, and to estimate the market share or relative levels of usage for National Library of Medicine (NLM) and National Institutes of Health (NIH) websites compared to other health information providers. Methods The primary method presented is Internet audience measurement based on Web usage by external panels of users and assembled by private vendors—in this case, comScore. A secondary method discussed is Web usage based on Web log software data. The principle metrics for both methods are unique visitors and total pages downloaded per month. Results NLM websites (primarily MedlinePlus and PubMed) account for 55% to 80% of total NIH website usage depending on the metric used. In turn, NIH.gov top-level domain usage (inclusive of NLM) ranks second only behind WebMD in the US domestic home health information market and ranks first on a global basis. NIH.gov consistently ranks among the top three or four US government top-level domains based on global Web usage. On a site-specific basis, the top health information websites in terms of global usage appear to be WebMD, MSN Health, PubMed, Yahoo! Health, AOL Health, and MedlinePlus. Based on MedlinePlus Web log data and external Internet audience measurement data, the three most heavily used cancer-centric websites appear to be www.cancer.gov (National Cancer Institute), www.cancer.org (American Cancer Society), and www.breastcancer.org (non-profit organization). Conclusions Internet audience measurement has proven useful to NLM, with significant advantages compared to sole reliance on usage data from Web log software. Internet audience data has helped NLM better understand the relative usage of NLM and NIH websites in the intersection of the health information and US government information market sectors, which is the primary market intersector for NLM and NIH. However important, Web usage is only one dimension of a complete Web evaluation framework, and other primary research methods, such as online user surveys, usability tests, and focus groups, are also important for comprehensive evaluation that includes qualitative elements, such as user satisfaction and user friendliness, as well as quantitative indicators of website usage. PMID:15998622
The design and implementation of web mining in web sites security

NASA Astrophysics Data System (ADS)

Li, Jian; Zhang, Guo-Yin; Gu, Guo-Chang; Li, Jian-Li

2003-06-01

The backdoor or information leak of Web servers can be detected by using Web Mining techniques on some abnormal Web log and Web application log data. The security of Web servers can be enhanced and the damage of illegal access can be avoided. Firstly, the system for discovering the patterns of information leakages in CGI scripts from Web log data was proposed. Secondly, those patterns for system administrators to modify their codes and enhance their Web site security were provided. The following aspects were described: one is to combine web application log with web log to extract more information, so web data mining could be used to mine web log for discovering the information that firewall and Information Detection System cannot find. Another approach is to propose an operation module of web site to enhance Web site security. In cluster server session, Density-Based Clustering technique is used to reduce resource cost and obtain better efficiency.
Introduction to the JASIST Special Topic Issue on Web Retrieval and Mining: A Machine Learning Perspective.

ERIC Educational Resources Information Center

Chen, Hsinchun

2003-01-01

Discusses information retrieval techniques used on the World Wide Web. Topics include machine learning in information extraction; relevance feedback; information filtering and recommendation; text classification and text clustering; Web mining, based on data mining techniques; hyperlink structure; and Web size. (LRW)
Visual Based Retrieval Systems and Web Mining--Introduction.

ERIC Educational Resources Information Center

Iyengar, S. S.

2001-01-01

Briefly discusses Web mining and image retrieval techniques, and then presents a summary of articles in this special issue. Articles focus on Web content mining, artificial neural networks as tools for image retrieval, content-based image retrieval systems, and personalizing the Web browsing experience using media agents. (AEF)
Technologies for Decreasing Mining Losses

NASA Astrophysics Data System (ADS)

Valgma, Ingo; Väizene, Vivika; Kolats, Margit; Saarnak, Martin

2013-12-01

In case of stratified deposits like oil shale deposit in Estonia, mining losses depend on mining technologies. Current research focuses on extraction and separation possibilities of mineral resources. Selective mining, selective crushing and separation tests have been performed, showing possibilities of decreasing mining losses. Rock crushing and screening process simulations were used for optimizing rock fractions. In addition mine backfilling, fine separation, and optimized drilling and blasting have been analyzed. All tested methods show potential and depend on mineral usage. Usage in addition depends on the utilization technology. The questions like stability of the material flow and influences of the quality fluctuations to the final yield are raised.
Aspect level sentiment analysis using machine learning

NASA Astrophysics Data System (ADS)

Shubham, D.; Mithil, P.; Shobharani, Meesala; Sumathy, S.

2017-11-01

In modern world the development of web and smartphones increases the usage of online shopping. The overall feedback about product is generated with the help of sentiment analysis using text processing.Opinion mining or sentiment analysis is used to collect and categorized the reviews of product. The proposed system uses aspect leveldetection in which features are extracted from the datasets. The system performs pre-processing operation such as tokenization, part of speech and limitization on the data tofinds meaningful information which is used to detect the polarity level and assigns rating to product. The proposed model focuses on aspects to produces accurate result by avoiding the spam reviews.
Patterns of usage for a Web-based clinical information system.

PubMed

Chen, Elizabeth S; Cimino, James J

2004-01-01

Understanding how clinicians are using clinical information systems to assist with their everyday tasks is valuable to the system design and development process. Developers of such systems are interested in monitoring usage in order to make enhancements. System log files are rich resources for gaining knowledge about how the system is being used. We have analyzed the log files of our Web-based clinical information system (WebCIS) to obtain various usage statistics including which WebCIS features are frequently being used. We have also identified usage patterns, which convey how the user is traversing the system. We present our method and these results as well as describe how the results can be used to customize menus, shortcut lists, and patient reports in WebCIS and similar systems.
Antecedents of Continued Usage Intentions of Web-Based Learning Management System in Tanzania

ERIC Educational Resources Information Center

Lwoga, Edda Tandi; Komba, Mercy

2015-01-01

Purpose: The purpose of this paper is to examine factors that predict students' continued usage intention of web-based learning management systems (LMS) in Tanzania, with a specific focus on the School of Business of Mzumbe University. Specifically, the study investigated major predictors of actual usage and continued usage intentions of…
Effect of Temporal Relationships in Associative Rule Mining for Web Log Data

PubMed Central

Mohd Khairudin, Nazli; Mustapha, Aida

2014-01-01

The advent of web-based applications and services has created such diverse and voluminous web log data stored in web servers, proxy servers, client machines, or organizational databases. This paper attempts to investigate the effect of temporal attribute in relational rule mining for web log data. We incorporated the characteristics of time in the rule mining process and analysed the effect of various temporal parameters. The rules generated from temporal relational rule mining are then compared against the rules generated from the classical rule mining approach such as the Apriori and FP-Growth algorithms. The results showed that by incorporating the temporal attribute via time, the number of rules generated is subsequently smaller but is comparable in terms of quality. PMID:24587757
Working with Data: Discovering Knowledge through Mining and Analysis; Systematic Knowledge Management and Knowledge Discovery; Text Mining; Methodological Approach in Discovering User Search Patterns through Web Log Analysis; Knowledge Discovery in Databases Using Formal Concept Analysis; Knowledge Discovery with a Little Perspective.

ERIC Educational Resources Information Center

Qin, Jian; Jurisica, Igor; Liddy, Elizabeth D.; Jansen, Bernard J; Spink, Amanda; Priss, Uta; Norton, Melanie J.

2000-01-01

These six articles discuss knowledge discovery in databases (KDD). Topics include data mining; knowledge management systems; applications of knowledge discovery; text and Web mining; text mining and information retrieval; user search patterns through Web log analysis; concept analysis; data collection; and data structure inconsistency. (LRW)
Impact of tailored blogs and content on usage of Web CIPHER - an online platform to help policymakers better engage with evidence from research.

PubMed

Makkar, Steve R; Howe, Megan; Williamson, Anna; Gilham, Frances

2016-12-01

There is a need to develop innovations that can help bridge the gap between research and policy. Web CIPHER is an online tool designed to help policymakers better engage with research in order to increase its use in health policymaking. The aim of the present study was to test interventions in order to increase policymakers' usage of Web CIPHER. Namely, the impact of posting articles and blogs on topics relevant to the missions and scope of selected policy agencies in the Web CIPHER community. Five policy agencies were targeted for the intervention. Web CIPHER usage data was gathered over a 30-month period using Google Analytics. Time series analysis was used to evaluate whether publication of tailored articles and blogs led to significant changes in usage for all Web CIPHER members from policy agencies, including those from the five target agencies. We further evaluated whether these users showed greater increases in usage following publication of articles and blogs directly targeted at their agency, and if these effects were moderated by the blog author. Web CIPHER usage gradually increased over time and was significantly predicted by the number of articles but not blogs that were posted throughout the study period. Publication of articles on sexual and reproductive health was followed by sustained increases in usage among all users, including users from the policy agency that targets this area. This effect of topic relevance did not occur for the four remaining target agencies. Finally, page views were higher for articles targeted at one's agency compared to other agencies. This effect also occurred for blogs, particularly when the author was internal to one's agency. The findings suggest that Web CIPHER usage in general was motivated by general interest, engagement and appeal, as opposed to the agency specificity of content and work relevance. Blogs in and of themselves may not be effective at promoting usage. Thus, in order to increase policymakers' engagement with research through similar online platforms, a potentially effective approach would be to post abundant, frequently updated, engaging, interesting and widely appealing content irrespective of form.
The Role of Virtual Reference in Library Web Site Design: A Qualitative Source for Usage Data

ERIC Educational Resources Information Center

Powers, Amanda Clay; Shedd, Julie; Hill, Clay

2011-01-01

Gathering qualitative information about usage behavior of library Web sites is a time-consuming process requiring the active participation of patron communities. Libraries that collect virtual reference transcripts, however, hold valuable data regarding how the library Web site is used that could benefit Web designers. An analysis of virtual…
Evaluation of the Kloswall longwall mining system

NASA Astrophysics Data System (ADS)

Guay, P. J.

1982-04-01

A new longwal mining system specifically designed to extract a very deep web (48 inches or deeper) from a longwall panel was studied. Productivity and cost analysis comparing the new mining system with a conventional longwall operation taking a 30 inch wide web is presented. It is shown that the new system will increase annual production and return on investment in most cases. Conceptual drawings and specifications for a high capacity three drum shearer and a unique shield type of roof support specifically designed for very wide web operation are reported. The advantages and problems associated with wide web mining in general and as they relate specifically to the equipment selected for the new mining system are discussed.
Web Mining: Machine Learning for Web Applications.

ERIC Educational Resources Information Center

Chen, Hsinchun; Chau, Michael

2004-01-01

Presents an overview of machine learning research and reviews methods used for evaluating machine learning systems. Ways that machine-learning algorithms were used in traditional information retrieval systems in the "pre-Web" era are described, and the field of Web mining and how machine learning has been used in different Web mining…
Psychosocial service needs of pediatric transport accident survivors: Using clinical data-mining to establish demographic and service usage characteristics.

PubMed

Manguy, Alys-Marie; Joubert, Lynette; Bansemer, Leah

2016-09-01

The objectives in this article are the exploration of demographic and service usage data gained through clinical data mining audit and suggesting recommendations for social work service delivery model and future research. The method is clinical data-mining audit of 100 sequentially sampled cases gathering quantitative demographic and service usage data. Descriptive analysis of file audit data raised interesting trends with potential to inform service delivery and usage; the key areas of the results included patient demographics, family involvement and impact, and child safety and risk issues. Transport accidents involving children often include other family members. Care planning must take into account psychosocial issues including patient and family emotional responses, availability of primary carers, and other practical needs that may impact on recovery and discharge planning. This study provides evidence to plan for further research and development of more integrated models of care.
Beyond Description: Converting Web Site Usage Statistics into Concrete Site Improvement Ideas

ERIC Educational Resources Information Center

Arendt, Julie; Wagner, Cassie

2010-01-01

Web site usage statistics are a widely used tool for Web site development, but libraries are still learning how to use them successfully. This case study summarizes how Morris Library at Southern Illinois University Carbondale implemented Google Analytics on its Web site and used the reports to inform a site redesign. As the main campus library at…
Vascular Access Tracking System: a Web-Based Clinical Tracking Tool for Identifying Catheter Related Blood Stream Infections in Interventional Radiology Placed Central Venous Catheters.

PubMed

Morrison, James; Kaufman, John

2016-12-01

Vascular access is invaluable in the treatment of hospitalized patients. Central venous catheters provide a durable and long-term solution while saving patients from repeated needle sticks for peripheral IVs and blood draws. The initial catheter placement procedure and long-term catheter usage place patients at risk for infection. The goal of this project was to develop a system to track and evaluate central line-associated blood stream infections related to interventional radiology placement of central venous catheters. A customized web-based clinical database was developed via open-source tools to provide a dashboard for data mining and analysis of the catheter placement and infection information. Preliminary results were gathered over a 4-month period confirming the utility of the system. The tools and methodology employed to develop the vascular access tracking system could be easily tailored to other clinical scenarios to assist in quality control and improvement programs.

Enabling a Community of Practice: Results of the LSCHE Web Portal Survey

ERIC Educational Resources Information Center

Hoff, Meagan A.; Hodges, Russ; Lin, Yuting; McConnell, Michael C.

2017-01-01

The study explored usage patterns of the Learning Support Centers in Higher Education (LSCHE) web portal, an open educational resource (OER) that serves learning support center professionals. Results of an online survey taken by LSCHE users (N = 41) tracked their self-reported usage and perceived value of resources on the web portal, which…
Web-video-mining-supported workflow modeling for laparoscopic surgeries.

PubMed

Liu, Rui; Zhang, Xiaoli; Zhang, Hao

2016-11-01

As quality assurance is of strong concern in advanced surgeries, intelligent surgical systems are expected to have knowledge such as the knowledge of the surgical workflow model (SWM) to support their intuitive cooperation with surgeons. For generating a robust and reliable SWM, a large amount of training data is required. However, training data collected by physically recording surgery operations is often limited and data collection is time-consuming and labor-intensive, severely influencing knowledge scalability of the surgical systems. The objective of this research is to solve the knowledge scalability problem in surgical workflow modeling with a low cost and labor efficient way. A novel web-video-mining-supported surgical workflow modeling (webSWM) method is developed. A novel video quality analysis method based on topic analysis and sentiment analysis techniques is developed to select high-quality videos from abundant and noisy web videos. A statistical learning method is then used to build the workflow model based on the selected videos. To test the effectiveness of the webSWM method, 250 web videos were mined to generate a surgical workflow for the robotic cholecystectomy surgery. The generated workflow was evaluated by 4 web-retrieved videos and 4 operation-room-recorded videos, respectively. The evaluation results (video selection consistency n-index ≥0.60; surgical workflow matching degree ≥0.84) proved the effectiveness of the webSWM method in generating robust and reliable SWM knowledge by mining web videos. With the webSWM method, abundant web videos were selected and a reliable SWM was modeled in a short time with low labor cost. Satisfied performances in mining web videos and learning surgery-related knowledge show that the webSWM method is promising in scaling knowledge for intelligent surgical systems. Copyright © 2016 Elsevier B.V. All rights reserved.
OntoGene web services for biomedical text mining.

PubMed

Rinaldi, Fabio; Clematide, Simon; Marques, Hernani; Ellendorff, Tilia; Romacker, Martin; Rodriguez-Esteban, Raul

2014-01-01

Text mining services are rapidly becoming a crucial component of various knowledge management pipelines, for example in the process of database curation, or for exploration and enrichment of biomedical data within the pharmaceutical industry. Traditional architectures, based on monolithic applications, do not offer sufficient flexibility for a wide range of use case scenarios, and therefore open architectures, as provided by web services, are attracting increased interest. We present an approach towards providing advanced text mining capabilities through web services, using a recently proposed standard for textual data interchange (BioC). The web services leverage a state-of-the-art platform for text mining (OntoGene) which has been tested in several community-organized evaluation challenges,with top ranked results in several of them.
OntoGene web services for biomedical text mining

PubMed Central

2014-01-01

Text mining services are rapidly becoming a crucial component of various knowledge management pipelines, for example in the process of database curation, or for exploration and enrichment of biomedical data within the pharmaceutical industry. Traditional architectures, based on monolithic applications, do not offer sufficient flexibility for a wide range of use case scenarios, and therefore open architectures, as provided by web services, are attracting increased interest. We present an approach towards providing advanced text mining capabilities through web services, using a recently proposed standard for textual data interchange (BioC). The web services leverage a state-of-the-art platform for text mining (OntoGene) which has been tested in several community-organized evaluation challenges, with top ranked results in several of them. PMID:25472638
What Is Different about E-Books? A MINES for Libraries® Analysis of Academic and Health Sciences Research Libraries' E-Book Usage

ERIC Educational Resources Information Center

Plum, Terry; Franklin, Brinley

2015-01-01

Building on the theoretical proposals of Kevin Guthrie and others concerning the transition from print books to e-books in academic and health sciences libraries, this paper presents data collected using the MINES for Libraries® e-resource survey methodology. Approximately 6,000 e-book uses were analyzed from a sample of e-resource usage at…
Automatic generation of Web mining environments

NASA Astrophysics Data System (ADS)

Cibelli, Maurizio; Costagliola, Gennaro

1999-02-01

The main problem related to the retrieval of information from the world wide web is the enormous number of unstructured documents and resources, i.e., the difficulty of locating and tracking appropriate sources. This paper presents a web mining environment (WME), which is capable of finding, extracting and structuring information related to a particular domain from web documents, using general purpose indices. The WME architecture includes a web engine filter (WEF), to sort and reduce the answer set returned by a web engine, a data source pre-processor (DSP), which processes html layout cues in order to collect and qualify page segments, and a heuristic-based information extraction system (HIES), to finally retrieve the required data. Furthermore, we present a web mining environment generator, WMEG, that allows naive users to generate a WME specific to a given domain by providing a set of specifications.
Dynamically Allocated Virtual Clustering Management System Users Guide

DTIC Science & Technology

2016-11-01

provides usage instructions for the DAVC version 2.0 web application. 15. SUBJECT TERMS DAVC, Dynamically Allocated Virtual Clustering...This report provides usage instructions for the DAVC version 2.0 web application. This report is separated into the following sections, which detail
Mining a Web Citation Database for Author Co-Citation Analysis.

ERIC Educational Resources Information Center

He, Yulan; Hui, Siu Cheung

2002-01-01

Proposes a mining process to automate author co-citation analysis based on the Web Citation Database, a data warehouse for storing citation indices of Web publications. Describes the use of agglomerative hierarchical clustering for author clustering and multidimensional scaling for displaying author cluster maps, and explains PubSearch, a…
Analysis of pathology department Web sites and practical recommendations.

PubMed

Nero, Christopher; Dighe, Anand S

2008-09-01

There are numerous customers for pathology departmental Web sites, including pathology department staff, clinical staff, residency applicants, job seekers, and other individuals outside the department seeking department information. Despite the increasing importance of departmental Web sites as a means of distributing information, no analysis has been done to date of the content and usage of pathology department Web sites. In this study, we analyzed pathology department Web sites to examine the elements present on each site and to evaluate the use of search technology on these sites. Further, we examined the usage patterns of our own departmental Internet and internet Web sites to better understand the users of pathology Web sites. We reviewed selected departmental pathology Web sites and analyzed their content and functionality. Our institution's departmental pathology Web sites were modified to enable detailed information to be stored regarding users and usage patterns, and that information was analyzed. We demonstrate considerable heterogeneity in departmental Web sites with many sites lacking basic content and search features. In addition, we demonstrate that increasing the traffic of a department's informational Web sites may result in reduced phone inquiries to the laboratory. We propose recommendations for pathology department Web sites to maximize promotion of a department's mission. A departmental pathology Web site is an essential communication tool for all pathology departments, and attention to the users and content of the site can have operational impact.
Data Mining for Web-Based Support Systems: A Case Study in e-Custom Systems

NASA Astrophysics Data System (ADS)

Razmerita, Liana; Kirchner, Kathrin

This chapter provides an example of a Web-based support system (WSS) used to streamline trade procedures, prevent potential security threats, and reduce tax-related fraud in cross-border trade. The architecture is based on a service-oriented architecture that includes smart seals and Web services. We discuss the implications and suggest further enhancements to demonstrate how such systems can move toward a Web-based decision support system with the support of data mining methods. We provide a concrete example of how data mining can help to analyze the vast amount of data collected while monitoring the container movements along its supply chain.
Semantic web for integrated network analysis in biomedicine.

PubMed

Chen, Huajun; Ding, Li; Wu, Zhaohui; Yu, Tong; Dhanapalan, Lavanya; Chen, Jake Y

2009-03-01

The Semantic Web technology enables integration of heterogeneous data on the World Wide Web by making the semantics of data explicit through formal ontologies. In this article, we survey the feasibility and state of the art of utilizing the Semantic Web technology to represent, integrate and analyze the knowledge in various biomedical networks. We introduce a new conceptual framework, semantic graph mining, to enable researchers to integrate graph mining with ontology reasoning in network data analysis. Through four case studies, we demonstrate how semantic graph mining can be applied to the analysis of disease-causal genes, Gene Ontology category cross-talks, drug efficacy analysis and herb-drug interactions analysis.
An Evaluative Methodology for Virtual Communities Using Web Analytics

ERIC Educational Resources Information Center

Phippen, A. D.

2004-01-01

The evaluation of virtual community usage and user behaviour has its roots in social science approaches such as interview, document analysis and survey. Little evaluation is carried out using traffic or protocol analysis. Business approaches to evaluating customer/business web site usage are more advanced, in particular using advanced web…
Web Usage, Advertising, and Shopping: Relationship Patterns.

ERIC Educational Resources Information Center

Korgaonkar, Pradeep; Wolin, Lori D.

2002-01-01

Discusses Web sales and explores the differences between heavy, medium, and light Web users in terms of their beliefs about Web advertising, attitudes toward Web advertising, purchasing patterns, and demographics. Suggests marketers need to target Web advertising to particular Web users. (Author/LRW)
Web data mining

NASA Astrophysics Data System (ADS)

Wibonele, Kasanda J.; Zhang, Yanqing

2002-03-01

A web data mining system using granular computing and ASP programming is proposed. This is a web based application, which allows web users to submit survey data for many different companies. This survey is a collection of questions that will help these companies develop and improve their business and customer service with their clients by analyzing survey data. This web application allows users to submit data anywhere. All the survey data is collected into a database for further analysis. An administrator of this web application can login to the system and view all the data submitted. This web application resides on a web server, and the database resides on the MS SQL server.
Usage of a generic web-based self-management intervention for breast cancer survivors: substudy analysis of the BREATH trial.

PubMed

van den Berg, Sanne W; Peters, Esmee J; Kraaijeveld, J Frank; Gielissen, Marieke F M; Prins, Judith B

2013-08-19

Generic fully automated Web-based self-management interventions are upcoming, for example, for the growing number of breast cancer survivors. It is hypothesized that the use of these interventions is more individualized and that users apply a large amount of self-tailoring. However, technical usage evaluations of these types of interventions are scarce and practical guidelines are lacking. To gain insight into meaningful usage parameters to evaluate the use of generic fully automated Web-based interventions by assessing how breast cancer survivors use a generic self-management website. Final aim is to propose practical recommendations for researchers and information and communication technology (ICT) professionals who aim to design and evaluate the use of similar Web-based interventions. The BREAst cancer ehealTH (BREATH) intervention is a generic unguided fully automated website with stepwise weekly access and a fixed 4-month structure containing 104 intervention ingredients (ie, texts, tasks, tests, videos). By monitoring https-server requests, technical usage statistics were recorded for the intervention group of the randomized controlled trial. Observed usage was analyzed by measures of frequency, duration, and activity. Intervention adherence was defined as continuous usage, or the proportion of participants who started using the intervention and continued to log in during all four phases. By comparing observed to minimal intended usage (frequency and activity), different user groups were defined. Usage statistics for 4 months were collected from 70 breast cancer survivors (mean age 50.9 years). Frequency of logins/person ranged from 0 to 45, total duration/person from 0 to 2324 minutes (38.7 hours), and activity from opening none to all intervention ingredients. 31 participants continued logging in to all four phases resulting in an intervention adherence rate of 44.3% (95% CI 33.2-55.9). Nine nonusers (13%), 30 low users (43%), and 31 high users (44%) were defined. Low and high users differed significantly on frequency (P<.001), total duration (P<.001), session duration (P=.009), and activity (P<.001). High users logged in an average of 21 times, had a mean session duration of 33 minutes, and opened on average 91% of all ingredients. Signing the self-help contract (P<.001), reporting usefulness of ingredients (P=.003), overall satisfaction (P=.028), and user friendliness evaluation (P=.003) were higher in high users. User groups did not differ on age, education, and baseline distress. By reporting the usage of a self-management website for breast cancer survivors, the present study gained first insight into the design of usage evaluations of generic fully automated Web-based interventions. It is recommended to (1) incorporate usage statistics that reflect the amount of self-tailoring applied by users, (2) combine technical usage statistics with self-reported usefulness, and (3) use qualitative measures. Also, (4) a pilot usage evaluation should be a fixed step in the development process of novel Web-based interventions, and (5) it is essential for researchers to gain insight into the rationale of recorded and nonrecorded usage statistics. Netherlands Trial Register (NTR): 2935; http://www.trialregister.nl/trialreg/admin/rctview.asp?TC=2935 (Archived by WebCite at http://www.webcitation.org/6IkX1ADEV).
Automated Data Tagging in the HLA

NASA Astrophysics Data System (ADS)

Gaffney, N. I.; Miller, W. W.

2008-08-01

One of the more powerful and popular forms of data organization implemented in most popular information sharing web applications is data tagging. With a rich user base from which to gather and digest tags, many interesting and often unanticipated yet very useful associations are revealed. With regard to an existing information, the astronomical community has a rich pool of existing digitally stored and searchable data than any of the currently popular web community, such as You Tube or My Space, had when they started. In initial experiments with the search engine for the Hubble Legacy Archive, we have created a simple yet powerful scheme by which the information from a footprint service, the NED and SIMBAD catalog services, and the ADS abstracts and keywords can be used to initially tag data with standard keywords. By then ingesting this into a public ally available information search engine, such as Apache Lucene, one can create a simple and powerful data tag search engine and association system. By then augmenting this with user provided keys and usage pattern analysis, one can produce a powerful modern data mining system for any astronomical data warehouse.
The Effectiveness of Web-Based Learning Environment: A Case Study of Public Universities in Kenya

ERIC Educational Resources Information Center

Kirui, Paul A.; Mutai, Sheila J.

2010-01-01

Web mining is emerging in many aspects of e-learning, aiming at improving online learning and teaching processes and making them more transparent and effective. Researchers using Web mining tools and techniques are challenged to learn more about the online students' reshaping online courses and educational websites, and create tools for…
Earth Science Mining Web Services

NASA Astrophysics Data System (ADS)

Pham, L. B.; Lynnes, C. S.; Hegde, M.; Graves, S.; Ramachandran, R.; Maskey, M.; Keiser, K.

2008-12-01

To allow scientists further capabilities in the area of data mining and web services, the Goddard Earth Sciences Data and Information Services Center (GES DISC) and researchers at the University of Alabama in Huntsville (UAH) have developed a system to mine data at the source without the need of network transfers. The system has been constructed by linking together several pre-existing technologies: the Simple Scalable Script-based Science Processor for Measurements (S4PM), a processing engine at the GES DISC; the Algorithm Development and Mining (ADaM) system, a data mining toolkit from UAH that can be configured in a variety of ways to create customized mining processes; ActiveBPEL, a workflow execution engine based on BPEL (Business Process Execution Language); XBaya, a graphical workflow composer; and the EOS Clearinghouse (ECHO). XBaya is used to construct an analysis workflow at UAH using ADaM components, which are also installed remotely at the GES DISC, wrapped as Web Services. The S4PM processing engine searches ECHO for data using space-time criteria, staging them to cache, allowing the ActiveBPEL engine to remotely orchestrates the processing workflow within S4PM. As mining is completed, the output is placed in an FTP holding area for the end user. The goals are to give users control over the data they want to process, while mining data at the data source using the server's resources rather than transferring the full volume over the internet. These diverse technologies have been infused into a functioning, distributed system with only minor changes to the underlying technologies. The key to this infusion is the loosely coupled, Web- Services based architecture: All of the participating components are accessible (one way or another) through (Simple Object Access Protocol) SOAP-based Web Services.
Earth Science Mining Web Services

NASA Technical Reports Server (NTRS)

Pham, Long; Lynnes, Christopher; Hegde, Mahabaleshwa; Graves, Sara; Ramachandran, Rahul; Maskey, Manil; Keiser, Ken

2008-01-01

To allow scientists further capabilities in the area of data mining and web services, the Goddard Earth Sciences Data and Information Services Center (GES DISC) and researchers at the University of Alabama in Huntsville (UAH) have developed a system to mine data at the source without the need of network transfers. The system has been constructed by linking together several pre-existing technologies: the Simple Scalable Script-based Science Processor for Measurements (S4PM), a processing engine at he GES DISC; the Algorithm Development and Mining (ADaM) system, a data mining toolkit from UAH that can be configured in a variety of ways to create customized mining processes; ActiveBPEL, a workflow execution engine based on BPEL (Business Process Execution Language); XBaya, a graphical workflow composer; and the EOS Clearinghouse (ECHO). XBaya is used to construct an analysis workflow at UAH using ADam components, which are also installed remotely at the GES DISC, wrapped as Web Services. The S4PM processing engine searches ECHO for data using space-time criteria, staging them to cache, allowing the ActiveBPEL engine to remotely orchestras the processing workflow within S4PM. As mining is completed, the output is placed in an FTP holding area for the end user. The goals are to give users control over the data they want to process, while mining data at the data source using the server's resources rather than transferring the full volume over the internet. These diverse technologies have been infused into a functioning, distributed system with only minor changes to the underlying technologies. The key to the infusion is the loosely coupled, Web-Services based architecture: All of the participating components are accessible (one way or another) through (Simple Object Access Protocol) SOAP-based Web Services.
Lightweight monitoring and control system for coal mine safety using REST style.

PubMed

Cheng, Bo; Cheng, Xin; Chen, Junliang

2015-01-01

The complex environment of a coal mine requires the underground environment, devices and miners to be constantly monitored to ensure safe coal production. However, existing coal mines do not meet these coverage requirements because blind spots occur when using a wired network. In this paper, we develop a Web-based, lightweight remote monitoring and control platform using a wireless sensor network (WSN) with the REST style to collect temperature, humidity and methane concentration data in a coal mine using sensor nodes. This platform also collects information on personnel positions inside the mine. We implement a RESTful application programming interface (API) that provides access to underground sensors and instruments through the Web such that underground coal mine physical devices can be easily interfaced to remote monitoring and control applications. We also implement three different scenarios for Web-based, lightweight remote monitoring and control of coal mine safety and measure and analyze the system performance. Finally, we present the conclusions from this study and discuss future work. Copyright © 2014 ISA. Published by Elsevier Ltd. All rights reserved.

Examining Web 2.0 Tools Usage of Science Teacher Candidates

ERIC Educational Resources Information Center

Balkan Kiyici, Fatime

2012-01-01

Using technology in a science teaching is so important. Only the person, who can use these tools in expert level, can use these tools in their teaching activities. In this research it is aimed firstly identifying science teacher candidates web 2.0 tools usage experience level and factors affecting experience level. In this research survey method…
Lecture Attendance and Web Based Lecture Technologies: A Comparison of Student Perceptions and Usage Patterns

ERIC Educational Resources Information Center

von Konsky, Brian R.; Ivins, Jim; Gribble, Susan J.

2009-01-01

This paper investigates the impact of web based lecture recordings on learning and attendance at lectures. Student opinions regarding the perceived value of the recordings were evaluated in the context of usage patterns and final marks, and compared with attendance data and student perceptions regarding the usefulness of lectures. The availability…
HEP Outreach, Inreach, and Web 2.0

NASA Astrophysics Data System (ADS)

Goldfarb, Steven

2011-12-01

I report on current usage of multimedia and social networking "Web 2.0" tools for Education and Outreach in high-energy physics, and discuss their potential for internal communication within large worldwide collaborations, such as those of the LHC. Following a brief description of the history of Web 2.0 development, I present a survey of the most popular sites and describe their usage in HEP to disseminate information to students and the general public. I then discuss the potential of certain specific tools, such as document and multimedia sharing sites, for boosting the speed and effectiveness of information exchange within the collaborations. I conclude with a brief discussion of the successes and failures of these tools, and make suggestions for improved usage in the future.
Service-based analysis of biological pathways

PubMed Central

Zheng, George; Bouguettaya, Athman

2009-01-01

Background Computer-based pathway discovery is concerned with two important objectives: pathway identification and analysis. Conventional mining and modeling approaches aimed at pathway discovery are often effective at achieving either objective, but not both. Such limitations can be effectively tackled leveraging a Web service-based modeling and mining approach. Results Inspired by molecular recognitions and drug discovery processes, we developed a Web service mining tool, named PathExplorer, to discover potentially interesting biological pathways linking service models of biological processes. The tool uses an innovative approach to identify useful pathways based on graph-based hints and service-based simulation verifying user's hypotheses. Conclusion Web service modeling of biological processes allows the easy access and invocation of these processes on the Web. Web service mining techniques described in this paper enable the discovery of biological pathways linking these process service models. Algorithms presented in this paper for automatically highlighting interesting subgraph within an identified pathway network enable the user to formulate hypothesis, which can be tested out using our simulation algorithm that are also described in this paper. PMID:19796403
Exploiting Recurring Structure in a Semantic Network

NASA Technical Reports Server (NTRS)

Wolfe, Shawn R.; Keller, Richard M.

2004-01-01

With the growing popularity of the Semantic Web, an increasing amount of information is becoming available in machine interpretable, semantically structured networks. Within these semantic networks are recurring structures that could be mined by existing or novel knowledge discovery methods. The mining of these semantic structures represents an interesting area that focuses on mining both for and from the Semantic Web, with surprising applicability to problems confronting the developers of Semantic Web applications. In this paper, we present representative examples of recurring structures and show how these structures could be used to increase the utility of a semantic repository deployed at NASA.
Data Mining Web Services for Science Data Repositories

NASA Astrophysics Data System (ADS)

Graves, S.; Ramachandran, R.; Keiser, K.; Maskey, M.; Lynnes, C.; Pham, L.

2006-12-01

The maturation of web services standards and technologies sets the stage for a distributed "Service-Oriented Architecture" (SOA) for NASA's next generation science data processing. This architecture will allow members of the scientific community to create and combine persistent distributed data processing services and make them available to other users over the Internet. NASA has initiated a project to create a suite of specialized data mining web services designed specifically for science data. The project leverages the Algorithm Development and Mining (ADaM) toolkit as its basis. The ADaM toolkit is a robust, mature and freely available science data mining toolkit that is being used by several research organizations and educational institutions worldwide. These mining services will give the scientific community a powerful and versatile data mining capability that can be used to create higher order products such as thematic maps from current and future NASA satellite data records with methods that are not currently available. The package of mining and related services are being developed using Web Services standards so that community-based measurement processing systems can access and interoperate with them. These standards-based services allow users different options for utilizing them, from direct remote invocation by a client application to deployment of a Business Process Execution Language (BPEL) solutions package where a complex data mining workflow is exposed to others as a single service. The ability to deploy and operate these services at a data archive allows the data mining algorithms to be run where the data are stored, a more efficient scenario than moving large amounts of data over the network. This will be demonstrated in a scenario in which a user uses a remote Web-Service-enabled clustering algorithm to create cloud masks from satellite imagery at the Goddard Earth Sciences Data and Information Services Center (GES DISC).
A construction scheme of web page comment information extraction system based on frequent subtree mining

NASA Astrophysics Data System (ADS)

Zhang, Xiaowen; Chen, Bingfeng

2017-08-01

Based on the frequent sub-tree mining algorithm, this paper proposes a construction scheme of web page comment information extraction system based on frequent subtree mining, referred to as FSM system. The entire system architecture and the various modules to do a brief introduction, and then the core of the system to do a detailed description, and finally give the system prototype.
Kernel Methods for Mining Instance Data in Ontologies

NASA Astrophysics Data System (ADS)

Bloehdorn, Stephan; Sure, York

The amount of ontologies and meta data available on the Web is constantly growing. The successful application of machine learning techniques for learning of ontologies from textual data, i.e. mining for the Semantic Web, contributes to this trend. However, no principal approaches exist so far for mining from the Semantic Web. We investigate how machine learning algorithms can be made amenable for directly taking advantage of the rich knowledge expressed in ontologies and associated instance data. Kernel methods have been successfully employed in various learning tasks and provide a clean framework for interfacing between non-vectorial data and machine learning algorithms. In this spirit, we express the problem of mining instances in ontologies as the problem of defining valid corresponding kernels. We present a principled framework for designing such kernels by means of decomposing the kernel computation into specialized kernels for selected characteristics of an ontology which can be flexibly assembled and tuned. Initial experiments on real world Semantic Web data enjoy promising results and show the usefulness of our approach.
Abandoned Uranium Mines (AUM) Site Screening Map Service, 2016, US EPA Region 9

EPA Pesticide Factsheets

As described in detail in the Five-Year Report, US EPA completed on-the-ground screening of 521 abandoned uranium mine areas. US EPA and the Navajo EPA are using the Comprehensive Database and Atlas to determine which mines should be cleaned up first. US EPA continues to research and identify Potentially Responsible Parties (PRPs) under Superfund to contribute to the costs of cleanup efforts.This US EPA Region 9 web service contains the following map layers:Abandoned Uranium Mines, Priority Mines, Tronox Mines, Navajo Environmental Response Trust Mines, Mines with Enforcement Actions, Superfund AUM Regions, Navajo Nation Administrative Boundaries and Chapter Houses.Mine points have a maximum scale of 1:220,000, while Mine polygons have a minimum scale of 1:220,000. Chapter houses have a minimum scale of 1:200,000. BLM Land Status has a minimum scale of 1:150,000.Full FGDC metadata records for each layer can be found by clicking the layer name at the web service endpoint and viewing the layer description. Data used to create this web service are available for download at https://edg.epa.gov/metadata/catalog/data/data.page.Security Classification: Public. Access Constraints: None. Use Constraints: None. Please check sources, scale, accuracy, currentness and other available information. Please confirm that you are using the most recent copy of both data and metadata. Acknowledgement of the EPA would be appreciated.
Provenance-Based Approaches to Semantic Web Service Discovery and Usage

ERIC Educational Resources Information Center

Narock, Thomas William

2012-01-01

The World Wide Web Consortium defines a Web Service as "a software system designed to support interoperable machine-to-machine interaction over a network." Web Services have become increasingly important both within and across organizational boundaries. With the recent advent of the Semantic Web, web services have evolved into semantic…
Using an improved association rules mining optimization algorithm in web-based mobile-learning system

NASA Astrophysics Data System (ADS)

Huang, Yin; Chen, Jianhua; Xiong, Shaojun

2009-07-01

Mobile-Learning (M-learning) makes many learners get the advantages of both traditional learning and E-learning. Currently, Web-based Mobile-Learning Systems have created many new ways and defined new relationships between educators and learners. Association rule mining is one of the most important fields in data mining and knowledge discovery in databases. Rules explosion is a serious problem which causes great concerns, as conventional mining algorithms often produce too many rules for decision makers to digest. Since Web-based Mobile-Learning System collects vast amounts of student profile data, data mining and knowledge discovery techniques can be applied to find interesting relationships between attributes of learners, assessments, the solution strategies adopted by learners and so on. Therefore ,this paper focus on a new data-mining algorithm, combined with the advantages of genetic algorithm and simulated annealing algorithm , called ARGSA(Association rules based on an improved Genetic Simulated Annealing Algorithm), to mine the association rules. This paper first takes advantage of the Parallel Genetic Algorithm and Simulated Algorithm designed specifically for discovering association rules. Moreover, the analysis and experiment are also made to show the proposed method is superior to the Apriori algorithm in this Mobile-Learning system.
A Study on Information Search and Commitment Strategies on Web Environment and Internet Usage Self-Efficacy Beliefs of University Students'

ERIC Educational Resources Information Center

Geçer, Aynur Kolburan

2014-01-01

This study addresses university students' information search and commitment strategies on web environment and internet usage self-efficacy beliefs in terms of such variables as gender, department, grade level and frequency of internet use; and whether there is a significant relation between these beliefs. Descriptive method was used in the study.…
Usage Analysis of Web 2.0 and Library 2.0 Tools by Librarians in Kwara State Academic Libraries

ERIC Educational Resources Information Center

Tella, Adeyinka; Soluoku, Taofeeqat

2016-01-01

This study analysed the usage of Web 2.0 and Library 2.0 tools by librarians in Kwara State academic libraries. A sample of 40 librarians was surveyed through total enumeration sampling technique from four different tertiary education institutions libraries in Kwara State, Nigeria. Questionnaire was used for the collection of data. The collected…
Graph Mining Meets the Semantic Web

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lee, Sangkeun; Sukumar, Sreenivas R; Lim, Seung-Hwan

The Resource Description Framework (RDF) and SPARQL Protocol and RDF Query Language (SPARQL) were introduced about a decade ago to enable flexible schema-free data interchange on the Semantic Web. Today, data scientists use the framework as a scalable graph representation for integrating, querying, exploring and analyzing data sets hosted at different sources. With increasing adoption, the need for graph mining capabilities for the Semantic Web has emerged. We address that need through implementation of three popular iterative Graph Mining algorithms (Triangle count, Connected component analysis, and PageRank). We implement these algorithms as SPARQL queries, wrapped within Python scripts. We evaluatemore » the performance of our implementation on 6 real world data sets and show graph mining algorithms (that have a linear-algebra formulation) can indeed be unleashed on data represented as RDF graphs using the SPARQL query interface.« less
Mining Longitudinal Web Queries: Trends and Patterns.

ERIC Educational Resources Information Center

Wang, Peiling; Berry, Michael W.; Yang, Yiheng

2003-01-01

Analyzed user queries submitted to an academic Web site during a four-year period, using a relational database, to examine users' query behavior, to identify problems they encounter, and to develop techniques for optimizing query analysis and mining. Linguistic analyses focus on query structures, lexicon, and word associations using statistical…
An Expertise Recommender using Web Mining

NASA Technical Reports Server (NTRS)

Joshi, Anupam; Chandrasekaran, Purnima; ShuYang, Michelle; Ramakrishnan, Ramya

2001-01-01

This report explored techniques to mine web pages of scientists to extract information regarding their expertise, build expertise chains and referral webs, and semi automatically combine this information with directory information services to create a recommender system that permits query by expertise. The approach included experimenting with existing techniques that have been reported in research literature in recent past , and adapted them as needed. In addition, software tools were developed to capture and use this information.
Visualization of usability and functionality of a professional website through web-mining.

PubMed

Jones, Josette F; Mahoui, Malika; Gopa, Venkata Devi Pragna

2007-10-11

Functional interface design requires understanding of the information system structure and the user. Web logs record user interactions with the interface, and thus provide some insight into user search behavior and efficiency of the search process. The present study uses a data-mining approach with techniques such as association rules, clustering and classification, to visualize the usability and functionality of a digital library through in depth analyses of web logs.
Application of data mining in science and technology management information system based on WebGIS

NASA Astrophysics Data System (ADS)

Wu, Xiaofang; Xu, Zhiyong; Bao, Shitai; Chen, Feixiang

2009-10-01

With the rapid development of science and technology and the quick increase of information, a great deal of data is accumulated in the management department of science and technology. Usually, many knowledge and rules are contained and concealed in the data. Therefore, how to excavate and use the knowledge fully is very important in the management of science and technology. It will help to examine and approve the project of science and technology more scientifically and make the achievement transformed as the realistic productive forces easier. Therefore, the data mine technology will be researched and applied to the science and technology management information system to find and excavate the knowledge in the paper. According to analyzing the disadvantages of traditional science and technology management information system, the database technology, data mining and web geographic information systems (WebGIS) technology will be introduced to develop and construct the science and technology management information system based on WebGIS. The key problems are researched in detail such as data mining and statistical analysis. What's more, the prototype system is developed and validated based on the project data of National Natural Science Foundation Committee. The spatial data mining is done from the axis of time, space and other factors. Then the variety of knowledge and rules will be excavated by using data mining technology, which helps to provide an effective support for decisionmaking.
A Hybrid Data Mining Approach for Credit Card Usage Behavior Analysis

NASA Astrophysics Data System (ADS)

Tsai, Chieh-Yuan

Credit card is one of the most popular e-payment approaches in current online e-commerce. To consolidate valuable customers, card issuers invest a lot of money to maintain good relationship with their customers. Although several efforts have been done in studying card usage motivation, few researches emphasize on credit card usage behavior analysis when time periods change from t to t+1. To address this issue, an integrated data mining approach is proposed in this paper. First, the customer profile and their transaction data at time period t are retrieved from databases. Second, a LabelSOM neural network groups customers into segments and identify critical characteristics for each group. Third, a fuzzy decision tree algorithm is used to construct usage behavior rules of interesting customer groups. Finally, these rules are used to analysis the behavior changes between time periods t and t+1. An implementation case using a practical credit card database provided by a commercial bank in Taiwan is illustrated to show the benefits of the proposed framework.
Evaluating the Utility of Web-Based Consumer Support Tools Using Rough Sets

NASA Astrophysics Data System (ADS)

Maciag, Timothy; Hepting, Daryl H.; Slezak, Dominik; Hilderman, Robert J.

On the Web, many popular e-commerce sites provide consumers with decision support tools to assist them in their commerce-related decision-making. Many consumers will rank the utility of these tools quite highly. Data obtained from web usage mining analyses, which may provide knowledge about a user's online experiences, could help indicate the utility of these tools. This type of analysis could provide insight into whether provided tools are adequately assisting consumers in conducting their online shopping activities or if new or additional enhancements need consideration. Although some research in this regard has been described in previous literature, there is still much that can be done. The authors of this paper hypothesize that a measurement of consumer decision accuracy, i.e. a measurement preferences, could help indicate the utility of these tools. This paper describes a procedure developed towards this goal using elements of rough set theory. The authors evaluated the procedure using two support tools, one based on a tool developed by the US-EPA and the other developed by one of the authors called cogito. Results from the evaluation did provide interesting insights on the utility of both support tools. Although it was shown that the cogito tool obtained slightly higher decision accuracy, both tools could be improved from additional enhancements. Details of the procedure developed and results obtained from the evaluation will be provided. Opportunities for future work are also discussed.

The Effects of Web 2.0 Technologies Usage in Programming Languages Lesson on the Academic Success, Interrogative Learning Skills and Attitudes of Students towards Programming Languages

ERIC Educational Resources Information Center

Gençtürk, Abdullah Tarik; Korucu, Agah Tugrul

2017-01-01

It is observed that teacher candidates receiving education in the department of Computer and Instructional Technologies Education are not able to gain enough experience and knowledge in "Programming Languages" lesson. The goal of this study is to analyse the effects of web 2.0 technologies usage in programming languages lesson on the…
Binary Coded Web Access Pattern Tree in Education Domain

ERIC Educational Resources Information Center

Gomathi, C.; Moorthi, M.; Duraiswamy, K.

2008-01-01

Web Access Pattern (WAP), which is the sequence of accesses pursued by users frequently, is a kind of interesting and useful knowledge in practice. Sequential Pattern mining is the process of applying data mining techniques to a sequential database for the purposes of discovering the correlation relationships that exist among an ordered list of…
Use of communication technologies by people with type 1 diabetes in the social networking era. A chance for improvement.

PubMed

Giménez-Pérez, Gabriel; Recasens, Assumpta; Simó, Olga; Aguas, Teresa; Suárez, Ana; Vila, Maria; Castells, Ignasi

2016-04-01

To evaluate the health-related use of Web 2.0 tools by patients with type 1 diabetes. Cross-sectional survey assessing views and usage of the Internet, Apps and Web 2.0. Number of participants: 289 (age 42.8±13.5 years; diabetes duration 18.4±12.2 years; 58.7% males; 39% with an upper secondary or higher education level). Web 2.0 usage for health purposes was low with 19.6% and 14% of Web 2.0 members (147; 50.9%) having health-related contacts and posting health comments. Health-related Apps were used by 35.4% of Smartphone owners (161; 55.7%). 75.3% patients would share information online with professionals, preferably through e-mail (78.7%) rather than Facebook (47.7%). 141 (66.5%) of those willing to share information would participate in a professional-moderated Facebook group. Web 2.0 and Apps usage for health purposes is low. The difference between the use of Web 2.0 networks and the willingness to participate in professional-moderated Web 2.0 groups points to the need of a higher implication of health professionals in promoting Web 2.0 technologies if these are to be adopted in a clinical setting. Currently, e-mail is the tool to be considered when aiming to increase online communication with patients with type 1 diabetes. Copyright © 2015 Primary Care Diabetes Europe. Published by Elsevier Ltd. All rights reserved.
Text and Structural Data Mining of Influenza Mentions in Web and Social Media

DOE Office of Scientific and Technical Information (OSTI.GOV)

Corley, Courtney D.; Cook, Diane; Mikler, Armin R.

Text and structural data mining of Web and social media (WSM) provides a novel disease surveillance resource and can identify online communities for targeted public health communications (PHC) to assure wide dissemination of pertinent information. WSM that mention influenza are harvested over a 24-week period, 5-October-2008 to 21-March-2009. Link analysis reveals communities for targeted PHC. Text mining is shown to identify trends in flu posts that correlate to real-world influenza-like-illness patient report data. We also bring to bear a graph-based data mining technique to detect anomalies among flu blogs connected by publisher type, links, and user-tags.
Health care public reporting utilization - user clusters, web trails, and usage barriers on Germany's public reporting portal Weisse-Liste.de.

PubMed

Pross, Christoph; Averdunk, Lars-Henrik; Stjepanovic, Josip; Busse, Reinhard; Geissler, Alexander

2017-04-21

Quality of care public reporting provides structural, process and outcome information to facilitate hospital choice and strengthen quality competition. Yet, evidence indicates that patients rarely use this information in their decision-making, due to limited awareness of the data and complex and conflicting information. While there is enthusiasm among policy makers for public reporting, clinicians and researchers doubt its overall impact. Almost no study has analyzed how users behave on public reporting portals, which information they seek out and when they abort their search. This study employs web-usage mining techniques on server log data of 17 million user actions from Germany's premier provider transparency portal Weisse-Liste.de (WL.de) between 2012 and 2015. Postal code and ICD search requests facilitate identification of geographical and treatment area usage patterns. User clustering helps to identify user types based on parameters like session length, referrer and page topic visited. First-level markov chains illustrate common click paths and premature exits. In 2015, the WL.de Hospital Search portal had 2,750 daily users, with 25% mobile traffic, a bounce rate of 38% and 48% of users examining hospital quality information. From 2013 to 2015, user traffic grew at 38% annually. On average users spent 7 min on the portal, with 7.4 clicks and 54 s between clicks. Users request information for many oncologic and orthopedic conditions, for which no process or outcome quality indicators are available. Ten distinct user types, with particular usage patterns and interests, are identified. In particular, the different types of professional and non-professional users need to be addressed differently to avoid high premature exit rates at several key steps in the information search and view process. Of all users, 37% enter hospital information correctly upon entry, while 47% require support in their hospital search. Several onsite and offsite improvement options are identified. Public reporting needs to be directed at the interests of its users, with more outcome quality information for oncology and orthopedics. Customized reporting can cater to the different needs and skill levels of professional and non-professional users. Search engine optimization and hospital quality advocacy can increase website traffic.
Optimizing the Information Presentation on Mining Potential by using Web Services Technology with Restful Protocol

NASA Astrophysics Data System (ADS)

Abdillah, T.; Dai, R.; Setiawan, E.

2018-02-01

This study aims to develop the application of Web Services technology with RestFul Protocol to optimize the information presentation on mining potential. This study used User Interface Design approach for the information accuracy and relevance as well as the Web Service for the reliability in presenting the information. The results show that: the information accuracy and relevance regarding mining potential can be seen from the achievement of User Interface implementation in the application that is based on the following rules: The consideration of the appropriate colours and objects, the easiness of using the navigation, and users’ interaction with the applications that employs symbols and languages understood by the users; the information accuracy and relevance related to mining potential can be observed by the information presented by using charts and Tool Tip Text to help the users understand the provided chart/figure; the reliability of the information presentation is evident by the results of Web Services testing in Figure 4.5.6. This study finds out that User Interface Design and Web Services approaches (for the access of different Platform apps) are able to optimize the presentation. The results of this study can be used as a reference for software developers and Provincial Government of Gorontalo.
Teachers' Technology Acceptance and Usage Situations and the Evaluation of Web Pedagogic Content Knowledge in Terms of Different Variations and the Determination of the Relationship between These

ERIC Educational Resources Information Center

Korucu, Agah Tugrul

2017-01-01

The goal of this study is to analyze the situations of teachers' technology acceptance and usage (TAU) and web pedagogy content knowledge (WPACK) in terms of different variations and to determine of the relationship between these two. The study group of this research consists of 96 teachers in total having different variations such as different…
Data mining application in customer relationship management for hospital inpatients.

PubMed

Lee, Eun Whan

2012-09-01

This study aims to discover patients loyal to a hospital and model their medical service usage patterns. Consequently, this study proposes a data mining application in customer relationship management (CRM) for hospital inpatients. A recency, frequency, monetary (RFM) model has been applied toward 14,072 patients discharged from a university hospital. Cluster analysis was conducted to segment customers, and it modeled the patterns of the loyal customers' medical services usage via a decision tree. Patients were divided into two groups according to the variables of the RFM model and the group which had significantly high frequency of medical use and expenses was defined as loyal customers, a target market. As a result of the decision tree, the predictable factors of the loyal clients were; length of stay, certainty of selectable treatment, surgery, number of accompanying treatments, kind of patient room, and department from which they were discharged. Particularly, this research showed that when a patient within the internal medicine department who did not have surgery stayed for more than 13.5 days, their probability of being a classified as a loyal customer was 70.0%. To discover a hospital's loyal patients and model their medical usage patterns, the application of data-mining has been suggested. This paper suggests practical use of combining segmentation, targeting, positioning (STP) strategy and the RFM model with data-mining in CRM.
Data Mining Application in Customer Relationship Management for Hospital Inpatients

PubMed Central

2012-01-01

Objectives This study aims to discover patients loyal to a hospital and model their medical service usage patterns. Consequently, this study proposes a data mining application in customer relationship management (CRM) for hospital inpatients. Methods A recency, frequency, monetary (RFM) model has been applied toward 14,072 patients discharged from a university hospital. Cluster analysis was conducted to segment customers, and it modeled the patterns of the loyal customers' medical services usage via a decision tree. Results Patients were divided into two groups according to the variables of the RFM model and the group which had significantly high frequency of medical use and expenses was defined as loyal customers, a target market. As a result of the decision tree, the predictable factors of the loyal clients were; length of stay, certainty of selectable treatment, surgery, number of accompanying treatments, kind of patient room, and department from which they were discharged. Particularly, this research showed that when a patient within the internal medicine department who did not have surgery stayed for more than 13.5 days, their probability of being a classified as a loyal customer was 70.0%. Conclusions To discover a hospital's loyal patients and model their medical usage patterns, the application of data-mining has been suggested. This paper suggests practical use of combining segmentation, targeting, positioning (STP) strategy and the RFM model with data-mining in CRM. PMID:23115740
Recommendations for Benchmarking Web Site Usage among Academic Libraries.

ERIC Educational Resources Information Center

Hightower, Christy; Sih, Julie; Tilghman, Adam

1998-01-01

To help library directors and Web developers create a benchmarking program to compare statistics of academic Web sites, the authors analyzed the Web server log files of 14 university science and engineering libraries. Recommends a centralized voluntary reporting structure coordinated by the Association of Research Libraries (ARL) and a method for…
Delving into Data

ERIC Educational Resources Information Center

Cullen, Kevin

2005-01-01

Corporations employ data mining to analyze operations, find trends in recorded information, and look for new opportunities. Libraries are no different. Librarians manage large stores of data--about collections and usage, for example--and they also want to analyze this data to serve their users better. Analysts use data mining to query a data…
Usage and User Acceptance of Applied Physics Letters Online

NASA Astrophysics Data System (ADS)

Ingoldsby, Timothy C.

1996-03-01

Applied Physics Letters Online became the first established physics print journal to appear online in full-text, hyperlinked form effective with January 1996 issues. In partnership with the Online Computer Library Center (OCLC), APL Online at the same time became the first established scientific or engineering journal to appear on the World Wide Web, in addition to being available through OCLC's proprietary Guidon user interface. AIP has now accumulated usage data for more than one year of operation, and has recently completed a survey of its full subscriber base. Usage has steadily increased throughout the year, with subscribers showing a clear preference for the Web version, even though it provides an interface in many ways inferior to OCLC's Guidon. Usage data and subscriber survey results will be presented, and directions for future research in online information delivery will be presented.
Motivation Mining: Prospecting the Web.

ERIC Educational Resources Information Center

Small, Ruth V.; Arnone, Marilyn P.

1999-01-01

Describes WebMAC instruments, which differ from other Web-evaluation instruments because they have a theoretical base, are user-centered, are designed for students in grades 7 through 12, and assess the motivational quality of Web sites. Examples are given of uses of WebMAC Middle and WebMAC Senior in activities to promote evaluation and…
Analysis of mesenchymal stem cell differentiation in vitro using classification association rule mining.

PubMed

Wang, Weiqi; Wang, Yanbo Justin; Bañares-Alcántara, René; Coenen, Frans; Cui, Zhanfeng

2009-12-01

In this paper, data mining is used to analyze the data on the differentiation of mammalian Mesenchymal Stem Cells (MSCs), aiming at discovering known and hidden rules governing MSC differentiation, following the establishment of a web-based public database containing experimental data on the MSC proliferation and differentiation. To this effect, a web-based public interactive database comprising the key parameters which influence the fate and destiny of mammalian MSCs has been constructed and analyzed using Classification Association Rule Mining (CARM) as a data-mining technique. The results show that the proposed approach is technically feasible and performs well with respect to the accuracy of (classification) prediction. Key rules mined from the constructed MSC database are consistent with experimental observations, indicating the validity of the method developed and the first step in the application of data mining to the study of MSCs.
Surfing for thinness: a pilot study of pro-eating disorder Web site usage in adolescents with eating disorders.

PubMed

Wilson, Jenny L; Peebles, Rebecka; Hardy, Kristina K; Litt, Iris F

2006-12-01

Pro-eating disorder Web sites are communities of individuals who engage in disordered eating and use the Internet to discuss their activities. Pro-recovery sites, which are less numerous, express a recovery-oriented perspective. This pilot study investigated the awareness and usage of pro-eating disorder Web sites among adolescents with eating disorders and their parents and explored associations with health and quality of life. This was a cross-sectional study of 698 families of patients (aged 10-22 years) diagnosed with an eating disorder at Stanford between 1997 and 2004. Anonymous surveys were mailed and offered in clinic. Survey content included questions about disease severity, health outcomes, Web site usage, and parental knowledge of eating disorder Web site usage. Surveys were returned by 182 individuals: 76 patients and 106 parents. Parents frequently (52.8%) were aware of pro-eating disorder sites, but an equal number did not know whether their child visited these sites, and only 27.6% had discussed them with their child. Most (62.5%) parents, however, did not know about pro-recovery sites. Forty-one percent of patients visited pro-recovery sites, 35.5% visited pro-eating disorder sites, 25.0% visited both, and 48.7% visited neither. While visiting pro-eating disorder sites, 96.0% reported learning new weight loss or purging techniques. However, 46.4% of pro-recovery site visitors also learned new techniques. Pro-eating disorder site users did not differ from nonusers in health outcomes but reported spending less time on school or schoolwork and had a longer duration of illness. Users of both pro-eating disorder and pro-recovery sites were hospitalized more than users of neither site. Pro-eating disorder site usage was prevalent among adolescents with eating disorders, yet parents had little knowledge of this. Although use of these sites was not associated with other health outcomes, usage may have a negative impact on quality of life and result in adolescents' learning about and adopting disordered eating behaviors.
Mining Student Data Captured from a Web-Based Tutoring Tool: Initial Exploration and Results

ERIC Educational Resources Information Center

Merceron, Agathe; Yacef, Kalina

2004-01-01

In this article we describe the initial investigations that we have conducted on student data collected from a web-based tutoring tool. We have used some data mining techniques such as association rule and symbolic data analysis, as well as traditional SQL queries to gain further insight on the students' learning and deduce information to improve…
Beyond accuracy: creating interoperable and scalable text-mining web services.

PubMed

Wei, Chih-Hsuan; Leaman, Robert; Lu, Zhiyong

2016-06-15

The biomedical literature is a knowledge-rich resource and an important foundation for future research. With over 24 million articles in PubMed and an increasing growth rate, research in automated text processing is becoming increasingly important. We report here our recently developed web-based text mining services for biomedical concept recognition and normalization. Unlike most text-mining software tools, our web services integrate several state-of-the-art entity tagging systems (DNorm, GNormPlus, SR4GN, tmChem and tmVar) and offer a batch-processing mode able to process arbitrary text input (e.g. scholarly publications, patents and medical records) in multiple formats (e.g. BioC). We support multiple standards to make our service interoperable and allow simpler integration with other text-processing pipelines. To maximize scalability, we have preprocessed all PubMed articles, and use a computer cluster for processing large requests of arbitrary text. Our text-mining web service is freely available at http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/tmTools/#curl : Zhiyong.Lu@nih.gov. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.
Stratification-Based Outlier Detection over the Deep Web.

PubMed

Xian, Xuefeng; Zhao, Pengpeng; Sheng, Victor S; Fang, Ligang; Gu, Caidong; Yang, Yuanfeng; Cui, Zhiming

2016-01-01

For many applications, finding rare instances or outliers can be more interesting than finding common patterns. Existing work in outlier detection never considers the context of deep web. In this paper, we argue that, for many scenarios, it is more meaningful to detect outliers over deep web. In the context of deep web, users must submit queries through a query interface to retrieve corresponding data. Therefore, traditional data mining methods cannot be directly applied. The primary contribution of this paper is to develop a new data mining method for outlier detection over deep web. In our approach, the query space of a deep web data source is stratified based on a pilot sample. Neighborhood sampling and uncertainty sampling are developed in this paper with the goal of improving recall and precision based on stratification. Finally, a careful performance evaluation of our algorithm confirms that our approach can effectively detect outliers in deep web.
Stratification-Based Outlier Detection over the Deep Web

PubMed Central

Xian, Xuefeng; Zhao, Pengpeng; Sheng, Victor S.; Fang, Ligang; Gu, Caidong; Yang, Yuanfeng; Cui, Zhiming

2016-01-01

For many applications, finding rare instances or outliers can be more interesting than finding common patterns. Existing work in outlier detection never considers the context of deep web. In this paper, we argue that, for many scenarios, it is more meaningful to detect outliers over deep web. In the context of deep web, users must submit queries through a query interface to retrieve corresponding data. Therefore, traditional data mining methods cannot be directly applied. The primary contribution of this paper is to develop a new data mining method for outlier detection over deep web. In our approach, the query space of a deep web data source is stratified based on a pilot sample. Neighborhood sampling and uncertainty sampling are developed in this paper with the goal of improving recall and precision based on stratification. Finally, a careful performance evaluation of our algorithm confirms that our approach can effectively detect outliers in deep web. PMID:27313603
Monitoring and Evaluating Use of the World Wide Web in an Academic Library: An Exploratory Study.

ERIC Educational Resources Information Center

Abramson, Alicia D.

1998-01-01

Examines use of the World Wide Web on public-access computers at the American University Library (Washington, D.C.) to identify the most frequently accessed Web sites, the frequency with which library-owned Web resources were accessed, and Web-usage patterns in the library in relation to the time of day and day of the week. (Author/AEF)

Deploying and sharing U-Compare workflows as web services.

PubMed

Kontonatsios, Georgios; Korkontzelos, Ioannis; Kolluru, Balakrishna; Thompson, Paul; Ananiadou, Sophia

2013-02-18

U-Compare is a text mining platform that allows the construction, evaluation and comparison of text mining workflows. U-Compare contains a large library of components that are tuned to the biomedical domain. Users can rapidly develop biomedical text mining workflows by mixing and matching U-Compare's components. Workflows developed using U-Compare can be exported and sent to other users who, in turn, can import and re-use them. However, the resulting workflows are standalone applications, i.e., software tools that run and are accessible only via a local machine, and that can only be run with the U-Compare platform. We address the above issues by extending U-Compare to convert standalone workflows into web services automatically, via a two-click process. The resulting web services can be registered on a central server and made publicly available. Alternatively, users can make web services available on their own servers, after installing the web application framework, which is part of the extension to U-Compare. We have performed a user-oriented evaluation of the proposed extension, by asking users who have tested the enhanced functionality of U-Compare to complete questionnaires that assess its functionality, reliability, usability, efficiency and maintainability. The results obtained reveal that the new functionality is well received by users. The web services produced by U-Compare are built on top of open standards, i.e., REST and SOAP protocols, and therefore, they are decoupled from the underlying platform. Exported workflows can be integrated with any application that supports these open standards. We demonstrate how the newly extended U-Compare enhances the cross-platform interoperability of workflows, by seamlessly importing a number of text mining workflow web services exported from U-Compare into Taverna, i.e., a generic scientific workflow construction platform.
Deploying and sharing U-Compare workflows as web services

PubMed Central

2013-01-01

Background U-Compare is a text mining platform that allows the construction, evaluation and comparison of text mining workflows. U-Compare contains a large library of components that are tuned to the biomedical domain. Users can rapidly develop biomedical text mining workflows by mixing and matching U-Compare’s components. Workflows developed using U-Compare can be exported and sent to other users who, in turn, can import and re-use them. However, the resulting workflows are standalone applications, i.e., software tools that run and are accessible only via a local machine, and that can only be run with the U-Compare platform. Results We address the above issues by extending U-Compare to convert standalone workflows into web services automatically, via a two-click process. The resulting web services can be registered on a central server and made publicly available. Alternatively, users can make web services available on their own servers, after installing the web application framework, which is part of the extension to U-Compare. We have performed a user-oriented evaluation of the proposed extension, by asking users who have tested the enhanced functionality of U-Compare to complete questionnaires that assess its functionality, reliability, usability, efficiency and maintainability. The results obtained reveal that the new functionality is well received by users. Conclusions The web services produced by U-Compare are built on top of open standards, i.e., REST and SOAP protocols, and therefore, they are decoupled from the underlying platform. Exported workflows can be integrated with any application that supports these open standards. We demonstrate how the newly extended U-Compare enhances the cross-platform interoperability of workflows, by seamlessly importing a number of text mining workflow web services exported from U-Compare into Taverna, i.e., a generic scientific workflow construction platform. PMID:23419017
MyWEST: my Web Extraction Software Tool for effective mining of annotations from web-based databanks.

PubMed

Masseroli, Marco; Stella, Andrea; Meani, Natalia; Alcalay, Myriam; Pinciroli, Francesco

2004-12-12

High-throughput technologies create the necessity to mine large amounts of gene annotations from diverse databanks, and to integrate the resulting data. Most databanks can be interrogated only via Web, for a single gene at a time, and query results are generally available only in the HTML format. Although some databanks provide batch retrieval of data via FTP, this requires expertise and resources for locally reimplementing the databank. We developed MyWEST, a tool aimed at researchers without extensive informatics skills or resources, which exploits user-defined templates to easily mine selected annotations from different Web-interfaced databanks, and aggregates and structures results in an automatically updated database. Using microarray results from a model system of retinoic acid-induced differentiation, MyWEST effectively gathered relevant annotations from various biomolecular databanks, highlighted significant biological characteristics and supported a global approach to the understanding of complex cellular mechanisms. MyWEST is freely available for non-profit use at http://www.medinfopoli.polimi.it/MyWEST/
CANFAR+Skytree: A Cloud Computing and Data Mining System for Astronomy

NASA Astrophysics Data System (ADS)

Ball, N. M.

2013-10-01

This is a companion Focus Demonstration article to the CANFAR+Skytree poster (Ball 2013, this volume), demonstrating the usage of the Skytree machine learning software on the Canadian Advanced Network for Astronomical Research (CANFAR) cloud computing system. CANFAR+Skytree is the world's first cloud computing system for data mining in astronomy.
Who Goes There? Measuring Library Web Site Usage.

ERIC Educational Resources Information Center

Bauer, Kathleen

2000-01-01

Discusses how libraries can gather data on the use of their Web sites. Highlights include Web server log files, including the common log file, referrer log file, and agent log file; log file limitations; privacy concerns; and choosing log analysis software, both free and commercial. (LRW)
76 FR 60474 - Intent To Prepare a Draft Environmental Impact Statement (DEIS) for the Haile Gold Mine in...

Federal Register 2010, 2011, 2012, 2013, 2014

2011-09-29

...--County on January 28, 2011. The public notice is available on Charleston District's public Web site at... eight open mining pits over a twelve-year period, with pit depths ranging from 110 to 840 feet deep. The... of January 28, 2011, and are available on Charleston District's public Web site at http://www.sac...
The spread of scientific information: insights from the web usage statistics in PLoS article-level metrics.

PubMed

Yan, Koon-Kiu; Gerstein, Mark

2011-01-01

The presence of web-based communities is a distinctive signature of Web 2.0. The web-based feature means that information propagation within each community is highly facilitated, promoting complex collective dynamics in view of information exchange. In this work, we focus on a community of scientists and study, in particular, how the awareness of a scientific paper is spread. Our work is based on the web usage statistics obtained from the PLoS Article Level Metrics dataset compiled by PLoS. The cumulative number of HTML views was found to follow a long tail distribution which is reasonably well-fitted by a lognormal one. We modeled the diffusion of information by a random multiplicative process, and thus extracted the rates of information spread at different stages after the publication of a paper. We found that the spread of information displays two distinct decay regimes: a rapid downfall in the first month after publication, and a gradual power law decay afterwards. We identified these two regimes with two distinct driving processes: a short-term behavior driven by the fame of a paper, and a long-term behavior consistent with citation statistics. The patterns of information spread were found to be remarkably similar in data from different journals, but there are intrinsic differences for different types of web usage (HTML views and PDF downloads versus XML). These similarities and differences shed light on the theoretical understanding of different complex systems, as well as a better design of the corresponding web applications that is of high potential marketing impact.
The Spread of Scientific Information: Insights from the Web Usage Statistics in PLoS Article-Level Metrics

PubMed Central

Yan, Koon-Kiu; Gerstein, Mark

2011-01-01

The presence of web-based communities is a distinctive signature of Web 2.0. The web-based feature means that information propagation within each community is highly facilitated, promoting complex collective dynamics in view of information exchange. In this work, we focus on a community of scientists and study, in particular, how the awareness of a scientific paper is spread. Our work is based on the web usage statistics obtained from the PLoS Article Level Metrics dataset compiled by PLoS. The cumulative number of HTML views was found to follow a long tail distribution which is reasonably well-fitted by a lognormal one. We modeled the diffusion of information by a random multiplicative process, and thus extracted the rates of information spread at different stages after the publication of a paper. We found that the spread of information displays two distinct decay regimes: a rapid downfall in the first month after publication, and a gradual power law decay afterwards. We identified these two regimes with two distinct driving processes: a short-term behavior driven by the fame of a paper, and a long-term behavior consistent with citation statistics. The patterns of information spread were found to be remarkably similar in data from different journals, but there are intrinsic differences for different types of web usage (HTML views and PDF downloads versus XML). These similarities and differences shed light on the theoretical understanding of different complex systems, as well as a better design of the corresponding web applications that is of high potential marketing impact. PMID:21603617
Collecting conditions usage metadata to optimize current and future ATLAS software and processing

NASA Astrophysics Data System (ADS)

Rinaldi, L.; Barberis, D.; Formica, A.; Gallas, E. J.; Oda, S.; Rybkin, G.; Verducci, M.; ATLAS Collaboration

2017-10-01

Conditions data (for example: alignment, calibration, data quality) are used extensively in the processing of real and simulated data in ATLAS. The volume and variety of the conditions data needed by different types of processing are quite diverse, so optimizing its access requires a careful understanding of conditions usage patterns. These patterns can be quantified by mining representative log files from each type of processing and gathering detailed information about conditions usage for that type of processing into a central repository.
Discovering Student Web Usage Profiles Using Markov Chains

ERIC Educational Resources Information Center

Marques, Alice; Belo, Orlando

2011-01-01

Nowadays, Web based platforms are quite common in any university, supporting a very diversified set of applications and services. Ranging from personal management to student evaluation processes, Web based platforms are doing a great job providing a very flexible way of working, promote student enrolment, and making access to academic information…
Analyzing Web Server Logs to Improve a Site's Usage. The Systems Librarian

ERIC Educational Resources Information Center

Breeding, Marshall

2005-01-01

This column describes ways to streamline and optimize how a Web site works in order to improve both its usability and its visibility. The author explains how to analyze logs and other system data to measure the effectiveness of the Web site design and search engine.
Students Using a Novel Web-Based Laboratory Class Support System: A Case Study in Food Chemistry Education

ERIC Educational Resources Information Center

van der Kolk, Koos; Beldman, Gerrit; Hartog, Rob; Gruppen, Harry

2012-01-01

The design, usage, and evaluation of a Web-based laboratory manual (WebLM) are described. The main aim of the WebLM is to support students while working in the laboratory by providing them with just-in-time information. Design guidelines for this electronic manual were derived from literature on cognitive load and user interface design. The WebLM…
PaaS for web applications with OpenShift Origin

NASA Astrophysics Data System (ADS)

Lossent, A.; Rodriguez Peon, A.; Wagner, A.

2017-10-01

The CERN Web Frameworks team has deployed OpenShift Origin to facilitate deployment of web applications and to improving efficiency in terms of computing resource usage. OpenShift leverages Docker containers and Kubernetes orchestration to provide a Platform-as-a-service solution oriented for web applications. We will review use cases and how OpenShift was integrated with other services such as source control, web site management and authentication services.
The world wide web: exploring a new advertising environment.

PubMed

Johnson, C R; Neath, I

1999-01-01

The World Wide Web currently boasts millions of users in the United States alone and is likely to continue to expand both as a marketplace and as an advertising environment. Three experiments explored advertising in the Web environment, in particular memory for ads as they appear in everyday use across the Web. Experiments 1 and 2 examined the effect of advertising repetition on the retention of familiar and less familiar brand names, respectively. Experiment 1 demonstrated that repetition of a banner ad within multiple web pages can improve recall of familiar brand names, and Experiment 2 demonstrated that repetition can improve recognition of less familiar brand names. Experiment 3 directly compared the retention of familiar and less familiar brand names that were promoted by static and dynamic ads and demonstrated that the use of dynamic advertising can increase brand name recall, though only for familiar brand names. This study also demonstrated that, in the Web environment, much as in other advertising environments, familiar brand names possess a mnemonic advantage not possessed by less familiar brand names. Finally, data regarding Web usage gathered from all experiments confirm reports that Web usage among males tends to exceed that among females.
Creating Usage Context-Based Object Similarities to Boost Recommender Systems in Technology Enhanced Learning

ERIC Educational Resources Information Center

Niemann, Katja; Wolpers, Martin

2015-01-01

In this paper, we introduce a new way of detecting semantic similarities between learning objects by analysing their usage in web portals. Our approach relies on the usage-based relations between the objects themselves rather then on the content of the learning objects or on the relations between users and learning objects. We then take this new…
Effective Filtering of Query Results on Updated User Behavioral Profiles in Web Mining

PubMed Central

Sadesh, S.; Suganthe, R. C.

2015-01-01

Web with tremendous volume of information retrieves result for user related queries. With the rapid growth of web page recommendation, results retrieved based on data mining techniques did not offer higher performance filtering rate because relationships between user profile and queries were not analyzed in an extensive manner. At the same time, existing user profile based prediction in web data mining is not exhaustive in producing personalized result rate. To improve the query result rate on dynamics of user behavior over time, Hamilton Filtered Regime Switching User Query Probability (HFRS-UQP) framework is proposed. HFRS-UQP framework is split into two processes, where filtering and switching are carried out. The data mining based filtering in our research work uses the Hamilton Filtering framework to filter user result based on personalized information on automatic updated profiles through search engine. Maximized result is fetched, that is, filtered out with respect to user behavior profiles. The switching performs accurate filtering updated profiles using regime switching. The updating in profile change (i.e., switches) regime in HFRS-UQP framework identifies the second- and higher-order association of query result on the updated profiles. Experiment is conducted on factors such as personalized information search retrieval rate, filtering efficiency, and precision ratio. PMID:26221626
The effectiveness of a web 2.0 physical activity intervention in older adults - a randomised controlled trial.

PubMed

Alley, Stephanie J; Kolt, Gregory S; Duncan, Mitch J; Caperchione, Cristina M; Savage, Trevor N; Maeder, Anthony J; Rosenkranz, Richard R; Tague, Rhys; Van Itallie, Anetta K; Kerry Mummery, W; Vandelanotte, Corneel

2018-01-12

Interactive web-based physical activity interventions using Web 2.0 features (e.g., social networking) have the potential to improve engagement and effectiveness compared to static Web 1.0 interventions. However, older adults may engage with Web 2.0 interventions differently than younger adults. The aims of this study were to determine whether an interaction between intervention (Web 2.0 and Web 1.0) and age group (<55y and ≥55y) exists for website usage and to determine whether an interaction between intervention (Web 2.0, Web 1.0 and logbook) and age group (<55y and ≥55y) exists for intervention effectiveness (changes in physical activity). As part of the WALK 2.0 trial, 504 Australian adults were randomly assigned to receive either a paper logbook (n = 171), a Web 1.0 (n = 165) or a Web 2.0 (n = 168) physical activity intervention. Moderate to vigorous physical activity was measured using ActiGraph monitors at baseline 3, 12 and 18 months. Website usage statistics including time on site, number of log-ins and number of step entries were also recorded. Generalised linear and intention-to-treat linear mixed models were used to test interactions between intervention and age groups (<55y and ≥55y) for website usage and moderate to vigorous physical activity changes. Time on site was higher for the Web 2.0 compared to the Web 1.0 intervention from baseline to 3 months, and this difference was significantly greater in the older group (OR = 1.47, 95%CI = 1.01-2.14, p = .047). Participants in the Web 2.0 group increased their activity more than the logbook group at 3 months, and this difference was significantly greater in the older group (moderate to vigorous physical activity adjusted mean difference = 13.74, 95%CI = 1.08-26.40 min per day, p = .03). No intervention by age interactions were observed for Web 1.0 and logbook groups. Results partially support the use of Web 2.0 features to improve adults over 55 s' engagement in and behaviour changes from web-based physical activity interventions. ACTRN ACTRN12611000157976 , Registered 7 March 2011.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Raymond, David W.; Gaither, Katherine N.; Polsky, Yarom

Sandia National Laboratories (Sandia) has a long history in developing compact, mobile, very high-speed drilling systems and this technology could be applied to increasing the rate at which boreholes are drilled during a mine accident response. The present study reviews current technical approaches, primarily based on technology developed under other programs, analyzes mine rescue specific requirements to develop a conceptual mine rescue drilling approach, and finally, proposes development of a phased mine rescue drilling system (MRDS) that accomplishes (1) development of rapid drilling MRDS equipment; (2) structuring improved web communication through the Mine Safety & Health Administration (MSHA) web site;more » (3) development of an improved protocol for employment of existing drilling technology in emergencies; (4) deployment of advanced technologies to complement mine rescue drilling operations during emergency events; and (5) preliminary discussion of potential future technology development of specialized MRDS equipment. This phased approach allows for rapid fielding of a basic system for improved rescue drilling, with the ability to improve the system over time at a reasonable cost.« less
QuakeSim: a Web Service Environment for Productive Investigations with Earth Surface Sensor Data

NASA Astrophysics Data System (ADS)

Parker, J. W.; Donnellan, A.; Granat, R. A.; Lyzenga, G. A.; Glasscoe, M. T.; McLeod, D.; Al-Ghanmi, R.; Pierce, M.; Fox, G.; Grant Ludwig, L.; Rundle, J. B.

2011-12-01

The QuakeSim science gateway environment includes a visually rich portal interface, web service access to data and data processing operations, and the QuakeTables ontology-based database of fault models and sensor data. The integrated tools and services are designed to assist investigators by covering the entire earthquake cycle of strain accumulation and release. The Web interface now includes Drupal-based access to diverse and changing content, with new ability to access data and data processing directly from the public page, as well as the traditional project management areas that require password access. The system is designed to make initial browsing of fault models and deformation data particularly engaging for new users. Popular data and data processing include GPS time series with data mining techniques to find anomalies in time and space, experimental forecasting methods based on catalogue seismicity, faulted deformation models (both half-space and finite element), and model-based inversion of sensor data. The fault models include the CGS and UCERF 2.0 faults of California and are easily augmented with self-consistent fault models from other regions. The QuakeTables deformation data include the comprehensive set of UAVSAR interferograms as well as a growing collection of satellite InSAR data.. Fault interaction simulations are also being incorporated in the web environment based on Virtual California. A sample usage scenario is presented which follows an investigation of UAVSAR data from viewing as an overlay in Google Maps, to selection of an area of interest via a polygon tool, to fast extraction of the relevant correlation and phase information from large data files, to a model inversion of fault slip followed by calculation and display of a synthetic model interferogram.
Anthropogenic and natural sources of acidity and metals and their influence on the structure of stream food webs.

PubMed

Hogsden, Kristy L; Harding, Jon S

2012-03-01

We compared food web structure in 20 streams with either anthropogenic or natural sources of acidity and metals or circumneutral water chemistry in New Zealand. Community and diet analysis indicated that mining streams receiving anthropogenic inputs of acidic and metal-rich drainage had much simpler food webs (fewer species, shorter food chains, less links) than those in naturally acidic, naturally high metal, and circumneutral streams. Food webs of naturally high metal streams were structurally similar to those in mining streams, lacking fish predators and having few species. Whereas, webs in naturally acidic streams differed very little from those in circumneutral streams due to strong similarities in community composition and diets of secondary and top consumers. The combined negative effects of acidity and metals on stream food webs are clear. However, elevated metal concentrations, regardless of source, appear to play a more important role than acidity in driving food web structure. Copyright © 2011 Elsevier Ltd. All rights reserved.

Social Web mining and exploitation for serious applications: Technosocial Predictive Analytics and related technologies for public health, environmental and national security surveillance.

PubMed

Kamel Boulos, Maged N; Sanfilippo, Antonio P; Corley, Courtney D; Wheeler, Steve

2010-10-01

This paper explores Technosocial Predictive Analytics (TPA) and related methods for Web "data mining" where users' posts and queries are garnered from Social Web ("Web 2.0") tools such as blogs, micro-blogging and social networking sites to form coherent representations of real-time health events. The paper includes a brief introduction to commonly used Social Web tools such as mashups and aggregators, and maps their exponential growth as an open architecture of participation for the masses and an emerging way to gain insight about people's collective health status of whole populations. Several health related tool examples are described and demonstrated as practical means through which health professionals might create clear location specific pictures of epidemiological data such as flu outbreaks. Copyright 2010 Elsevier Ireland Ltd. All rights reserved.
Big data mining: In-database Oracle data mining over hadoop

NASA Astrophysics Data System (ADS)

Kovacheva, Zlatinka; Naydenova, Ina; Kaloyanova, Kalinka; Markov, Krasimir

2017-07-01

Big data challenges different aspects of storing, processing and managing data, as well as analyzing and using data for business purposes. Applying Data Mining methods over Big Data is another challenge because of huge data volumes, variety of information, and the dynamic of the sources. Different applications are made in this area, but their successful usage depends on understanding many specific parameters. In this paper we present several opportunities for using Data Mining techniques provided by the analytical engine of RDBMS Oracle over data stored in Hadoop Distributed File System (HDFS). Some experimental results are given and they are discussed.
Clustering Educational Digital Library Usage Data: A Comparison of Latent Class Analysis and K-Means Algorithms

ERIC Educational Resources Information Center

Xu, Beijie; Recker, Mimi; Qi, Xiaojun; Flann, Nicholas; Ye, Lei

2013-01-01

This article examines clustering as an educational data mining method. In particular, two clustering algorithms, the widely used K-means and the model-based Latent Class Analysis, are compared, using usage data from an educational digital library service, the Instructional Architect (IA.usu.edu). Using a multi-faceted approach and multiple data…
Using Web-Based Technologies and Tools in Future Choreographers' Training: British Experience

ERIC Educational Resources Information Center

Bidyuk, Dmytro

2016-01-01

In the paper the problem of using effective web-based technologies and tools in teaching choreography in British higher education institutions has been discussed. Researches on the usage of web-based technologies and tools for practical dance courses in choreographers' professional training at British higher education institutions by such British…
A Web 2.0-Based Collaborative Model for Multicultural Education

ERIC Educational Resources Information Center

Hossain, Md. Mokter; Aydin, Hasan

2011-01-01

Purpose: Web 2.0 is a collaborative web development platform that has had tremendous usage in building effective, interactive, and collaborative virtual societies at home and abroad. Multicultural study is another trend that has tremendous possibilities to help people in the fight against racism and enables them to become active members of a…
Environment: General; Grammar & Usage; Money Management; Music History; Web Page Creation & Design.

ERIC Educational Resources Information Center

Web Feet, 2001

2001-01-01

Describes Web site resources for elementary and secondary education in the topics of: environment, grammar, money management, music history, and Web page creation and design. Each entry includes an illustration of a sample page on the site and an indication of the grade levels for which it is appropriate. (AEF)
University Internet Services: Problems and Opportunities.

ERIC Educational Resources Information Center

Phan, Dien D.; Chen, Jim Q.

This paper presents the findings of a study on the use of World Wide Web among students at St. Cloud State University, Minnesota, USA. The paper explores problems and challenges on campus Web computing and the relationships among the extent of Web usage, class level, and overall student academic performance. Specifically, the purposes of this…
Rule-based statistical data mining agents for an e-commerce application

NASA Astrophysics Data System (ADS)

Qin, Yi; Zhang, Yan-Qing; King, K. N.; Sunderraman, Rajshekhar

2003-03-01

Intelligent data mining techniques have useful e-Business applications. Because an e-Commerce application is related to multiple domains such as statistical analysis, market competition, price comparison, profit improvement and personal preferences, this paper presents a hybrid knowledge-based e-Commerce system fusing intelligent techniques, statistical data mining, and personal information to enhance QoS (Quality of Service) of e-Commerce. A Web-based e-Commerce application software system, eDVD Web Shopping Center, is successfully implemented uisng Java servlets and an Oracle81 database server. Simulation results have shown that the hybrid intelligent e-Commerce system is able to make smart decisions for different customers.
Research on the optimization strategy of web search engine based on data mining

NASA Astrophysics Data System (ADS)

Chen, Ronghua

2018-04-01

With the wide application of search engines, web site information has become an important way for people to obtain information. People have found that they are growing in an increasingly explosive manner. Web site information is verydifficult to find the information they need, and now the search engine can not meet the need, so there is an urgent need for the network to provide website personalized information service, data mining technology for this new challenge is to find a breakthrough. In order to improve people's accuracy of finding information from websites, a website search engine optimization strategy based on data mining is proposed, and verified by website search engine optimization experiment. The results show that the proposed strategy improves the accuracy of the people to find information, and reduces the time for people to find information. It has an important practical value.
Utilization of two web-based continuing education courses evaluated by Markov chain model.

PubMed

Tian, Hao; Lin, Jin-Mann S; Reeves, William C

2012-01-01

To evaluate the web structure of two web-based continuing education courses, identify problems and assess the effects of web site modifications. Markov chain models were built from 2008 web usage data to evaluate the courses' web structure and navigation patterns. The web site was then modified to resolve identified design issues and the improvement in user activity over the subsequent 12 months was quantitatively evaluated. Web navigation paths were collected between 2008 and 2010. The probability of navigating from one web page to another was analyzed. The continuing education courses' sequential structure design was clearly reflected in the resulting actual web usage models, and none of the skip transitions provided was heavily used. The web navigation patterns of the two different continuing education courses were similar. Two possible design flaws were identified and fixed in only one of the two courses. Over the following 12 months, the drop-out rate in the modified course significantly decreased from 41% to 35%, but remained unchanged in the unmodified course. The web improvement effects were further verified via a second-order Markov chain model. The results imply that differences in web content have less impact than web structure design on how learners navigate through continuing education courses. Evaluation of user navigation can help identify web design flaws and guide modifications. This study showed that Markov chain models provide a valuable tool to evaluate web-based education courses. Both the results and techniques in this study would be very useful for public health education and research specialists.
Utilization of two web-based continuing education courses evaluated by Markov chain model

PubMed Central

Lin, Jin-Mann S; Reeves, William C

2011-01-01

Objectives To evaluate the web structure of two web-based continuing education courses, identify problems and assess the effects of web site modifications. Design Markov chain models were built from 2008 web usage data to evaluate the courses' web structure and navigation patterns. The web site was then modified to resolve identified design issues and the improvement in user activity over the subsequent 12 months was quantitatively evaluated. Measurements Web navigation paths were collected between 2008 and 2010. The probability of navigating from one web page to another was analyzed. Results The continuing education courses' sequential structure design was clearly reflected in the resulting actual web usage models, and none of the skip transitions provided was heavily used. The web navigation patterns of the two different continuing education courses were similar. Two possible design flaws were identified and fixed in only one of the two courses. Over the following 12 months, the drop-out rate in the modified course significantly decreased from 41% to 35%, but remained unchanged in the unmodified course. The web improvement effects were further verified via a second-order Markov chain model. Conclusions The results imply that differences in web content have less impact than web structure design on how learners navigate through continuing education courses. Evaluation of user navigation can help identify web design flaws and guide modifications. This study showed that Markov chain models provide a valuable tool to evaluate web-based education courses. Both the results and techniques in this study would be very useful for public health education and research specialists. PMID:21976027
Informal Learning through Expertise Mining in the Social Web

ERIC Educational Resources Information Center

Valencia-Garcia, Rafael; Garcia-Sanchez, Francisco; Casado-Lumbreras, Cristina; Castellanos-Nieves, Dagoberto; Fernandez-Breis, Jesualdo Tomas

2012-01-01

The advent of Web 2.0, also called the Social Web, has changed the way people interact with the Web. Assisted by the technologies associated with this new trend, users now play a much more active role as content providers. This Web paradigm shift has also changed how companies operate and interact with their employees, partners and customers. The…
Web-based pathology practice examination usage.

PubMed

Klatt, Edward C

2014-01-01

General and subject specific practice examinations for students in health sciences studying pathology were placed onto a free public internet web site entitled web path and were accessed four clicks from the home web site menu. Multiple choice questions were coded into. html files with JavaScript functions for web browser viewing in a timed format. A Perl programming language script with common gateway interface for web page forms scored examinations and placed results into a log file on an internet computer server. The four general review examinations of 30 questions each could be completed in up to 30 min. The 17 subject specific examinations of 10 questions each with accompanying images could be completed in up to 15 min each. The results of scores and user educational field of study from log files were compiled from June 2006 to January 2014. The four general review examinations had 31,639 accesses with completion of all questions, for a completion rate of 54% and average score of 75%. A score of 100% was achieved by 7% of users, ≥90% by 21%, and ≥50% score by 95% of users. In top to bottom web page menu order, review examination usage was 44%, 24%, 17%, and 15% of all accessions. The 17 subject specific examinations had 103,028 completions, with completion rate 73% and average score 74%. Scoring at 100% was 20% overall, ≥90% by 37%, and ≥50% score by 90% of users. The first three menu items on the web page accounted for 12.6%, 10.0%, and 8.2% of all completions, and the bottom three accounted for no more than 2.2% each. Completion rates were higher for shorter 10 questions subject examinations. Users identifying themselves as MD/DO scored higher than other users, averaging 75%. Usage was higher for examinations at the top of the web page menu. Scores achieved suggest that a cohort of serious users fully completing the examinations had sufficient preparation to use them to support their pathology education.
Using Web 2.0 applications to promote health-related physical activity: findings from the WALK 2.0 randomised controlled trial.

PubMed

Kolt, Gregory S; Rosenkranz, Richard R; Vandelanotte, Corneel; Caperchione, Cristina M; Maeder, Anthony J; Tague, Rhys; Savage, Trevor N; Van, Itallie Anetta; Mummery, W Kerry; Oldmeadow, Christopher; Duncan, Mitch J

2017-10-01

Web 2.0 internet technology has great potential in promoting physical activity. This trial investigated the effectiveness of a Web 2.0-based intervention on physical activity behaviour, and the impact on website usage and engagement. 504 (328 women, 126 men) insufficiently active adult participants were randomly allocated to one of two web-based interventions or a paper-based Logbook group. The Web 1.0 group participated in the existing 10 000 Steps programme, while the Web 2.0 group participated in a Web 2.0-enabled physical activity intervention including user-to-user interaction through social networking capabilities. ActiGraph GT3X activity monitors were used to assess physical activity at four points across the intervention (0, 3, 12 and 18 months), and usage and engagement were assessed continuously through website usage statistics. Treatment groups differed significantly in trajectories of minutes/day of physical activity (p=0.0198), through a greater change at 3 months for Web 2.0 than Web 1.0 (7.3 min/day, 95% CI 2.4 to 12.3). In the Web 2.0 group, physical activity increased at 3 (mean change 6.8 min/day, 95% CI 3.9 to 9.6) and 12 months (3.8 min/day, 95% CI 0.5 to 7.0), but not 18 months. The Logbook group also increased physical activity at 3 (4.8 min/day, 95% CI 1.8 to 7.7) and 12 months (4.9 min/day, 95% CI 0.7 to 9.1), but not 18 months. The Web 1.0 group increased physical activity at 12 months only (4.9 min/day, 95% CI 0.5 to 9.3). The Web 2.0 group demonstrated higher levels of website engagement (p=0.3964). In comparison to a Web 1.0 intervention, a more interactive Web 2.0 intervention, as well as the paper-based Logbook intervention, improved physical activity in the short term, but that effect reduced over time, despite higher levels of engagement of the Web 2.0 group. ACTRN12611000157976. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Using Web 2.0 applications to promote health-related physical activity: findings from the WALK 2.0 randomised controlled trial

PubMed Central

Kolt, Gregory S; Rosenkranz, Richard R; Vandelanotte, Corneel; Caperchione, Cristina M; Maeder, Anthony J; Tague, Rhys; Savage, Trevor N; Van, Itallie Anetta; Mummery, W Kerry; Oldmeadow, Christopher; Duncan, Mitch J

2017-01-01

Background/Aim Web 2.0 internet technology has great potential in promoting physical activity. This trial investigated the effectiveness of a Web 2.0-based intervention on physical activity behaviour, and the impact on website usage and engagement. Methods 504 (328 women, 126 men) insufficiently active adult participants were randomly allocated to one of two web-based interventions or a paper-based Logbook group. The Web 1.0 group participated in the existing 10 000 Steps programme, while the Web 2.0 group participated in a Web 2.0-enabled physical activity intervention including user-to-user interaction through social networking capabilities. ActiGraph GT3X activity monitors were used to assess physical activity at four points across the intervention (0, 3, 12 and 18 months), and usage and engagement were assessed continuously through website usage statistics. Results Treatment groups differed significantly in trajectories of minutes/day of physical activity (p=0.0198), through a greater change at 3 months for Web 2.0 than Web 1.0 (7.3 min/day, 95% CI 2.4 to 12.3). In the Web 2.0 group, physical activity increased at 3 (mean change 6.8 min/day, 95% CI 3.9 to 9.6) and 12 months (3.8 min/day, 95% CI 0.5 to 7.0), but not 18 months. The Logbook group also increased physical activity at 3 (4.8 min/day, 95% CI 1.8 to 7.7) and 12 months (4.9 min/day, 95% CI 0.7 to 9.1), but not 18 months. The Web 1.0 group increased physical activity at 12 months only (4.9 min/day, 95% CI 0.5 to 9.3). The Web 2.0 group demonstrated higher levels of website engagement (p=0.3964). Conclusions In comparison to a Web 1.0 intervention, a more interactive Web 2.0 intervention, as well as the paper-based Logbook intervention, improved physical activity in the short term, but that effect reduced over time, despite higher levels of engagement of the Web 2.0 group. Trial registration number ACTRN12611000157976. PMID:28049624
Provenance Usage in the OceanLink Project

NASA Astrophysics Data System (ADS)

Narock, T.; Arko, R. A.; Carbotte, S. M.; Chandler, C. L.; Cheatham, M.; Fils, D.; Finin, T.; Hitzler, P.; Janowicz, K.; Jones, M.; Krisnadhi, A.; Lehnert, K. A.; Mickle, A.; Raymond, L. M.; Schildhauer, M.; Shepherd, A.; Wiebe, P. H.

2014-12-01

A wide spectrum of maturing methods and tools, collectively characterized as the Semantic Web, is helping to vastly improve thedissemination of scientific research. The OceanLink project, an NSF EarthCube Building Block, is utilizing semantic technologies tointegrate geoscience data repositories, library holdings, conference abstracts, and funded research awards. Provenance is a vital componentin meeting both the scientific and engineering requirements of OceanLink. Provenance plays a key role in justification and understanding when presenting users with results aggregated from multiple sources. In the engineering sense, provenance enables the identification of new data and the ability to determine which data sources to query. Additionally, OceanLink will leverage human and machine computation for crowdsourcing, text mining, and co-reference resolution. The results of these computations, and their associated provenance, will be folded back into the constituent systems to continually enhance precision and utility. We will touch on the various roles provenance is playing in OceanLink as well as present our use of the PROV Ontology and associated Ontology Design Patterns.
Upper Animas Mining District

EPA Pesticide Factsheets

Web page provides narrative of What's New?, Site Description, Site Risk, Cleanup Progress, Community Involvement, Next Steps, Site Documents, FAQ, Contacts and LInks for the Upper Animas Mining District site in San Juan County, Colorado.
Data Mining of Extremely Large Ad-Hoc Data Sets to Produce Reverse Web-Link Graphs

DTIC Science & Technology

2017-03-01

in most of the MR cases. From these studies , we also learned that computing -optimized instances should be chosen for serialized/compressed input data...maximum 200 words) Data mining can be a valuable tool, particularly in the acquisition of military intelligence. As the second study within a larger Naval...open web crawler data set Common Crawl. Similar to previous studies , this research employs MapReduce (MR) for sorting and categorizing output value
Integrated database for identifying candidate genes for Aspergillus flavus resistance in maize

PubMed Central

2010-01-01

Background Aspergillus flavus Link:Fr, an opportunistic fungus that produces aflatoxin, is pathogenic to maize and other oilseed crops. Aflatoxin is a potent carcinogen, and its presence markedly reduces the value of grain. Understanding and enhancing host resistance to A. flavus infection and/or subsequent aflatoxin accumulation is generally considered an efficient means of reducing grain losses to aflatoxin. Different proteomic, genomic and genetic studies of maize (Zea mays L.) have generated large data sets with the goal of identifying genes responsible for conferring resistance to A. flavus, or aflatoxin. Results In order to maximize the usage of different data sets in new studies, including association mapping, we have constructed a relational database with web interface integrating the results of gene expression, proteomic (both gel-based and shotgun), Quantitative Trait Loci (QTL) genetic mapping studies, and sequence data from the literature to facilitate selection of candidate genes for continued investigation. The Corn Fungal Resistance Associated Sequences Database (CFRAS-DB) (http://agbase.msstate.edu/) was created with the main goal of identifying genes important to aflatoxin resistance. CFRAS-DB is implemented using MySQL as the relational database management system running on a Linux server, using an Apache web server, and Perl CGI scripts as the web interface. The database and the associated web-based interface allow researchers to examine many lines of evidence (e.g. microarray, proteomics, QTL studies, SNP data) to assess the potential role of a gene or group of genes in the response of different maize lines to A. flavus infection and subsequent production of aflatoxin by the fungus. Conclusions CFRAS-DB provides the first opportunity to integrate data pertaining to the problem of A. flavus and aflatoxin resistance in maize in one resource and to support queries across different datasets. The web-based interface gives researchers different query options for mining the database across different types of experiments. The database is publically available at http://agbase.msstate.edu. PMID:20946609
Integrated database for identifying candidate genes for Aspergillus flavus resistance in maize.

PubMed

Kelley, Rowena Y; Gresham, Cathy; Harper, Jonathan; Bridges, Susan M; Warburton, Marilyn L; Hawkins, Leigh K; Pechanova, Olga; Peethambaran, Bela; Pechan, Tibor; Luthe, Dawn S; Mylroie, J E; Ankala, Arunkanth; Ozkan, Seval; Henry, W B; Williams, W P

2010-10-07

Aspergillus flavus Link:Fr, an opportunistic fungus that produces aflatoxin, is pathogenic to maize and other oilseed crops. Aflatoxin is a potent carcinogen, and its presence markedly reduces the value of grain. Understanding and enhancing host resistance to A. flavus infection and/or subsequent aflatoxin accumulation is generally considered an efficient means of reducing grain losses to aflatoxin. Different proteomic, genomic and genetic studies of maize (Zea mays L.) have generated large data sets with the goal of identifying genes responsible for conferring resistance to A. flavus, or aflatoxin. In order to maximize the usage of different data sets in new studies, including association mapping, we have constructed a relational database with web interface integrating the results of gene expression, proteomic (both gel-based and shotgun), Quantitative Trait Loci (QTL) genetic mapping studies, and sequence data from the literature to facilitate selection of candidate genes for continued investigation. The Corn Fungal Resistance Associated Sequences Database (CFRAS-DB) (http://agbase.msstate.edu/) was created with the main goal of identifying genes important to aflatoxin resistance. CFRAS-DB is implemented using MySQL as the relational database management system running on a Linux server, using an Apache web server, and Perl CGI scripts as the web interface. The database and the associated web-based interface allow researchers to examine many lines of evidence (e.g. microarray, proteomics, QTL studies, SNP data) to assess the potential role of a gene or group of genes in the response of different maize lines to A. flavus infection and subsequent production of aflatoxin by the fungus. CFRAS-DB provides the first opportunity to integrate data pertaining to the problem of A. flavus and aflatoxin resistance in maize in one resource and to support queries across different datasets. The web-based interface gives researchers different query options for mining the database across different types of experiments. The database is publically available at http://agbase.msstate.edu.

Mining Social Media and Web Searches For Disease Detection

PubMed Central

Yang, Y. Tony; Horneffer, Michael; DiLisio, Nicole

2013-01-01

Web-based social media is increasingly being used across different settings in the health care industry. The increased frequency in the use of the Internet via computer or mobile devices provides an opportunity for social media to be the medium through which people can be provided with valuable health information quickly and directly. While traditional methods of detection relied predominately on hierarchical or bureaucratic lines of communication, these often failed to yield timely and accurate epidemiological intelligence. New web-based platforms promise increased opportunities for a more timely and accurate spreading of information and analysis. This article aims to provide an overview and discussion of the availability of timely and accurate information. It is especially useful for the rapid identification of an outbreak of an infectious disease that is necessary to promptly and effectively develop public health responses. These web-based platforms include search queries, data mining of web and social media, process and analysis of blogs containing epidemic key words, text mining, and geographical information system data analyses. These new sources of analysis and information are intended to complement traditional sources of epidemic intelligence. Despite the attractiveness of these new approaches, further study is needed to determine the accuracy of blogger statements, as increases in public participation may not necessarily mean the information provided is more accurate. PMID:25170475
Mining social media and web searches for disease detection.

PubMed

Yang, Y Tony; Horneffer, Michael; DiLisio, Nicole

2013-04-28

Web-based social media is increasingly being used across different settings in the health care industry. The increased frequency in the use of the Internet via computer or mobile devices provides an opportunity for social media to be the medium through which people can be provided with valuable health information quickly and directly. While traditional methods of detection relied predominately on hierarchical or bureaucratic lines of communication, these often failed to yield timely and accurate epidemiological intelligence. New web-based platforms promise increased opportunities for a more timely and accurate spreading of information and analysis. This article aims to provide an overview and discussion of the availability of timely and accurate information. It is especially useful for the rapid identification of an outbreak of an infectious disease that is necessary to promptly and effectively develop public health responses. These web-based platforms include search queries, data mining of web and social media, process and analysis of blogs containing epidemic key words, text mining, and geographical information system data analyses. These new sources of analysis and information are intended to complement traditional sources of epidemic intelligence. Despite the attractiveness of these new approaches, further study is needed to determine the accuracy of blogger statements, as increases in public participation may not necessarily mean the information provided is more accurate.
GoWeb: a semantic search engine for the life science web.

PubMed

Dietze, Heiko; Schroeder, Michael

2009-10-01

Current search engines are keyword-based. Semantic technologies promise a next generation of semantic search engines, which will be able to answer questions. Current approaches either apply natural language processing to unstructured text or they assume the existence of structured statements over which they can reason. Here, we introduce a third approach, GoWeb, which combines classical keyword-based Web search with text-mining and ontologies to navigate large results sets and facilitate question answering. We evaluate GoWeb on three benchmarks of questions on genes and functions, on symptoms and diseases, and on proteins and diseases. The first benchmark is based on the BioCreAtivE 1 Task 2 and links 457 gene names with 1352 functions. GoWeb finds 58% of the functional GeneOntology annotations. The second benchmark is based on 26 case reports and links symptoms with diseases. GoWeb achieves 77% success rate improving an existing approach by nearly 20%. The third benchmark is based on 28 questions in the TREC genomics challenge and links proteins to diseases. GoWeb achieves a success rate of 79%. GoWeb's combination of classical Web search with text-mining and ontologies is a first step towards answering questions in the biomedical domain. GoWeb is online at: http://www.gopubmed.org/goweb.
Beyond Google: The Invisible Web in the Academic Library

ERIC Educational Resources Information Center

Devine, Jane; Egger-Sider, Francine

2004-01-01

This article analyzes the concept of the Invisible Web and its implication for academic librarianship. It offers a guide to tools that can be used to mine the Invisible Web and discusses the benefits of using the Invisible Web to promote interest in library services. In addition, the article includes an expanded definition, a literature review,…
Intelligent Information Retrieval and Web Mining Architecture Using SOA

ERIC Educational Resources Information Center

El-Bathy, Naser Ibrahim

2010-01-01

The study of this dissertation provides a solution to a very specific problem instance in the area of data mining, data warehousing, and service-oriented architecture in publishing and newspaper industries. The research question focuses on the integration of data mining and data warehousing. The research problem focuses on the development of…
Introducing Text Analytics as a Graduate Business School Course

ERIC Educational Resources Information Center

Edgington, Theresa M.

2011-01-01

Text analytics refers to the process of analyzing unstructured data from documented sources, including open-ended surveys, blogs, and other types of web dialog. Text analytics has enveloped the concept of text mining, an analysis approach influenced heavily from data mining. While text mining has been covered extensively in various computer…
[Distribution characteristics of soil nematodes in reclaimed land of copper-mine-tailings in different plant associations].

PubMed

Zhu, Yong-heng; Li, Ke-zhong; Zhang, Heng; Han, Fei; Zhou, Ju-hua; Gao, Ting-ting

2015-02-01

A survey was carried out to investigate soil nematode communities in the plant associations of gramineae (Arthraxon lanceolatus, AL; Imperata cylindrica, IC) and leguminosae (Glycine soja, GS) in reclaimed land of copper-mine-tailings and in the plant associations of gramineae (Digitaria chrysoblephara, DC-CK) of peripheral control in Fenghuang Mountain, Tongling City. A total of 1277 nematodes were extracted and sorted into 51 genera. The average individual density of the nematodes was 590 individuals · 100 g(-1) dry soil. In order to analyze the distribution character- istics of soil nematode communities in reclaimed land of copper-mine-tailings, Shannon community diversity index and soil food web structure indices were applied in the research. The results showed that the total number of nematode genus and the Shannon community diversity index of soil nematode in the three plant associations of AL, IC and GS were less than that in the plant associations of DC-CK. Compared with the ecological indices of soil nematode communities among the different plant associations in reclaimed land of copper-mine-tailings and peripheral natural habitat, we found that the structure of soil food web in the plant associations of GS was more mature, with bacterial decomposition being dominant in the soil organic matter decomposition, and that the soil ecosystem in the plant associations of GS was not stable with low interference. This indicated that the soil food web in the plant associations of leguminosae had a greater development potential to improve the ecological stability of the reclaimed land of copper-mine-tailings. On the other hand, the structure of soil food web in the plant associations of AL and IC were relatively stable in a structured state with fungal decomposition being dominant in the decomposition of soil organic matter. This indicated that the soil food web in the plant associations of gramineae was at a poor development level.
Mining Tasks from the Web Anchor Text Graph: MSR Notebook Paper for the TREC 2015 Tasks Track

DTIC Science & Technology

2015-11-20

Mining Tasks from the Web Anchor Text Graph: MSR Notebook Paper for the TREC 2015 Tasks Track Paul N. Bennett Microsoft Research Redmond, USA pauben...anchor text graph has proven useful in the general realm of query reformulation [2], we sought to quantify the value of extracting key phrases from...anchor text in the broader setting of the task understanding track. Given a query, our approach considers a simple method for identifying a relevant
Understanding Web Activity Patterns among Teachers, Students and Teacher Candidates

ERIC Educational Resources Information Center

Kimmons, Royce; Clark, B.; Lim, M.

2017-01-01

This study sought to understand generational and role differences in web usage of teachers, teacher candidates and K-12 students in a state in the USA (n = 2261). The researchers employed unique methods, which included using a custom-built persistent web browser to track user behaviours free of self-report, self-selection and perception bias.…
A Survey of Bioinformatics Database and Software Usage through Mining the Literature.

PubMed

Duck, Geraint; Nenadic, Goran; Filannino, Michele; Brass, Andy; Robertson, David L; Stevens, Robert

2016-01-01

Computer-based resources are central to much, if not most, biological and medical research. However, while there is an ever expanding choice of bioinformatics resources to use, described within the biomedical literature, little work to date has provided an evaluation of the full range of availability or levels of usage of database and software resources. Here we use text mining to process the PubMed Central full-text corpus, identifying mentions of databases or software within the scientific literature. We provide an audit of the resources contained within the biomedical literature, and a comparison of their relative usage, both over time and between the sub-disciplines of bioinformatics, biology and medicine. We find that trends in resource usage differs between these domains. The bioinformatics literature emphasises novel resource development, while database and software usage within biology and medicine is more stable and conservative. Many resources are only mentioned in the bioinformatics literature, with a relatively small number making it out into general biology, and fewer still into the medical literature. In addition, many resources are seeing a steady decline in their usage (e.g., BLAST, SWISS-PROT), though some are instead seeing rapid growth (e.g., the GO, R). We find a striking imbalance in resource usage with the top 5% of resource names (133 names) accounting for 47% of total usage, and over 70% of resources extracted being only mentioned once each. While these results highlight the dynamic and creative nature of bioinformatics research they raise questions about software reuse, choice and the sharing of bioinformatics practice. Is it acceptable that so many resources are apparently never reused? Finally, our work is a step towards automated extraction of scientific method from text. We make the dataset generated by our study available under the CC0 license here: http://dx.doi.org/10.6084/m9.figshare.1281371.
BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins.

PubMed

van Heel, Auke J; de Jong, Anne; Song, Chunxu; Viel, Jakob H; Kok, Jan; Kuipers, Oscar P

2018-05-21

Interest in secondary metabolites such as RiPPs (ribosomally synthesized and posttranslationally modified peptides) is increasing worldwide. To facilitate the research in this field we have updated our mining web server. BAGEL4 is faster than its predecessor and is now fully independent from ORF-calling. Gene clusters of interest are discovered using the core-peptide database and/or through HMM motifs that are present in associated context genes. The databases used for mining have been updated and extended with literature references and links to UniProt and NCBI. Additionally, we have included automated promoter and terminator prediction and the option to upload RNA expression data, which can be displayed along with the identified clusters. Further improvements include the annotation of the context genes, which is now based on a fast blast against the prokaryote part of the UniRef90 database, and the improved web-BLAST feature that dynamically loads structural data such as internal cross-linking from UniProt. Overall BAGEL4 provides the user with more information through a user-friendly web-interface which simplifies data evaluation. BAGEL4 is freely accessible at http://bagel4.molgenrug.nl.
Asymmetric threat data mining and knowledge discovery

NASA Astrophysics Data System (ADS)

Gilmore, John F.; Pagels, Michael A.; Palk, Justin

2001-03-01

Asymmetric threats differ from the conventional force-on- force military encounters that the Defense Department has historically been trained to engage. Terrorism by its nature is now an operational activity that is neither easily detected or countered as its very existence depends on small covert attacks exploiting the element of surprise. But terrorism does have defined forms, motivations, tactics and organizational structure. Exploiting a terrorism taxonomy provides the opportunity to discover and assess knowledge of terrorist operations. This paper describes the Asymmetric Threat Terrorist Assessment, Countering, and Knowledge (ATTACK) system. ATTACK has been developed to (a) data mine open source intelligence (OSINT) information from web-based newspaper sources, video news web casts, and actual terrorist web sites, (b) evaluate this information against a terrorism taxonomy, (c) exploit country/region specific social, economic, political, and religious knowledge, and (d) discover and predict potential terrorist activities and association links. Details of the asymmetric threat structure and the ATTACK system architecture are presented with results of an actual terrorist data mining and knowledge discovery test case shown.
Informing child welfare policy and practice: using knowledge discovery and data mining technology via a dynamic Web site.

PubMed

Duncan, Dean F; Kum, Hye-Chung; Weigensberg, Elizabeth Caplick; Flair, Kimberly A; Stewart, C Joy

2008-11-01

Proper management and implementation of an effective child welfare agency requires the constant use of information about the experiences and outcomes of children involved in the system, emphasizing the need for comprehensive, timely, and accurate data. In the past 20 years, there have been many advances in technology that can maximize the potential of administrative data to promote better evaluation and management in the field of child welfare. Specifically, this article discusses the use of knowledge discovery and data mining (KDD), which makes it possible to create longitudinal data files from administrative data sources, extract valuable knowledge, and make the information available via a user-friendly public Web site. This article demonstrates a successful project in North Carolina where knowledge discovery and data mining technology was used to develop a comprehensive set of child welfare outcomes available through a public Web site to facilitate information sharing of child welfare data to improve policy and practice.
Chemotext: A Publicly Available Web Server for Mining Drug-Target-Disease Relationships in PubMed.

PubMed

Capuzzi, Stephen J; Thornton, Thomas E; Liu, Kammy; Baker, Nancy; Lam, Wai In; O'Banion, Colin P; Muratov, Eugene N; Pozefsky, Diane; Tropsha, Alexander

2018-02-26

Elucidation of the mechanistic relationships between drugs, their targets, and diseases is at the core of modern drug discovery research. Thousands of studies relevant to the drug-target-disease (DTD) triangle have been published and annotated in the Medline/PubMed database. Mining this database affords rapid identification of all published studies that confirm connections between vertices of this triangle or enable new inferences of such connections. To this end, we describe the development of Chemotext, a publicly available Web server that mines the entire compendium of published literature in PubMed annotated by Medline Subject Heading (MeSH) terms. The goal of Chemotext is to identify all known DTD relationships and infer missing links between vertices of the DTD triangle. As a proof-of-concept, we show that Chemotext could be instrumental in generating new drug repurposing hypotheses or annotating clinical outcomes pathways for known drugs. The Chemotext Web server is freely available at http://chemotext.mml.unc.edu .
The ATLAS Public Web Pages: Online Management of HEP External Communication Content

NASA Astrophysics Data System (ADS)

Goldfarb, S.; Marcelloni, C.; Eli Phoboo, A.; Shaw, K.

2015-12-01

The ATLAS Education and Outreach Group is in the process of migrating its public online content to a professionally designed set of web pages built on the Drupal [1] content management system. Development of the front-end design passed through several key stages, including audience surveys, stakeholder interviews, usage analytics, and a series of fast design iterations, called sprints. Implementation of the web site involves application of the html design using Drupal templates, refined development iterations, and the overall population of the site with content. We present the design and development processes and share the lessons learned along the way, including the results of the data-driven discovery studies. We also demonstrate the advantages of selecting a back-end supported by content management, with a focus on workflow. Finally, we discuss usage of the new public web pages to implement outreach strategy through implementation of clearly presented themes, consistent audience targeting and messaging, and the enforcement of a well-defined visual identity.
Data mining for personal navigation

NASA Astrophysics Data System (ADS)

Hariharan, Gurushyam; Franti, Pasi; Mehta, Sandeep

2002-03-01

Relevance is the key in defining what data is to be extracted from the Internet. Traditionally, relevance has been defined mainly by keywords and user profiles. In this paper we discuss a fairly untouched dimension to relevance: location. Any navigational information sought by a user at large on earth is evidently governed by his location. We believe that task oriented data mining of the web amalgamated with location information is the key to providing relevant information for personal navigation. We explore the existential hurdles and propose novel approaches to tackle them. We also present naive, task-oriented data mining based approaches and their implementations in Java, to extract location based information. Ad-hoc pairing of data with coordinates (x, y) is very rare on the web. But if the same co-ordinates are converted to a logical address (state/city/street), a wide spectrum of location-based information base opens up. Hence, given the coordinates (x, y) on the earth, the scheme points to the logical address of the user. Location based information could either be picked up from fixed and known service providers (e.g. Yellow Pages) or from any arbitrary website on the Web. Once the web servers providing information relevant to the logical address are located, task oriented data mining is performed over these sites keeping in mind what information is interesting to the contemporary user. After all this, a simple data stream is provided to the user with information scaled to his convenience. The scheme has been implemented for cities of Finland.
Large-Scale Overlays and Trends: Visually Mining, Panning and Zooming the Observable Universe.

PubMed

Luciani, Timothy Basil; Cherinka, Brian; Oliphant, Daniel; Myers, Sean; Wood-Vasey, W Michael; Labrinidis, Alexandros; Marai, G Elisabeta

2014-07-01

We introduce a web-based computing infrastructure to assist the visual integration, mining and interactive navigation of large-scale astronomy observations. Following an analysis of the application domain, we design a client-server architecture to fetch distributed image data and to partition local data into a spatial index structure that allows prefix-matching of spatial objects. In conjunction with hardware-accelerated pixel-based overlays and an online cross-registration pipeline, this approach allows the fetching, displaying, panning and zooming of gigabit panoramas of the sky in real time. To further facilitate the integration and mining of spatial and non-spatial data, we introduce interactive trend images-compact visual representations for identifying outlier objects and for studying trends within large collections of spatial objects of a given class. In a demonstration, images from three sky surveys (SDSS, FIRST and simulated LSST results) are cross-registered and integrated as overlays, allowing cross-spectrum analysis of astronomy observations. Trend images are interactively generated from catalog data and used to visually mine astronomy observations of similar type. The front-end of the infrastructure uses the web technologies WebGL and HTML5 to enable cross-platform, web-based functionality. Our approach attains interactive rendering framerates; its power and flexibility enables it to serve the needs of the astronomy community. Evaluation on three case studies, as well as feedback from domain experts emphasize the benefits of this visual approach to the observational astronomy field; and its potential benefits to large scale geospatial visualization in general.
Personality variables as predictors of Facebook usage.

PubMed

Caci, Barbara; Cardaci, Maurizio; Tabacchi, Marco E; Scrima, Fabrizio

2014-04-01

This study investigates the role of personality factors as predictors of Facebook usage. Data concerning Facebook usage and personality factors from 654 Facebook users were gathered using a web survey. Using path analysis, the results showed Openness was a predictor of Facebook early adoption, Conscientiousness with sparing use, Extraversion with long sessions and abundant friendships, and Neuroticism with high frequency of sessions. The possible role of Agreeableness in predicting low session frequency and friendships needs further validation.
Les Chansons de la Francophonie Web Site and Its Two Web-Usage-Tracking Systems in an Advanced Listening Comprehension Course

ERIC Educational Resources Information Center

Weinberg, Alysse

2005-01-01

The "Les Chansons de la francophonie" web site is based on French songs and was developed using HTML and JavaScript for the advanced French Comprehension Course at the Second Language Institute of the University of Ottawa. These interactive listening activities include true-false and multiple-choice questions, fill in the blanks,…
Evaluation of a new website design for iwantthekit for chlamydia, gonorrhea, and trichomonas screening.

PubMed

Kuder, Margaret; Goheen, Mary Jett; Dize, Laura; Barnes, Mathilda; Gaydos, Charlotte A

2015-05-01

The www.iwantthekit.org provides Internet-based, at-home sexually transmitted infection screening. The Web site implemented an automated test result access system. To evaluate potential deleterious effects of the new system, we analyzed demographics, Web site usage, and treatment. The post-Web site design captured more participant information and no decrease in requests, kit return, or treatment adherence.

Assessing the Integrity of Web Sites Providing Data and Information on Corporate Behavior

ERIC Educational Resources Information Center

McLaughlin, Josetta; Pavelka, Deborah; McLaughlin, Gerald

2005-01-01

A significant trend in higher education evolving from the wide accessibility to the Internet is the availability of an ever-increasing supply of data on Web sites for use by professors, students, and researchers. As this usage by a wider variety of users grows, the ability to judge the integrity of the data, the related findings, and the Web site…
The SADI Personal Health Lens: A Web Browser-Based System for Identifying Personally Relevant Drug Interactions.

PubMed

Vandervalk, Ben; McCarthy, E Luke; Cruz-Toledo, José; Klein, Artjom; Baker, Christopher J O; Dumontier, Michel; Wilkinson, Mark D

2013-04-05

The Web provides widespread access to vast quantities of health-related information that can improve quality-of-life through better understanding of personal symptoms, medical conditions, and available treatments. Unfortunately, identifying a credible and personally relevant subset of information can be a time-consuming and challenging task for users without a medical background. The objective of the Personal Health Lens system is to aid users when reading health-related webpages by providing warnings about personally relevant drug interactions. More broadly, we wish to present a prototype for a novel, generalizable approach to facilitating interactions between a patient, their practitioner(s), and the Web. We utilized a distributed, Semantic Web-based architecture for recognizing personally dangerous drugs consisting of: (1) a private, local triple store of personal health information, (2) Semantic Web services, following the Semantic Automated Discovery and Integration (SADI) design pattern, for text mining and identifying substance interactions, (3) a bookmarklet to trigger analysis of a webpage and annotate it with personalized warnings, and (4) a semantic query that acts as an abstract template of the analytical workflow to be enacted by the system. A prototype implementation of the system is provided in the form of a Java standalone executable JAR file. The JAR file bundles all components of the system: the personal health database, locally-running versions of the SADI services, and a javascript bookmarklet that triggers analysis of a webpage. In addition, the demonstration includes a hypothetical personal health profile, allowing the system to be used immediately without configuration. Usage instructions are provided. The main strength of the Personal Health Lens system is its ability to organize medical information and to present it to the user in a personalized and contextually relevant manner. While this prototype was limited to a single knowledge domain (drug/drug interactions), the proposed architecture is generalizable, and could act as the foundation for much richer personalized-health-Web clients, while importantly providing a novel and personalizable mechanism for clinical experts to inject their expertise into the browsing experience of their patients in the form of customized semantic queries and ontologies.
The SADI Personal Health Lens: A Web Browser-Based System for Identifying Personally Relevant Drug Interactions

PubMed Central

Vandervalk, Ben; McCarthy, E Luke; Cruz-Toledo, José; Klein, Artjom; Baker, Christopher J O; Dumontier, Michel

2013-01-01

Background The Web provides widespread access to vast quantities of health-related information that can improve quality-of-life through better understanding of personal symptoms, medical conditions, and available treatments. Unfortunately, identifying a credible and personally relevant subset of information can be a time-consuming and challenging task for users without a medical background. Objective The objective of the Personal Health Lens system is to aid users when reading health-related webpages by providing warnings about personally relevant drug interactions. More broadly, we wish to present a prototype for a novel, generalizable approach to facilitating interactions between a patient, their practitioner(s), and the Web. Methods We utilized a distributed, Semantic Web-based architecture for recognizing personally dangerous drugs consisting of: (1) a private, local triple store of personal health information, (2) Semantic Web services, following the Semantic Automated Discovery and Integration (SADI) design pattern, for text mining and identifying substance interactions, (3) a bookmarklet to trigger analysis of a webpage and annotate it with personalized warnings, and (4) a semantic query that acts as an abstract template of the analytical workflow to be enacted by the system. Results A prototype implementation of the system is provided in the form of a Java standalone executable JAR file. The JAR file bundles all components of the system: the personal health database, locally-running versions of the SADI services, and a javascript bookmarklet that triggers analysis of a webpage. In addition, the demonstration includes a hypothetical personal health profile, allowing the system to be used immediately without configuration. Usage instructions are provided. Conclusions The main strength of the Personal Health Lens system is its ability to organize medical information and to present it to the user in a personalized and contextually relevant manner. While this prototype was limited to a single knowledge domain (drug/drug interactions), the proposed architecture is generalizable, and could act as the foundation for much richer personalized-health-Web clients, while importantly providing a novel and personalizable mechanism for clinical experts to inject their expertise into the browsing experience of their patients in the form of customized semantic queries and ontologies. PMID:23612187
Tweeting and Blogging: Moving towards Education 2.0

ERIC Educational Resources Information Center

Luo, Tian; Franklin, Teresa

2015-01-01

This paper reports on an exploratory study that employed Twitter and blogs as instructional Web 2.0 tools to support student learning in an undergraduate-level class. Case study methodology entailing a usage survey, an exit survey, and 12 in-depth semi-structured interviews was sought to examine patterns and characteristics of students' usage of…
Google Scholar Usage: An Academic Library's Experience

ERIC Educational Resources Information Center

Wang, Ya; Howard, Pamela

2012-01-01

Google Scholar is a free service that provides a simple way to broadly search for scholarly works and to connect patrons with the resources libraries provide. The researchers in this study analyzed Google Scholar usage data from 2006 for three library tools at San Francisco State University: SFX link resolver, Web Access Management proxy server,…
37 CFR 380.23 - Terms for making payment of royalty fees and statements of account.

Code of Federal Regulations, 2011 CFR

2011-07-01

... waiver, including development of proxy usage data. The Proxy Fee shall be paid by the date specified in... Educational Webcasters based on proxy usage data in accordance with a methodology adopted by the Collective's... third-party Web hosting or service provider maintains equipment or software for a Noncommercial...
Web-based health interventions for family caregivers of elderly individuals: A Scoping Review.

PubMed

Wasilewski, Marina B; Stinson, Jennifer N; Cameron, Jill I

2017-07-01

For the growing proportion of elders globally, aging-related illnesses are primary causes of morbidity causing reliance on family members for support in the community. Family caregivers experience poorer physical and mental health than their non-caregiving counterparts. Web-based interventions can provide accessible support to family caregivers to offset declines in their health and well-being. Existing reviews focused on web-based interventions for caregivers have been limited to single illness populations and have mostly focused on the efficacy of the interventions. We therefore have limited insight into how web-based interventions for family caregiver have been developed, implemented and evaluated across aging-related illness. To describe: a) theoretical underpinnings of the literature; b) development, content and delivery of web-based interventions; c) caregiver usage of web-based interventions; d) caregiver experience with web-based interventions and e) impact of web-based interventions on caregivers' health outcomes. We followed Arksey and O'Malley's methodological framework for conducting scoping reviews which entails setting research questions, selecting relevant studies, charting the data and synthesizing the results in a report. Fifty-three publications representing 32 unique web-based interventions were included. Over half of the interventions were targeted at dementia caregivers, with the rest targeting caregivers to the stroke, cancer, diabetes and general frailty populations. Studies used theory across the intervention trajectory. Interventions aimed to improve a range of health outcomes for caregivers through static and interactive delivery methods Caregivers were satisfied with the usability and accessibility of the websites but usage was generally low and declined over time. Depression and caregiver burden were the most common outcomes evaluated. The interventions ranged in their impact on health and social outcomes but reductions in perception of caregiver burden were consistently observed. Caregivers value interactive interventions that are tailored to their unique needs and the illness context. However, usage of the interventions was sporadic and declined over time, indicating that future interventions should address stage-specific needs across the caregiving trajectory. A systematic review has the potential to be conducted given the consistency in caregiver burden and depression as outcomes. Copyright © 2017 Elsevier B.V. All rights reserved.
"BreastfeedingBasics": web-based education that meets current knowledge competencies.

PubMed

Lewin, Linda Orkin; O'Connor, Mary E

2012-08-01

The United States has not met the majority of the Centers for Disease Control and Prevention goals for breastfeeding duration. Studies have shown a lack of knowledge about breastfeeding by health care professionals and students (HCP/S). Web-based education can be a cost-effective manner of education for HCP/S. "BreastfeedingBasics" is an online free educational program available for use. This study compares information in "BreastfeedingBasics" to the breastfeeding knowledge competencies recommended by the US Breastfeeding Committee (USBC). It also evaluates usage of "BreastfeedingBasics" by users and health care professional faculty. Using anonymous information from Web site users, the authors compared mean pre-test and post-test scores of the modules as a measure of the knowledge gained by HCP/S users. They evaluated usage by demographic information and used a Web-based survey to assess benefits of usage of "BreastfeedingBasics" to faculty. Overall, 15 020 HCP/S used the Web site between April 1999 and December 2009. "BreastfeedingBasics" meets 8 of the 11 USBC knowledge competencies. Mean post-test scores increased (P < .001) for all modules. Faculty reported its benefits to be free, broad scope, and the ability to be completed on the students' own time; 84% of the faculty combined the use of "BreastfeedingBasics" with clinical work. Use of "BreastfeedingBasics" can help HCP/S meet the USBC core breastfeeding knowledge competencies and gain knowledge. Faculty are satisfied with its use. Wider use of "BreastfeedingBasics" to help improve the knowledge of HCP/S may help in improving breastfeeding outcomes.
Mining Hidden Gems Beneath the Surface: A Look At the Invisible Web.

ERIC Educational Resources Information Center

Carlson, Randal D.; Repman, Judi

2002-01-01

Describes resources for researchers called the Invisible Web that are hidden from the usual search engines and other tools and contrasts them with those resources available on the surface Web. Identifies specialized search tools, databases, and strategies that can be used to locate credible in-depth information. (Author/LRW)
Opinion Integration and Summarization

ERIC Educational Resources Information Center

Lu, Yue

2011-01-01

As Web 2.0 applications become increasingly popular, more and more people express their opinions on the Web in various ways in real time. Such wide coverage of topics and abundance of users make the Web an extremely valuable source for mining people's opinions about all kinds of topics. However, since the opinions are usually expressed as…
Mining Formative Evaluation Rules Using Web-Based Learning Portfolios for Web-Based Learning Systems

ERIC Educational Resources Information Center

Chen, Chih-Ming; Hong, Chin-Ming; Chen, Shyuan-Yi; Liu, Chao-Yu

2006-01-01

Learning performance assessment aims to evaluate what knowledge learners have acquired from teaching activities. Objective technical measures of learning performance are difficult to develop, but are extremely important for both teachers and learners. Learning performance assessment using learning portfolios or web server log data is becoming an…
40 CFR 52.254 - Organic solvent usage.

Code of Federal Regulations, 2012 CFR

2012-07-01

... Air Quality Control Regions (the “Regions”), as described in 40 CFR part 81, dated July 1, 1979... contrivances designed for processing continuous web, strip, or wire that emit organic materials in the course... articles, machines, equipment, or other contrivances designed for processing a continuous web, strip, or...
40 CFR 52.254 - Organic solvent usage.

Code of Federal Regulations, 2010 CFR

2010-07-01

... Air Quality Control Regions (the “Regions”), as described in 40 CFR part 81, dated July 1, 1979... contrivances designed for processing continuous web, strip, or wire that emit organic materials in the course... articles, machines, equipment, or other contrivances designed for processing a continuous web, strip, or...
40 CFR 52.254 - Organic solvent usage.

Code of Federal Regulations, 2014 CFR

2014-07-01

... Air Quality Control Regions (the “Regions”), as described in 40 CFR part 81, dated July 1, 1979... contrivances designed for processing continuous web, strip, or wire that emit organic materials in the course... articles, machines, equipment, or other contrivances designed for processing a continuous web, strip, or...
40 CFR 52.254 - Organic solvent usage.

Code of Federal Regulations, 2013 CFR

2013-07-01

... Air Quality Control Regions (the “Regions”), as described in 40 CFR part 81, dated July 1, 1979... contrivances designed for processing continuous web, strip, or wire that emit organic materials in the course... articles, machines, equipment, or other contrivances designed for processing a continuous web, strip, or...
Mine or Theirs, Where Do Users Go? A Comparison of E-Journal Usage at the OhioLINK Electronic Journal Center Platform versus the Elsevier ScienceDirect Platform

ERIC Educational Resources Information Center

Swanson, Juleah

2015-01-01

This research provides librarians with a model for assessing and predicting which platforms patrons will use to access the same content, specifically comparing usage at the Ohio Library and Information Network (OhioLINK) Electronic Journal Center (EJC) and at Elsevier's ScienceDirect from 2007 to 2013. Findings show that in the earlier years, the…
78 FR 77706 - Notice of Intent To Prepare an Environmental Impact Statement for the Proposed Gemfield Mine...

Federal Register 2010, 2011, 2012, 2013, 2014

2013-12-24

... gold mine and associated processing and ancillary facilities. The project would be located on public... media, newspapers and the BLM Web site at: http://www.blm.gov/nv/st/en/fo/battle_mountain_field.html... to construct, operate, reclaim, and close an open pit, heap leach, gold mining operation known as the...
QuadBase2: web server for multiplexed guanine quadruplex mining and visualization

PubMed Central

Dhapola, Parashar; Chowdhury, Shantanu

2016-01-01

DNA guanine quadruplexes or G4s are non-canonical DNA secondary structures which affect genomic processes like replication, transcription and recombination. G4s are computationally identified by specific nucleotide motifs which are also called putative G4 (PG4) motifs. Despite the general relevance of these structures, there is currently no tool available that can allow batch queries and genome-wide analysis of these motifs in a user-friendly interface. QuadBase2 (quadbase.igib.res.in) presents a completely reinvented web server version of previously published QuadBase database. QuadBase2 enables users to mine PG4 motifs in up to 178 eukaryotes through the EuQuad module. This module interfaces with Ensembl Compara database, to allow users mine PG4 motifs in the orthologues of genes of interest across eukaryotes. PG4 motifs can be mined across genes and their promoter sequences in 1719 prokaryotes through ProQuad module. This module includes a feature that allows genome-wide mining of PG4 motifs and their visualization as circular histograms. TetraplexFinder, the module for mining PG4 motifs in user-provided sequences is now capable of handling up to 20 MB of data. QuadBase2 is a comprehensive PG4 motif mining tool that further expands the configurations and algorithms for mining PG4 motifs in a user-friendly way. PMID:27185890
Mining of the social network extraction

NASA Astrophysics Data System (ADS)

Nasution, M. K. M.; Hardi, M.; Syah, R.

2017-01-01

The use of Web as social media is steadily gaining ground in the study of social actor behaviour. However, information in Web can be interpreted in accordance with the ability of the method such as superficial methods for extracting social networks. Each method however has features and drawbacks: it cannot reveal the behaviour of social actors, but it has the hidden information about them. Therefore, this paper aims to reveal such information in the social networks mining. Social behaviour could be expressed through a set of words extracted from the list of snippets.
Analysing Customer Opinions with Text Mining Algorithms

NASA Astrophysics Data System (ADS)

Consoli, Domenico

2009-08-01

Knowing what the customer thinks of a particular product/service helps top management to introduce improvements in processes and products, thus differentiating the company from their competitors and gain competitive advantages. The customers, with their preferences, determine the success or failure of a company. In order to know opinions of the customers we can use technologies available from the web 2.0 (blog, wiki, forums, chat, social networking, social commerce). From these web sites, useful information must be extracted, for strategic purposes, using techniques of sentiment analysis or opinion mining.

Significant applications of ERTS-1 data to resource management activities at the state level in Ohio. [strip mining and land use mapping

NASA Technical Reports Server (NTRS)

Sweet, D. C.; Pincura, P. G.; Meier, C. J.; Garrett, G. B.; Herd, L.; Wukelic, G. E.; Stephan, J. G.; Smail, H. E.

1974-01-01

Described are techniques utilized and the progress made in applying ERTS-1 data to (1) detecting, inventorying, and monitoring surface mining activities, particularly in relation to recently passed strip mine legislation in Ohio; (2) updating current land use maps at various scales for multiagency usage, and (3) solving other real-time problems existing throughout the various Ohio governmental agencies. General conclusions regarding current user views as to the opportunities and limitations of operationally using ERTS-1 data at the state level are also noted.
An analysis of technology usage for streaming digital video in support of a preclinical curriculum.

PubMed

Dev, P; Rindfleisch, T C; Kush, S J; Stringer, J R

2000-01-01

Usage of streaming digital video of lectures in preclinical courses was measured by analysis of the data in the log file maintained on the web server. We observed that students use the video when it is available. They do not use it to replace classroom attendance but rather for review before examinations or when a class has been missed. Usage of video has not increased significantly for any course within the 18 month duration of this project.
Do College Faculty Embrace Web 2.0 Technology?

ERIC Educational Resources Information Center

Siha, Samia M.; Bell, Reginald Lamar; Roebuck, Deborah

2016-01-01

The authors sought to determine if Rogers's Innovation Decision Process model could analyze Web 2.0 usage within the collegiate environment. The key independent variables studied in relationship to this model were gender, faculty rank, course content delivery method, and age. Chi-square nonparametric tests on the independent variables across…
Web Access to Japanese Science and Technology Information.

ERIC Educational Resources Information Center

Takase, Emi

1997-01-01

Describes a project conducted by the Massachusetts Institute of Technology (MIT) Libraries in collaboration with the MIT Japan Program; its objectives are to increase information exchange and enhance cooperation between Japan and the United States through a World Wide Web page and an interactive listserv. Examines usage statistics and issues in…
Efficacy of a Pilot Internet-Based Weight Management Program (H.E.A.L.T.H.) and Longitudinal Physical Fitness Data in Army Reserve Soldiers

PubMed Central

Newton, Robert L; Han, Hongmei; Stewart, Tiffany M; Ryan, Donna H; Williamson, Donald A

2011-01-01

Background The primary aims of this article are to describe the utilization of an Internet-based weight management Web site [Healthy Eating, Activity, and Lifestyle Training Headquarters (H.E.A.L.T.H.)] over a 12–27 month period and to describe concurrent weight and fitness changes in Army Reserve soldiers. Methods The H.E.A.L.T.H. Web site was marketed to Army Reserve soldiers via a Web site promotion program for 27 months (phase I) and its continued usage was observed over a subsequent 12-month period (phase II). Web site usage was obtained from the H.E.A.L.T.H. Web site. Weight and fitness data were extracted from the Regional Level Application Software (RLAS). Results A total of 1499 Army Reserve soldiers registered on the H.E.A.L.T.H. Web site. There were 118 soldiers who returned to the H.E.A.L.T.H. Web site more than once. Registration rate reduced significantly following the removal of the Web site promotion program. During phase I, 778 Army Reserve soldiers had longitudinal weight and fitness data in RLAS. Men exceeding the screening table weight gained less weight compared with men below it (p < .007). Percentage change in body weight was inversely associated with change in fitness scores. Conclusions The Web site promotion program resulted in 52% of available Army Reserve soldiers registering onto the H.E.A.L.T.H. Web site, and 7.9% used the Web site more than once. The H.E.A.L.T.H. Web site may be a viable population-based weight and fitness management tool for soldier use. PMID:22027327
What Are the Usage Conditions of Web 2.0 Tools Faculty of Education Students?

ERIC Educational Resources Information Center

Agir, Ahmet

2014-01-01

As a result of advances in technology and then the emergence of using Internet in every step of life, web that provides access to the documents such as picture, audio, animation and text in Internet started to be used. At first, web consists of only visual and text pages that couldn't enable to make user's interaction. However, it is seen that not…
Utilizing Web Information Systems for Organizational Knowledge Work: An Investigation of the Information Ecology and Information Behaviors of Users in a Telecommunications Company.

ERIC Educational Resources Information Center

Detlor, Brian

This paper outlines a detailed research investigation of Web information systems (WIS), such as intranets, extranets, and the World Wide Web, and their capacity to facilitate organizational knowledge work. The objective was to conduct a case study evaluation of WIS usage that examines the information needs and uses of major sets of users and the…
Quantitative Analysis of the Usage of the COSMOS Science Education Portal

ERIC Educational Resources Information Center

Sotiriou, Sofoklis; Bogner, Franz X.; Neofotistos, George

2011-01-01

A quantitative method of mapping the web usage of an innovative educational portal is applied to analyze the behaviour of users of the COSMOS Science Education Portal. The COSMOS Portal contains user-generated resources (that are uploaded by its users). It has been designed to support a science teacher's search, retrieval and access to both,…
Comparison of Turkish and US Pre-Service Teachers' Web 2.0 Tools Usage Characteristics

ERIC Educational Resources Information Center

Kiyici, Mubin; Akyeampong, Albert; Balkan Kiyici, Fatime

2013-01-01

As the Internet and computer develop, the world is changing dramatically and fantastically. Usage of technological tools is increased day by day in daily life besides ICT. All the technological tools shape individual behavior, life style and learning style as well as individual lives. Today's child use different tools and different way to…
First Report about an E-learning Application Supporting PBL: Students' Usages, Satisfactions, and Achievements

ERIC Educational Resources Information Center

Gurpinar, Erol; Zayim, Nese; Ozenci, Ciler Celik; Alimoglu, Mustafa Kemal

2009-01-01

The purpose of the study was to determine applicability of e-learning in problem based learning (PBL) by investigating its usage and acceptability among students and its effect on academic achievement. The study was carried out among first year medical students of Akdeniz University, Turkey. A web-based learning environment (WBLE) including…
WWW Motivation Mining: Finding Treasures for Teaching Evaluation Skills, Grades 1-6. Professional Growth Series.

ERIC Educational Resources Information Center

Arnone, Marilyn P.; Small, Ruth V.

Designed for elementary or middle school teachers and library media specialists, this book provides educators with practical, easy-to-use ways of applying motivation assessment techniques when selecting World Wide Web sites for inclusion in their lessons and offers concrete examples of how to use Web evaluation with young learners. WebMAC…
Dental practice websites: creating a Web presence.

PubMed

Miller, Syrene A; Forrest, Jane L

2002-07-01

Web technology provides an opportunity for dentists to showcase their practice philosophy, quality of care, office setting, and staff in a creative manner. Having a Website provides a practice with innovative and cost-effective communications and marketing tools for current and potential patients who use the Internet. The main benefits of using a Website to promote one's practice are: Making office time more productive, tasks more timely, follow-up less necessary Engaging patients in an interactive and visual learning process Providing online forms and procedure examples for patients Projecting a competent and current image Tracking the usage of Web pages. Several options are available when considering the development of a Website. These options range in cost based on customization of the site and ongoing support services, such as site updates, technical assistance, and Web usage statistics. In most cases, Websites are less expensive than advertising in the phone book. Options in creating a Website include building one's own, employing a company that offers Website templates, and employing a company that offers customized sites. These development options and benefits will continue to grow as individuals access the Web and more information and sites become available.
Astrophysical data mining with GPU. A case study: Genetic classification of globular clusters

NASA Astrophysics Data System (ADS)

Cavuoti, S.; Garofalo, M.; Brescia, M.; Paolillo, M.; Pescape', A.; Longo, G.; Ventre, G.

2014-01-01

We present a multi-purpose genetic algorithm, designed and implemented with GPGPU/CUDA parallel computing technology. The model was derived from our CPU serial implementation, named GAME (Genetic Algorithm Model Experiment). It was successfully tested and validated on the detection of candidate Globular Clusters in deep, wide-field, single band HST images. The GPU version of GAME will be made available to the community by integrating it into the web application DAMEWARE (DAta Mining Web Application REsource, http://dame.dsf.unina.it/beta_info.html), a public data mining service specialized on massive astrophysical data. Since genetic algorithms are inherently parallel, the GPGPU computing paradigm leads to a speedup of a factor of 200× in the training phase with respect to the CPU based version.
Web services-based text-mining demonstrates broad impacts for interoperability and process simplification.

PubMed

Wiegers, Thomas C; Davis, Allan Peter; Mattingly, Carolyn J

2014-01-01

The Critical Assessment of Information Extraction systems in Biology (BioCreAtIvE) challenge evaluation tasks collectively represent a community-wide effort to evaluate a variety of text-mining and information extraction systems applied to the biological domain. The BioCreative IV Workshop included five independent subject areas, including Track 3, which focused on named-entity recognition (NER) for the Comparative Toxicogenomics Database (CTD; http://ctdbase.org). Previously, CTD had organized document ranking and NER-related tasks for the BioCreative Workshop 2012; a key finding of that effort was that interoperability and integration complexity were major impediments to the direct application of the systems to CTD's text-mining pipeline. This underscored a prevailing problem with software integration efforts. Major interoperability-related issues included lack of process modularity, operating system incompatibility, tool configuration complexity and lack of standardization of high-level inter-process communications. One approach to potentially mitigate interoperability and general integration issues is the use of Web services to abstract implementation details; rather than integrating NER tools directly, HTTP-based calls from CTD's asynchronous, batch-oriented text-mining pipeline could be made to remote NER Web services for recognition of specific biological terms using BioC (an emerging family of XML formats) for inter-process communications. To test this concept, participating groups developed Representational State Transfer /BioC-compliant Web services tailored to CTD's NER requirements. Participants were provided with a comprehensive set of training materials. CTD evaluated results obtained from the remote Web service-based URLs against a test data set of 510 manually curated scientific articles. Twelve groups participated in the challenge. Recall, precision, balanced F-scores and response times were calculated. Top balanced F-scores for gene, chemical and disease NER were 61, 74 and 51%, respectively. Response times ranged from fractions-of-a-second to over a minute per article. We present a description of the challenge and summary of results, demonstrating how curation groups can effectively use interoperable NER technologies to simplify text-mining pipeline implementation. Database URL: http://ctdbase.org/ © The Author(s) 2014. Published by Oxford University Press.
Web services-based text-mining demonstrates broad impacts for interoperability and process simplification

PubMed Central

Wiegers, Thomas C.; Davis, Allan Peter; Mattingly, Carolyn J.

2014-01-01

The Critical Assessment of Information Extraction systems in Biology (BioCreAtIvE) challenge evaluation tasks collectively represent a community-wide effort to evaluate a variety of text-mining and information extraction systems applied to the biological domain. The BioCreative IV Workshop included five independent subject areas, including Track 3, which focused on named-entity recognition (NER) for the Comparative Toxicogenomics Database (CTD; http://ctdbase.org). Previously, CTD had organized document ranking and NER-related tasks for the BioCreative Workshop 2012; a key finding of that effort was that interoperability and integration complexity were major impediments to the direct application of the systems to CTD's text-mining pipeline. This underscored a prevailing problem with software integration efforts. Major interoperability-related issues included lack of process modularity, operating system incompatibility, tool configuration complexity and lack of standardization of high-level inter-process communications. One approach to potentially mitigate interoperability and general integration issues is the use of Web services to abstract implementation details; rather than integrating NER tools directly, HTTP-based calls from CTD's asynchronous, batch-oriented text-mining pipeline could be made to remote NER Web services for recognition of specific biological terms using BioC (an emerging family of XML formats) for inter-process communications. To test this concept, participating groups developed Representational State Transfer /BioC-compliant Web services tailored to CTD's NER requirements. Participants were provided with a comprehensive set of training materials. CTD evaluated results obtained from the remote Web service-based URLs against a test data set of 510 manually curated scientific articles. Twelve groups participated in the challenge. Recall, precision, balanced F-scores and response times were calculated. Top balanced F-scores for gene, chemical and disease NER were 61, 74 and 51%, respectively. Response times ranged from fractions-of-a-second to over a minute per article. We present a description of the challenge and summary of results, demonstrating how curation groups can effectively use interoperable NER technologies to simplify text-mining pipeline implementation. Database URL: http://ctdbase.org/ PMID:24919658
Operating System Support for Shared Hardware Data Structures

DTIC Science & Technology

2013-01-31

Carbon [73] uses hardware queues to improve fine-grained multitasking for Recognition, Mining , and Synthesis. Compared to software ap- proaches...web transaction processing, data mining , and multimedia. Early work in database processors [114, 96, 79, 111] reduce the costs of relational database...assignment can be solved statically or dynamically. Static assignment deter- mines offline which data structures are assigned to use HWDS resources and at
77 FR 4360 - Notice of Availability of the Draft Environmental Impact Statement for the Hycroft Mine Expansion...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-01-27

... comments related to the Hycroft Mine Expansion Draft EIS by any of the following methods: Web site: www.blm..., Nevada 89445, Attn. Kathleen Rehberg. Copies of the Hycroft Mine Expansion Draft EIS are available in the... hours. The FIRS is available 24 hours a day, 7 days a week, to leave a message or question with the...
Mining Software Usage with the Automatic Library Tracking Database (ALTD)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hadri, Bilel; Fahey, Mark R

2013-01-01

Tracking software usage is important for HPC centers, computer vendors, code developers and funding agencies to provide more efficient and targeted software support, and to forecast needs and guide HPC software effort towards the Exascale era. However, accurately tracking software usage on HPC systems has been a challenging task. In this paper, we present a tool called Automatic Library Tracking Database (ALTD) that has been developed and put in production on several Cray systems. The ALTD infrastructure prototype automatically and transparently stores information about libraries linked into an application at compilation time and also the executables launched in a batchmore » job. We will illustrate the usage of libraries, compilers and third party software applications on a system managed by the National Institute for Computational Sciences.« less
Geochemical characteristics of rare earth elements in different types of soil: A chemometric approach.

PubMed

Khan, Aysha Masood; Behkami, Shima; Yusoff, Ismail; Md Zain, Sharifuddin Bin; Bakar, Nor Kartini Abu; Bakar, Ahmad Farid Abu; Alias, Yatimah

2017-10-01

Rare earth elements (REEs) are becoming significant due to their huge applications in many industries, large-scale mining and refining activities. Increasing usage of such metals pose negative environmental impacts. In this research ICP-MS has been used to analyze soil samples collected from former ex-mining areas in the depths of 0-20 cm, 21-40 cm, and 41-60 cm of residential, mining, natural, and industrial areas of Perak. Principal component analysis (PCA) revealed that soil samples taken from different mining, industrial, residential, and natural areas are separated into four clusters. It was observed that REEs were abundant in most of the samples from mining areas. Concentration of the rare elements decrease in general as we move from surface soil to deeper soils. Copyright © 2017 Elsevier Ltd. All rights reserved.
Alkemio: association of chemicals with biomedical topics by text and data mining

PubMed Central

Gijón-Correas, José A.; Andrade-Navarro, Miguel A.; Fontaine, Jean F.

2014-01-01

The PubMed® database of biomedical citations allows the retrieval of scientific articles studying the function of chemicals in biology and medicine. Mining millions of available citations to search reported associations between chemicals and topics of interest would require substantial human time. We have implemented the Alkemio text mining web tool and SOAP web service to help in this task. The tool uses biomedical articles discussing chemicals (including drugs), predicts their relatedness to the query topic with a naïve Bayesian classifier and ranks all chemicals by P-values computed from random simulations. Benchmarks on seven human pathways showed good retrieval performance (areas under the receiver operating characteristic curves ranged from 73.6 to 94.5%). Comparison with existing tools to retrieve chemicals associated to eight diseases showed the higher precision and recall of Alkemio when considering the top 10 candidate chemicals. Alkemio is a high performing web tool ranking chemicals for any biomedical topics and it is free to non-commercial users. Availability: http://cbdm.mdc-berlin.de/∼medlineranker/cms/alkemio. PMID:24838570

Geovisualization of Local and Regional Migration Using Web-mined Demographics

NASA Astrophysics Data System (ADS)

Schuermann, R. T.; Chow, T. E.

2014-11-01

The intent of this research was to augment and facilitate analyses, which gauges the feasibility of web-mined demographics to study spatio-temporal dynamics of migration. As a case study, we explored the spatio-temporal dynamics of Vietnamese Americans (VA) in Texas through geovisualization of mined demographic microdata from the World Wide Web. Based on string matching across all demographic attributes, including full name, address, date of birth, age and phone number, multiple records of the same entity (i.e. person) over time were resolved and reconciled into a database. Migration trajectories were geovisualized through animated sprites by connecting the different addresses associated with the same person and segmenting the trajectory into small fragments. Intra-metropolitan migration patterns appeared at the local scale within many metropolitan areas. At the scale of metropolitan area, varying degrees of immigration and emigration manifest different types of migration clusters. This paper presents a methodology incorporating GIS methods and cartographic design to produce geovisualization animation, enabling the cognitive identification of migration patterns at multiple scales. Identification of spatio-temporal patterns often stimulates further research to better understand the phenomenon and enhance subsequent modeling.
Integration of Geographical Information Systems and Geophysical Applications with Distributed Computing Technologies.

NASA Astrophysics Data System (ADS)

Pierce, M. E.; Aktas, M. S.; Aydin, G.; Fox, G. C.; Gadgil, H.; Sayar, A.

2005-12-01

We examine the application of Web Service Architectures and Grid-based distributed computing technologies to geophysics and geo-informatics. We are particularly interested in the integration of Geographical Information System (GIS) services with distributed data mining applications. GIS services provide the general purpose framework for building archival data services, real time streaming data services, and map-based visualization services that may be integrated with data mining and other applications through the use of distributed messaging systems and Web Service orchestration tools. Building upon on our previous work in these areas, we present our current research efforts. These include fundamental investigations into increasing XML-based Web service performance, supporting real time data streams, and integrating GIS mapping tools with audio/video collaboration systems for shared display and annotation.
Usage, Barriers, and Training of Web 2.0 Technology Applications

ERIC Educational Resources Information Center

Pritchett, Christopher G.; Pritchett, Christal C.; Wohleb, Elisha C.

2013-01-01

This research study was designed to determine the degree of use of Web 2.0 technology applications by certified education professionals and examine differences among various groups as well as reasons for these differences. A quantitative survey instrument was developed to gather demographic information and data. Participants reported they would be…
Social Work Information Center 2.0: A Case Study

ERIC Educational Resources Information Center

Xu, F. Grace

2009-01-01

The social work library at USC provides a case study of an academic library's transition to an information center service model. Analysis of the collection, user community, Web 2.0 applications, and Web usage data demonstrates how the changes facilitated library services and information literacy instruction. (Contains 6 tables and 3 figures.)
From Sensor to Observation Web with environmental enablers in the Future Internet.

PubMed

Havlik, Denis; Schade, Sven; Sabeur, Zoheir A; Mazzetti, Paolo; Watson, Kym; Berre, Arne J; Mon, Jose Lorenzo

2011-01-01

This paper outlines the grand challenges in global sustainability research and the objectives of the FP7 Future Internet PPP program within the Digital Agenda for Europe. Large user communities are generating significant amounts of valuable environmental observations at local and regional scales using the devices and services of the Future Internet. These communities' environmental observations represent a wealth of information which is currently hardly used or used only in isolation and therefore in need of integration with other information sources. Indeed, this very integration will lead to a paradigm shift from a mere Sensor Web to an Observation Web with semantically enriched content emanating from sensors, environmental simulations and citizens. The paper also describes the research challenges to realize the Observation Web and the associated environmental enablers for the Future Internet. Such an environmental enabler could for instance be an electronic sensing device, a web-service application, or even a social networking group affording or facilitating the capability of the Future Internet applications to consume, produce, and use environmental observations in cross-domain applications. The term "envirofied" Future Internet is coined to describe this overall target that forms a cornerstone of work in the Environmental Usage Area within the Future Internet PPP program. Relevant trends described in the paper are the usage of ubiquitous sensors (anywhere), the provision and generation of information by citizens, and the convergence of real and virtual realities to convey understanding of environmental observations. The paper addresses the technical challenges in the Environmental Usage Area and the need for designing multi-style service oriented architecture. Key topics are the mapping of requirements to capabilities, providing scalability and robustness with implementing context aware information retrieval. Another essential research topic is handling data fusion and model based computation, and the related propagation of information uncertainty. Approaches to security, standardization and harmonization, all essential for sustainable solutions, are summarized from the perspective of the Environmental Usage Area. The paper concludes with an overview of emerging, high impact applications in the environmental areas concerning land ecosystems (biodiversity), air quality (atmospheric conditions) and water ecosystems (marine asset management).
From Sensor to Observation Web with Environmental Enablers in the Future Internet

PubMed Central

Havlik, Denis; Schade, Sven; Sabeur, Zoheir A.; Mazzetti, Paolo; Watson, Kym; Berre, Arne J.; Mon, Jose Lorenzo

2011-01-01

This paper outlines the grand challenges in global sustainability research and the objectives of the FP7 Future Internet PPP program within the Digital Agenda for Europe. Large user communities are generating significant amounts of valuable environmental observations at local and regional scales using the devices and services of the Future Internet. These communities’ environmental observations represent a wealth of information which is currently hardly used or used only in isolation and therefore in need of integration with other information sources. Indeed, this very integration will lead to a paradigm shift from a mere Sensor Web to an Observation Web with semantically enriched content emanating from sensors, environmental simulations and citizens. The paper also describes the research challenges to realize the Observation Web and the associated environmental enablers for the Future Internet. Such an environmental enabler could for instance be an electronic sensing device, a web-service application, or even a social networking group affording or facilitating the capability of the Future Internet applications to consume, produce, and use environmental observations in cross-domain applications. The term “envirofied” Future Internet is coined to describe this overall target that forms a cornerstone of work in the Environmental Usage Area within the Future Internet PPP program. Relevant trends described in the paper are the usage of ubiquitous sensors (anywhere), the provision and generation of information by citizens, and the convergence of real and virtual realities to convey understanding of environmental observations. The paper addresses the technical challenges in the Environmental Usage Area and the need for designing multi-style service oriented architecture. Key topics are the mapping of requirements to capabilities, providing scalability and robustness with implementing context aware information retrieval. Another essential research topic is handling data fusion and model based computation, and the related propagation of information uncertainty. Approaches to security, standardization and harmonization, all essential for sustainable solutions, are summarized from the perspective of the Environmental Usage Area. The paper concludes with an overview of emerging, high impact applications in the environmental areas concerning land ecosystems (biodiversity), air quality (atmospheric conditions) and water ecosystems (marine asset management). PMID:22163827
Mining Available Data from the United States Environmental ...

EPA Pesticide Factsheets

Demands for quick and accurate life cycle assessments create a need for methods to rapidly generate reliable life cycle inventories (LCI). Data mining is a suitable tool for this purpose, especially given the large amount of available governmental data. These data are typically applied to LCIs on a case-by-case basis. As linked open data becomes more prevalent, it may be possible to automate LCI using data mining by establishing a reproducible approach for identifying, extracting, and processing the data. This work proposes a method for standardizing and eventually automating the discovery and use of publicly available data at the United States Environmental Protection Agency for chemical-manufacturing LCI. The method is developed using a case study of acetic acid. The data quality and gap analyses for the generated inventory found that the selected data sources can provide information with equal or better reliability and representativeness on air, water, hazardous waste, on-site energy usage, and production volumes but with key data gaps including material inputs, water usage, purchased electricity, and transportation requirements. A comparison of the generated LCI with existing data revealed that the data mining inventory is in reasonable agreement with existing data and may provide a more-comprehensive inventory of air emissions and water discharges. The case study highlighted challenges for current data management practices that must be overcome to successfu
Electrical Resistivity Imaging

EPA Science Inventory

Electrical resistivity imaging (ERI) is a geophysical method originally developed within the mining industry where it has been used for decades to explore for and characterize subsurface mineral deposits. It is one of the oldest geophysical methods with the first documented usag...
30 CFR 75.1107-6 - Capacity of fire suppression devices; location and direction of nozzles.

Code of Federal Regulations, 2013 CFR

2013-07-01

... withstand rough usage and vibration when installed on mining equipment. (b) The extinguishant-discharge..., or combination type. Where fire control is achieved by internal injection, or combination of internal...
30 CFR 75.1107-6 - Capacity of fire suppression devices; location and direction of nozzles.

Code of Federal Regulations, 2012 CFR

2012-07-01

... withstand rough usage and vibration when installed on mining equipment. (b) The extinguishant-discharge..., or combination type. Where fire control is achieved by internal injection, or combination of internal...
30 CFR 75.1107-6 - Capacity of fire suppression devices; location and direction of nozzles.

Code of Federal Regulations, 2011 CFR

2011-07-01

... withstand rough usage and vibration when installed on mining equipment. (b) The extinguishant-discharge..., or combination type. Where fire control is achieved by internal injection, or combination of internal...
30 CFR 75.1107-6 - Capacity of fire suppression devices; location and direction of nozzles.

Code of Federal Regulations, 2014 CFR

2014-07-01

... withstand rough usage and vibration when installed on mining equipment. (b) The extinguishant-discharge..., or combination type. Where fire control is achieved by internal injection, or combination of internal...
Mining and Utilizing Dataset Relevancy from Oceanographic Dataset (MUDROD) Metadata, Usage Metrics, and User Feedback to Improve Data Discovery and Access

NASA Astrophysics Data System (ADS)

Jiang, Y.

2015-12-01

Oceanographic resource discovery is a critical step for developing ocean science applications. With the increasing number of resources available online, many Spatial Data Infrastructure (SDI) components (e.g. catalogues and portals) have been developed to help manage and discover oceanographic resources. However, efficient and accurate resource discovery is still a big challenge because of the lack of data relevancy information. In this article, we propose a search engine framework for mining and utilizing dataset relevancy from oceanographic dataset metadata, usage metrics, and user feedback. The objective is to improve discovery accuracy of oceanographic data and reduce time for scientist to discover, download and reformat data for their projects. Experiments and a search example show that the propose engine helps both scientists and general users search for more accurate results with enhanced performance and user experience through a user-friendly interface.
Usage and Design Evaluation by Family Caregivers of aStroke Intervention Website

PubMed Central

Pierce, Linda L.; Steiner, Victoria

2013-01-01

Background Four out of 5 families are affected by stroke. Many caregivers access the Internet and gather healthcare information from web-based sources. Design The purpose of this descriptive evaluation was to assess the usage and design of the Caring~Web© site, which provides education/support for family caregivers of persons with stroke residing in home settings. Sample and Setting Thirty-six caregivers from two Midwest states accessed this intervention in a 1-year study. The average participant was fifty-four years of age, white, female, and the spouse of the care recipient. Methods In a telephone interview, four website questions were asked twice-/bi-monthly and a 33-item Survey at the conclusion of the study evaluated the website usage and design of its components. Descriptive analysis methods were used and statistics were collected on the number of visits to the website. Results On average, participants logged on to the website one to two hours per week, although usage declined after several months for some participants. Participants positively rated the website’s appearance and usability that included finding the training to be adequate. Conclusion Website designers can replicate this intervention for other health conditions. PMID:24025464
Literature Mining Methods for Toxicology and Construction of ...

EPA Pesticide Factsheets

Webinar Presentation on text-mining methodologies in use at NCCT and how they can be used to assist with the OECD Retinoid project. Presentation to 1st Workshop/Scientific Expert Group meeting on the OECD Retinoid Project - April 26, 2016 –Brussels, Presented remotely via web.
Monitoring food safety violation reports from internet forums.

PubMed

Kate, Kiran; Negi, Sumit; Kalagnanam, Jayant

2014-01-01

Food-borne illness is a growing public health concern in the world. Government bodies, which regulate and monitor the state of food safety, solicit citizen feedback about food hygiene practices followed by food establishments. They use traditional channels like call center, e-mail for such feedback collection. With the growing popularity of Web 2.0 and social media, citizens often post such feedback on internet forums, message boards etc. The system proposed in this paper applies text mining techniques to identify and mine such food safety complaints posted by citizens on web data sources thereby enabling the government agencies to gather more information about the state of food safety. In this paper, we discuss the architecture of our system and the text mining methods used. We also present results which demonstrate the effectiveness of this system in a real-world deployment.
TCGA4U: A Web-Based Genomic Analysis Platform To Explore And Mine TCGA Genomic Data For Translational Research.

PubMed

Huang, Zhenzhen; Duan, Huilong; Li, Haomin

2015-01-01

Large-scale human cancer genomics projects, such as TCGA, generated large genomics data for further study. Exploring and mining these data to obtain meaningful analysis results can help researchers find potential genomics alterations that intervene the development and metastasis of tumors. We developed a web-based gene analysis platform, named TCGA4U, which used statistics methods and models to help translational investigators explore, mine and visualize human cancer genomic characteristic information from the TCGA datasets. Furthermore, through Gene Ontology (GO) annotation and clinical data integration, the genomic data were transformed into biological process, molecular function, cellular component and survival curves to help researchers identify potential driver genes. Clinical researchers without expertise in data analysis will benefit from such a user-friendly genomic analysis platform.
The use of online information resources by nurses.

PubMed

Wozar, Jody A; Worona, Paul C

2003-04-01

Based on the results of an informal needs assessment, the Usage of Online Information Resources by Nurses Project was designed to provide clinical nurses with accurate medical information at the point of care by introducing them to existing online library resources through instructional classes. Actual usage of the resources was then monitored for a set period of time. A two-hour hands-on class was developed for interested nurses. Participants were instructed in the content and use of several different online resources. A special Web page was designed for this project serving as an access point to the resources. Using a password system and WebTrends trade mark software, individual participant's usage of the resources was monitored for a thirty-day period following the class. At the end of the thirty days, usage results were tabulated, and participants were sent general evaluation forms. Eight participants accessed the project page thirty-nine times in a thirty-day period. The most accessed resource was Primary Care Online (PCO), accessed thirty-three times. PCO was followed by MD Consult (17), Ovid (8), NLM resources (5), and electronic journals (1). The individual with the highest usage accessed the project page thirteen times. Practicing clinical nurses will use online medical information resources if they are first introduced to them and taught how to access and use them. Health sciences librarians can play an important role in providing instruction to this often overlooked population.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Kargupta, H.; Stafford, B.; Hamzaoglu, I.

This paper describes an experimental parallel/distributed data mining system PADMA (PArallel Data Mining Agents) that uses software agents for local data accessing and analysis and a web based interface for interactive data visualization. It also presents the results of applying PADMA for detecting patterns in unstructured texts of postmortem reports and laboratory test data for Hepatitis C patients.
Data warehousing as a basis for web-based documentation of data mining and analysis.

PubMed

Karlsson, J; Eklund, P; Hallgren, C G; Sjödin, J G

1999-01-01

In this paper we present a case study for data warehousing intended to support data mining and analysis. We also describe a prototype for data retrieval. Further we discuss some technical issues related to a particular choice of a patient record environment.

On-Board Mining in the Sensor Web

NASA Astrophysics Data System (ADS)

Tanner, S.; Conover, H.; Graves, S.; Ramachandran, R.; Rushing, J.

2004-12-01

On-board data mining can contribute to many research and engineering applications, including natural hazard detection and prediction, intelligent sensor control, and the generation of customized data products for direct distribution to users. The ability to mine sensor data in real time can also be a critical component of autonomous operations, supporting deep space missions, unmanned aerial and ground-based vehicles (UAVs, UGVs), and a wide range of sensor meshes, webs and grids. On-board processing is expected to play a significant role in the next generation of NASA, Homeland Security, Department of Defense and civilian programs, providing for greater flexibility and versatility in measurements of physical systems. In addition, the use of UAV and UGV systems is increasing in military, emergency response and industrial applications. As research into the autonomy of these vehicles progresses, especially in fleet or web configurations, the applicability of on-board data mining is expected to increase significantly. Data mining in real time on board sensor platforms presents unique challenges. Most notably, the data to be mined is a continuous stream, rather than a fixed store such as a database. This means that the data mining algorithms must be modified to make only a single pass through the data. In addition, the on-board environment requires real time processing with limited computing resources, thus the algorithms must use fixed and relatively small amounts of processing time and memory. The University of Alabama in Huntsville is developing an innovative processing framework for the on-board data and information environment. The Environment for On-Board Processing (EVE) and the Adaptive On-board Data Processing (AODP) projects serve as proofs-of-concept of advanced information systems for remote sensing platforms. The EVE real-time processing infrastructure will upload, schedule and control the execution of processing plans on board remote sensors. These plans provide capabilities for autonomous data mining, classification and feature extraction using both streaming and buffered data sources. A ground-based testbed provides a heterogeneous, embedded hardware and software environment representing both space-based and ground-based sensor platforms, including wireless sensor mesh architectures. The AODP project explores the EVE concepts in the world of sensor-networks, including ad-hoc networks of small sensor platforms.
Designing and Managing Your Digital Library.

ERIC Educational Resources Information Center

Guenther, Kim

2000-01-01

Discusses digital libraries and Web site design issues. Highlights include accessibility issues, including standards, markup languages like HTML and XML, and metadata; building virtual communities; the use of Web portals for customized delivery of information; quality assurance tools, including data mining; and determining user needs, including…
BioServices: a common Python package to access biological Web Services programmatically.

PubMed

Cokelaer, Thomas; Pultz, Dennis; Harder, Lea M; Serra-Musach, Jordi; Saez-Rodriguez, Julio

2013-12-15

Web interfaces provide access to numerous biological databases. Many can be accessed to in a programmatic way thanks to Web Services. Building applications that combine several of them would benefit from a single framework. BioServices is a comprehensive Python framework that provides programmatic access to major bioinformatics Web Services (e.g. KEGG, UniProt, BioModels, ChEMBLdb). Wrapping additional Web Services based either on Representational State Transfer or Simple Object Access Protocol/Web Services Description Language technologies is eased by the usage of object-oriented programming. BioServices releases and documentation are available at http://pypi.python.org/pypi/bioservices under a GPL-v3 license.
MetalS(3), a database-mining tool for the identification of structurally similar metal sites.

PubMed

Valasatava, Yana; Rosato, Antonio; Cavallaro, Gabriele; Andreini, Claudia

2014-08-01

We have developed a database search tool to identify metal sites having structural similarity to a query metal site structure within the MetalPDB database of minimal functional sites (MFSs) contained in metal-binding biological macromolecules. MFSs describe the local environment around the metal(s) independently of the larger context of the macromolecular structure. Such a local environment has a determinant role in tuning the chemical reactivity of the metal, ultimately contributing to the functional properties of the whole system. The database search tool, which we called MetalS(3) (Metal Sites Similarity Search), can be accessed through a Web interface at http://metalweb.cerm.unifi.it/tools/metals3/ . MetalS(3) uses a suitably adapted version of an algorithm that we previously developed to systematically compare the structure of the query metal site with each MFS in MetalPDB. For each MFS, the best superposition is kept. All these superpositions are then ranked according to the MetalS(3) scoring function and are presented to the user in tabular form. The user can interact with the output Web page to visualize the structural alignment or the sequence alignment derived from it. Options to filter the results are available. Test calculations show that the MetalS(3) output correlates well with expectations from protein homology considerations. Furthermore, we describe some usage scenarios that highlight the usefulness of MetalS(3) to obtain mechanistic and functional hints regardless of homology.
Competence and Usage of Web 2.0 Technologies by Higher Education Faculty

ERIC Educational Resources Information Center

Soomro, Kamal Ahmed; Zai, Sajid Yousuf; Jafri, Iftikhar Hussain

2015-01-01

Literature on Web 2.0 experiences of higher education faculty in developing countries such as Pakistan is very limited. An insight on awareness and practices of higher education faculty with these tools can be helpful to map strategies and plan of action for adopting latest technologies to support teaching-learning processes in higher education of…
The Role of Peer Influence and Perceived Quality of Teaching in Faculty Acceptance of Web-Based Learning Management Systems

ERIC Educational Resources Information Center

Salajan, Florin D.; Welch, Anita G.; Ray, Chris M.; Peterson, Claudette

2015-01-01

This study's primary investigation is the impact of "peer influence" and "perceived quality of teaching" on faculty members' usage of web-based learning management systems within the Technology Acceptance Model (TAM) framework. These factors are entered into an extended TAM as external variables impacting on the core constructs…
Critical Success Factors for Adoption of Web-Based Learning Management Systems in Tanzania

ERIC Educational Resources Information Center

Lwoga, Edda Tandi

2014-01-01

This paper examines factors that predict students' continual usage intention of web-based learning content management systems in Tanzania, with a specific focus at Muhimbili University of Health and Allied Science (MUHAS). This study sent a questionnaire surveys to 408 first year undergraduate students, with a rate of return of 66.7. This study…
The Islamic State Battle Plan: Press Release Natural Language Processing

DTIC Science & Technology

2016-06-01

Processing, text mining , corpus, generalized linear model, cascade, R Shiny, leaflet, data visualization 15. NUMBER OF PAGES 83 16. PRICE CODE...Terrorism and Responses to Terrorism TDM Term Document Matrix TF Term Frequency TF-IDF Term Frequency-Inverse Document Frequency tm text mining (R...package=leaflet. Feinerer I, Hornik K (2015) Text Mining Package “tm,” Version 0.6-2. (Jul 3) https://cran.r-project.org/web/packages/tm/tm.pdf
Numerical linear algebra in data mining

NASA Astrophysics Data System (ADS)

Eldén, Lars

Ideas and algorithms from numerical linear algebra are important in several areas of data mining. We give an overview of linear algebra methods in text mining (information retrieval), pattern recognition (classification of handwritten digits), and PageRank computations for web search engines. The emphasis is on rank reduction as a method of extracting information from a data matrix, low-rank approximation of matrices using the singular value decomposition and clustering, and on eigenvalue methods for network analysis.
About Student's Media Use for Learning in Tertiary Education Influence Factors and Structures of Usage Behavior

ERIC Educational Resources Information Center

Grosch, Michael

2014-01-01

The rise of the web 2.0 led to dramatic changes in media usage behavior of students in tertiary education. Services such as Google and Facebook are most accepted amongst students not only in pastime but also for learning. A representative survey was made at Karlsruhe Institute of Technology (KIT). About 1,400 students were asked 150 questions to…
Students with LD in Higher Education: Use and Contribution of Assistive Technology and Website Courses and Their Correlation to Students' Hope and Well-Being

ERIC Educational Resources Information Center

Heiman, Tali; Shemesh, Dorit Olenik

2012-01-01

This study examined the extent and patterns of usage of web courses, and their contribution to the academic and social perceptions of 964 undergraduate students with and without learning disabilities studying in higher education. Students were asked to complete four questionnaires examining the usage patterns of various adaptive technologies and…
Spinning Gland Transcriptomics from Two Main Clades of Spiders (Order: Araneae) - Insights on Their Molecular, Anatomical and Behavioral Evolution

PubMed Central

Prosdocimi, Francisco; Bittencourt, Daniela; da Silva, Felipe Rodrigues; Kirst, Matias; Motta, Paulo C.; Rech, Elibio L.

2011-01-01

Characterized by distinctive evolutionary adaptations, spiders provide a comprehensive system for evolutionary and developmental studies of anatomical organs, including silk and venom production. Here we performed cDNA sequencing using massively parallel sequencers (454 GS-FLX Titanium) to generate ∼80,000 reads from the spinning gland of Actinopus spp. (infraorder: Mygalomorphae) and Gasteracantha cancriformis (infraorder: Araneomorphae, Orbiculariae clade). Actinopus spp. retains primitive characteristics on web usage and presents a single undifferentiated spinning gland while the orbiculariae spiders have seven differentiated spinning glands and complex patterns of web usage. MIRA, Celera Assembler and CAP3 software were used to cluster NGS reads for each spider. CAP3 unigenes passed through a pipeline for automatic annotation, classification by biological function, and comparative transcriptomics. Genes related to spider silks were manually curated and analyzed. Although a single spidroin gene family was found in Actinopus spp., a vast repertoire of specialized spider silk proteins was encountered in orbiculariae. Astacin-like metalloproteases (meprin subfamily) were shown to be some of the most sampled unigenes and duplicated gene families in G. cancriformis since its evolutionary split from mygalomorphs. Our results confirm that the evolution of the molecular repertoire of silk proteins was accompanied by the (i) anatomical differentiation of spinning glands and (ii) behavioral complexification in the web usage. Finally, a phylogenetic tree was constructed to cluster most of the known spidroins in gene clades. This is the first large-scale, multi-organism transcriptome for spider spinning glands and a first step into a broad understanding of spider web systems biology and evolution. PMID:21738742
Public reaction to Chikungunya outbreaks in Italy-Insights from an extensive novel data streams-based structural equation modeling analysis.

PubMed

Mahroum, Naim; Adawi, Mohammad; Sharif, Kassem; Waknin, Roy; Mahagna, Hussein; Bisharat, Bishara; Mahamid, Mahmud; Abu-Much, Arsalan; Amital, Howard; Luigi Bragazzi, Nicola; Watad, Abdulla

2018-01-01

The recent outbreak of Chikungunya virus in Italy represents a serious public health concern, which is attracting media coverage and generating public interest in terms of Internet searches and social media interactions. Here, we sought to assess the Chikungunya-related digital behavior and the interplay between epidemiological figures and novel data streams traffic. Reaction to the recent outbreak was analyzed in terms of Google Trends, Google News and Twitter traffic, Wikipedia visits and edits, and PubMed articles, exploiting structural modelling equations. A total of 233,678 page-views and 150 edits on the Italian Wikipedia page, 3,702 tweets, 149 scholarly articles, and 3,073 news articles were retrieved. The relationship between overall Chikungunya cases, as well as autochthonous cases, and tweets production was found to be fully mediated by Chikungunya-related web searches. However, in the allochthonous/imported cases model, tweet production was not found to be significantly mediated by epidemiological figures, with web searches still significantly mediating tweet production. Inconsistent relationships were detected in mediation models involving Wikipedia usage as a mediator variable. Similarly, the effect between news consumption and tweets production was suppressed by the Wikipedia usage. A further inconsistent mediation was found in the case of the effect between Wikipedia usage and tweets production, with web searches as a mediator variable. When adjusting for the Internet penetration index, similar findings could be obtained, with the important exception that in the adjusted model the relationship between GN and Twitter was found to be partially mediated by Wikipedia usage. Furthermore, the link between Wikipedia usage and PubMed/MEDLINE was fully mediated by GN, differently from what was found in the unadjusted model. In conclusion-a significant public reaction to the current Chikungunya outbreak was documented. Health authorities should be aware of this, recognizing the role of new technologies for collecting public concerns and replying to them, disseminating awareness and avoid misleading information.
Public reaction to Chikungunya outbreaks in Italy—Insights from an extensive novel data streams-based structural equation modeling analysis

PubMed Central

Sharif, Kassem; Waknin, Roy; Mahagna, Hussein; Bisharat, Bishara; Mahamid, Mahmud; Abu-Much, Arsalan; Amital, Howard; Luigi Bragazzi, Nicola

2018-01-01

The recent outbreak of Chikungunya virus in Italy represents a serious public health concern, which is attracting media coverage and generating public interest in terms of Internet searches and social media interactions. Here, we sought to assess the Chikungunya-related digital behavior and the interplay between epidemiological figures and novel data streams traffic. Reaction to the recent outbreak was analyzed in terms of Google Trends, Google News and Twitter traffic, Wikipedia visits and edits, and PubMed articles, exploiting structural modelling equations. A total of 233,678 page-views and 150 edits on the Italian Wikipedia page, 3,702 tweets, 149 scholarly articles, and 3,073 news articles were retrieved. The relationship between overall Chikungunya cases, as well as autochthonous cases, and tweets production was found to be fully mediated by Chikungunya-related web searches. However, in the allochthonous/imported cases model, tweet production was not found to be significantly mediated by epidemiological figures, with web searches still significantly mediating tweet production. Inconsistent relationships were detected in mediation models involving Wikipedia usage as a mediator variable. Similarly, the effect between news consumption and tweets production was suppressed by the Wikipedia usage. A further inconsistent mediation was found in the case of the effect between Wikipedia usage and tweets production, with web searches as a mediator variable. When adjusting for the Internet penetration index, similar findings could be obtained, with the important exception that in the adjusted model the relationship between GN and Twitter was found to be partially mediated by Wikipedia usage. Furthermore, the link between Wikipedia usage and PubMed/MEDLINE was fully mediated by GN, differently from what was found in the unadjusted model. In conclusion—a significant public reaction to the current Chikungunya outbreak was documented. Health authorities should be aware of this, recognizing the role of new technologies for collecting public concerns and replying to them, disseminating awareness and avoid misleading information. PMID:29795578
A New Look at Data Usage by Using Metadata Attributes as Indicators of Data Quality

NASA Astrophysics Data System (ADS)

Won, Y. I.; Wanchoo, L.; Behnke, J.

2016-12-01

NASA's Earth Observing System Data and Information System (EOSDIS) stores and distributes data from EOS satellites, as well as ancillary, airborne, in-situ, and socio-economic data. Twelve EOSDIS data centers support different scientific disciplines by providing products and services tailored to specific science communities. Although discipline oriented, these data centers provide common data management functions of ingest, archive and distribution, as well as documentation of their data and services on their web-sites. The Earth Science Data and Information System (ESDIS) Project collects these metrics from the EOSDIS data centers on a daily basis through a tool called the ESDIS Metrics System (EMS). These metrics are used in this study. The implementation of the Earthdata Login - formerly known as the User Registration System (URS) - across the various NASA data centers provides the EMS additional information about users obtaining data products from EOSDIS data centers. These additional user attributes collected by the Earthdata login, such as the user's primary area of study can augment the understanding of data usage, which in turn can help the EOSDIS program better understand the users' needs. This study will review the key metrics (users, distributed volume, and files) in multiple ways to gain an understanding of the significance of the metadata. Characterizing the usability of data by key metadata elements such as discipline and study area, will assist in understanding how the users have evolved over time. The data usage pattern based on version numbers may also provide some insight into the level of data quality. In addition, the data metrics by various services such as the Open-source Project for a Network Data Access Protocol (OPeNDAP), Web Map Service (WMS), Web Coverage Service (WCS), and subsets, will address how these services have extended the usage of data. Over-all, this study will present the usage of data and metadata by metrics analyses and will assist data centers in better supporting the needs of the users.
HC StratoMineR: A Web-Based Tool for the Rapid Analysis of High-Content Datasets.

PubMed

Omta, Wienand A; van Heesbeen, Roy G; Pagliero, Romina J; van der Velden, Lieke M; Lelieveld, Daphne; Nellen, Mehdi; Kramer, Maik; Yeong, Marley; Saeidi, Amir M; Medema, Rene H; Spruit, Marco; Brinkkemper, Sjaak; Klumperman, Judith; Egan, David A

2016-10-01

High-content screening (HCS) can generate large multidimensional datasets and when aligned with the appropriate data mining tools, it can yield valuable insights into the mechanism of action of bioactive molecules. However, easy-to-use data mining tools are not widely available, with the result that these datasets are frequently underutilized. Here, we present HC StratoMineR, a web-based tool for high-content data analysis. It is a decision-supportive platform that guides even non-expert users through a high-content data analysis workflow. HC StratoMineR is built by using My Structured Query Language for storage and querying, PHP: Hypertext Preprocessor as the main programming language, and jQuery for additional user interface functionality. R is used for statistical calculations, logic and data visualizations. Furthermore, C++ and graphical processor unit power is diffusely embedded in R by using the rcpp and rpud libraries for operations that are computationally highly intensive. We show that we can use HC StratoMineR for the analysis of multivariate data from a high-content siRNA knock-down screen and a small-molecule screen. It can be used to rapidly filter out undesirable data; to select relevant data; and to perform quality control, data reduction, data exploration, morphological hit picking, and data clustering. Our results demonstrate that HC StratoMineR can be used to functionally categorize HCS hits and, thus, provide valuable information for hit prioritization.
Codon usage bias reveals genomic adaptations to environmental conditions in an acidophilic consortium.

PubMed

Hart, Andrew; Cortés, María Paz; Latorre, Mauricio; Martinez, Servet

2018-01-01

The analysis of codon usage bias has been widely used to characterize different communities of microorganisms. In this context, the aim of this work was to study the codon usage bias in a natural consortium of five acidophilic bacteria used for biomining. The codon usage bias of the consortium was contrasted with genes from an alternative collection of acidophilic reference strains and metagenome samples. Results indicate that acidophilic bacteria preferentially have low codon usage bias, consistent with both their capacity to live in a wide range of habitats and their slow growth rate, a characteristic probably acquired independently from their phylogenetic relationships. In addition, the analysis showed significant differences in the unique sets of genes from the autotrophic species of the consortium in relation to other acidophilic organisms, principally in genes which code for proteins involved in metal and oxidative stress resistance. The lower values of codon usage bias obtained in this unique set of genes suggest higher transcriptional adaptation to living in extreme conditions, which was probably acquired as a measure for resisting the elevated metal conditions present in the mine.
Soil food web changes during spontaneous succession at post mining sites: a possible ecosystem engineering effect on food web organization?

PubMed

Frouz, Jan; Thébault, Elisa; Pižl, Václav; Adl, Sina; Cajthaml, Tomáš; Baldrián, Petr; Háněl, Ladislav; Starý, Josef; Tajovský, Karel; Materna, Jan; Nováková, Alena; de Ruiter, Peter C

2013-01-01

Parameters characterizing the structure of the decomposer food web, biomass of the soil microflora (bacteria and fungi) and soil micro-, meso- and macrofauna were studied at 14 non-reclaimed 1- 41-year-old post-mining sites near the town of Sokolov (Czech Republic). These observations on the decomposer food webs were compared with knowledge of vegetation and soil microstructure development from previous studies. The amount of carbon entering the food web increased with succession age in a similar way as the total amount of C in food web biomass and the number of functional groups in the food web. Connectance did not show any significant changes with succession age, however. In early stages of the succession, the bacterial channel dominated the food web. Later on, in shrub-dominated stands, the fungal channel took over. Even later, in the forest stage, the bacterial channel prevailed again. The best predictor of fungal bacterial ratio is thickness of fermentation layer. We argue that these changes correspond with changes in topsoil microstructure driven by a combination of plant organic matter input and engineering effects of earthworms. In early stages, soil is alkaline, and a discontinuous litter layer on the soil surface promotes bacterial biomass growth, so the bacterial food web channel can dominate. Litter accumulation on the soil surface supports the development of the fungal channel. In older stages, earthworms arrive, mix litter into the mineral soil and form an organo-mineral topsoil, which is beneficial for bacteria and enhances the bacterial food web channel.
Soil Food Web Changes during Spontaneous Succession at Post Mining Sites: A Possible Ecosystem Engineering Effect on Food Web Organization?

PubMed Central

Frouz, Jan; Thébault, Elisa; Pižl, Václav; Adl, Sina; Cajthaml, Tomáš; Baldrián, Petr; Háněl, Ladislav; Starý, Josef; Tajovský, Karel; Materna, Jan; Nováková, Alena; de Ruiter, Peter C.

2013-01-01

Parameters characterizing the structure of the decomposer food web, biomass of the soil microflora (bacteria and fungi) and soil micro-, meso- and macrofauna were studied at 14 non-reclaimed 1– 41-year-old post-mining sites near the town of Sokolov (Czech Republic). These observations on the decomposer food webs were compared with knowledge of vegetation and soil microstructure development from previous studies. The amount of carbon entering the food web increased with succession age in a similar way as the total amount of C in food web biomass and the number of functional groups in the food web. Connectance did not show any significant changes with succession age, however. In early stages of the succession, the bacterial channel dominated the food web. Later on, in shrub-dominated stands, the fungal channel took over. Even later, in the forest stage, the bacterial channel prevailed again. The best predictor of fungal bacterial ratio is thickness of fermentation layer. We argue that these changes correspond with changes in topsoil microstructure driven by a combination of plant organic matter input and engineering effects of earthworms. In early stages, soil is alkaline, and a discontinuous litter layer on the soil surface promotes bacterial biomass growth, so the bacterial food web channel can dominate. Litter accumulation on the soil surface supports the development of the fungal channel. In older stages, earthworms arrive, mix litter into the mineral soil and form an organo-mineral topsoil, which is beneficial for bacteria and enhances the bacterial food web channel. PMID:24260281
IF YOU BUILD IT WILL THEY COME? TEACHERS’ ONLINE USE OF STUDENT PERFORMANCE DATA

PubMed Central

Tyler, John H.

2014-01-01

Testing of students and computer systems to store, manage, analyze, and report the resulting test data have grown hand-in-hand. Extant research on teacher use of electronically stored data are largely qualitative and focused on the conditions necessary (but not sufficient) for effective teacher data use. Absent from the research is objective information on how much and in what ways teachers use computer-based student test data, even when supposed precursors of usage are in place. This paper addresses this knowledge gap by analyzing the online activities of teachers in one mid-size urban district. Utilizing Web logs collected between 2008 and 2010, I find low teacher interaction with Web-based pages that contain student test information that could potentially inform practice. I also find no evidence that teacher usage of Web-based student data are related to student achievement gains, but there is reason to believe these estimates are downwardly biased. PMID:25593564

Game-Theoretic Models for Usage-based Maintenance Contract

NASA Astrophysics Data System (ADS)

Husniah, H.; Wangsaputra, R.; Cakravastia, A.; Iskandar, B. P.

2018-03-01

A usage-based maintenance contracts with coordination and non coordination between two parties is studied in this paper. The contract is applied to a dump truck operated in a mining industry. The situation under study is that an agent offers service contract to the owner of the truck after warranty ends. This contract has only a time limit but no usage limit. If the total usage per period exceeds the maximum usage allowed in the contract, then the owner will be charged an additional cost. In general, the agent (Original Equipment Manufacturer/OEM) provides a full coverage of maintenance, which includes PM and CM under the lease contract. The decision problem for the owner is to select the best option offered that fits to its requirement, and the decision problem for the agent is to find the optimal maintenance efforts for a given price of the service option offered. We first find the optimal decisions using coordination scheme and then with non coordination scheme for both parties.
Alkemio: association of chemicals with biomedical topics by text and data mining.

PubMed

Gijón-Correas, José A; Andrade-Navarro, Miguel A; Fontaine, Jean F

2014-07-01

The PubMed® database of biomedical citations allows the retrieval of scientific articles studying the function of chemicals in biology and medicine. Mining millions of available citations to search reported associations between chemicals and topics of interest would require substantial human time. We have implemented the Alkemio text mining web tool and SOAP web service to help in this task. The tool uses biomedical articles discussing chemicals (including drugs), predicts their relatedness to the query topic with a naïve Bayesian classifier and ranks all chemicals by P-values computed from random simulations. Benchmarks on seven human pathways showed good retrieval performance (areas under the receiver operating characteristic curves ranged from 73.6 to 94.5%). Comparison with existing tools to retrieve chemicals associated to eight diseases showed the higher precision and recall of Alkemio when considering the top 10 candidate chemicals. Alkemio is a high performing web tool ranking chemicals for any biomedical topics and it is free to non-commercial users. http://cbdm.mdc-berlin.de/∼medlineranker/cms/alkemio. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
EAGLE: 'EAGLE'Is an' Algorithmic Graph Library for Exploration

DOE Office of Scientific and Technical Information (OSTI.GOV)

2015-01-16

The Resource Description Framework (RDF) and SPARQL Protocol and RDF Query Language (SPARQL) were introduced about a decade ago to enable flexible schema-free data interchange on the Semantic Web. Today data scientists use the framework as a scalable graph representation for integrating, querying, exploring and analyzing data sets hosted at different sources. With increasing adoption, the need for graph mining capabilities for the Semantic Web has emerged. Today there is no tools to conduct "graph mining" on RDF standard data sets. We address that need through implementation of popular iterative Graph Mining algorithms (Triangle count, Connected component analysis, degree distribution,more » diversity degree, PageRank, etc.). We implement these algorithms as SPARQL queries, wrapped within Python scripts and call our software tool as EAGLE. In RDF style, EAGLE stands for "EAGLE 'Is an' algorithmic graph library for exploration. EAGLE is like 'MATLAB' for 'Linked Data.'« less
Understanding the usage of content in a mental health intervention for depression: an analysis of log data.

PubMed

Van Gemert-Pijnen, Julia Ewc; Kelders, Saskia M; Bohlmeijer, Ernst T

2014-01-31

Web-based interventions for the early treatment of depressive symptoms can be considered effective in reducing mental complaints. However, there is a limited understanding of which elements in an intervention contribute to effectiveness. For efficiency and effectiveness of interventions, insight is needed into the use of content and persuasive features. The aims of this study were (1) to illustrate how log data can be used to understand the uptake of the content of a Web-based intervention that is based on the acceptance and commitment therapy (ACT) and (2) to discover how log data can be of value for improving the incorporation of content in Web-based interventions. Data from 206 participants (out of the 239) who started the first nine lessons of the Web-based intervention, Living to the Full, were used for a secondary analysis of a subset of the log data of the parent study about adherence to the intervention. The log files used in this study were per lesson: login, start mindfulness, download mindfulness, view success story, view feedback message, start multimedia, turn on text-message coach, turn off text-message coach, and view text message. Differences in usage between lessons were explored with repeated measures ANOVAs (analysis of variance). Differences between groups were explored with one-way ANOVAs. To explore the possible predictive value of the login per lesson quartiles on the outcome measures, four linear regressions were used with login quartiles as predictor and with the outcome measures (Center for Epidemiologic Studies-Depression [CES-D] and the Hospital Anxiety and Depression Scale-Anxiety [HADS-A] on post-intervention and follow-up) as dependent variables. A significant decrease in logins and in the use of content and persuasive features over time was observed. The usage of features varied significantly during the treatment process. The usage of persuasive features increased during the third part of the ACT (commitment to value-based living), which might indicate that at that stage motivational support was relevant. Higher logins over time (9 weeks) corresponded with a higher usage of features (in most cases significant); when predicting depressive symptoms at post-intervention, the linear regression yielded a significant model with login quartile as a significant predictor (explained variance is 2.7%). A better integration of content and persuasive features in the design of the intervention and a better intra-usability of features within the system are needed to identify which combination of features works best for whom. Pattern recognition can be used to tailor the intervention based on usage patterns from the earlier lessons and to support the uptake of content essential for therapy. An adaptable interface for a modular composition of therapy features supposes a dynamic approach for Web-based treatment; not a predefined path for all, but a flexible way to go through all features that have to be used.
Understanding the Usage of Content in a Mental Health Intervention for Depression: An Analysis of Log Data

PubMed Central

2014-01-01

Background Web-based interventions for the early treatment of depressive symptoms can be considered effective in reducing mental complaints. However, there is a limited understanding of which elements in an intervention contribute to effectiveness. For efficiency and effectiveness of interventions, insight is needed into the use of content and persuasive features. Objective The aims of this study were (1) to illustrate how log data can be used to understand the uptake of the content of a Web-based intervention that is based on the acceptance and commitment therapy (ACT) and (2) to discover how log data can be of value for improving the incorporation of content in Web-based interventions. Methods Data from 206 participants (out of the 239) who started the first nine lessons of the Web-based intervention, Living to the Full, were used for a secondary analysis of a subset of the log data of the parent study about adherence to the intervention. The log files used in this study were per lesson: login, start mindfulness, download mindfulness, view success story, view feedback message, start multimedia, turn on text-message coach, turn off text-message coach, and view text message. Differences in usage between lessons were explored with repeated measures ANOVAs (analysis of variance). Differences between groups were explored with one-way ANOVAs. To explore the possible predictive value of the login per lesson quartiles on the outcome measures, four linear regressions were used with login quartiles as predictor and with the outcome measures (Center for Epidemiologic Studies—Depression [CES-D] and the Hospital Anxiety and Depression Scale—Anxiety [HADS-A] on post-intervention and follow-up) as dependent variables. Results A significant decrease in logins and in the use of content and persuasive features over time was observed. The usage of features varied significantly during the treatment process. The usage of persuasive features increased during the third part of the ACT (commitment to value-based living), which might indicate that at that stage motivational support was relevant. Higher logins over time (9 weeks) corresponded with a higher usage of features (in most cases significant); when predicting depressive symptoms at post-intervention, the linear regression yielded a significant model with login quartile as a significant predictor (explained variance is 2.7%). Conclusions A better integration of content and persuasive features in the design of the intervention and a better intra-usability of features within the system are needed to identify which combination of features works best for whom. Pattern recognition can be used to tailor the intervention based on usage patterns from the earlier lessons and to support the uptake of content essential for therapy. An adaptable interface for a modular composition of therapy features supposes a dynamic approach for Web-based treatment; not a predefined path for all, but a flexible way to go through all features that have to be used. PMID:24486914
Results from Two Years of Web-Based Astronomy Teaching

NASA Astrophysics Data System (ADS)

Wallin, J.

1996-12-01

During the last two years, course notes, supplemental material, bulletin boards, and an interactive quiz system have been developed for the introductory astronomy course at George Mason University. In this talk, I will present results about the level of Web literacy, Web usage, and educational effectiveness of this system based on in-class surveys and test results. The results presented are based on a 300 person survey course composed primarily of non-science majors. Although this course currently includes a lecture section, we plan to offer this as a web-based distance learning course within six months.
Using Syntactic Patterns to Enhance Text Analytics

ERIC Educational Resources Information Center

Meyer, Bradley B.

2017-01-01

Large scale product and service reviews proliferate and are commonly found across the web. The ability to harvest, digest and analyze a large corpus of reviews from online websites is still however a difficult problem. This problem is referred to as "opinion mining." Opinion mining is an important area of research as advances in the…
30 CFR 74.16 - Material required for record.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Material required for record. 74.16 Section 74.16 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR COAL MINE SAFETY AND... deliver a complete sampling device free of charge to NIOSH at the address specified on the NIOSH Web page...
Types of Online Hierarchical Repository Structures

ERIC Educational Resources Information Center

Hershkovitz, Arnon; Azran, Ronit; Hardof-Jaffe, Sharon; Nachmias, Rafi

2011-01-01

This study presents an empirical investigation of online hierarchical repositories of items presented to university students in Web-supported course websites, using Web mining methods. To this end, data from 1747 courses were collected, and the use of online repositories of content items in these courses was examined. At a later stage, courses…
Analyzing Information Seeking and Drug-Safety Alert Response by Health Care Professionals as New Methods for Surveillance

PubMed Central

Pernek, Igor; Stiglic, Gregor; Leskovec, Jure; Strasberg, Howard R; Shah, Nigam Haresh

2015-01-01

Background Patterns in general consumer online search logs have been used to monitor health conditions and to predict health-related activities, but the multiple contexts within which consumers perform online searches make significant associations difficult to interpret. Physician information-seeking behavior has typically been analyzed through survey-based approaches and literature reviews. Activity logs from health care professionals using online medical information resources are thus a valuable yet relatively untapped resource for large-scale medical surveillance. Objective To analyze health care professionals’ information-seeking behavior and assess the feasibility of measuring drug-safety alert response from the usage logs of an online medical information resource. Methods Using two years (2011-2012) of usage logs from UpToDate, we measured the volume of searches related to medical conditions with significant burden in the United States, as well as the seasonal distribution of those searches. We quantified the relationship between searches and resulting page views. Using a large collection of online mainstream media articles and Web log posts we also characterized the uptake of a Food and Drug Administration (FDA) alert via changes in UpToDate search activity compared with general online media activity related to the subject of the alert. Results Diseases and symptoms dominate UpToDate searches. Some searches result in page views of only short duration, while others consistently result in longer-than-average page views. The response to an FDA alert for Celexa, characterized by a change in UpToDate search activity, differed considerably from general online media activity. Changes in search activity appeared later and persisted longer in UpToDate logs. The volume of searches and page view durations related to Celexa before the alert also differed from those after the alert. Conclusions Understanding the information-seeking behavior associated with online evidence sources can offer insight into the information needs of health professionals and enable large-scale medical surveillance. Our Web log mining approach has the potential to monitor responses to FDA alerts at a national level. Our findings can also inform the design and content of evidence-based medical information resources such as UpToDate. PMID:26293444
Analyzing Information Seeking and Drug-Safety Alert Response by Health Care Professionals as New Methods for Surveillance.

PubMed

Callahan, Alison; Pernek, Igor; Stiglic, Gregor; Leskovec, Jure; Strasberg, Howard R; Shah, Nigam Haresh

2015-08-20

Patterns in general consumer online search logs have been used to monitor health conditions and to predict health-related activities, but the multiple contexts within which consumers perform online searches make significant associations difficult to interpret. Physician information-seeking behavior has typically been analyzed through survey-based approaches and literature reviews. Activity logs from health care professionals using online medical information resources are thus a valuable yet relatively untapped resource for large-scale medical surveillance. To analyze health care professionals' information-seeking behavior and assess the feasibility of measuring drug-safety alert response from the usage logs of an online medical information resource. Using two years (2011-2012) of usage logs from UpToDate, we measured the volume of searches related to medical conditions with significant burden in the United States, as well as the seasonal distribution of those searches. We quantified the relationship between searches and resulting page views. Using a large collection of online mainstream media articles and Web log posts we also characterized the uptake of a Food and Drug Administration (FDA) alert via changes in UpToDate search activity compared with general online media activity related to the subject of the alert. Diseases and symptoms dominate UpToDate searches. Some searches result in page views of only short duration, while others consistently result in longer-than-average page views. The response to an FDA alert for Celexa, characterized by a change in UpToDate search activity, differed considerably from general online media activity. Changes in search activity appeared later and persisted longer in UpToDate logs. The volume of searches and page view durations related to Celexa before the alert also differed from those after the alert. Understanding the information-seeking behavior associated with online evidence sources can offer insight into the information needs of health professionals and enable large-scale medical surveillance. Our Web log mining approach has the potential to monitor responses to FDA alerts at a national level. Our findings can also inform the design and content of evidence-based medical information resources such as UpToDate.
The Online Expectations of College-Bound Juniors and Seniors. E-Expectations Report, 2012

ERIC Educational Resources Information Center

Noel-Levitz, Inc, 2012

2012-01-01

Noel-Levitz, OmniUpdate, CollegeWeekLive, and NRCCUA[R] (National Research Center for College & University Admissions) conducted a survey of 2,000 college-bound juniors and seniors about their expectations for college Web sites, mobile usage, e-mail, and social media. Among the findings: (1) More than 50 percent of students said the Web played a…
The Development of a Web-Based College Awareness Program

ERIC Educational Resources Information Center

Roberson, Keith W.

2010-01-01

The purpose of this study was to develop and evaluate a web-based college awareness program that would aid college-bound students in their search for a college or university that fit their interests. Since there is an increase in computer usage among high school aged students, and there are a very few college awareness programs included in the…
Location-based Web Search

NASA Astrophysics Data System (ADS)

Ahlers, Dirk; Boll, Susanne

In recent years, the relation of Web information to a physical location has gained much attention. However, Web content today often carries only an implicit relation to a location. In this chapter, we present a novel location-based search engine that automatically derives spatial context from unstructured Web resources and allows for location-based search: our focused crawler applies heuristics to crawl and analyze Web pages that have a high probability of carrying a spatial relation to a certain region or place; the location extractor identifies the actual location information from the pages; our indexer assigns a geo-context to the pages and makes them available for a later spatial Web search. We illustrate the usage of our spatial Web search for location-based applications that provide information not only right-in-time but also right-on-the-spot.
Specification Patent Management for Web Application Platform Ecosystem

NASA Astrophysics Data System (ADS)

Fukami, Yoshiaki; Isshiki, Masao; Takeda, Hideaki; Ohmukai, Ikki; Kokuryo, Jiro

Diversified usage of web applications has encouraged disintegration of web platform into management of identification and applications. Users make use of various kinds of data linked to their identity with multiple applications on certain social web platforms such as Facebook or MySpace. There has emerged competition among web application platforms. Platformers can design relationship with developers by controlling patent of their own specification and adopt open technologies developed external organizations. Platformers choose a way to open according to feature of the specification and their position. Patent management of specification come to be a key success factor to build competitive web application platforms. Each way to attract external developers such as standardization, open source has not discussed and analyzed all together.
Web service module for access to g-Lite

NASA Astrophysics Data System (ADS)

Goranova, R.; Goranov, G.

2012-10-01

G-Lite is a lightweight grid middleware for grid computing installed on all clusters of the European Grid Infrastructure (EGI). The middleware is partially service-oriented and does not provide well-defined Web services for job management. The existing Web services in the environment cannot be directly used by grid users for building service compositions in the EGI. In this article we present a module of well-defined Web services for job management in the EGI. We describe the architecture of the module and the design of the developed Web services. The presented Web services are composable and can participate in service compositions (workflows). An example of usage of the module with tools for service compositions in g-Lite is shown.
Simple, Scalable, Script-based, Science Processor for Measurements - Data Mining Edition (S4PM-DME)

NASA Astrophysics Data System (ADS)

Pham, L. B.; Eng, E. K.; Lynnes, C. S.; Berrick, S. W.; Vollmer, B. E.

2005-12-01

The S4PM-DME is the Goddard Earth Sciences Distributed Active Archive Center's (GES DAAC) web-based data mining environment. The S4PM-DME replaces the Near-line Archive Data Mining (NADM) system with a better web environment and a richer set of production rules. S4PM-DME enables registered users to submit and execute custom data mining algorithms. The S4PM-DME system uses the GES DAAC developed Simple Scalable Script-based Science Processor for Measurements (S4PM) to automate tasks and perform the actual data processing. A web interface allows the user to access the S4PM-DME system. The user first develops personalized data mining algorithm on his/her home platform and then uploads them to the S4PM-DME system. Algorithms in C and FORTRAN languages are currently supported. The user developed algorithm is automatically audited for any potential security problems before it is installed within the S4PM-DME system and made available to the user. Once the algorithm has been installed the user can promote the algorithm to the "operational" environment. From here the user can search and order the data available in the GES DAAC archive for his/her science algorithm. The user can also set up a processing subscription. The subscription will automatically process new data as it becomes available in the GES DAAC archive. The generated mined data products are then made available for FTP pickup. The benefits of using S4PM-DME are 1) to decrease the downloading time it typically takes a user to transfer the GES DAAC data to his/her system thus off-load the heavy network traffic, 2) to free-up the load on their system, and last 3) to utilize the rich and abundance ocean, atmosphere data from the MODIS and AIRS instruments available from the GES DAAC.
Usage and Usability of a Web-based Program for Family Caregivers of Older People in Three European Countries: A Mixed-Methods Evaluation.

PubMed

Barbabella, Francesco; Poli, Arianna; Hanson, Elizabeth; Andréasson, Frida; Salzmann, Benjamin; Döhner, Hanneli; Papa, Roberta; Efthymiou, Areti; Valenza, Silvia; Pelliccioni, Giuseppe; Lamura, Giovanni

2018-05-01

InformCare is a European Web platform that supports informal caregivers of older people by providing access to online information and professional and peer support. The aim of this study was to assess the usage and usability of a psychosocial Web-based program carried out in three European countries (Italy, Sweden, and Germany). A mixed-methods sequential explanatory design was adopted, comprising baseline and postintervention assessments, as well as combined thematic content analysis of results and focus group findings. A convenience sample of 118 caregivers was enrolled, of whom 94 used the services offered by the program at least once. The subsamples in the three countries used the platform in different ways, with a predominance of passive strategies (eg, seeking information and reading other people's comments) for Italian caregivers, and more active usage by Swedish and German caregivers. The usability assessment showed that the platform was perceived well by Italian and German caregivers, whereas technical problems affected the Swedish sample's experiences. Focus group data highlighted user satisfaction with the online support and reliability of the environment. Recommendations for practitioners are to ensure digital training for caregivers who have lower confidence in use of the Internet, to involve different healthcare professionals in the provision of professional support, and to adequately manage online community building.
[Remote access to a web-based image distribution system].

PubMed

Bergh, B; Schlaefke, A; Frankenbach, R; Vogl, T J

2004-06-01

To assess different network and security technologies for remote access to a web-based image distribution system of a hospital intranet. Following preparatory testing, the time-to-display (TTD) was measured for three image types (CR, CT, MR). The evaluation included two remote access technologies consisting of direct ISDN-Dial-Up or VPN connection (Virtual Private Network), with three different connection speeds of 64, 128 (ISDN) and 768 Kbit/s (ADSL-Asymmetric Digital Subscriber Line), as well as with lossless and lossy compression. Depending on the image type, the TTD with lossless compression for 64 Kbit/s varied from 1 : 00 to 2 : 40 minutes, for 128 Kbit/s from 0 : 35 to 1 : 15 minutes and for ADSL from 0 : 15 to 0 : 45 minutes. The ISDN-Dial-Up connection was superior to VPN technology at 64 Kbit/s but did not allow higher connection speeds. Lossy compression reduced the TTD by half for all measurements. VPN technology is preferable to direct Dial-Up connections since it offers higher connection speeds and advantages in usage and security. For occasional usage, 128 Kbit/s (ISDN) can be considered sufficient, especially in conjunction with lossy compression. ADSL should be chosen when a more frequent usage is anticipated, whereby lossy compression may be omitted. Due to higher bandwidths and improved usability, the web-based approach appears superior to conventional teleradiology systems.
ESTminer: a Web interface for mining EST contig and cluster databases.

PubMed

Huang, Yecheng; Pumphrey, Janie; Gingle, Alan R

2005-03-01

ESTminer is a Web application and database schema for interactive mining of expressed sequence tag (EST) contig and cluster datasets. The Web interface contains a query frame that allows the selection of contigs/clusters with specific cDNA library makeup or a threshold number of members. The results are displayed as color-coded tree nodes, where the color indicates the fractional size of each cDNA library component. The nodes are expandable, revealing library statistics as well as EST or contig members, with links to sequence data, GenBank records or user configurable links. Also, the interface allows 'queries within queries' where the result set of a query is further filtered by the subsequent query. ESTminer is implemented in Java/JSP and the package, including MySQL and Oracle schema creation scripts, is available from http://cggc.agtec.uga.edu/Data/download.asp agingle@uga.edu.

Socio-contextual Network Mining for User Assistance in Web-based Knowledge Gathering Tasks

NASA Astrophysics Data System (ADS)

Rajendran, Balaji; Kombiah, Iyakutti

Web-based Knowledge Gathering (WKG) is a specialized and complex information seeking task carried out by many users on the web, for their various learning, and decision-making requirements. We construct a contextual semantic structure by observing the actions of the users involved in WKG task, in order to gain an understanding of their task and requirement. We also build a knowledge warehouse in the form of a master Semantic Link Network (SLX) that accommodates and assimilates all the contextual semantic structures. This master SLX, which is a socio-contextual network, is then mined to provide contextual inputs to the current users through their agents. We validated our approach through experiments and analyzed the benefits to the users in terms of resource explorations and the time saved. The results are positive enough to motivate us to implement in a larger scale.
A Study of the Demographics of Web-Based Health-Related Social Media Users.

PubMed

Sadah, Shouq A; Shahbazi, Moloud; Wiley, Matthew T; Hristidis, Vagelis

2015-08-06

The rapid spread of Web-based social media in recent years has impacted how patients share health-related information. However, little work has studied the demographics of these users. Our aim was to study the demographics of users who participate in health-related Web-based social outlets to identify possible links to health care disparities. We analyze and compare three different types of health-related social outlets: (1) general Web-based social networks, Twitter and Google+, (2) drug review websites, and (3) health Web forums. We focus on the following demographic attributes: age, gender, ethnicity, location, and writing level. We build and evaluate domain-specific classifiers to infer missing data where possible. The estimated demographic statistics are compared against various baselines, such as Internet and social networks usage of the population. We found that (1) drug review websites and health Web forums are dominated by female users, (2) the participants of health-related social outlets are generally older with the exception of the 65+ years bracket, (3) blacks are underrepresented in health-related social networks, (4) users in areas with better access to health care participate more in Web-based health-related social outlets, and (5) the writing level of users in health-related social outlets is significantly lower than the reading level of the population. We identified interesting and actionable disparities in the participation of various demographic groups to various types of health-related social outlets. These disparities are significantly distinct from the disparities in Internet usage or general social outlets participation.
Development of a Web-Based Clinical Decision Support System for Drug Prescription: Non-Interventional Naturalistic Description of the Antipsychotic Prescription Patterns in 4345 Outpatients and Future Applications.

PubMed

Berrouiguet, Sofian; Barrigón, Maria Luisa; Brandt, Sara A; Ovejero-García, Santiago; Álvarez-García, Raquel; Carballo, Juan Jose; Lenca, Philippe; Courtet, Philippe; Baca-García, Enrique

2016-01-01

The emergence of electronic prescribing devices with clinical decision support systems (CDSS) is able to significantly improve management pharmacological treatments. We developed a web application available on smartphones in order to help clinicians monitor prescription and further propose CDSS. A web application (www.MEmind.net) was developed to assess patients and collect data regarding gender, age, diagnosis and treatment. We analyzed antipsychotic prescriptions in 4345 patients attended in five Psychiatric Community Mental Health Centers from June 2014 to October 2014. The web-application reported average daily dose prescribed for antipsychotics, prescribed daily dose (PDD), and the PDD to defined daily dose (DDD) ratio. The MEmind web-application reported that antipsychotics were used in 1116 patients out of the total sample, mostly in 486 (44%) patients with schizophrenia related disorders but also in other diagnoses. Second generation antipsychotics (quetiapine, aripiprazole and long-acting paliperidone) were preferably employed. Low doses were more frequently used than high doses. Long acting paliperidone and ziprasidone however, were the only two antipsychotics used at excessive dosing. Antipsychotic polypharmacy was used in 287 (26%) patients with classic depot drugs, clotiapine, amisulpride and clozapine. In this study we describe the first step of the development of a web application that is able to make polypharmacy, high dose usage and off label usage of antipsychotics visible to clinicians. Current development of the MEmind web application may help to improve prescription security via momentary feedback of prescription and clinical decision support system.
Feasibility and preliminary efficacy of a web-based smoking cessation intervention for HIV-infected smokers: a randomized controlled trial.

PubMed

Shuter, Jonathan; Morales, Daniela A; Considine-Dunn, Shannon E; An, Lawrence C; Stanton, Cassandra A

2014-09-01

To evaluate the feasibility and preliminary efficacy of a Web-based tobacco treatment for persons living with HIV (PLWH). Prospective, randomized controlled trial. HIV-care center in the Bronx, New York. Eligibility criteria included HIV infection, current tobacco usage, interest in quitting, and access to a computer with internet. One hundred thirty-eight subjects enrolled, and 134 completed the study. Positively Smoke Free on the Web (PSFW), an 8-session, 7-week targeted tobacco treatment program for PLWH, was compared with standard care (brief advice to quit and self-help brochure). All subjects were offered nicotine patches. The main feasibility outcomes were number of sessions logged into, number of Web pages visited, number of interactive clicks, and total time logged in. The main efficacy outcome was biochemically verified, 7-day point prevalence abstinence 3 months after intervention. PSFW subjects logged into a mean of 5.5 of 8 sessions and 26.2 of 41 pages. They executed a mean of 10 interactive clicks during a mean total of 59.8 minutes logged in. Most required reminder phone calls to complete the sessions. Educational level, anxiety score, and home access of the Web site were associated with Web site usage. Ten percent of the PSFW group vs. 4.3% of controls achieved the abstinence end point. Among those who completed all 8 sessions, 17.9% were abstinent, and among women completers, 30.8% were abstinent. Web-based treatment is a feasible strategy for PLWH smokers, and preliminary findings suggest therapeutic efficacy.
Development of a Web-Based Clinical Decision Support System for Drug Prescription: Non-Interventional Naturalistic Description of the Antipsychotic Prescription Patterns in 4345 Outpatients and Future Applications

PubMed Central

Berrouiguet, Sofian; Barrigón, Maria Luisa; Brandt, Sara A.; Ovejero-García, Santiago; Álvarez-García, Raquel; Carballo, Juan Jose; Lenca, Philippe; Courtet, Philippe; Baca-García, Enrique

2016-01-01

Purpose The emergence of electronic prescribing devices with clinical decision support systems (CDSS) is able to significantly improve management pharmacological treatments. We developed a web application available on smartphones in order to help clinicians monitor prescription and further propose CDSS. Method A web application (www.MEmind.net) was developed to assess patients and collect data regarding gender, age, diagnosis and treatment. We analyzed antipsychotic prescriptions in 4345 patients attended in five Psychiatric Community Mental Health Centers from June 2014 to October 2014. The web-application reported average daily dose prescribed for antipsychotics, prescribed daily dose (PDD), and the PDD to defined daily dose (DDD) ratio. Results The MEmind web-application reported that antipsychotics were used in 1116 patients out of the total sample, mostly in 486 (44%) patients with schizophrenia related disorders but also in other diagnoses. Second generation antipsychotics (quetiapine, aripiprazole and long-acting paliperidone) were preferably employed. Low doses were more frequently used than high doses. Long acting paliperidone and ziprasidone however, were the only two antipsychotics used at excessive dosing. Antipsychotic polypharmacy was used in 287 (26%) patients with classic depot drugs, clotiapine, amisulpride and clozapine. Conclusions In this study we describe the first step of the development of a web application that is able to make polypharmacy, high dose usage and off label usage of antipsychotics visible to clinicians. Current development of the MEmind web application may help to improve prescription security via momentary feedback of prescription and clinical decision support system. PMID:27764107
PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan

PubMed Central

Kinjo, Akira R.; Yamashita, Reiko; Nakamura, Haruki

2010-01-01

This article is a tutorial for PDBj Mine, a new database and its interface for Protein Data Bank Japan (PDBj). In PDBj Mine, data are loaded from files in the PDBMLplus format (an extension of PDBML, PDB's canonical XML format, enriched with annotations), which are then served for the user of PDBj via the worldwide web (WWW). We describe the basic design of the relational database (RDB) and web interfaces of PDBj Mine. The contents of PDBMLplus files are first broken into XPath entities, and these paths and data are indexed in the way that reflects the hierarchical structure of the XML files. The data for each XPath type are saved into the corresponding relational table that is named as the XPath itself. The generation of table definitions from the PDBMLplus XML schema is fully automated. For efficient search, frequently queried terms are compiled into a brief summary table. Casual users can perform simple keyword search, and 'Advanced Search' which can specify various conditions on the entries. More experienced users can query the database using SQL statements which can be constructed in a uniform manner. Thus, PDBj Mine achieves a combination of the flexibility of XML documents and the robustness of the RDB. Database URL: http://www.pdbj.org/ PMID:20798081
PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan.

PubMed

Kinjo, Akira R; Yamashita, Reiko; Nakamura, Haruki

2010-08-25

This article is a tutorial for PDBj Mine, a new database and its interface for Protein Data Bank Japan (PDBj). In PDBj Mine, data are loaded from files in the PDBMLplus format (an extension of PDBML, PDB's canonical XML format, enriched with annotations), which are then served for the user of PDBj via the worldwide web (WWW). We describe the basic design of the relational database (RDB) and web interfaces of PDBj Mine. The contents of PDBMLplus files are first broken into XPath entities, and these paths and data are indexed in the way that reflects the hierarchical structure of the XML files. The data for each XPath type are saved into the corresponding relational table that is named as the XPath itself. The generation of table definitions from the PDBMLplus XML schema is fully automated. For efficient search, frequently queried terms are compiled into a brief summary table. Casual users can perform simple keyword search, and 'Advanced Search' which can specify various conditions on the entries. More experienced users can query the database using SQL statements which can be constructed in a uniform manner. Thus, PDBj Mine achieves a combination of the flexibility of XML documents and the robustness of the RDB. Database URL: http://www.pdbj.org/
Near-line Archive Data Mining at the Goddard Distributed Active Archive Center

NASA Astrophysics Data System (ADS)

Pham, L.; Mack, R.; Eng, E.; Lynnes, C.

2002-12-01

NASA's Earth Observing System (EOS) is generating immense volumes of data, in some cases too much to provide to users with data-intensive needs. As an alternative to moving the data to the user and his/her research algorithms, we are providing a means to move the algorithms to the data. The Near-line Archive Data Mining (NADM) system is the Goddard Earth Sciences Distributed Active Archive Center's (GES DAAC) web data mining portal to the EOS Data and Information System (EOSDIS) data pool, a 50-TB online disk cache. The NADM web portal enables registered users to submit and execute data mining algorithm codes on the data in the EOSDIS data pool. A web interface allows the user to access the NADM system. The users first develops personalized data mining code on their home platform and then uploads them to the NADM system. The C, FORTRAN and IDL languages are currently supported. The user developed code is automatically audited for any potential security problems before it is installed within the NADM system and made available to the user. Once the code has been installed the user is provided a test environment where he/she can test the execution of the software against data sets of the user's choosing. When the user is satisfied with the results, he/she can promote their code to the "operational" environment. From here the user can interactively run his/her code on the data available in the EOSDIS data pool. The user can also set up a processing subscription. The subscription will automatically process new data as it becomes available in the EOSDIS data pool. The generated mined data products are then made available for FTP pickup. The NADM system uses the GES DAAC-developed Simple Scalable Script-based Science Processor (S4P) to automate tasks and perform the actual data processing. Users will also have the option of selecting a DAAC-provided data mining algorithm and using it to process the data of their choice.
Clustering and Dimensionality Reduction to Discover Interesting Patterns in Binary Data

NASA Astrophysics Data System (ADS)

Palumbo, Francesco; D'Enza, Alfonso Iodice

The attention towards binary data coding increased consistently in the last decade due to several reasons. The analysis of binary data characterizes several fields of application, such as market basket analysis, DNA microarray data, image mining, text mining and web-clickstream mining. The paper illustrates two different approaches exploiting a profitable combination of clustering and dimensionality reduction for the identification of non-trivial association structures in binary data. An application in the Association Rules framework supports the theory with the empirical evidence.
Recent advancements on the development of web-based applications for the implementation of seismic analysis and surveillance systems

NASA Astrophysics Data System (ADS)

Friberg, P. A.; Luis, R. S.; Quintiliani, M.; Lisowski, S.; Hunter, S.

2014-12-01

Recently, a novel set of modules has been included in the Open Source Earthworm seismic data processing system, supporting the use of web applications. These include the Mole sub-system, for storing relevant event data in a MySQL database (see M. Quintiliani and S. Pintore, SRL, 2013), and an embedded webserver, Moleserv, for serving such data to web clients in QuakeML format. These modules have enabled, for the first time using Earthworm, the use of web applications for seismic data processing. These can greatly simplify the operation and maintenance of seismic data processing centers by having one or more servers providing the relevant data as well as the data processing applications themselves to client machines running arbitrary operating systems.Web applications with secure online web access allow operators to work anywhere, without the often cumbersome and bandwidth hungry use of secure shell or virtual private networks. Furthermore, web applications can seamlessly access third party data repositories to acquire additional information, such as maps. Finally, the usage of HTML email brought the possibility of specialized web applications, to be used in email clients. This is the case of EWHTMLEmail, which produces event notification emails that are in fact simple web applications for plotting relevant seismic data.Providing web services as part of Earthworm has enabled a number of other tools as well. One is ISTI's EZ Earthworm, a web based command and control system for an otherwise command line driven system; another is a waveform web service. The waveform web service serves Earthworm data to additional web clients for plotting, picking, and other web-based processing tools. The current Earthworm waveform web service hosts an advanced plotting capability for providing views of event-based waveforms from a Mole database served by Moleserve.The current trend towards the usage of cloud services supported by web applications is driving improvements in JavaScript, css and HTML, as well as faster and more efficient web browsers, including mobile. It is foreseeable that in the near future, web applications are as powerful and efficient as native applications. Hence the work described here has been the first step towards bringing the Open Source Earthworm seismic data processing system to this new paradigm.
TREC Microblog 2012 Track: Real-Time Algorithm for Microblog Ranking Systems

DTIC Science & Technology

2012-11-01

such as information about the tweet and the user profile. We collected those tweets by means of web crawler and extract several features from the raw...Mining Text Data. 2012. [5] D. Feltoni. Twittersa: un sistema per l’analisi del sentimento nelle reti sociali. Master’s thesis, Roma Tre University...Morris. Twittersearch: a comparison of microblog search and web search. Proceedings of the fourth ACM international conference on Web search, 2011
Stress measurements in Kuzbass mines using photoelastic sensors

NASA Astrophysics Data System (ADS)

Schastlivtsev, E.

1996-06-01

The basic amount of known measurements of stressed state in front of development workings' faces was carried out with the use of hydraulic sensors, which give an information about principal stresses without their separation. Besides, the availability of pipe-line and cumbersome equipment make more complicated and sometimes impossible the process of stresses' measurements during works in mining process. In our opinion, the borehole and photoelastic sensors at high degree satisfy with the conditions of stresses' measurements in front of mining workings' faces. The principal idea of the method is in the usage of proper face advancing aiming the estimation of the field stresses in its neighborhood. Borehole and photoelastic sensors, fixed in the advanced boreholes, drilled from the active face react to the field change of stresses or deformation caused by working face advancing. While obtaining this information we may judge about the distribution of additional stresses in rock of face's neighborhood and concentration of stresses in front of face. The usage of cavity (because of face advancing) in the quality of disturbing influence in combination with the properties of ring photoelastic sensor to given an information about magnitude and direction of secondary principle stresses, permits us to obtain rather a simple and not labor consuming method of investigation of field additional stresses in the working's face neighborhood.
Identifying Engineering Students' English Sentence Reading Comprehension Errors: Applying a Data Mining Technique

ERIC Educational Resources Information Center

Tsai, Yea-Ru; Ouyang, Chen-Sen; Chang, Yukon

2016-01-01

The purpose of this study is to propose a diagnostic approach to identify engineering students' English reading comprehension errors. Student data were collected during the process of reading texts of English for science and technology on a web-based cumulative sentence analysis system. For the analysis, the association-rule, data mining technique…
Application of Learning Analytics Using Clustering Data Mining for Students' Disposition Analysis

ERIC Educational Resources Information Center

Bharara, Sanyam; Sabitha, Sai; Bansal, Abhay

2018-01-01

Learning Analytics (LA) is an emerging field in which sophisticated analytic tools are used to improve learning and education. It draws from, and is closely tied to, a series of other fields of study like business intelligence, web analytics, academic analytics, educational data mining, and action analytics. The main objective of this research…
Rare disease diagnosis: A review of web search, social media and large-scale data-mining approaches.

PubMed

Svenstrup, Dan; Jørgensen, Henrik L; Winther, Ole

2015-01-01

Physicians and the general public are increasingly using web-based tools to find answers to medical questions. The field of rare diseases is especially challenging and important as shown by the long delay and many mistakes associated with diagnoses. In this paper we review recent initiatives on the use of web search, social media and data mining in data repositories for medical diagnosis. We compare the retrieval accuracy on 56 rare disease cases with known diagnosis for the web search tools google.com, pubmed.gov, omim.org and our own search tool findzebra.com. We give a detailed description of IBM's Watson system and make a rough comparison between findzebra.com and Watson on subsets of the Doctor's dilemma dataset. The recall@10 and recall@20 (fraction of cases where the correct result appears in top 10 and top 20) for the 56 cases are found to be be 29%, 16%, 27% and 59% and 32%, 18%, 34% and 64%, respectively. Thus, FindZebra has a significantly (p < 0.01) higher recall than the other 3 search engines. When tested under the same conditions, Watson and FindZebra showed similar recall@10 accuracy. However, the tests were performed on different subsets of Doctors dilemma questions. Advances in technology and access to high quality data have opened new possibilities for aiding the diagnostic process. Specialized search engines, data mining tools and social media are some of the areas that hold promise.
Rare disease diagnosis: A review of web search, social media and large-scale data-mining approaches

PubMed Central

Svenstrup, Dan; Jørgensen, Henrik L; Winther, Ole

2015-01-01

Physicians and the general public are increasingly using web-based tools to find answers to medical questions. The field of rare diseases is especially challenging and important as shown by the long delay and many mistakes associated with diagnoses. In this paper we review recent initiatives on the use of web search, social media and data mining in data repositories for medical diagnosis. We compare the retrieval accuracy on 56 rare disease cases with known diagnosis for the web search tools google.com, pubmed.gov, omim.org and our own search tool findzebra.com. We give a detailed description of IBM's Watson system and make a rough comparison between findzebra.com and Watson on subsets of the Doctor's dilemma dataset. The recall@10 and recall@20 (fraction of cases where the correct result appears in top 10 and top 20) for the 56 cases are found to be be 29%, 16%, 27% and 59% and 32%, 18%, 34% and 64%, respectively. Thus, FindZebra has a significantly (p < 0.01) higher recall than the other 3 search engines. When tested under the same conditions, Watson and FindZebra showed similar recall@10 accuracy. However, the tests were performed on different subsets of Doctors dilemma questions. Advances in technology and access to high quality data have opened new possibilities for aiding the diagnostic process. Specialized search engines, data mining tools and social media are some of the areas that hold promise. PMID:26442199
Web Service Distributed Management Framework for Autonomic Server Virtualization

NASA Astrophysics Data System (ADS)

Solomon, Bogdan; Ionescu, Dan; Litoiu, Marin; Mihaescu, Mircea

Virtualization for the x86 platform has imposed itself recently as a new technology that can improve the usage of machines in data centers and decrease the cost and energy of running a high number of servers. Similar to virtualization, autonomic computing and more specifically self-optimization, aims to improve server farm usage through provisioning and deprovisioning of instances as needed by the system. Autonomic systems are able to determine the optimal number of server machines - real or virtual - to use at a given time, and add or remove servers from a cluster in order to achieve optimal usage. While provisioning and deprovisioning of servers is very important, the way the autonomic system is built is also very important, as a robust and open framework is needed. One such management framework is the Web Service Distributed Management (WSDM) system, which is an open standard of the Organization for the Advancement of Structured Information Standards (OASIS). This paper presents an open framework built on top of the WSDM specification, which aims to provide self-optimization for applications servers residing on virtual machines.
A web-based genomic sequence database for the Streptomycetaceae: a tool for systematics and genome mining

USDA-ARS?s Scientific Manuscript database

The ARS Microbial Genome Sequence Database (http://199.133.98.43), a web-based database server, was established utilizing the BIGSdb (Bacterial Isolate Genomics Sequence Database) software package, developed at Oxford University, as a tool to manage multi-locus sequence data for the family Streptomy...
Online Persistence in Higher Education Web-Supported Courses

ERIC Educational Resources Information Center

Hershkovitz, Arnon; Nachmias, Rafi

2011-01-01

This research consists of an empirical study of online persistence in Web-supported courses in higher education, using Data Mining techniques. Log files of 58 Moodle websites accompanying Tel Aviv University courses were drawn, recording the activity of 1189 students in 1897 course enrollments during the academic year 2008/9, and were analyzed…
Mining the Human Phenome using Semantic Web Technologies: A Case Study for Type 2 Diabetes

PubMed Central

Pathak, Jyotishman; Kiefer, Richard C.; Bielinski, Suzette J.; Chute, Christopher G.

2012-01-01

The ability to conduct genome-wide association studies (GWAS) has enabled new exploration of how genetic variations contribute to health and disease etiology. However, historically GWAS have been limited by inadequate sample size due to associated costs for genotyping and phenotyping of study subjects. This has prompted several academic medical centers to form “biobanks” where biospecimens linked to personal health information, typically in electronic health records (EHRs), are collected and stored on large number of subjects. This provides tremendous opportunities to discover novel genotype-phenotype associations and foster hypothesis generation. In this work, we study how emerging Semantic Web technologies can be applied in conjunction with clinical and genotype data stored at the Mayo Clinic Biobank to mine the phenotype data for genetic associations. In particular, we demonstrate the role of using Resource Description Framework (RDF) for representing EHR diagnoses and procedure data, and enable federated querying via standardized Web protocols to identify subjects genotyped with Type 2 Diabetes for discovering gene-disease associations. Our study highlights the potential of Web-scale data federation techniques to execute complex queries. PMID:23304343

Mining the human phenome using semantic web technologies: a case study for Type 2 Diabetes.

PubMed

Pathak, Jyotishman; Kiefer, Richard C; Bielinski, Suzette J; Chute, Christopher G

2012-01-01

The ability to conduct genome-wide association studies (GWAS) has enabled new exploration of how genetic variations contribute to health and disease etiology. However, historically GWAS have been limited by inadequate sample size due to associated costs for genotyping and phenotyping of study subjects. This has prompted several academic medical centers to form "biobanks" where biospecimens linked to personal health information, typically in electronic health records (EHRs), are collected and stored on large number of subjects. This provides tremendous opportunities to discover novel genotype-phenotype associations and foster hypothesis generation. In this work, we study how emerging Semantic Web technologies can be applied in conjunction with clinical and genotype data stored at the Mayo Clinic Biobank to mine the phenotype data for genetic associations. In particular, we demonstrate the role of using Resource Description Framework (RDF) for representing EHR diagnoses and procedure data, and enable federated querying via standardized Web protocols to identify subjects genotyped with Type 2 Diabetes for discovering gene-disease associations. Our study highlights the potential of Web-scale data federation techniques to execute complex queries.
Astroinformatics, data mining and the future of astronomical research

NASA Astrophysics Data System (ADS)

Brescia, Massimo; Longo, Giuseppe

2013-08-01

Astronomy, as many other scientific disciplines, is facing a true data deluge which is bound to change both the praxis and the methodology of every day research work. The emerging field of astroinformatics, while on the one end appears crucial to face the technological challenges, on the other is opening new exciting perspectives for new astronomical discoveries through the implementation of advanced data mining procedures. The complexity of astronomical data and the variety of scientific problems, however, call for innovative algorithms and methods as well as for an extreme usage of ICT technologies.
OSCAR4: a flexible architecture for chemical text-mining.

PubMed

Jessop, David M; Adams, Sam E; Willighagen, Egon L; Hawizy, Lezan; Murray-Rust, Peter

2011-10-14

The Open-Source Chemistry Analysis Routines (OSCAR) software, a toolkit for the recognition of named entities and data in chemistry publications, has been developed since 2002. Recent work has resulted in the separation of the core OSCAR functionality and its release as the OSCAR4 library. This library features a modular API (based on reduction of surface coupling) that permits client programmers to easily incorporate it into external applications. OSCAR4 offers a domain-independent architecture upon which chemistry specific text-mining tools can be built, and its development and usage are discussed.
Integration of Text- and Data-Mining Technologies for Use in Banking Applications

NASA Astrophysics Data System (ADS)

Maslankowski, Jacek

Unstructured data, most of it in the form of text files, typically accounts for 85% of an organization's knowledge stores, but it's not always easy to find, access, analyze or use (Robb 2004). That is why it is important to use solutions based on text and data mining. This solution is known as duo mining. This leads to improve management based on knowledge owned in organization. The results are interesting. Data mining provides to lead with structuralized data, usually powered from data warehouses. Text mining, sometimes called web mining, looks for patterns in unstructured data — memos, document and www. Integrating text-based information with structured data enriches predictive modeling capabilities and provides new stores of insightful and valuable information for driving business and research initiatives forward.
Mining of high utility-probability sequential patterns from uncertain databases

PubMed Central

Zhang, Binbin; Fournier-Viger, Philippe; Li, Ting

2017-01-01

High-utility sequential pattern mining (HUSPM) has become an important issue in the field of data mining. Several HUSPM algorithms have been designed to mine high-utility sequential patterns (HUPSPs). They have been applied in several real-life situations such as for consumer behavior analysis and event detection in sensor networks. Nonetheless, most studies on HUSPM have focused on mining HUPSPs in precise data. But in real-life, uncertainty is an important factor as data is collected using various types of sensors that are more or less accurate. Hence, data collected in a real-life database can be annotated with existing probabilities. This paper presents a novel pattern mining framework called high utility-probability sequential pattern mining (HUPSPM) for mining high utility-probability sequential patterns (HUPSPs) in uncertain sequence databases. A baseline algorithm with three optional pruning strategies is presented to mine HUPSPs. Moroever, to speed up the mining process, a projection mechanism is designed to create a database projection for each processed sequence, which is smaller than the original database. Thus, the number of unpromising candidates can be greatly reduced, as well as the execution time for mining HUPSPs. Substantial experiments both on real-life and synthetic datasets show that the designed algorithm performs well in terms of runtime, number of candidates, memory usage, and scalability for different minimum utility and minimum probability thresholds. PMID:28742847
EPA Communications Stylebook

EPA Pesticide Factsheets

This currently effective Stylebook developed by the Office of External Affairs and Environmental Education (OEAEE) includes a checklist for communications product development, publication and web writing guide, and graphics and logo usage and policies.
Informal Learning in Work Environments: Training with the Social Web in the Workplace

ERIC Educational Resources Information Center

Garcia-Penalvo, Francisco J.; Colomo-Palacios, Ricardo; Lytras, Miltiadis D.

2012-01-01

The Internet and its increasing usage has changed informal learning in depth. This change has affected young and older adults in both the workplace and in higher education. But, in spite of this, formal and non-formal course-based approaches have not taken full advantage of these new informal learning scenarios and technologies. The Web 2.0 is a…
Exploring the Impact of Individualism and Uncertainty Avoidance in Web-Based Electronic Learning: An Empirical Analysis in European Higher Education

ERIC Educational Resources Information Center

Sanchez-Franco, Manuel J.; Martinez-Lopez, Francisco J.; Martin-Velicia, Felix A.

2009-01-01

Our research specifically focuses on the effects of the national cultural background of educators on the acceptance and usage of ICT, particularly the Web as an extensive and expanding information base that provides the ultimate in resource-rich learning. Most research has been used North Americans as subjects. For this reason, we interviewed…
Use of a Case-Based Hypermedia Resource in an Early Literacy Coaching Intervention with Pre-Kindergarten Teachers

ERIC Educational Resources Information Center

Powell, Douglas R.; Diamond, Karen E.; Koehler, Matthew J.

2010-01-01

Use of a case-based hypermedia resource (HR) was examined in a Web-based early literacy coaching intervention with pre-kindergarten teachers of at-risk children. Web usage logs, written records of coach feedback to teachers on their instruction, and a teacher questionnaire were the primary data sources. Visits to the HR content pages were unevenly…
Responding to User's Expectation in the Library: Innovative Web 2.0 Applications at JUIT Library: A Case Study

ERIC Educational Resources Information Center

Ram, Shri; Anbu K., John Paul; Kataria, Sanjay

2011-01-01

Purpose: This paper seeks to provide an insight into the implementation of some of the innovative Web 2.0 applications at Jaypee University of Information Technology with the aim of exploring the expectations of the users and their awareness and usage of such applications. Design/methodology/approach: The study was undertaken at the Learning…
Adoption of Web 2.0 Technology in Higher Education: A Case Study of Universities in National Capital Region, India

ERIC Educational Resources Information Center

Tyagi, Sunil

2012-01-01

The present study was conducted in six (6) Indian Universities at NCR (National Capital Region) of India to explore the usage analysis of Web 2.0 technologies in learning environment by faculty members. The investigator conducted a survey with the help of structured questionnaire on 300 respondents. A total of 300 self-administered questionnaires…
The State of Wiki Usage in U.S. K-12 Schools: Leveraging Web 2.0 Data Warehouses to Assess Quality and Equity in Online Learning Environments

ERIC Educational Resources Information Center

Reich, Justin; Murnane, Richard; Willett, John

2012-01-01

To document wiki usage in U.S. K-12 settings, this study examined a representative sample drawn from a population of nearly 180,000 wikis. The authors measured the opportunities wikis provide for students to develop 21st-century skills such as expert thinking, complex communication, and new media literacy. The authors found four types of wiki…
Technical note: real-time web-based wireless visual guidance system for radiotherapy.

PubMed

Lee, Danny; Kim, Siyong; Palta, Jatinder R; Kim, Taeho

2017-06-01

Describe a Web-based wireless visual guidance system that mitigates issues associated with hard-wired audio-visual aided patient interactive motion management systems that are cumbersome to use in routine clinical practice. Web-based wireless visual display duplicates an existing visual display of a respiratory-motion management system for visual guidance. The visual display of the existing system is sent to legacy Web clients over a private wireless network, thereby allowing a wireless setting for real-time visual guidance. In this study, active breathing coordinator (ABC) trace was used as an input for visual display, which captured and transmitted to Web clients. Virtual reality goggles require two (left and right eye view) images for visual display. We investigated the performance of Web-based wireless visual guidance by quantifying (1) the network latency of visual displays between an ABC computer display and Web clients of a laptop, an iPad mini 2 and an iPhone 6, and (2) the frame rate of visual display on the Web clients in frames per second (fps). The network latency of visual display between the ABC computer and Web clients was about 100 ms and the frame rate was 14.0 fps (laptop), 9.2 fps (iPad mini 2) and 11.2 fps (iPhone 6). In addition, visual display for virtual reality goggles was successfully shown on the iPhone 6 with 100 ms and 11.2 fps. A high network security was maintained by utilizing the private network configuration. This study demonstrated that a Web-based wireless visual guidance can be a promising technique for clinical motion management systems, which require real-time visual display of their outputs. Based on the results of this study, our approach has the potential to reduce clutter associated with wired-systems, reduce space requirements, and extend the use of medical devices from static usage to interactive and dynamic usage in a radiotherapy treatment vault.
SOAP based web services and their future role in VO projects

NASA Astrophysics Data System (ADS)

Topf, F.; Jacquey, C.; Génot, V.; Cecconi, B.; André, N.; Zhang, T. L.; Kallio, E.; Lammer, H.; Facsko, G.; Stöckler, R.; Khodachenko, M.

2011-10-01

Modern state-of-the-art web services are from crucial importance for the interoperability of different VO tools existing in the planetary community. SOAP based web services assure the interconnectability between different data sources and tools by providing a common protocol for communication. This paper will point out a best practice approach with the Automated Multi-Dataset Analysis Tool (AMDA) developed by CDPP, Toulouse and the provision of VEX/MAG data from a remote database located at IWF, Graz. Furthermore a new FP7 project IMPEx will be introduced with a potential usage example of AMDA web services in conjunction with simulation models.
Changes in host-parasitoid food web structure with elevation.

PubMed

Maunsell, Sarah C; Kitching, Roger L; Burwell, Chris J; Morris, Rebecca J

2015-03-01

Gradients in elevation are increasingly used to investigate how species respond to changes in local climatic conditions. Whilst many studies have shown elevational patterns in species richness and turnover, little is known about how food web structure is affected by elevation. Contrasting responses of predator and prey species to elevation may lead to changes in food web structure. We investigated how the quantitative structure of a herbivore-parasitoid food web changes with elevation in an Australian subtropical rain forest. On four occasions, spread over 1 year, we hand-collected leaf miners at twelve sites, along three elevational gradients (between 493 m and 1159 m a.s.l). A total of 5030 insects, including 603 parasitoids, were reared, and summary food webs were created for each site. We also carried out a replicated manipulative experiment by translocating an abundant leaf-mining weevil Platynotocis sp., which largely escaped parasitism at high elevations (≥ 900 m a.s.l.), to lower, warmer elevations, to test if it would experience higher parasitism pressure. We found strong evidence that the environmental change that occurs with increasing elevation affects food web structure. Quantitative measures of generality, vulnerability and interaction evenness decreased significantly with increasing elevation (and decreasing temperature), whilst elevation did not have a significant effect on connectance. Mined plant composition also had a significant effect on generality and vulnerability, but not on interaction evenness. Several relatively abundant species of leaf miner appeared to escape parasitism at higher elevations, but contrary to our prediction, Platynotocis sp. did not experience greater levels of parasitism when translocated to lower elevations. Our study indicates that leaf-mining herbivores and their parasitoids respond differently to environmental conditions imposed by elevation, thus producing structural changes in their food webs. Increasing temperatures and changes in vegetation communities that are likely to result from climate change may have a restructuring effect on host-parasitoid food webs. Our translocation experiment, however, indicated that leaf miners currently escaping parasitism at high elevations may not automatically experience higher parasitism under warmer conditions and future changes in food web structure may depend on the ability of parasitoids to adapt to novel hosts. © 2014 The Authors. Journal of Animal Ecology © 2014 British Ecological Society.
A demanding web-based PACS supported by web services technology

NASA Astrophysics Data System (ADS)

Costa, Carlos M. A.; Silva, Augusto; Oliveira, José L.; Ribeiro, Vasco G.; Ribeiro, José

2006-03-01

During the last years, the ubiquity of web interfaces have pushed practically all PACS suppliers to develop client applications in which clinical practitioners can receive and analyze medical images, using conventional personal computers and Web browsers. However, due to security and performance issues, the utilization of these software packages has been restricted to Intranets. Paradigmatically, one of the most important advantages of digital image systems is to simplify the widespread sharing and remote access of medical data between healthcare institutions. This paper analyses the traditional PACS drawbacks that contribute to their reduced usage in the Internet and describes a PACS based on Web Services technology that supports a customized DICOM encoding syntax and a specific compression scheme providing all historical patient data in a unique Web interface.
Usage of the www.2aida.org AIDA diabetes software Website: a pilot study.

PubMed

Lehmann, Eldon D

2003-01-01

AIDA is a diabetes-computing program freely available from www.2aida.org on the Web. The software is intended to serve as an educational support tool, and can be used by anyone who has an interest in diabetes, whether they be patients, relatives, health-care professionals, or students. In previous "Diabetes Information Technology & WebWatch" columns various indicators of usage of the AIDA program have been reviewed, and various comments from users of the software have been documented. One aspect of AIDA, though, that has been of considerable interest has been to investigate its Web-based distribution as a wider paradigm for more general medically related usage of the Internet. In this respect we have been keen to understand in general terms: (1) why people are turning to the Web for health-care/diabetes information; (2) more specifically, what sort of people are making use of the AIDA software; and (3) what benefits they feel might accrue from using the program. To answer these types of questions we have been conducting a series of audits/surveys via the AIDA Website, and via the software program itself, to learn as much as possible about who the AIDA end users really are. The rationale for this work is that, in this way, it should be possible to improve the program as well as tailor future versions of the software to the interests and needs of its users. However, a recurring observation is that data collection is easiest if it is as unobtrusive and innocuous as possible. One aspect of learning as much as possible about diabetes Website visitors and users may be to apply techniques that do not necessitate any visitor or user interaction. There are various programs that can monitor what pages visitors are viewing at a site. As these programs do not require visitors to do anything special, over time some interesting insights into Website usage may be obtained. For the current study we have reviewed anonymous logstats data, which are automatically collected at many Websites, to try and establish a baseline level of usage for the AIDA site. For the initial pilot study the analysis was performed from October 1, 2000 to November 1, 2001. The study has yielded an interesting insight into how the AIDA Website is being used. The results also confirm those of previous audits based on different self-reported methodologies, confirming, amongst other things, what countries people are visiting from and what operating systems/computers they are using. These analyses have been informative and useful. Given this, it is proposed to repeat the current pilot survey approach on a routine basis, in the future, as a way of monitoring on-going usage of the AIDA Website.
Analyzing engagement in a web-based intervention platform through visualizing log-data.

PubMed

Morrison, Cecily; Doherty, Gavin

2014-11-13

Engagement has emerged as a significant cross-cutting concern within the development of Web-based interventions. There have been calls to institute a more rigorous approach to the design of Web-based interventions, to increase both the quantity and quality of engagement. One approach would be to use log-data to better understand the process of engagement and patterns of use. However, an important challenge lies in organizing log-data for productive analysis. Our aim was to conduct an initial exploration of the use of visualizations of log-data to enhance understanding of engagement with Web-based interventions. We applied exploratory sequential data analysis to highlight sequential aspects of the log data, such as time or module number, to provide insights into engagement. After applying a number of processing steps, a range of visualizations were generated from the log-data. We then examined the usefulness of these visualizations for understanding the engagement of individual users and the engagement of cohorts of users. The visualizations created are illustrated with two datasets drawn from studies using the SilverCloud Platform: (1) a small, detailed dataset with interviews (n=19) and (2) a large dataset (n=326) with 44,838 logged events. We present four exploratory visualizations of user engagement with a Web-based intervention, including Navigation Graph, Stripe Graph, Start-Finish Graph, and Next Action Heat Map. The first represents individual usage and the last three, specific aspects of cohort usage. We provide examples of each with a discussion of salient features. Log-data analysis through data visualization is an alternative way of exploring user engagement with Web-based interventions, which can yield different insights than more commonly used summative measures. We describe how understanding the process of engagement through visualizations can support the development and evaluation of Web-based interventions. Specifically, we show how visualizations can (1) allow inspection of content or feature usage in a temporal relationship to the overall program at different levels of granularity, (2) detect different patterns of use to consider personalization in the design process, (3) detect usability issues, (4) enable exploratory analysis to support the design of statistical queries to summarize the data, (5) provide new opportunities for real-time evaluation, and (6) examine assumptions about interactivity that underlie many summative measures in this field.
Analyzing Engagement in a Web-Based Intervention Platform Through Visualizing Log-Data

PubMed Central

2014-01-01

Background Engagement has emerged as a significant cross-cutting concern within the development of Web-based interventions. There have been calls to institute a more rigorous approach to the design of Web-based interventions, to increase both the quantity and quality of engagement. One approach would be to use log-data to better understand the process of engagement and patterns of use. However, an important challenge lies in organizing log-data for productive analysis. Objective Our aim was to conduct an initial exploration of the use of visualizations of log-data to enhance understanding of engagement with Web-based interventions. Methods We applied exploratory sequential data analysis to highlight sequential aspects of the log data, such as time or module number, to provide insights into engagement. After applying a number of processing steps, a range of visualizations were generated from the log-data. We then examined the usefulness of these visualizations for understanding the engagement of individual users and the engagement of cohorts of users. The visualizations created are illustrated with two datasets drawn from studies using the SilverCloud Platform: (1) a small, detailed dataset with interviews (n=19) and (2) a large dataset (n=326) with 44,838 logged events. Results We present four exploratory visualizations of user engagement with a Web-based intervention, including Navigation Graph, Stripe Graph, Start–Finish Graph, and Next Action Heat Map. The first represents individual usage and the last three, specific aspects of cohort usage. We provide examples of each with a discussion of salient features. Conclusions Log-data analysis through data visualization is an alternative way of exploring user engagement with Web-based interventions, which can yield different insights than more commonly used summative measures. We describe how understanding the process of engagement through visualizations can support the development and evaluation of Web-based interventions. Specifically, we show how visualizations can (1) allow inspection of content or feature usage in a temporal relationship to the overall program at different levels of granularity, (2) detect different patterns of use to consider personalization in the design process, (3) detect usability issues, (4) enable exploratory analysis to support the design of statistical queries to summarize the data, (5) provide new opportunities for real-time evaluation, and (6) examine assumptions about interactivity that underlie many summative measures in this field. PMID:25406097
Influence of plankton mercury dynamics and trophic pathways on mercury concentrations of top predator fish of a mining-impacted reservoir

USGS Publications Warehouse

Stewart, A.R.; Saiki, M.K.; Kuwabara, J.S.; Alpers, Charles N.; Marvin-DiPasquale, M.; Krabbenhoft, D.P.

2008-01-01

Physical and biogeochemical characteristics of the aquatic environment that affect growth dynamics of phytoplankton and the zooplankton communities that depend on them may also affect uptake of methylmercury (MeHg) into the pelagic food web of oligotrophic reservoirs. We evaluated changes in the quality and quantity of suspended particulate material, zooplankton taxonomy, and MeHg concentrations coincident with seasonal changes in water storage of a mining-impacted reservoir in northern California, USA. MeHg concentrations in bulk zooplankton increased from 4 ng??g-1 at low water to 77 ?? 6.1 ng??g-1 at high water and were positively correlated with cladoceran biomass (r = 0.66) and negatively correlated with rotifer biomass (r = -0.65). Stable isotope analysis revealed overall higher MeHg concentrations in the pelagic-based food web relative to the benthic-based food web. Statistically similar patterns of trophic enrichment of MeHg (slopes) for the pelagic and benthic food webs and slightly higher MeHg concentrations in zooplankton than in benthic invertebrates suggest that the difference in MeHg bioaccumulation among trophic pathways is set at the base of the food webs. These results suggest an important role for plankton dynamics in driving the MeHg content of zooplankton and ultimately MeHg bioaccumulation in top predators in pelagic-based food webs. ?? 2008 NRC.

Public Health and Epidemiology Informatics.

PubMed

Flahault, A; Bar-Hen, A; Paragios, N

2016-11-10

The aim of this manuscript is to provide a brief overview of the scientific challenges that should be addressed in order to unlock the full potential of using data from a general point of view, as well as to present some ideas that could help answer specific needs for data understanding in the field of health sciences and epidemiology. A survey of uses and challenges of big data analyses for medicine and public health was conducted. The first part of the paper focuses on big data techniques, algorithms, and statistical approaches to identify patterns in data. The second part describes some cutting-edge applications of analyses and predictive modeling in public health. In recent years, we witnessed a revolution regarding the nature, collection, and availability of data in general. This was especially striking in the health sector and particularly in the field of epidemiology. Data derives from a large variety of sources, e.g. clinical settings, billing claims, care scheduling, drug usage, web based search queries, and Tweets. The exploitation of the information (data mining, artificial intelligence) relevant to these data has become one of the most promising as well challenging tasks from societal and scientific viewpoints in order to leverage the information available and making public health more efficient.
Text Mining for Adverse Drug Events: the Promise, Challenges, and State of the Art

PubMed Central

Harpaz, Rave; Callahan, Alison; Tamang, Suzanne; Low, Yen; Odgers, David; Finlayson, Sam; Jung, Kenneth; LePendu, Paea; Shah, Nigam H.

2014-01-01

Text mining is the computational process of extracting meaningful information from large amounts of unstructured text. Text mining is emerging as a tool to leverage underutilized data sources that can improve pharmacovigilance, including the objective of adverse drug event detection and assessment. This article provides an overview of recent advances in pharmacovigilance driven by the application of text mining, and discusses several data sources—such as biomedical literature, clinical narratives, product labeling, social media, and Web search logs—that are amenable to text-mining for pharmacovigilance. Given the state of the art, it appears text mining can be applied to extract useful ADE-related information from multiple textual sources. Nonetheless, further research is required to address remaining technical challenges associated with the text mining methodologies, and to conclusively determine the relative contribution of each textual source to improving pharmacovigilance. PMID:25151493
Randomized Controlled Trial of the Combined Effects of Web and Quitline Interventions for Smokeless Tobacco Cessation

PubMed Central

Danaher, Brian G.; Severson, Herbert H.; Zhu, Shu-Hong; Andrews, Judy A.; Cummins, Sharon E.; Lichtenstein, Edward; Tedeschi, Gary J.; Hudkins, Coleen; Widdop, Chris; Crowley, Ryann; Seeley, John R.

2015-01-01

Background Use of smokeless tobacco (moist snuff and chewing tobacco) is a significant public health problem but smokeless tobacco users have few resources to help them quit. Web programs and telephone-based programs (Quitlines) have been shown to be effective for smoking cessation. We evaluate the effectiveness of a Web program, a Quitline, and the combination of the two for smokeless users recruited via the Web. Objectives To test whether offering both a Web and Quitline intervention for smokeless tobacco users results in significantly better long-term tobacco abstinence outcomes than offering either intervention alone; to test whether the offer of Web or Quitline results in better outcome than a self-help manual only Control condition; and to report the usage and satisfaction of the interventions when offered alone or combined. Methods Smokeless tobacco users (N= 1,683) wanting to quit were recruited online and randomly offered one of four treatment conditions in a 2×2 design: Web Only, Quitline Only, Web + Quitline, and Control (printed self-help guide). Point-prevalence all tobacco abstinence was assessed at 3- and 6-months post enrollment. Results 69% of participants completed both the 3- and 6-month assessments. There was no significant additive or synergistic effect of combining the two interventions for Complete Case or the more rigorous Intent To Treat (ITT) analyses. Significant simple effects were detected, individually the interventions were more efficacious than the control in achieving repeated 7-day point prevalence all tobacco abstinence: Web (ITT, OR = 1.41, 95% CI = 1.03, 1.94, p = .033) and Quitline (ITT: OR = 1.54, 95% CI = 1.13, 2.11, p = .007). Participants were more likely to complete a Quitline call when offered only the Quitline intervention (OR = 0.71, 95% CI = .054, .093, p = .013), the number of website visits and duration did not differ when offered alone or in combination with Quitline. Rates of program helpfulness (p <.05) and satisfaction (p <.05) were higher for those offered both interventions versus offered only quitline. Conclusion Combining Web and Quitline interventions did not result in additive or synergistic effects, as have been found for smoking. Both interventions were more effective than a self-help control condition in helping motivated smokeless tobacco users quit tobacco. Intervention usage and satisfaction were related to the amount intervention content offered. Usage of the Quitline intervention decreased when offered in combination, though rates of helpfulness and recommendations were higher when offered in combination. Trial Registration Clinicaltrials.gov NCT00820495; http://clinicaltrials.gov/ct2/show/NCT00820495 PMID:25914872
Combined mine tremors source location and error evaluation in the Lubin Copper Mine (Poland)

NASA Astrophysics Data System (ADS)

Leśniak, Andrzej; Pszczoła, Grzegorz

2008-08-01

A modified method of mine tremors location used in Lubin Copper Mine is presented in the paper. In mines where an intensive exploration is carried out a high accuracy source location technique is usually required. The effect of the flatness of the geophones array, complex geological structure of the rock mass and intense exploitation make the location results ambiguous in such mines. In the present paper an effective method of source location and location's error evaluations are presented, combining data from two different arrays of geophones. The first consists of uniaxial geophones spaced in the whole mine area. The second is installed in one of the mining panels and consists of triaxial geophones. The usage of the data obtained from triaxial geophones allows to increase the hypocenter vertical coordinate precision. The presented two-step location procedure combines standard location methods: P-waves directions and P-waves arrival times. Using computer simulations the efficiency of the created algorithm was tested. The designed algorithm is fully non-linear and was tested on the multilayered rock mass model of the Lubin Copper Mine, showing a computational better efficiency than the traditional P-wave arrival times location algorithm. In this paper we present the complete procedure that effectively solves the non-linear location problems, i.e. the mine tremor location and measurement of the error propagation.
A Clustering Methodology of Web Log Data for Learning Management Systems

ERIC Educational Resources Information Center

Valsamidis, Stavros; Kontogiannis, Sotirios; Kazanidis, Ioannis; Theodosiou, Theodosios; Karakos, Alexandros

2012-01-01

Learning Management Systems (LMS) collect large amounts of data. Data mining techniques can be applied to analyse their web data log files. The instructors may use this data for assessing and measuring their courses. In this respect, we have proposed a methodology for analysing LMS courses and students' activity. This methodology uses a Markov…
Query Classification and Study of University Students' Search Trends

ERIC Educational Resources Information Center

Maabreh, Majdi A.; Al-Kabi, Mohammed N.; Alsmadi, Izzat M.

2012-01-01

Purpose: This study is an attempt to develop an automatic identification method for Arabic web queries and divide them into several query types using data mining. In addition, it seeks to evaluate the impact of the academic environment on using the internet. Design/methodology/approach: The web log files were collected from one of the higher…
Learning System of Web Navigation Patterns through Hypertext Probabilistic Grammars

ERIC Educational Resources Information Center

Cortes Vasquez, Augusto

2015-01-01

One issue of real interest in the area of web data mining is to capture users' activities during connection and extract behavior patterns that help define their preferences in order to improve the design of future pages adapting websites interfaces to individual users. This research is intended to provide, first of all, a presentation of the…
Mining Learning Social Networks for Cooperative Learning with Appropriate Learning Partners in a Problem-Based Learning Environment

ERIC Educational Resources Information Center

Chen, Chih-Ming; Chang, Chia-Cheng

2014-01-01

Many studies have identified web-based cooperative learning as an increasingly popular educational paradigm with potential to increase learner satisfaction and interactions. However, peer-to-peer interaction often suffers barriers owing to a failure to explore useful social interaction information in web-based cooperative learning environments.…
WALK 2.0 - using Web 2.0 applications to promote health-related physical activity: a randomised controlled trial protocol.

PubMed

Kolt, Gregory S; Rosenkranz, Richard R; Savage, Trevor N; Maeder, Anthony J; Vandelanotte, Corneel; Duncan, Mitch J; Caperchione, Cristina M; Tague, Rhys; Hooker, Cindy; Mummery, W Kerry

2013-05-03

Physical inactivity is one of the leading modifiable causes of death and disease in Australia. National surveys indicate less than half of the Australian adult population are sufficiently active to obtain health benefits. The Internet is a potentially important medium for successfully communicating health messages to the general population and enabling individual behaviour change. Internet-based interventions have proven efficacy; however, intervention studies describing website usage objectively have reported a strong decline in usage, and high attrition rate, over the course of the interventions. Web 2.0 applications give users control over web content generated and present innovative possibilities to improve user engagement. There is, however, a need to assess the effectiveness of these applications in the general population. The Walk 2.0 project is a 3-arm randomised controlled trial investigating the effects of "next generation" web-based applications on engagement, retention, and subsequent physical activity behaviour change. 504 individuals will be recruited from two sites in Australia, randomly allocated to one of two web-based interventions (Web 1.0 or Web 2.0) or a control group, and provided with a pedometer to monitor physical activity. The Web 1.0 intervention will provide participants with access to an existing physical activity website with limited interactivity. The Web 2.0 intervention will provide access to a website featuring Web 2.0 content, including social networking, blogs, and virtual walking groups. Control participants will receive a logbook to record their steps. All groups will receive similar educational material on setting goals and increasing physical activity. The primary outcomes are objectively measured physical activity and website engagement and retention. Other outcomes measured include quality of life, psychosocial correlates, and anthropometric measurements. Outcomes will be measured at baseline, 3, 12 and 18 months. The findings of this study will provide increased understanding of the benefit of new web-based technologies and applications in engaging and retaining participants on web-based intervention sites, with the aim of improved health behaviour change outcomes. Australian New Zealand Clinical Trials Registry, ACTRN12611000157976.
InterMine Webservices for Phytozome (Rev2)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Carlson, Joseph; Goodstein, David; Rokhsar, Dan

2014-07-10

A datawarehousing framework for information provides a useful infrastructure for providers and users of genomic data. For providers, the infrastructure give them a consistent mechanism for extracting raw data. While for the users, the web services supported by the software allows them to make complex, and often unique, queries of the data. Previously, phytozome.net used BioMart to provide the infrastructure. As the complexity, scale and diversity of the dataset as grown, we decided to implement an InterMine web service on our servers. This change was largely motivated by the ability to have a more complex table structure and richer webmore » reporting mechanism than BioMart. For InterMine to achieve its more complex database schema it requires an XML description of the data and an appropriate loader. Unlimited one-to-many and many-to-many relationship between the tables can be enabled in the schema. We have implemented support for:1.) Genomes and annotations for the data in Phytozome. This set is the 48 organisms currently stored in a back end CHADO datastore. The data loaders are modified versions of the CHADO data adapters from FlyMine. 2.) Interproscan results from all proteins in the Phytozome database. 3.) Clusters of proteins into a grouped heirarchically by similarity. 4.) Cufflinks results from tissue-specific RNA-Seq data of Phytozome organisms. 5.) Diversity data (GATK and SnpEFF results) from a set of individual organism. The last two datatypes are new in this implementation of our web services. We anticipate that the scale of these data will increase considerably in the near future.« less
minepath.org: a free interactive pathway analysis web server.

PubMed

Koumakis, Lefteris; Roussos, Panos; Potamias, George

2017-07-03

( www.minepath.org ) is a web-based platform that elaborates on, and radically extends the identification of differentially expressed sub-paths in molecular pathways. Besides the network topology, the underlying MinePath algorithmic processes exploit exact gene-gene molecular relationships (e.g. activation, inhibition) and are able to identify differentially expressed pathway parts. Each pathway is decomposed into all its constituent sub-paths, which in turn are matched with corresponding gene expression profiles. The highly ranked, and phenotype inclined sub-paths are kept. Apart from the pathway analysis algorithm, the fundamental innovation of the MinePath web-server concerns its advanced visualization and interactive capabilities. To our knowledge, this is the first pathway analysis server that introduces and offers visualization of the underlying and active pathway regulatory mechanisms instead of genes. Other features include live interaction, immediate visualization of functional sub-paths per phenotype and dynamic linked annotations for the engaged genes and molecular relations. The user can download not only the results but also the corresponding web viewer framework of the performed analysis. This feature provides the flexibility to immediately publish results without publishing source/expression data, and get all the functionality of a web based pathway analysis viewer. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
PLAN2L: a web tool for integrated text mining and literature-based bioentity relation extraction.

PubMed

Krallinger, Martin; Rodriguez-Penagos, Carlos; Tendulkar, Ashish; Valencia, Alfonso

2009-07-01

There is an increasing interest in using literature mining techniques to complement information extracted from annotation databases or generated by bioinformatics applications. Here we present PLAN2L, a web-based online search system that integrates text mining and information extraction techniques to access systematically information useful for analyzing genetic, cellular and molecular aspects of the plant model organism Arabidopsis thaliana. Our system facilitates a more efficient retrieval of information relevant to heterogeneous biological topics, from implications in biological relationships at the level of protein interactions and gene regulation, to sub-cellular locations of gene products and associations to cellular and developmental processes, i.e. cell cycle, flowering, root, leaf and seed development. Beyond single entities, also predefined pairs of entities can be provided as queries for which literature-derived relations together with textual evidences are returned. PLAN2L does not require registration and is freely accessible at http://zope.bioinfo.cnio.es/plan2l.
When the Americans with Disabilities Act Goes Online: Application of the ADA to the Internet and the Worldwide Web. Position Paper.

ERIC Educational Resources Information Center

National Council on Disability, Washington, DC.

This paper addresses the issue of how the Americans with Disabilities Act (ADA) applies to commercial and other private sector Web sites. Beginning with a brief discussion of the role electronic communication plays in our lives, the paper then considers the placement of the ADA in the context of current technology and of computer usage in the…
The State of Wiki Usage in U.S. K-12 Schools: Leveraging Web 2.0 Data Warehouses to Study Quality and Equality in Online Learning Environments

ERIC Educational Resources Information Center

Reich, Blair Justin Fire

2012-01-01

In the first part of this dissertation, I document wiki usage in U.S. K-12 settings by analyzing data on a representative sample drawn from a population of nearly 180,000 wikis. My research group, which I lead and managed, measured the opportunities wikis provide for students to develop 21st century skills such as expert thinking, complex…
Use of the computer and Internet among Italian families: first national study.

PubMed

Bricolo, Francesco; Gentile, Douglas A; Smelser, Rachel L; Serpelloni, Giovanni

2007-12-01

Although home Internet access has continued to increase, little is known about actual usage patterns in homes. This nationally representative study of over 4,700 Italian households with children measured computer and Internet use of each family member across 3 months. Data on actual computer and Internet usage were collected by Nielsen//NetRatings service and provide national baseline information on several variables for several age groups separately, including children, adolescents, and adult men and women. National averages are shown for the average amount of time spent using computers and on the Web, the percentage of each age group online, and the types of Web sites viewed. Overall, about one-third of children ages 2 to 11, three-fourths of adolescents and adult women, and over four-fifths of adult men access the Internet each month. Children spend an average of 22 hours/month on the computer, with a jump to 87 hours/month for adolescents. Adult women spend less time (about 60 hours/month), and adult men spend more (over 100). The types of Web sites visited are reported, including the top five for each age group. In general, search engines and Web portals are the top sites visited, regardless of age group. These data provide a baseline for comparisons across time and cultures.
Personalised Information Services Using a Hybrid Recommendation Method Based on Usage Frequency

ERIC Educational Resources Information Center

Kim, Yong; Chung, Min Gyo

2008-01-01

Purpose: This paper seeks to describe a personal recommendation service (PRS) involving an innovative hybrid recommendation method suitable for deployment in a large-scale multimedia user environment. Design/methodology/approach: The proposed hybrid method partitions content and user into segments and executes association rule mining,…
The Study of Contract Drafting Strategy: Exercises in Mine Detection.

ERIC Educational Resources Information Center

Child, Barbara

1992-01-01

A technique to teach the drafting of contracts in legal education is offered. Two form contracts, for real estate sale and purchase, are compared to illustrate the need for careful drafting of contracts. Examination of provisions for repair and maintenance reveals potentially significant differences in language usage. (MSE)
Environmental management in North American mining sector.

PubMed

Asif, Zunaira; Chen, Zhi

2016-01-01

This paper reviews the environmental issues and management practices in the mining sector in the North America. The sustainable measures on waste management are recognized as one of the most serious environmental concerns in the mining industry. For mining activities, it will be no surprise that the metal recovery reagents and acid effluents are a threat to the ecosystem as well as hazards to human health. In addition, poor air quality and ventilation in underground mines can lead to occupational illness and death of workers. Electricity usage and fuel consumption are major factors that contribute to greenhouse gases. On the other hand, many sustainability challenges are faced in the management of tailings and disposal of waste rock. This paper aims to highlight the problems that arise due to poor air quality and acid mine drainage. The paper also addresses some of the advantages and limitations of tailing and waste rock management that still have to be studied in context of the mining sector. This paper suggests that implementation of suitable environmental management tools like life cycle assessment (LCA), cleaner production technologies (CPTs), and multicriteria decision analysis (MCD) are important as it ultimately lead to improve environmental performance and enabling a mine to focus on the next stage of sustainability.
Handling Dynamic Weights in Weighted Frequent Pattern Mining

NASA Astrophysics Data System (ADS)

Ahmed, Chowdhury Farhan; Tanbeer, Syed Khairuzzaman; Jeong, Byeong-Soo; Lee, Young-Koo

Even though weighted frequent pattern (WFP) mining is more effective than traditional frequent pattern mining because it can consider different semantic significances (weights) of items, existing WFP algorithms assume that each item has a fixed weight. But in real world scenarios, the weight (price or significance) of an item can vary with time. Reflecting these changes in item weight is necessary in several mining applications, such as retail market data analysis and web click stream analysis. In this paper, we introduce the concept of a dynamic weight for each item, and propose an algorithm, DWFPM (dynamic weighted frequent pattern mining), that makes use of this concept. Our algorithm can address situations where the weight (price or significance) of an item varies dynamically. It exploits a pattern growth mining technique to avoid the level-wise candidate set generation-and-test methodology. Furthermore, it requires only one database scan, so it is eligible for use in stream data mining. An extensive performance analysis shows that our algorithm is efficient and scalable for WFP mining using dynamic weights.
Efficient Web Vulnerability Detection Tool for Sleeping Giant-Cross Site Request Forgery

NASA Astrophysics Data System (ADS)

Parimala, G.; Sangeetha, M.; AndalPriyadharsini, R.

2018-04-01

Now day’s web applications are very high in the rate of usage due to their user friendly environment and getting any information via internet but these web applications are affected by lot of threats. CSRF attack is one of the serious threats to web applications which is based on the vulnerabilities present in the normal web request and response of HTTP protocol. It is hard to detect but hence still it is present in most of the existing web applications. In CSRF attack, without user knowledge the unwanted actions on a reliable websites are forced to happen. So it is placed in OWASP’s top 10 Web Application attacks list. My proposed work is to do a real time scan of CSRF vulnerability attack in given URL of the web applications as well as local host address for any organization using python language. Client side detection of CSRF is depended on Form count which is presented in that given web site.

Selenium in ecosystems within the mountaintop coal mining and valley-fill region of southern West Virginia-assessment and ecosystem-scale modeling

USGS Publications Warehouse

Presser, Theresa S.

2013-01-01

Investigating the presence and variability of prey and predator species in demographically open systems such as streams also is key to model outcomes given the overall environmental stressors (for example, general landscape change, food-web disruption, recolonization potential) imposed on the composition of biological communities in coal mining and valley-fill affected watersheds
Publications - GMC 273 | Alaska Division of Geological & Geophysical

Science.gov Websites

holes received at the GMC (1 box, holes N1 through N8) of the INEXCO Mining Company Nikolai Project , holes N1 through N8) of the INEXCO Mining Company Nikolai Project, McCarthy, Alaska that consist of core Alaska's Mineral Industry Reports AKGeology.info Rare Earth Elements WebGeochem Engineering Geology Alaska
The Raising Influence of Information Technologies on Professional Training in the Sphere of Automated Driving When Transporting Mined Rock

NASA Astrophysics Data System (ADS)

Kosolapov, Andrey; Krysin, Sergey

2017-11-01

Revolutionary changes in the area of production, holding and exploitation of the automobile as a transport vehicle are analyzed in the article. Current state of the issue is described and the development stages of new approach to driving without human participation are predicted, taking into consideration the usage of automobiles for transportation of mined rock in Kuzbass. The influence of modern information technologies on the development of new sector of automobile industry and on the process of professional and further training of the specialists in the sphere of automobile driving is considered.
OSCAR4: a flexible architecture for chemical text-mining

PubMed Central

2011-01-01

The Open-Source Chemistry Analysis Routines (OSCAR) software, a toolkit for the recognition of named entities and data in chemistry publications, has been developed since 2002. Recent work has resulted in the separation of the core OSCAR functionality and its release as the OSCAR4 library. This library features a modular API (based on reduction of surface coupling) that permits client programmers to easily incorporate it into external applications. OSCAR4 offers a domain-independent architecture upon which chemistry specific text-mining tools can be built, and its development and usage are discussed. PMID:21999457
Personal Health Technologies in Employee Health Promotion: Usage Activity, Usefulness, and Health-Related Outcomes in a 1-Year Randomized Controlled Trial

PubMed Central

Orsama, Anna-Leena; Ahtinen, Aino; Hopsu, Leila; Leino, Timo; Korhonen, Ilkka

2013-01-01

Background Common risk factors such as obesity, poor nutrition, physical inactivity, stress, and sleep deprivation threaten the wellness and work ability of employees. Personal health technologies may help improve engagement in health promotion programs and maintenance of their effect. Objective This study investigated personal health technologies in supporting employee health promotion targeting multiple behavioral health risks. We studied the relations of usage activity to demographic and physiological characteristics, health-related outcomes (weight, aerobic fitness, blood pressure and cholesterol), and the perceived usefulness of technologies in wellness management. Methods We conducted a subgroup analysis of the technology group (114 subjects, 33 males, average age 45 years, average BMI 27.1 kg/m2) of a 3-arm randomized controlled trial (N=352). The trial was organized to study the efficacy of a face-to-face group intervention supported by technologies, including Web services, mobile applications, and personal monitoring devices. Technology usage was investigated based on log files and questionnaires. The associations between sustained usage of Web and mobile technologies and demographic and physiological characteristics were analyzed by comparing the baseline data of sustained and non-sustained users. The associations between sustained usage and changes in health-related outcomes were studied by repeated analysis of variance, using data measured by baseline and end questionnaires, and anthropometric and laboratory measurements. The experienced usability, usefulness, motivation, and barriers to using technologies were investigated by 4 questionnaires and 2 interviews. Results 111 subjects (97.4%) used technologies at some point of the study, and 33 (29.9%) were classified as sustained users of Web or mobile technologies. Simple technologies, weight scales and pedometer, attracted the most users. The sustained users were slightly older 47 years (95% CI 44 to 49) versus 44 years (95% CI 42 to 45), P=.034 and had poorer aerobic fitness at baseline (mean difference in maximal metabolic equivalent 1.0, 95% Cl 0.39 to 1.39; P=.013) than non-sustained users. They succeeded better in weight management: their weight decreased -1.2 kg (95% CI -2.38 to -0.01) versus +0.6 kg (95% CI -0.095 to 1.27), P=.006; body fat percentage -0.9%-units (95% CI -1.64 to -0.09) versus +0.3%-units (95% CI -0.28 to 0.73), P=.014; and waist circumference -1.4 cm (95% CI -2.60 to -0.20) versus +0.7 cm (95% CI -0.21 to 1.66), P=.01. They also participated in intervention meetings more actively: median 4 meetings (interquartile range; IQR 4–5) versus 4 meetings (IQR 3–4), P=.009. The key factors in usefulness were: simplicity, integration into daily life, and clear feedback on progress. Conclusions Despite active initial usage, less than 30% of subjects continued using Web or mobile technologies throughout the study. Sustained users achieved better weight-related outcomes than non-sustained users. High non-usage attrition and modest outcomes cast doubt on the potential of technologies to support interventions. PMID:25098385
Personal health technologies in employee health promotion: usage activity, usefulness, and health-related outcomes in a 1-year randomized controlled trial.

PubMed

Mattila, Elina; Orsama, Anna-Leena; Ahtinen, Aino; Hopsu, Leila; Leino, Timo; Korhonen, Ilkka

2013-07-29

Common risk factors such as obesity, poor nutrition, physical inactivity, stress, and sleep deprivation threaten the wellness and work ability of employees. Personal health technologies may help improve engagement in health promotion programs and maintenance of their effect. This study investigated personal health technologies in supporting employee health promotion targeting multiple behavioral health risks. We studied the relations of usage activity to demographic and physiological characteristics, health-related outcomes (weight, aerobic fitness, blood pressure and cholesterol), and the perceived usefulness of technologies in wellness management. We conducted a subgroup analysis of the technology group (114 subjects, 33 males, average age 45 years, average BMI 27.1 kg/m(2)) of a 3-arm randomized controlled trial (N=352). The trial was organized to study the efficacy of a face-to-face group intervention supported by technologies, including Web services, mobile applications, and personal monitoring devices. Technology usage was investigated based on log files and questionnaires. The associations between sustained usage of Web and mobile technologies and demographic and physiological characteristics were analyzed by comparing the baseline data of sustained and non-sustained users. The associations between sustained usage and changes in health-related outcomes were studied by repeated analysis of variance, using data measured by baseline and end questionnaires, and anthropometric and laboratory measurements. The experienced usability, usefulness, motivation, and barriers to using technologies were investigated by 4 questionnaires and 2 interviews. 111 subjects (97.4%) used technologies at some point of the study, and 33 (29.9%) were classified as sustained users of Web or mobile technologies. Simple technologies, weight scales and pedometer, attracted the most users. The sustained users were slightly older 47 years (95% CI 44 to 49) versus 44 years (95% CI 42 to 45), P=.034 and had poorer aerobic fitness at baseline (mean difference in maximal metabolic equivalent 1.0, 95% Cl 0.39 to 1.39; P=.013) than non-sustained users. They succeeded better in weight management: their weight decreased -1.2 kg (95% CI -2.38 to -0.01) versus +0.6 kg (95% CI -0.095 to 1.27), P=.006; body fat percentage -0.9%-units (95% CI -1.64 to -0.09) versus +0.3%-units (95% CI -0.28 to 0.73), P=.014; and waist circumference -1.4 cm (95% CI -2.60 to -0.20) versus +0.7 cm (95% CI -0.21 to 1.66), P=.01. They also participated in intervention meetings more actively: median 4 meetings (interquartile range; IQR 4-5) versus 4 meetings (IQR 3-4), P=.009. The key factors in usefulness were: simplicity, integration into daily life, and clear feedback on progress. Despite active initial usage, less than 30% of subjects continued using Web or mobile technologies throughout the study. Sustained users achieved better weight-related outcomes than non-sustained users. High non-usage attrition and modest outcomes cast doubt on the potential of technologies to support interventions.
WWW Motivation Mining: Finding Treasures for Teaching Evaluation Skills, Grades 7-12. Professional Growth Series.

ERIC Educational Resources Information Center

Small, Ruth V.; Arnone, Marilyn P.

Intended for use by middle or high school teachers and library media specialists, this book describes a World Wide Web evaluation tool developed specifically for use by high school students and designed to provide hands-on experience in critically evaluating the strengths and weaknesses of Web sites. The book uses a workbook format and is…
Web mining for topics defined by complex and precise predicates

NASA Astrophysics Data System (ADS)

Lee, Ching-Cheng; Sampathkumar, Sushma

2004-04-01

The enormous growth of the World Wide Web has made it important to perform resource discovery efficiently for any given topic. Several new techniques have been proposed in the recent years for this kind of topic specific web-mining, and among them a key new technique called focused crawling which is able to crawl topic-specific portions of the web without having to explore all pages. Most existing research on focused crawling considers a simple topic definition that typically consists of one or more keywords connected by an OR operator. However this kind of simple topic definition may result in too many irrelevant pages in which the same keyword appears in a wrong context. In this research we explore new strategies for crawling topic specific portions of the web using complex and precise predicates. A complex predicate will allow the user to precisely specify a topic using Boolean operators such as "AND", "OR" and "NOT". Our work will concentrate on defining a format to specify this kind of a complex topic definition and secondly on devising a crawl strategy to crawl the topic specific portions of the web defined by the complex predicate, efficiently and with minimal overhead. Our new crawl strategy will improve the performance of topic-specific web crawling by reducing the number of irrelevant pages crawled. In order to demonstrate the effectiveness of the above approach, we have built a complete focused crawler called "Eureka" with complex predicate support, and a search engine that indexes and supports end-user searches on the crawled pages.
Bandwidth Constraints to Using Video and Other Rich Media in Behavior Change Websites

PubMed Central

Jazdzewski, Stephen A; McKay, H Garth; Hudson, Clinton R

2005-01-01

Background Web-based behavior change interventions often include rich media (eg, video, audio, and large graphics). The rationale for using rich media includes the need to reach users who are not inclined or able to use text-based website content, encouragement of program engagement, and following the precedent set by news and sports websites. Objectives We describe the development of a bandwidth usage index, which seeks to provide a practical method to gauge the extent to which websites can successfully be used within different Internet access scenarios (eg, dial-up and broadband). Methods We conducted three studies to measure bandwidth consumption. In Study 1, we measured the bandwidth usage index for three video-rich websites (for smoking cessation, for caregivers, and for improving eldercare by family members). We then estimated the number of concurrent users that could be accommodated by each website under various Internet access scenarios. In Study 2, we sought to validate our estimated threshold number of concurrent users by testing the video-rich smoking cessation website with different numbers of concurrent users. In Study 3, we calculated the bandwidth usage index and threshold number of concurrent users for three versions of the smoking cessation website: the video-rich version (tested in Study 1), an audio-rich version, and a Web-enabled CD-ROM version in which all media-rich content was placed on a CD-ROM on the client computer. Results In Study 1, we found that the bandwidth usage index of the video-rich websites ranged from 144 Kbps to 93 Kbps. These results indicated that dial-up modem users would not achieve a “good user experience” with any of the three rich media websites. Results for Study 2 confirmed that usability was compromised when the estimated threshold number of concurrent users was exceeded. Results for Study 3 indicated that changing a website from video- to audio-rich content reduced the bandwidth requirement by almost 50%, but it remained too large to allow satisfactory use in dial-up modem scenarios. The Web-enabled CD-ROM reduced bandwidth requirements such that even a dial-up modem user could have a good user experience with the rich media content. Conclusions We conclude that the bandwidth usage index represents a practical tool that can help developers and researchers to measure the bandwidth requirements of their websites as well as to evaluate the feasibility of certain website designs in terms of specific use cases. These findings are discussed in terms of reaching different groups of users as well accommodating the intended number of concurrent users. We also discuss the promising option of using Web-enabled CD-ROMs to deliver rich media content to users with dial-up Internet access. We introduce a number of researchable themes for improving our ability to develop Web-based behavior change interventions that can better deliver what they promise. PMID:16236701
Bandwidth constraints to using video and other rich media in behavior change websites.

PubMed

Danaher, Brian G; Jazdzewski, Stephen A; McKay, H Garth; Hudson, Clinton R

2005-09-16

Web-based behavior change interventions often include rich media (eg, video, audio, and large graphics). The rationale for using rich media includes the need to reach users who are not inclined or able to use text-based website content, encouragement of program engagement, and following the precedent set by news and sports websites. We describe the development of a bandwidth usage index, which seeks to provide a practical method to gauge the extent to which websites can successfully be used within different Internet access scenarios (eg, dial-up and broadband). We conducted three studies to measure bandwidth consumption. In Study 1, we measured the bandwidth usage index for three video-rich websites (for smoking cessation, for caregivers, and for improving eldercare by family members). We then estimated the number of concurrent users that could be accommodated by each website under various Internet access scenarios. In Study 2, we sought to validate our estimated threshold number of concurrent users by testing the video-rich smoking cessation website with different numbers of concurrent users. In Study 3, we calculated the bandwidth usage index and threshold number of concurrent users for three versions of the smoking cessation website: the video-rich version (tested in Study 1), an audio-rich version, and a Web-enabled CD-ROM version in which all media-rich content was placed on a CD-ROM on the client computer. In Study 1, we found that the bandwidth usage index of the video-rich websites ranged from 144 Kbps to 93 Kbps. These results indicated that dial-up modem users would not achieve a "good user experience" with any of the three rich media websites. Results for Study 2 confirmed that usability was compromised when the estimated threshold number of concurrent users was exceeded. Results for Study 3 indicated that changing a website from video- to audio-rich content reduced the bandwidth requirement by almost 50%, but it remained too large to allow satisfactory use in dial-up modem scenarios. The Web-enabled CD-ROM reduced bandwidth requirements such that even a dial-up modem user could have a good user experience with the rich media content. We conclude that the bandwidth usage index represents a practical tool that can help developers and researchers to measure the bandwidth requirements of their websites as well as to evaluate the feasibility of certain website designs in terms of specific use cases. These findings are discussed in terms of reaching different groups of users as well accommodating the intended number of concurrent users. We also discuss the promising option of using Web-enabled CD-ROMs to deliver rich media content to users with dial-up Internet access. We introduce a number of researchable themes for improving our ability to develop Web-based behavior change interventions that can better deliver what they promise.
Text mining for adverse drug events: the promise, challenges, and state of the art.

PubMed

Harpaz, Rave; Callahan, Alison; Tamang, Suzanne; Low, Yen; Odgers, David; Finlayson, Sam; Jung, Kenneth; LePendu, Paea; Shah, Nigam H

2014-10-01

Text mining is the computational process of extracting meaningful information from large amounts of unstructured text. It is emerging as a tool to leverage underutilized data sources that can improve pharmacovigilance, including the objective of adverse drug event (ADE) detection and assessment. This article provides an overview of recent advances in pharmacovigilance driven by the application of text mining, and discusses several data sources-such as biomedical literature, clinical narratives, product labeling, social media, and Web search logs-that are amenable to text mining for pharmacovigilance. Given the state of the art, it appears text mining can be applied to extract useful ADE-related information from multiple textual sources. Nonetheless, further research is required to address remaining technical challenges associated with the text mining methodologies, and to conclusively determine the relative contribution of each textual source to improving pharmacovigilance.
Bringing Control System User Interfaces to the Web

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, Xihui; Kasemir, Kay

With the evolution of web based technologies, especially HTML5 [1], it becomes possible to create web-based control system user interfaces (UI) that are cross-browser and cross-device compatible. This article describes two technologies that facilitate this goal. The first one is the WebOPI [2], which can seamlessly display CSS BOY [3] Operator Interfaces (OPI) in web browsers without modification to the original OPI file. The WebOPI leverages the powerful graphical editing capabilities of BOY and provides the convenience of re-using existing OPI files. On the other hand, it uses generic JavaScript and a generic communication mechanism between the web browser andmore » web server. It is not optimized for a control system, which results in unnecessary network traffic and resource usage. Our second technology is the WebSocket-based Process Data Access (WebPDA) [4]. It is a protocol that provides efficient control system data communication using WebSocket [5], so that users can create web-based control system UIs using standard web page technologies such as HTML, CSS and JavaScript. WebPDA is control system independent, potentially supporting any type of control system.« less
A web-based platform to support an evidence-based mental health intervention: lessons from the CBITS web site.

PubMed

Vona, Pamela; Wilmoth, Pete; Jaycox, Lisa H; McMillen, Janey S; Kataoka, Sheryl H; Wong, Marleen; DeRosier, Melissa E; Langley, Audra K; Kaufman, Joshua; Tang, Lingqi; Stein, Bradley D

2014-11-01

To explore the role of Web-based platforms in behavioral health, the study examined usage of a Web site for supporting training and implementation of an evidence-based intervention. Using data from an online registration survey and Google Analytics, the investigators examined user characteristics and Web site utilization. Site engagement was substantial across user groups. Visit duration differed by registrants' characteristics. Less experienced clinicians spent more time on the Web site. The training section accounted for most page views across user groups. Individuals previously trained in the Cognitive-Behavioral Intervention for Trauma in Schools intervention viewed more implementation assistance and online community pages than did other user groups. Web-based platforms have the potential to support training and implementation of evidence-based interventions for clinicians of varying levels of experience and may facilitate more rapid dissemination. Web-based platforms may be promising for trauma-related interventions, because training and implementation support should be readily available after a traumatic event.
An initial log analysis of usage patterns on a research networking system.

PubMed

Boland, Mary Regina; Trembowelski, Sylvia; Bakken, Suzanne; Weng, Chunhua

2012-08-01

Usage data for research networking systems (RNSs) are valuable but generally unavailable for understanding scientific professionals' information needs and online collaborator seeking behaviors. This study contributes a method for evaluating RNSs and initial usage knowledge of one RNS obtained from using this method. We designed a log for an institutional RNS, defined categories of users and tasks, and analyzed correlations between usage patterns and user and query types. Our results show that scientific professionals spend more time performing deep Web searching on RNSs than generic Google users and we also show that retrieving scientist profiles is faster on an RNS than on Google (3.5 seconds vs. 34.2 seconds) whereas organization-specific browsing on a RNS takes longer than on Google (117.0 seconds vs. 34.2 seconds). Usage patterns vary by user role, e.g., faculty performed more informational queries than administrators, which implies role-specific user support is needed for RNSs. © 2012 Wiley Periodicals, Inc.
An Initial Log Analysis of Usage Patterns on a Research Networking System

PubMed Central

Boland, Mary Regina; Trembowelski, Sylvia; Bakken, Suzanne; Weng, Chunhua

2012-01-01

Abstract Usage data for research networking systems (RNSs) are valuable but generally unavailable for understanding scientific professionals’ information needs and online collaborator seeking behaviors. This study contributes a method for evaluating RNSs and initial usage knowledge of one RNS obtained from using this method. We designed a log for an institutional RNS, defined categories of users and tasks, and analyzed correlations between usage patterns and user and query types. Our results show that scientific professionals spend more time performing deep Web searching on RNSs than generic Google users and we also show that retrieving scientist profiles is faster on an RNS than on Google (3.5 seconds vs. 34.2 seconds) whereas organization‐specific browsing on a RNS takes longer than on Google (117.0 seconds vs. 34.2 seconds). Usage patterns vary by user role, e.g., faculty performed more informational queries than administrators, which implies role‐specific user support is needed for RNSs. Clin Trans Sci 2012; Volume 5: 340–347 PMID:22883612
Stress monitoring versus microseismic ruptures in an active deep mine

NASA Astrophysics Data System (ADS)

Tonnellier, Alice; Bouffier, Christian; Bigarré, Pascal; Nyström, Anders; Österberg, Anders; Fjellström, Peter

2015-04-01

Nowadays, underground mining industry has developed high-technology mass mining methods to optimise the productivity at deep levels. Such massive extraction induces high-level stress redistribution generating seismic events around the mining works, threatening safety and economics. For this reason mining irregular deep ore bodies calls for steadily enhanced scientific practises and technologies to guarantee the mine environment to be safer and stable for the miners and the infrastructures. INERIS, within the framework of the FP7 European project I2Mine and in partnership with the Swedish mining company Boliden, has developed new methodologies in order to monitor both quasi-static stress changes and ruptures in a seismic prone area. To this purpose, a unique local permanent microseismic and stress monitoring network has been installed into the deep-working Garpenberg mine situated to the north of Uppsala (Sweden). In this mine, ore is extracted using sublevel stoping with paste fill production/distribution system and long-hole drilling method. This monitoring network has been deployed between about 1100 and 1250 meter depth. It consists in six 1-component and five 3-component microseismic probes (14-Hz geophones) deployed in the Lappberget area, in addition to three 3D stress monitoring cells that focus on a very local exploited area. Objective is three-fold: to quantify accurately quasi-static stress changes and freshly-induced stress gradients with drift development in the orebody, to study quantitatively those stress changes versus induced detected and located microseismic ruptures, and possibly to identify quasi-static stress transfer from those seismic ruptures. Geophysical and geotechnical data are acquired continuously and automatically transferred to INERIS datacenter through the web. They are made available on a secured web cloud monitoring infrastructure called e.cenaris and completed with mine data. Such interface enables the visualisation of the monitoring data coming from the mine in quasi-real time and facilitates information exchanges and decision making for experts and stakeholders. On the basis of these data acquisition and sharing, preliminary analysis has been started to highlight whether stress variations and seismic sources behaviour might be directly bound with mine working evolution and could improve the knowledge on the equilibrium states inside the mine. Knowing such parameters indeed will be a potential solution to understand better the response of deep mining activities to the exploitation solicitations and to develop, if possible, methods to prevent from major hazards such as rock bursts and other ground failure phenomena.
Australian health professionals' social media (Web 2.0) adoption trends: early 21st century health care delivery and practice promotion.

PubMed

Usher, Wayne T

2012-01-01

This study was concerned with identifying reasons behind patterns of social media (Web 2.0) usage associated with eight of Australia's major health professions. Attention was given to uncovering some of the more significant motivations for the resistance or adoption of Web 2.0 technologies for health care delivery and practice promotion by Australian health professionals. Surveys were developed from a common set of questions with specific variations between professions negotiated with professional health societies. Survey questions were constructed in an attempt to identify Web 2.0 adoption trends. An online survey (www.limesurvey.org) was used to collect data. Initial data preparation involved the development of one integrated SPSS file to incorporate all responses from the eight surveys undertaken. Initial data analysis applied Frequencies and Crosstabs to the identified groups and provided a profile of respondents by key business and demographic characteristics. Of the 935 respondents, 9.5% of participants indicated that they used Web 2.0 for their professional work, 19.1% of them did not use it for work but used it for their personal needs and 71.3% of them did not use Web 2.0 at all. Participants have indicated that the main reason for 'choosing not to adopt' Web 2.0 applications as a way of delivering health care to their patients is due to the health professionals' lack of understanding of Web 2.0 (83.3%), while the main reason for 'choosing to adopt' Web 2.0 applications is the perception of Web 2.0 as a quick and effective method of communication (73.0%). This study has indicated that Australian health professionals 'choose not to adopt' Web 2.0 usage as a way of delivering health care primarily due to 'a lack of understanding as to how social media would be used in health care' (83.3%). This study identifies that Australian health professionals are interacting with Web 2.0 technologies in their private lives but are failing to see how such technologies might be used throughout their professions. Australian health professionals are willing to undertake online educational courses (n=553, 58%) designed to upskill them about how Web 2.0 may be used for practice promotion and health care delivery.
Data Mining of Web-Based Documents on Social Networking Sites That Included Suicide-Related Words Among Korean Adolescents.

PubMed

Song, Juyoung; Song, Tae Min; Seo, Dong-Chul; Jin, Jae Hyun

2016-12-01

To investigate online search activity of suicide-related words in South Korean adolescents through data mining of social media Web sites as the suicide rate in South Korea is one of the highest in the world. Out of more than 2.35 billion posts for 2 years from January 1, 2011 to December 31, 2012 on 163 social media Web sites in South Korea, 99,693 suicide-related documents were retrieved by Crawler and analyzed using text mining and opinion mining. These data were further combined with monthly employment rate, monthly rental prices index, monthly youth suicide rate, and monthly number of reported bully victims to fit multilevel models as well as structural equation models. The link from grade pressure to suicide risk showed the largest standardized path coefficient (beta = .357, p < .001) in structural models and a significant random effect (p < .01) in multilevel models. Depression was a partial mediator between suicide risk and grade pressure, low body image, victims of bullying, and concerns about disease. The largest total effect was observed in the grade pressure to depression to suicide risk. The multilevel models indicate about 27% of the variance in the daily suicide-related word search activity is explained by month-to-month variations. A lower employment rate, a higher rental prices index, and more bullying were associated with an increased suicide-related word search activity. Academic pressure appears to be the biggest contributor to Korean adolescents' suicide risk. Real-time suicide-related word search activity monitoring and response system needs to be developed. Copyright © 2016 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.
Understanding usage of a hybrid website and smartphone app for weight management: a mixed-methods study.

PubMed

Morrison, Leanne G; Hargood, Charlie; Lin, Sharon Xiaowen; Dennison, Laura; Joseph, Judith; Hughes, Stephanie; Michaelides, Danius T; Johnston, Derek; Johnston, Marie; Michie, Susan; Little, Paul; Smith, Peter Wf; Weal, Mark J; Yardley, Lucy

2014-10-22

Advancements in mobile phone technology offer huge potential for enhancing the timely delivery of health behavior change interventions. The development of smartphone-based health interventions (apps) is a rapidly growing field of research, yet there have been few longitudinal examinations of how people experience and use these apps within their day-to-day routines, particularly within the context of a hybrid Web- and app-based intervention. This study used an in-depth mixed-methods design to examine individual variation in (1) impact on self-reported goal engagement (ie, motivation, self-efficacy, awareness, effort, achievement) of access to a weight management app (POWeR Tracker) when provided alongside a Web-based weight management intervention (POWeR) and (2) usage and views of POWeR Tracker. Thirteen adults were provided access to POWeR and were monitored over a 4-week period. Access to POWeR Tracker was provided in 2 alternate weeks (ie, weeks 1 and 3 or weeks 2 and 4). Participants' goal engagement was measured daily via self-report. Mixed effects models were used to examine change in goal engagement between the weeks when POWeR Tracker was and was not available and whether the extent of change in goal engagement varied between individual participants. Usage of POWeR and POWeR Tracker was automatically recorded for each participant. Telephone interviews were conducted and analyzed using inductive thematic analysis to further explore participants' experiences using POWeR and POWeR Tracker. Access to POWeR Tracker was associated with a significant increase in participants' awareness of their eating (β1=0.31, P=.04) and physical activity goals (β1=0.28, P=.03). The level of increase varied between individual participants. Usage data showed that participants used the POWeR website for similar amounts of time during the weeks when POWeR Tracker was (mean 29 minutes, SD 31 minutes) and was not available (mean 27 minutes, SD 33 minutes). POWeR Tracker was mostly accessed in short bursts (mean 3 minutes, SD 2 minutes) during convenient moments or moments when participants deemed the intervention content most relevant. The qualitative data indicated that nearly all participants agreed that it was more convenient to access information on-the-go via their mobiles compared to a computer. However, participants varied in their views and usage of the Web- versus app-based components and the informational versus tracking tools provided by POWeR Tracker. This study provides evidence that smartphones have the potential to improve individuals' engagement with their health-related goals when used as a supplement to an existing online intervention. The perceived convenience of mobile access to information does not appear to deter use of Web-based interventions or strengthen the impact of app access on goal engagement. A mixed-methods design enabled exploration of individual variation in daily usage of the app-based tools.
Understanding Usage of a Hybrid Website and Smartphone App for Weight Management: A Mixed-Methods Study

PubMed Central

Hargood, Charlie; Lin, Sharon Xiaowen; Dennison, Laura; Joseph, Judith; Hughes, Stephanie; Michaelides, Danius T; Johnston, Derek; Johnston, Marie; Michie, Susan; Little, Paul; Smith, Peter WF; Weal, Mark J; Yardley, Lucy

2014-01-01

Background Advancements in mobile phone technology offer huge potential for enhancing the timely delivery of health behavior change interventions. The development of smartphone-based health interventions (apps) is a rapidly growing field of research, yet there have been few longitudinal examinations of how people experience and use these apps within their day-to-day routines, particularly within the context of a hybrid Web- and app-based intervention. Objective This study used an in-depth mixed-methods design to examine individual variation in (1) impact on self-reported goal engagement (ie, motivation, self-efficacy, awareness, effort, achievement) of access to a weight management app (POWeR Tracker) when provided alongside a Web-based weight management intervention (POWeR) and (2) usage and views of POWeR Tracker. Methods Thirteen adults were provided access to POWeR and were monitored over a 4-week period. Access to POWeR Tracker was provided in 2 alternate weeks (ie, weeks 1 and 3 or weeks 2 and 4). Participants’ goal engagement was measured daily via self-report. Mixed effects models were used to examine change in goal engagement between the weeks when POWeR Tracker was and was not available and whether the extent of change in goal engagement varied between individual participants. Usage of POWeR and POWeR Tracker was automatically recorded for each participant. Telephone interviews were conducted and analyzed using inductive thematic analysis to further explore participants’ experiences using POWeR and POWeR Tracker. Results Access to POWeR Tracker was associated with a significant increase in participants’ awareness of their eating (β1=0.31, P=.04) and physical activity goals (β1=0.28, P=.03). The level of increase varied between individual participants. Usage data showed that participants used the POWeR website for similar amounts of time during the weeks when POWeR Tracker was (mean 29 minutes, SD 31 minutes) and was not available (mean 27 minutes, SD 33 minutes). POWeR Tracker was mostly accessed in short bursts (mean 3 minutes, SD 2 minutes) during convenient moments or moments when participants deemed the intervention content most relevant. The qualitative data indicated that nearly all participants agreed that it was more convenient to access information on-the-go via their mobiles compared to a computer. However, participants varied in their views and usage of the Web- versus app-based components and the informational versus tracking tools provided by POWeR Tracker. Conclusions This study provides evidence that smartphones have the potential to improve individuals’ engagement with their health-related goals when used as a supplement to an existing online intervention. The perceived convenience of mobile access to information does not appear to deter use of Web-based interventions or strengthen the impact of app access on goal engagement. A mixed-methods design enabled exploration of individual variation in daily usage of the app-based tools. PMID:25355131

The potential of text mining in data integration and network biology for plant research: a case study on Arabidopsis.

PubMed

Van Landeghem, Sofie; De Bodt, Stefanie; Drebert, Zuzanna J; Inzé, Dirk; Van de Peer, Yves

2013-03-01

Despite the availability of various data repositories for plant research, a wealth of information currently remains hidden within the biomolecular literature. Text mining provides the necessary means to retrieve these data through automated processing of texts. However, only recently has advanced text mining methodology been implemented with sufficient computational power to process texts at a large scale. In this study, we assess the potential of large-scale text mining for plant biology research in general and for network biology in particular using a state-of-the-art text mining system applied to all PubMed abstracts and PubMed Central full texts. We present extensive evaluation of the textual data for Arabidopsis thaliana, assessing the overall accuracy of this new resource for usage in plant network analyses. Furthermore, we combine text mining information with both protein-protein and regulatory interactions from experimental databases. Clusters of tightly connected genes are delineated from the resulting network, illustrating how such an integrative approach is essential to grasp the current knowledge available for Arabidopsis and to uncover gene information through guilt by association. All large-scale data sets, as well as the manually curated textual data, are made publicly available, hereby stimulating the application of text mining data in future plant biology studies.
Smartphone dependence classification using tensor factorization.

PubMed

Choi, Jingyun; Rho, Mi Jung; Kim, Yejin; Yook, In Hye; Yu, Hwanjo; Kim, Dai-Jin; Choi, In Young

2017-01-01

Excessive smartphone use causes personal and social problems. To address this issue, we sought to derive usage patterns that were directly correlated with smartphone dependence based on usage data. This study attempted to classify smartphone dependence using a data-driven prediction algorithm. We developed a mobile application to collect smartphone usage data. A total of 41,683 logs of 48 smartphone users were collected from March 8, 2015, to January 8, 2016. The participants were classified into the control group (SUC) or the addiction group (SUD) using the Korean Smartphone Addiction Proneness Scale for Adults (S-Scale) and a face-to-face offline interview by a psychiatrist and a clinical psychologist (SUC = 23 and SUD = 25). We derived usage patterns using tensor factorization and found the following six optimal usage patterns: 1) social networking services (SNS) during daytime, 2) web surfing, 3) SNS at night, 4) mobile shopping, 5) entertainment, and 6) gaming at night. The membership vectors of the six patterns obtained a significantly better prediction performance than the raw data. For all patterns, the usage times of the SUD were much longer than those of the SUC. From our findings, we concluded that usage patterns and membership vectors were effective tools to assess and predict smartphone dependence and could provide an intervention guideline to predict and treat smartphone dependence based on usage data.
Lamprey: tracking users on the World Wide Web.

PubMed

Felciano, R M; Altman, R B

1996-01-01

Tracking individual web sessions provides valuable information about user behavior. This information can be used for general purpose evaluation of web-based user interfaces to biomedical information systems. To this end, we have developed Lamprey, a tool for doing quantitative and qualitative analysis of Web-based user interfaces. Lamprey can be used from any conforming browser, and does not require modification of server or client software. By rerouting WWW navigation through a centralized filter, Lamprey collects the sequence and timing of hyperlinks used by individual users to move through the web. Instead of providing marginal statistics, it retains the full information required to recreate a user session. We have built Lamprey as a standard Common Gateway Interface (CGI) that works with all standard WWW browsers and servers. In this paper, we describe Lamprey and provide a short demonstration of this approach for evaluating web usage patterns.
LimTox: a web tool for applied text mining of adverse event and toxicity associations of compounds, drugs and genes

PubMed Central

Cañada, Andres; Rabal, Obdulia; Oyarzabal, Julen; Valencia, Alfonso

2017-01-01

Abstract A considerable effort has been devoted to retrieve systematically information for genes and proteins as well as relationships between them. Despite the importance of chemical compounds and drugs as a central bio-entity in pharmacological and biological research, only a limited number of freely available chemical text-mining/search engine technologies are currently accessible. Here we present LimTox (Literature Mining for Toxicology), a web-based online biomedical search tool with special focus on adverse hepatobiliary reactions. It integrates a range of text mining, named entity recognition and information extraction components. LimTox relies on machine-learning, rule-based, pattern-based and term lookup strategies. This system processes scientific abstracts, a set of full text articles and medical agency assessment reports. Although the main focus of LimTox is on adverse liver events, it enables also basic searches for other organ level toxicity associations (nephrotoxicity, cardiotoxicity, thyrotoxicity and phospholipidosis). This tool supports specialized search queries for: chemical compounds/drugs, genes (with additional emphasis on key enzymes in drug metabolism, namely P450 cytochromes—CYPs) and biochemical liver markers. The LimTox website is free and open to all users and there is no login requirement. LimTox can be accessed at: http://limtox.bioinfo.cnio.es PMID:28531339
Mining Available Data from the United States Environmental Protection Agency to Support Rapid Life Cycle Inventory Modeling of Chemical Manufacturing.

PubMed

Cashman, Sarah A; Meyer, David E; Edelen, Ashley N; Ingwersen, Wesley W; Abraham, John P; Barrett, William M; Gonzalez, Michael A; Randall, Paul M; Ruiz-Mercado, Gerardo; Smith, Raymond L

2016-09-06

Demands for quick and accurate life cycle assessments create a need for methods to rapidly generate reliable life cycle inventories (LCI). Data mining is a suitable tool for this purpose, especially given the large amount of available governmental data. These data are typically applied to LCIs on a case-by-case basis. As linked open data becomes more prevalent, it may be possible to automate LCI using data mining by establishing a reproducible approach for identifying, extracting, and processing the data. This work proposes a method for standardizing and eventually automating the discovery and use of publicly available data at the United States Environmental Protection Agency for chemical-manufacturing LCI. The method is developed using a case study of acetic acid. The data quality and gap analyses for the generated inventory found that the selected data sources can provide information with equal or better reliability and representativeness on air, water, hazardous waste, on-site energy usage, and production volumes but with key data gaps including material inputs, water usage, purchased electricity, and transportation requirements. A comparison of the generated LCI with existing data revealed that the data mining inventory is in reasonable agreement with existing data and may provide a more-comprehensive inventory of air emissions and water discharges. The case study highlighted challenges for current data management practices that must be overcome to successfully automate the method using semantic technology. Benefits of the method are that the openly available data can be compiled in a standardized and transparent approach that supports potential automation with flexibility to incorporate new data sources as needed.
The utility of web mining for epidemiological research: studying the association between parity and cancer risk [Web Mining for Epidemiological Research. Assessing its Utility in Exploring the Association Between Parity and Cancer Risk

DOE PAGES

Tourassi, Georgia; Yoon, Hong-Jun; Xu, Songhua; ...

2015-11-27

Background: The World Wide Web has emerged as a powerful data source for epidemiological studies related to infectious disease surveillance. However, its potential for cancer-related epidemiological discoveries is largely unexplored. Methods: Using advanced web crawling and tailored information extraction procedures we automatically collected and analyzed the text content of 79,394 online obituary articles published between 1998 and 2014. The collected data included 51,911 cancer (27,330 breast; 9,470 lung; 6,496 pancreatic; 6,342 ovarian; 2,273 colon) and 27,483 non-cancer cases. With the derived information, we replicated a case-control study design to investigate the association between parity and cancer risk. Age-adjusted odds ratiosmore » (ORs) with 95% confidence intervals (CIs) were calculated for each cancer type and compared to those reported in large-scale epidemiological studies. Results: Parity was found to be associated with a significantly reduced risk of breast cancer (OR=0.78, 95% CI = 0.75 to 0.82), pancreatic cancer (OR=0.78, 95% CI = 0.72 to 0.83), colon cancer (OR=0.67, 95% CI = 0.60 to 0.74), and ovarian cancer (OR=0.58, 95% CI = 0.54 to 0.62). Marginal association was found for lung cancer prevalence (OR=0.87, 95% CI = 0.81 to 0.92). The linear trend between multi-parity and reduced cancer risk was dramatically more pronounced for breast and ovarian cancer than the other cancers included in the analysis. Conclusion: This large web-mining study on parity and cancer risk produced findings very similar to those reported with traditional observational studies. It may be used as a promising strategy to generate study hypotheses for guiding and prioritizing future epidemiological studies.« less
The utility of web mining for epidemiological research: studying the association between parity and cancer risk [Web Mining for Epidemiological Research. Assessing its Utility in Exploring the Association Between Parity and Cancer Risk

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tourassi, Georgia; Yoon, Hong-Jun; Xu, Songhua

Background: The World Wide Web has emerged as a powerful data source for epidemiological studies related to infectious disease surveillance. However, its potential for cancer-related epidemiological discoveries is largely unexplored. Methods: Using advanced web crawling and tailored information extraction procedures we automatically collected and analyzed the text content of 79,394 online obituary articles published between 1998 and 2014. The collected data included 51,911 cancer (27,330 breast; 9,470 lung; 6,496 pancreatic; 6,342 ovarian; 2,273 colon) and 27,483 non-cancer cases. With the derived information, we replicated a case-control study design to investigate the association between parity and cancer risk. Age-adjusted odds ratiosmore » (ORs) with 95% confidence intervals (CIs) were calculated for each cancer type and compared to those reported in large-scale epidemiological studies. Results: Parity was found to be associated with a significantly reduced risk of breast cancer (OR=0.78, 95% CI = 0.75 to 0.82), pancreatic cancer (OR=0.78, 95% CI = 0.72 to 0.83), colon cancer (OR=0.67, 95% CI = 0.60 to 0.74), and ovarian cancer (OR=0.58, 95% CI = 0.54 to 0.62). Marginal association was found for lung cancer prevalence (OR=0.87, 95% CI = 0.81 to 0.92). The linear trend between multi-parity and reduced cancer risk was dramatically more pronounced for breast and ovarian cancer than the other cancers included in the analysis. Conclusion: This large web-mining study on parity and cancer risk produced findings very similar to those reported with traditional observational studies. It may be used as a promising strategy to generate study hypotheses for guiding and prioritizing future epidemiological studies.« less
Monitoring the Earth System Grid Federation through the ESGF Dashboard

NASA Astrophysics Data System (ADS)

Fiore, S.; Bell, G. M.; Drach, B.; Williams, D.; Aloisio, G.

2012-12-01

The Climate Model Intercomparison Project, phase 5 (CMIP5) is a global effort coordinated by the World Climate Research Programme (WCRP) involving tens of modeling groups spanning 19 countries. It is expected the CMIP5 distributed data archive will total upwards of 3.5 petabytes, stored across several ESGF Nodes on four continents (North America, Europe, Asia, and Australia). The Earth System Grid Federation (ESGF) provides the IT infrastructure to support the CMIP5. In this regard, the monitoring of the distributed ESGF infrastructure represents a crucial part carried out by the ESGF Dashboard. The ESGF Dashboard is a software component of the ESGF stack, responsible for collecting key information about the status of the federation in terms of: 1) Network topology (peer-groups composition), 2) Node type (host/services mapping), 3) Registered users (including their Identity Providers), 4) System metrics (e.g., round-trip time, service availability, CPU, memory, disk, processes, etc.), 5) Download metrics (both at the Node and federation level). The last class of information is very important since it provides a strong insight of the CMIP5 experiment: the data usage statistics. In this regard, CMCC and LLNL have developed a data analytics management system for the analysis of both node-level and federation-level data usage statistics. It provides data usage statistics aggregated by project, model, experiment, variable, realm, peer node, time, ensemble, datasetname (including version), etc. The back-end of the system is able to infer the data usage information of the entire federation, by carrying out: - at node level: a 18-step reconciliation process on the peer node databases (i.e. node manager and publisher DB) which provides a 15-dimension datawarehouse with local statistics and - at global level: an aggregation process which federates the data usage statistics into a 16-dimension datawarehouse with federation-level data usage statistics. The front-end of the Dashboard system exploits a web desktop approach, which joins the pervasivity of a web application with the flexibility of a desktop one.
An open data mining framework for the analysis of medical images: application on obstructive nephropathy microscopy images.

PubMed

Doukas, Charalampos; Goudas, Theodosis; Fischer, Simon; Mierswa, Ingo; Chatziioannou, Aristotle; Maglogiannis, Ilias

2010-01-01

This paper presents an open image-mining framework that provides access to tools and methods for the characterization of medical images. Several image processing and feature extraction operators have been implemented and exposed through Web Services. Rapid-Miner, an open source data mining system has been utilized for applying classification operators and creating the essential processing workflows. The proposed framework has been applied for the detection of salient objects in Obstructive Nephropathy microscopy images. Initial classification results are quite promising demonstrating the feasibility of automated characterization of kidney biopsy images.
A novel web informatics approach for automated surveillance of cancer mortality trends✩

PubMed Central

Tourassi, Georgia; Yoon, Hong-Jun; Xu, Songhua

2016-01-01

Cancer surveillance data are collected every year in the United States via the National Program of Cancer Registries (NPCR) and the Surveillance, Epidemiology and End Results (SEER) Program of the National Cancer Institute (NCI). General trends are closely monitored to measure the nation's progress against cancer. The objective of this study was to apply a novel web informatics approach for enabling fully automated monitoring of cancer mortality trends. The approach involves automated collection and text mining of online obituaries to derive the age distribution, geospatial, and temporal trends of cancer deaths in the US. Using breast and lung cancer as examples, we mined 23,850 cancer-related and 413,024 general online obituaries spanning the timeframe 2008–2012. There was high correlation between the web-derived mortality trends and the official surveillance statistics reported by NCI with respect to the age distribution (ρ = 0.981 for breast; ρ = 0.994 for lung), the geospatial distribution (ρ = 0.939 for breast; ρ = 0.881 for lung), and the annual rates of cancer deaths (ρ = 0.661 for breast; ρ = 0.839 for lung). Additional experiments investigated the effect of sample size on the consistency of the web-based findings. Overall, our study findings support web informatics as a promising, cost-effective way to dynamically monitor spatiotemporal cancer mortality trends. PMID:27044930
A novel web informatics approach for automated surveillance of cancer mortality trends

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tourassi, Georgia; Yoon, Hong -Jun; Xu, Songhua

Cancer surveillance data are collected every year in the United States via the National Program of Cancer Registries (NPCR) and the Surveillance, Epidemiology and End Results (SEER) Program of the National Cancer Institute (NCI). General trends are closely monitored to measure the nation’s progress against cancer. The objective of this study was to apply a novel web informatics approach for enabling fully automated monitoring of cancer mortality trends. The approach involves automated collection and text mining of online obituaries to derive the age distribution, geospatial, and temporal trends of cancer deaths in the US. Using breast and lung cancer asmore » examples, we mined 23,850 cancer-related and 413,024 general online obituaries spanning the timeframe 2008–2012. There was high correlation between the web-derived mortality trends and the official surveillance statistics reported by NCI with respect to the age distribution (ρ = 0.981 for breast; ρ = 0.994 for lung), the geospatial distribution (ρ = 0.939 for breast; ρ = 0.881 for lung), and the annual rates of cancer deaths (ρ = 0.661 for breast; ρ = 0.839 for lung). Additional experiments investigated the effect of sample size on the consistency of the web-based findings. Altogether, our study findings support web informatics as a promising, cost-effective way to dynamically monitor spatiotemporal cancer mortality trends.« less
A novel web informatics approach for automated surveillance of cancer mortality trends

DOE PAGES

Tourassi, Georgia; Yoon, Hong -Jun; Xu, Songhua

2016-04-01

Cancer surveillance data are collected every year in the United States via the National Program of Cancer Registries (NPCR) and the Surveillance, Epidemiology and End Results (SEER) Program of the National Cancer Institute (NCI). General trends are closely monitored to measure the nation’s progress against cancer. The objective of this study was to apply a novel web informatics approach for enabling fully automated monitoring of cancer mortality trends. The approach involves automated collection and text mining of online obituaries to derive the age distribution, geospatial, and temporal trends of cancer deaths in the US. Using breast and lung cancer asmore » examples, we mined 23,850 cancer-related and 413,024 general online obituaries spanning the timeframe 2008–2012. There was high correlation between the web-derived mortality trends and the official surveillance statistics reported by NCI with respect to the age distribution (ρ = 0.981 for breast; ρ = 0.994 for lung), the geospatial distribution (ρ = 0.939 for breast; ρ = 0.881 for lung), and the annual rates of cancer deaths (ρ = 0.661 for breast; ρ = 0.839 for lung). Additional experiments investigated the effect of sample size on the consistency of the web-based findings. Altogether, our study findings support web informatics as a promising, cost-effective way to dynamically monitor spatiotemporal cancer mortality trends.« less
TOPSAN: a dynamic web database for structural genomics.

PubMed

Ellrott, Kyle; Zmasek, Christian M; Weekes, Dana; Sri Krishna, S; Bakolitsa, Constantina; Godzik, Adam; Wooley, John

2011-01-01

The Open Protein Structure Annotation Network (TOPSAN) is a web-based collaboration platform for exploring and annotating structures determined by structural genomics efforts. Characterization of those structures presents a challenge since the majority of the proteins themselves have not yet been characterized. Responding to this challenge, the TOPSAN platform facilitates collaborative annotation and investigation via a user-friendly web-based interface pre-populated with automatically generated information. Semantic web technologies expand and enrich TOPSAN's content through links to larger sets of related databases, and thus, enable data integration from disparate sources and data mining via conventional query languages. TOPSAN can be found at http://www.topsan.org.
The development and testing of a linear induction motor being fed from the source with limited electric power

NASA Astrophysics Data System (ADS)

Tiunov, V. V.

2018-02-01

The report provides results of the research related to the tubular linear induction motors’ application. The motors’ design features, a calculation model, a description of test specimens for mining and electric power industry are introduced. The most attention is given to the single-phase motors for high voltage switches drives with the usage of inexpensive standard single-phase transformers for motors’ power supply. The method of the motor’s parameters determination, when the motor is being fed from the transformer, working in the overload mode, was described, and the results of it practical usage were good enough for the engineering practice.
Attachment Style and Internet Addiction: An Online Survey

PubMed Central

Schott, Markus; Decker, Oliver; Sindelar, Brigitte

2017-01-01

Background One of the clinically relevant problems of Internet use is the phenomenon of Internet addiction. Considering the fact that there is ample evidence for the relationship between attachment style and substance abuse, it stands to reason that attachment theory can also make an important contribution to the understanding of the pathogenesis of Internet addiction. Objective The aim of this study was to examine people’s tendency toward pathological Internet usage in relation to their attachment style. Methods An online survey was conducted. Sociodemographic data, attachment style (Bielefeld questionnaire partnership expectations), symptoms of Internet addiction (scale for online addiction for adults), used Web-based services, and online relationship motives (Cyber Relationship Motive Scale, CRMS-D) were assessed. In order to confirm the findings, a study using the Rorschach test was also conducted. Results In total, 245 subjects were recruited. Participants with insecure attachment style showed a higher tendency to pathological Internet usage compared with securely attached participants. An ambivalent attachment style was particularly associated with pathological Internet usage. Escapist and social-compensatory motives played an important role for insecurely attached subjects. However, there were no significant effects with respect to Web-based services and apps used. Results of the analysis of the Rorschach protocol with 16 subjects corroborated these results. Users with pathological Internet use frequently showed signs of infantile relationship structures in the context of social groups. This refers to the results of the Web-based survey, in which interpersonal relationships were the result of an insecure attachment style. Conclusions Pathological Internet use was a function of insecure attachment and limited interpersonal relationships. PMID:28526662
Next-Generation Methods for HIV Partner Services: A Systematic Review.

PubMed

Hochberg, Chad H; Berringer, Kathryn; Schneider, John A

2015-09-01

Partner notification is a widely accepted method whose intent is to limit onward HIV transmission. With increasing use of new technologies such as text messaging, e-mail, and social network sites, there is growing interest in using these techniques for "next-generation" HIV partner services (PS). We conducted a systematic review to assess the use and effectiveness of these technologies in HIV PS. Our literature search resulted in 1343 citations, with 7 meeting inclusion criteria. We found programs in 2 domains: (1) Public Health Department usage of new technologies to augment traditional partner notification (n = 3) and (2) patient or provider-led usage of partner notification Web sites (n = 4) The health department-based efforts showed an ability to find new cases in a previously unreachable population but in the limited comparisons to traditional PS had a lower rate of successful contact. Usage data from the partner notification Web sites revealed a high total number of e-notifications sent, with less than 10% of cards sent for HIV. Clear evidence on outcomes and directly traceable utilization for these Web services was lacking. When given a choice, most clients chose to send e-notifications via text versus e-mail. Although successful notification may be lower overall, use of next-generation services provides an avenue to contact those who would previously have been untraceable. Additional research is needed to determine to what extent technology-enhanced PS improves the identification of newly infected persons as well as the initiation of new prevention interventions for HIV-negative clients within high-risk networks.
SA-Search: a web tool for protein structure mining based on a Structural Alphabet

PubMed Central

Guyon, Frédéric; Camproux, Anne-Claude; Hochez, Joëlle; Tufféry, Pierre

2004-01-01

SA-Search is a web tool that can be used to mine for protein structures and extract structural similarities. It is based on a hidden Markov model derived Structural Alphabet (SA) that allows the compression of three-dimensional (3D) protein conformations into a one-dimensional (1D) representation using a limited number of prototype conformations. Using such a representation, classical methods developed for amino acid sequences can be employed. Currently, SA-Search permits the performance of fast 3D similarity searches such as the extraction of exact words using a suffix tree approach, and the search for fuzzy words viewed as a simple 1D sequence alignment problem. SA-Search is available at http://bioserv.rpbs.jussieu.fr/cgi-bin/SA-Search. PMID:15215446
SA-Search: a web tool for protein structure mining based on a Structural Alphabet.

PubMed

Guyon, Frédéric; Camproux, Anne-Claude; Hochez, Joëlle; Tufféry, Pierre

2004-07-01

SA-Search is a web tool that can be used to mine for protein structures and extract structural similarities. It is based on a hidden Markov model derived Structural Alphabet (SA) that allows the compression of three-dimensional (3D) protein conformations into a one-dimensional (1D) representation using a limited number of prototype conformations. Using such a representation, classical methods developed for amino acid sequences can be employed. Currently, SA-Search permits the performance of fast 3D similarity searches such as the extraction of exact words using a suffix tree approach, and the search for fuzzy words viewed as a simple 1D sequence alignment problem. SA-Search is available at http://bioserv.rpbs.jussieu.fr/cgi-bin/SA-Search.
Interactive text mining with Pipeline Pilot: a bibliographic web-based tool for PubMed.

PubMed

Vellay, S G P; Latimer, N E Miller; Paillard, G

2009-06-01

Text mining has become an integral part of all research in the medical field. Many text analysis software platforms support particular use cases and only those. We show an example of a bibliographic tool that can be used to support virtually any use case in an agile manner. Here we focus on a Pipeline Pilot web-based application that interactively analyzes and reports on PubMed search results. This will be of interest to any scientist to help identify the most relevant papers in a topical area more quickly and to evaluate the results of query refinement. Links with Entrez databases help both the biologist and the chemist alike. We illustrate this application with Leishmaniasis, a neglected tropical disease, as a case study.
Evaluation of longitudinal tracking and data mining for an imaging informatics-based multiple sclerosis e-folder (Conference Presentation)

NASA Astrophysics Data System (ADS)

Ma, Kevin C.; Forsyth, Sydney; Amezcua, Lilyana; Liu, Brent J.

2017-03-01

We have designed and developed a multiple sclerosis eFolder system for patient data storage, image viewing, and automatic lesion quantification results to allow patient tracking. The web-based system aims to be integrated in DICOM-compliant clinical and research environments to aid clinicians in patient treatments and data analysis. The system quantifies lesion volumes, identify and register lesion locations to track shifts in volume and quantity of lesions in a longitudinal study. We aim to evaluate the two most important features of the system, data mining and longitudinal lesion tracking, to demonstrate the MS eFolder's capability in improving clinical workflow efficiency and outcome analysis for research. In order to evaluate data mining capabilities, we have collected radiological and neurological data from 72 patients, 36 Caucasian and 36 Hispanic matched by gender, disease duration, and age. Data analysis on those patients based on ethnicity is performed, and analysis results are displayed by the system's web-based user interface. The data mining module is able to successfully separate Hispanic and Caucasian patients and compare their disease profiles. For longitudinal lesion tracking, we have collected 4 longitudinal cases and simulated different lesion growths over the next year. As a result, the eFolder is able to detect changes in lesion volume and identifying lesions with the most changes. Data mining and lesion tracking evaluation results show high potential of eFolder's usefulness in patientcare and informatics research for multiple sclerosis.

Ecogeochemistry of the subsurface food web at pH 0-2.5 in Iron Mountain, California, U.S.A.

USGS Publications Warehouse

Robbins, E.I.; Rodgers, T.M.; Alpers, Charles N.; Nordstrom, D. Kirk

2000-01-01

Pyrite oxidation in the underground mining environment of Iron Mountain, California, has created the most acidic pH values ever reported in aquatic systems. Sulfate values as high as 120 000 mg l-1 and iron as high as 27 600 mg l-1 have been measured in the mine water, which also carries abundant other dissolved metals including Al, Zn, Cu, Cd, Mn, Sb and Pb. Extreme acidity and high metal concentrations apparently do not preclude the presence of an underground acidophilic food web, which has developed with bacterial biomass at the base and heliozoans as top predators. Slimes, oil-like films, flexible and inflexible stalactites, sediments, water and precipitates were found to have distinctive communities. A variety of filamentous and non-filamentous bacteria grew in slimes in water having pH values < 1.0. Fungal hyphae colonize stalactites dripping pH 1.0 water; they may help to form these drip structures. Motile hypotrichous ciliates and bdelloid rotifers are particularly abundant in slimes having a pH of 1.5. Holdfasts of the iron bacterium Leptothrix discophora attach to biofilms covering pools of standing water having a pH of 2.5 in the mine. The mine is not a closed environment - people, forced air flow and massive flushing during high intensity rainfall provide intermittent contact between the surface and underground habitats, so the mine ecosystem probably is not a restricted one.
Ecogeochemistry of the subsurface food web at pH 0–2.5 in Iron Mountain, California, U.S.A.

USGS Publications Warehouse

Robbins, Eleanora I.; Rodgers , Teresa M.; Alpers, Charles N.; Nordstrom, D. Kirk

2000-01-01

Pyrite oxidation in the underground mining environment of Iron Mountain, California, has created the most acidic pH values ever reported in aquatic systems. Sulfate values as high as 120 000 mg l−1 and iron as high as 27 600 mg l−1 have been measured in the mine water, which also carries abundant other dissolved metals including Al, Zn, Cu, Cd, Mn, Sb and Pb. Extreme acidity and high metal concentrations apparently do not preclude the presence of an underground acidophilic food web, which has developed with bacterial biomass at the base and heliozoans as top predators. Slimes, oil-like films, flexible and inflexible stalactites, sediments, water and precipitates were found to have distinctive communities. A variety of filamentous and non-filamentous bacteria grew in slimes in water having pH values <1.0. Fungal hyphae colonize stalactites dripping pH 1.0 water; they may help to form these drip structures. Motile hypotrichous ciliates and bdelloid rotifers are particularly abundant in slimes having a pH of 1.5. Holdfasts of the iron bacterium Leptothrix discophora attach to biofilms covering pools of standing water having a pH of 2.5 in the mine. The mine is not a closed environment – people, forced air flow and massive flushing during high intensity rainfall provide intermittent contact between the surface and underground habitats, so the mine ecosystem probably is not a restricted one.
Mining large heterogeneous data sets in drug discovery.

PubMed

Wild, David J

2009-10-01

Increasingly, effective drug discovery involves the searching and data mining of large volumes of information from many sources covering the domains of chemistry, biology and pharmacology amongst others. This has led to a proliferation of databases and data sources relevant to drug discovery. This paper provides a review of the publicly-available large-scale databases relevant to drug discovery, describes the kinds of data mining approaches that can be applied to them and discusses recent work in integrative data mining that looks for associations that pan multiple sources, including the use of Semantic Web techniques. The future of mining large data sets for drug discovery requires intelligent, semantic aggregation of information from all of the data sources described in this review, along with the application of advanced methods such as intelligent agents and inference engines in client applications.
The Development of a Web-Based Attendance System with RFID for Higher Education Institution in Binus University

NASA Astrophysics Data System (ADS)

Kurniali, S.; Mayliana

2014-03-01

This study focuses on the development of a web-based attendance system with RFID in a Indonesian higher education institution. The development of this system is motivated due to the fact that the students' attendance records are one of the important elements that reflect their academic achievements. However, the current manual practice implemented is causing such a hassle. Empowering the usage of the new RFID based student card, a new web based-attendance system has been built to cater the recording and reporting of not just the student's' attendances, but also the lecturer's and taught topics in the class. The development of this system is inspired by the senior management. And the system can be easily accessed through the learning management system and can generate a report in real time, This paper will discuss in details the development until the maintaining phase of the system. Result achieved is the innovation of developing the system proved reliable to support related business processes and empowered the intention to maximize the usage of the RFID card. Considered as a successful implementation, this paper will give an input for others who want to implement a similar system.
Traffic-based feedback on the web.

PubMed

Aizen, Jonathan; Huttenlocher, Daniel; Kleinberg, Jon; Novak, Antal

2004-04-06

Usage data at a high-traffic web site can expose information about external events and surges in popularity that may not be accessible solely from analyses of content and link structure. We consider sites that are organized around a set of items available for purchase or download, consider, for example, an e-commerce site or collection of online research papers, and we study a simple indicator of collective user interest in an item, the batting average, defined as the fraction of visits to an item's description that result in an acquisition of that item. We develop a stochastic model for identifying points in time at which an item's batting average experiences significant change. In experiments with usage data from the Internet Archive, we find that such changes often occur in an abrupt, discrete fashion, and that these changes can be closely aligned with events such as the highlighting of an item on the site or the appearance of a link from an active external referrer. In this way, analyzing the dynamics of item popularity at an active web site can help characterize the impact of a range of events taking place both on and off the site.
Traffic-based feedback on the web

PubMed Central

Aizen, Jonathan; Huttenlocher, Daniel; Kleinberg, Jon; Novak, Antal

2004-01-01

Usage data at a high-traffic web site can expose information about external events and surges in popularity that may not be accessible solely from analyses of content and link structure. We consider sites that are organized around a set of items available for purchase or download, consider, for example, an e-commerce site or collection of online research papers, and we study a simple indicator of collective user interest in an item, the batting average, defined as the fraction of visits to an item's description that result in an acquisition of that item. We develop a stochastic model for identifying points in time at which an item's batting average experiences significant change. In experiments with usage data from the Internet Archive, we find that such changes often occur in an abrupt, discrete fashion, and that these changes can be closely aligned with events such as the highlighting of an item on the site or the appearance of a link from an active external referrer. In this way, analyzing the dynamics of item popularity at an active web site can help characterize the impact of a range of events taking place both on and off the site. PMID:14709676
Quantitative Analysis of the Usage of the COSMOS Science Education Portal

NASA Astrophysics Data System (ADS)

Sotiriou, Sofoklis; Bogner, Franz X.; Neofotistos, George

2011-08-01

A quantitative method of mapping the web usage of an innovative educational portal is applied to analyze the behaviour of users of the COSMOS Science Education Portal. The COSMOS Portal contains user-generated resources (that are uploaded by its users). It has been designed to support a science teacher's search, retrieval and access to both, scientific and educational resources. It also aims to introduce in and familiarize teachers with an innovative methodology for designing, expressing and representing educational practices in a commonly understandable way through the use of user-friendly authoring tools that are available through the portal. As a new science education portal that includes user-generated content, the COSMOS Portal encounters the well-known "new product/service challenge": to convince the users to use its tools, which facilitate quite fast lesson planning and lesson preparation activities. To respond to this challenge, the COSMOS Portal operators implemented a validation process by analyzing the usage data of the portal in a 10 month time-period. The data analyzed comprised: (a) the temporal evolution of the number of contributors and the amount of content uploaded to the COSMOS Portal; (b) the number of portal visitors (categorized as all-visitors, new-visitors, and returning-visitors) and (c) visitor loyalty parameters (such as page-views; pages/visit; average time on site; depth of visit; length of visit). The data is augmented with data associated with the usage context (e.g. the time of day when most of the activities in the portal take place). The quantitative results indicate that the exponential growth of the contributors to the COSMOS Portal is followed by an exponential growth of the uploaded content. Furthermore, the web usage statistics demonstrate significant changes in users' behaviour during the period under study, with returning visitors using the COSMOS Portal more frequently, mainly for lesson planning and preparation (in the afternoon hours). The findings demonstrate that the new COSMOS users follow the "law of surfing" behaviour, a common pattern of surfing behaviour in portals. However, users return to the COSMOS Portal: returning users comprise more than 50% of all COSMOS visits, stay longer on site and visit more pages. Returning visitors are benchmarked against the "law of surfing" and outperform it substantially. These quantitative results benchmark the web usage of a portal and provide its operators with maps of value-added patterns of the portal's offering to its users in the science education community.
Data-driven decision support for radiologists: re-using the National Lung Screening Trial dataset for pulmonary nodule management.

PubMed

Morrison, James J; Hostetter, Jason; Wang, Kenneth; Siegel, Eliot L

2015-02-01

Real-time mining of large research trial datasets enables development of case-based clinical decision support tools. Several applicable research datasets exist including the National Lung Screening Trial (NLST), a dataset unparalleled in size and scope for studying population-based lung cancer screening. Using these data, a clinical decision support tool was developed which matches patient demographics and lung nodule characteristics to a cohort of similar patients. The NLST dataset was converted into Structured Query Language (SQL) tables hosted on a web server, and a web-based JavaScript application was developed which performs real-time queries. JavaScript is used for both the server-side and client-side language, allowing for rapid development of a robust client interface and server-side data layer. Real-time data mining of user-specified patient cohorts achieved a rapid return of cohort cancer statistics and lung nodule distribution information. This system demonstrates the potential of individualized real-time data mining using large high-quality clinical trial datasets to drive evidence-based clinical decision-making.
Biomedical data mining in clinical routine: expanding the impact of hospital information systems.

PubMed

Müller, Marcel; Markó, Kornel; Daumke, Philipp; Paetzold, Jan; Roesner, Arnold; Klar, Rüdiger

2007-01-01

In this paper we want to describe how the promising technology of biomedical data mining can improve the use of hospital information systems: a large set of unstructured, narrative clinical data from a dermatological university hospital like discharge letters or other dermatological reports were processed through a morpho-semantic text retrieval engine ("MorphoSaurus") and integrated with other clinical data using a web-based interface and brought into daily clinical routine. The user evaluation showed a very high user acceptance - this system seems to meet the clinicians' requirements for a vertical data mining in the electronic patient records. What emerges is the need for integration of biomedical data mining into hospital information systems for clinical, scientific, educational and economic reasons.
Generic HTML Form Processor: A versatile PHP script to save web-collected data into a MySQL database.

PubMed

Göritz, Anja S; Birnbaum, Michael H

2005-11-01

The customizable PHP script Generic HTML Form Processor is intended to assist researchers and students in quickly setting up surveys and experiments that can be administered via the Web. This script relieves researchers from the burdens of writing new CGI scripts and building databases for each Web study. Generic HTML Form Processor processes any syntactically correct HTML forminput and saves it into a dynamically created open-source database. We describe five modes for usage of the script that allow increasing functionality but require increasing levels of knowledge of PHP and Web servers: The first two modes require no previous knowledge, and the fifth requires PHP programming expertise. Use of Generic HTML Form Processor is free for academic purposes, and its Web address is www.goeritz.net/brmic.
pubmed.mineR: an R package with text-mining algorithms to analyse PubMed abstracts.

PubMed

Rani, Jyoti; Shah, A B Rauf; Ramachandran, Srinivasan

2015-10-01

The PubMed literature database is a valuable source of information for scientific research. It is rich in biomedical literature with more than 24 million citations. Data-mining of voluminous literature is a challenging task. Although several text-mining algorithms have been developed in recent years with focus on data visualization, they have limitations such as speed, are rigid and are not available in the open source. We have developed an R package, pubmed.mineR, wherein we have combined the advantages of existing algorithms, overcome their limitations, and offer user flexibility and link with other packages in Bioconductor and the Comprehensive R Network (CRAN) in order to expand the user capabilities for executing multifaceted approaches. Three case studies are presented, namely, 'Evolving role of diabetes educators', 'Cancer risk assessment' and 'Dynamic concepts on disease and comorbidity' to illustrate the use of pubmed.mineR. The package generally runs fast with small elapsed times in regular workstations even on large corpus sizes and with compute intensive functions. The pubmed.mineR is available at http://cran.rproject. org/web/packages/pubmed.mineR.
Current state of web accessibility of Malaysian ministries websites

NASA Astrophysics Data System (ADS)

Ahmi, Aidi; Mohamad, Rosli

2016-08-01

Despite the fact that Malaysian public institutions have progressed considerably on website and portal usage, web accessibility has been reported as one of the issues deserves special attention. Consistent with the government moves to promote an effective use of web and portal, it is essential for the government institutions to ensure compliance with established standards and guidelines on web accessibility. This paper evaluates accessibility of 25 Malaysian ministries websites using automated tools i.e. WAVE and Achecker. Both tools are designed to objectively evaluate web accessibility in conformance with Web Content Accessibility Guidelines 2.0 (WCAG 2.0) and United States Rehabilitation Act 1973 (Section 508). The findings reported somewhat low compliance to web accessibility standard amongst the ministries. Further enhancement is needed in the aspect of input elements such as label and checkbox to be associated with text as well as image-related elements. This findings could be used as a mechanism for webmasters to locate and rectify errors pertaining to the web accessibility and to ensure equal access of the web information and services to all citizen.
Attitudes and awareness of web-based self-care resources in the military: a preliminary survey study.

PubMed

Luxton, David D; Armstrong, Christina M; Fantelli, Emily E; Thomas, Elissa K

2011-09-01

Web-based self-care resources have a number of potential benefits for military service members (SMs) and their families such as convenience, anonymity, and immediate 24/7 access to useful information. There is limited data available, however, regarding SM and military healthcare provider use of online self-care resources. Our goal with this study was to conduct a preliminary survey assessment of self-care Web site awareness, general attitudes about use, and usage behaviors of Web-based self-care resources among SMs and military healthcare providers. Results show that the majority of SMs and providers use the Internet often, use Internet self-care resources, and are willing to use additional Web-based resources and capabilities. SMs and providers also indicated a preference for Web-based self-care resources as adjunct tools to face-to-face/in-person care. Data from this preliminary study are useful for informing additional research and best practices for integrating Web-based self-care for the military community.
Segmentation of Natural Gas Customers in Industrial Sector Using Self-Organizing Map (SOM) Method

NASA Astrophysics Data System (ADS)

Masbar Rus, A. M.; Pramudita, R.; Surjandari, I.

2018-03-01

The usage of the natural gas which is non-renewable energy, needs to be more efficient. Therefore, customer segmentation becomes necessary to set up a marketing strategy to be right on target or to determine an appropriate fee. This research was conducted at PT PGN using one of data mining method, i.e. Self-Organizing Map (SOM). The clustering process is based on the characteristic of its customers as a reference to create the customer segmentation of natural gas customers. The input variables of this research are variable of area, type of customer, the industrial sector, the average usage, standard deviation of the usage, and the total deviation. As a result, 37 cluster and 9 segment from 838 customer data are formed. These 9 segments then employed to illustrate the general characteristic of the natural gas customer of PT PGN.
Identification of mine rescue equipment reduction gears technical condition

NASA Astrophysics Data System (ADS)

Gerike, B. L.; Klishin, V. I.; Kuzin, E. G.

2017-09-01

The article presents the reasons for adopting intelligent service of mine belt conveyer drives concerning evaluation of their technical condition based on the diagnostic techniques instead of regular preventative maintenance. The article reveals the diagnostic results of belt conveyer drive reduction gears condition taking into account the parameters of lubricating oil, vibration and temperature. Usage of a complex approach to evaluate technical conditions allows reliability of the forecast to be improved, which makes it possible not only to prevent accidental breakdowns and eliminate unscheduled downtime, but also to bring sufficient economic benefits through reduction of the term and scope of work during overhauls.
dictyExpress: a web-based platform for sequence data management and analytics in Dictyostelium and beyond.

PubMed

Stajdohar, Miha; Rosengarten, Rafael D; Kokosar, Janez; Jeran, Luka; Blenkus, Domen; Shaulsky, Gad; Zupan, Blaz

2017-06-02

Dictyostelium discoideum, a soil-dwelling social amoeba, is a model for the study of numerous biological processes. Research in the field has benefited mightily from the adoption of next-generation sequencing for genomics and transcriptomics. Dictyostelium biologists now face the widespread challenges of analyzing and exploring high dimensional data sets to generate hypotheses and discovering novel insights. We present dictyExpress (2.0), a web application designed for exploratory analysis of gene expression data, as well as data from related experiments such as Chromatin Immunoprecipitation sequencing (ChIP-Seq). The application features visualization modules that include time course expression profiles, clustering, gene ontology enrichment analysis, differential expression analysis and comparison of experiments. All visualizations are interactive and interconnected, such that the selection of genes in one module propagates instantly to visualizations in other modules. dictyExpress currently stores the data from over 800 Dictyostelium experiments and is embedded within a general-purpose software framework for management of next-generation sequencing data. dictyExpress allows users to explore their data in a broader context by reciprocal linking with dictyBase-a repository of Dictyostelium genomic data. In addition, we introduce a companion application called GenBoard, an intuitive graphic user interface for data management and bioinformatics analysis. dictyExpress and GenBoard enable broad adoption of next generation sequencing based inquiries by the Dictyostelium research community. Labs without the means to undertake deep sequencing projects can mine the data available to the public. The entire information flow, from raw sequence data to hypothesis testing, can be accomplished in an efficient workspace. The software framework is generalizable and represents a useful approach for any research community. To encourage more wide usage, the backend is open-source, available for extension and further development by bioinformaticians and data scientists.
A Comparison of Educational Statistics and Data Mining Approaches to Identify Characteristics That Impact Online Learning

ERIC Educational Resources Information Center

Miller, L. Dee; Soh, Leen-Kiat; Samal, Ashok; Kupzyk, Kevin; Nugent, Gwen

2015-01-01

Learning objects (LOs) are important online resources for both learners and instructors and usage for LOs is growing. Automatic LO tracking collects large amounts of metadata about individual students as well as data aggregated across courses, learning objects, and other demographic characteristics (e.g. gender). The challenge becomes identifying…
30 CFR 75.1107-6 - Capacity of fire suppression devices; location and direction of nozzles.

Code of Federal Regulations, 2010 CFR

2010-07-01

... withstand rough usage and vibration when installed on mining equipment. (b) The extinguishant-discharge... electrical cables on the equipment which are subject to flexing or to external damage; and (2) All hydraulic components on the equipment which are exposed directly to or located in the immediate vicinity of electrical...
Accessing Electronic Journals.

ERIC Educational Resources Information Center

McKay, Sharon Cline

1999-01-01

Discusses issues librarians need to consider when providing access to electronic journals. Topics include gateways; index and abstract services; validation and pay-per-view; title selection; integration with OPACs (online public access catalogs)or Web sites; paper availability; ownership versus access; usage restrictions; and services offered…
Resource Management Scheme Based on Ubiquitous Data Analysis

PubMed Central

Lee, Heung Ki; Jung, Jaehee

2014-01-01

Resource management of the main memory and process handler is critical to enhancing the system performance of a web server. Owing to the transaction delay time that affects incoming requests from web clients, web server systems utilize several web processes to anticipate future requests. This procedure is able to decrease the web generation time because there are enough processes to handle the incoming requests from web browsers. However, inefficient process management results in low service quality for the web server system. Proper pregenerated process mechanisms are required for dealing with the clients' requests. Unfortunately, it is difficult to predict how many requests a web server system is going to receive. If a web server system builds too many web processes, it wastes a considerable amount of memory space, and thus performance is reduced. We propose an adaptive web process manager scheme based on the analysis of web log mining. In the proposed scheme, the number of web processes is controlled through prediction of incoming requests, and accordingly, the web process management scheme consumes the least possible web transaction resources. In experiments, real web trace data were used to prove the improved performance of the proposed scheme. PMID:25197692

Participatory visualization with Wordle.

PubMed

Viégas, Fernanda B; Wattenberg, Martin; Feinberg, Jonathan

2009-01-01

We discuss the design and usage of "Wordle," a web-based tool for visualizing text. Wordle creates tag-cloud-like displays that give careful attention to typography, color, and composition. We describe the algorithms used to balance various aesthetic criteria and create the distinctive Wordle layouts. We then present the results of a study of Wordle usage, based both on spontaneous behaviour observed in the wild, and on a large-scale survey of Wordle users. The results suggest that Wordles have become a kind of medium of expression, and that a "participatory culture" has arisen around them.
A systematic review of studies measuring and reporting hearing aid usage in older adults since 1999: a descriptive summary of measurement tools.

PubMed

Perez, Elvira; Edmonds, Barrie A

2012-01-01

A systematic review was conducted to identify and quality assess how studies published since 1999 have measured and reported the usage of hearing aids in older adults. The relationship between usage and other dimensions of hearing aid outcome, age and hearing loss are summarised. Articles were identified through systematic searches in PubMed/MEDLINE, The University of Nottingham Online Catalogue, Web of Science and through reference checking. (1) participants aged fifty years or over with sensori-neural hearing loss, (2) provision of an air conduction hearing aid, (3) inclusion of hearing aid usage measure(s) and (4) published between 1999 and 2011. Of the initial 1933 papers obtained from the searches, a total of 64 were found eligible for review and were quality assessed on six dimensions: study design, choice of outcome instruments, level of reporting (usage, age, and audiometry) and cross validation of usage measures. Five papers were rated as being of high quality (scoring 10-12), 35 papers were rated as being of moderate quality (scoring 7-9), 22 as low quality (scoring 4-6) and two as very low quality (scoring 0-2). Fifteen different methods were identified for assessing the usage of hearing aids. Generally, the usage data reviewed was not well specified. There was a lack of consistency and robustness in the way that usage of hearing aids was assessed and categorised. There is a need for more standardised level of reporting of hearing aid usage data to further understand the relationship between usage and hearing aid outcomes.
Trends in the Evolution of the Public Web, 1998-2002; The Fedora Project: An Open-source Digital Object Repository Management System; State of the Dublin Core Metadata Initiative, April 2003; Preservation Metadata; How Many People Search the ERIC Database Each Day?

ERIC Educational Resources Information Center

O'Neill, Edward T.; Lavoie, Brian F.; Bennett, Rick; Staples, Thornton; Wayland, Ross; Payette, Sandra; Dekkers, Makx; Weibel, Stuart; Searle, Sam; Thompson, Dave; Rudner, Lawrence M.

2003-01-01

Includes five articles that examine key trends in the development of the public Web: size and growth, internationalization, and metadata usage; Flexible Extensible Digital Object and Repository Architecture (Fedora) for use in digital libraries; developments in the Dublin Core Metadata Initiative (DCMI); the National Library of New Zealand Te Puna…
Multiple-Feature Extracting Modules Based Leak Mining System Design

PubMed Central

Cho, Ying-Chiang; Pan, Jen-Yi

2013-01-01

Over the years, human dependence on the Internet has increased dramatically. A large amount of information is placed on the Internet and retrieved from it daily, which makes web security in terms of online information a major concern. In recent years, the most problematic issues in web security have been e-mail address leakage and SQL injection attacks. There are many possible causes of information leakage, such as inadequate precautions during the programming process, which lead to the leakage of e-mail addresses entered online or insufficient protection of database information, a loophole that enables malicious users to steal online content. In this paper, we implement a crawler mining system that is equipped with SQL injection vulnerability detection, by means of an algorithm developed for the web crawler. In addition, we analyze portal sites of the governments of various countries or regions in order to investigate the information leaking status of each site. Subsequently, we analyze the database structure and content of each site, using the data collected. Thus, we make use of practical verification in order to focus on information security and privacy through black-box testing. PMID:24453892
Multiple-feature extracting modules based leak mining system design.

PubMed

Cho, Ying-Chiang; Pan, Jen-Yi

2013-01-01

Over the years, human dependence on the Internet has increased dramatically. A large amount of information is placed on the Internet and retrieved from it daily, which makes web security in terms of online information a major concern. In recent years, the most problematic issues in web security have been e-mail address leakage and SQL injection attacks. There are many possible causes of information leakage, such as inadequate precautions during the programming process, which lead to the leakage of e-mail addresses entered online or insufficient protection of database information, a loophole that enables malicious users to steal online content. In this paper, we implement a crawler mining system that is equipped with SQL injection vulnerability detection, by means of an algorithm developed for the web crawler. In addition, we analyze portal sites of the governments of various countries or regions in order to investigate the information leaking status of each site. Subsequently, we analyze the database structure and content of each site, using the data collected. Thus, we make use of practical verification in order to focus on information security and privacy through black-box testing.
SeMPI: a genome-based secondary metabolite prediction and identification web server.

PubMed

Zierep, Paul F; Padilla, Natàlia; Yonchev, Dimitar G; Telukunta, Kiran K; Klementz, Dennis; Günther, Stefan

2017-07-03

The secondary metabolism of bacteria, fungi and plants yields a vast number of bioactive substances. The constantly increasing amount of published genomic data provides the opportunity for an efficient identification of gene clusters by genome mining. Conversely, for many natural products with resolved structures, the encoding gene clusters have not been identified yet. Even though genome mining tools have become significantly more efficient in the identification of biosynthetic gene clusters, structural elucidation of the actual secondary metabolite is still challenging, especially due to as yet unpredictable post-modifications. Here, we introduce SeMPI, a web server providing a prediction and identification pipeline for natural products synthesized by polyketide synthases of type I modular. In order to limit the possible structures of PKS products and to include putative tailoring reactions, a structural comparison with annotated natural products was introduced. Furthermore, a benchmark was designed based on 40 gene clusters with annotated PKS products. The web server of the pipeline (SeMPI) is freely available at: http://www.pharmaceutical-bioinformatics.de/sempi. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Diamond Eye: a distributed architecture for image data mining

NASA Astrophysics Data System (ADS)

Burl, Michael C.; Fowlkes, Charless; Roden, Joe; Stechert, Andre; Mukhtar, Saleem

1999-02-01

Diamond Eye is a distributed software architecture, which enables users (scientists) to analyze large image collections by interacting with one or more custom data mining servers via a Java applet interface. Each server is coupled with an object-oriented database and a computational engine, such as a network of high-performance workstations. The database provides persistent storage and supports querying of the 'mined' information. The computational engine provides parallel execution of expensive image processing, object recognition, and query-by-content operations. Key benefits of the Diamond Eye architecture are: (1) the design promotes trial evaluation of advanced data mining and machine learning techniques by potential new users (all that is required is to point a web browser to the appropriate URL), (2) software infrastructure that is common across a range of science mining applications is factored out and reused, and (3) the system facilitates closer collaborations between algorithm developers and domain experts.
A Bayesian additive model for understanding public transport usage in special events.

PubMed

Rodrigues, Filipe; Borysov, Stanislav; Ribeiro, Bernardete; Pereira, Francisco

2016-12-02

Public special events, like sports games, concerts and festivals are well known to create disruptions in transportation systems, often catching the operators by surprise. Although these are usually planned well in advance, their impact is difficult to predict, even when organisers and transportation operators coordinate. The problem highly increases when several events happen concurrently. To solve these problems, costly processes, heavily reliant on manual search and personal experience, are usual practice in large cities like Singapore, London or Tokyo. This paper presents a Bayesian additive model with Gaussian process components that combines smart card records from public transport with context information about events that is continuously mined from the Web. We develop an efficient approximate inference algorithm using expectation propagation, which allows us to predict the total number of public transportation trips to the special event areas, thereby contributing to a more adaptive transportation system. Furthermore, for multiple concurrent event scenarios, the proposed algorithm is able to disaggregate gross trip counts into their most likely components related to specific events and routine behavior. Using real data from Singapore, we show that the presented model outperforms the best baseline model by up to 26% in R2 and also has explanatory power for its individual components.
Public Health and Epidemiology Informatics

PubMed Central

Bar-Hen, A.; Paragios, N.

2016-01-01

Summary Objectives The aim of this manuscript is to provide a brief overview of the scientific challenges that should be addressed in order to unlock the full potential of using data from a general point of view, as well as to present some ideas that could help answer specific needs for data understanding in the field of health sciences and epidemiology. Methods A survey of uses and challenges of big data analyses for medicine and public health was conducted. The first part of the paper focuses on big data techniques, algorithms, and statistical approaches to identify patterns in data. The second part describes some cutting-edge applications of analyses and predictive modeling in public health. Results In recent years, we witnessed a revolution regarding the nature, collection, and availability of data in general. This was especially striking in the health sector and particularly in the field of epidemiology. Data derives from a large variety of sources, e.g. clinical settings, billing claims, care scheduling, drug usage, web based search queries, and Tweets. Conclusion The exploitation of the information (data mining, artificial intelligence) relevant to these data has become one of the most promising as well challenging tasks from societal and scientific viewpoints in order to leverage the information available and making public health more efficient. PMID:27830257
Smartphone dependence classification using tensor factorization

PubMed Central

Kim, Yejin; Yook, In Hye; Yu, Hwanjo; Kim, Dai-Jin

2017-01-01

Excessive smartphone use causes personal and social problems. To address this issue, we sought to derive usage patterns that were directly correlated with smartphone dependence based on usage data. This study attempted to classify smartphone dependence using a data-driven prediction algorithm. We developed a mobile application to collect smartphone usage data. A total of 41,683 logs of 48 smartphone users were collected from March 8, 2015, to January 8, 2016. The participants were classified into the control group (SUC) or the addiction group (SUD) using the Korean Smartphone Addiction Proneness Scale for Adults (S-Scale) and a face-to-face offline interview by a psychiatrist and a clinical psychologist (SUC = 23 and SUD = 25). We derived usage patterns using tensor factorization and found the following six optimal usage patterns: 1) social networking services (SNS) during daytime, 2) web surfing, 3) SNS at night, 4) mobile shopping, 5) entertainment, and 6) gaming at night. The membership vectors of the six patterns obtained a significantly better prediction performance than the raw data. For all patterns, the usage times of the SUD were much longer than those of the SUC. From our findings, we concluded that usage patterns and membership vectors were effective tools to assess and predict smartphone dependence and could provide an intervention guideline to predict and treat smartphone dependence based on usage data. PMID:28636614
Calypso: a user-friendly web-server for mining and visualizing microbiome-environment interactions.

PubMed

Zakrzewski, Martha; Proietti, Carla; Ellis, Jonathan J; Hasan, Shihab; Brion, Marie-Jo; Berger, Bernard; Krause, Lutz

2017-03-01

Calypso is an easy-to-use online software suite that allows non-expert users to mine, interpret and compare taxonomic information from metagenomic or 16S rDNA datasets. Calypso has a focus on multivariate statistical approaches that can identify complex environment-microbiome associations. The software enables quantitative visualizations, statistical testing, multivariate analysis, supervised learning, factor analysis, multivariable regression, network analysis and diversity estimates. Comprehensive help pages, tutorials and videos are provided via a wiki page. The web-interface is accessible via http://cgenome.net/calypso/ . The software is programmed in Java, PERL and R and the source code is available from Zenodo ( https://zenodo.org/record/50931 ). The software is freely available for non-commercial users. l.krause@uq.edu.au. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Impacts of gold mine waste disposal on a tropical pelagic ecosystem.

PubMed

Brewer, D T; Morello, E B; Griffiths, S; Fry, G; Heales, D; Apte, S C; Venables, W N; Rothlisberg, P C; Moeseneder, C; Lansdell, M; Pendrey, R; Coman, F; Strzelecki, J; Jarolimek, C V; Jung, R F; Richardson, A J

2012-12-01

We used a comparative approach to investigate the impact of the disposal of gold mine tailings into the ocean near the Lihir mine (Niolam Island, Papua New Guinea). We found abundance and diversity of zooplankton, micronekton and pelagic fish to be similar or higher in the mine region compared to the reference site. We also found relatively high trace metal concentrations in lower trophic level groups, especially zooplankton, near the mine discharge, but few differences in tissue concentrations of micronekton, baitfish and pelagic fish between the two regions. Biomagnification of some trace metals by micronekton, and of mercury by fish was evident in both regions. We conclude that ocean mine waste disposal at Niolam Island has a local impact on the smaller and less mobile pelagic communities in terms of trace metal concentrations, but has little effect on the abundance and biodiversity of the local food web. Crown Copyright © 2012. Published by Elsevier Ltd. All rights reserved.
The Potential of Text Mining in Data Integration and Network Biology for Plant Research: A Case Study on Arabidopsis[C][W

PubMed Central

Van Landeghem, Sofie; De Bodt, Stefanie; Drebert, Zuzanna J.; Inzé, Dirk; Van de Peer, Yves

2013-01-01

Despite the availability of various data repositories for plant research, a wealth of information currently remains hidden within the biomolecular literature. Text mining provides the necessary means to retrieve these data through automated processing of texts. However, only recently has advanced text mining methodology been implemented with sufficient computational power to process texts at a large scale. In this study, we assess the potential of large-scale text mining for plant biology research in general and for network biology in particular using a state-of-the-art text mining system applied to all PubMed abstracts and PubMed Central full texts. We present extensive evaluation of the textual data for Arabidopsis thaliana, assessing the overall accuracy of this new resource for usage in plant network analyses. Furthermore, we combine text mining information with both protein–protein and regulatory interactions from experimental databases. Clusters of tightly connected genes are delineated from the resulting network, illustrating how such an integrative approach is essential to grasp the current knowledge available for Arabidopsis and to uncover gene information through guilt by association. All large-scale data sets, as well as the manually curated textual data, are made publicly available, hereby stimulating the application of text mining data in future plant biology studies. PMID:23532071
TRFolder-W: a web server for telomerase RNA structure prediction in yeast genomes.

PubMed

Zhang, Dong; Xue, Xingran; Malmberg, Russell L; Cai, Liming

2012-10-15

TRFolder-W is a web server capable of predicting core structures of telomerase RNA (TR) in yeast genomes. TRFolder is a command-line Python toolkit for TR-specific structure prediction. We developed a web-version built on the django web framework, leveraging the work done previously, to include enhancements to increase flexibility of usage. To date, there are five core sub-structures commonly found in TR of fungal species, which are the template region, downstream pseudoknot, boundary element, core-closing stem and triple helix. The aim of TRFolder-W is to use the five core structures as fundamental units to predict potential TR genes for yeast, and to provide a user-friendly interface. Moreover, the application of TRFolder-W can be extended to predict the characteristic structure on species other than fungal species. The web server TRFolder-W is available at http://rna-informatics.uga.edu/?f=software&p=TRFolder-w.
Human dynamics revealed through Web analytics

NASA Astrophysics Data System (ADS)

Gonçalves, Bruno; Ramasco, José J.

2008-08-01

The increasing ubiquity of Internet access and the frequency with which people interact with it raise the possibility of using the Web to better observe, understand, and monitor several aspects of human social behavior. Web sites with large numbers of frequently returning users are ideal for this task. If these sites belong to companies or universities, their usage patterns can furnish information about the working habits of entire populations. In this work, we analyze the properly anonymized logs detailing the access history to Emory University’s Web site. Emory is a medium-sized university located in Atlanta, Georgia. We find interesting structure in the activity patterns of the domain and study in a systematic way the main forces behind the dynamics of the traffic. In particular, we find that linear preferential linking, priority-based queuing, and the decay of interest for the contents of the pages are the essential ingredients to understand the way users navigate the Web.
A study on PubMed search tag usage pattern: association rule mining of a full-day PubMed query log.

PubMed

Mosa, Abu Saleh Mohammad; Yoo, Illhoi

2013-01-09

The practice of evidence-based medicine requires efficient biomedical literature search such as PubMed/MEDLINE. Retrieval performance relies highly on the efficient use of search field tags. The purpose of this study was to analyze PubMed log data in order to understand the usage pattern of search tags by the end user in PubMed/MEDLINE search. A PubMed query log file was obtained from the National Library of Medicine containing anonymous user identification, timestamp, and query text. Inconsistent records were removed from the dataset and the search tags were extracted from the query texts. A total of 2,917,159 queries were selected for this study issued by a total of 613,061 users. The analysis of frequent co-occurrences and usage patterns of the search tags was conducted using an association mining algorithm. The percentage of search tag usage was low (11.38% of the total queries) and only 2.95% of queries contained two or more tags. Three out of four users used no search tag and about two-third of them issued less than four queries. Among the queries containing at least one tagged search term, the average number of search tags was almost half of the number of total search terms. Navigational search tags are more frequently used than informational search tags. While no strong association was observed between informational and navigational tags, six (out of 19) informational tags and six (out of 29) navigational tags showed strong associations in PubMed searches. The low percentage of search tag usage implies that PubMed/MEDLINE users do not utilize the features of PubMed/MEDLINE widely or they are not aware of such features or solely depend on the high recall focused query translation by the PubMed's Automatic Term Mapping. The users need further education and interactive search application for effective use of the search tags in order to fulfill their biomedical information needs from PubMed/MEDLINE.
A Study on Pubmed Search Tag Usage Pattern: Association Rule Mining of a Full-day Pubmed Query Log

PubMed Central

2013-01-01

Background The practice of evidence-based medicine requires efficient biomedical literature search such as PubMed/MEDLINE. Retrieval performance relies highly on the efficient use of search field tags. The purpose of this study was to analyze PubMed log data in order to understand the usage pattern of search tags by the end user in PubMed/MEDLINE search. Methods A PubMed query log file was obtained from the National Library of Medicine containing anonymous user identification, timestamp, and query text. Inconsistent records were removed from the dataset and the search tags were extracted from the query texts. A total of 2,917,159 queries were selected for this study issued by a total of 613,061 users. The analysis of frequent co-occurrences and usage patterns of the search tags was conducted using an association mining algorithm. Results The percentage of search tag usage was low (11.38% of the total queries) and only 2.95% of queries contained two or more tags. Three out of four users used no search tag and about two-third of them issued less than four queries. Among the queries containing at least one tagged search term, the average number of search tags was almost half of the number of total search terms. Navigational search tags are more frequently used than informational search tags. While no strong association was observed between informational and navigational tags, six (out of 19) informational tags and six (out of 29) navigational tags showed strong associations in PubMed searches. Conclusions The low percentage of search tag usage implies that PubMed/MEDLINE users do not utilize the features of PubMed/MEDLINE widely or they are not aware of such features or solely depend on the high recall focused query translation by the PubMed’s Automatic Term Mapping. The users need further education and interactive search application for effective use of the search tags in order to fulfill their biomedical information needs from PubMed/MEDLINE. PMID:23302604
Facilitating Decision Making, Re-Use and Collaboration: A Knowledge Management Approach to Acquisition Program Self-Awareness

DTIC Science & Technology

2009-06-01

capabilities: web-based, relational/multi-dimensional, client/server, and metadata (data about data) inclusion (pp. 39-40). Text mining, on the other...and Organizational Systems ( CASOS ) (Carley, 2005). Although AutoMap can be used to conduct text-mining, it was utilized only for its visualization...provides insight into how the GMCOI is using the terms, and where there might be redundant terms and need for de -confliction and standardization
FlyMine: an integrated database for Drosophila and Anopheles genomics

PubMed Central

Lyne, Rachel; Smith, Richard; Rutherford, Kim; Wakeling, Matthew; Varley, Andrew; Guillier, Francois; Janssens, Hilde; Ji, Wenyan; Mclaren, Peter; North, Philip; Rana, Debashis; Riley, Tom; Sullivan, Julie; Watkins, Xavier; Woodbridge, Mark; Lilley, Kathryn; Russell, Steve; Ashburner, Michael; Mizuguchi, Kenji; Micklem, Gos

2007-01-01

FlyMine is a data warehouse that addresses one of the important challenges of modern biology: how to integrate and make use of the diversity and volume of current biological data. Its main focus is genomic and proteomics data for Drosophila and other insects. It provides web access to integrated data at a number of different levels, from simple browsing to construction of complex queries, which can be executed on either single items or lists. PMID:17615057
LimTox: a web tool for applied text mining of adverse event and toxicity associations of compounds, drugs and genes.

PubMed

Cañada, Andres; Capella-Gutierrez, Salvador; Rabal, Obdulia; Oyarzabal, Julen; Valencia, Alfonso; Krallinger, Martin

2017-07-03

A considerable effort has been devoted to retrieve systematically information for genes and proteins as well as relationships between them. Despite the importance of chemical compounds and drugs as a central bio-entity in pharmacological and biological research, only a limited number of freely available chemical text-mining/search engine technologies are currently accessible. Here we present LimTox (Literature Mining for Toxicology), a web-based online biomedical search tool with special focus on adverse hepatobiliary reactions. It integrates a range of text mining, named entity recognition and information extraction components. LimTox relies on machine-learning, rule-based, pattern-based and term lookup strategies. This system processes scientific abstracts, a set of full text articles and medical agency assessment reports. Although the main focus of LimTox is on adverse liver events, it enables also basic searches for other organ level toxicity associations (nephrotoxicity, cardiotoxicity, thyrotoxicity and phospholipidosis). This tool supports specialized search queries for: chemical compounds/drugs, genes (with additional emphasis on key enzymes in drug metabolism, namely P450 cytochromes-CYPs) and biochemical liver markers. The LimTox website is free and open to all users and there is no login requirement. LimTox can be accessed at: http://limtox.bioinfo.cnio.es. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Attachment Style and Internet Addiction: An Online Survey.

PubMed

Eichenberg, Christiane; Schott, Markus; Decker, Oliver; Sindelar, Brigitte

2017-05-17

One of the clinically relevant problems of Internet use is the phenomenon of Internet addiction. Considering the fact that there is ample evidence for the relationship between attachment style and substance abuse, it stands to reason that attachment theory can also make an important contribution to the understanding of the pathogenesis of Internet addiction. The aim of this study was to examine people's tendency toward pathological Internet usage in relation to their attachment style. An online survey was conducted. Sociodemographic data, attachment style (Bielefeld questionnaire partnership expectations), symptoms of Internet addiction (scale for online addiction for adults), used Web-based services, and online relationship motives (Cyber Relationship Motive Scale, CRMS-D) were assessed. In order to confirm the findings, a study using the Rorschach test was also conducted. In total, 245 subjects were recruited. Participants with insecure attachment style showed a higher tendency to pathological Internet usage compared with securely attached participants. An ambivalent attachment style was particularly associated with pathological Internet usage. Escapist and social-compensatory motives played an important role for insecurely attached subjects. However, there were no significant effects with respect to Web-based services and apps used. Results of the analysis of the Rorschach protocol with 16 subjects corroborated these results. Users with pathological Internet use frequently showed signs of infantile relationship structures in the context of social groups. This refers to the results of the Web-based survey, in which interpersonal relationships were the result of an insecure attachment style. Pathological Internet use was a function of insecure attachment and limited interpersonal relationships. ©Christiane Eichenberg, Markus Schott, Oliver Decker, Brigitte Sindelar. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 17.05.2017.
A pilot study investigating the feasibility of symptom assessment manager (SAM), a Web-based real-time tool for monitoring challenging behaviors.

PubMed

Loi, Samantha M; Wanasinghage, Sangeeth; Goh, Anita; Lautenschlager, Nicola T; Darby, David G; Velakoulis, Dennis

2018-04-01

Improving and minimizing challenging behaviors seen in psychiatric conditions, including behavioral and psychological symptoms of dementia are important in the care of people with these conditions. Yet there is a lack of systematic evaluation of these as a part of routine clinical care. The Neuropsychiatric Inventory is a validated and reliable tool for rating the severity and disruptiveness of challenging behaviors. We report on the evaluation of a Web-based symptom assessment manager (SAM), designed to address the limitation of previous tools using some of the Neuropsychiatric Inventory functions, to monitor behaviors by staff caring for people with dementia and other psychiatric conditions in inpatient and residential care settings. The SAM was piloted in an 8-bed inpatient neuropsychiatry unit over 5 months. Eleven nurses and 4 clinicians were trained in usage of SAM. Primary outcomes were usage of SAM and perceived usability, utility, and acceptance of SAM. Secondary outcomes were the frequencies of documented behavior. Usage data were analyzed using chi-square and logistic regression analyses. The SAM was used for all admitted patients regardless of diagnosis, with a usage rate of 64% for nurses regularly employed in the unit. Staff provided positive feedback regarding the utility of SAM. The SAM appeared to offer individualized behavior assessment by providing a quick, structured, and standardized platform for assessing behavior in a real-world setting. Further research would involve trialing SAM with more staff in alternative settings such as in home or residential care settings. Copyright © 2017 John Wiley & Sons, Ltd.
Development of management information system for land in mine area based on MapInfo

NASA Astrophysics Data System (ADS)

Wang, Shi-Dong; Liu, Chuang-Hua; Wang, Xin-Chuang; Pan, Yan-Yu

2008-10-01

MapInfo is current a popular GIS software. This paper introduces characters of MapInfo and GIS second development methods offered by MapInfo, which include three ones based on MapBasic, OLE automation, and MapX control usage respectively. Taking development of land management information system in mine area for example, in the paper, the method of developing GIS applications based on MapX has been discussed, as well as development of land management information system in mine area has been introduced in detail, including development environment, overall design, design and realization of every function module, and simple application of system, etc. The system uses MapX 5.0 and Visual Basic 6.0 as development platform, takes SQL Server 2005 as back-end database, and adopts Matlab 6.5 to calculate number in back-end. On the basis of integrated design, the system develops eight modules including start-up, layer control, spatial query, spatial analysis, data editing, application model, document management, results output. The system can be used in mine area for cadastral management, land use structure optimization, land reclamation, land evaluation, analysis and forecasting for land in mine area and environmental disruption, thematic mapping, and so on.
Study on Personalized Recommendation Model of Internet Advertisement

NASA Astrophysics Data System (ADS)

Zhou, Ning; Chen, Yongyue; Zhang, Huiping

With the rapid development of E-Commerce, the audiences put forward higher requirements on personalized Internet advertisement than before. The main function of Personalized Advertising System is to provide the most suitable advertisements for anonymous users on Web sites. The paper offers a personalized Internet advertisement recommendation model. By mining the audiences' historical and current behavior, and the advertisers' and publisher's web site content, etc, the system can recommend appropriate advertisements to corresponding audiences.
Data Mining Meets HCI: Making Sense of Large Graphs

DTIC Science & Technology

2012-07-01

graph algo- rithms, won the Open Source Software World Challenge, Silver Award. We have released Pegasus as free , open-source software, downloaded by...METIS [77], spectral clustering [108], and the parameter- free “Cross-associations” (CA) [26]. Belief Propagation can also be used for clus- tering, as...number of tools have been developed to support “ landscape ” views of information. These include WebBook and Web- Forager [23], which use a book metaphor
Fast access to the CMS detector condition data employing HTML5 technologies

NASA Astrophysics Data System (ADS)

Pierro, Giuseppe Antonio; Cavallari, Francesca; Di Guida, Salvatore; Innocente, Vincenzo

2011-12-01

This paper focuses on using HTML version 5 (HTML5) for accessing condition data for the CMS experiment, evaluating the benefits and risks posed by the use of this technology. According to the authors of HTML5, this technology attempts to solve issues found in previous iterations of HTML and addresses the needs of web applications, an area previously not adequately covered by HTML. We demonstrate that employing HTML5 brings important benefits in terms of access performance to the CMS condition data. The combined use of web storage and web sockets allows increasing the performance and reducing the costs in term of computation power, memory usage and network bandwidth for client and server. Above all, the web workers allow creating different scripts that can be executed using multi-thread mode, exploiting multi-core microprocessors. Web workers have been employed in order to substantially decrease the web page rendering time to display the condition data stored in the CMS condition database.
77 FR 48007 - Administrative Simplification: Adoption of Operating Rules for Health Care Electronic Funds...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-08-10

... care transactions results in higher use of EDI by health care providers.\\8\\ We expect usage of EFT and..., surveys, and straw polls, and shared updates on the CAQH CORE and NACHA Web sites. On August 1, 2011 CAQH...
Metadata: Pure and Simple, or Is It?

ERIC Educational Resources Information Center

Chalmers, Marilyn

2002-01-01

Discusses issues concerning metadata in Web pages based on experiences in a vocational education center library in Queensland (Australia). Highlights include Dublin Core elements; search engines; controlled vocabulary; performance measurement to assess usage patterns and provide quality control over the vocabulary; and considerations given the…
Digital roadway interactive visualization and evaluation network applications to WSDOT operational data usage.

DOT National Transportation Integrated Search

2016-12-01

DRIVE Net is a region-wide, Web-based transportation decision support system that adopts digital roadway maps as : the base, and provides data layers for integrating and analyzing a variety of data sources (e.g., traffic sensors, incident : records)....
Test of the technology acceptance model for a Web-based information system in a Hong Kong Chinese sample.

PubMed

Cheung, Emily Yee Man; Sachs, John

2006-12-01

The modified technology acceptance model was used to predict actual Blackboard usage (a web-based information system) in a sample of 57 Hong Kong student teachers whose mean age was 27.8 yr. (SD = 6.9). While the general form of the model was supported, Application-specific Self-efficacy was a more powerful predictor of system use than Behavioural Intention as predicted by the theory of reasoned action. Thus in this cultural and educational context, it has been shown that the model does not fully mediate the effect of Self-efficacy on System Use. Also, users' Enjoyment exerted considerable influence on the component variables of Usefulness and Ease of Use and on Application-specific Self-efficacy, thus indirectly influencing system usage. Consequently, efforts to gain students' acceptance and, therefore, use of information systems such as Blackboard must pay adequate attention to users' Self-efficacy and motivational variables such as Enjoyment.
PC-based web authoring: How to learn as little unix as possible while getting on the Web

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gennari, L.T.; Breaux, M.; Minton, S.

1996-09-01

This document is a general guide for creating Web pages, using commonly available word processing and file transfer applications. It is not a full guide to HTML, nor does it provide an introduction to the many WYSIWYG HTML editors available. The viability of the authoring method it describes will not be affected by changes in the HTML specification or the rapid release-and-obsolescence cycles of commercial WYSIWYG HTML editors. This document provides a gentle introduction to HTML for the beginner, and as the user gains confidence and experience, encourages greater familiarity with HTML through continued exposure to and hands-on usage ofmore » HTML code.« less
Mac-based Web authoring: How to learn as little Unix as possible while getting on the Web.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gennari, L.T.

1996-06-01

This document is a general guide for creating Web pages, using commonly available word processing and file transfer applications. It is not a full guide to HTML, nor does it provide an introduction to the many WYSIWYG HTML editors available. The viability of the authoring method it describes will not be affected by changes in the HTML specification or the rapid release-and-obsolescence cycles of commercial WYSIWYG HTML editors. This document provides a gentle introduction to HTML for the beginner and as the user gains confidence and experience, encourages greater familiarity with HTML through continued exposure to and hands-on usage ofmore » HTML code.« less
mORCA: ubiquitous access to life science web services.

PubMed

Diaz-Del-Pino, Sergio; Trelles, Oswaldo; Falgueras, Juan

2018-01-16

Technical advances in mobile devices such as smartphones and tablets have produced an extraordinary increase in their use around the world and have become part of our daily lives. The possibility of carrying these devices in a pocket, particularly mobile phones, has enabled ubiquitous access to Internet resources. Furthermore, in the life sciences world there has been a vast proliferation of data types and services that finish as Web Services. This suggests the need for research into mobile clients to deal with life sciences applications for effective usage and exploitation. Analysing the current features in existing bioinformatics applications managing Web Services, we have devised, implemented, and deployed an easy-to-use web-based lightweight mobile client. This client is able to browse, select, compose parameters, invoke, and monitor the execution of Web Services stored in catalogues or central repositories. The client is also able to deal with huge amounts of data between external storage mounts. In addition, we also present a validation use case, which illustrates the usage of the application while executing, monitoring, and exploring the results of a registered workflow. The software its available in the Apple Store and Android Market and the source code is publicly available in Github. Mobile devices are becoming increasingly important in the scientific world due to their strong potential impact on scientific applications. Bioinformatics should not fall behind this trend. We present an original software client that deals with the intrinsic limitations of such devices and propose different guidelines to provide location-independent access to computational resources in bioinformatics and biomedicine. Its modular design makes it easily expandable with the inclusion of new repositories, tools, types of visualization, etc.
A Systematic Review of Studies Measuring and Reporting Hearing Aid Usage in Older Adults since 1999: A Descriptive Summary of Measurement Tools

PubMed Central

Perez, Elvira; Edmonds, Barrie A.

2012-01-01

Objective A systematic review was conducted to identify and quality assess how studies published since 1999 have measured and reported the usage of hearing aids in older adults. The relationship between usage and other dimensions of hearing aid outcome, age and hearing loss are summarised. Data sources Articles were identified through systematic searches in PubMed/MEDLINE, The University of Nottingham Online Catalogue, Web of Science and through reference checking. Study eligibility criteria: (1) participants aged fifty years or over with sensori-neural hearing loss, (2) provision of an air conduction hearing aid, (3) inclusion of hearing aid usage measure(s) and (4) published between 1999 and 2011. Results Of the initial 1933 papers obtained from the searches, a total of 64 were found eligible for review and were quality assessed on six dimensions: study design, choice of outcome instruments, level of reporting (usage, age, and audiometry) and cross validation of usage measures. Five papers were rated as being of high quality (scoring 10–12), 35 papers were rated as being of moderate quality (scoring 7–9), 22 as low quality (scoring 4–6) and two as very low quality (scoring 0–2). Fifteen different methods were identified for assessing the usage of hearing aids. Conclusions Generally, the usage data reviewed was not well specified. There was a lack of consistency and robustness in the way that usage of hearing aids was assessed and categorised. There is a need for more standardised level of reporting of hearing aid usage data to further understand the relationship between usage and hearing aid outcomes. PMID:22479312
The development and preliminary testing of a multimedia patient-provider survivorship communication module for breast cancer survivors.

PubMed

Wen, Kuang-Yi; Miller, Suzanne M; Stanton, Annette L; Fleisher, Linda; Morra, Marion E; Jorge, Alexandra; Diefenbach, Michael A; Ropka, Mary E; Marcus, Alfred C

2012-08-01

This paper describes the development of a theory-guided and evidence-based multimedia training module to facilitate breast cancer survivors' preparedness for effective communication with their health care providers after active treatment. The iterative developmental process used included: (1) theory and evidence-based content development and vetting; (2) user testing; (3) usability testing; and (4) participant module utilization. Formative evaluation of the training module prototype occurred through user testing (n = 12), resulting in modification of the content and layout. Usability testing (n = 10) was employed to improve module functionality. Preliminary web usage data (n = 256, mean age = 53, 94.5% White, 75% college graduate and above) showed that 59% of the participants accessed the communication module, for an average of 7 min per login. The iterative developmental process was informative in enhancing the relevance of the communication module. Preliminary web usage results demonstrate the potential feasibility of such a program. Our study demonstrates survivors' openness to the use of a web-based communication skills training module and outlines a systematic iterative user and interface program development and testing process, which can serve as a prototype for others considering such an approach. Copyright © 2012. Published by Elsevier Ireland Ltd.
BRepertoire: a user-friendly web server for analysing antibody repertoire data.

PubMed

Margreitter, Christian; Lu, Hui-Chun; Townsend, Catherine; Stewart, Alexander; Dunn-Walters, Deborah K; Fraternali, Franca

2018-04-14

Antibody repertoire analysis by high throughput sequencing is now widely used, but a persisting challenge is enabling immunologists to explore their data to discover discriminating repertoire features for their own particular investigations. Computational methods are necessary for large-scale evaluation of antibody properties. We have developed BRepertoire, a suite of user-friendly web-based software tools for large-scale statistical analyses of repertoire data. The software is able to use data preprocessed by IMGT, and performs statistical and comparative analyses with versatile plotting options. BRepertoire has been designed to operate in various modes, for example analysing sequence-specific V(D)J gene usage, discerning physico-chemical properties of the CDR regions and clustering of clonotypes. Those analyses are performed on the fly by a number of R packages and are deployed by a shiny web platform. The user can download the analysed data in different table formats and save the generated plots as image files ready for publication. We believe BRepertoire to be a versatile analytical tool that complements experimental studies of immune repertoires. To illustrate the server's functionality, we show use cases including differential gene usage in a vaccination dataset and analysis of CDR3H properties in old and young individuals. The server is accessible under http://mabra.biomed.kcl.ac.uk/BRepertoire.
The development and preliminary testing of a multimedia patient–provider survivorship communication module for breast cancer survivors

PubMed Central

Wen, Kuang-Yi; Miller, Suzanne M.; Stanton, Annette L.; Fleisher, Linda; Morra, Marion E.; Jorge, Alexandra; Diefenbach, Michael A.; Ropka, Mary E.; Marcus, Alfred C.

2012-01-01

Objective This paper describes the development of a theory-guided and evidence-based multimedia training module to facilitate breast cancer survivors’ preparedness for effective communication with their health care providers after active treatment. Methods The iterative developmental process used included: (1) theory and evidence-based content development and vetting; (2) user testing; (3) usability testing; and (4) participant module utilization. Results Formative evaluation of the training module prototype occurred through user testing (n = 12), resulting in modification of the content and layout. Usability testing (n = 10) was employed to improve module functionality. Preliminary web usage data (n = 256, mean age = 53, 94.5% White, 75% college graduate and above) showed that 59% of the participants accessed the communication module, for an average of 7 min per login. Conclusion The iterative developmental process was informative in enhancing the relevance of the communication module. Preliminary web usage results demonstrate the potential feasibility of such a program. Practice implications Our study demonstrates survivors’ openness to the use of a web-based communication skills training module and outlines a systematic iterative user and interface program development and testing process, which can serve as a prototype for others considering such an approach. PMID:22770812
Development and alignment of undergraduate medical curricula in a web-based, dynamic Learning Opportunities, Objectives and Outcome Platform (LOOOP).

PubMed

Balzer, Felix; Hautz, Wolf E; Spies, Claudia; Bietenbeck, Andreas; Dittmar, Martin; Sugiharto, Firman; Lehmann, Lars; Eisenmann, Dorothea; Bubser, Florian; Stieg, Markus; Hanfler, Sven; Georg, Waltraud; Tekian, Ara; Ahlers, Olaf

2016-01-01

This study presents a web-based method and its interface ensuring alignment of all parts of a curriculum map including competencies, objectives, teaching and assessment methods, workload and patient availability. Needs, acceptance and effectiveness are shown through a nine-year study. After a comprehensive needs assessment, the curriculum map and a web-based interface "Learning Opportunities, Objectives and Outcome Platform" (LOOOP) were developed according to Harden's conceptual framework of 10-steps for curriculum mapping. The outcome was measured by surveys and results of interdisciplinary MCQ-assessments. The usage rates and functionalities were analysed. The implementation of LOOOP was significantly associated with improved perception of the curriculum structure by teachers and students, quality of defined objectives and their alignment with teaching and assessment, usage by students to prepare examinations and their scores in interdisciplinary MCQ-assessment. Additionally, LOOOP improved the curriculum coordination by faculty, and assisted departments for identifying patient availability for clinical training. LOOOP is well accepted among students and teachers, has positive effect on curriculum development, facilitates effective utilisation of educational resources and improves student's outcomes. Currently, LOOOP is used in five undergraduate medical curricula including 85,000 mapped learning opportunities (lectures, seminars), 5000 registered users (students, teachers) and 380,000 yearly page-visits.
Consolidating drug data on a global scale using Linked Data.

PubMed

Jovanovik, Milos; Trajanov, Dimitar

2017-01-21

Drug product data is available on the Web in a distributed fashion. The reasons lie within the regulatory domains, which exist on a national level. As a consequence, the drug data available on the Web are independently curated by national institutions from each country, leaving the data in varying languages, with a varying structure, granularity level and format, on different locations on the Web. Therefore, one of the main challenges in the realm of drug data is the consolidation and integration of large amounts of heterogeneous data into a comprehensive dataspace, for the purpose of developing data-driven applications. In recent years, the adoption of the Linked Data principles has enabled data publishers to provide structured data on the Web and contextually interlink them with other public datasets, effectively de-siloing them. Defining methodological guidelines and specialized tools for generating Linked Data in the drug domain, applicable on a global scale, is a crucial step to achieving the necessary levels of data consolidation and alignment needed for the development of a global dataset of drug product data. This dataset would then enable a myriad of new usage scenarios, which can, for instance, provide insight into the global availability of different drug categories in different parts of the world. We developed a methodology and a set of tools which support the process of generating Linked Data in the drug domain. Using them, we generated the LinkedDrugs dataset by seamlessly transforming, consolidating and publishing high-quality, 5-star Linked Drug Data from twenty-three countries, containing over 248,000 drug products, over 99,000,000 RDF triples and over 278,000 links to generic drugs from the LOD Cloud. Using the linked nature of the dataset, we demonstrate its ability to support advanced usage scenarios in the drug domain. The process of generating the LinkedDrugs dataset demonstrates the applicability of the methodological guidelines and the supporting tools in transforming drug product data from various, independent and distributed sources, into a comprehensive Linked Drug Data dataset. The presented user-centric and analytical usage scenarios over the dataset show the advantages of having a de-siloed, consolidated and comprehensive dataspace of drug data available via the existing infrastructure of the Web.
Proactive Supply Chain Performance Management with Predictive Analytics

PubMed Central

Stefanovic, Nenad

2014-01-01

Today's business climate requires supply chains to be proactive rather than reactive, which demands a new approach that incorporates data mining predictive analytics. This paper introduces a predictive supply chain performance management model which combines process modelling, performance measurement, data mining models, and web portal technologies into a unique model. It presents the supply chain modelling approach based on the specialized metamodel which allows modelling of any supply chain configuration and at different level of details. The paper also presents the supply chain semantic business intelligence (BI) model which encapsulates data sources and business rules and includes the data warehouse model with specific supply chain dimensions, measures, and KPIs (key performance indicators). Next, the paper describes two generic approaches for designing the KPI predictive data mining models based on the BI semantic model. KPI predictive models were trained and tested with a real-world data set. Finally, a specialized analytical web portal which offers collaborative performance monitoring and decision making is presented. The results show that these models give very accurate KPI projections and provide valuable insights into newly emerging trends, opportunities, and problems. This should lead to more intelligent, predictive, and responsive supply chains capable of adapting to future business environment. PMID:25386605

Proactive supply chain performance management with predictive analytics.

PubMed

Stefanovic, Nenad

2014-01-01

Today's business climate requires supply chains to be proactive rather than reactive, which demands a new approach that incorporates data mining predictive analytics. This paper introduces a predictive supply chain performance management model which combines process modelling, performance measurement, data mining models, and web portal technologies into a unique model. It presents the supply chain modelling approach based on the specialized metamodel which allows modelling of any supply chain configuration and at different level of details. The paper also presents the supply chain semantic business intelligence (BI) model which encapsulates data sources and business rules and includes the data warehouse model with specific supply chain dimensions, measures, and KPIs (key performance indicators). Next, the paper describes two generic approaches for designing the KPI predictive data mining models based on the BI semantic model. KPI predictive models were trained and tested with a real-world data set. Finally, a specialized analytical web portal which offers collaborative performance monitoring and decision making is presented. The results show that these models give very accurate KPI projections and provide valuable insights into newly emerging trends, opportunities, and problems. This should lead to more intelligent, predictive, and responsive supply chains capable of adapting to future business environment.
Advanced Query and Data Mining Capabilities for MaROS

NASA Technical Reports Server (NTRS)

Wang, Paul; Wallick, Michael N.; Allard, Daniel A.; Gladden, Roy E.; Hy, Franklin H.

2013-01-01

The Mars Relay Operational Service (MaROS) comprises a number of tools to coordinate, plan, and visualize various aspects of the Mars Relay network. These levels include a Web-based user interface, a back-end "ReSTlet" built in Java, and databases that store the data as it is received from the network. As part of MaROS, the innovators have developed and implemented a feature set that operates on several levels of the software architecture. This new feature is an advanced querying capability through either the Web-based user interface, or through a back-end REST interface to access all of the data gathered from the network. This software is not meant to replace the REST interface, but to augment and expand the range of available data. The current REST interface provides specific data that is used by the MaROS Web application to display and visualize the information; however, the returned information from the REST interface has typically been pre-processed to return only a subset of the entire information within the repository, particularly only the information that is of interest to the GUI (graphical user interface). The new, advanced query and data mining capabilities allow users to retrieve the raw data and/or to perform their own data processing. The query language used to access the repository is a restricted subset of the structured query language (SQL) that can be built safely from the Web user interface, or entered as freeform SQL by a user. The results are returned in a CSV (Comma Separated Values) format for easy exporting to third party tools and applications that can be used for data mining or user-defined visualization and interpretation. This is the first time that a service is capable of providing access to all cross-project relay data from a single Web resource. Because MaROS contains the data for a variety of missions from the Mars network, which span both NASA and ESA, the software also establishes an access control list (ACL) on each data record in the database repository to enforce user access permissions through a multilayered approach.
MINING ENVIRONMENTAL TOXICOLOGY INFORMATION WEB RESOURCES

EPA Science Inventory

Environmental toxicology is the study of the ecological effects of anthropogenic substances released into the environment. It is a relatively diverse field addressing impacts to aquatic and terrestrial organisms and communities. The determination of potential risk associated with...
Experimental evaluation of the impact of packet capturing tools for web services.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Choe, Yung Ryn; Mohapatra, Prasant; Chuah, Chen-Nee

Network measurement is a discipline that provides the techniques to collect data that are fundamental to many branches of computer science. While many capturing tools and comparisons have made available in the literature and elsewhere, the impact of these packet capturing tools on existing processes have not been thoroughly studied. While not a concern for collection methods in which dedicated servers are used, many usage scenarios of packet capturing now requires the packet capturing tool to run concurrently with operational processes. In this work we perform experimental evaluations of the performance impact that packet capturing process have on web-based services;more » in particular, we observe the impact on web servers. We find that packet capturing processes indeed impact the performance of web servers, but on a multi-core system the impact varies depending on whether the packet capturing and web hosting processes are co-located or not. In addition, the architecture and behavior of the web server and process scheduling is coupled with the behavior of the packet capturing process, which in turn also affect the web server's performance.« less
AMP: A platform for managing and mining data in the treatment of Autism Spectrum Disorder.

PubMed

Linstead, Erik; Burns, Ryan; Duy Nguyen; Tyler, David

2016-08-01

We introduce AMP (Autism Management Platform), an integrated health care information system for capturing, analyzing, and managing data associated with the diagnosis and treatment of Autism Spectrum Disorder in children. AMP's mobile application simplifies the means by which parents, guardians, and clinicians can collect and share multimedia data with one another, facilitating communication and reducing data redundancy, while simplifying retrieval. Additionally, AMP provides an intelligent web interface and analytics platform which allow physicians and specialists to aggregate and mine patient data in real-time, as well as give relevant feedback to automatically learn data filtering preferences over time. Together AMP's mobile app, web client, and analytics engine implement a rich set of features that streamline the data collection and analysis process in the context of a secure and easy-to-use system so that data may be more effectively leveraged to guide treatment.
Assessing the Effects of Participant Preference and Demographics in the Usage of Web-based Survey Questionnaires by Women Attending Screening Mammography in British Columbia.

PubMed

Mlikotic, Rebecca; Parker, Brent; Rajapakshe, Rasika

2016-03-22

Increased usage of Internet applications has allowed for the collection of patient reported outcomes (PROs) and other health data through Web-based communication and questionnaires. While these Web platforms allow for increased speed and scope of communication delivery, there are certain limitations associated with this technology, as survey mode preferences vary across demographic groups. To investigate the impact of demographic factors and participant preferences on the use of a Web-based questionnaire in comparison with more traditional methods (mail and phone) for women participating in screening mammography in British Columbia, Canada. A sample of women attending the Screening Mammography Program of British Columbia (SMPBC) participated in a breast cancer risk assessment project. The study questionnaire was administered through one of three modes (ie, telephone, mail, or website platform). Survey mode preferences and actual methods of response were analyzed for participants recruited from Victoria General Hospital. Both univariate and multivariate analyses were used to investigate the association of demographic factors (ie, age, education level, and ethnicity) with certain survey response types. A total of 1192 women successfully completed the study questionnaire at Victoria General Hospital. Mail was stated as the most preferred survey mode (509/1192, 42.70%), followed by website platform (422/1192, 35.40%), and telephone (147/1192, 12.33%). Over 80% (955/1192) of participants completed the questionnaire in the mode previously specified as their most preferred; mail was the most common method of response (688/1192, 57.72%). Mail was also the most preferred type of questionnaire response method when participants responded in a mode other than their original preference. The average age of participants who responded via the Web-based platform (age 52.9, 95% confidence interval [CI] 52.1-53.7) was significantly lower than those who used mail and telephone methods (age 55.9, 95% CI 55.2-56.5; P<.001); each decade of increased age was associated with a 0.97-fold decrease in the odds of using the website platform (P<.001). Web-based participation was more likely for those who completed higher levels of education; each interval increase leading to a 1.83 increase in the odds of website platform usage (P<.001). Ethnicity was not shown to play a role in participant preference for the website platform (P=.96). It is beneficial to consider participant survey mode preference when planning to collect PROs and other patient health data. Younger participants and those of higher education level were more likely to use the website platform questionnaire; Web-based participation failed to vary across ethnic group. Because mail questionnaires were still the most preferred survey mode, it will be important to employ strategies, such as user-friendly design and Web-based support, to ensure that the patient feedback being collected is representative of the population being served.
Assessing the Effects of Participant Preference and Demographics in the Usage of Web-based Survey Questionnaires by Women Attending Screening Mammography in British Columbia

PubMed Central

2016-01-01

Background Increased usage of Internet applications has allowed for the collection of patient reported outcomes (PROs) and other health data through Web-based communication and questionnaires. While these Web platforms allow for increased speed and scope of communication delivery, there are certain limitations associated with this technology, as survey mode preferences vary across demographic groups. Objective To investigate the impact of demographic factors and participant preferences on the use of a Web-based questionnaire in comparison with more traditional methods (mail and phone) for women participating in screening mammography in British Columbia, Canada. Methods A sample of women attending the Screening Mammography Program of British Columbia (SMPBC) participated in a breast cancer risk assessment project. The study questionnaire was administered through one of three modes (ie, telephone, mail, or website platform). Survey mode preferences and actual methods of response were analyzed for participants recruited from Victoria General Hospital. Both univariate and multivariate analyses were used to investigate the association of demographic factors (ie, age, education level, and ethnicity) with certain survey response types. Results A total of 1192 women successfully completed the study questionnaire at Victoria General Hospital. Mail was stated as the most preferred survey mode (509/1192, 42.70%), followed by website platform (422/1192, 35.40%), and telephone (147/1192, 12.33%). Over 80% (955/1192) of participants completed the questionnaire in the mode previously specified as their most preferred; mail was the most common method of response (688/1192, 57.72%). Mail was also the most preferred type of questionnaire response method when participants responded in a mode other than their original preference. The average age of participants who responded via the Web-based platform (age 52.9, 95% confidence interval [CI] 52.1-53.7) was significantly lower than those who used mail and telephone methods (age 55.9, 95% CI 55.2-56.5; P<.001); each decade of increased age was associated with a 0.97-fold decrease in the odds of using the website platform (P<.001). Web-based participation was more likely for those who completed higher levels of education; each interval increase leading to a 1.83 increase in the odds of website platform usage (P<.001). Ethnicity was not shown to play a role in participant preference for the website platform (P=.96). Conclusions It is beneficial to consider participant survey mode preference when planning to collect PROs and other patient health data. Younger participants and those of higher education level were more likely to use the website platform questionnaire; Web-based participation failed to vary across ethnic group. Because mail questionnaires were still the most preferred survey mode, it will be important to employ strategies, such as user-friendly design and Web-based support, to ensure that the patient feedback being collected is representative of the population being served. PMID:27005707
78 FR 10613 - Proposed Agency Information Collection

Federal Register 2010, 2011, 2012, 2013, 2014

2013-02-14

.... The information collection requests a three-year approval of its Customer Electricity Data Access and... information about customer access to electricity usage data. The information will be shared on the DOE-supported OpenEI Web site where consumers can learn about the access offered by their electricity provider...
Identifying the Computer Competency Levels of Recreation Department Undergraduates

ERIC Educational Resources Information Center

Zorba, Erdal

2011-01-01

Computer-based and web-based applications are as major instructional tools to increase undergraduates' motivation at school. In the recreation field usage of, computer and the internet based recreational applications has become more prevalent in order to present visual and interactive entertainment activities. Recreation department undergraduates…
Exploring Determinants of Patient Adherence to a Portal-Supported Oncology Rehabilitation Program: Interview and Data Log Analyses

PubMed Central

Tabak, Monique; van Velsen, Lex; van der Geest, Thea; Hermens, Hermie

2017-01-01

Background Telemedicine applications often do not live up to their expectations and often fail once they have reached the operational phase. Objective The objective of this study was to explore the determinants of patient adherence to a blended care rehabilitation program, which includes a Web portal, from a patient’s perspective. Methods Patients were enrolled in a 12-week oncology rehabilitation treatment supported by a Web portal that was developed in cooperation with patients and care professionals. Semistructured interviews were used to analyze thought processes and behavior concerning patient adherence and portal use. Interviews were conducted with patients close to the start and the end of the treatment. Besides, usage data from the portal were analyzed to gain insights into actual usage of the portal. Results A total of 12 patients participated in the first interview, whereas 10 participated in the second round of interviews. Furthermore, portal usage of 31 patients was monitored. On average, 11 persons used the portal each week, with a maximum of 20 in the seventh week and a drop toward just one person in the weeks in the follow-up period of the treatment. From the interviews, it was derived that patients’ behavior in the treatment and use of the portal was primarily determined by extrinsic motivation cues (eg, stimulation by care professionals and patient group), perceived severity of the disease (eg, physical and mental condition), perceived ease of use (eg, accessibility of the portal and the ease with which information is found), and perceived usefulness (eg, fit with the treatment). Conclusions The results emphasized the impact that care professionals and fellow patients have on patient adherence and portal usage. For this reason, the success of blended care telemedicine interventions seems highly dependent on the willingness of care professionals to include the technology in their treatment and stimulate usage among patients. PMID:29242173
Uncovering text mining: A survey of current work on web-based epidemic intelligence

PubMed Central

Collier, Nigel

2012-01-01

Real world pandemics such as SARS 2002 as well as popular fiction like the movie Contagion graphically depict the health threat of a global pandemic and the key role of epidemic intelligence (EI). While EI relies heavily on established indicator sources a new class of methods based on event alerting from unstructured digital Internet media is rapidly becoming acknowledged within the public health community. At the heart of automated information gathering systems is a technology called text mining. My contribution here is to provide an overview of the role that text mining technology plays in detecting epidemics and to synthesise my existing research on the BioCaster project. PMID:22783909
SalanderMaps: A rapid overview about felt earthquakes through data mining of web-accesses

NASA Astrophysics Data System (ADS)

Kradolfer, Urs

2013-04-01

While seismological observatories detect and locate earthquakes based on measurements of the ground motion, they neither know a priori whether an earthquake has been felt by the public nor is it known, where it has been felt. Such information is usually gathered by evaluating feedback reported by the public through on-line forms on the web. However, after a felt earthquake in Switzerland, many people visit the webpages of the Swiss Seismological Service (SED) at the ETH Zurich and each such visit leaves traces in the logfiles on our web-servers. Data mining techniques, applied to these logfiles and mining publicly available data bases on the internet open possibilities to obtain previously unknown information about our virtual visitors. In order to provide precise information to authorities and the media, it would be desirable to rapidly know from which locations these web-accesses origin. The method 'Salander' (Seismic Activitiy Linked to Area codes - Nimble Detection of Earthquake Rumbles) will be introduced and it will be explained, how the IP-addresses (each computer or router directly connected to the internet has a unique IP-address; an example would be 129.132.53.5) of a sufficient amount of our virtual visitors were linked to their geographical area. This allows us to unprecedentedly quickly know whether and where an earthquake was felt in Switzerland. It will also be explained, why the method Salander is superior to commercial so-called geolocation products. The corresponding products of the Salander method, animated SalanderMaps, which are routinely generated after each earthquake with a magnitude of M>2 in Switzerland (http://www.seismo.ethz.ch/prod/salandermaps/, available after March 2013), demonstrate how the wavefield of earthquakes propagates through Switzerland and where it was felt. Often, such information is available within less than 60 seconds after origin time, and we always get a clear picture within already five minutes after origin time. Furthermore, the method allows to detect earthquakes solely on the analysis of accesses to our web-servers. Analyzing more than 170 million web-accesses since 2003, all seismic events within or near Switzerland with magnitudes M>4 and most felt events with magnitudes between 3 and 4 were detected. The current system is very robust, as we only had one false alarm while re-processing the web-access logfiles of the past almost 10 years. We anticipate that this method will produce even faster results in the future as the number of both commercial and private internet users is - according to the statistics of our logfiles - still increasing.
Reconsidering the Rhizome: A Textual Analysis of Web Search Engines as Gatekeepers of the Internet

NASA Astrophysics Data System (ADS)

Hess, A.

Critical theorists have often drawn from Deleuze and Guattari's notion of the rhizome when discussing the potential of the Internet. While the Internet may structurally appear as a rhizome, its day-to-day usage by millions via search engines precludes experiencing the random interconnectedness and potential democratizing function. Through a textual analysis of four search engines, I argue that Web searching has grown hierarchies, or "trees," that organize data in tracts of knowledge and place users in marketing niches rather than assist in the development of new knowledge.
ChloroMitoCU: Codon patterns across organelle genomes for functional genomics and evolutionary applications.

PubMed

Sablok, Gaurav; Chen, Ting-Wen; Lee, Chi-Ching; Yang, Chi; Gan, Ruei-Chi; Wegrzyn, Jill L; Porta, Nicola L; Nayak, Kinshuk C; Huang, Po-Jung; Varotto, Claudio; Tang, Petrus

2017-06-01

Organelle genomes are widely thought to have arisen from reduction events involving cyanobacterial and archaeal genomes, in the case of chloroplasts, or α-proteobacterial genomes, in the case of mitochondria. Heterogeneity in base composition and codon preference has long been the subject of investigation of topics ranging from phylogenetic distortion to the design of overexpression cassettes for transgenic expression. From the overexpression point of view, it is critical to systematically analyze the codon usage patterns of the organelle genomes. In light of the importance of codon usage patterns in the development of hyper-expression organelle transgenics, we present ChloroMitoCU, the first-ever curated, web-based reference catalog of the codon usage patterns in organelle genomes. ChloroMitoCU contains the pre-compiled codon usage patterns of 328 chloroplast genomes (29,960 CDS) and 3,502 mitochondrial genomes (49,066 CDS), enabling genome-wide exploration and comparative analysis of codon usage patterns across species. ChloroMitoCU allows the phylogenetic comparison of codon usage patterns across organelle genomes, the prediction of codon usage patterns based on user-submitted transcripts or assembled organelle genes, and comparative analysis with the pre-compiled patterns across species of interest. ChloroMitoCU can increase our understanding of the biased patterns of codon usage in organelle genomes across multiple clades. ChloroMitoCU can be accessed at: http://chloromitocu.cgu.edu.tw/. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Does individual learning styles influence the choice to use a web-based ECG learning programme in a blended learning setting?

PubMed Central

2012-01-01

Background The compressed curriculum in modern knowledge-intensive medicine demands useful tools to achieve approved learning aims in a limited space of time. Web-based learning can be used in different ways to enhance learning. Little is however known regarding its optimal utilisation. Our aim was to investigate if the individual learning styles of medical students influence the choice to use a web-based ECG learning programme in a blended learning setting. Methods The programme, with three types of modules (learning content, self-assessment questions and interactive ECG interpretation training), was offered on a voluntary basis during a face to face ECG learning course for undergraduate medical students. The Index of Learning Styles (ILS) and a general questionnaire including questions about computer and Internet usage, preferred future speciality and prior experience of E-learning were used to explore different factors related to the choice of using the programme or not. Results 93 (76%) out of 123 students answered the ILS instrument and 91 the general questionnaire. 55 students (59%) were defined as users of the web-based ECG-interpretation programme. Cronbach's alpha was analysed with coefficients above 0.7 in all of the four dimensions of ILS. There were no significant differences with regard to learning styles, as assessed by ILS, between the user and non-user groups; Active/Reflective; Visual/Verbal; Sensing/Intuitive; and Sequential/Global (p = 0.56-0.96). Neither did gender, prior experience of E-learning or preference for future speciality differ between groups. Conclusion Among medical students, neither learning styles according to ILS, nor a number of other characteristics seem to influence the choice to use a web-based ECG programme. This finding was consistent also when the usage of the different modules in the programme were considered. Thus, the findings suggest that web-based learning may attract a broad variety of medical students. PMID:22248183
Does individual learning styles influence the choice to use a web-based ECG learning programme in a blended learning setting?

PubMed

Nilsson, Mikael; Östergren, Jan; Fors, Uno; Rickenlund, Anette; Jorfeldt, Lennart; Caidahl, Kenneth; Bolinder, Gunilla

2012-01-16

The compressed curriculum in modern knowledge-intensive medicine demands useful tools to achieve approved learning aims in a limited space of time. Web-based learning can be used in different ways to enhance learning. Little is however known regarding its optimal utilisation. Our aim was to investigate if the individual learning styles of medical students influence the choice to use a web-based ECG learning programme in a blended learning setting. The programme, with three types of modules (learning content, self-assessment questions and interactive ECG interpretation training), was offered on a voluntary basis during a face to face ECG learning course for undergraduate medical students. The Index of Learning Styles (ILS) and a general questionnaire including questions about computer and Internet usage, preferred future speciality and prior experience of E-learning were used to explore different factors related to the choice of using the programme or not. 93 (76%) out of 123 students answered the ILS instrument and 91 the general questionnaire. 55 students (59%) were defined as users of the web-based ECG-interpretation programme. Cronbach's alpha was analysed with coefficients above 0.7 in all of the four dimensions of ILS. There were no significant differences with regard to learning styles, as assessed by ILS, between the user and non-user groups; Active/Reflective; Visual/Verbal; Sensing/Intuitive; and Sequential/Global (p = 0.56-0.96). Neither did gender, prior experience of E-learning or preference for future speciality differ between groups. Among medical students, neither learning styles according to ILS, nor a number of other characteristics seem to influence the choice to use a web-based ECG programme. This finding was consistent also when the usage of the different modules in the programme were considered. Thus, the findings suggest that web-based learning may attract a broad variety of medical students.
Fate and Trophic Transfer of Rare Earth Elements in Temperate Lake Food Webs.

PubMed

Amyot, Marc; Clayden, Meredith G; MacMillan, Gwyneth A; Perron, Tania; Arscott-Gauvin, Alexandre

2017-06-06

Many mining projects targeting rare earth elements (REE) are in development in North America, but the background concentrations and trophic transfer of these elements in natural environments have not been well characterized. We sampled abiotic and food web components in 14 Canadian temperate lakes unaffected by mines to assess the natural ecosystem fate of REE. Individual REE and total REE concentrations (sum of individual element concentrations, ΣREE) were strongly related with each other throughout different components of lake food webs. Dissolved organic carbon and dissolved oxygen in the water column, as well as ΣREE in sediments, were identified as potential drivers of aqueous ΣREE. Log 10 of median bioaccumulation factors ranged from 1.3, 3.7, 4.0, and 4.4 L/kg (wet weight) for fish muscle, zooplankton, predatory invertebrates, and nonpredatory invertebrates, respectively. [ΣREE] in fish, benthic macroinvertebrates, and zooplankton declined as a function of their trophic position, as determined by functional feeding groups and isotopic signatures of nitrogen (δ 15 N), indicating that REE were subject to trophic dilution. Low concentrations of REE in freshwater fish muscle compared to their potential invertebrate prey suggest that fish fillet consumption is unlikely to be a significant source of REE to humans in areas unperturbed by mining activities. However, other fish predators (e.g., piscivorous birds and mammals) may accumulate REE from whole fish as they are more concentrated than muscle. Overall, this study provides key information on the baseline concentrations and trophic patterns for REE in freshwater temperate lakes in Quebec, Canada.
Web-based cognitive behavior therapy: analysis of site usage and changes in depression and anxiety scores.

PubMed

Christensen, Helen; Griffiths, Kathleen M; Korten, Ailsa

2002-01-01

Cognitive behavior therapy is well recognized as an effective treatment and prevention for depression when delivered face-to-face, via self-help books (bibliotherapy), and through computer administration. The public health impact of cognitive behavior therapy has been limited by cost and the lack of trained practitioners. We have developed a free Internet-based cognitive behavior therapy intervention (MoodGYM, http://moodgym.anu.edu.au) designed to treat and prevent depression in young people, available to all Internet users, and targeted to those who may have no formal contact with professional help services. To document site usage, visitor characteristics, and changes in depression and anxiety symptoms among users of MoodGYM, a Web site delivering a cognitive-behavioral-based preventive intervention to the general public. All visitors to the MoodGYM site over about 6 months were investigated, including 2909 registrants of whom 1503 had completed at least one online assessment. Outcomes for 71 university students enrolled in an Abnormal Psychology course who visited the site for educational training were included and examined separately. The main outcome measures were (1) site-usage measures including number of sessions, hits and average time on the server, and number of page views; (2) visitor characteristics including age, gender, and initial Goldberg self-report anxiety and depression scores; and (3) symptom change measures based on Goldberg anxiety and depression scores recorded on up to 5 separate occasions. Over the first almost-6-month period of operation, the server recorded 817284 hits and 17646 separate sessions. Approximately 20% of sessions lasted more than 16 minutes. Registrants who completed at least one assessment reported initial symptoms of depression and anxiety that exceeded those found in population-based surveys and those characterizing a sample of University students. For the Web-based population, both anxiety and depression scores decreased significantly as individuals progressed through the modules. CONCLUSIONS Web sites are a practical and promising means of delivering cognitive behavioral interventions for preventing depression and anxiety to the general public. However, randomized controlled trials are required to establish the effectiveness of these interventions.
Classification of Learning Styles in Virtual Learning Environment Using J48 Decision Tree

ERIC Educational Resources Information Center

Maaliw, Renato R. III; Ballera, Melvin A.

2017-01-01

The usage of data mining has dramatically increased over the past few years and the education sector is leveraging this field in order to analyze and gain intuitive knowledge in terms of the vast accumulated data within its confines. The primary objective of this study is to compare the results of different classification techniques such as Naïve…
2018 Cyber Enabled Emerging Technologies Symposium

DTIC Science & Technology

2018-03-08

Principles • Better data = better outcomes • Training > Programming • AI anxiety?... Think IA (Intelligent Assistant) • Ingest much more information • Make...Local Marketing 7 Usage: “Local” / Specific AI • Healthcare (oncology) • Data Mining/Discovery • Chat bots • Personnel • Finance • Sourcing...cognitive- principles / So, Our Priorities for AI Adoption and Ethics • Purpose: human augmentation versus replacement • Human decision-making • Human

The Comprehensive Microbial Resource.

PubMed

Peterson, J D; Umayam, L A; Dickinson, T; Hickey, E K; White, O

2001-01-01

One challenge presented by large-scale genome sequencing efforts is effective display of uniform information to the scientific community. The Comprehensive Microbial Resource (CMR) contains robust annotation of all complete microbial genomes and allows for a wide variety of data retrievals. The bacterial information has been placed on the Web at http://www.tigr.org/CMR for retrieval using standard web browsing technology. Retrievals can be based on protein properties such as molecular weight or hydrophobicity, GC-content, functional role assignments and taxonomy. The CMR also has special web-based tools to allow data mining using pre-run homology searches, whole genome dot-plots, batch downloading and traversal across genomes using a variety of datatypes.
Cadmium Accumulation in Periphyton from an Abandoned Mining District in the Buffalo National River, Arkansas.

PubMed

McCauley, Jacob R; Bouldin, Jennifer L

2016-06-01

The Rush Mining District along the Buffalo River in Arkansas has a significant history of zinc and lead mining operations. The tails and spoils of these operations deposit heavy amounts of raw ore into streams. One element commonly found in the earth's crust that becomes a minor constituent of the deposition is cadmium. Periphyton samples from Rush Creek and Clabber Creek, two creeks within the Rush Mining District were measured for cadmium as well as two creeks with no history of mining, Spring Creek and Water Creek. Periphyton samples from Rush and Clabber Creek contained mean cadmium concentrations of 436.6 ± 67.3 and 93.38 ± 8.67 µg/kg, respectively. Spring Creek and Water Creek had a mean cadmium concentration of 40.49 ± 3.40 and 41.78 ± 3.99 µg/kg within periphyton. The results indicate increased metal concentrations in algal communities from mined areas. As periphyton is the base of the aquatic food chain, it acts as a conduit for movement of cadmium in the food web.
Launching a virtual decision lab: development and field-testing of a web-based patient decision support research platform.

PubMed

Hoffman, Aubri S; Llewellyn-Thomas, Hilary A; Tosteson, Anna N A; O'Connor, Annette M; Volk, Robert J; Tomek, Ivan M; Andrews, Steven B; Bartels, Stephen J

2014-12-12

Over 100 trials show that patient decision aids effectively improve patients' information comprehension and values-based decision making. However, gaps remain in our understanding of several fundamental and applied questions, particularly related to the design of interactive, personalized decision aids. This paper describes an interdisciplinary development process for, and early field testing of, a web-based patient decision support research platform, or virtual decision lab, to address these questions. An interdisciplinary stakeholder panel designed the web-based research platform with three components: a) an introduction to shared decision making, b) a web-based patient decision aid, and c) interactive data collection items. Iterative focus groups provided feedback on paper drafts and online prototypes. A field test assessed a) feasibility for using the research platform, in terms of recruitment, usage, and acceptability; and b) feasibility of using the web-based decision aid component, compared to performance of a videobooklet decision aid in clinical care. This interdisciplinary, theory-based, patient-centered design approach produced a prototype for field-testing in six months. Participants (n = 126) reported that: the decision aid component was easy to use (98%), information was clear (90%), the length was appropriate (100%), it was appropriately detailed (90%), and it held their interest (97%). They spent a mean of 36 minutes using the decision aid and 100% preferred using their home/library computer. Participants scored a mean of 75% correct on the Decision Quality, Knowledge Subscale, and 74 out of 100 on the Preparation for Decision Making Scale. Completing the web-based decision aid reduced mean Decisional Conflict scores from 31.1 to 19.5 (p < 0.01). Combining decision science and health informatics approaches facilitated rapid development of a web-based patient decision support research platform that was feasible for use in research studies in terms of recruitment, acceptability, and usage. Within this platform, the web-based decision aid component performed comparably with the videobooklet decision aid used in clinical practice. Future studies may use this interactive research platform to study patients' decision making processes in real-time, explore interdisciplinary approaches to designing web-based decision aids, and test strategies for tailoring decision support to meet patients' needs and preferences.
Electricity forecasting on the individual household level enhanced based on activity patterns

PubMed Central

Gajowniczek, Krzysztof; Ząbkowski, Tomasz

2017-01-01

Leveraging smart metering solutions to support energy efficiency on the individual household level poses novel research challenges in monitoring usage and providing accurate load forecasting. Forecasting electricity usage is an especially important component that can provide intelligence to smart meters. In this paper, we propose an enhanced approach for load forecasting at the household level. The impacts of residents’ daily activities and appliance usages on the power consumption of the entire household are incorporated to improve the accuracy of the forecasting model. The contributions of this paper are threefold: (1) we addressed short-term electricity load forecasting for 24 hours ahead, not on the aggregate but on the individual household level, which fits into the Residential Power Load Forecasting (RPLF) methods; (2) for the forecasting, we utilized a household specific dataset of behaviors that influence power consumption, which was derived using segmentation and sequence mining algorithms; and (3) an extensive load forecasting study using different forecasting algorithms enhanced by the household activity patterns was undertaken. PMID:28423039
Electricity forecasting on the individual household level enhanced based on activity patterns.

PubMed

Gajowniczek, Krzysztof; Ząbkowski, Tomasz

2017-01-01

Leveraging smart metering solutions to support energy efficiency on the individual household level poses novel research challenges in monitoring usage and providing accurate load forecasting. Forecasting electricity usage is an especially important component that can provide intelligence to smart meters. In this paper, we propose an enhanced approach for load forecasting at the household level. The impacts of residents' daily activities and appliance usages on the power consumption of the entire household are incorporated to improve the accuracy of the forecasting model. The contributions of this paper are threefold: (1) we addressed short-term electricity load forecasting for 24 hours ahead, not on the aggregate but on the individual household level, which fits into the Residential Power Load Forecasting (RPLF) methods; (2) for the forecasting, we utilized a household specific dataset of behaviors that influence power consumption, which was derived using segmentation and sequence mining algorithms; and (3) an extensive load forecasting study using different forecasting algorithms enhanced by the household activity patterns was undertaken.
Reclamation of surface mined lands

DOE Office of Scientific and Technical Information (OSTI.GOV)

Not Available

1979-09-01

A detailed report has recently been published in which the whole subject of mine reclamation has been extensively reviewed and discussed. Part One deals with the technology of reclamation, in which the methods and procedures used have been illustrated by examples taken from the practice of different countries as required by law or by accepted usage. In general, the illustrations used are the most stringent that apply to the procedure under discussion. This serves to show the situation in its most severe light, but it also gives warning of the direction in which the law will move in other countriesmore » that are not so environmentally conscious as the pacesetters. Part Two of the report deals with the law and practice in the major mining nations of the West that have legislation on the subject. This is a field in which much movement is taking place; new laws and regulations are being enacted, and old ones amended and revised. The laws outlined in the report are designed to give the general sense of the law and define the most important regulations. Any mining company contemplating surface mining in a country with which it is unfamiliar will naturally obtain the currently valid legislation. Reclamation of Surface Mined Lands by W.L.G. Muir (price US $225 or equivalent) can be obtained from World Coal, Book Department, 500 Howard Street, San Francisco, California 94105, USA.« less
A Survey and Empirical Study of Virtual Reference Service in Academic Libraries

ERIC Educational Resources Information Center

Mu, Xiangming; Dimitroff, Alexandra; Jordan, Jeanette; Burclaff, Natalie

2011-01-01

Virtual Reference Services (VRS) have high user satisfaction. The main problem is its low usage. We surveyed 100 academic library web sites to understand how VRS are presented. We then conducted a usability study to further test an active VRS model regarding its effectiveness.
The Internet and Librarians: Just Do It!!!!!

ERIC Educational Resources Information Center

Helfer, Doris Small

1998-01-01

Argues that librarians should actively accept their responsibility to ensure and provide useful content to the general Internet public. Discusses change brought on by electric commerce and the need for librarians to fill traditional roles, as well as new roles in providing and encouraging Web usage by the public. (AEF)
WINDS: A Web-Based Intelligent Interactive Course on Data-Structures

ERIC Educational Resources Information Center

Sirohi, Vijayalaxmi

2007-01-01

The Internet has opened new ways of learning and has brought several advantages to computer-aided education. Global access, self-paced learning, asynchronous teaching, interactivity, and multimedia usage are some of these. Along with the advantages comes the challenge of designing the software using the available facilities. Integrating online…
Improving ESL Writing Using an Online Formulaic Sequence Word-Combination Checker

ERIC Educational Resources Information Center

Grami, G. M. A.; Alkazemi, B. Y.

2016-01-01

Writing correct English sentences can be challenging. Furthermore, writing correct formulaic sequences can be especially difficult because accepted combinations do not follow clear rules governing which words appear together in a sequence. One solution is to provide examples of correct usage accompanied by statistical feedback from web-based…
Translating Translations: Selecting and Using Translated Early Childhood Materials.

ERIC Educational Resources Information Center

Santos, Rosa Milagros; Lee, Sung Yoon; Valdivia, Rebeca; Zhang, Chun

2001-01-01

This article provides early intervention professionals with strategies for selecting and using translated materials. It stresses the importance of considering both the intended audience of the material and the quality of the translation itself. The article notes that many Web-based translator programs fail to capture the idiomatic usage or…
ADVICE--Educational System for Teaching Database Courses

ERIC Educational Resources Information Center

Cvetanovic, M.; Radivojevic, Z.; Blagojevic, V.; Bojovic, M.

2011-01-01

This paper presents a Web-based educational system, ADVICE, that helps students to bridge the gap between database management system (DBMS) theory and practice. The usage of ADVICE is presented through a set of laboratory exercises developed to teach students conceptual and logical modeling, SQL, formal query languages, and normalization. While…
Efficacy of a Virtual Teaching Assistant in an Open Laboratory Environment for Electric Circuits

ERIC Educational Resources Information Center

Saleheen, Firdous; Wang, Zicong; Picone, Joseph; Butz, Brian P.; Won, Chang-Hee

2018-01-01

In order to provide an on-demand, open electrical engineering laboratory, we developed an innovative software-based Virtual Open Laboratory Teaching Assistant (VOLTA). This web-based virtual assistant provides laboratory instructions, equipment usage videos, circuit simulation assistance, and hardware implementation diagnostics. VOLTA allows…
Development of Database for Accident Analysis in Indian Mines

NASA Astrophysics Data System (ADS)

Tripathy, Debi Prasad; Guru Raghavendra Reddy, K.

2016-10-01

Mining is a hazardous industry and high accident rates associated with underground mining is a cause of deep concern. Technological developments notwithstanding, rate of fatal accidents and reportable incidents have not shown corresponding levels of decline. This paper argues that adoption of appropriate safety standards by both mine management and the government may result in appreciable reduction in accident frequency. This can be achieved by using the technology in improving the working conditions, sensitising workers and managers about causes and prevention of accidents. Inputs required for a detailed analysis of an accident include information on location, time, type, cost of accident, victim, nature of injury, personal and environmental factors etc. Such information can be generated from data available in the standard coded accident report form. This paper presents a web based application for accident analysis in Indian mines during 2001-2013. An accident database (SafeStat) prototype based on Intranet of the TCP/IP agreement, as developed by the authors, is also discussed.
An appraisal of biological responses and network of environmental interactions in non-mining and mining impacted coastal waters.

PubMed

Fernandes, Christabelle E G; Malik, Ashish; Jineesh, V K; Fernandes, Sheryl O; Das, Anindita; Pandey, Sunita S; Kanolkar, Geeta; Sujith, P P; Velip, Dhillan M; Shaikh, Shagufta; Helekar, Samita; Gonsalves, Maria Judith; Nair, Shanta; LokaBharathi, P A

2015-08-01

The coastal waters of Goa and Ratnagiri lying on the West coast of India are influenced by terrestrial influx. However, Goa is influenced anthropogenically by iron-ore mining, while Ratnagiri is influenced by deposition of heavy minerals containing iron brought from the hinterlands. We hypothesize that there could be a shift in biological response along with changes in network of interactions between environmental and biological variables in these mining and non-mining impacted regions, lying 160 nmi apart. Biological and environmental parameters were analyzed during pre-monsoon season. Except silicates, the measured parameters were higher at Goa and related significantly, suggesting bacteria centric, detritus-driven region. At Ratnagiri, phytoplankton biomass related positively with silicate suggesting a region dominated by primary producers. This dominance perhaps got reflected as a higher tertiary yield. Thus, even though the regions are geographically proximate, the different biological response could be attributed to the differences in the web of interactions between the measured variables.
Corner-cutting mining assembly

DOEpatents

Bradley, J.A.

1981-07-01

This invention resulted from a contract with the United States Department of Energy and relates to a mining tool. More particularly, the invention relates to an assembly capable of drilling a hole having a square cross-sectional shape with radiused corners. In mining operations in which conventional auger-type drills are used to form a series of parallel, cylindrical holes in a coal seam, a large amount of coal remains in place in the seam because the shape of the holes leaves thick webs between the holes. A higher percentage of coal can be mined from a seam by a means capable of drilling holes having a substantially square cross section. It is an object of this invention to provide an improved mining apparatus by means of which the amount of coal recovered from a seam deposit can be increased. Another object of the invention is to provide a drilling assembly which cuts corners in a hole having a circular cross section. These objects and other advantages are attained by a preferred embodiment of the invention.
Mercury flow through an Asian rice-based food web.

PubMed

Abeysinghe, Kasun S; Qiu, Guangle; Goodale, Eben; Anderson, Christopher W N; Bishop, Kevin; Evers, David C; Goodale, Morgan W; Hintelmann, Holger; Liu, Shengjie; Mammides, Christos; Quan, Rui-Chang; Wang, Jin; Wu, Pianpian; Xu, Xiao-Hang; Yang, Xiao-Dong; Feng, Xinbin

2017-10-01

Mercury (Hg) is a globally-distributed pollutant, toxic to humans and animals. Emissions are particularly high in Asia, and the source of exposure for humans there may also be different from other regions, including rice as well as fish consumption, particularly in contaminated areas. Yet the threats Asian wildlife face in rice-based ecosystems are as yet unclear. We sought to understand how Hg flows through rice-based food webs in historic mining and non-mining regions of Guizhou, China. We measured total Hg (THg) and methylmercury (MeHg) in soil, rice, 38 animal species (27 for MeHg) spanning multiple trophic levels, and examined the relationship between stable isotopes and Hg concentrations. Our results confirm biomagnification of THg/MeHg, with a high trophic magnification slope. Invertivorous songbirds had concentrations of THg in their feathers that were 15x and 3x the concentration reported to significantly impair reproduction, at mining and non-mining sites, respectively. High concentrations in specialist rice consumers and in granivorous birds, the later as high as in piscivorous birds, suggest rice is a primary source of exposure. Spiders had the highest THg concentrations among invertebrates and may represent a vector through which Hg is passed to vertebrates, especially songbirds. Our findings suggest there could be significant population level health effects and consequent biodiversity loss in sensitive ecosystems, like agricultural wetlands, across Asia, and invertivorous songbirds would be good subjects for further studies investigating this possibility. Copyright © 2017 Elsevier Ltd. All rights reserved.
Trajectories of 12-Month Usage Patterns for Two Smoking Cessation Websites: Exploring How Users Engage Over Time.

PubMed

Bricker, Jonathan B; Sridharan, Vasundhara; Zhu, Yifan; Mull, Kristin E; Heffner, Jaimee L; Watson, Noreen L; McClure, Jennifer B; Di, Chongzhi

2018-04-20

Little is known about how individuals engage with electronic health (eHealth) interventions over time and whether this engagement predicts health outcomes. The objectives of this study, by using the example of a specific type of eHealth intervention (ie, websites for smoking cessation), were to determine (1) distinct groups of log-in trajectories over a 12-month period, (2) their association with smoking cessation, and (3) baseline user characteristics that predict trajectory group membership. We conducted a functional clustering analysis of 365 consecutive days of log-in data from both arms of a large (N=2637) randomized trial of 2 website interventions for smoking cessation (WebQuit and Smokefree), with a primary outcome of 30-day point prevalence smoking abstinence at 12 months. We conducted analyses for each website separately. A total of 3 distinct trajectory groups emerged for each website. For WebQuit, participants were clustered into 3 groups: 1-week users (682/1240, 55.00% of the sample), 5-week users (399/1240, 32.18%), and 52-week users (159/1240, 12.82%). Compared with the 1-week users, the 5- and 52-week users had 57% higher odds (odds ratio [OR] 1.57, 95% CI 1.13-2.17; P=.007) and 124% higher odds (OR 2.24, 95% CI 1.45-3.43; P<.001), respectively, of being abstinent at 12 months. Smokefree users were clustered into 3 groups: 1-week users (645/1309, 49.27% of the sample), 4-week users (395/1309, 30.18%), and 5-week users (269/1309, 20.55%). Compared with the 1-week users, 5-week users (but not 4-week users; P=.99) had 48% higher odds (OR 1.48, 95% CI 1.05-2.07; P=.02) of being abstinent at 12 months. In general, the WebQuit intervention had a greater number of weekly log-ins within each of the 3 trajectory groups as compared with those of the Smokefree intervention. Baseline characteristics associated with trajectory group membership varied between websites. Patterns of 1-, 4-, and 5-week usage of websites may be common for how people engage in eHealth interventions. The 5-week usage of either website, and 52-week usage only of WebQuit, predicted a higher odds of quitting smoking. Strategies to increase eHealth intervention engagement for 4 more weeks (ie, from 1 week to 5 weeks) could be highly cost effective. ClinicalTrials.gov NCT01812278; https://www.clinicaltrials.gov/ct2/show/NCT01812278 (Archived by WebCite at http://www.webcitation.org/6yPO2OIKR). ©Jonathan B Bricker, Vasundhara Sridharan, Yifan Zhu, Kristin E Mull, Jaimee L Heffner, Noreen L Watson, Jennifer B McClure, Chongzhi Di. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 20.04.2018.
Preliminary Research on Possibilities of Drilling Process Robotization

NASA Astrophysics Data System (ADS)

Pawel, Stefaniak; Jacek, Wodecki; Jakubiak, Janusz; Zimroz, Radoslaw

2017-12-01

Nowadays, drilling & blasting is crucial technique for deposit excavation using in hard rock mining. Unfortunately, such approach requires qualified staff to perform, and consequently there is a serious risk related to rock mechanics when using explosives. Negative influence of explosives usage on safety issues of underground mine is a main cause of mining demands related to elimination of people from production area. Other aspects worth taking into consideration are drilling precision according to drilling pattern, blasting effectiveness, improvement of drilling tool reliability etc. In the literature different drilling support solutions are well-known in terms of positioning support systems, anti-jamming systems or cavity detection systems. For many years, teleoperation of drilling process is also developed. Unfortunately, available technologies have so far not fully met the industries expectation in hard rock. Mine of the future is expected to incorporate robotic system instead of current approaches. In this paper we present preliminary research related to robotization of drilling process and possibilities of its application in underground mine condition. A test rig has been proposed. To simulate drilling process several key assumptions have been accepted. As a result, algorithms for automation of drilling process have been proposed and tested on the test rig. Experiences gathered so far underline that there is a need for further developing robotic system for drilling process.
Effectiveness of a Web 2.0 Intervention to Increase Physical Activity in Real-World Settings: Randomized Ecological Trial

PubMed Central

Kolt, Gregory S; Caperchione, Cristina M; Savage, Trevor N; Rosenkranz, Richard R; Maeder, Anthony J; Van Itallie, Anetta; Tague, Rhys; Oldmeadow, Christopher; Mummery, W Kerry; Duncan, Mitch J

2017-01-01

Background The translation of Web-based physical activity intervention research into the real world is lacking and becoming increasingly important. Objective To compare usage and effectiveness, in real-world settings, of a traditional Web 1.0 Web-based physical activity intervention, providing limited interactivity, to a Web 2.0 Web-based physical activity intervention that includes interactive features, such as social networking (ie, status updates, online “friends,” and personalized profile pages), blogs, and Google Maps mash-ups. Methods Adults spontaneously signing up for the freely available 10,000 Steps website were randomized to the 10,000 Steps website (Web 1.0) or the newly developed WALK 2.0 website (Web 2.0). Physical activity (Active Australia Survey), quality of life (RAND 36), and body mass index (BMI) were assessed at baseline, 3 months, and 12 months. Website usage was measured continuously. Analyses of covariance were used to assess change over time in continuous outcome measures. Multiple imputation was used to deal with missing data. Results A total of 1328 participants completed baseline assessments. Only 3-month outcomes (224 completers) were analyzed due to high attrition at 12 months (77 completers). Web 2.0 group participants increased physical activity by 92.8 minutes per week more than those in the Web 1.0 group (95% CI 28.8-156.8; P=.005); their BMI values also decreased more (–1.03 kg/m2, 95% CI –1.65 to -0.41; P=.001). For quality of life, only the physical functioning domain score significantly improved more in the Web 2.0 group (3.6, 95% CI 1.7-5.5; P<.001). The time between the first and last visit to the website (3.57 vs 2.22 weeks; P<.001) and the mean number of days the website was visited (9.02 vs 5.71 days; P=.002) were significantly greater in the Web 2.0 group compared to the Web 1.0 group. The difference in time-to-nonusage attrition was not statistically significant between groups (Hazard Ratio=0.97, 95% CI 0.86-1.09; P=.59). Only 21.99% (292/1328) of participants (n=292 summed for both groups) were still using either website after 2 weeks and 6.55% (87/1328) were using either website after 10 weeks. Conclusions The website that provided more interactive and social features was more effective in improving physical activity in real-world conditions. While the Web 2.0 website was visited significantly more, both groups nevertheless displayed high nonusage attrition and low intervention engagement. More research is needed to examine the external validity and generalizability of Web-based physical activity interventions. Trial Registration Australian New Zealand Clinical Trials Registry: ACTRN12611000253909; https://anzctr.org.au /Trial/Registration/TrialReview.aspx?id=336588&isReview=true (Archived by WebCite at http://www.webcitation.org/6ufzw 2HxD) PMID:29133282

Social Web mining and exploitation for serious applications: Technosocial Predictive Analytics and related technologies for public health, environmental and national security surveillance

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kamel Boulos, Maged; Sanfilippo, Antonio P.; Corley, Courtney D.

2010-03-17

This paper explores techno-social predictive analytics (TPA) and related methods for Web “data mining” where users’ posts and queries are garnered from Social Web (“Web 2.0”) tools such as blogs, microblogging and social networking sites to form coherent representations of real-time health events. The paper includes a brief introduction to commonly used Social Web tools such as mashups and aggregators, and maps their exponential growth as an open architecture of participation for the masses and an emerging way to gain insight about people’s collective health status of whole populations. Several health related tool examples are described and demonstrated as practicalmore » means through which health professionals might create clear location specific pictures of epidemiological data such as flu outbreaks.« less
Explanation of fields used in the Alaska Resource Data File of mines, prospects, and mineral occurrences in Alaska

USGS Publications Warehouse

,

1996-01-01

Descriptions of mines, prospects, and mineral occurrences in the Alaska Resource Data File (ARDF) are published for individual U.S. Geological Survey 1:250,000 scale quadrangles in Alaska (see accompanying map) and are available for downloading from USGS World Wide Web site: http://www-rnrs-ak.wr.usgs.gov/ardf.These descriptions are divided into a number of fields which describe features of each mine, prospect, or mineral occurrence. These descriptions were complied from published literature and from unpublished reports and data from industry, the U.S. Bureau of Mines, and the U.S. Geological Survey and other sources. Compilation of this database is an ongoing process and each report is essentially a progress report. The authors of the individual quadrangle reports would appreciate any corrections or additional information that users may be able to contribute.
Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine

PubMed Central

Elsik, Christine G.; Tayal, Aditi; Diesh, Colin M.; Unni, Deepak R.; Emery, Marianne L.; Nguyen, Hung N.; Hagen, Darren E.

2016-01-01

We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. PMID:26578564
Visual mining geo-related data using pixel bar charts

NASA Astrophysics Data System (ADS)

Hao, Ming C.; Keim, Daniel A.; Dayal, Umeshwar; Wright, Peter; Schneidewind, Joern

2005-03-01

A common approach to analyze geo-related data is using bar charts or x-y plots. They are intuitive and easy to use. But important information often gets lost. In this paper, we introduce a new interactive visualization technique called Geo Pixel Bar Charts, which combines the advantages of Pixel Bar Charts and interactive maps. This technique allows analysts to visualize large amounts of spatial data without aggregation and shows the geographical regions corresponding to the spatial data attribute at the same time. In this paper, we apply Geo Pixel Bar Charts to visually mining sales transactions and Internet usage from different locations. Our experimental results show the effectiveness of this technique for providing data distribution and exceptions from the map.
COPRED: prediction of fold, GO molecular function and functional residues at the domain level.

PubMed

López, Daniel; Pazos, Florencio

2013-07-15

Only recently the first resources devoted to the functional annotation of proteins at the domain level started to appear. The next step is to develop specific methodologies for predicting function at the domain level based on these resources, and to implement them in web servers to be used by the community. In this work, we present COPRED, a web server for the concomitant prediction of fold, molecular function and functional sites at the domain level, based on a methodology for domain molecular function prediction and a resource of domain functional annotations previously developed and benchmarked. COPRED can be freely accessed at http://csbg.cnb.csic.es/copred. The interface works in all standard web browsers. WebGL (natively supported by most browsers) is required for the in-line preview and manipulation of protein 3D structures. The website includes a detailed help section and usage examples. pazos@cnb.csic.es.
Intelligent web image retrieval system

NASA Astrophysics Data System (ADS)

Hong, Sungyong; Lee, Chungwoo; Nah, Yunmook

2001-07-01

Recently, the web sites such as e-business sites and shopping mall sites deal with lots of image information. To find a specific image from these image sources, we usually use web search engines or image database engines which rely on keyword only retrievals or color based retrievals with limited search capabilities. This paper presents an intelligent web image retrieval system. We propose the system architecture, the texture and color based image classification and indexing techniques, and representation schemes of user usage patterns. The query can be given by providing keywords, by selecting one or more sample texture patterns, by assigning color values within positional color blocks, or by combining some or all of these factors. The system keeps track of user's preferences by generating user query logs and automatically add more search information to subsequent user queries. To show the usefulness of the proposed system, some experimental results showing recall and precision are also explained.
DelPhiForce web server: electrostatic forces and energy calculations and visualization.

PubMed

Li, Lin; Jia, Zhe; Peng, Yunhui; Chakravorty, Arghya; Sun, Lexuan; Alexov, Emil

2017-11-15

Electrostatic force is an essential component of the total force acting between atoms and macromolecules. Therefore, accurate calculations of electrostatic forces are crucial for revealing the mechanisms of many biological processes. We developed a DelPhiForce web server to calculate and visualize the electrostatic forces at molecular level. DelPhiForce web server enables modeling of electrostatic forces on individual atoms, residues, domains and molecules, and generates an output that can be visualized by VMD software. Here we demonstrate the usage of the server for various biological problems including protein-cofactor, domain-domain, protein-protein, protein-DNA and protein-RNA interactions. The DelPhiForce web server is available at: http://compbio.clemson.edu/delphi-force. delphi@clemson.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
ASCOT: a text mining-based web-service for efficient search and assisted creation of clinical trials

PubMed Central

2012-01-01

Clinical trials are mandatory protocols describing medical research on humans and among the most valuable sources of medical practice evidence. Searching for trials relevant to some query is laborious due to the immense number of existing protocols. Apart from search, writing new trials includes composing detailed eligibility criteria, which might be time-consuming, especially for new researchers. In this paper we present ASCOT, an efficient search application customised for clinical trials. ASCOT uses text mining and data mining methods to enrich clinical trials with metadata, that in turn serve as effective tools to narrow down search. In addition, ASCOT integrates a component for recommending eligibility criteria based on a set of selected protocols. PMID:22595088
ASCOT: a text mining-based web-service for efficient search and assisted creation of clinical trials.

PubMed

Korkontzelos, Ioannis; Mu, Tingting; Ananiadou, Sophia

2012-04-30

Clinical trials are mandatory protocols describing medical research on humans and among the most valuable sources of medical practice evidence. Searching for trials relevant to some query is laborious due to the immense number of existing protocols. Apart from search, writing new trials includes composing detailed eligibility criteria, which might be time-consuming, especially for new researchers. In this paper we present ASCOT, an efficient search application customised for clinical trials. ASCOT uses text mining and data mining methods to enrich clinical trials with metadata, that in turn serve as effective tools to narrow down search. In addition, ASCOT integrates a component for recommending eligibility criteria based on a set of selected protocols.
Public health, GIS, and the internet.

PubMed

Croner, Charles M

2003-01-01

Internet access and use of georeferenced public health information for GIS application will be an important and exciting development for the nation's Department of Health and Human Services and other health agencies in this new millennium. Technological progress toward public health geospatial data integration, analysis, and visualization of space-time events using the Web portends eventual robust use of GIS by public health and other sectors of the economy. Increasing Web resources from distributed spatial data portals and global geospatial libraries, and a growing suite of Web integration tools, will provide new opportunities to advance disease surveillance, control, and prevention, and insure public access and community empowerment in public health decision making. Emerging supercomputing, data mining, compression, and transmission technologies will play increasingly critical roles in national emergency, catastrophic planning and response, and risk management. Web-enabled public health GIS will be guided by Federal Geographic Data Committee spatial metadata, OpenGIS Web interoperability, and GML/XML geospatial Web content standards. Public health will become a responsive and integral part of the National Spatial Data Infrastructure.
Geographical Assesment of Results from Preventing the Parameter Tampering in a Web Application

NASA Astrophysics Data System (ADS)

Menemencioğlu, O.; Orak, İ. M.

2017-11-01

The improving usage of internet and attained intensity of usage rate attracts the malicious in around the world. Many preventing systems are offered by researchers with different infrastructures. Very effective preventing system was proposed most recently by the researchers. The previously offered mechanism has prevented the multi-type vulnerabilities after preventing system was put into use. The attack attempts have been recorded. The researchers analysed the results geographically, discussed the obtained results and made some inference of the results. Our assessments show that the geographical findings can be used to retrieve some implication and build an infrastructure which prevents the vulnerabilities by location.
The Voice of Chinese Health Consumers: A Text Mining Approach to Web-Based Physician Reviews

PubMed Central

Zhang, Kunpeng

2016-01-01

Background Many Web-based health care platforms allow patients to evaluate physicians by posting open-end textual reviews based on their experiences. These reviews are helpful resources for other patients to choose high-quality doctors, especially in countries like China where no doctor referral systems exist. Analyzing such a large amount of user-generated content to understand the voice of health consumers has attracted much attention from health care providers and health care researchers. Objective The aim of this paper is to automatically extract hidden topics from Web-based physician reviews using text-mining techniques to examine what Chinese patients have said about their doctors and whether these topics differ across various specialties. This knowledge will help health care consumers, providers, and researchers better understand this information. Methods We conducted two-fold analyses on the data collected from the “Good Doctor Online” platform, the largest online health community in China. First, we explored all reviews from 2006-2014 using descriptive statistics. Second, we applied the well-known topic extraction algorithm Latent Dirichlet Allocation to more than 500,000 textual reviews from over 75,000 Chinese doctors across four major specialty areas to understand what Chinese health consumers said online about their doctor visits. Results On the “Good Doctor Online” platform, 112,873 out of 314,624 doctors had been reviewed at least once by April 11, 2014. Among the 772,979 textual reviews, we chose to focus on four major specialty areas that received the most reviews: Internal Medicine, Surgery, Obstetrics/Gynecology and Pediatrics, and Chinese Traditional Medicine. Among the doctors who received reviews from those four medical specialties, two-thirds of them received more than two reviews and in a few extreme cases, some doctors received more than 500 reviews. Across the four major areas, the most popular topics reviewers found were the experience of finding doctors, doctors’ technical skills and bedside manner, general appreciation from patients, and description of various symptoms. Conclusions To the best of our knowledge, our work is the first study using an automated text-mining approach to analyze a large amount of unstructured textual data of Web-based physician reviews in China. Based on our analysis, we found that Chinese reviewers mainly concentrate on a few popular topics. This is consistent with the goal of Chinese online health platforms and demonstrates the health care focus in China’s health care system. Our text-mining approach reveals a new research area on how to use big data to help health care providers, health care administrators, and policy makers hear patient voices, target patient concerns, and improve the quality of care in this age of patient-centered care. Also, on the health care consumer side, our text mining technique helps patients make more informed decisions about which specialists to see without reading thousands of reviews, which is simply not feasible. In addition, our comparison analysis of Web-based physician reviews in China and the United States also indicates some cultural differences. PMID:27165558
The Voice of Chinese Health Consumers: A Text Mining Approach to Web-Based Physician Reviews.

PubMed

Hao, Haijing; Zhang, Kunpeng

2016-05-10

Many Web-based health care platforms allow patients to evaluate physicians by posting open-end textual reviews based on their experiences. These reviews are helpful resources for other patients to choose high-quality doctors, especially in countries like China where no doctor referral systems exist. Analyzing such a large amount of user-generated content to understand the voice of health consumers has attracted much attention from health care providers and health care researchers. The aim of this paper is to automatically extract hidden topics from Web-based physician reviews using text-mining techniques to examine what Chinese patients have said about their doctors and whether these topics differ across various specialties. This knowledge will help health care consumers, providers, and researchers better understand this information. We conducted two-fold analyses on the data collected from the "Good Doctor Online" platform, the largest online health community in China. First, we explored all reviews from 2006-2014 using descriptive statistics. Second, we applied the well-known topic extraction algorithm Latent Dirichlet Allocation to more than 500,000 textual reviews from over 75,000 Chinese doctors across four major specialty areas to understand what Chinese health consumers said online about their doctor visits. On the "Good Doctor Online" platform, 112,873 out of 314,624 doctors had been reviewed at least once by April 11, 2014. Among the 772,979 textual reviews, we chose to focus on four major specialty areas that received the most reviews: Internal Medicine, Surgery, Obstetrics/Gynecology and Pediatrics, and Chinese Traditional Medicine. Among the doctors who received reviews from those four medical specialties, two-thirds of them received more than two reviews and in a few extreme cases, some doctors received more than 500 reviews. Across the four major areas, the most popular topics reviewers found were the experience of finding doctors, doctors' technical skills and bedside manner, general appreciation from patients, and description of various symptoms. To the best of our knowledge, our work is the first study using an automated text-mining approach to analyze a large amount of unstructured textual data of Web-based physician reviews in China. Based on our analysis, we found that Chinese reviewers mainly concentrate on a few popular topics. This is consistent with the goal of Chinese online health platforms and demonstrates the health care focus in China's health care system. Our text-mining approach reveals a new research area on how to use big data to help health care providers, health care administrators, and policy makers hear patient voices, target patient concerns, and improve the quality of care in this age of patient-centered care. Also, on the health care consumer side, our text mining technique helps patients make more informed decisions about which specialists to see without reading thousands of reviews, which is simply not feasible. In addition, our comparison analysis of Web-based physician reviews in China and the United States also indicates some cultural differences.
Mining-related metals in terrestrial food webs of the upper Clark Fork River basin

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pastorok, R.A.; LaTier, A.J.; Butcher, M.K.

1994-12-31

Fluvial deposits of tailings and other mining-related waste in selected riparian habitats of the Upper Clark Fork River basin (Montana) have resulted in metals enriched soils. The significance of metals exposure to selected wildlife species was evaluated by measuring tissue residues of metals (arsenic, cadmium, copper, lead, zinc) in key dietary species, including dominant grasses (tufted hair grass and redtop), willows, alfalfa, barley, invertebrates (grasshoppers, spiders, and beetles), and deer mice. Average metals concentrations in grasses, invertebrates, and deer mice collected from tailings-affected sites were elevated relative to reference to reference levels. Soil-tissue bioconcentration factors for grasses and invertebrates weremore » generally lower than expected based on the range of values in the literature, indicating the reduced bioavailability of metals from mining waste. In general, metals concentrations in willows, alfalfa, and barley were not elevated above reference levels. Using these data and plausible assumptions for other exposure parameters for white-tailed deer, red fox, and American kestrel, metals intake was estimated for soil and diet ingestion pathways. Comparisons of exposure estimates with toxicity reference values indicated that the elevated concentrations of metals in key food web species do not pose a significant risk to wildlife.« less
CrosstalkNet: A Visualization Tool for Differential Co-expression Networks and Communities.

PubMed

Manem, Venkata; Adam, George Alexandru; Gruosso, Tina; Gigoux, Mathieu; Bertos, Nicholas; Park, Morag; Haibe-Kains, Benjamin

2018-04-15

Variations in physiological conditions can rewire molecular interactions between biological compartments, which can yield novel insights into gain or loss of interactions specific to perturbations of interest. Networks are a promising tool to elucidate intercellular interactions, yet exploration of these large-scale networks remains a challenge due to their high dimensionality. To retrieve and mine interactions, we developed CrosstalkNet, a user friendly, web-based network visualization tool that provides a statistical framework to infer condition-specific interactions coupled with a community detection algorithm for bipartite graphs to identify significantly dense subnetworks. As a case study, we used CrosstalkNet to mine a set of 54 and 22 gene-expression profiles from breast tumor and normal samples, respectively, with epithelial and stromal compartments extracted via laser microdissection. We show how CrosstalkNet can be used to explore large-scale co-expression networks and to obtain insights into the biological processes that govern cross-talk between different tumor compartments. Significance: This web application enables researchers to mine complex networks and to decipher novel biological processes in tumor epithelial-stroma cross-talk as well as in other studies of intercompartmental interactions. Cancer Res; 78(8); 2140-3. ©2018 AACR . ©2018 American Association for Cancer Research.
Forget the hype or reality. Big data presents new opportunities in Earth Science.

NASA Astrophysics Data System (ADS)

Lee, T. J.

2015-12-01

Earth science is arguably one of the most mature science discipline which constantly acquires, curates, and utilizes a large volume of data with diverse variety. We deal with big data before there is big data. For example, while developing the EOS program in the 1980s, the EOS data and information system (EOSDIS) was developed to manage the vast amount of data acquired by the EOS fleet of satellites. EOSDIS continues to be a shining example of modern science data systems in the past two decades. With the explosion of internet, the usage of social media, and the provision of sensors everywhere, the big data era has bring new challenges. First, Goggle developed the search algorithm and a distributed data management system. The open source communities quickly followed up and developed Hadoop file system to facility the map reduce workloads. The internet continues to generate tens of petabytes of data every day. There is a significant shortage of algorithms and knowledgeable manpower to mine the data. In response, the federal government developed the big data programs that fund research and development projects and training programs to tackle these new challenges. Meanwhile, comparatively to the internet data explosion, Earth science big data problem has become quite small. Nevertheless, the big data era presents an opportunity for Earth science to evolve. We learned about the MapReduce algorithms, in memory data mining, machine learning, graph analysis, and semantic web technologies. How do we apply these new technologies to our discipline and bring the hype to Earth? In this talk, I will discuss how we might want to apply some of the big data technologies to our discipline and solve many of our challenging problems. More importantly, I will propose new Earth science data system architecture to enable new type of scientific inquires.
Spectral reflectance properties (0.4-2.5 um) of secondary Fe-oxide, Fe-hydroxide, and Fe-sulfate-hydrate minerals associated with sulfide-bearing mine waste

USGS Publications Warehouse

Crowley, J.K.; Williams, D.E.; Hammarstrom1, J.M.; Piatak, N.; Mars, J.C.; Chou, I-Ming

2006-01-01

Fifteen Fe-oxide, Fe-hydroxide, and Fe-sulphate-hydrate mineral species commonly associated with sulphide bearing mine wastes were characterized by using X-ray powder diffraction and scanning electron microscope methods. Diffuse reflectance spectra of the samples show diagnostic absorption features related to electronic processes involving ferric and/or ferrous iron, and to vibrational processes involving water and hydroxyl ions. Such spectral features enable field and remote sensing based studies of the mineral distributions. Because secondary minerals are sensitive indicators of pH, Eh, relative humidity, and other environmental conditions, spectral mapping of these minerals promises to have important applications to mine waste remediation studies. This report releases digital (ascii) spectra (spectral_data_files.zip) of the fifteen mineral samples to facilitate usage of the data with spectral libraries and spectral analysis software. The spectral data are provided in a two-column format listing wavelength (in micrometers) and reflectance, respectively.
77 FR 1728 - Privacy Act of 1974; Publication of Five New Systems of Records; Amendments to Five Existing...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-01-11

... assistance to correspondents; to use Web site based programs; to provide usage statistics associated with the... of individuals for surveys. Among other things, maintaining the names, addresses, etc. of individuals... information in the system. Safeguards: Access by authorized personnel only. Computer security safeguards are...
A Case Study on TRICARE Online Web-enabled Appointing: Improving Utilization Rates at Navy Medical Treatment Facilities

DTIC Science & Technology

2009-10-20

Low usage volume raised concerns about the effectiveness of TOL. In 2004, the eHealth Division of TMA Information Management conducted a study to...Case Study 31 (Version 15.8). Falls Church, VA: Department of Defense, TRICARE Management Activity, Information Management eHealth Division
Web Based Nasal Surgical Simulator Using VRML and Java.

PubMed

Yuan-Yuan, Zhao; Guo-Hong, Zhou; De-Rong, Ye

2005-01-01

This paper describes a nasal surgical simulator that we have designed and implemented to run on the WWW using VRML and Java. In this paper we concentrate on implementation details such as collision detection and the usage of our simulator. At last, we discuss the advantage and disadvantave of the simulator.

Electronic Reading and Digital Library Technologies: Understanding Learner Expectation and Usage Intent for Mobile Learning

ERIC Educational Resources Information Center

Hyman, Jack A.; Moser, Mary T.; Segala, Laura N.

2014-01-01

Mobile information technology is changing the education landscape by offering learners the opportunity to engage in asynchronous, ubiquitous instruction. While there is a proliferation of mobile content management systems being developed for the mobile Web and stand-alone mobile applications, few studies have addressed learner expectations and…
Development of Web-Based Examination System Using Open Source Programming Model

ERIC Educational Resources Information Center

Abass, Olalere A.; Olajide, Samuel A.; Samuel, Babafemi O.

2017-01-01

The traditional method of assessment (examination) is often characterized by examination questions leakages, human errors during marking of scripts and recording of scores. The technological advancement in the field of computer science has necessitated the need for computer usage in majorly all areas of human life and endeavors, education sector…
Success Stories 2000. The Success Stories Series.

ERIC Educational Resources Information Center

Southwest Educational Development Laboratory, Austin, TX. National Center for the Dissemination of Disability Research.

The booklet is the first issue of a series that will highlight a variety of successes realized by the National Institute on Disability and Rehabilitation Research (NIDRR) grantees in their dissemination and utilization efforts. This issue focuses upon the status of World Wide Web (WWW) usage among NIDRR grantees. Information is provided on the…
The Changing Role of the Educational Video in Higher Distance Education

ERIC Educational Resources Information Center

Laaser, Wolfram; Toloza, Eduardo A.

2017-01-01

The article argues that the ongoing usage of audio visual media is falling behind in terms of educational quality compared to prior achievements in the history of distance education. After reviewing some important steps and experiences of audio visual digital media development, we analyse predominant presentation formats on the Web. Special focus…
Identifying University Professors' Information Needs in the Challenging Environment of Information and Communication Technologies

ERIC Educational Resources Information Center

Jankowska, Maria Anna

2004-01-01

A Web-based survey was conducted to determine usage of information and communication technologies by faculty for research and teaching. Respondents expressed their preferences regarding library electronic materials and services. Survey results highlighted solutions to help faculty in this era of information overload and rapid development of…
Student Usage of Instructional Technologies: Differences in Online Learning Styles

ERIC Educational Resources Information Center

Ballenger, Robert M.; Garvis, Dennis M.

2010-01-01

We contribute to the MIS education literature by empirically examining Web log server data generated by undergraduate students enrolled in multiple sections of a MIS course where an online Learning Management System (LMS) was used to complement a traditional classroom environment. We identify online learning styles by investigating differences in…
WISE-MD usage among millennial medical students.

PubMed

Phitayakorn, Roy; Nick, Michael W; Alseidi, Adnan; Lind, David Scott; Sudan, Ranjan; Isenberg, Gerald; Capella, Jeannette; Hopkins, Mary A; Petrusa, Emil R

2015-01-01

E-learning is increasingly common in undergraduate medical education. Internet-based multimedia materials should be designed with millennial learner utilization preferences in mind for maximal impact. Medical students used all 20 Web Initiative for Surgical Education of Medical Doctors modules from July 1, 2013 to October 1, 2013. Data were analyzed for topic frequency, time and week day, and access to questions. Three thousand five hundred eighty-seven students completed 35,848 modules. Students accessed modules for average of 51 minutes. Most frequent use occurred on Sunday (23.1%), Saturday (15.4%), and Monday (14.3%). Friday had the least use (8.2%). A predominance of students accessed the modules between 7 and 10 PM (34.4%). About 80.4% of students accessed questions for at least one module. They completed an average of 40 ± 30 of the questions. Only 827 students (2.3%) repeated the questions. Web Initiative for Surgical Education of Medical Doctors has peak usage during the weekend and evenings. Most frequently used modules reflect core surgical problems. Multiple factors influence the manner module questions are accessed. Copyright © 2015 Elsevier Inc. All rights reserved.
Databases and Web Tools for Cancer Genomics Study

PubMed Central

Yang, Yadong; Dong, Xunong; Xie, Bingbing; Ding, Nan; Chen, Juan; Li, Yongjun; Zhang, Qian; Qu, Hongzhu; Fang, Xiangdong

2015-01-01

Publicly-accessible resources have promoted the advance of scientific discovery. The era of genomics and big data has brought the need for collaboration and data sharing in order to make effective use of this new knowledge. Here, we describe the web resources for cancer genomics research and rate them on the basis of the diversity of cancer types, sample size, omics data comprehensiveness, and user experience. The resources reviewed include data repository and analysis tools; and we hope such introduction will promote the awareness and facilitate the usage of these resources in the cancer research community. PMID:25707591
Usage Analysis for the Identification of Research Trends in Digital Libraries; Keepers of the Crumbling Culture: What Digital Preservation Can Learn from Library History; Patterns of Journal Use by Scientists through Three Evolutionary Phases; Developing a Content Management System-Based Web Site; Exploring Charging Models for Digital Cultural Heritage in Europe; Visions: The Academic Library in 2012.

ERIC Educational Resources Information Center

Bollen, Johan; Vemulapalli, Soma Sekara; Xu, Weining; Luce, Rick; Marcum, Deanna; Friedlander, Amy; Tenopir, Carol; Grayson, Matt; Zhang, Yan; Ebuen, Mercy; King, Donald W.; Boyce, Peter; Rogers, Clare; Kirriemuir, John; Tanner, Simon; Deegan, Marilyn; Marcum, James W.

2003-01-01

Includes six articles that discuss use analysis and research trends in digital libraries; library history and digital preservation; journal use by scientists; a content management system-based Web site for higher education in the United Kingdom; cost studies for transitioning to digitized collections in European cultural institutions; and the…
[Utility of Smartphone in Home Care Medicine - First Trial].

PubMed

Takeshige, Toshiyuki; Hirano, Chiho; Nakagawa, Midori; Yoshioka, Rentaro

2015-12-01

The use of video calls for home care can reduce anxiety and offer patients peace of mind. The most suitable terminals at facilities to support home care have been iPad Air and iPhone with FaceTime software. However, usage has been limited to specific terminals. In order to eliminate the need for special terminals and software, we have developed a program that has been customized to meet the needs of facilities using Web Real Time Communication(WebRTC)in cooperation with the University of Aizu. With this software, video calls can accommodate the large number of home care patients.
Monitoring the performance of the Southern African Large Telescope

NASA Astrophysics Data System (ADS)

Hettlage, Christian; Coetzee, Chris; Väisänen, Petri; Romero Colmenero, Encarni; Crawford, Steven M.; Kotze, Paul; Rabe, Paul; Hulme, Stephen; Brink, Janus; Maartens, Deneys; Browne, Keith; Strydom, Ockert; De Bruyn, David

2016-07-01

The efficient operation of a telescope requires awareness of its performance on a daily and long-term basis. This paper outlines the Fault Tracker, WebSAMMI and the Dashboard used by the Southern African Large Telescope (SALT) to achieve this aim. Faults are mostly logged automatically, but the Fault Tracker allows users to add and edit faults. The SALT Astronomer and SALT Operator record weather conditions and telescope usage with WebSAMMI. Various efficiency metrics are shown for different time periods on the Dashboard. A kiosk mode for displaying on a public screen is included. Possible applications for other telescopes are discussed.
Monitoring of large-scale federated data storage: XRootD and beyond

NASA Astrophysics Data System (ADS)

Andreeva, J.; Beche, A.; Belov, S.; Diguez Arias, D.; Giordano, D.; Oleynik, D.; Petrosyan, A.; Saiz, P.; Tadel, M.; Tuckett, D.; Vukotic, I.

2014-06-01

The computing models of the LHC experiments are gradually moving from hierarchical data models with centrally managed data pre-placement towards federated storage which provides seamless access to data files independently of their location and dramatically improve recovery due to fail-over mechanisms. Construction of the data federations and understanding the impact of the new approach to data management on user analysis requires complete and detailed monitoring. Monitoring functionality should cover the status of all components of the federated storage, measuring data traffic and data access performance, as well as being able to detect any kind of inefficiencies and to provide hints for resource optimization and effective data distribution policy. Data mining of the collected monitoring data provides a deep insight into new usage patterns. In the WLCG context, there are several federations currently based on the XRootD technology. This paper will focus on monitoring for the ATLAS and CMS XRootD federations implemented in the Experiment Dashboard monitoring framework. Both federations consist of many dozens of sites accessed by many hundreds of clients and they continue to grow in size. Handling of the monitoring flow generated by these systems has to be well optimized in order to achieve the required performance. Furthermore, this paper demonstrates the XRootD monitoring architecture is sufficiently generic to be easily adapted for other technologies, such as HTTP/WebDAV dynamic federations.
Assessing usage patterns of electronic clinical documentation templates.

PubMed

Vawdrey, David K

2008-11-06

Many vendors of electronic medical records support structured and free-text entry of clinical documents using configurable templates. At a healthcare institution comprising two large academic medical centers, a documentation management data mart and a custom, Web-accessible business intelligence application were developed to track the availability and usage of electronic documentation templates. For each medical center, template availability and usage trends were measured from November 2007 through February 2008. By February 2008, approximately 65,000 electronic notes were authored per week on the two campuses. One site had 934 available templates, with 313 being used to author at least one note. The other site had 765 templates, of which 480 were used. The most commonly used template at both campuses was a free text note called "Miscellaneous Nursing Note," which accounted for 33.3% of total documents generated at one campus and 15.2% at the other.
Factors influencing the use of a Web-based application for supporting the self-care of patients with type 2 diabetes: a longitudinal study.

PubMed

Nijland, Nicol; van Gemert-Pijnen, Julia E W C; Kelders, Saskia M; Brandenburg, Bart J; Seydel, Erwin R

2011-09-30

The take-up of eHealth applications in general is still rather low and user attrition is often high. Only limited information is available about the use of eHealth technologies among specific patient groups. The aim of this study was to explore the factors that influence the initial and long-term use of a Web-based application (DiabetesCoach) for supporting the self-care of patients with type 2 diabetes. A mixed-methods research design was used for a process analysis of the actual usage of the Web application over a 2-year period and to identify user profiles. Research instruments included log files, interviews, usability tests, and a survey. The DiabetesCoach was predominantly used for interactive features like online monitoring, personal data, and patient-nurse email contact. It was the continuous, personal feedback that particularly appealed to the patients; they felt more closely monitored by their nurse and encouraged to play a more active role in self-managing their disease. Despite the positive outcomes, usage of the Web application was hindered by low enrollment and nonusage attrition. The main barrier to enrollment had to do with a lack of access to the Internet (146/226, 65%). Although 68% (34/50) of the enrollees were continuous users, of whom 32% (16/50) could be defined as hardcore users (highly active), the remaining 32% (16/50) did not continue using the Web application for the full duration of the study period. Barriers to long-term use were primarily due to poor user-friendliness of the Web application (the absence of "push" factors or reminders) and selection of the "wrong" users; the well-regulated patients were not the ones who could benefit the most from system use because of a ceiling effect. Patients with a greater need for care seemed to be more engaged in long-term use; highly active users were significantly more often medication users than low/inactive users (P = .005) and had a longer diabetes duration (P = .03). Innovations in health care will diffuse more rapidly when technology is employed that is simple to use and has applicable components for interactivity. This would foresee the patients' need for continuous and personalized feedback, in particular for patients with a greater need for care. From this study several factors appear to influence increased use of eHealth technologies: (1) avoiding selective enrollment, (2) making use of participatory design methods, and (3) developing push factors for persistence. Further research should focus on the causal relationship between using the system's features and actual usage, as such a view would provide important evidence on how specific technology features can engage and captivate users.
An Extraction Method of an Informative DOM Node from a Web Page by Using Layout Information

NASA Astrophysics Data System (ADS)

Tsuruta, Masanobu; Masuyama, Shigeru

We propose an informative DOM node extraction method from a Web page for preprocessing of Web content mining. Our proposed method LM uses layout data of DOM nodes generated by a generic Web browser, and the learning set consists of hundreds of Web pages and the annotations of informative DOM nodes of those Web pages. Our method does not require large scale crawling of the whole Web site to which the target Web page belongs. We design LM so that it uses the information of the learning set more efficiently in comparison to the existing method that uses the same learning set. By experiments, we evaluate the methods obtained by combining one that consists of the method for extracting the informative DOM node both the proposed method and the existing methods, and the existing noise elimination methods: Heur removes advertisements and link-lists by some heuristics and CE removes the DOM nodes existing in the Web pages in the same Web site to which the target Web page belongs. Experimental results show that 1) LM outperforms other methods for extracting the informative DOM node, 2) the combination method (LM, {CE(10), Heur}) based on LM (precision: 0.755, recall: 0.826, F-measure: 0.746) outperforms other combination methods.
Mining Specific and General Features in Both Positive and Negative Relevance Feedback. QUT E-Discovery Lab at the TREC󈧍 Relevance Feedback Track

DTIC Science & Technology

2009-11-01

relevance feedback algo- rithm. Four methods, εMap [1], MapA , P10A, and StatAP [2], were used in the track to measure the performance of Phase 2 runs...εMap and StatAP were applied to the runs us- ing the testing set of only ClueWeb09 Category-B, whereas MapA and P10A were applied to those using the...whole ClueWeb09 English set. Because our experiments were based on only ClueWeb09 Category-B, measuring our per- formance by MapA and P10A might not
The Comprehensive Microbial Resource

PubMed Central

Peterson, Jeremy D.; Umayam, Lowell A.; Dickinson, Tanja; Hickey, Erin K.; White, Owen

2001-01-01

One challenge presented by large-scale genome sequencing efforts is effective display of uniform information to the scientific community. The Comprehensive Microbial Resource (CMR) contains robust annotation of all complete microbial genomes and allows for a wide variety of data retrievals. The bacterial information has been placed on the Web at http://www.tigr.org/CMR for retrieval using standard web browsing technology. Retrievals can be based on protein properties such as molecular weight or hydrophobicity, GC-content, functional role assignments and taxonomy. The CMR also has special web-based tools to allow data mining using pre-run homology searches, whole genome dot-plots, batch downloading and traversal across genomes using a variety of datatypes. PMID:11125067
Improving entrepreneurial opportunity recognition through web content analytics

NASA Astrophysics Data System (ADS)

Bakar, Muhamad Shahbani Abu; Azmi, Azwiyati

2017-10-01

The ability to recognize and develop an opportunity into a venture defines an entrepreneur. Research in opportunity recognition has been robust and focuses more on explaining the processes involved in opportunity recognition. Factors such as prior knowledge, cognitive and creative capabilities are shown to affect opportunity recognition in entrepreneurs. Prior knowledge in areas such as customer problems, ways to serve the market, and technology has been shows in various studies to be a factor that facilitates entrepreneurs to identify and recognize opportunities. Findings from research also shows that experienced entrepreneurs search and scan for information to discover opportunities. Searching and scanning for information has also been shown to help novice entrepreneurs who lack prior knowledge to narrow this gap and enable them to better identify and recognize opportunities. There is less focus in research on finding empirically proven techniques and methods to develop and enhance opportunity recognition in student entrepreneurs. This is important as the country pushes for more graduate entrepreneurs that can drive the economy. This paper aims to discuss Opportunity Recognition Support System (ORSS), an information support system to help especially student entrepreneurs in identifying and recognizing business opportunities. The ORSS aims to provide the necessary knowledge to student entrepreneurs to be able to better identify and recognize opportunities. Applying design research, theories in opportunity recognition are applied to identify the requirements for the support system and the requirements in turn dictate the design of the support system. The paper proposes the use of web content mining and analytics as two core components and techniques for the support system. Web content mining can mine the vast knowledge repositories available on the internet and analytics can provide entrepreneurs with further insights into the information needed to recognize opportunities in a given market or industry.
CFL3D Version 6.4-General Usage and Aeroelastic Analysis

NASA Technical Reports Server (NTRS)

Bartels, Robert E.; Rumsey, Christopher L.; Biedron, Robert T.

2006-01-01

This document contains the course notes on the computational fluid dynamics code CFL3D version 6.4. It is intended to provide from basic to advanced users the information necessary to successfully use the code for a broad range of cases. Much of the course covers capability that has been a part of previous versions of the code, with material compiled from a CFL3D v5.0 manual and from the CFL3D v6 web site prior to the current release. This part of the material is presented to users of the code not familiar with computational fluid dynamics. There is new capability in CFL3D version 6.4 presented here that has not previously been published. There are also outdated features no longer used or recommended in recent releases of the code. The information offered here supersedes earlier manuals and updates outdated usage. Where current usage supersedes older versions, notation of that is made. These course notes also provides hints for usage, code installation and examples not found elsewhere.
Methodology of development of a Delirium clinical application and initial feasibility results.

PubMed

Zhang, Melvyn W B; Ho, Roger C M; Sockalingam, Sanjeev

2015-01-01

Delirium is a highly prevalent condition in the hospital settings, with prevalence rates ranging from 6% to 56%, based on previous studies. A recent review provides evidence for the need of practice tools at the point of care to increase impact and to improve patient outcomes related to delirium care. The major challenge is to help maintain the skill-sets required by clinicians and allied healthcare workers over time. There have been massive advancements in smartphone technologies, as well as several papers being published recently about how clinicians could be application developers. The following study will serve to illustrate how the authors made use of the latest advances in application creation technologies in designing a Delirium education application, containing protocols that are appropriate to their healthcare setting. The study in itself will serve as a pilot project aimed at implementing smartphone technologies in delirium education, to determine its feasibility as well as user's perspectives towards such an implementation. The Delirium UHN Application was developed between the months of February 2013 to September 2014. Making use of the methodologies shared by Zhang MWB et al., the authors embarked on the development of the web-based and the native application. The web-based application was developed using HTML5 programming language and with the aid of an online application builder. Psychiatry residents and allied health professionals, at the University of Toronto were recruited to help evaluate the pilot web-based version of the application. Since the introduction of the web-based application during the delirium awareness week, there has been a total of 1165 unique access to the online web-based application. Of significance, there is a shift in the confidence levels of the participants with regards to the management of delirium after using the application. The majority of the participants (44.0%) reported being moderately comfortable with managing delirium prior to the usage of the application, but this changed after the implementation of the application, with 39.0% reporting being very confident and 44.0% being extremely confident about managing delirium after using the application. 69.0% of the participants also perceived the smartphone application to be of use to their clinical care for delirious patients. This study is one of the first to demonstrate the potential usage of smartphone innovations in delirium education. The current study demonstrated the added feasibility of smartphone applications, and demonstrated that users perceived that they are more abled with managing delirium after the usage of the smartphone application.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.