Sample records for global gene mining

  1. Systematic Association of Genes to Phenotypes by Genome and Literature Mining

    PubMed Central

    Jensen, Lars J; Perez-Iratxeta, Carolina; Kaczanowski, Szymon; Hooper, Sean D; Andrade, Miguel A

    2005-01-01

    One of the major challenges of functional genomics is to unravel the connection between genotype and phenotype. So far no global analysis has attempted to explore those connections in the light of the large phenotypic variability seen in nature. Here, we use an unsupervised, systematic approach for associating genes and phenotypic characteristics that combines literature mining with comparative genome analysis. We first mine the MEDLINE literature database for terms that reflect phenotypic similarities of species. Subsequently we predict the likely genomic determinants: genes specifically present in the respective genomes. In a global analysis involving 92 prokaryotic genomes we retrieve 323 clusters containing a total of 2,700 significant gene–phenotype associations. Some clusters contain mostly known relationships, such as genes involved in motility or plant degradation, often with additional hypothetical proteins associated with those phenotypes. Other clusters comprise unexpected associations; for example, a group of terms related to food and spoilage is linked to genes predicted to be involved in bacterial food poisoning. Among the clusters, we observe an enrichment of pathogenicity-related associations, suggesting that the approach reveals many novel genes likely to play a role in infectious diseases. PMID:15799710

  2. Microarray data and gene expression statistics for Saccharomyces cerevisiae exposed to simulated asbestos mine drainage.

    PubMed

    Driscoll, Heather E; Murray, Janet M; English, Erika L; Hunter, Timothy C; Pivarski, Kara; Dolci, Elizabeth D

    2017-08-01

    Here we describe microarray expression data (raw and normalized), experimental metadata, and gene-level data with expression statistics from Saccharomyces cerevisiae exposed to simulated asbestos mine drainage from the Vermont Asbestos Group (VAG) Mine on Belvidere Mountain in northern Vermont, USA. For nearly 100 years (between the late 1890s and 1993), chrysotile asbestos fibers were extracted from serpentinized ultramafic rock at the VAG Mine for use in construction and manufacturing industries. Studies have shown that water courses and streambeds nearby have become contaminated with asbestos mine tailings runoff, including elevated levels of magnesium, nickel, chromium, and arsenic, elevated pH, and chrysotile asbestos-laden mine tailings, due to leaching and gradual erosion of massive piles of mine waste covering approximately 9 km 2 . We exposed yeast to simulated VAG Mine tailings leachate to help gain insight on how eukaryotic cells exposed to VAG Mine drainage may respond in the mine environment. Affymetrix GeneChip® Yeast Genome 2.0 Arrays were utilized to assess gene expression after 24-h exposure to simulated VAG Mine tailings runoff. The chemistry of mine-tailings leachate, mine-tailings leachate plus yeast extract peptone dextrose media, and control yeast extract peptone dextrose media is also reported. To our knowledge this is the first dataset to assess global gene expression patterns in a eukaryotic model system simulating asbestos mine tailings runoff exposure. Raw and normalized gene expression data are accessible through the National Center for Biotechnology Information Gene Expression Omnibus (NCBI GEO) Database Series GSE89875 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE89875).

  3. Text Mining in Cancer Gene and Pathway Prioritization

    PubMed Central

    Luo, Yuan; Riedlinger, Gregory; Szolovits, Peter

    2014-01-01

    Prioritization of cancer implicated genes has received growing attention as an effective way to reduce wet lab cost by computational analysis that ranks candidate genes according to the likelihood that experimental verifications will succeed. A multitude of gene prioritization tools have been developed, each integrating different data sources covering gene sequences, differential expressions, function annotations, gene regulations, protein domains, protein interactions, and pathways. This review places existing gene prioritization tools against the backdrop of an integrative Omic hierarchy view toward cancer and focuses on the analysis of their text mining components. We explain the relatively slow progress of text mining in gene prioritization, identify several challenges to current text mining methods, and highlight a few directions where more effective text mining algorithms may improve the overall prioritization task and where prioritizing the pathways may be more desirable than prioritizing only genes. PMID:25392685

  4. Text mining in cancer gene and pathway prioritization.

    PubMed

    Luo, Yuan; Riedlinger, Gregory; Szolovits, Peter

    2014-01-01

    Prioritization of cancer implicated genes has received growing attention as an effective way to reduce wet lab cost by computational analysis that ranks candidate genes according to the likelihood that experimental verifications will succeed. A multitude of gene prioritization tools have been developed, each integrating different data sources covering gene sequences, differential expressions, function annotations, gene regulations, protein domains, protein interactions, and pathways. This review places existing gene prioritization tools against the backdrop of an integrative Omic hierarchy view toward cancer and focuses on the analysis of their text mining components. We explain the relatively slow progress of text mining in gene prioritization, identify several challenges to current text mining methods, and highlight a few directions where more effective text mining algorithms may improve the overall prioritization task and where prioritizing the pathways may be more desirable than prioritizing only genes.

  5. Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

    PubMed

    Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

    2015-01-01

    In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.

  6. An improved Pearson's correlation proximity-based hierarchical clustering for mining biological association between genes.

    PubMed

    Booma, P M; Prabhakaran, S; Dhanalakshmi, R

    2014-01-01

    Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality.

  7. An Improved Pearson's Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes

    PubMed Central

    Booma, P. M.; Prabhakaran, S.; Dhanalakshmi, R.

    2014-01-01

    Microarray gene expression datasets has concerned great awareness among molecular biologist, statisticians, and computer scientists. Data mining that extracts the hidden and usual information from datasets fails to identify the most significant biological associations between genes. A search made with heuristic for standard biological process measures only the gene expression level, threshold, and response time. Heuristic search identifies and mines the best biological solution, but the association process was not efficiently addressed. To monitor higher rate of expression levels between genes, a hierarchical clustering model was proposed, where the biological association between genes is measured simultaneously using proximity measure of improved Pearson's correlation (PCPHC). Additionally, the Seed Augment algorithm adopts average linkage methods on rows and columns in order to expand a seed PCPHC model into a maximal global PCPHC (GL-PCPHC) model and to identify association between the clusters. Moreover, a GL-PCPHC applies pattern growing method to mine the PCPHC patterns. Compared to existing gene expression analysis, the PCPHC model achieves better performance. Experimental evaluations are conducted for GL-PCPHC model with standard benchmark gene expression datasets extracted from UCI repository and GenBank database in terms of execution time, size of pattern, significance level, biological association efficiency, and pattern quality. PMID:25136661

  8. Documenting the global impacts of beach sand mining

    NASA Astrophysics Data System (ADS)

    Young, R.; Griffith, A.

    2009-04-01

    For centuries, beach sand has been mined for use as aggregate in concrete, for heavy minerals, and for construction fill. The global extent and impact of this phenomenon has gone relatively unnoticed by academics, NGOs, and major news sources. Most reports of sand mining activities are found at the very local scale (if the mining is ever documented at all). Yet, sand mining in many localities has resulted in the complete destruction of beach (and related) ecosystems along with severe impacts to coastal protection and tourism. The Program for the Study of Developed Shorelines at Western Carolina University and Beachcare.org have initiated the construction of a global database of beach sand mining activities. The database is being built through a combination of site visits and through the data mining of media resources, peer reviewed papers, and reports from private and governmental entities. Currently, we have documented sand mining in 35 countries on 6 continents representing the removal of millions of cubic meters of sand. Problems extend from Asia where critical infrastructure has been disrupted by sand mining to the Caribbean where policy reform has swiftly followed a highly publicized theft of sand. The Program for the Study of Developed Shorelines recently observed extensive sand mining in Morocco at the regional scale. Tens of kilometers of beach have been stripped of sand and the mining continues southward reducing hope of a thriving tourism-based economy. Problems caused by beach sand mining include: destruction of natural beaches and the ecosystems they protect (e.g. dunes, wetlands), habitat loss for globally important species (e.g. turtles, shorebirds), destruction of nearshore marine ecosystems, increased shoreline erosion rates, reduced protection from storms, tsunamis, and wave events, and economic losses through tourist abandonment and loss of coastal aesthetics. The threats posed by sand mining are made even more critical given the prospect of a

  9. Sustainable Bauxite Mining — A Global Perspective

    NASA Astrophysics Data System (ADS)

    Wagner, Christian

    In 2008 the International Aluminium Institute commissioned its fourth sustainable bauxite mining report with the aim to collect global data on the environmental, social and economic impacts of bauxite mining operations and their rehabilitation programmes. The report shows that bauxite mining has become sustainable and land area footprint neutral;it is a relatively small land use operation when compared to most other types of mining. All operations have clearly defined rehabilitation objectives, fully integrated rehabilitation programmes, and written rehabilitation procedures. The rehabilitation objectives can be summarized as follows: "The bauxite mining operations aim to restore pre-mining environment and the respective conditions; this can be a self-sustaining ecosystem consisting of native flora and fauna or any other land-use to the benefit of the local community".

  10. Mining biological databases for candidate disease genes

    NASA Astrophysics Data System (ADS)

    Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

    2001-07-01

    The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).

  11. Gene prioritization and clustering by multi-view text mining

    PubMed Central

    2010-01-01

    Background Text mining has become a useful tool for biologists trying to understand the genetics of diseases. In particular, it can help identify the most interesting candidate genes for a disease for further experimental analysis. Many text mining approaches have been introduced, but the effect of disease-gene identification varies in different text mining models. Thus, the idea of incorporating more text mining models may be beneficial to obtain more refined and accurate knowledge. However, how to effectively combine these models still remains a challenging question in machine learning. In particular, it is a non-trivial issue to guarantee that the integrated model performs better than the best individual model. Results We present a multi-view approach to retrieve biomedical knowledge using different controlled vocabularies. These controlled vocabularies are selected on the basis of nine well-known bio-ontologies and are applied to index the vast amounts of gene-based free-text information available in the MEDLINE repository. The text mining result specified by a vocabulary is considered as a view and the obtained multiple views are integrated by multi-source learning algorithms. We investigate the effect of integration in two fundamental computational disease gene identification tasks: gene prioritization and gene clustering. The performance of the proposed approach is systematically evaluated and compared on real benchmark data sets. In both tasks, the multi-view approach demonstrates significantly better performance than other comparing methods. Conclusions In practical research, the relevance of specific vocabulary pertaining to the task is usually unknown. In such case, multi-view text mining is a superior and promising strategy for text-based disease gene identification. PMID:20074336

  12. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records.

    PubMed

    Jiang, Li; Edwards, Stefan M; Thomsen, Bo; Workman, Christopher T; Guldbrandtsen, Bernt; Sørensen, Peter

    2014-09-24

    Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization. We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance. We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data

  13. Diversification of the Higher Mining Education Financing in Globalization Era

    NASA Astrophysics Data System (ADS)

    Frolova, Victoria; Dolina, Olga; Shpil'kina, Tatyana

    2017-11-01

    In the current conditions of global competition, the development of new mining technologies, the requirements to labor resources, their skills and creative potential are increasing. The tasks facing the mining industry cannot be solved without highly qualified personnel, especially managers, engineers and technicians, specialists who possess the knowledge and competences necessary for the development of science and technology of mining, and ensuring mining industrial safety. The authors analyze personnel problems and financing of mining higher education, conclude that there is a need to develop social partnership and diversify the sources of funding for training, advanced training and retraining of personnel for mining and processing of solid mineral deposits.

  14. Development and application of an interaction network ontology for literature mining of vaccine-associated gene-gene interactions.

    PubMed

    Hur, Junguk; Özgür, Arzucan; Xiang, Zuoshuang; He, Yongqun

    2015-01-01

    Literature mining of gene-gene interactions has been enhanced by ontology-based name classifications. However, in biomedical literature mining, interaction keywords have not been carefully studied and used beyond a collection of keywords. In this study, we report the development of a new Interaction Network Ontology (INO) that classifies >800 interaction keywords and incorporates interaction terms from the PSI Molecular Interactions (PSI-MI) and Gene Ontology (GO). Using INO-based literature mining results, a modified Fisher's exact test was established to analyze significantly over- and under-represented enriched gene-gene interaction types within a specific area. Such a strategy was applied to study the vaccine-mediated gene-gene interactions using all PubMed abstracts. The Vaccine Ontology (VO) and INO were used to support the retrieval of vaccine terms and interaction keywords from the literature. INO is aligned with the Basic Formal Ontology (BFO) and imports terms from 10 other existing ontologies. Current INO includes 540 terms. In terms of interaction-related terms, INO imports and aligns PSI-MI and GO interaction terms and includes over 100 newly generated ontology terms with 'INO_' prefix. A new annotation property, 'has literature mining keywords', was generated to allow the listing of different keywords mapping to the interaction types in INO. Using all PubMed documents published as of 12/31/2013, approximately 266,000 vaccine-associated documents were identified, and a total of 6,116 gene-pairs were associated with at least one INO term. Out of 78 INO interaction terms associated with at least five gene-pairs of the vaccine-associated sub-network, 14 terms were significantly over-represented (i.e., more frequently used) and 17 under-represented based on our modified Fisher's exact test. These over-represented and under-represented terms share some common top-level terms but are distinct at the bottom levels of the INO hierarchy. The analysis of these

  15. Biomedical hypothesis generation by text mining and gene prioritization.

    PubMed

    Petric, Ingrid; Ligeti, Balazs; Gyorffy, Balazs; Pongor, Sandor

    2014-01-01

    Text mining methods can facilitate the generation of biomedical hypotheses by suggesting novel associations between diseases and genes. Previously, we developed a rare-term model called RaJoLink (Petric et al, J. Biomed. Inform. 42(2): 219-227, 2009) in which hypotheses are formulated on the basis of terms rarely associated with a target domain. Since many current medical hypotheses are formulated in terms of molecular entities and molecular mechanisms, here we extend the methodology to proteins and genes, using a standardized vocabulary as well as a gene/protein network model. The proposed enhanced RaJoLink rare-term model combines text mining and gene prioritization approaches. Its utility is illustrated by finding known as well as potential gene-disease associations in ovarian cancer using MEDLINE abstracts and the STRING database.

  16. DISEASES: text mining and data integration of disease-gene associations.

    PubMed

    Pletscher-Frankild, Sune; Pallejà, Albert; Tsafou, Kalliopi; Binder, Janos X; Jensen, Lars Juhl

    2015-03-01

    Text mining is a flexible technology that can be applied to numerous different tasks in biology and medicine. We present a system for extracting disease-gene associations from biomedical abstracts. The system consists of a highly efficient dictionary-based tagger for named entity recognition of human genes and diseases, which we combine with a scoring scheme that takes into account co-occurrences both within and between sentences. We show that this approach is able to extract half of all manually curated associations with a false positive rate of only 0.16%. Nonetheless, text mining should not stand alone, but be combined with other types of evidence. For this reason, we have developed the DISEASES resource, which integrates the results from text mining with manually curated disease-gene associations, cancer mutation data, and genome-wide association studies from existing databases. The DISEASES resource is accessible through a web interface at http://diseases.jensenlab.org/, where the text-mining software and all associations are also freely available for download. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  17. Gene Prioritization of Resistant Rice Gene against Xanthomas oryzae pv. oryzae by Using Text Mining Technologies

    PubMed Central

    Xia, Jingbo; Zhang, Xing; Yuan, Daojun; Chen, Lingling; Webster, Jonathan; Fang, Alex Chengyu

    2013-01-01

    To effectively assess the possibility of the unknown rice protein resistant to Xanthomonas oryzae pv. oryzae, a hybrid strategy is proposed to enhance gene prioritization by combining text mining technologies with a sequence-based approach. The text mining technique of term frequency inverse document frequency is used to measure the importance of distinguished terms which reflect biomedical activity in rice before candidate genes are screened and vital terms are produced. Afterwards, a built-in classifier under the chaos games representation algorithm is used to sieve the best possible candidate gene. Our experiment results show that the combination of these two methods achieves enhanced gene prioritization. PMID:24371834

  18. Gene prioritization of resistant rice gene against Xanthomas oryzae pv. oryzae by using text mining technologies.

    PubMed

    Xia, Jingbo; Zhang, Xing; Yuan, Daojun; Chen, Lingling; Webster, Jonathan; Fang, Alex Chengyu

    2013-01-01

    To effectively assess the possibility of the unknown rice protein resistant to Xanthomonas oryzae pv. oryzae, a hybrid strategy is proposed to enhance gene prioritization by combining text mining technologies with a sequence-based approach. The text mining technique of term frequency inverse document frequency is used to measure the importance of distinguished terms which reflect biomedical activity in rice before candidate genes are screened and vital terms are produced. Afterwards, a built-in classifier under the chaos games representation algorithm is used to sieve the best possible candidate gene. Our experiment results show that the combination of these two methods achieves enhanced gene prioritization.

  19. RANWAR: rank-based weighted association rule mining from gene expression and methylation data.

    PubMed

    Mallik, Saurav; Mukhopadhyay, Anirban; Maulik, Ujjwal

    2015-01-01

    Ranking of association rules is currently an interesting topic in data mining and bioinformatics. The huge number of evolved rules of items (or, genes) by association rule mining (ARM) algorithms makes confusion to the decision maker. In this article, we propose a weighted rule-mining technique (say, RANWAR or rank-based weighted association rule-mining) to rank the rules using two novel rule-interestingness measures, viz., rank-based weighted condensed support (wcs) and weighted condensed confidence (wcc) measures to bypass the problem. These measures are basically depended on the rank of items (genes). Using the rank, we assign weight to each item. RANWAR generates much less number of frequent itemsets than the state-of-the-art association rule mining algorithms. Thus, it saves time of execution of the algorithm. We run RANWAR on gene expression and methylation datasets. The genes of the top rules are biologically validated by Gene Ontologies (GOs) and KEGG pathway analyses. Many top ranked rules extracted from RANWAR that hold poor ranks in traditional Apriori, are highly biologically significant to the related diseases. Finally, the top rules evolved from RANWAR, that are not in Apriori, are reported.

  20. Global direct pressures on biodiversity by large-scale metal mining: Spatial distribution and implications for conservation.

    PubMed

    Murguía, Diego I; Bringezu, Stefan; Schaldach, Rüdiger

    2016-09-15

    Biodiversity loss is widely recognized as a serious global environmental change process. While large-scale metal mining activities do not belong to the top drivers of such change, these operations exert or may intensify pressures on biodiversity by adversely changing habitats, directly and indirectly, at local and regional scales. So far, analyses of global spatial dynamics of mining and its burden on biodiversity focused on the overlap between mines and protected areas or areas of high value for conservation. However, it is less clear how operating metal mines are globally exerting pressure on zones of different biodiversity richness; a similar gap exists for unmined but known mineral deposits. By using vascular plants' diversity as a proxy to quantify overall biodiversity, this study provides a first examination of the global spatial distribution of mines and deposits for five key metals across different biodiversity zones. The results indicate that mines and deposits are not randomly distributed, but concentrated within intermediate and high diversity zones, especially bauxite and silver. In contrast, iron, gold, and copper mines and deposits are closer to a more proportional distribution while showing a high concentration in the intermediate biodiversity zone. Considering the five metals together, 63% and 61% of available mines and deposits, respectively, are located in intermediate diversity zones, comprising 52% of the global land terrestrial surface. 23% of mines and 20% of ore deposits are located in areas of high plant diversity, covering 17% of the land. 13% of mines and 19% of deposits are in areas of low plant diversity, comprising 31% of the land surface. Thus, there seems to be potential for opening new mines in areas of low biodiversity in the future. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. Ontology-based literature mining of E. coli vaccine-associated gene interaction networks.

    PubMed

    Hur, Junguk; Özgür, Arzucan; He, Yongqun

    2017-03-14

    Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i.e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of

  2. OntoGene web services for biomedical text mining.

    PubMed

    Rinaldi, Fabio; Clematide, Simon; Marques, Hernani; Ellendorff, Tilia; Romacker, Martin; Rodriguez-Esteban, Raul

    2014-01-01

    Text mining services are rapidly becoming a crucial component of various knowledge management pipelines, for example in the process of database curation, or for exploration and enrichment of biomedical data within the pharmaceutical industry. Traditional architectures, based on monolithic applications, do not offer sufficient flexibility for a wide range of use case scenarios, and therefore open architectures, as provided by web services, are attracting increased interest. We present an approach towards providing advanced text mining capabilities through web services, using a recently proposed standard for textual data interchange (BioC). The web services leverage a state-of-the-art platform for text mining (OntoGene) which has been tested in several community-organized evaluation challenges,with top ranked results in several of them.

  3. OntoGene web services for biomedical text mining

    PubMed Central

    2014-01-01

    Text mining services are rapidly becoming a crucial component of various knowledge management pipelines, for example in the process of database curation, or for exploration and enrichment of biomedical data within the pharmaceutical industry. Traditional architectures, based on monolithic applications, do not offer sufficient flexibility for a wide range of use case scenarios, and therefore open architectures, as provided by web services, are attracting increased interest. We present an approach towards providing advanced text mining capabilities through web services, using a recently proposed standard for textual data interchange (BioC). The web services leverage a state-of-the-art platform for text mining (OntoGene) which has been tested in several community-organized evaluation challenges, with top ranked results in several of them. PMID:25472638

  4. Novel approaches to global mining of aberrantly methylated promoter sites in squamous head and neck cancer.

    PubMed

    Worsham, Maria J; Chen, Kang Mei; Stephen, Josena K; Havard, Shaleta; Benninger, Michael S

    2010-07-01

    Promoter hypermethylation is emerging as a promising molecular strategy for early detection of cancer. We examined promoter methylation status of 1143 cancer-associated genes to perform a global but unbiased inspection of methylated regions in head and neck squamous cell carcinoma (HNSCC). Laboratory-based study. Integrated health care system. Five samples, two frozen primary HNSCC biopsies and three HNSCC cell lines, were examined. Whole genomic DNA was interrogated using a combination of DNA immunoprecipitation (IP) and Affymetrix whole-genome tiling arrays. Of the 1143 unique cancer genes on the array, 265 were recorded across five samples. Of the 265 genes, 55 were present in all five samples, and 36 were common to four of five samples, 46 to three of five, 56 to two of five, and 72 to one of five samples. Hypermethylated genes in the five samples were cross-examined against those in PubMeth, a cancer methylation database combining text mining and expert annotation (http://www.pubmeth.org). Of the 441 genes in PubMeth, only 33 are referenced to HNSCC. We matched 34 genes in our samples to the 441 genes in the PubMeth database. Of the 34 genes, eight are reported in PubMeth as HNSCC associated. This pilot study examined the contribution of global DNA hypermethylation to the pathogenesis of HNSCC. The whole-genome methylation approach indicated 231 new genes with methylated promoter regions not yet reported in HNSCC. Examination of this comprehensive gene panel in a larger HNSCC cohort should advance selection of HNSCC-specific candidate genes for further validation as biomarkers in HNSCC. 2010 American Academy of Otolaryngology-Head and Neck Surgery Foundation. Published by Mosby, Inc. All rights reserved.

  5. GEOGLE: context mining tool for the correlation between gene expression and the phenotypic distinction.

    PubMed

    Yu, Yao; Tu, Kang; Zheng, Siyuan; Li, Yun; Ding, Guohui; Ping, Jie; Hao, Pei; Li, Yixue

    2009-08-25

    In the post-genomic era, the development of high-throughput gene expression detection technology provides huge amounts of experimental data, which challenges the traditional pipelines for data processing and analyzing in scientific researches. In our work, we integrated gene expression information from Gene Expression Omnibus (GEO), biomedical ontology from Medical Subject Headings (MeSH) and signaling pathway knowledge from sigPathway entries to develop a context mining tool for gene expression analysis - GEOGLE. GEOGLE offers a rapid and convenient way for searching relevant experimental datasets, pathways and biological terms according to multiple types of queries: including biomedical vocabularies, GDS IDs, gene IDs, pathway names and signature list. Moreover, GEOGLE summarizes the signature genes from a subset of GDSes and estimates the correlation between gene expression and the phenotypic distinction with an integrated p value. This approach performing global searching of expression data may expand the traditional way of collecting heterogeneous gene expression experiment data. GEOGLE is a novel tool that provides researchers a quantitative way to understand the correlation between gene expression and phenotypic distinction through meta-analysis of gene expression datasets from different experiments, as well as the biological meaning behind. The web site and user guide of GEOGLE are available at: http://omics.biosino.org:14000/kweb/workflow.jsp?id=00020.

  6. Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

    PubMed

    Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

  7. Identification of Nitrogen-Fixing Genes and Gene Clusters from Metagenomic Library of Acid Mine Drainage

    PubMed Central

    Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

    2014-01-01

    Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community. PMID:24498417

  8. miRTex: A Text Mining System for miRNA-Gene Relation Extraction

    PubMed Central

    Li, Gang; Ross, Karen E.; Arighi, Cecilia N.; Peng, Yifan; Wu, Cathy H.; Vijay-Shanker, K.

    2015-01-01

    MicroRNAs (miRNAs) regulate a wide range of cellular and developmental processes through gene expression suppression or mRNA degradation. Experimentally validated miRNA gene targets are often reported in the literature. In this paper, we describe miRTex, a text mining system that extracts miRNA-target relations, as well as miRNA-gene and gene-miRNA regulation relations. The system achieves good precision and recall when evaluated on a literature corpus of 150 abstracts with F-scores close to 0.90 on the three different types of relations. We conducted full-scale text mining using miRTex to process all the Medline abstracts and all the full-length articles in the PubMed Central Open Access Subset. The results for all the Medline abstracts are stored in a database for interactive query and file download via the website at http://proteininformationresource.org/mirtex. Using miRTex, we identified genes potentially regulated by miRNAs in Triple Negative Breast Cancer, as well as miRNA-gene relations that, in conjunction with kinase-substrate relations, regulate the response to abiotic stress in Arabidopsis thaliana. These two use cases demonstrate the usefulness of miRTex text mining in the analysis of miRNA-regulated biological processes. PMID:26407127

  9. Text Mining to Support Gene Ontology Curation and Vice Versa.

    PubMed

    Ruch, Patrick

    2017-01-01

    In this chapter, we explain how text mining can support the curation of molecular biology databases dealing with protein functions. We also show how curated data can play a disruptive role in the developments of text mining methods. We review a decade of efforts to improve the automatic assignment of Gene Ontology (GO) descriptors, the reference ontology for the characterization of genes and gene products. To illustrate the high potential of this approach, we compare the performances of an automatic text categorizer and show a large improvement of +225 % in both precision and recall on benchmarked data. We argue that automatic text categorization functions can ultimately be embedded into a Question-Answering (QA) system to answer questions related to protein functions. Because GO descriptors can be relatively long and specific, traditional QA systems cannot answer such questions. A new type of QA system, so-called Deep QA which uses machine learning methods trained with curated contents, is thus emerging. Finally, future advances of text mining instruments are directly dependent on the availability of high-quality annotated contents at every curation step. Databases workflows must start recording explicitly all the data they curate and ideally also some of the data they do not curate.

  10. Finding novel relationships with integrated gene-gene association network analysis of Synechocystis sp. PCC 6803 using species-independent text-mining.

    PubMed

    Kreula, Sanna M; Kaewphan, Suwisa; Ginter, Filip; Jones, Patrik R

    2018-01-01

    The increasing move towards open access full-text scientific literature enhances our ability to utilize advanced text-mining methods to construct information-rich networks that no human will be able to grasp simply from 'reading the literature'. The utility of text-mining for well-studied species is obvious though the utility for less studied species, or those with no prior track-record at all, is not clear. Here we present a concept for how advanced text-mining can be used to create information-rich networks even for less well studied species and apply it to generate an open-access gene-gene association network resource for Synechocystis sp. PCC 6803, a representative model organism for cyanobacteria and first case-study for the methodology. By merging the text-mining network with networks generated from species-specific experimental data, network integration was used to enhance the accuracy of predicting novel interactions that are biologically relevant. A rule-based algorithm (filter) was constructed in order to automate the search for novel candidate genes with a high degree of likely association to known target genes by (1) ignoring established relationships from the existing literature, as they are already 'known', and (2) demanding multiple independent evidences for every novel and potentially relevant relationship. Using selected case studies, we demonstrate the utility of the network resource and filter to ( i ) discover novel candidate associations between different genes or proteins in the network, and ( ii ) rapidly evaluate the potential role of any one particular gene or protein. The full network is provided as an open-source resource.

  11. Finding novel relationships with integrated gene-gene association network analysis of Synechocystis sp. PCC 6803 using species-independent text-mining

    PubMed Central

    Kreula, Sanna M.; Kaewphan, Suwisa; Ginter, Filip

    2018-01-01

    The increasing move towards open access full-text scientific literature enhances our ability to utilize advanced text-mining methods to construct information-rich networks that no human will be able to grasp simply from ‘reading the literature’. The utility of text-mining for well-studied species is obvious though the utility for less studied species, or those with no prior track-record at all, is not clear. Here we present a concept for how advanced text-mining can be used to create information-rich networks even for less well studied species and apply it to generate an open-access gene-gene association network resource for Synechocystis sp. PCC 6803, a representative model organism for cyanobacteria and first case-study for the methodology. By merging the text-mining network with networks generated from species-specific experimental data, network integration was used to enhance the accuracy of predicting novel interactions that are biologically relevant. A rule-based algorithm (filter) was constructed in order to automate the search for novel candidate genes with a high degree of likely association to known target genes by (1) ignoring established relationships from the existing literature, as they are already ‘known’, and (2) demanding multiple independent evidences for every novel and potentially relevant relationship. Using selected case studies, we demonstrate the utility of the network resource and filter to (i) discover novel candidate associations between different genes or proteins in the network, and (ii) rapidly evaluate the potential role of any one particular gene or protein. The full network is provided as an open-source resource. PMID:29844966

  12. Study of Staphylococcus aureus N315 Pathogenic Genes by Text Mining and Enrichment Analysis of Pathways and Operons.

    PubMed

    Yang, Chun-Feng; Gou, Wei-Hui; Dai, Xin-Lun; Li, Yu-Mei

    2018-06-01

    Staphylococcus aureus (S. aureus) is a versatile pathogen found in many environments and can cause nosocomial infections in the community and hospitals. S. aureus infection is an increasingly serious threat to global public health that requires action across many government bodies, medical and health sectors, and scientific research institutions. In the present study, S. aureus N315 genes that have been shown in the literature to be pathogenic were extracted using a bibliometric method for functional enrichment analysis of pathways and operons to statistically discover novel pathogenic genes associated with S. aureus N315. A total of 383 pathogenic genes were mined from the literature using bibliometrics, and subsequently a few new pathogenic genes of S. aureus N315 were identified by functional enrichment analysis of pathways and operons. The discovery of these novel S. aureus N315 pathogenic genes is of great significance to treat S. aureus induced diseases and identify potential diagnostic markers, thus providing theoretical fundamentals for epidemiological prevention.

  13. Online Analytical Processing (OLAP): A Fast and Effective Data Mining Tool for Gene Expression Databases

    PubMed Central

    2005-01-01

    Gene expression databases contain a wealth of information, but current data mining tools are limited in their speed and effectiveness in extracting meaningful biological knowledge from them. Online analytical processing (OLAP) can be used as a supplement to cluster analysis for fast and effective data mining of gene expression databases. We used Analysis Services 2000, a product that ships with SQLServer2000, to construct an OLAP cube that was used to mine a time series experiment designed to identify genes associated with resistance of soybean to the soybean cyst nematode, a devastating pest of soybean. The data for these experiments is stored in the soybean genomics and microarray database (SGMD). A number of candidate resistance genes and pathways were found. Compared to traditional cluster analysis of gene expression data, OLAP was more effective and faster in finding biologically meaningful information. OLAP is available from a number of vendors and can work with any relational database management system through OLE DB. PMID:16046824

  14. A high-resolution network model for global gene regulation in Mycobacterium tuberculosis

    PubMed Central

    Peterson, Eliza J.R.; Reiss, David J.; Turkarslan, Serdar; Minch, Kyle J.; Rustad, Tige; Plaisier, Christopher L.; Longabaugh, William J.R.; Sherman, David R.; Baliga, Nitin S.

    2014-01-01

    The resilience of Mycobacterium tuberculosis (MTB) is largely due to its ability to effectively counteract and even take advantage of the hostile environments of a host. In order to accelerate the discovery and characterization of these adaptive mechanisms, we have mined a compendium of 2325 publicly available transcriptome profiles of MTB to decipher a predictive, systems-scale gene regulatory network model. The resulting modular organization of 98% of all MTB genes within this regulatory network was rigorously tested using two independently generated datasets: a genome-wide map of 7248 DNA-binding locations for 143 transcription factors (TFs) and global transcriptional consequences of overexpressing 206 TFs. This analysis has discovered specific TFs that mediate conditional co-regulation of genes within 240 modules across 14 distinct environmental contexts. In addition to recapitulating previously characterized regulons, we discovered 454 novel mechanisms for gene regulation during stress, cholesterol utilization and dormancy. Significantly, 183 of these mechanisms act uniquely under conditions experienced during the infection cycle to regulate diverse functions including 23 genes that are essential to host-pathogen interactions. These and other insights underscore the power of a rational, model-driven approach to unearth novel MTB biology that operates under some but not all phases of infection. PMID:25232098

  15. Mining Gene Regulatory Networks by Neural Modeling of Expression Time-Series.

    PubMed

    Rubiolo, Mariano; Milone, Diego H; Stegmayer, Georgina

    2015-01-01

    Discovering gene regulatory networks from data is one of the most studied topics in recent years. Neural networks can be successfully used to infer an underlying gene network by modeling expression profiles as times series. This work proposes a novel method based on a pool of neural networks for obtaining a gene regulatory network from a gene expression dataset. They are used for modeling each possible interaction between pairs of genes in the dataset, and a set of mining rules is applied to accurately detect the subjacent relations among genes. The results obtained on artificial and real datasets confirm the method effectiveness for discovering regulatory networks from a proper modeling of the temporal dynamics of gene expression profiles.

  16. RNA-Seq analysis of yak ovary: improving yak gene structure information and mining reproduction-related genes.

    PubMed

    Lan, DaoLiang; Xiong, XianRong; Wei, YanLi; Xu, Tong; Zhong, JinCheng; Zhi, XiangDong; Wang, Yong; Li, Jian

    2014-09-01

    RNA-Seq, a high-throughput (HT) sequencing technique, has been used effectively in large-scale transcriptomic studies, and is particularly useful for improving gene structure information and mining of new genes. In this study, RNA-Seq HT technology was employed to analyze the transcriptome of yak ovary. After Illumina-Solexa deep sequencing, 26826516 clean reads with a total of 4828772880 bp were obtained from the ovary library. Alignment analysis showed that 16992 yak genes mapped to the yak genome and 3734 of these genes were involved in alternative splicing. Gene structure refinement analysis showed that 7340 genes that were annotated in the yak genome could be extended at the 5' or 3' ends based on the alignments been the transcripts and the genome sequence. Novel transcript prediction analysis identified 6321 new transcripts with lengths ranging from 180 to 14884 bp, and 2267 of them were predicted to code proteins. BLAST analysis of the new transcripts showed that 1200?4933 mapped to the non-redundant (nr), nucleotide (nt) and/or SwissProt sequence databases. Comparative statistical analysis of the new mapped transcripts showed that the majority of them were similar to genes in Bos taurus (41.4%), Bos grunniens mutus (33.0%), Ovis aries (6.3%), Homo sapiens (2.8%), Mus musculus (1.6%) and other species. Functional analysis showed that these expressed genes were involved in various Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes pathways. GO analysis of the new transcripts found that the largest proportion of them was associated with reproduction. The results of this study will provide a basis for describing the normal transcriptome map of yak ovary and for future studies on yak breeding performance. Moreover, the results confirmed that RNA-Seq HT technology is highly advantageous in improving gene structure information and mining of new genes, as well as in providing valuable data to expand the yak genome information.

  17. Analyzing Large Gene Expression and Methylation Data Profiles Using StatBicRM: Statistical Biclustering-Based Rule Mining

    PubMed Central

    Maulik, Ujjwal; Mallik, Saurav; Mukhopadhyay, Anirban; Bandyopadhyay, Sanghamitra

    2015-01-01

    Microarray and beadchip are two most efficient techniques for measuring gene expression and methylation data in bioinformatics. Biclustering deals with the simultaneous clustering of genes and samples. In this article, we propose a computational rule mining framework, StatBicRM (i.e., statistical biclustering-based rule mining) to identify special type of rules and potential biomarkers using integrated approaches of statistical and binary inclusion-maximal biclustering techniques from the biological datasets. At first, a novel statistical strategy has been utilized to eliminate the insignificant/low-significant/redundant genes in such way that significance level must satisfy the data distribution property (viz., either normal distribution or non-normal distribution). The data is then discretized and post-discretized, consecutively. Thereafter, the biclustering technique is applied to identify maximal frequent closed homogeneous itemsets. Corresponding special type of rules are then extracted from the selected itemsets. Our proposed rule mining method performs better than the other rule mining algorithms as it generates maximal frequent closed homogeneous itemsets instead of frequent itemsets. Thus, it saves elapsed time, and can work on big dataset. Pathway and Gene Ontology analyses are conducted on the genes of the evolved rules using David database. Frequency analysis of the genes appearing in the evolved rules is performed to determine potential biomarkers. Furthermore, we also classify the data to know how much the evolved rules are able to describe accurately the remaining test (unknown) data. Subsequently, we also compare the average classification accuracy, and other related factors with other rule-based classifiers. Statistical significance tests are also performed for verifying the statistical relevance of the comparative results. Here, each of the other rule mining methods or rule-based classifiers is also starting with the same post-discretized data

  18. Analyzing large gene expression and methylation data profiles using StatBicRM: statistical biclustering-based rule mining.

    PubMed

    Maulik, Ujjwal; Mallik, Saurav; Mukhopadhyay, Anirban; Bandyopadhyay, Sanghamitra

    2015-01-01

    Microarray and beadchip are two most efficient techniques for measuring gene expression and methylation data in bioinformatics. Biclustering deals with the simultaneous clustering of genes and samples. In this article, we propose a computational rule mining framework, StatBicRM (i.e., statistical biclustering-based rule mining) to identify special type of rules and potential biomarkers using integrated approaches of statistical and binary inclusion-maximal biclustering techniques from the biological datasets. At first, a novel statistical strategy has been utilized to eliminate the insignificant/low-significant/redundant genes in such way that significance level must satisfy the data distribution property (viz., either normal distribution or non-normal distribution). The data is then discretized and post-discretized, consecutively. Thereafter, the biclustering technique is applied to identify maximal frequent closed homogeneous itemsets. Corresponding special type of rules are then extracted from the selected itemsets. Our proposed rule mining method performs better than the other rule mining algorithms as it generates maximal frequent closed homogeneous itemsets instead of frequent itemsets. Thus, it saves elapsed time, and can work on big dataset. Pathway and Gene Ontology analyses are conducted on the genes of the evolved rules using David database. Frequency analysis of the genes appearing in the evolved rules is performed to determine potential biomarkers. Furthermore, we also classify the data to know how much the evolved rules are able to describe accurately the remaining test (unknown) data. Subsequently, we also compare the average classification accuracy, and other related factors with other rule-based classifiers. Statistical significance tests are also performed for verifying the statistical relevance of the comparative results. Here, each of the other rule mining methods or rule-based classifiers is also starting with the same post-discretized data

  19. Distributed Function Mining for Gene Expression Programming Based on Fast Reduction.

    PubMed

    Deng, Song; Yue, Dong; Yang, Le-chan; Fu, Xiong; Feng, Ya-zhou

    2016-01-01

    For high-dimensional and massive data sets, traditional centralized gene expression programming (GEP) or improved algorithms lead to increased run-time and decreased prediction accuracy. To solve this problem, this paper proposes a new improved algorithm called distributed function mining for gene expression programming based on fast reduction (DFMGEP-FR). In DFMGEP-FR, fast attribution reduction in binary search algorithms (FAR-BSA) is proposed to quickly find the optimal attribution set, and the function consistency replacement algorithm is given to solve integration of the local function model. Thorough comparative experiments for DFMGEP-FR, centralized GEP and the parallel gene expression programming algorithm based on simulated annealing (parallel GEPSA) are included in this paper. For the waveform, mushroom, connect-4 and musk datasets, the comparative results show that the average time-consumption of DFMGEP-FR drops by 89.09%%, 88.85%, 85.79% and 93.06%, respectively, in contrast to centralized GEP and by 12.5%, 8.42%, 9.62% and 13.75%, respectively, compared with parallel GEPSA. Six well-studied UCI test data sets demonstrate the efficiency and capability of our proposed DFMGEP-FR algorithm for distributed function mining.

  20. Novel strategies to mine alcoholism-related haplotypes and genes by combining existing knowledge framework.

    PubMed

    Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng

    2009-02-01

    High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.

  1. Gene regulatory networks in lactation: identification of global principles using bioinformatics.

    PubMed

    Lemay, Danielle G; Neville, Margaret C; Rudolph, Michael C; Pollard, Katherine S; German, J Bruce

    2007-11-27

    The molecular events underlying mammary development during pregnancy, lactation, and involution are incompletely understood. Mammary gland microarray data, cellular localization data, protein-protein interactions, and literature-mined genes were integrated and analyzed using statistics, principal component analysis, gene ontology analysis, pathway analysis, and network analysis to identify global biological principles that govern molecular events during pregnancy, lactation, and involution. Several key principles were derived: (1) nearly a third of the transcriptome fluctuates to build, run, and disassemble the lactation apparatus; (2) genes encoding the secretory machinery are transcribed prior to lactation; (3) the diversity of the endogenous portion of the milk proteome is derived from fewer than 100 transcripts; (4) while some genes are differentially transcribed near the onset of lactation, the lactation switch is primarily post-transcriptionally mediated; (5) the secretion of materials during lactation occurs not by up-regulation of novel genomic functions, but by widespread transcriptional suppression of functions such as protein degradation and cell-environment communication; (6) the involution switch is primarily transcriptionally mediated; and (7) during early involution, the transcriptional state is partially reverted to the pre-lactation state. A new hypothesis for secretory diminution is suggested - milk production gradually declines because the secretory machinery is not transcriptionally replenished. A comprehensive network of protein interactions during lactation is assembled and new regulatory gene targets are identified. Less than one fifth of the transcriptionally regulated nodes in this lactation network have been previously explored in the context of lactation. Implications for future research in mammary and cancer biology are discussed.

  2. MinePath: Mining for Phenotype Differential Sub-paths in Molecular Pathways

    PubMed Central

    Koumakis, Lefteris; Kartsaki, Evgenia; Chatzimina, Maria; Zervakis, Michalis; Vassou, Despoina; Marias, Kostas; Moustakis, Vassilis; Potamias, George

    2016-01-01

    Pathway analysis methodologies couple traditional gene expression analysis with knowledge encoded in established molecular pathway networks, offering a promising approach towards the biological interpretation of phenotype differentiating genes. Early pathway analysis methodologies, named as gene set analysis (GSA), view pathways just as plain lists of genes without taking into account either the underlying pathway network topology or the involved gene regulatory relations. These approaches, even if they achieve computational efficiency and simplicity, consider pathways that involve the same genes as equivalent in terms of their gene enrichment characteristics. Most recent pathway analysis approaches take into account the underlying gene regulatory relations by examining their consistency with gene expression profiles and computing a score for each profile. Even with this approach, assessing and scoring single-relations limits the ability to reveal key gene regulation mechanisms hidden in longer pathway sub-paths. We introduce MinePath, a pathway analysis methodology that addresses and overcomes the aforementioned problems. MinePath facilitates the decomposition of pathways into their constituent sub-paths. Decomposition leads to the transformation of single-relations to complex regulation sub-paths. Regulation sub-paths are then matched with gene expression sample profiles in order to evaluate their functional status and to assess phenotype differential power. Assessment of differential power supports the identification of the most discriminant profiles. In addition, MinePath assess the significance of the pathways as a whole, ranking them by their p-values. Comparison results with state-of-the-art pathway analysis systems are indicative for the soundness and reliability of the MinePath approach. In contrast with many pathway analysis tools, MinePath is a web-based system (www.minepath.org) offering dynamic and rich pathway visualization functionality, with the

  3. MinePath: Mining for Phenotype Differential Sub-paths in Molecular Pathways.

    PubMed

    Koumakis, Lefteris; Kanterakis, Alexandros; Kartsaki, Evgenia; Chatzimina, Maria; Zervakis, Michalis; Tsiknakis, Manolis; Vassou, Despoina; Kafetzopoulos, Dimitris; Marias, Kostas; Moustakis, Vassilis; Potamias, George

    2016-11-01

    Pathway analysis methodologies couple traditional gene expression analysis with knowledge encoded in established molecular pathway networks, offering a promising approach towards the biological interpretation of phenotype differentiating genes. Early pathway analysis methodologies, named as gene set analysis (GSA), view pathways just as plain lists of genes without taking into account either the underlying pathway network topology or the involved gene regulatory relations. These approaches, even if they achieve computational efficiency and simplicity, consider pathways that involve the same genes as equivalent in terms of their gene enrichment characteristics. Most recent pathway analysis approaches take into account the underlying gene regulatory relations by examining their consistency with gene expression profiles and computing a score for each profile. Even with this approach, assessing and scoring single-relations limits the ability to reveal key gene regulation mechanisms hidden in longer pathway sub-paths. We introduce MinePath, a pathway analysis methodology that addresses and overcomes the aforementioned problems. MinePath facilitates the decomposition of pathways into their constituent sub-paths. Decomposition leads to the transformation of single-relations to complex regulation sub-paths. Regulation sub-paths are then matched with gene expression sample profiles in order to evaluate their functional status and to assess phenotype differential power. Assessment of differential power supports the identification of the most discriminant profiles. In addition, MinePath assess the significance of the pathways as a whole, ranking them by their p-values. Comparison results with state-of-the-art pathway analysis systems are indicative for the soundness and reliability of the MinePath approach. In contrast with many pathway analysis tools, MinePath is a web-based system (www.minepath.org) offering dynamic and rich pathway visualization functionality, with the

  4. Bacteria and Genes Involved in Arsenic Speciation in Sediment Impacted by Long-Term Gold Mining

    PubMed Central

    Costa, Patrícia S.; Scholte, Larissa L. S.; Reis, Mariana P.; Chaves, Anderson V.; Oliveira, Pollyanna L.; Itabayana, Luiza B.; Suhadolnik, Maria Luiza S.; Barbosa, Francisco A. R.; Chartone-Souza, Edmar; Nascimento, Andréa M. A.

    2014-01-01

    The bacterial community and genes involved in geobiocycling of arsenic (As) from sediment impacted by long-term gold mining were characterized through culture-based analysis of As-transforming bacteria and metagenomic studies of the arsC, arrA, and aioA genes. Sediment was collected from the historically gold mining impacted Mina stream, located in one of the world’s largest mining regions known as the “Iron Quadrangle”. A total of 123 As-resistant bacteria were recovered from the enrichment cultures, which were phenotypically and genotypically characterized for As-transformation. A diverse As-resistant bacteria community was found through phylogenetic analyses of the 16S rRNA gene. Bacterial isolates were affiliated with Proteobacteria, Firmicutes, and Actinobacteria and were represented by 20 genera. Most were AsV-reducing (72%), whereas AsIII-oxidizing accounted for 20%. Bacteria harboring the arsC gene predominated (85%), followed by aioA (20%) and arrA (7%). Additionally, we identified two novel As-transforming genera, Thermomonas and Pannonibacter. Metagenomic analysis of arsC, aioA, and arrA sequences confirmed the presence of these genes, with arrA sequences being more closely related to uncultured organisms. Evolutionary analyses revealed high genetic similarity between some arsC and aioA sequences obtained from isolates and clone libraries, suggesting that those isolates may represent environmentally important bacteria acting in As speciation. In addition, our findings show that the diversity of arrA genes is wider than earlier described, once none arrA-OTUs were affiliated with known reference strains. Therefore, the molecular diversity of arrA genes is far from being fully explored deserving further attention. PMID:24755825

  5. Identification of underground mine workings with the use of global positioning system technology

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Canty, G.A.; Everett, J.W.; Sharp, M.

    1998-12-31

    Identification of underground mine workings for well drilling is a difficult task given the limited resources available and lack of reliable information. Relic mine maps of questionable accuracy and difficulty in correlating the subsurface to the surface, make the process of locating wells arduous. With the development of global positioning system (GPS), specific locations on the earth can be identified with the aid of satellites. This technology can be applied to mine workings identification given a few necessary, precursory details. For an abandoned mine treatment project conducted by the University of Oklahoma, in conjunction with the Oklahoma Conservation Commission, amore » Trimble ProXL 8 channel GPS receiver was employed to locate specific points on the surface with respect to a mine map. A 1925 mine map was digitized into AutoCAD version 13 software. Surface features identified on the map, such as mine adits, were located and marked in the field using the GPS receiver. These features were than imported into AutoCAD and referenced with the same points drawn on the map. A rubber sheeting program, Multric, was used to tweak the points so the map features correlated with the surface points. The correlation of these features allowed the map to be geo-referenced with the surface. Specific drilling points were located on the digitized map and assigned a latitude and longitude. The GPS receiver, using real time differential correction, was used to locate these points in the field. This method was assumed to be relatively accurate, to within 5 to 15 feet.« less

  6. Integrated pathway-based transcription regulation network mining and visualization based on gene expression profiles.

    PubMed

    Kibinge, Nelson; Ono, Naoaki; Horie, Masafumi; Sato, Tetsuo; Sugiura, Tadao; Altaf-Ul-Amin, Md; Saito, Akira; Kanaya, Shigehiko

    2016-06-01

    Conventionally, workflows examining transcription regulation networks from gene expression data involve distinct analytical steps. There is a need for pipelines that unify data mining and inference deduction into a singular framework to enhance interpretation and hypotheses generation. We propose a workflow that merges network construction with gene expression data mining focusing on regulation processes in the context of transcription factor driven gene regulation. The pipeline implements pathway-based modularization of expression profiles into functional units to improve biological interpretation. The integrated workflow was implemented as a web application software (TransReguloNet) with functions that enable pathway visualization and comparison of transcription factor activity between sample conditions defined in the experimental design. The pipeline merges differential expression, network construction, pathway-based abstraction, clustering and visualization. The framework was applied in analysis of actual expression datasets related to lung, breast and prostrate cancer. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. The Determination of Children's Knowledge of Global Lunar Patterns from Online Essays Using Text Mining Analysis

    ERIC Educational Resources Information Center

    Cheon, Jongpil; Lee, Sangno; Smith, Walter; Song, Jaeki; Kim, Yongjin

    2013-01-01

    The purpose of this study was to use text mining analysis of early adolescents' online essays to determine their knowledge of global lunar patterns. Australian and American students in grades five to seven wrote about global lunar patterns they had discovered by sharing observations with each other via the Internet. These essays were analyzed for…

  8. Clique-based data mining for related genes in a biomedical database.

    PubMed

    Matsunaga, Tsutomu; Yonemori, Chikara; Tomita, Etsuji; Muramatsu, Masaaki

    2009-07-01

    Progress in the life sciences cannot be made without integrating biomedical knowledge on numerous genes in order to help formulate hypotheses on the genetic mechanisms behind various biological phenomena, including diseases. There is thus a strong need for a way to automatically and comprehensively search from biomedical databases for related genes, such as genes in the same families and genes encoding components of the same pathways. Here we address the extraction of related genes by searching for densely-connected subgraphs, which are modeled as cliques, in a biomedical relational graph. We constructed a graph whose nodes were gene or disease pages, and edges were the hyperlink connections between those pages in the Online Mendelian Inheritance in Man (OMIM) database. We obtained over 20,000 sets of related genes (called 'gene modules') by enumerating cliques computationally. The modules included genes in the same family, genes for proteins that form a complex, and genes for components of the same signaling pathway. The results of experiments using 'metabolic syndrome'-related gene modules show that the gene modules can be used to get a coherent holistic picture helpful for interpreting relations among genes. We presented a data mining approach extracting related genes by enumerating cliques. The extracted gene sets provide a holistic picture useful for comprehending complex disease mechanisms.

  9. Intrinsic limits to gene regulation by global crosstalk

    PubMed Central

    Friedlander, Tamar; Prizak, Roshan; Guet, Călin C.; Barton, Nicholas H.; Tkačik, Gašper

    2016-01-01

    Gene regulation relies on the specificity of transcription factor (TF)–DNA interactions. Limited specificity may lead to crosstalk: a regulatory state in which a gene is either incorrectly activated due to noncognate TF–DNA interactions or remains erroneously inactive. As each TF can have numerous interactions with noncognate cis-regulatory elements, crosstalk is inherently a global problem, yet has previously not been studied as such. We construct a theoretical framework to analyse the effects of global crosstalk on gene regulation. We find that crosstalk presents a significant challenge for organisms with low-specificity TFs, such as metazoans. Crosstalk is not easily mitigated by known regulatory schemes acting at equilibrium, including variants of cooperativity and combinatorial regulation. Our results suggest that crosstalk imposes a previously unexplored global constraint on the functioning and evolution of regulatory networks, which is qualitatively distinct from the known constraints that act at the level of individual gene regulatory elements. PMID:27489144

  10. Resistance Genes in Global Crop Breeding Networks.

    PubMed

    Garrett, K A; Andersen, K F; Asche, F; Bowden, R L; Forbes, G A; Kulakow, P A; Zhou, B

    2017-10-01

    Resistance genes are a major tool for managing crop diseases. The networks of crop breeders who exchange resistance genes and deploy them in varieties help to determine the global landscape of resistance and epidemics, an important system for maintaining food security. These networks function as a complex adaptive system, with associated strengths and vulnerabilities, and implications for policies to support resistance gene deployment strategies. Extensions of epidemic network analysis can be used to evaluate the multilayer agricultural networks that support and influence crop breeding networks. Here, we evaluate the general structure of crop breeding networks for cassava, potato, rice, and wheat. All four are clustered due to phytosanitary and intellectual property regulations, and linked through CGIAR hubs. Cassava networks primarily include public breeding groups, whereas others are more mixed. These systems must adapt to global change in climate and land use, the emergence of new diseases, and disruptive breeding technologies. Research priorities to support policy include how best to maintain both diversity and redundancy in the roles played by individual crop breeding groups (public versus private and global versus local), and how best to manage connectivity to optimize resistance gene deployment while avoiding risks to the useful life of resistance genes. [Formula: see text] Copyright © 2017 The Author(s). This is an open access article distributed under the CC BY 4.0 International license .

  11. DTFP-Growth: Dynamic Threshold-Based FP-Growth Rule Mining Algorithm Through Integrating Gene Expression, Methylation, and Protein-Protein Interaction Profiles.

    PubMed

    Mallik, Saurav; Bhadra, Tapas; Mukherji, Ayan; Mallik, Saurav; Bhadra, Tapas; Mukherji, Ayan; Mallik, Saurav; Bhadra, Tapas; Mukherji, Ayan

    2018-04-01

    Association rule mining is an important technique for identifying interesting relationships between gene pairs in a biological data set. Earlier methods basically work for a single biological data set, and, in maximum cases, a single minimum support cutoff can be applied globally, i.e., across all genesets/itemsets. To overcome this limitation, in this paper, we propose dynamic threshold-based FP-growth rule mining algorithm that integrates gene expression, methylation and protein-protein interaction profiles based on weighted shortest distance to find the novel associations among different pairs of genes in multi-view data sets. For this purpose, we introduce three new thresholds, namely, Distance-based Variable/Dynamic Supports (DVS), Distance-based Variable Confidences (DVC), and Distance-based Variable Lifts (DVL) for each rule by integrating co-expression, co-methylation, and protein-protein interactions existed in the multi-omics data set. We develop the proposed algorithm utilizing these three novel multiple threshold measures. In the proposed algorithm, the values of , , and are computed for each rule separately, and subsequently it is verified whether the support, confidence, and lift of each evolved rule are greater than or equal to the corresponding individual , , and values, respectively, or not. If all these three conditions for a rule are found to be true, the rule is treated as a resultant rule. One of the major advantages of the proposed method compared with other related state-of-the-art methods is that it considers both the quantitative and interactive significance among all pairwise genes belonging to each rule. Moreover, the proposed method generates fewer rules, takes less running time, and provides greater biological significance for the resultant top-ranking rules compared to previous methods.

  12. JAK signaling globally counteracts heterochromatic gene silencing.

    PubMed

    Shi, Song; Calhoun, Healani C; Xia, Fan; Li, Jinghong; Le, Long; Li, Willis X

    2006-09-01

    The JAK/STAT pathway has pleiotropic roles in animal development, and its aberrant activation is implicated in multiple human cancers. JAK/STAT signaling effects have been attributed largely to direct transcriptional regulation by STAT of specific target genes that promote tumor cell proliferation or survival. We show here in a Drosophila melanogaster hematopoietic tumor model, however, that JAK overactivation globally disrupts heterochromatic gene silencing, an epigenetic tumor suppressive mechanism. This disruption allows derepression of genes that are not direct targets of STAT, as evidenced by suppression of heterochromatin-mediated position effect variegation. Moreover, mutations in the genes encoding heterochromatin components heterochromatin protein 1 (HP1) and Su(var)3-9 enhance tumorigenesis induced by an oncogenic JAK kinase without affecting JAK/STAT signaling. Consistently, JAK loss of function enhances heterochromatic gene silencing, whereas overexpressing HP1 suppresses oncogenic JAK-induced tumors. These results demonstrate that the JAK/STAT pathway regulates cellular epigenetic status and that globally disrupting heterochromatin-mediated tumor suppression is essential for tumorigenesis induced by JAK overactivation.

  13. JAK signaling globally counteracts heterochromatic gene silencing

    PubMed Central

    Shi, Song; Calhoun, Healani C; Xia, Fan; Li, Jinghong; Le, Long; Li, Willis X

    2011-01-01

    The JAK/STAT pathway has pleiotropic roles in animal development, and its aberrant activation is implicated in multiple human cancers1–3. JAK/STAT signaling effects have been attributed largely to direct transcriptional regulation by STAT of specific target genes that promote tumor cell proliferation or survival. We show here in a Drosophila melanogaster hematopoietic tumor model, however, that JAK overactivation globally disrupts heterochromatic gene silencing, an epigenetic tumor suppressive mechanism4. This disruption allows derepression of genes that are not direct targets of STAT, as evidenced by suppression of heterochromatin-mediated position effect variegation. Moreover, mutations in the genes encoding heterochromatin components heterochromatin protein 1 (HP1) and Su(var)3-9 enhance tumorigenesis induced by an oncogenic JAK kinase without affecting JAK/STAT signaling. Consistently, JAK loss of function enhances heterochromatic gene silencing, whereas overexpressing HP1 suppresses oncogenic JAK-induced tumors. These results demonstrate that the JAK/STAT pathway regulates cellular epigenetic status and that globally disrupting heterochromatin-mediated tumor suppression is essential for tumorigenesis induced by JAK overactivation. PMID:16892059

  14. Literature mining, gene-set enrichment and pathway analysis for target identification in Behçet's disease.

    PubMed

    Wilson, Paul; Larminie, Christopher; Smith, Rona

    2016-01-01

    To use literature mining to catalogue Behçet's associated genes, and advanced computational methods to improve the understanding of the pathways and signalling mechanisms that lead to the typical clinical characteristics of Behçet's patients. To extend this technique to identify potential treatment targets for further experimental validation. Text mining methods combined with gene enrichment tools, pathway analysis and causal analysis algorithms. This approach identified 247 human genes associated with Behçet's disease and the resulting disease map, comprising 644 nodes and 19220 edges, captured important details of the relationships between these genes and their associated pathways, as described in diverse data repositories. Pathway analysis has identified how Behçet's associated genes are likely to participate in innate and adaptive immune responses. Causal analysis algorithms have identified a number of potential therapeutic strategies for further investigation. Computational methods have captured pertinent features of the prominent disease characteristics presented in Behçet's disease and have highlighted NOD2, ICOS and IL18 signalling as potential therapeutic strategies.

  15. DDMGD: the database of text-mined associations between genes methylated in diseases from different species.

    PubMed

    Bin Raies, Arwa; Mansour, Hicham; Incitti, Roberto; Bajic, Vladimir B

    2015-01-01

    Gathering information about associations between methylated genes and diseases is important for diseases diagnosis and treatment decisions. Recent advancements in epigenetics research allow for large-scale discoveries of associations of genes methylated in diseases in different species. Searching manually for such information is not easy, as it is scattered across a large number of electronic publications and repositories. Therefore, we developed DDMGD database (http://www.cbrc.kaust.edu.sa/ddmgd/) to provide a comprehensive repository of information related to genes methylated in diseases that can be found through text mining. DDMGD's scope is not limited to a particular group of genes, diseases or species. Using the text mining system DEMGD we developed earlier and additional post-processing, we extracted associations of genes methylated in different diseases from PubMed Central articles and PubMed abstracts. The accuracy of extracted associations is 82% as estimated on 2500 hand-curated entries. DDMGD provides a user-friendly interface facilitating retrieval of these associations ranked according to confidence scores. Submission of new associations to DDMGD is provided. A comparison analysis of DDMGD with several other databases focused on genes methylated in diseases shows that DDMGD is comprehensive and includes most of the recent information on genes methylated in diseases. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Intrinsic limits to gene regulation by global crosstalk

    NASA Astrophysics Data System (ADS)

    Friedlander, Tamar; Prizak, Roshan; Guet, Calin; Barton, Nicholas H.; Tkacik, Gasper

    Gene activity is mediated by the specificity of binding interactions between special proteins, called transcription factors, and short regulatory sequences on the DNA, where different protein species preferentially bind different DNA targets. Limited interaction specificity may lead to crosstalk: a regulatory state in which a gene is either incorrectly activated due to spurious interactions or remains erroneously inactive. Since each protein can potentially interact with numerous DNA targets, crosstalk is inherently a global problem, yet has previously not been studied as such. We construct a theoretical framework to analyze the effects of global crosstalk on gene regulation, using statistical mechanics. We find that crosstalk in regulatory interactions puts fundamental limits on the reliability of gene regulation that are not easily mitigated by tuning proteins concentrations or by complex regulatory schemes proposed in the literature. Our results suggest that crosstalk imposes a previously unexplored global constraint on the functioning and evolution of regulatory networks, which is qualitatively distinct from the known constraints that act at the level of individual gene regulatory elements. The research leading to these results has received funding from the People Programme (Marie Curie Actions) of the European Union's Seventh Framework Programme (FP7/2007-2013) under REA Grant agreement Nr. 291734 (T.F.) and ERC Grant Nr. 250152 (N.B.).

  17. Nitrifier Gene Abundance and Diversity in Sediments Impacted by Acid Mine Drainage

    PubMed Central

    Ramanathan, Bhargavi; Boddicker, Andrew M.; Roane, Timberley M.; Mosier, Annika C.

    2017-01-01

    Extremely acidic and metal-rich acid mine drainage (AMD) waters can have severe toxicological effects on aquatic ecosystems. AMD has been shown to completely halt nitrification, which plays an important role in transferring nitrogen to higher organisms and in mitigating nitrogen pollution. We evaluated the gene abundance and diversity of nitrifying microbes in AMD-impacted sediments: ammonia-oxidizing archaea (AOA), ammonia-oxidizing bacteria (AOB), and nitrite-oxidizing bacteria (NOB). Samples were collected from the Iron Springs Mining District (Ophir, CO, United States) during early and late summer in 2013 and 2014. Many of the sites were characterized by low pH (<5) and high metal concentrations. Sequence analyses revealed AOA genes related to Nitrososphaera, Nitrosotalea, and Nitrosoarchaeum; AOB genes related to Nitrosomonas and Nitrosospira; and NOB genes related to Nitrospira. The overall abundance of AOA, AOB and NOB was examined using quantitative PCR (qPCR) amplification of the amoA and nxrB functional genes and 16S rRNA genes. Gene copy numbers ranged from 3.2 × 104 – 4.9 × 107 archaeal amoA copies ∗ μg DNA-1, 1.5 × 103 – 5.3 × 105 AOB 16S rRNA copies ∗ μg DNA-1, and 1.3 × 106 – 7.7 × 107 Nitrospira nxrB copies ∗ μg DNA-1. Overall, Nitrospira nxrB genes were found to be more abundant than AOB 16S rRNA and archaeal amoA genes in most of the sample sites across 2013 and 2014. AOB 16S rRNA and Nitrospira nxrB genes were quantified in sediments with pH as low as 3.2, and AOA amoA genes were quantified in sediments as low as 3.5. Though pH varied across all sites (pH 3.2–8.3), pH was not strongly correlated to the overall community structure or relative abundance of individual OTUs for any gene (based on CCA and Spearman correlations). pH was positivity correlated to the total abundance (qPCR) of AOB 16S rRNA genes, but not for any other genes. Metals were not correlated to the overall nitrifier community composition or abundance, but

  18. Development of Biomarkers for Screening Hepatocellular Carcinoma Using Global Data Mining and Multiple Reaction Monitoring

    PubMed Central

    Yu, Su Jong; Jang, Eun Sun; Yu, Jiyoung; Cho, Geunhee; Yoon, Jung-Hwan; Kim, Youngsoo

    2013-01-01

    Hepatocellular carcinoma (HCC) is one of the most common and aggressive cancers and is associated with a poor survival rate. Clinically, the level of alpha-fetoprotein (AFP) has been used as a biomarker for the diagnosis of HCC. The discovery of useful biomarkers for HCC, focused solely on the proteome, has been difficult; thus, wide-ranging global data mining of genomic and proteomic databases from previous reports would be valuable in screening biomarker candidates. Further, multiple reaction monitoring (MRM), based on triple quadrupole mass spectrometry, has been effective with regard to high-throughput verification, complementing antibody-based verification pipelines. In this study, global data mining was performed using 5 types of HCC data to screen for candidate biomarker proteins: cDNA microarray, copy number variation, somatic mutation, epigenetic, and quantitative proteomics data. Next, we applied MRM to verify HCC candidate biomarkers in individual serum samples from 3 groups: a healthy control group, patients who have been diagnosed with HCC (Before HCC treatment group), and HCC patients who underwent locoregional therapy (After HCC treatment group). After determining the relative quantities of the candidate proteins by MRM, we compared their expression levels between the 3 groups, identifying 4 potential biomarkers: the actin-binding protein anillin (ANLN), filamin-B (FLNB), complementary C4-A (C4A), and AFP. The combination of 2 markers (ANLN, FLNB) improved the discrimination of the before HCC treatment group from the healthy control group compared with AFP. We conclude that the combination of global data mining and MRM verification enhances the screening and verification of potential HCC biomarkers. This efficacious integrative strategy is applicable to the development of markers for cancer and other diseases. PMID:23717429

  19. Development of biomarkers for screening hepatocellular carcinoma using global data mining and multiple reaction monitoring.

    PubMed

    Kim, Hyunsoo; Kim, Kyunggon; Yu, Su Jong; Jang, Eun Sun; Yu, Jiyoung; Cho, Geunhee; Yoon, Jung-Hwan; Kim, Youngsoo

    2013-01-01

    Hepatocellular carcinoma (HCC) is one of the most common and aggressive cancers and is associated with a poor survival rate. Clinically, the level of alpha-fetoprotein (AFP) has been used as a biomarker for the diagnosis of HCC. The discovery of useful biomarkers for HCC, focused solely on the proteome, has been difficult; thus, wide-ranging global data mining of genomic and proteomic databases from previous reports would be valuable in screening biomarker candidates. Further, multiple reaction monitoring (MRM), based on triple quadrupole mass spectrometry, has been effective with regard to high-throughput verification, complementing antibody-based verification pipelines. In this study, global data mining was performed using 5 types of HCC data to screen for candidate biomarker proteins: cDNA microarray, copy number variation, somatic mutation, epigenetic, and quantitative proteomics data. Next, we applied MRM to verify HCC candidate biomarkers in individual serum samples from 3 groups: a healthy control group, patients who have been diagnosed with HCC (Before HCC treatment group), and HCC patients who underwent locoregional therapy (After HCC treatment group). After determining the relative quantities of the candidate proteins by MRM, we compared their expression levels between the 3 groups, identifying 4 potential biomarkers: the actin-binding protein anillin (ANLN), filamin-B (FLNB), complementary C4-A (C4A), and AFP. The combination of 2 markers (ANLN, FLNB) improved the discrimination of the before HCC treatment group from the healthy control group compared with AFP. We conclude that the combination of global data mining and MRM verification enhances the screening and verification of potential HCC biomarkers. This efficacious integrative strategy is applicable to the development of markers for cancer and other diseases.

  20. Global demand for rare earth resources and strategies for green mining

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dutta, Tanushree

    Rare earth elements (REEs) are essential raw materials for emerging renewable energy resources and ‘smart’ electronic devices. Global REE demand is slated to grow at an annual rate of 5% by 2020. This high growth rate will require a steady supply base of REEs in the long run. At present, China is responsible for 85% of global rare earth oxide (REO) production. To overcome this monopolistic supply situation, new strategies and investments are necessary to satisfy domestic supply demands. Concurrently, environmental, economic, and social problems arising from REE mining must be addressed. There is an urgent need to develop efficientmore » REE recycling techniques from end-of-life products, technologies to minimize the amount of REEs required per unit device, and methods to recover them from fly ash or fossil fuel-burning wastes.« less

  1. Global demand for rare earth resources and strategies for green mining.

    PubMed

    Dutta, Tanushree; Kim, Ki-Hyun; Uchimiya, Minori; Kwon, Eilhann E; Jeon, Byong-Hun; Deep, Akash; Yun, Seong-Taek

    2016-10-01

    Rare earth elements (REEs) are essential raw materials for emerging renewable energy resources and 'smart' electronic devices. Global REE demand is slated to grow at an annual rate of 5% by 2020. This high growth rate will require a steady supply base of REEs in the long run. At present, China is responsible for 85% of global rare earth oxide (REO) production. To overcome this monopolistic supply situation, new strategies and investments are necessary to satisfy domestic supply demands. Concurrently, environmental, economic, and social problems arising from REE mining must be addressed. There is an urgent need to develop efficient REE recycling techniques from end-of-life products, technologies to minimize the amount of REEs required per unit device, and methods to recover them from fly ash or fossil fuel-burning wastes. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. Stress-Survival Gene Identification From an Acid Mine Drainage Algal Mat Community

    NASA Astrophysics Data System (ADS)

    Urbina-Navarrete, J.; Fujishima, K.; Paulino-Lima, I. G.; Rothschild-Mancinelli, B.; Rothschild, L. J.

    2014-12-01

    Microbial communities from acid mine drainage environments are exposed to multiple stressors to include low pH, high dissolved metal loads, seasonal freezing, and desiccation. The microbial and algal communities that inhabit these niche environments have evolved strategies that allow for their ecological success. Metagenomic analyses are useful in identifying species diversity, however they do not elucidate the mechanisms that allow for the resilience of a community under these extreme conditions. Many known or predicted genes encode for protein products that are unknown, or similarly, many proteins cannot be traced to their gene of origin. This investigation seeks to identify genes that are active in an algal consortium during stress from living in an acid mine drainage environment. Our approach involves using the entire community transcriptome for a functional screen in an Escherichia coli host. This approach directly targets the genes involved in survival, without need for characterizing the members of the consortium.The consortium was harvested and stressed with conditions similar to the native environment it was collected from. Exposure to low pH (< 3.2), high metal load, desiccation, and deep freeze resulted in the expression of stress-induced genes that were transcribed into messenger RNA (mRNA). These mRNA transcripts were harvested to build complementary DNA (cDNA) libraries in E. coli. The transformed E. coli were exposed to the same stressors as the original algal consortium to select for surviving cells. Successful cells incorporated the transcripts that encode survival mechanisms, thus allowing for selection and identification of the gene(s) involved. Initial selection screens for freeze and desiccation tolerance have yielded E. coli that are 1 order of magnitude more resistant to freezing (0.01% survival of control with no transcript, 0.2% survival of E. coli with transcript) and 3 orders of magnitude more resistant to desiccation (0.005% survival of

  3. Mountaintop mining consequences

    Treesearch

    M.A. Palmer; E.S. Bernhardt; W.H. Schlesinger; K.N. Eshleman; E. Foufoula-Georgiou; M.S. Hendryx; A.D. Lemly; G.E. Likens; O.L. Loucks; M.E. Power; P.S. White; P.R. Wilcock

    2010-01-01

    There has been a global, 30-year increase in surface mining (1), which is now the dominant driver of land-use change in the central Appalachian ecoregion of the United States (2). One major form of such mining, mountaintop mining with valley fills (MTM/VF) (3), is widespread throughout eastern Kentucky, West Virginia (WV), and southwestern Virginia. Upper elevation...

  4. Function Clustering Self-Organization Maps (FCSOMs) for mining differentially expressed genes in Drosophila and its correlation with the growth medium.

    PubMed

    Liu, L L; Liu, M J; Ma, M

    2015-09-28

    The central task of this study was to mine the gene-to-medium relationship. Adequate knowledge of this relationship could potentially improve the accuracy of differentially expressed gene mining. One of the approaches to differentially expressed gene mining uses conventional clustering algorithms to identify the gene-to-medium relationship. Compared to conventional clustering algorithms, self-organization maps (SOMs) identify the nonlinear aspects of the gene-to-medium relationships by mapping the input space into another higher dimensional feature space. However, SOMs are not suitable for huge datasets consisting of millions of samples. Therefore, a new computational model, the Function Clustering Self-Organization Maps (FCSOMs), was developed. FCSOMs take advantage of the theory of granular computing as well as advanced statistical learning methodologies, and are built specifically for each information granule (a function cluster of genes), which are intelligently partitioned by the clustering algorithm provided by the DAVID_6.7 software platform. However, only the gene functions, and not their expression values, are considered in the fuzzy clustering algorithm of DAVID. Compared to the clustering algorithm of DAVID, these experimental results show a marked improvement in the accuracy of classification with the application of FCSOMs. FCSOMs can handle huge datasets and their complex classification problems, as each FCSOM (modeled for each function cluster) can be easily parallelized.

  5. System Analysis of LWDH Related Genes Based on Text Mining in Biological Networks

    PubMed Central

    Miao, Yingbo; Zhang, Liangcai; Wang, Yang; Feng, Rennan; Yang, Lei; Zhang, Shihua; Jiang, Yongshuai; Liu, Guiyou

    2014-01-01

    Liuwei-dihuang (LWDH) is widely used in traditional Chinese medicine (TCM), but its molecular mechanism about gene interactions is unclear. LWDH genes were extracted from the existing literatures based on text mining technology. To simulate the complex molecular interactions that occur in the whole body, protein-protein interaction networks (PPINs) were constructed and the topological properties of LWDH genes were analyzed. LWDH genes have higher centrality properties and may play important roles in the complex biological network environment. It was also found that the distances within LWDH genes are smaller than expected, which means that the communication of LWDH genes during the biological process is rapid and effectual. At last, a comprehensive network of LWDH genes, including the related drugs and regulatory pathways at both the transcriptional and posttranscriptional levels, was constructed and analyzed. The biological network analysis strategy used in this study may be helpful for the understanding of molecular mechanism of TCM. PMID:25243143

  6. Global gene expression analysis by combinatorial optimization.

    PubMed

    Ameur, Adam; Aurell, Erik; Carlsson, Mats; Westholm, Jakub Orzechowski

    2004-01-01

    Generally, there is a trade-off between methods of gene expression analysis that are precise but labor-intensive, e.g. RT-PCR, and methods that scale up to global coverage but are not quite as quantitative, e.g. microarrays. In the present paper, we show how how a known method of gene expression profiling (K. Kato, Nucleic Acids Res. 23, 3685-3690 (1995)), which relies on a fairly small number of steps, can be turned into a global gene expression measurement by advanced data post-processing, with potentially little loss of accuracy. Post-processing here entails solving an ancillary combinatorial optimization problem. Validation is performed on in silico experiments generated from the FANTOM data base of full-length mouse cDNA. We present two variants of the method. One uses state-of-the-art commercial software for solving problems of this kind, the other a code developed by us specifically for this purpose, released in the public domain under GPL license.

  7. Gold Mining in the Peruvian Amazon: Global Prices, Deforestation, and Mercury Imports

    PubMed Central

    Swenson, Jennifer J.; Carter, Catherine E.; Domec, Jean-Christophe; Delgado, Cesar I.

    2011-01-01

    Many factors such as poverty, ineffective institutions and environmental regulations may prevent developing countries from managing how natural resources are extracted to meet a strong market demand. Extraction for some resources has reached such proportions that evidence is measurable from space. We present recent evidence of the global demand for a single commodity and the ecosystem destruction resulting from commodity extraction, recorded by satellites for one of the most biodiverse areas of the world. We find that since 2003, recent mining deforestation in Madre de Dios, Peru is increasing nonlinearly alongside a constant annual rate of increase in international gold price (∼18%/yr). We detect that the new pattern of mining deforestation (1915 ha/year, 2006–2009) is outpacing that of nearby settlement deforestation. We show that gold price is linked with exponential increases in Peruvian national mercury imports over time (R2 = 0.93, p = 0.04, 2003–2009). Given the past rates of increase we predict that mercury imports may more than double for 2011 (∼500 t/year). Virtually all of Peru's mercury imports are used in artisanal gold mining. Much of the mining increase is unregulated/artisanal in nature, lacking environmental impact analysis or miner education. As a result, large quantities of mercury are being released into the atmosphere, sediments and waterways. Other developing countries endowed with gold deposits are likely experiencing similar environmental destruction in response to recent record high gold prices. The increasing availability of satellite imagery ought to evoke further studies linking economic variables with land use and cover changes on the ground. PMID:21526143

  8. Gold mining in the Peruvian Amazon: global prices, deforestation, and mercury imports.

    PubMed

    Swenson, Jennifer J; Carter, Catherine E; Domec, Jean-Christophe; Delgado, Cesar I

    2011-04-19

    Many factors such as poverty, ineffective institutions and environmental regulations may prevent developing countries from managing how natural resources are extracted to meet a strong market demand. Extraction for some resources has reached such proportions that evidence is measurable from space. We present recent evidence of the global demand for a single commodity and the ecosystem destruction resulting from commodity extraction, recorded by satellites for one of the most biodiverse areas of the world. We find that since 2003, recent mining deforestation in Madre de Dios, Peru is increasing nonlinearly alongside a constant annual rate of increase in international gold price (∼18%/yr). We detect that the new pattern of mining deforestation (1915 ha/year, 2006-2009) is outpacing that of nearby settlement deforestation. We show that gold price is linked with exponential increases in Peruvian national mercury imports over time (R(2) = 0.93, p = 0.04, 2003-2009). Given the past rates of increase we predict that mercury imports may more than double for 2011 (∼500 t/year). Virtually all of Peru's mercury imports are used in artisanal gold mining. Much of the mining increase is unregulated/artisanal in nature, lacking environmental impact analysis or miner education. As a result, large quantities of mercury are being released into the atmosphere, sediments and waterways. Other developing countries endowed with gold deposits are likely experiencing similar environmental destruction in response to recent record high gold prices. The increasing availability of satellite imagery ought to evoke further studies linking economic variables with land use and cover changes on the ground.

  9. Heat acclimation: Gold mines and genes

    PubMed Central

    Schneider, Suzanne M.

    2016-01-01

    ABSTRACT The underground gold mines of South Africa offer a unique historical setting to study heat acclimation. The early heat stress research was conducted and described by a young medical officer, Dr. Aldo Dreosti. He developed practical and specific protocols to first assess the heat tolerance of thousands of new mining recruits, and then used the screening results as the basis for assigning a heat acclimation protocol. The mines provide an interesting paradigm where the prevention of heat stroke evolved from genetic selection, where only Black natives were recruited due to a false assumption of their intrinsic tolerance to heat, to our current appreciation of the epigenetic and other molecular adaptations that occur with exposure to heat. PMID:28090556

  10. Microbial and geochemical assessment of bauxitic un-mined and post-mined chronosequence soils from Mocho Mountains, Jamaica.

    PubMed

    Lewis, Dawn E; Chauhan, Ashvini; White, John R; Overholt, Will; Green, Stefan J; Jasrotia, Puja; Wafula, Denis; Jagoe, Charles

    2012-10-01

    Microorganisms are very sensitive to environmental change and can be used to gauge anthropogenic impacts and even predict restoration success of degraded environments. Here, we report assessment of bauxite mining activities on soil biogeochemistry and microbial community structure using un-mined and three post-mined sites in Jamaica. The post-mined soils represent a chronosequence, undergoing restoration since 1987, 1997, and 2007. Soils were collected during dry and wet seasons and analyzed for pH, organic matter (OM), total carbon (TC), nitrogen (TN), and phosphorus. The microbial community structure was assessed through quantitative PCR and massively parallel bacterial ribosomal RNA (rRNA) gene sequencing. Edaphic factors and microbial community composition were analyzed using multivariate statistical approaches and revealed a significant, negative impact of mining on soil that persisted even after greater than 20 years of restoration. Seasonal fluctuations contributed to variation in measured soil properties and community composition, but they were minor in comparison to long-term effects of mining. In both seasons, post-mined soils were higher in pH but OM, TC, and TN decreased. Bacterial rRNA gene analyses demonstrated a general decrease in diversity in post-mined soils and up to a 3-log decrease in rRNA gene abundance. Community composition analyses demonstrated that bacteria from the Proteobacteria (α, β, γ, δ), Acidobacteria, and Firmicutes were abundant in all soils. The abundance of Firmicutes was elevated in newer post-mined soils relative to the un-mined soil, and this contrasted a decrease, relative to un-mined soils, in proteobacterial and acidobacterial rRNA gene abundances. Our study indicates long-lasting impacts of mining activities to soil biogeochemical and microbial properties with impending loss in soil productivity.

  11. ChimerDB 3.0: an enhanced database for fusion genes from cancer transcriptome and literature data mining.

    PubMed

    Lee, Myunggyo; Lee, Kyubum; Yu, Namhee; Jang, Insu; Choi, Ikjung; Kim, Pora; Jang, Ye Eun; Kim, Byounggun; Kim, Sunkyu; Lee, Byungwook; Kang, Jaewoo; Lee, Sanghyuk

    2017-01-04

    Fusion gene is an important class of therapeutic targets and prognostic markers in cancer. ChimerDB is a comprehensive database of fusion genes encompassing analysis of deep sequencing data and manual curations. In this update, the database coverage was enhanced considerably by adding two new modules of The Cancer Genome Atlas (TCGA) RNA-Seq analysis and PubMed abstract mining. ChimerDB 3.0 is composed of three modules of ChimerKB, ChimerPub and ChimerSeq. ChimerKB represents a knowledgebase including 1066 fusion genes with manual curation that were compiled from public resources of fusion genes with experimental evidences. ChimerPub includes 2767 fusion genes obtained from text mining of PubMed abstracts. ChimerSeq module is designed to archive the fusion candidates from deep sequencing data. Importantly, we have analyzed RNA-Seq data of the TCGA project covering 4569 patients in 23 cancer types using two reliable programs of FusionScan and TopHat-Fusion. The new user interface supports diverse search options and graphic representation of fusion gene structure. ChimerDB 3.0 is available at http://ercsb.ewha.ac.kr/fusiongene/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Horizontal gene transfer in an acid mine drainage microbial community.

    PubMed

    Guo, Jiangtao; Wang, Qi; Wang, Xiaoqi; Wang, Fumeng; Yao, Jinxian; Zhu, Huaiqiu

    2015-07-04

    Horizontal gene transfer (HGT) has been widely identified in complete prokaryotic genomes. However, the roles of HGT among members of a microbial community and in evolution remain largely unknown. With the emergence of metagenomics, it is nontrivial to investigate such horizontal flow of genetic materials among members in a microbial community from the natural environment. Because of the lack of suitable methods for metagenomics gene transfer detection, microorganisms from a low-complexity community acid mine drainage (AMD) with near-complete genomes were used to detect possible gene transfer events and suggest the biological significance. Using the annotation of coding regions by the current tools, a phylogenetic approach, and an approximately unbiased test, we found that HGTs in AMD organisms are not rare, and we predicted 119 putative transferred genes. Among them, 14 HGT events were determined to be transfer events among the AMD members. Further analysis of the 14 transferred genes revealed that the HGT events affected the functional evolution of archaea or bacteria in AMD, and it probably shaped the community structure, such as the dominance of G-plasma in archaea in AMD through HGT. Our study provides a novel insight into HGT events among microorganisms in natural communities. The interconnectedness between HGT and community evolution is essential to understand microbial community formation and development.

  13. Mining functionally relevant gene sets for analyzing physiologically novel clinical expression data.

    PubMed

    Turcan, Sevin; Vetter, Douglas E; Maron, Jill L; Wei, Xintao; Slonim, Donna K

    2011-01-01

    Gene set analyses have become a standard approach for increasing the sensitivity of transcriptomic studies. However, analytical methods incorporating gene sets require the availability of pre-defined gene sets relevant to the underlying physiology being studied. For novel physiological problems, relevant gene sets may be unavailable or existing gene set databases may bias the results towards only the best-studied of the relevant biological processes. We describe a successful attempt to mine novel functional gene sets for translational projects where the underlying physiology is not necessarily well characterized in existing annotation databases. We choose targeted training data from public expression data repositories and define new criteria for selecting biclusters to serve as candidate gene sets. Many of the discovered gene sets show little or no enrichment for informative Gene Ontology terms or other functional annotation. However, we observe that such gene sets show coherent differential expression in new clinical test data sets, even if derived from different species, tissues, and disease states. We demonstrate the efficacy of this method on a human metabolic data set, where we discover novel, uncharacterized gene sets that are diagnostic of diabetes, and on additional data sets related to neuronal processes and human development. Our results suggest that our approach may be an efficient way to generate a collection of gene sets relevant to the analysis of data for novel clinical applications where existing functional annotation is relatively incomplete.

  14. Text Mining Effectively Scores and Ranks the Literature for Improving Chemical-Gene-Disease Curation at the Comparative Toxicogenomics Database

    PubMed Central

    Johnson, Robin J.; Lay, Jean M.; Lennon-Hopkins, Kelley; Saraceni-Richards, Cynthia; Sciaky, Daniela; Murphy, Cynthia Grondin; Mattingly, Carolyn J.

    2013-01-01

    The Comparative Toxicogenomics Database (CTD; http://ctdbase.org/) is a public resource that curates interactions between environmental chemicals and gene products, and their relationships to diseases, as a means of understanding the effects of environmental chemicals on human health. CTD provides a triad of core information in the form of chemical-gene, chemical-disease, and gene-disease interactions that are manually curated from scientific articles. To increase the efficiency, productivity, and data coverage of manual curation, we have leveraged text mining to help rank and prioritize the triaged literature. Here, we describe our text-mining process that computes and assigns each article a document relevancy score (DRS), wherein a high DRS suggests that an article is more likely to be relevant for curation at CTD. We evaluated our process by first text mining a corpus of 14,904 articles triaged for seven heavy metals (cadmium, cobalt, copper, lead, manganese, mercury, and nickel). Based upon initial analysis, a representative subset corpus of 3,583 articles was then selected from the 14,094 articles and sent to five CTD biocurators for review. The resulting curation of these 3,583 articles was analyzed for a variety of parameters, including article relevancy, novel data content, interaction yield rate, mean average precision, and biological and toxicological interpretability. We show that for all measured parameters, the DRS is an effective indicator for scoring and improving the ranking of literature for the curation of chemical-gene-disease information at CTD. Here, we demonstrate how fully incorporating text mining-based DRS scoring into our curation pipeline enhances manual curation by prioritizing more relevant articles, thereby increasing data content, productivity, and efficiency. PMID:23613709

  15. Global gene mining and the pharmaceutical industry

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Knudsen, Lisbeth E.

    2005-09-01

    Worldwide efforts are ongoing in optimizing medical treatment by searching for the right medicine at the right dose for the individual. Metabolism is regulated by polymorphisms, which may be tested by relatively simple SNP analysis, however requiring DNA from the test individuals. Target genes for the efficiency of a given medicine or predisposition of a given disease are also subject to population studies, e.g., in Iceland, Estonia, Sweden, etc. For hypothesis testing and generation, several bio-banks with samples from patients and healthy persons within the pharmaceutical industry have been established during the past 10 years. Thus, more than 100,000 samplesmore » are stored in the freezers of either the pharmaceutical companies or their contractual partners at universities and test institutions. Ethical issues related to data protection of the individuals providing samples to bio-banks are several: nature and extent of information prior to consent, coverage of the consent given by the study person, labeling and storage of the sample and data (coded or anonymized). In general, genetic test data, once obtained, are permanent and cannot be changed. The test data may imply information that is not beneficial to the patient and his/her family (e.g., employment opportunities, insurance, etc.). Furthermore, there may be a long latency between the analysis of the genetic test and the clinical expression of the disease and wide differences in the disease patterns. Consequently, information about some genetic test data may stigmatize patients leading to poor quality of life. This has raised the issue of 'genetic exceptionalism' justifying specific regulation of use of genetic information. Discussions on how to handle sampling and data are ongoing within the industry and the regulatory sphere, the European Agency for the Evaluation of Medicinal Products (EMEA) having issued a position paper, the Council for International Organizations of Medical Sciences (CIOMS) having a

  16. Examination of Global Methylation and Targeted Imprinted Genes in Prader-Willi Syndrome.

    PubMed

    Manzardo, A M; Butler, M G

    2016-01-01

    Methylation changes observed in Prader-Willi syndrome (PWS) may impact global methylation as well as regional methylation status of imprinted genes on chromosome 15 (in cis) or other imprinted obesity-related genes on other chromosomes (in trans) leading to differential effects on gene expression impacting obesity phenotype unique to (PWS). Characterize the global methylation profiles and methylation status for select imprinted genes associated with obesity phenotype in a well-characterized imprinted, obesity-related syndrome (PWS) relative to a cohort of obese and non-obese individuals. Global methylation was assayed using two methodologies: 1) enriched LINE-1 repeat sequences by EpigenDx and 2) ELISA-based immunoassay method sensitive to genomic 5-methylcytosine by Epigentek. Target gene methylation patterns at selected candidate obesity gene loci were determined using methylation-specific PCR. Study participants were recruited as part of an ongoing research program on obesity-related genomics and Prader-Willi syndrome. Individuals with non-syndromic obesity (N=26), leanness (N=26) and PWS (N=39). A detailed characterization of the imprinting status of select target genes within the critical PWS 15q11-q13 genomic region showed enhanced cis but not trans methylation of imprinted genes. No significant differences in global methylation were found between non-syndromic obese, PWS or non-obese controls. None. Percentage methylation and the methylation index. The methylation abnormality in PWS due to errors of genomic imprinting effects both upstream and downstream effectors in the 15q11-q13 region showing enhanced cis but not trans methylation of imprinted genes. Obesity in our subject cohorts did not appear to impact global methylation levels using the described methodology.

  17. ESTIMATE OF GLOBAL METHANE EMISSIONS FROM COAL MINES

    EPA Science Inventory

    Country-specific emissions of methane (CH4) from underground coal mines, surface coal mines, and coal crushing and transport operations are estimated for 1989. Emissions for individual countries are estimated by using two sets of regression equations (R2 values range from 0.56 to...

  18. Mining the transcriptomes of four commercially important shellfish species for single nucleotide polymorphisms within biomineralization genes.

    PubMed

    Vendrami, David L J; Shah, Abhijeet; Telesca, Luca; Hoffman, Joseph I

    2016-06-01

    Transcriptional profiling not only provides insights into patterns of gene expression, but also generates sequences that can be mined for molecular markers, which in turn can be used for population genetic studies. As part of a large-scale effort to better understand how commercially important European shellfish species may respond to ocean acidification, we therefore mined the transcriptomes of four species (the Pacific oyster Crassostrea gigas, the blue mussel Mytilus edulis, the great scallop Pecten maximus and the blunt gaper Mya truncata) for single nucleotide polymorphisms (SNPs). Illumina data for C. gigas, M. edulis and P. maximus and 454 data for M. truncata were interrogated using GATK and SWAP454 respectively to identify between 8267 and 47,159 high quality SNPs per species (total=121,053 SNPs residing within 34,716 different contigs). We then annotated the transcripts containing SNPs to reveal homology to diverse genes. Finally, as oceanic pH affects the ability of organisms to incorporate calcium carbonate, we honed in on genes implicated in the biomineralization process to identify a total of 1899 SNPs in 157 genes. These provide good candidates for biomarkers with which to study patterns of selection in natural or experimental populations. Copyright © 2016 Elsevier B.V. All rights reserved.

  19. Mining the human gut microbiome for novel stress resistance genes

    PubMed Central

    Culligan, Eamonn P.; Marchesi, Julian R.; Hill, Colin; Sleator, Roy D.

    2012-01-01

    With the rapid advances in sequencing technologies in recent years, the human genome is now considered incomplete without the complementing microbiome, which outnumbers human genes by a factor of one hundred. The human microbiome, and more specifically the gut microbiome, has received considerable attention and research efforts over the past decade. Many studies have identified and quantified “who is there?,” while others have determined some of their functional capacity, or “what are they doing?” In a recent study, we identified novel salt-tolerance loci from the human gut microbiome using combined functional metagenomic and bioinformatics based approaches. Herein, we discuss the identified loci, their role in salt-tolerance and their importance in the context of the gut environment. We also consider the utility and power of functional metagenomics for mining such environments for novel genes and proteins, as well as the implications and possible applications for future research. PMID:22688726

  20. Functional Genome Mining for Metabolites Encoded by Large Gene Clusters through Heterologous Expression of a Whole-Genome Bacterial Artificial Chromosome Library in Streptomyces spp.

    PubMed Central

    Xu, Min; Wang, Yemin; Zhao, Zhilong; Gao, Guixi; Huang, Sheng-Xiong; Kang, Qianjin; He, Xinyi; Lin, Shuangjun; Pang, Xiuhua; Deng, Zixin

    2016-01-01

    ABSTRACT Genome sequencing projects in the last decade revealed numerous cryptic biosynthetic pathways for unknown secondary metabolites in microbes, revitalizing drug discovery from microbial metabolites by approaches called genome mining. In this work, we developed a heterologous expression and functional screening approach for genome mining from genomic bacterial artificial chromosome (BAC) libraries in Streptomyces spp. We demonstrate mining from a strain of Streptomyces rochei, which is known to produce streptothricins and borrelidin, by expressing its BAC library in the surrogate host Streptomyces lividans SBT5, and screening for antimicrobial activity. In addition to the successful capture of the streptothricin and borrelidin biosynthetic gene clusters, we discovered two novel linear lipopeptides and their corresponding biosynthetic gene cluster, as well as a novel cryptic gene cluster for an unknown antibiotic from S. rochei. This high-throughput functional genome mining approach can be easily applied to other streptomycetes, and it is very suitable for the large-scale screening of genomic BAC libraries for bioactive natural products and the corresponding biosynthetic pathways. IMPORTANCE Microbial genomes encode numerous cryptic biosynthetic gene clusters for unknown small metabolites with potential biological activities. Several genome mining approaches have been developed to activate and bring these cryptic metabolites to biological tests for future drug discovery. Previous sequence-guided procedures relied on bioinformatic analysis to predict potentially interesting biosynthetic gene clusters. In this study, we describe an efficient approach based on heterologous expression and functional screening of a whole-genome library for the mining of bioactive metabolites from Streptomyces. The usefulness of this function-driven approach was demonstrated by the capture of four large biosynthetic gene clusters for metabolites of various chemical types, including

  1. Examination of Global Methylation and Targeted Imprinted Genes in Prader-Willi Syndrome

    PubMed Central

    Manzardo, AM; Butler, MG

    2016-01-01

    Context Methylation changes observed in Prader-Willi syndrome (PWS) may impact global methylation as well as regional methylation status of imprinted genes on chromosome 15 (in cis) or other imprinted obesity-related genes on other chromosomes (in trans) leading to differential effects on gene expression impacting obesity phenotype unique to (PWS). Objective Characterize the global methylation profiles and methylation status for select imprinted genes associated with obesity phenotype in a well-characterized imprinted, obesity-related syndrome (PWS) relative to a cohort of obese and non-obese individuals. Design Global methylation was assayed using two methodologies: 1) enriched LINE-1 repeat sequences by EpigenDx and 2) ELISA-based immunoassay method sensitive to genomic 5-methylcytosine by Epigentek. Target gene methylation patterns at selected candidate obesity gene loci were determined using methylation-specific PCR. Setting Study participants were recruited as part of an ongoing research program on obesity-related genomics and Prader-Willi syndrome. Participants Individuals with non-syndromic obesity (N=26), leanness (N=26) and PWS (N=39). Results A detailed characterization of the imprinting status of select target genes within the critical PWS 15q11-q13 genomic region showed enhanced cis but not trans methylation of imprinted genes. No significant differences in global methylation were found between non-syndromic obese, PWS or non-obese controls. Intervention None. Main outcome measures Percentage methylation and the methylation index. Conclusion The methylation abnormality in PWS due to errors of genomic imprinting effects both upstream and downstream effectors in the 15q11-q13 region showing enhanced cis but not trans methylation of imprinted genes. Obesity in our subject cohorts did not appear to impact global methylation levels using the described methodology. PMID:28111641

  2. Interestingness measures and strategies for mining multi-ontology multi-level association rules from gene ontology annotations for the discovery of new GO relationships.

    PubMed

    Manda, Prashanti; McCarthy, Fiona; Bridges, Susan M

    2013-10-01

    The Gene Ontology (GO), a set of three sub-ontologies, is one of the most popular bio-ontologies used for describing gene product characteristics. GO annotation data containing terms from multiple sub-ontologies and at different levels in the ontologies is an important source of implicit relationships between terms from the three sub-ontologies. Data mining techniques such as association rule mining that are tailored to mine from multiple ontologies at multiple levels of abstraction are required for effective knowledge discovery from GO annotation data. We present a data mining approach, Multi-ontology data mining at All Levels (MOAL) that uses the structure and relationships of the GO to mine multi-ontology multi-level association rules. We introduce two interestingness measures: Multi-ontology Support (MOSupport) and Multi-ontology Confidence (MOConfidence) customized to evaluate multi-ontology multi-level association rules. We also describe a variety of post-processing strategies for pruning uninteresting rules. We use publicly available GO annotation data to demonstrate our methods with respect to two applications (1) the discovery of co-annotation suggestions and (2) the discovery of new cross-ontology relationships. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.

  3. Association mining of mutated cancer genes in different clinical stages across 11 cancer types.

    PubMed

    Hu, Wangxiong; Li, Xiaofen; Wang, Tingzhang; Zheng, Shu

    2016-10-18

    Many studies have demonstrated that some genes (e.g. APC, BRAF, KRAS, PTEN, TP53) are frequently mutated in cancer, however, underlying mechanism that contributes to their high mutation frequency remains unclear. Here we used Apriori algorithm to find the frequent mutational gene sets (FMGSs) from 4,904 tumors across 11 cancer types as part of the TCGA Pan-Cancer effort and then mined the hidden association rules (ARs) within these FMGSs. Intriguingly, we found that well-known cancer driver genes such as BRAF, KRAS, PTEN, and TP53 were often co-occurred with other driver genes and FMGSs size peaked at an itemset size of 3~4 genes. Besides, the number and constitution of FMGS and ARs differed greatly among different cancers and stages. In addition, FMGS and ARs were rare in endocrine-related cancers such as breast carcinoma, ovarian cystadenocarcinoma, and thyroid carcinoma, but abundant in cancers contact directly with external environments such as skin melanoma and stomach adenocarcinoma. Furthermore, we observed more rules in stage IV than in other stages, indicating that distant metastasis needed more sophisticated gene regulatory network.

  4. Polymorphisms in genes encoding potential mercury transporters and urine mercury concentrations in populations exposed to mercury vapor from gold mining.

    PubMed

    Engström, Karin; Ameer, Shegufta; Bernaudat, Ludovic; Drasch, Gustav; Baeuml, Jennifer; Skerfving, Staffan; Bose-O'Reilly, Stephan; Broberg, Karin

    2013-01-01

    Elemental mercury (Hg0) is widely used in small-scale gold mining. Persons working or living in mining areas have high urinary concentrations of Hg (U-Hg). Differences in genes encoding potential Hg-transporters may affect uptake and elimination of Hg. We aimed to identify single nucleotide polymorphisms (SNPs) in Hg-transporter genes that modify U-Hg. Men and women (1,017) from Indonesia, the Philippines, Tanzania, and Zimbabwe were classified either as controls (no Hg exposure from gold mining) or as having low (living in a gold-mining area) or high exposure (working as gold miners). U-Hg was analyzed by cold-vapor atomic absorption spectrometry. Eighteen SNPs in eight Hg-transporter genes were analyzed. U-Hg concentrations were higher among ABCC2/MRP2 rs1885301 A-allele carriers than among GG homozygotes in all populations, though differences were not statistically significant in most cases. MRP2 SNPs showed particularly strong associations with U-Hg in the subgroup with highest exposure (miners in Zimbabwe), whereas rs1885301 A-allele carriers had higher U-Hg than GG homozygotes [geometric mean (GM): 36.4 µg/g creatinine vs. 21.9; p = 0.027], rs2273697 GG homozygotes had higher U-Hg than A-allele carriers (GM: 37.4 vs. 16.7; p = 0.001), and rs717620 A-allele carriers had higher U-Hg than GG homozygotes (GM: 83 vs. 28; p = 0.084). The SLC7A5/LAT1 rs33916661 GG genotype was associated with higher U-Hg in all populations (statistically significant for all Tanzanians combined). SNPs in SLC22A6/OAT1 (rs4149170) and SLC22A8/OAT3 (rs4149182) were associated with U-Hg mainly in the Tanzanian study groups. SNPs in putative Hg-transporter genes may influence U-Hg concentrations.

  5. Polymorphisms in Genes Encoding Potential Mercury Transporters and Urine Mercury Concentrations in Populations Exposed to Mercury Vapor from Gold Mining

    PubMed Central

    Ameer, Shegufta; Bernaudat, Ludovic; Drasch, Gustav; Baeuml, Jennifer; Skerfving, Staffan; Bose-O’Reilly, Stephan; Broberg, Karin

    2012-01-01

    Background: Elemental mercury (Hg0) is widely used in small-scale gold mining. Persons working or living in mining areas have high urinary concentrations of Hg (U-Hg). Differences in genes encoding potential Hg-transporters may affect uptake and elimination of Hg. Objective: We aimed to identify single nucleotide polymorphisms (SNPs) in Hg-transporter genes that modify U-Hg. Methods: Men and women (1,017) from Indonesia, the Philippines, Tanzania, and Zimbabwe were classified either as controls (no Hg exposure from gold mining) or as having low (living in a gold-mining area) or high exposure (working as gold miners). U-Hg was analyzed by cold-vapor atomic absorption spectrometry. Eighteen SNPs in eight Hg-transporter genes were analyzed. Results: U-Hg concentrations were higher among ABCC2/MRP2 rs1885301 A–allele carriers than among GG homozygotes in all populations, though differences were not statistically significant in most cases. MRP2 SNPs showed particularly strong associations with U-Hg in the subgroup with highest exposure (miners in Zimbabwe), whereas rs1885301 A–allele carriers had higher U-Hg than GG homozygotes [geometric mean (GM): 36.4 µg/g creatinine vs. 21.9; p = 0.027], rs2273697 GG homozygotes had higher U-Hg than A–allele carriers (GM: 37.4 vs. 16.7; p = 0.001), and rs717620 A–allele carriers had higher U-Hg than GG homozygotes (GM: 83 vs. 28; p = 0.084). The SLC7A5/LAT1 rs33916661 GG genotype was associated with higher U-Hg in all populations (statistically significant for all Tanzanians combined). SNPs in SLC22A6/OAT1 (rs4149170) and SLC22A8/OAT3 (rs4149182) were associated with U-Hg mainly in the Tanzanian study groups. Conclusions: SNPs in putative Hg-transporter genes may influence U-Hg concentrations. PMID:23052037

  6. Growth-rate dependent global effects on gene expression in bacteria

    PubMed Central

    Klumpp, Stefan; Zhang, Zhongge; Hwa, Terence

    2010-01-01

    Summary Bacterial gene expression depends not only on specific regulations but also directly on bacterial growth, because important global parameters such as the abundance of RNA polymerases and ribosomes are all growth-rate dependent. Understanding these global effects is necessary for a quantitative understanding of gene regulation and for the robust design of synthetic genetic circuits. The observed growth-rate dependence of constitutive gene expression can be explained by a simple model using the measured growth-rate dependence of the relevant cellular parameters. More complex growth dependences for genetic circuits involving activators, repressors and feedback control were analyzed, and salient features were verified experimentally using synthetic circuits. The results suggest a novel feedback mechanism mediated by general growth-dependent effects and not requiring explicit gene regulation, if the expressed protein affects cell growth. This mechanism can lead to growth bistability and promote the acquisition of important physiological functions such as antibiotic resistance and tolerance (persistence). PMID:20064380

  7. Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents.

    PubMed

    Usie, Anabel; Karathia, Hiren; Teixidó, Ivan; Alves, Rui; Solsona, Francesc

    2014-01-01

    One way to initiate the reconstruction of molecular circuits is by using automated text-mining techniques. Developing more efficient methods for such reconstruction is a topic of active research, and those methods are typically included by bioinformaticians in pipelines used to mine and curate large literature datasets. Nevertheless, experimental biologists have a limited number of available user-friendly tools that use text-mining for network reconstruction and require no programming skills to use. One of these tools is Biblio-MetReS. Originally, this tool permitted an on-the-fly analysis of documents contained in a number of web-based literature databases to identify co-occurrence of proteins/genes. This approach ensured results that were always up-to-date with the latest live version of the databases. However, this 'up-to-dateness' came at the cost of large execution times. Here we report an evolution of the application Biblio-MetReS that permits constructing co-occurrence networks for genes, GO processes, Pathways, or any combination of the three types of entities and graphically represent those entities. We show that the performance of Biblio-MetReS in identifying gene co-occurrence is as least as good as that of other comparable applications (STRING and iHOP). In addition, we also show that the identification of GO processes is on par to that reported in the latest BioCreAtIvE challenge. Finally, we also report the implementation of a new strategy that combines on-the-fly analysis of new documents with preprocessed information from documents that were encountered in previous analyses. This combination simultaneously decreases program run time and maintains 'up-to-dateness' of the results. http://metres.udl.cat/index.php/downloads, metres.cmb@gmail.com.

  8. Local and global responses in complex gene regulation networks

    NASA Astrophysics Data System (ADS)

    Tsuchiya, Masa; Selvarajoo, Kumar; Piras, Vincent; Tomita, Masaru; Giuliani, Alessandro

    2009-04-01

    An exacerbated sensitivity to apparently minor stimuli and a general resilience of the entire system stay together side-by-side in biological systems. This apparent paradox can be explained by the consideration of biological systems as very strongly interconnected network systems. Some nodes of these networks, thanks to their peculiar location in the network architecture, are responsible for the sensitivity aspects, while the large degree of interconnection is at the basis of the resilience properties of the system. One relevant feature of the high degree of connectivity of gene regulation networks is the emergence of collective ordered phenomena influencing the entire genome and not only a specific portion of transcripts. The great majority of existing gene regulation models give the impression of purely local ‘hard-wired’ mechanisms disregarding the emergence of global ordered behavior encompassing thousands of genes while the general, genome wide, aspects are less known. Here we address, on a data analysis perspective, the discrimination between local and global scale regulations, this goal was achieved by means of the examination of two biological systems: innate immune response in macrophages and oscillating growth dynamics in yeast. Our aim was to reconcile the ‘hard-wired’ local view of gene regulation with a global continuous and scalable one borrowed from statistical physics. This reconciliation is based on the network paradigm in which the local ‘hard-wired’ activities correspond to the activation of specific crucial nodes in the regulation network, while the scalable continuous responses can be equated to the collective oscillations of the network after a perturbation.

  9. Mining high-throughput experimental data to link gene and function.

    PubMed

    Blaby-Haas, Crysten E; de Crécy-Lagard, Valérie

    2011-04-01

    Nearly 2200 genomes that encode around 6 million proteins have now been sequenced. Around 40% of these proteins are of unknown function, even when function is loosely and minimally defined as 'belonging to a superfamily'. In addition to in silico methods, the swelling stream of high-throughput experimental data can give valuable clues for linking these unknowns with precise biological roles. The goal is to develop integrative data-mining platforms that allow the scientific community at large to access and utilize this rich source of experimental knowledge. To this end, we review recent advances in generating whole-genome experimental datasets, where this data can be accessed, and how it can be used to drive prediction of gene function. Copyright © 2011 Elsevier Ltd. All rights reserved.

  10. Mining high-throughput experimental data to link gene and function

    PubMed Central

    Blaby-Haas, Crysten E.; de Crécy-Lagard, Valérie

    2011-01-01

    Nearly 2200 genomes encoding some 6 million proteins have now been sequenced. Around 40% of these proteins are of unknown function even when function is loosely and minimally defined as “belonging to a superfamily”. In addition to in silico methods, the swelling stream of high-throughput experimental data can give valuable clues for linking these “unknowns” with precise biological roles. The goal is to develop integrative data-mining platforms that allow the scientific community at large to access and utilize this rich source of experimental knowledge. To this end, we review recent advances in generating whole-genome experimental datasets, where this data can be accessed, and how it can be used to drive prediction of gene function. PMID:21310501

  11. GeoChip-Based Analysis of the Functional Gene Diversity and Metabolic Potential of Microbial Communities in Acid Mine Drainage▿ †

    PubMed Central

    Xie, Jianping; He, Zhili; Liu, Xinxing; Liu, Xueduan; Van Nostrand, Joy D.; Deng, Ye; Wu, Liyou; Zhou, Jizhong; Qiu, Guanzhou

    2011-01-01

    Acid mine drainage (AMD) is an extreme environment, usually with low pH and high concentrations of metals. Although the phylogenetic diversity of AMD microbial communities has been examined extensively, little is known about their functional gene diversity and metabolic potential. In this study, a comprehensive functional gene array (GeoChip 2.0) was used to analyze the functional diversity, composition, structure, and metabolic potential of AMD microbial communities from three copper mines in China. GeoChip data indicated that these microbial communities were functionally diverse as measured by the number of genes detected, gene overlapping, unique genes, and various diversity indices. Almost all key functional gene categories targeted by GeoChip 2.0 were detected in the AMD microbial communities, including carbon fixation, carbon degradation, methane generation, nitrogen fixation, nitrification, denitrification, ammonification, nitrogen reduction, sulfur metabolism, metal resistance, and organic contaminant degradation, which suggested that the functional gene diversity was higher than was previously thought. Mantel test results indicated that AMD microbial communities are shaped largely by surrounding environmental factors (e.g., S, Mg, and Cu). Functional genes (e.g., narG and norB) and several key functional processes (e.g., methane generation, ammonification, denitrification, sulfite reduction, and organic contaminant degradation) were significantly (P < 0.10) correlated with environmental variables. This study presents an overview of functional gene diversity and the structure of AMD microbial communities and also provides insights into our understanding of metabolic potential in AMD ecosystems. PMID:21097602

  12. Large Omnivore Movements in Response to Surface Mining and Mine Reclamation

    PubMed Central

    Cristescu, Bogdan; Stenhouse, Gordon B.; Boyce, Mark S.

    2016-01-01

    Increasing global demands have resulted in widespread proliferation of resource extraction. Scientists are challenged to develop environmental mitigation strategies that meet societal expectations of resource supply, while achieving minimal disruption to sensitive “wilderness” species. We used GPS collar data from a 9-year study on grizzly bears (Ursus arctos) (n = 18) in Alberta, Canada to assess movements and associated space use during versus after mining. Grizzly bear home range overlap with mined areas was lower during active mining except for females with cubs, that also had shortest movements on active mines. However, both females with cubs and males made shorter steps when on/close to mines following mine closure and reclamation. Our results show differences in bear movement and space-use strategies, with individuals from a key population segment (females with cubs) appearing most adaptable to mining disturbance. Preserving patches of original habitat, reclaiming the landscape and minimizing the risk of direct human-induced mortality during and after development can help conserve bears and other wildlife on industrially modified landscapes. PMID:26750094

  13. Large Omnivore Movements in Response to Surface Mining and Mine Reclamation.

    PubMed

    Cristescu, Bogdan; Stenhouse, Gordon B; Boyce, Mark S

    2016-01-11

    Increasing global demands have resulted in widespread proliferation of resource extraction. Scientists are challenged to develop environmental mitigation strategies that meet societal expectations of resource supply, while achieving minimal disruption to sensitive "wilderness" species. We used GPS collar data from a 9-year study on grizzly bears (Ursus arctos) (n = 18) in Alberta, Canada to assess movements and associated space use during versus after mining. Grizzly bear home range overlap with mined areas was lower during active mining except for females with cubs, that also had shortest movements on active mines. However, both females with cubs and males made shorter steps when on/close to mines following mine closure and reclamation. Our results show differences in bear movement and space-use strategies, with individuals from a key population segment (females with cubs) appearing most adaptable to mining disturbance. Preserving patches of original habitat, reclaiming the landscape and minimizing the risk of direct human-induced mortality during and after development can help conserve bears and other wildlife on industrially modified landscapes.

  14. Evidence for host genetic regulation of altered lipid metabolism in experimental toxoplasmosis supported with gene data mining results

    PubMed Central

    2017-01-01

    Toxoplasma gondii is one of the most successful parasites on Earth, infecting a wide array of mammals including one third of the global human population. The obligate intracellular protozoon is not capable of synthesizing cholesterol (Chl), and thus depends on uptake of host Chl for its own development. To explore the genetic regulation of previously observed lipid metabolism alterations during acute murine T. gondii infection, we here assessed total Chl and its fractions in serum and selected tissues at the pathophysiological and molecular level, and integrated the observed gene expression of selected molecules relevant for Chl metabolism, including its biosynthetic and export KEGG pathways, with the results of published transcriptomes obtained in similar murine models of T. gondii infection. The serum lipid status as well as the transcript levels of relevant genes in the brain and the liver were assessed in experimental models of acute and chronic toxoplasmosis in wild-type mice. The results showed that acute infection was associated with a decrease in Chl content in both the liver and periphery (brain, peripheral lymphocytes), and a decrease in Chl reverse transport. In contrast, in chronic infection, a return to normal levels of Chl metabolism has been noted. These changes corresponded to the brain and liver gene expression results as well as to data obtained via mining. We propose that the observed changes in Chl metabolism are part of the host defense response. Further insight into the lipid metabolism in T. gondii infection may provide novel targets for therapeutic agents. PMID:28459857

  15. MINE: Module Identification in Networks

    PubMed Central

    2011-01-01

    Background Graphical models of network associations are useful for both visualizing and integrating multiple types of association data. Identifying modules, or groups of functionally related gene products, is an important challenge in analyzing biological networks. However, existing tools to identify modules are insufficient when applied to dense networks of experimentally derived interaction data. To address this problem, we have developed an agglomerative clustering method that is able to identify highly modular sets of gene products within highly interconnected molecular interaction networks. Results MINE outperforms MCODE, CFinder, NEMO, SPICi, and MCL in identifying non-exclusive, high modularity clusters when applied to the C. elegans protein-protein interaction network. The algorithm generally achieves superior geometric accuracy and modularity for annotated functional categories. In comparison with the most closely related algorithm, MCODE, the top clusters identified by MINE are consistently of higher density and MINE is less likely to designate overlapping modules as a single unit. MINE offers a high level of granularity with a small number of adjustable parameters, enabling users to fine-tune cluster results for input networks with differing topological properties. Conclusions MINE was created in response to the challenge of discovering high quality modules of gene products within highly interconnected biological networks. The algorithm allows a high degree of flexibility and user-customisation of results with few adjustable parameters. MINE outperforms several popular clustering algorithms in identifying modules with high modularity and obtains good overall recall and precision of functional annotations in protein-protein interaction networks from both S. cerevisiae and C. elegans. PMID:21605434

  16. Hybrid coexpression link similarity graph clustering for mining biological modules from multiple gene expression datasets.

    PubMed

    Salem, Saeed; Ozcaglar, Cagri

    2014-01-01

    Advances in genomic technologies have enabled the accumulation of vast amount of genomic data, including gene expression data for multiple species under various biological and environmental conditions. Integration of these gene expression datasets is a promising strategy to alleviate the challenges of protein functional annotation and biological module discovery based on a single gene expression data, which suffers from spurious coexpression. We propose a joint mining algorithm that constructs a weighted hybrid similarity graph whose nodes are the coexpression links. The weight of an edge between two coexpression links in this hybrid graph is a linear combination of the topological similarities and co-appearance similarities of the corresponding two coexpression links. Clustering the weighted hybrid similarity graph yields recurrent coexpression link clusters (modules). Experimental results on Human gene expression datasets show that the reported modules are functionally homogeneous as evident by their enrichment with biological process GO terms and KEGG pathways.

  17. Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithm.

    PubMed

    Tchagang, Alain B; Phan, Sieu; Famili, Fazel; Shearer, Heather; Fobert, Pierre; Huang, Yi; Zou, Jitao; Huang, Daiqing; Cutler, Adrian; Liu, Ziying; Pan, Youlian

    2012-04-04

    Nowadays, it is possible to collect expression levels of a set of genes from a set of biological samples during a series of time points. Such data have three dimensions: gene-sample-time (GST). Thus they are called 3D microarray gene expression data. To take advantage of the 3D data collected, and to fully understand the biological knowledge hidden in the GST data, novel subspace clustering algorithms have to be developed to effectively address the biological problem in the corresponding space. We developed a subspace clustering algorithm called Order Preserving Triclustering (OPTricluster), for 3D short time-series data mining. OPTricluster is able to identify 3D clusters with coherent evolution from a given 3D dataset using a combinatorial approach on the sample dimension, and the order preserving (OP) concept on the time dimension. The fusion of the two methodologies allows one to study similarities and differences between samples in terms of their temporal expression profile. OPTricluster has been successfully applied to four case studies: immune response in mice infected by malaria (Plasmodium chabaudi), systemic acquired resistance in Arabidopsis thaliana, similarities and differences between inner and outer cotyledon in Brassica napus during seed development, and to Brassica napus whole seed development. These studies showed that OPTricluster is robust to noise and is able to detect the similarities and differences between biological samples. Our analysis showed that OPTricluster generally outperforms other well known clustering algorithms such as the TRICLUSTER, gTRICLUSTER and K-means; it is robust to noise and can effectively mine the biological knowledge hidden in the 3D short time-series gene expression data.

  18. Text mining and network analysis to find functional associations of genes in high altitude diseases.

    PubMed

    Bhasuran, Balu; Subramanian, Devika; Natarajan, Jeyakumar

    2018-05-02

    Travel to elevations above 2500 m is associated with the risk of developing one or more forms of acute altitude illness such as acute mountain sickness (AMS), high altitude cerebral edema (HACE) or high altitude pulmonary edema (HAPE). Our work aims to identify the functional association of genes involved in high altitude diseases. In this work we identified the gene networks responsible for high altitude diseases by using the principle of gene co-occurrence statistics from literature and network analysis. First, we mined the literature data from PubMed on high-altitude diseases, and extracted the co-occurring gene pairs. Next, based on their co-occurrence frequency, gene pairs were ranked. Finally, a gene association network was created using statistical measures to explore potential relationships. Network analysis results revealed that EPO, ACE, IL6 and TNF are the top five genes that were found to co-occur with 20 or more genes, while the association between EPAS1 and EGLN1 genes is strongly substantiated. The network constructed from this study proposes a large number of genes that work in-toto in high altitude conditions. Overall, the result provides a good reference for further study of the genetic relationships in high altitude diseases. Copyright © 2018 Elsevier Ltd. All rights reserved.

  19. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data.

    PubMed

    Hettne, Kristina M; Boorsma, André; van Dartel, Dorien A M; Goeman, Jelle J; de Jong, Esther; Piersma, Aldert H; Stierum, Rob H; Kleinjans, Jos C; Kors, Jan A

    2013-01-29

    Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values < 0.05) of the next-gen TM-derived gene sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity

  20. Functions and Unique Diversity of Genes and Microorganisms Involved in Arsenite Oxidation from the Tailings of a Realgar Mine

    PubMed Central

    E, Guoji; Wang, Jianing; Wang, Nian; Chen, Xiaoming; Mu, Yao; Li, Hao; Yang, Ye; Liu, Yichen; Wang, Yanxin

    2016-01-01

    ABSTRACT The tailings of the Shimen realgar mine have unique geochemical features. Arsenite oxidation is one of the major biogeochemical processes that occurs in the tailings. However, little is known about the functional and molecular aspects of the microbial community involved in arsenite oxidation. Here, we fully explored the functional and molecular features of the microbial communities from the tailings of the Shimen realgar mine. We collected six samples of tailings from sites A, B, C, D, E, and F. Microcosm assays indicated that all of the six sites contain both chemoautotrophic and heterotrophic arsenite-oxidizing microorganisms; their activities differed considerably from each other. The microbial arsenite-oxidizing activities show a positive correlation with soluble arsenic concentrations. The microbial communities of the six sites contain 40 phyla of bacteria and 2 phyla of archaea that show extremely high diversity. Soluble arsenic, sulfate, pH, and total organic carbon (TOC) are the key environmental factors that shape the microbial communities. We further identified 114 unique arsenite oxidase genes from the samples; all of them code for new or new-type arsenite oxidases. We also isolated 10 novel arsenite oxidizers from the samples, of which 4 are chemoautotrophic and 6 are heterotrophic. These data highlight the unique diversities of the arsenite-oxidizing microorganisms and their oxidase genes from the tailings of the Shimen realgar mine. To the best of our knowledge, this is the first report describing the functional and molecular features of microbial communities from the tailings of a realgar mine. IMPORTANCE This study focused on the functional and molecular characterizations of microbial communities from the tailings of the Shimen realgar mine. We fully explored, for the first time, the arsenite-oxidizing activities and the functional gene diversities of microorganisms from the tailings, as well as the correlation of the microbial activities

  1. Functions and Unique Diversity of Genes and Microorganisms Involved in Arsenite Oxidation from the Tailings of a Realgar Mine.

    PubMed

    Zeng, Xian-Chun; E, Guoji; Wang, Jianing; Wang, Nian; Chen, Xiaoming; Mu, Yao; Li, Hao; Yang, Ye; Liu, Yichen; Wang, Yanxin

    2016-12-15

    The tailings of the Shimen realgar mine have unique geochemical features. Arsenite oxidation is one of the major biogeochemical processes that occurs in the tailings. However, little is known about the functional and molecular aspects of the microbial community involved in arsenite oxidation. Here, we fully explored the functional and molecular features of the microbial communities from the tailings of the Shimen realgar mine. We collected six samples of tailings from sites A, B, C, D, E, and F. Microcosm assays indicated that all of the six sites contain both chemoautotrophic and heterotrophic arsenite-oxidizing microorganisms; their activities differed considerably from each other. The microbial arsenite-oxidizing activities show a positive correlation with soluble arsenic concentrations. The microbial communities of the six sites contain 40 phyla of bacteria and 2 phyla of archaea that show extremely high diversity. Soluble arsenic, sulfate, pH, and total organic carbon (TOC) are the key environmental factors that shape the microbial communities. We further identified 114 unique arsenite oxidase genes from the samples; all of them code for new or new-type arsenite oxidases. We also isolated 10 novel arsenite oxidizers from the samples, of which 4 are chemoautotrophic and 6 are heterotrophic. These data highlight the unique diversities of the arsenite-oxidizing microorganisms and their oxidase genes from the tailings of the Shimen realgar mine. To the best of our knowledge, this is the first report describing the functional and molecular features of microbial communities from the tailings of a realgar mine. This study focused on the functional and molecular characterizations of microbial communities from the tailings of the Shimen realgar mine. We fully explored, for the first time, the arsenite-oxidizing activities and the functional gene diversities of microorganisms from the tailings, as well as the correlation of the microbial activities/diversities with

  2. Gene Mining for Proline Based Signaling Proteins in Cell Wall of Arabidopsis thaliana

    PubMed Central

    Ihsan, Muhammad Z.; Ahmad, Samina J. N.; Shah, Zahid Hussain; Rehman, Hafiz M.; Aslam, Zubair; Ahuja, Ishita; Bones, Atle M.; Ahmad, Jam N.

    2017-01-01

    The cell wall (CW) as a first line of defense against biotic and abiotic stresses is of primary importance in plant biology. The proteins associated with cell walls play a significant role in determining a plant's sustainability to adverse environmental conditions. In this work, the genes encoding cell wall proteins (CWPs) in Arabidopsis were identified and functionally classified using geneMANIA and GENEVESTIGATOR with published microarrays data. This yielded 1605 genes, out of which 58 genes encoded proline-rich proteins (PRPs) and glycine-rich proteins (GRPs). Here, we have focused on the cellular compartmentalization, biological processes, and molecular functioning of proline-rich CWPs along with their expression at different plant developmental stages. The mined genes were categorized into five classes on the basis of the type of PRPs encoded in the cell wall of Arabidopsis thaliana. We review the domain structure and function of each class of protein, many with respect to the developmental stages of the plant. We have then used networks, hierarchical clustering and correlations to analyze co-expression, co-localization, genetic, and physical interactions and shared protein domains of these PRPs. This has given us further insight into these functionally important CWPs and identified a number of potentially new cell-wall related proteins in A. thaliana. PMID:28289422

  3. RubisCO Gene Clusters Found in a Metagenome Microarray from Acid Mine Drainage

    PubMed Central

    Guo, Xue; Yin, Huaqun; Cong, Jing; Dai, Zhimin; Liang, Yili

    2013-01-01

    The enzyme responsible for carbon dioxide fixation in the Calvin cycle, ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO), is always detected as a phylogenetic marker to analyze the distribution and activity of autotrophic bacteria. However, such an approach provides no indication as to the significance of genomic content and organization. Horizontal transfers of RubisCO genes occurring in eubacteria and plastids may seriously affect the credibility of this approach. Here, we presented a new method to analyze the diversity and genomic content of RubisCO genes in acid mine drainage (AMD). A metagenome microarray containing 7,776 large-insertion fosmids was constructed to quickly screen genome fragments containing RubisCO form I large-subunit genes (cbbL). Forty-six cbbL-containing fosmids were detected, and six fosmids were fully sequenced. To evaluate the reliability of the metagenome microarray and understand the microbial community in AMD, the diversities of cbbL and the 16S rRNA gene were analyzed. Fosmid sequences revealed that the form I RubisCO gene cluster could be subdivided into form IA and IB RubisCO gene clusters in AMD, because of significant divergences in molecular phylogenetics and conservative genomic organization. Interestingly, the form I RubisCO gene cluster coexisted with the form II RubisCO gene cluster in one fosmid genomic fragment. Phylogenetic analyses revealed that horizontal transfers of RubisCO genes may occur widely in AMD, which makes the evolutionary history of RubisCO difficult to reconcile with organismal phylogeny. PMID:23335778

  4. Hybrid coexpression link similarity graph clustering for mining biological modules from multiple gene expression datasets

    PubMed Central

    2014-01-01

    Background Advances in genomic technologies have enabled the accumulation of vast amount of genomic data, including gene expression data for multiple species under various biological and environmental conditions. Integration of these gene expression datasets is a promising strategy to alleviate the challenges of protein functional annotation and biological module discovery based on a single gene expression data, which suffers from spurious coexpression. Results We propose a joint mining algorithm that constructs a weighted hybrid similarity graph whose nodes are the coexpression links. The weight of an edge between two coexpression links in this hybrid graph is a linear combination of the topological similarities and co-appearance similarities of the corresponding two coexpression links. Clustering the weighted hybrid similarity graph yields recurrent coexpression link clusters (modules). Experimental results on Human gene expression datasets show that the reported modules are functionally homogeneous as evident by their enrichment with biological process GO terms and KEGG pathways. PMID:25221624

  5. Urban Mining of E-Waste is Becoming More Cost-Effective Than Virgin Mining.

    PubMed

    Zeng, Xianlai; Mathews, John A; Li, Jinhui

    2018-04-17

    Stocks of virgin-mined materials utilized in linear economic flows continue to present enormous challenges. E-waste is one of the fastest growing waste streams, and threatens to grow into a global problem of unmanageable proportions. An effective form of management of resource recycling and environmental improvement is available, in the form of extraction and purification of precious metals taken from waste streams, in a process known as urban mining. In this work, we demonstrate utilizing real cost data from e-waste processors in China that ingots of pure copper and gold could be recovered from e-waste streams at costs that are comparable to those encountered in virgin mining of ores. Our results are confined to the cases of copper and gold extracted and processed from e-waste streams made up of recycled TV sets, but these results indicate a trend and potential if applied across a broader range of e-waste sources and metals extracted. If these results can be extended to other metals and countries, they promise to have positive impact on waste disposal and mining activities globally, as the circular economy comes to displace linear economic pathways.

  6. A systematic review of lost-time injuries in the global mining industry.

    PubMed

    Nowrouzi-Kia, Behdin; Gohar, Basem; Casole, Jennifer; Chidu, Carla; Dumond, Jennifer; McDougall, Alicia; Nowrouzi-Kia, Behnam

    2018-05-01

    Mining is a hazardous occupation with elevated rates of lost-time injury and disability. The purpose of this study is twofold: 1) To identify the type of lost-time injuries in the mining workforce, regardless of the kind of mining and 2) To examine the antecedent factors to the occupational injury (lost-time injuries). We identified and extracted primary papers related to lost-time injuries in the mining sector by conducting a systematic search of the electronic literature in the eight health and related databases. We critically reviewed nine articles in the mining sector that examined lost-time injuries. Musculoskeletal injuries (hand, back, limbs, fractures, lacerations and muscle contusions), slips and falls were identified as types of lost-time injuries. The review identified the following antecedent factors related to lost-time injuries: the mining work environment (underground mining), being male, age, working with mining equipment, organizational size, falling objects, disease status, job training and lack of occupational safety management teams, recovery time, social supports, access to health services, pre-injury health status and susceptibility to injury. The mining sector is a hazardous environment that increases workers' susceptibility to occupational injuries. There is a need to create and implement monitoring systems of lost-time injuries to implement prevention programs.

  7. Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

    PubMed Central

    2013-01-01

    Background Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values < 0.05) of the next-gen TM-derived gene sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Results Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants

  8. MARQ: an online tool to mine GEO for experiments with similar or opposite gene expression signatures.

    PubMed

    Vazquez, Miguel; Nogales-Cadenas, Ruben; Arroyo, Javier; Botías, Pedro; García, Raul; Carazo, Jose M; Tirado, Francisco; Pascual-Montano, Alberto; Carmona-Saez, Pedro

    2010-07-01

    The enormous amount of data available in public gene expression repositories such as Gene Expression Omnibus (GEO) offers an inestimable resource to explore gene expression programs across several organisms and conditions. This information can be used to discover experiments that induce similar or opposite gene expression patterns to a given query, which in turn may lead to the discovery of new relationships among diseases, drugs or pathways, as well as the generation of new hypotheses. In this work, we present MARQ, a web-based application that allows researchers to compare a query set of genes, e.g. a set of over- and under-expressed genes, against a signature database built from GEO datasets for different organisms and platforms. MARQ offers an easy-to-use and integrated environment to mine GEO, in order to identify conditions that induce similar or opposite gene expression patterns to a given experimental condition. MARQ also includes additional functionalities for the exploration of the results, including a meta-analysis pipeline to find genes that are differentially expressed across different experiments. The application is freely available at http://marq.dacya.ucm.es.

  9. Mining subspace clusters from DNA microarray data using large itemset techniques.

    PubMed

    Chang, Ye-In; Chen, Jiun-Rung; Tsai, Yueh-Chi

    2009-05-01

    Mining subspace clusters from the DNA microarrays could help researchers identify those genes which commonly contribute to a disease, where a subspace cluster indicates a subset of genes whose expression levels are similar under a subset of conditions. Since in a DNA microarray, the number of genes is far larger than the number of conditions, those previous proposed algorithms which compute the maximum dimension sets (MDSs) for any two genes will take a long time to mine subspace clusters. In this article, we propose the Large Itemset-Based Clustering (LISC) algorithm for mining subspace clusters. Instead of constructing MDSs for any two genes, we construct only MDSs for any two conditions. Then, we transform the task of finding the maximal possible gene sets into the problem of mining large itemsets from the condition-pair MDSs. Since we are only interested in those subspace clusters with gene sets as large as possible, it is desirable to pay attention to those gene sets which have reasonable large support values in the condition-pair MDSs. From our simulation results, we show that the proposed algorithm needs shorter processing time than those previous proposed algorithms which need to construct gene-pair MDSs.

  10. Global Identification of Disease-Associated Genes in Fragile X Cells

    DTIC Science & Technology

    2017-03-01

    identify those specific gene substrates of FMRP, particularly those expressed in the brain , that are implicated in FXS progression. Moreover, we use...the co-localized R-loop formation and chromosome fragility in Fragile X cells, particularly at the brain -expressed genes, by ChIP-seq (detecting...X mental retardation protein February 2016, NGS Data Analysis & Informatics Conference, San Diego, California (Poster presentation) Title: Global

  11. bc-GenExMiner 3.0: new mining module computes breast cancer gene expression correlation analyses.

    PubMed

    Jézéquel, Pascal; Frénel, Jean-Sébastien; Campion, Loïc; Guérin-Charbonnel, Catherine; Gouraud, Wilfried; Ricolleau, Gabriel; Campone, Mario

    2013-01-01

    We recently developed a user-friendly web-based application called bc-GenExMiner (http://bcgenex.centregauducheau.fr), which offered the possibility to evaluate prognostic informativity of genes in breast cancer by means of a 'prognostic module'. In this study, we develop a new module called 'correlation module', which includes three kinds of gene expression correlation analyses. The first one computes correlation coefficient between 2 or more (up to 10) chosen genes. The second one produces two lists of genes that are most correlated (positively and negatively) to a 'tested' gene. A gene ontology (GO) mining function is also proposed to explore GO 'biological process', 'molecular function' and 'cellular component' terms enrichment for the output lists of most correlated genes. The third one explores gene expression correlation between the 15 telomeric and 15 centromeric genes surrounding a 'tested' gene. These correlation analyses can be performed in different groups of patients: all patients (without any subtyping), in molecular subtypes (basal-like, HER2+, luminal A and luminal B) and according to oestrogen receptor status. Validation tests based on published data showed that these automatized analyses lead to results consistent with studies' conclusions. In brief, this new module has been developed to help basic researchers explore molecular mechanisms of breast cancer. DATABASE URL: http://bcgenex.centregauducheau.fr

  12. Seq-ing answers: uncovering the unexpected in global gene regulation.

    PubMed

    Otto, George Maxwell; Brar, Gloria Ann

    2018-04-19

    The development of techniques for measuring gene expression globally has greatly expanded our understanding of gene regulatory mechanisms in depth and scale. We can now quantify every intermediate and transition in the canonical pathway of gene expression-from DNA to mRNA to protein-genome-wide. Employing such measurements in parallel can produce rich datasets, but extracting the most information requires careful experimental design and analysis. Here, we argue for the value of genome-wide studies that measure multiple outputs of gene expression over many timepoints during the course of a natural developmental process. We discuss our findings from a highly parallel gene expression dataset of meiotic differentiation, and those of others, to illustrate how leveraging these features can provide new and surprising insight into fundamental mechanisms of gene regulation.

  13. From data towards knowledge: revealing the architecture of signaling systems by unifying knowledge mining and data mining of systematic perturbation data.

    PubMed

    Lu, Songjian; Jin, Bo; Cowart, L Ashley; Lu, Xinghua

    2013-01-01

    Genetic and pharmacological perturbation experiments, such as deleting a gene and monitoring gene expression responses, are powerful tools for studying cellular signal transduction pathways. However, it remains a challenge to automatically derive knowledge of a cellular signaling system at a conceptual level from systematic perturbation-response data. In this study, we explored a framework that unifies knowledge mining and data mining towards the goal. The framework consists of the following automated processes: 1) applying an ontology-driven knowledge mining approach to identify functional modules among the genes responding to a perturbation in order to reveal potential signals affected by the perturbation; 2) applying a graph-based data mining approach to search for perturbations that affect a common signal; and 3) revealing the architecture of a signaling system by organizing signaling units into a hierarchy based on their relationships. Applying this framework to a compendium of yeast perturbation-response data, we have successfully recovered many well-known signal transduction pathways; in addition, our analysis has led to many new hypotheses regarding the yeast signal transduction system; finally, our analysis automatically organized perturbed genes as a graph reflecting the architecture of the yeast signaling system. Importantly, this framework transformed molecular findings from a gene level to a conceptual level, which can be readily translated into computable knowledge in the form of rules regarding the yeast signaling system, such as "if genes involved in the MAPK signaling are perturbed, genes involved in pheromone responses will be differentially expressed."

  14. The impact of endurance exercise on global and AMPK gene-specific DNA methylation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    King-Himmelreich, Tanya S.; Schramm, Stefanie; Wolters, Miriam C.

    Alterations in gene expression as a consequence of physical exercise are frequently described. The mechanism of these regulations might depend on epigenetic changes in global or gene-specific DNA methylation levels. The AMP-activated protein kinase (AMPK) plays a key role in maintenance of energy homeostasis and is activated by increases in the AMP/ATP ratio as occurring in skeletal muscles after sporting activity. To analyze whether exercise has an impact on the methylation status of the AMPK promoter, we determined the AMPK methylation status in human blood samples from patients before and after sporting activity in the context of rehabilitation as wellmore » as in skeletal muscles of trained and untrained mice. Further, we examined long interspersed nuclear element 1 (LINE-1) as indicator of global DNA methylation changes. Our results revealed that light sporting activity in mice and humans does not alter global DNA methylation but has an effect on methylation of specific CpG sites in the AMPKα2 gene. These regulations were associated with a reduced AMPKα2 mRNA and protein expression in muscle tissue, pointing at a contribution of the methylation status to AMPK expression. Taken together, these results suggest that exercise influences AMPKα2 gene methylation in human blood and eminently in the skeletal muscle of mice and therefore might repress AMPKα2 gene expression. -- Highlights: •AMPK gene methylation increases after moderate endurance exercise in humans and mice. •AMPKα mRNA and protein decrease after moderate endurance exercise in mice. •Global DNA methylation is not affected under the same conditions.« less

  15. DynGO: a tool for visualizing and mining of Gene Ontology and its associations

    PubMed Central

    Liu, Hongfang; Hu, Zhang-Zhi; Wu, Cathy H

    2005-01-01

    Background A large volume of data and information about genes and gene products has been stored in various molecular biology databases. A major challenge for knowledge discovery using these databases is to identify related genes and gene products in disparate databases. The development of Gene Ontology (GO) as a common vocabulary for annotation allows integrated queries across multiple databases and identification of semantically related genes and gene products (i.e., genes and gene products that have similar GO annotations). Meanwhile, dozens of tools have been developed for browsing, mining or editing GO terms, their hierarchical relationships, or their "associated" genes and gene products (i.e., genes and gene products annotated with GO terms). Tools that allow users to directly search and inspect relations among all GO terms and their associated genes and gene products from multiple databases are needed. Results We present a standalone package called DynGO, which provides several advanced functionalities in addition to the standard browsing capability of the official GO browsing tool (AmiGO). DynGO allows users to conduct batch retrieval of GO annotations for a list of genes and gene products, and semantic retrieval of genes and gene products sharing similar GO annotations. The result are shown in an association tree organized according to GO hierarchies and supported with many dynamic display options such as sorting tree nodes or changing orientation of the tree. For GO curators and frequent GO users, DynGO provides fast and convenient access to GO annotation data. DynGO is generally applicable to any data set where the records are annotated with GO terms, as illustrated by two examples. Conclusion We have presented a standalone package DynGO that provides functionalities to search and browse GO and its association databases as well as several additional functions such as batch retrieval and semantic retrieval. The complete documentation and software are

  16. MyWEST: my Web Extraction Software Tool for effective mining of annotations from web-based databanks.

    PubMed

    Masseroli, Marco; Stella, Andrea; Meani, Natalia; Alcalay, Myriam; Pinciroli, Francesco

    2004-12-12

    High-throughput technologies create the necessity to mine large amounts of gene annotations from diverse databanks, and to integrate the resulting data. Most databanks can be interrogated only via Web, for a single gene at a time, and query results are generally available only in the HTML format. Although some databanks provide batch retrieval of data via FTP, this requires expertise and resources for locally reimplementing the databank. We developed MyWEST, a tool aimed at researchers without extensive informatics skills or resources, which exploits user-defined templates to easily mine selected annotations from different Web-interfaced databanks, and aggregates and structures results in an automatically updated database. Using microarray results from a model system of retinoic acid-induced differentiation, MyWEST effectively gathered relevant annotations from various biomolecular databanks, highlighted significant biological characteristics and supported a global approach to the understanding of complex cellular mechanisms. MyWEST is freely available for non-profit use at http://www.medinfopoli.polimi.it/MyWEST/

  17. LimTox: a web tool for applied text mining of adverse event and toxicity associations of compounds, drugs and genes

    PubMed Central

    Cañada, Andres; Rabal, Obdulia; Oyarzabal, Julen; Valencia, Alfonso

    2017-01-01

    Abstract A considerable effort has been devoted to retrieve systematically information for genes and proteins as well as relationships between them. Despite the importance of chemical compounds and drugs as a central bio-entity in pharmacological and biological research, only a limited number of freely available chemical text-mining/search engine technologies are currently accessible. Here we present LimTox (Literature Mining for Toxicology), a web-based online biomedical search tool with special focus on adverse hepatobiliary reactions. It integrates a range of text mining, named entity recognition and information extraction components. LimTox relies on machine-learning, rule-based, pattern-based and term lookup strategies. This system processes scientific abstracts, a set of full text articles and medical agency assessment reports. Although the main focus of LimTox is on adverse liver events, it enables also basic searches for other organ level toxicity associations (nephrotoxicity, cardiotoxicity, thyrotoxicity and phospholipidosis). This tool supports specialized search queries for: chemical compounds/drugs, genes (with additional emphasis on key enzymes in drug metabolism, namely P450 cytochromes—CYPs) and biochemical liver markers. The LimTox website is free and open to all users and there is no login requirement. LimTox can be accessed at: http://limtox.bioinfo.cnio.es PMID:28531339

  18. Global analysis of genes involved in freshwater adaptation in threespine sticklebacks (Gasterosteus aculeatus).

    PubMed

    DeFaveri, Jacquelin; Shikano, Takahito; Shimada, Yukinori; Goto, Akira; Merilä, Juha

    2011-06-01

    Examples of parallel evolution of phenotypic traits have been repeatedly demonstrated in threespine sticklebacks (Gasterosteus aculeatus) across their global distribution. Using these as a model, we performed a targeted genome scan--focusing on physiologically important genes potentially related to freshwater adaptation--to identify genetic signatures of parallel physiological evolution on a global scale. To this end, 50 microsatellite loci, including 26 loci within or close to (<6 kb) physiologically important genes, were screened in paired marine and freshwater populations from six locations across the Northern Hemisphere. Signatures of directional selection were detected in 24 loci, including 17 physiologically important genes, in at least one location. Although no loci showed consistent signatures of selection in all divergent population pairs, several outliers were common in multiple locations. In particular, seven physiologically important genes, as well as reference ectodysplasin gene (EDA), showed signatures of selection in three or more locations. Hence, although these results give some evidence for consistent parallel molecular evolution in response to freshwater colonization, they suggest that different evolutionary pathways may underlie physiological adaptation to freshwater habitats within the global distribution of the threespine stickleback. © 2011 The Author(s). Evolution© 2011 The Society for the Study of Evolution.

  19. A global evolutionary and metabolic analysis of human obesity gene risk variants.

    PubMed

    Castillo, Joseph J; Hazlett, Zachary S; Orlando, Robert A; Garver, William S

    2017-09-05

    It is generally accepted that the selection of gene variants during human evolution optimized energy metabolism that now interacts with our obesogenic environment to increase the prevalence of obesity. The purpose of this study was to perform a global evolutionary and metabolic analysis of human obesity gene risk variants (110 human obesity genes with 127 nearest gene risk variants) identified using genome-wide association studies (GWAS) to enhance our knowledge of early and late genotypes. As a result of determining the mean frequency of these obesity gene risk variants in 13 available populations from around the world our results provide evidence for the early selection of ancestral risk variants (defined as selection before migration from Africa) and late selection of derived risk variants (defined as selection after migration from Africa). Our results also provide novel information for association of these obesity genes or encoded proteins with diverse metabolic pathways and other human diseases. The overall results indicate a significant differential evolutionary pattern for the selection of obesity gene ancestral and derived risk variants proposed to optimize energy metabolism in varying global environments and complex association with metabolic pathways and other human diseases. These results are consistent with obesity genes that encode proteins possessing a fundamental role in maintaining energy metabolism and survival during the course of human evolution. Copyright © 2017. Published by Elsevier B.V.

  20. A data mining paradigm for identifying key factors in biological processes using gene expression data.

    PubMed

    Li, Jin; Zheng, Le; Uchiyama, Akihiko; Bin, Lianghua; Mauro, Theodora M; Elias, Peter M; Pawelczyk, Tadeusz; Sakowicz-Burkiewicz, Monika; Trzeciak, Magdalena; Leung, Donald Y M; Morasso, Maria I; Yu, Peng

    2018-06-13

    A large volume of biological data is being generated for studying mechanisms of various biological processes. These precious data enable large-scale computational analyses to gain biological insights. However, it remains a challenge to mine the data efficiently for knowledge discovery. The heterogeneity of these data makes it difficult to consistently integrate them, slowing down the process of biological discovery. We introduce a data processing paradigm to identify key factors in biological processes via systematic collection of gene expression datasets, primary analysis of data, and evaluation of consistent signals. To demonstrate its effectiveness, our paradigm was applied to epidermal development and identified many genes that play a potential role in this process. Besides the known epidermal development genes, a substantial proportion of the identified genes are still not supported by gain- or loss-of-function studies, yielding many novel genes for future studies. Among them, we selected a top gene for loss-of-function experimental validation and confirmed its function in epidermal differentiation, proving the ability of this paradigm to identify new factors in biological processes. In addition, this paradigm revealed many key genes in cold-induced thermogenesis using data from cold-challenged tissues, demonstrating its generalizability. This paradigm can lead to fruitful results for studying molecular mechanisms in an era of explosive accumulation of publicly available biological data.

  1. ThaleMine: A Warehouse for Arabidopsis Data Integration and Discovery.

    PubMed

    Krishnakumar, Vivek; Contrino, Sergio; Cheng, Chia-Yi; Belyaeva, Irina; Ferlanti, Erik S; Miller, Jason R; Vaughn, Matthew W; Micklem, Gos; Town, Christopher D; Chan, Agnes P

    2017-01-01

    ThaleMine (https://apps.araport.org/thalemine/) is a comprehensive data warehouse that integrates a wide array of genomic information of the model plant Arabidopsis thaliana. The data collection currently includes the latest structural and functional annotation from the Araport11 update, the Col-0 genome sequence, RNA-seq and array expression, co-expression, protein interactions, homologs, pathways, publications, alleles, germplasm and phenotypes. The data are collected from a wide variety of public resources. Users can browse gene-specific data through Gene Report pages, identify and create gene lists based on experiments or indexed keywords, and run GO enrichment analysis to investigate the biological significance of selected gene sets. Developed by the Arabidopsis Information Portal project (Araport, https://www.araport.org/), ThaleMine uses the InterMine software framework, which builds well-structured data, and provides powerful data query and analysis functionality. The warehoused data can be accessed by users via graphical interfaces, as well as programmatically via web-services. Here we describe recent developments in ThaleMine including new features and extensions, and discuss future improvements. InterMine has been broadly adopted by the model organism research community including nematode, rat, mouse, zebrafish, budding yeast, the modENCODE project, as well as being used for human data. ThaleMine is the first InterMine developed for a plant model. As additional new plant InterMines are developed by the legume and other plant research communities, the potential of cross-organism integrative data analysis will be further enabled. © The Author 2016. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  2. Prior knowledge based mining functional modules from Yeast PPI networks with gene ontology

    PubMed Central

    2010-01-01

    Background In the literature, there are fruitful algorithmic approaches for identification functional modules in protein-protein interactions (PPI) networks. Because of accumulation of large-scale interaction data on multiple organisms and non-recording interaction data in the existing PPI database, it is still emergent to design novel computational techniques that can be able to correctly and scalably analyze interaction data sets. Indeed there are a number of large scale biological data sets providing indirect evidence for protein-protein interaction relationships. Results The main aim of this paper is to present a prior knowledge based mining strategy to identify functional modules from PPI networks with the aid of Gene Ontology. Higher similarity value in Gene Ontology means that two gene products are more functionally related to each other, so it is better to group such gene products into one functional module. We study (i) to encode the functional pairs into the existing PPI networks; and (ii) to use these functional pairs as pairwise constraints to supervise the existing functional module identification algorithms. Topology-based modularity metric and complex annotation in MIPs will be used to evaluate the identified functional modules by these two approaches. Conclusions The experimental results on Yeast PPI networks and GO have shown that the prior knowledge based learning methods perform better than the existing algorithms. PMID:21172053

  3. Heavy metals in wild house mice from coal-mining areas of Colombia and expression of genes related to oxidative stress, DNA damage and exposure to metals.

    PubMed

    Guerrero-Castilla, Angélica; Olivero-Verbel, Jesús; Marrugo-Negrete, José

    2014-03-01

    Coal mining is a source of pollutants that impact on environmental and human health. This study examined the metal content and the transcriptional status of gene markers associated with oxidative stress, metal transport and DNA damage in livers of feral mice collected near coal-mining operations, in comparison with mice obtained from a reference site. Mus musculus specimens were caught from La Loma and La Jagua, two coal-mining sites in the north of Colombia, as well as from Valledupar (Cesar Department), a city located 100km north of the mines. Concentrations in liver tissue of Hg, Zn, Pb, Cd, Cu and As were determined by differential stripping voltammetry, and real-time PCR was used to measure gene expression. Compared with the reference group (Valledupar), hepatic concentrations of Cd, Cu and Zn were significantly higher in animals living near mining areas. In exposed animals, the mRNA expression of NQ01, MT1, SOD1, MT2, and DDIT3 was 4.2-, 7.3-, 2.5-, 4.6- and 3.4-fold greater in coal mining sites, respectively, than in animals from the reference site (p<0.05). These results suggest that activities related to coal mining may generate pollutants that could affect the biota, inducing the transcription of biochemical markers related to oxidative stress, metal exposure, and DNA damage. These changes may be in part linked to metal toxicity, and could have implications for the development of chronic disease. Therefore, it is essential to implement preventive measures to minimize the effects of coal mining on its nearby environment, in order to protect human health. Copyright © 2014 Elsevier B.V. All rights reserved.

  4. Socially Responsible Mining: the Relationship between Mining and Poverty, Human Health and the Environment

    PubMed Central

    Maier, Raina M.; Díaz-Barriga, Fernando; Field, James A.; Hopkins, James; Klein, Bern; Poulton, Mary M.

    2016-01-01

    Increasing global demand for metals is straining the ability of the mining industry to physically keep up with demand (physical scarcity). On the other hand, social issues including the environmental and human health consequences of mining as well as the disparity in income distribution from mining revenues are disproportionately felt at the local community level. This has created social rifts, particularly in the developing world, between affected communities and both industry and governments. Such rifts can result in a disruption of the steady supply of metals (situational scarcity). Here we discuss the importance of mining in relationship to poverty, identify steps that have been taken to create a framework for socially responsible mining, and then discuss the need for academia to work in partnership with communities, government, and industry to develop trans-disciplinary research-based step change solutions to the intertwined problems of physical and situational scarcity. PMID:24552962

  5. Identification of candidate genes in Populus cell wall biosynthesis using text-mining, co-expression network and comparative genomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Xiaohan; Ye, Chuyu; Bisaria, Anjali

    2011-01-01

    Populus is an important bioenergy crop for bioethanol production. A greater understanding of cell wall biosynthesis processes is critical in reducing biomass recalcitrance, a major hindrance in efficient generation of ethanol from lignocellulosic biomass. Here, we report the identification of candidate cell wall biosynthesis genes through the development and application of a novel bioinformatics pipeline. As a first step, via text-mining of PubMed publications, we obtained 121 Arabidopsis genes that had the experimental evidences supporting their involvement in cell wall biosynthesis or remodeling. The 121 genes were then used as bait genes to query an Arabidopsis co-expression database and additionalmore » genes were identified as neighbors of the bait genes in the network, increasing the number of genes to 548. The 548 Arabidopsis genes were then used to re-query the Arabidopsis co-expression database and re-construct a network that captured additional network neighbors, expanding to a total of 694 genes. The 694 Arabidopsis genes were computationally divided into 22 clusters. Queries of the Populus genome using the Arabidopsis genes revealed 817 Populus orthologs. Functional analysis of gene ontology and tissue-specific gene expression indicated that these Arabidopsis and Populus genes are high likelihood candidates for functional genomics in relation to cell wall biosynthesis.« less

  6. An Empirical Model for Mine-Blast Loading

    DTIC Science & Technology

    2014-10-17

    fledged experimental program. The numerical approach however suffers from several drawbacks in the mine blast simulations. First, it is a very...Suffield consisted in a pendulum type device to measure global impulse of buried mine [15]. One of the main purposes of the ONAGER pendulum was to study...TP-1 Terminal effects, KTA 1-34 report, 2004. [15] Bues, R., Hlady, S.L. and Bergeron, D.M., Pendulum Measurement of Land Mine Blast Output, Volume

  7. Polymorphisms in metabolism and repair genes affects DNA damage caused by open-cast coal mining exposure.

    PubMed

    Espitia-Pérez, Lyda; Sosa, Milton Quintana; Salcedo-Arteaga, Shirley; León-Mejía, Grethel; Hoyos-Giraldo, Luz Stella; Brango, Hugo; Kvitko, Katia; da Silva, Juliana; Henriques, João A P

    2016-09-15

    Increasing evidence suggest that occupational exposure to open-cast coal mining residues like dust particles, heavy metals and Polycyclic Aromatic Hydrocarbons (PAHs) may cause a wide range of DNA damage and genomic instability that could be associated to initial steps in cancer development and other work-related diseases. The aim of our study was to evaluate if key polymorphisms in metabolism genes CYP1A1Msp1, GSTM1Null, GSTT1Null and DNA repair genes XRCC1Arg194Trp and hOGG1Ser326Cys could modify individual susceptibility to adverse coal exposure effects, considering the DNA damage (Comet assay) and micronucleus formation in lymphocytes (CBMN) and buccal mucosa cells (BMNCyt) as endpoints for genotoxicity. The study population is comprised of 200 healthy male subjects, 100 open-cast coal-mining workers from "El Cerrejón" (world's largest open-cast coal mine located in Guajira - Colombia) and 100 non-exposed referents from general population. The data revealed a significant increase of CBMN frequency in peripheral lymphocytes of occupationally exposed workers carrying the wild-type variant of GSTT1 (+) gene. Exposed subjects carrying GSTT1null polymorphism showed a lower micronucleus frequency compared with their positive counterparts (FR: 0.83; P=0.04), while BMNCyt, frequency and Comet assay parameters in lymphocytes: Damage Index (DI) and percentage of DNA in the tail (Tail % DNA) were significantly higher in exposed workers with the GSTM1Null polymorphism. Other exfoliated buccal mucosa abnormalities related to cell death (Karyorrhexis and Karyolysis) were increased in GSTT/M1Null carriers. Nuclear buds were significantly higher in workers carrying the CYP1A1Msp1 (m1/m2, m2/m2) allele. Moreover, BMNCyt frequency and Comet assay parameters were significantly lower in exposed carriers of XRCC1Arg194Trp (Arg/Trp, Trp/Trp) and hOGG1Ser326Cys (Ser/Cys, Cys/Cys), thereby providing new data to the increasing evidence about the protective role of these polymorphisms

  8. Identification of fever and vaccine-associated gene interaction networks using ontology-based literature mining

    PubMed Central

    2012-01-01

    network. Since multiple TLRs were found in the generic fever network, it is reasonable to hypothesize that vaccine-TLR interactions may play an important role in inducing fever response, which deserves a further investigation. Conclusions This study demonstrated that ontology-based literature mining is a powerful method for analyzing gene interaction networks and generating new scientific hypotheses. PMID:23256563

  9. Ionospheric Signature of Surface Mine Blasts from Global Positioning System Measurements

    NASA Technical Reports Server (NTRS)

    Calais, Eric; Minster, J. Bernard; Hofton, Michelle A.; Hedlin, Michael A. H.

    1998-01-01

    Sources such as atmospheric or buried explosions and shallow earthquakes are known to produce infrasonic pressure waves in the atmosphere. Because of the coupling between neutral particles and electrons at ionospheric altitudes, these acoustic and gravity waves induce variations of the ionospheric electron density. The Global Positioning System (GPS) provides a way of directly measuring the total electron content in the ionosphere and, therefore, of detecting such perturbations in the upper atmosphere. In July and August 1996, three large surface mine blasts (1.5 Kt each) were detonated at the Black Thunder coal mine in eastern Wyoming. As part of a seismic and acoustic monitoring- experiment, we deployed five dual-frequency GPS receivers at distances ranging from 50 to 200 km from the mine and were able to detect the ionospheric perturbation caused by the blasts. The perturbation starts 10 to 15 min after the blast, lasts for about 30 min, and propagates with an apparent horizontal velocity of 1200 meters per second. Its amplitude reaches 3 x 10 (exp 14) el per square meters in the 7-3 min period band, a value close to the ionospheric perturbation caused by the M = 6.7 Northridge earthquake. The small signal-to-noise ratio of the perturbation can be improved by slant-stacking the electron content time-series recorded by the different GPS receivers taking into account the horizontal propagation of the perturbation. The energy of the perturbation is concentrated in the 200 to 300 second period band, a result consistent with previous observations and numerical model predictions. The 300 second band probably corresponds to gravity modes and shorter periods to acoustic modes, respectively. Using a 1-D stratified velocity model of the atmosphere we show that linear acoustic ray tracing fits arrival times at all GPS receivers. We interpret the perturbation as a direct acoustic wave caused by the explosion itself. This study shows that even relatively small subsurface

  10. LimTox: a web tool for applied text mining of adverse event and toxicity associations of compounds, drugs and genes.

    PubMed

    Cañada, Andres; Capella-Gutierrez, Salvador; Rabal, Obdulia; Oyarzabal, Julen; Valencia, Alfonso; Krallinger, Martin

    2017-07-03

    A considerable effort has been devoted to retrieve systematically information for genes and proteins as well as relationships between them. Despite the importance of chemical compounds and drugs as a central bio-entity in pharmacological and biological research, only a limited number of freely available chemical text-mining/search engine technologies are currently accessible. Here we present LimTox (Literature Mining for Toxicology), a web-based online biomedical search tool with special focus on adverse hepatobiliary reactions. It integrates a range of text mining, named entity recognition and information extraction components. LimTox relies on machine-learning, rule-based, pattern-based and term lookup strategies. This system processes scientific abstracts, a set of full text articles and medical agency assessment reports. Although the main focus of LimTox is on adverse liver events, it enables also basic searches for other organ level toxicity associations (nephrotoxicity, cardiotoxicity, thyrotoxicity and phospholipidosis). This tool supports specialized search queries for: chemical compounds/drugs, genes (with additional emphasis on key enzymes in drug metabolism, namely P450 cytochromes-CYPs) and biochemical liver markers. The LimTox website is free and open to all users and there is no login requirement. LimTox can be accessed at: http://limtox.bioinfo.cnio.es. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Genotoxic effects and gene expression in Danio rerio (Hamilton 1822) (Cypriniformes: Cyprinidae) exposed to mining-impacted tributaries in Manizales, Colombia.

    PubMed

    Ossa-López, Paula A; Castaño-Villa, Gabriel J; Rivera-Páez, Fredy A

    2017-09-25

    The zebrafish (Danio rerio) is one of the most studied aquatic organisms for water biomonitoring, due to its sensitivity to environmental degradation and resistance to toxic substances. This study determined the presence of micronuclei and nuclear abnormalities in peripheral blood erythrocytes, and assessed the gene expression of caspase-3 (CASP-3) and metallothionein 1 (MT-1) in the gills and liver of D. rerio. The study fish (n = 45) were exposed to water collected from two stations with mining impact (E2 and E3) and a reference station without evident mining contamination (E1), all located in La Elvira stream (Manizales-Colombia). In addition, a positive control (PC) with HgCl 2 (50 μg/L) and negative control (NC) with tap water were included. The fish from the PC and E2 and E3 treatments displayed genotoxic effects and changes in gene expression, with significant differences in micronuclei formation and the presence of blebbed nuclei. The cytochrome oxidase subunit I (COI) gene was used as reference and proved to be stable compared to the β-actin and 28S ribosomal RNA (28S) genes. In gills, CASP-3 expression was higher in the PC, and MT-1 expression was higher in the PC and E3 treatment. In liver, CASP-3 was expressed in the E2 treatment, and MT-1 expression was low. These results show that the genotoxic effects and differential gene expression observed in fish exposed to water from La Elvira stream could also be affecting the organisms present in this habitat.

  12. Defining global neuroendocrine gene expression patterns associated with reproductive seasonality in fish.

    PubMed

    Zhang, Dapeng; Xiong, Huiling; Mennigen, Jan A; Popesku, Jason T; Marlatt, Vicki L; Martyniuk, Christopher J; Crump, Kate; Cossins, Andrew R; Xia, Xuhua; Trudeau, Vance L

    2009-06-05

    Many vertebrates, including the goldfish, exhibit seasonal reproductive rhythms, which are a result of interactions between external environmental stimuli and internal endocrine systems in the hypothalamo-pituitary-gonadal axis. While it is long believed that differential expression of neuroendocrine genes contributes to establishing seasonal reproductive rhythms, no systems-level investigation has yet been conducted. In the present study, by analyzing multiple female goldfish brain microarray datasets, we have characterized global gene expression patterns for a seasonal cycle. A core set of genes (873 genes) in the hypothalamus were identified to be differentially expressed between May, August and December, which correspond to physiologically distinct stages that are sexually mature (prespawning), sexual regression, and early gonadal redevelopment, respectively. Expression changes of these genes are also shared by another brain region, the telencephalon, as revealed by multivariate analysis. More importantly, by examining one dataset obtained from fish in October who were kept under long-daylength photoperiod (16 h) typical of the springtime breeding season (May), we observed that the expression of identified genes appears regulated by photoperiod, a major factor controlling vertebrate reproductive cyclicity. Gene ontology analysis revealed that hormone genes and genes functionally involved in G-protein coupled receptor signaling pathway and transmission of nerve impulses are significantly enriched in an expression pattern, whose transition is located between prespawning and sexually regressed stages. The existence of seasonal expression patterns was verified for several genes including isotocin, ependymin II, GABA(A) gamma2 receptor, calmodulin, and aromatase b by independent samplings of goldfish brains from six seasonal time points and real-time PCR assays. Using both theoretical and experimental strategies, we report for the first time global gene expression

  13. Defining Global Neuroendocrine Gene Expression Patterns Associated with Reproductive Seasonality in Fish

    PubMed Central

    Mennigen, Jan A.; Popesku, Jason T.; Marlatt, Vicki L.; Martyniuk, Christopher J.; Crump, Kate; Cossins, Andrew R.; Xia, Xuhua; Trudeau, Vance L.

    2009-01-01

    Background Many vertebrates, including the goldfish, exhibit seasonal reproductive rhythms, which are a result of interactions between external environmental stimuli and internal endocrine systems in the hypothalamo-pituitary-gonadal axis. While it is long believed that differential expression of neuroendocrine genes contributes to establishing seasonal reproductive rhythms, no systems-level investigation has yet been conducted. Methodology/Principal Findings In the present study, by analyzing multiple female goldfish brain microarray datasets, we have characterized global gene expression patterns for a seasonal cycle. A core set of genes (873 genes) in the hypothalamus were identified to be differentially expressed between May, August and December, which correspond to physiologically distinct stages that are sexually mature (prespawning), sexual regression, and early gonadal redevelopment, respectively. Expression changes of these genes are also shared by another brain region, the telencephalon, as revealed by multivariate analysis. More importantly, by examining one dataset obtained from fish in October who were kept under long-daylength photoperiod (16 h) typical of the springtime breeding season (May), we observed that the expression of identified genes appears regulated by photoperiod, a major factor controlling vertebrate reproductive cyclicity. Gene ontology analysis revealed that hormone genes and genes functionally involved in G-protein coupled receptor signaling pathway and transmission of nerve impulses are significantly enriched in an expression pattern, whose transition is located between prespawning and sexually regressed stages. The existence of seasonal expression patterns was verified for several genes including isotocin, ependymin II, GABAA gamma2 receptor, calmodulin, and aromatase b by independent samplings of goldfish brains from six seasonal time points and real-time PCR assays. Conclusions/Significance Using both theoretical and experimental

  14. Global patterns of diversity and selection in human tyrosinase gene.

    PubMed

    Hudjashov, Georgi; Villems, Richard; Kivisild, Toomas

    2013-01-01

    Global variation in skin pigmentation is one of the most striking examples of environmental adaptation in humans. More than two hundred loci have been identified as candidate genes in model organisms and a few tens of these have been found to be significantly associated with human skin pigmentation in genome-wide association studies. However, the evolutionary history of different pigmentation genes is rather complex: some loci have been subjected to strong positive selection, while others evolved under the relaxation of functional constraints in low UV environment. Here we report the results of a global study of the human tyrosinase gene, which is one of the key enzymes in melanin production, to assess the role of its variation in the evolution of skin pigmentation differences among human populations. We observe a higher rate of non-synonymous polymorphisms in the European sample consistent with the relaxation of selective constraints. A similar pattern was previously observed in the MC1R gene and concurs with UV radiation-driven model of skin color evolution by which mutations leading to lower melanin levels and decreased photoprotection are subject to purifying selection at low latitudes while being tolerated or even favored at higher latitudes because they facilitate UV-dependent vitamin D production. Our coalescent date estimates suggest that the non-synonymous variants, which are frequent in Europe and North Africa, are recent and have emerged after the separation of East and West Eurasian populations.

  15. Mining drives extensive deforestation in the Brazilian Amazon.

    PubMed

    Sonter, Laura J; Herrera, Diego; Barrett, Damian J; Galford, Gillian L; Moran, Chris J; Soares-Filho, Britaldo S

    2017-10-18

    Mining poses significant and potentially underestimated risks to tropical forests worldwide. In Brazil's Amazon, mining drives deforestation far beyond operational lease boundaries, yet the full extent of these impacts is unknown and thus neglected in environmental licensing. Here we quantify mining-induced deforestation and investigate the aspects of mining operations, which most likely contribute. We find mining significantly increased Amazon forest loss up to 70 km beyond mining lease boundaries, causing 11,670 km 2 of deforestation between 2005 and 2015. This extent represents 9% of all Amazon forest loss during this time and 12 times more deforestation than occurred within mining leases alone. Pathways leading to such impacts include mining infrastructure establishment, urban expansion to support a growing workforce, and development of mineral commodity supply chains. Mining-induced deforestation is not unique to Brazil; to mitigate adverse impacts of mining and conserve tropical forests globally, environmental assessments and licensing must considered both on- and off-lease sources of deforestation.

  16. Microbial diversity at the moderate acidic stage in three different sulfidic mine tailings dumps generating acid mine drainage.

    PubMed

    Korehi, Hananeh; Blöthe, Marco; Schippers, Axel

    2014-11-01

    In freshly deposited sulfidic mine tailings the pH is alkaline or circumneutral. Due to pyrite or pyrrhotite oxidation the pH is dropping over time to pH values <3 at which acidophilic iron- and sulfur-oxidizing prokaryotes prevail and accelerate the oxidation processes, well described for several mine waste sites. The microbial communities at the moderate acidic stage in mine tailings are only scarcely studied. Here we investigated the microbial diversity via 16S rRNA gene sequence analysis in eight samples (pH range 3.2-6.5) from three different sulfidic mine tailings dumps in Botswana, Germany and Sweden. In total 701 partial 16S rRNA gene sequences revealed a divergent microbial community between the three sites and at different tailings depths. Proteobacteria and Firmicutes were overall the most abundant phyla in the clone libraries. Acidobacteria, Actinobacteria, Bacteroidetes, and Nitrospira occurred less frequently. The found microbial communities were completely different to microbial communities in tailings at

  17. Global gene expression analyses of hematopoietic stem cell-like cell lines with inducible Lhx2 expression

    PubMed Central

    Richter, Karin; Wirta, Valtteri; Dahl, Lina; Bruce, Sara; Lundeberg, Joakim; Carlsson, Leif; Williams, Cecilia

    2006-01-01

    Background Expression of the LIM-homeobox gene Lhx2 in murine hematopoietic cells allows for the generation of hematopoietic stem cell (HSC)-like cell lines. To address the molecular basis of Lhx2 function, we generated HSC-like cell lines where Lhx2 expression is regulated by a tet-on system and hence dependent on the presence of doxycyclin (dox). These cell lines efficiently down-regulate Lhx2 expression upon dox withdrawal leading to a rapid differentiation into various myeloid cell types. Results Global gene expression of these cell lines cultured in dox was compared to different time points after dox withdrawal using microarray technology. We identified 267 differentially expressed genes. The majority of the genes overlapping with HSC-specific databases were those down-regulated after turning off Lhx2 expression and a majority of the genes overlapping with those defined as late progenitor-specific genes were the up-regulated genes, suggesting that these cell lines represent a relevant model system for normal HSCs also at the level of global gene expression. Moreover, in situ hybridisations of several genes down-regulated after dox withdrawal showed overlapping expression patterns with Lhx2 in various tissues during embryonic development. Conclusion Global gene expression analysis of HSC-like cell lines with inducible Lhx2 expression has identified genes putatively linked to self-renewal / differentiation of HSCs, and function of Lhx2 in organ development and stem / progenitor cells of non-hematopoietic origin. PMID:16600034

  18. Global gene expression in channel catfish after vaccination with an attenuated Edwardsiella ictaluri

    USDA-ARS?s Scientific Manuscript database

    To understand the global gene expression in channel catfish after immersion vaccination with an attenuated Edwardsiella ictaluri (AquaVac ESCTM), microarray analysis of 65,182 UniGene transcripts were performed. With a filter of false-discovery rate less than 0.05 and fold change greater than 2, a t...

  19. A Global Coexpression Network Approach for Connecting Genes to Specialized Metabolic Pathways in Plants

    PubMed Central

    Borowsky, Alexander T.

    2017-01-01

    Plants produce diverse specialized metabolites (SMs), but the genes responsible for their production and regulation remain largely unknown, hindering efforts to tap plant pharmacopeia. Given that genes comprising SM pathways exhibit environmentally dependent coregulation, we hypothesized that genes within a SM pathway would form tight associations (modules) with each other in coexpression networks, facilitating their identification. To evaluate this hypothesis, we used 10 global coexpression data sets, each a meta-analysis of hundreds to thousands of experiments, across eight plant species to identify hundreds of coexpressed gene modules per data set. In support of our hypothesis, 15.3 to 52.6% of modules contained two or more known SM biosynthetic genes, and module genes were enriched in SM functions. Moreover, modules recovered many experimentally validated SM pathways, including all six known to form biosynthetic gene clusters (BGCs). In contrast, bioinformatically predicted BGCs (i.e., those lacking an associated metabolite) were no more coexpressed than the null distribution for neighboring genes. These results suggest that most predicted plant BGCs are not genuine SM pathways and argue that BGCs are not a hallmark of plant specialized metabolism. We submit that global gene coexpression is a rich, largely untapped resource for discovering the genetic basis and architecture of plant natural products. PMID:28408660

  20. Genes Involved in the Evolution of Herbivory by a Leaf-Mining, Drosophilid Fly

    PubMed Central

    Whiteman, Noah K.; Gloss, Andrew D.; Sackton, Timothy B.; Groen, Simon C.; Humphrey, Parris T.; Lapoint, Richard T.; Sønderby, Ida E.; Halkier, Barbara A.; Kocks, Christine; Ausubel, Frederick M.; Pierce, Naomi E.

    2012-01-01

    Herbivorous insects are among the most successful radiations of life. However, we know little about the processes underpinning the evolution of herbivory. We examined the evolution of herbivory in the fly, Scaptomyza flava, whose larvae are leaf miners on species of Brassicaceae, including the widely studied reference plant, Arabidopsis thaliana (Arabidopsis). Scaptomyza flava is phylogenetically nested within the paraphyletic genus Drosophila, and the whole genome sequences available for 12 species of Drosophila facilitated phylogenetic analysis and assembly of a transcriptome for S. flava. A time-calibrated phylogeny indicated that leaf mining in Scaptomyza evolved between 6 and 16 million years ago. Feeding assays showed that biosynthesis of glucosinolates, the major class of antiherbivore chemical defense compounds in mustard leaves, was upregulated by S. flava larval feeding. The presence of glucosinolates in wild-type (WT) Arabidopsis plants reduced S. flava larval weight gain and increased egg–adult development time relative to flies reared in glucosinolate knockout (GKO) plants. An analysis of gene expression differences in 5-day-old larvae reared on WT versus GKO plants showed a total of 341 transcripts that were differentially regulated by glucosinolate uptake in larval S. flava. Of these, approximately a third corresponded to homologs of Drosophila melanogaster genes associated with starvation, dietary toxin-, heat-, oxidation-, and aging-related stress. The upregulated transcripts exhibited elevated rates of protein evolution compared with unregulated transcripts. The remaining differentially regulated transcripts also contained a higher proportion of novel genes than the unregulated transcripts. Thus, the transition to herbivory in Scaptomyza appears to be coupled with the evolution of novel genes and the co-option of conserved stress-related genes. PMID:22813779

  1. Asturian mercury mining district (Spain) and the environment: a review.

    PubMed

    Ordóñez, A; Álvarez, R; Loredo, J

    2013-11-01

    Mercury is of particular concern amongst global environmental pollutants, with abundant contaminated sites worldwide, many of which are associated with mining activities. Asturias (Northwest of Spain) can be considered an Hg metallogenic province with abundant epithermal-type deposits, whose paragenetic sequences include also As-rich minerals. These mines were abandoned long before the introduction of any environmental regulations to control metal release from these sources. Consequently, the environment is globally affected, as high metal concentrations have been found in soils, waters, sediments, plants, and air. In this paper, a characterization of the environmental affection caused by Hg mining in nine Asturian mine sites is presented, with particular emphasis in Hg and As contents. Hg concentrations found in the studied milieu are similar and even higher than those reported in previous studies for other mercury mining districts (mainly Almadén and Idrija). Furthermore, the potential adverse health effects of exposure to these elements in the considered sites in this district have been assessed.

  2. SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.

    PubMed

    Merelli, Ivan; Calabria, Andrea; Cozzi, Paolo; Viti, Federica; Mosca, Ettore; Milanesi, Luciano

    2013-01-01

    The capability of correlating specific genotypes with human diseases is a complex issue in spite of all advantages arisen from high-throughput technologies, such as Genome Wide Association Studies (GWAS). New tools for genetic variants interpretation and for Single Nucleotide Polymorphisms (SNPs) prioritization are actually needed. Given a list of the most relevant SNPs statistically associated to a specific pathology as result of a genotype study, a critical issue is the identification of genes that are effectively related to the disease by re-scoring the importance of the identified genetic variations. Vice versa, given a list of genes, it can be of great importance to predict which SNPs can be involved in the onset of a particular disease, in order to focus the research on their effects. We propose a new bioinformatics approach to support biological data mining in the analysis and interpretation of SNPs associated to pathologies. This system can be employed to design custom genotyping chips for disease-oriented studies and to re-score GWAS results. The proposed method relies (1) on the data integration of public resources using a gene-centric database design, (2) on the evaluation of a set of static biomolecular annotations, defined as features, and (3) on the SNP scoring function, which computes SNP scores using parameters and weights set by users. We employed a machine learning classifier to set default feature weights and an ontological annotation layer to enable the enrichment of the input gene set. We implemented our method as a web tool called SNPranker 2.0 (http://www.itb.cnr.it/snpranker), improving our first published release of this system. A user-friendly interface allows the input of a list of genes, SNPs or a biological process, and to customize the features set with relative weights. As result, SNPranker 2.0 returns a list of SNPs, localized within input and ontologically enriched genes, combined with their prioritization scores. Different

  3. Global and gene specific DNA methylation changes during zebrafish development

    USDA-ARS?s Scientific Manuscript database

    DNA methylation is dynamic through the life of an organism. In this study, we measured the global and gene specific DNA methylation changes in zebrafish at different developmental stages. We found that the methylation percentage of cytosines was 11.75 ± 0.96% in 3.3 hour post fertilization (hpf) zeb...

  4. Global Burden of Disease of Mercury Used in Artisanal Small-Scale Gold Mining.

    PubMed

    Steckling, Nadine; Tobollik, Myriam; Plass, Dietrich; Hornberg, Claudia; Ericson, Bret; Fuller, Richard; Bose-O'Reilly, Stephan

    Artisanal small-scale gold mining (ASGM) is the world's largest anthropogenic source of mercury emission. Gold miners are highly exposed to metallic mercury and suffer occupational mercury intoxication. The global disease burden as a result of this exposure is largely unknown because the informal character of ASGM restricts the availability of reliable data. To estimate the prevalence of occupational mercury intoxication and the disability-adjusted life years (DALYs) attributable to chronic metallic mercury vapor intoxication (CMMVI) among ASGM gold miners globally and in selected countries. Estimates of the number of artisanal small-scale gold (ASG) miners were extracted from reviews supplemented by a literature search. Prevalence of moderate CMMVI among miners was determined by compiling a dataset of available studies that assessed frequency of intoxication in gold miners using a standardized diagnostic tool and biomonitoring data on mercury in urine. Severe cases of CMMVI were not included because it was assumed that these persons can no longer be employed as miners. Cases in workers' families and communities were not considered. Years lived with disability as a result of CMMVI among ASG miners were quantified by multiplying the number of prevalent cases of CMMVI by the appropriate disability weight. No deaths are expected to result from CMMVI and therefore years of life lost were not calculated. Disease burden was calculated by multiplying the prevalence rate with the number of miners for each country and the disability weight. Sensitivity analyses were performed using different assumptions on the number of miners and the intoxication prevalence rate. Globally, 14-19 million workers are employed as ASG miners. Based on human biomonitoring data, between 25% and 33% of these miners-3.3-6.5 million miners globally-suffer from moderate CMMVI. The resulting global burden of disease is estimated to range from 1.22 (uncertainty interval [UI] 0.87-1.61) to 2.39 (UI 1

  5. Literature Mining for the Discovery of Hidden Connections between Drugs, Genes and Diseases

    PubMed Central

    Frijters, Raoul; van Vugt, Marianne; Smeets, Ruben; van Schaik, René; de Vlieg, Jacob; Alkema, Wynand

    2010-01-01

    The scientific literature represents a rich source for retrieval of knowledge on associations between biomedical concepts such as genes, diseases and cellular processes. A commonly used method to establish relationships between biomedical concepts from literature is co-occurrence. Apart from its use in knowledge retrieval, the co-occurrence method is also well-suited to discover new, hidden relationships between biomedical concepts following a simple ABC-principle, in which A and C have no direct relationship, but are connected via shared B-intermediates. In this paper we describe CoPub Discovery, a tool that mines the literature for new relationships between biomedical concepts. Statistical analysis using ROC curves showed that CoPub Discovery performed well over a wide range of settings and keyword thesauri. We subsequently used CoPub Discovery to search for new relationships between genes, drugs, pathways and diseases. Several of the newly found relationships were validated using independent literature sources. In addition, new predicted relationships between compounds and cell proliferation were validated and confirmed experimentally in an in vitro cell proliferation assay. The results show that CoPub Discovery is able to identify novel associations between genes, drugs, pathways and diseases that have a high probability of being biologically valid. This makes CoPub Discovery a useful tool to unravel the mechanisms behind disease, to find novel drug targets, or to find novel applications for existing drugs. PMID:20885778

  6. Global expression analysis of gene regulatory pathways during endocrine pancreatic development.

    PubMed

    Gu, Guoqiang; Wells, James M; Dombkowski, David; Preffer, Fred; Aronow, Bruce; Melton, Douglas A

    2004-01-01

    To define genetic pathways that regulate development of the endocrine pancreas, we generated transcriptional profiles of enriched cells isolated from four biologically significant stages of endocrine pancreas development: endoderm before pancreas specification, early pancreatic progenitor cells, endocrine progenitor cells and adult islets of Langerhans. These analyses implicate new signaling pathways in endocrine pancreas development, and identified sets of known and novel genes that are temporally regulated, as well as genes that spatially define developing endocrine cells from their neighbors. The differential expression of several genes from each time point was verified by RT-PCR and in situ hybridization. Moreover, we present preliminary functional evidence suggesting that one transcription factor encoding gene (Myt1), which was identified in our screen, is expressed in endocrine progenitors and may regulate alpha, beta and delta cell development. In addition to identifying new genes that regulate endocrine cell fate, this global gene expression analysis has uncovered informative biological trends that occur during endocrine differentiation.

  7. A Global Coexpression Network Approach for Connecting Genes to Specialized Metabolic Pathways in Plants.

    PubMed

    Wisecaver, Jennifer H; Borowsky, Alexander T; Tzin, Vered; Jander, Georg; Kliebenstein, Daniel J; Rokas, Antonis

    2017-05-01

    Plants produce diverse specialized metabolites (SMs), but the genes responsible for their production and regulation remain largely unknown, hindering efforts to tap plant pharmacopeia. Given that genes comprising SM pathways exhibit environmentally dependent coregulation, we hypothesized that genes within a SM pathway would form tight associations (modules) with each other in coexpression networks, facilitating their identification. To evaluate this hypothesis, we used 10 global coexpression data sets, each a meta-analysis of hundreds to thousands of experiments, across eight plant species to identify hundreds of coexpressed gene modules per data set. In support of our hypothesis, 15.3 to 52.6% of modules contained two or more known SM biosynthetic genes, and module genes were enriched in SM functions. Moreover, modules recovered many experimentally validated SM pathways, including all six known to form biosynthetic gene clusters (BGCs). In contrast, bioinformatically predicted BGCs (i.e., those lacking an associated metabolite) were no more coexpressed than the null distribution for neighboring genes. These results suggest that most predicted plant BGCs are not genuine SM pathways and argue that BGCs are not a hallmark of plant specialized metabolism. We submit that global gene coexpression is a rich, largely untapped resource for discovering the genetic basis and architecture of plant natural products. © 2017 American Society of Plant Biologists. All rights reserved.

  8. Mechanical Strain Promotes Oligodendrocyte Differentiation by Global Changes of Gene Expression

    PubMed Central

    Jagielska, Anna; Lowe, Alexis L.; Makhija, Ekta; Wroblewska, Liliana; Guck, Jochen; Franklin, Robin J. M.; Shivashankar, G. V.; Van Vliet, Krystyn J.

    2017-01-01

    Differentiation of oligodendrocyte progenitor cells (OPC) to oligodendrocytes and subsequent axon myelination are critical steps in vertebrate central nervous system (CNS) development and regeneration. Growing evidence supports the significance of mechanical factors in oligodendrocyte biology. Here, we explore the effect of mechanical strains within physiological range on OPC proliferation and differentiation, and strain-associated changes in chromatin structure, epigenetics, and gene expression. Sustained tensile strain of 10–15% inhibited OPC proliferation and promoted differentiation into oligodendrocytes. This response to strain required specific interactions of OPCs with extracellular matrix ligands. Applied strain induced changes in nuclear shape, chromatin organization, and resulted in enhanced histone deacetylation, consistent with increased oligodendrocyte differentiation. This response was concurrent with increased mRNA levels of the epigenetic modifier histone deacetylase Hdac11. Inhibition of HDAC proteins eliminated the strain-mediated increase of OPC differentiation, demonstrating a role of HDACs in mechanotransduction of strain to chromatin. RNA sequencing revealed global changes in gene expression associated with strain. Specifically, expression of multiple genes associated with oligodendrocyte differentiation and axon-oligodendrocyte interactions was increased, including cell surface ligands (Ncam, ephrins), cyto- and nucleo-skeleton genes (Fyn, actinins, myosin, nesprin, Sun1), transcription factors (Sox10, Zfp191, Nkx2.2), and myelin genes (Cnp, Plp, Mag). These findings show how mechanical strain can be transmitted to the nucleus to promote oligodendrocyte differentiation, and identify the global landscape of signaling pathways involved in mechanotransduction. These data provide a source of potential new therapeutic avenues to enhance OPC differentiation in vivo. PMID:28473753

  9. Quantitative Analysis of Critical Factors for the Climate Impact of Landfill Mining.

    PubMed

    Laner, David; Cencic, Oliver; Svensson, Niclas; Krook, Joakim

    2016-07-05

    Landfill mining has been proposed as an innovative strategy to mitigate environmental risks associated with landfills, to recover secondary raw materials and energy from the deposited waste, and to enable high-valued land uses at the site. The present study quantitatively assesses the importance of specific factors and conditions for the net contribution of landfill mining to global warming using a novel, set-based modeling approach and provides policy recommendations for facilitating the development of projects contributing to global warming mitigation. Building on life-cycle assessment, scenario modeling and sensitivity analysis methods are used to identify critical factors for the climate impact of landfill mining. The net contributions to global warming of the scenarios range from -1550 (saving) to 640 (burden) kg CO2e per Mg of excavated waste. Nearly 90% of the results' total variation can be explained by changes in four factors, namely the landfill gas management in the reference case (i.e., alternative to mining the landfill), the background energy system, the composition of the excavated waste, and the applied waste-to-energy technology. Based on the analyses, circumstances under which landfill mining should be prioritized or not are identified and sensitive parameters for the climate impact assessment of landfill mining are highlighted.

  10. Data-mining analysis of the global distribution of soil carbon in observational databases and Earth system models

    NASA Astrophysics Data System (ADS)

    Hashimoto, Shoji; Nanko, Kazuki; Ťupek, Boris; Lehtonen, Aleksi

    2017-03-01

    Future climate change will dramatically change the carbon balance in the soil, and this change will affect the terrestrial carbon stock and the climate itself. Earth system models (ESMs) are used to understand the current climate and to project future climate conditions, but the soil organic carbon (SOC) stock simulated by ESMs and those of observational databases are not well correlated when the two are compared at fine grid scales. However, the specific key processes and factors, as well as the relationships among these factors that govern the SOC stock, remain unclear; the inclusion of such missing information would improve the agreement between modeled and observational data. In this study, we sought to identify the influential factors that govern global SOC distribution in observational databases, as well as those simulated by ESMs. We used a data-mining (machine-learning) (boosted regression trees - BRT) scheme to identify the factors affecting the SOC stock. We applied BRT scheme to three observational databases and 15 ESM outputs from the fifth phase of the Coupled Model Intercomparison Project (CMIP5) and examined the effects of 13 variables/factors categorized into five groups (climate, soil property, topography, vegetation, and land-use history). Globally, the contributions of mean annual temperature, clay content, carbon-to-nitrogen (CN) ratio, wetland ratio, and land cover were high in observational databases, whereas the contributions of the mean annual temperature, land cover, and net primary productivity (NPP) were predominant in the SOC distribution in ESMs. A comparison of the influential factors at a global scale revealed that the most distinct differences between the SOCs from the observational databases and ESMs were the low clay content and CN ratio contributions, and the high NPP contribution in the ESMs. The results of this study will aid in identifying the causes of the current mismatches between observational SOC databases and ESM outputs

  11. The influence of the scale of mining activity and mine site remediation on the contamination legacy of historical metal mining activity.

    PubMed

    Bird, Graham

    2016-12-01

    Globally, thousands of kilometres of rivers are degraded due to the presence of elevated concentrations of potentially harmful elements (PHEs) sourced from historical metal mining activity. In many countries, the presence of contaminated water and river sediment creates a legal requirement to address such problems. Remediation of mining-associated point sources has often been focused upon improving river water quality; however, this study evaluates the contaminant legacy present within river sediments and attempts to assess the influence of the scale of mining activity and post-mining remediation upon the magnitude of PHE contamination found within contemporary river sediments. Data collected from four exemplar catchments indicates a strong relationship between the scale of historical mining, as measured by ore output, and maximum PHE enrichment factors, calculated versus environmental quality guidelines. The use of channel slope as a proxy measure for the degree of channel-floodplain coupling indicates that enrichment factors for PHEs in contemporary river sediments may also be the highest where channel-floodplain coupling is the greatest. Calculation of a metric score for mine remediation activity indicates no clear influence of the scale of remediation activity and PHE enrichment factors for river sediments. It is suggested that whilst exemplars of significant successes at improving post-remediation river water quality can be identified; river sediment quality is a much more long-lasting environmental problem. In addition, it is suggested that improvements to river sediment quality do not occur quickly or easily as a result of remediation actions focused a specific mining point sources. Data indicate that PHEs continue to be episodically dispersed through river catchments hundreds of years after the cessation of mining activity, especially during flood flows. The high PHE loads of flood sediments in mining-affected river catchments and the predicted changes to

  12. Challenges for modeling global gene regulatory networks during development: insights from Drosophila.

    PubMed

    Wilczynski, Bartek; Furlong, Eileen E M

    2010-04-15

    Development is regulated by dynamic patterns of gene expression, which are orchestrated through the action of complex gene regulatory networks (GRNs). Substantial progress has been made in modeling transcriptional regulation in recent years, including qualitative "coarse-grain" models operating at the gene level to very "fine-grain" quantitative models operating at the biophysical "transcription factor-DNA level". Recent advances in genome-wide studies have revealed an enormous increase in the size and complexity or GRNs. Even relatively simple developmental processes can involve hundreds of regulatory molecules, with extensive interconnectivity and cooperative regulation. This leads to an explosion in the number of regulatory functions, effectively impeding Boolean-based qualitative modeling approaches. At the same time, the lack of information on the biophysical properties for the majority of transcription factors within a global network restricts quantitative approaches. In this review, we explore the current challenges in moving from modeling medium scale well-characterized networks to more poorly characterized global networks. We suggest to integrate coarse- and find-grain approaches to model gene regulatory networks in cis. We focus on two very well-studied examples from Drosophila, which likely represent typical developmental regulatory modules across metazoans. Copyright (c) 2009 Elsevier Inc. All rights reserved.

  13. Assessment of global ischemic/reperfusion and Tacrolimus administration on CA1 region of hippocampus: gene expression profiles of BAX and BCL2 genes.

    PubMed

    Badr, R; Hashemi, M; Javadi, G; Movafagh, A; Mahdian, R

    2016-01-01

    It is well known that hippocampus has a pivotal role in learning, formation and consolidation of memory. Global cerebral ischemia causes severe damage to pyramidal neurons of the CA1 region and usually results in residual neurological deficits following a recovery from ischemia. Scientists investigate to find the molecular mechanism of apoptosis and to use this cell death for clinical treatment. In this investigation, we evaluated the molecular mechanism of FK-506 in apoptosis using gene expression quantification of BAX and BCL-2 genes in hippocampus following global ischemic/reperfusion. In the present experimental study, adult male Wistar rats were obtained and housed under standard conditions. Each experimental group consisted of five rats and was equally distributed in the normal control, ischemia/reperfusion, ischemia/reperfusion followed by FK-506. Global ischemia was induced for animals in ischemia and drug groups. In the drug group, moreover, two doses of FK-506 were injected as IV injection and intra-peritoneal (IP) injection after 48 h. Then, hippocampus tissue was removed. Consequently, RNA isolated, cDNA was synthesized and Real-Time PCR was performed. Finally, the obtained data was analyzed statistically (p<0.05). The quantitative results showed the BAX expression ratio increased approximately 3-times in ischemia/reperfusion (3.11 ± 0.28) group compared to the untreated (NR) and the drug group (p<0.001). The statistical analysis showed a significant difference for BCL-2 expression between the experimental groups (p<0.001). The mRNA level of BCL-2 decreased in the ischemia/reperfusion group, while it was without alteration in the other groups. The results showed that global cerebral ischemia/reperfusion induced BAX as pro-apoptotic gene and tacrolimus a neuroprotective drug inhibited BAX gene expression and induced BCL-2 gene expression as anti-apoptotic gene (Tab. 2, Fig. 3, Ref. 21).

  14. Advances in genetic circuit design: novel biochemistries, deep part mining, and precision gene expression.

    PubMed

    Nielsen, Alec A K; Segall-Shapiro, Thomas H; Voigt, Christopher A

    2013-12-01

    Cells use regulatory networks to perform computational operations to respond to their environment. Reliably manipulating such networks would be valuable for many applications in biotechnology; for example, in having genes turn on only under a defined set of conditions or implementing dynamic or temporal control of expression. Still, building such synthetic regulatory circuits remains one of the most difficult challenges in genetic engineering and as a result they have not found widespread application. Here, we review recent advances that address the key challenges in the forward design of genetic circuits. First, we look at new design concepts, including the construction of layered digital and analog circuits, and new approaches to control circuit response functions. Second, we review recent work to apply part mining and computational design to expand the number of regulators that can be used together within one cell. Finally, we describe new approaches to obtain precise gene expression and to reduce context dependence that will accelerate circuit design by more reliably balancing regulators while reducing toxicity. Copyright © 2013. Published by Elsevier Ltd.

  15. Rapid Evaluation of Radioactive Contamination in Rare Earth Mine Mining

    NASA Astrophysics Data System (ADS)

    Wang, N.

    2017-12-01

    In order to estimate the current levels of environmental radioactivity in Bayan Obo rare earth mine and to study the rapid evaluation methods of radioactivity contamination in the rare earth mine, the surveys of the in-situ gamma-ray spectrometry and gamma dose rate measurement were carried out around the mining area and living area. The in-situ gamma-ray spectrometer was composed of a scintillation detector of NaI(Tl) (Φ75mm×75mm) and a multichannel analyzer. Our survey results in Bayan Obo Mine display: (1) Thorium-232 is the radioactive contamination source of this region, and uranium-238 and potassium - 40 is at the background level. (2) The average content of thorium-232 in the slag of the tailings dam in Bayan Obo is as high as 276 mg/kg, which is 37 times as the global average value of thorium content. (3) We found that the thorium-232 content in the soil in the living area near the mining is higher than that in the local soil in Guyang County. The average thorium-232 concentrations in the mining areas of the Bayan Obo Mine and the living areas of the Bayan Obo Town were 18.7±7.5 and 26.2±9.1 mg/kg, respectively. (4) It was observed that thorium-232 was abnormal distributed in the contaminated area near the tailings dam. Our preliminary research results show that the in-situ gamma-ray spectrometry is an effective approach of fast evaluating rare earths radioactive pollution, not only can the scene to determine the types of radioactive contamination source, but also to measure the radioactivity concentration of thorium and uranium in soil. The environmental radioactive evaluation of rare earth ore and tailings dam in open-pit mining is also needed. The research was supported by National Natural Science Foundation of China (No. 41674111).

  16. Global transcriptome analysis of Halolamina sp. to decipher the salt tolerance in extremely halophilic archaea.

    PubMed

    Kurt-Kızıldoğan, Aslıhan; Abanoz, Büşra; Okay, Sezer

    2017-02-15

    Extremely halophilic archaea survive in the hypersaline environments such as salt lakes or salt mines. Therefore, these microorganisms are good sources to investigate the molecular mechanisms underlying the tolerance to high salt concentrations. In this study, a global transcriptome analysis was conducted in an extremely halophilic archaeon, Halolamina sp. YKT1, isolated from a salt mine in Turkey. A comparative RNA-seq analysis was performed using YKT1 isolate grown either at 2.7M NaCl or 5.5M NaCl concentrations. A total of 2149 genes were predicted to be up-regulated and 1638 genes were down-regulated in the presence of 5.5M NaCl. The salt tolerance of Halolamina sp. YKT1 involves the up-regulation of genes related with membrane transporters, CRISPR-Cas systems, osmoprotectant solutes, oxidative stress proteins, and iron metabolism. On the other hand, the genes encoding the proteins involved in DNA replication, transcription, translation, mismatch and nucleotide excision repair were down-regulated. The RNA-seq data were verified for seven up-regulated genes as well as six down-regulated genes via qRT-PCR analysis. This comprehensive transcriptome analysis showed that the halophilic archaeon canalizes its energy towards keeping the intracellular osmotic balance minimizing the production of nucleic acids and peptides. Copyright © 2016 Elsevier B.V. All rights reserved.

  17. Environmental considerations related to mining of nonfuel minerals

    USGS Publications Warehouse

    Seal, Robert R.; Piatak, Nadine M.; Kimball, Bryn E.; Hammarstrom, Jane M.; Schulz, Klaus J.; DeYoung,, John H.; Seal, Robert R.; Bradley, Dwight C.

    2017-12-19

    Throughout most of human history, environmental stewardship during mining has not been a priority partly because of the lack of applicable laws and regulations and partly because of ignorance about the effects that mining can have on the environment. In the United States, the National Environmental Policy Act of 1969, in conjunction with related laws, codified a more modern approach to mining, including the responsibility for environmental stewardship, and provided a framework for incorporating environmental protection into mine planning. Today, similar frameworks are in place in the other developed countries of the world, and international mining companies generally follow similar procedures wherever they work in the world. The regulatory guidance has fostered an international effort among all stakeholders to identify best practices for environmental stewardship.The modern approach to mining using best practices involves the following: (a) establishment of a pre-mining baseline from which to monitor environmental effects during mining and help establish geologically reasonable closure goals; (b) identification of environmental risks related to mining through standardized approaches; and (c) formulation of an environmental closure plan before the start of mining. A key aspect of identifying the environmental risks and mitigating those risks is understanding how the risks vary from one deposit type to another—a concept that forms the basis for geoenvironmental mineral-deposit models.Accompanying the quest for best practices is the goal of making mining sustainable into the future. Sustainable mine development is generally considered to be development that meets the needs of the present generation without compromising the ability of future generations to meet their own needs. The concept extends beyond the availability of nonrenewable mineral commodities and includes the environmental and social effects of mine development.Global population growth, meanwhile, has

  18. Mining virulence genes using metagenomics.

    PubMed

    Belda-Ferre, Pedro; Cabrera-Rubio, Raúl; Moya, Andrés; Mira, Alex

    2011-01-01

    When a bacterial genome is compared to the metagenome of an environment it inhabits, most genes recruit at high sequence identity. In free-living bacteria (for instance marine bacteria compared against the ocean metagenome) certain genomic regions are totally absent in recruitment plots, representing therefore genes unique to individual bacterial isolates. We show that these Metagenomic Islands (MIs) are also visible in bacteria living in human hosts when their genomes are compared to sequences from the human microbiome, despite the compartmentalized structure of human-related environments such as the gut. From an applied point of view, MIs of human pathogens (e.g. those identified in enterohaemorragic Escherichia coli against the gut metagenome or in pathogenic Neisseria meningitidis against the oral metagenome) include virulence genes that appear to be absent in related strains or species present in the microbiome of healthy individuals. We propose that this strategy (i.e. recruitment analysis of pathogenic bacteria against the metagenome of healthy subjects) can be used to detect pathogenicity regions in species where the genes involved in virulence are poorly characterized. Using this approach, we detect well-known pathogenicity islands and identify new potential virulence genes in several human pathogens.

  19. antiSMASH 3.0-a comprehensive resource for the genome mining of biosynthetic gene clusters.

    PubMed

    Weber, Tilmann; Blin, Kai; Duddela, Srikanth; Krug, Daniel; Kim, Hyun Uk; Bruccoleri, Robert; Lee, Sang Yup; Fischbach, Michael A; Müller, Rolf; Wohlleben, Wolfgang; Breitling, Rainer; Takano, Eriko; Medema, Marnix H

    2015-07-01

    Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.

    PubMed

    Singhal, Ayush; Simmons, Michael; Lu, Zhiyong

    2016-11-01

    The practice of precision medicine will ultimately require databases of genes and mutations for healthcare providers to reference in order to understand the clinical implications of each patient's genetic makeup. Although the highest quality databases require manual curation, text mining tools can facilitate the curation process, increasing accuracy, coverage, and productivity. However, to date there are no available text mining tools that offer high-accuracy performance for extracting such triplets from biomedical literature. In this paper we propose a high-performance machine learning approach to automate the extraction of disease-gene-variant triplets from biomedical literature. Our approach is unique because we identify the genes and protein products associated with each mutation from not just the local text content, but from a global context as well (from the Internet and from all literature in PubMed). Our approach also incorporates protein sequence validation and disease association using a novel text-mining-based machine learning approach. We extract disease-gene-variant triplets from all abstracts in PubMed related to a set of ten important diseases (breast cancer, prostate cancer, pancreatic cancer, lung cancer, acute myeloid leukemia, Alzheimer's disease, hemochromatosis, age-related macular degeneration (AMD), diabetes mellitus, and cystic fibrosis). We then evaluate our approach in two ways: (1) a direct comparison with the state of the art using benchmark datasets; (2) a validation study comparing the results of our approach with entries in a popular human-curated database (UniProt) for each of the previously mentioned diseases. In the benchmark comparison, our full approach achieves a 28% improvement in F1-measure (from 0.62 to 0.79) over the state-of-the-art results. For the validation study with UniProt Knowledgebase (KB), we present a thorough analysis of the results and errors. Across all diseases, our approach returned 272 triplets (disease-gene

  1. Molecular Networking and Pattern-Based Genome Mining Improves Discovery of Biosynthetic Gene Clusters and their Products from Salinispora Species

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna

    Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. In this paper, we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains, including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated themore » identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. Finally, these efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches.« less

  2. Molecular Networking and Pattern-Based Genome Mining Improves Discovery of Biosynthetic Gene Clusters and their Products from Salinispora Species

    DOE PAGES

    Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna; ...

    2015-04-09

    Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. In this paper, we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains, including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated themore » identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. Finally, these efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches.« less

  3. Molecular Networking and Pattern-Based Genome Mining Improves discovery of biosynthetic gene clusters and their products from Salinispora species

    PubMed Central

    Duncan, Katherine R.; Crüsemann, Max; Lechner, Anna; Sarkar, Anindita; Li, Jie; Ziemert, Nadine; Wang, Mingxun; Bandeira, Nuno; Moore, Bradley S.; Dorrestein, Pieter C.; Jensen, Paul R.

    2015-01-01

    Summary Genome sequencing has revealed that bacteria contain many more biosynthetic gene clusters than predicted based on the number of secondary metabolites discovered to date. While this biosynthetic reservoir has fostered interest in new tools for natural product discovery, there remains a gap between gene cluster detection and compound discovery. Here we apply molecular networking and the new concept of pattern-based genome mining to 35 Salinispora strains including 30 for which draft genome sequences were either available or obtained for this study. The results provide a method to simultaneously compare large numbers of complex microbial extracts, which facilitated the identification of media components, known compounds and their derivatives, and new compounds that could be prioritized for structure elucidation. These efforts revealed considerable metabolite diversity and led to several molecular family-gene cluster pairings, of which the quinomycin-type depsipeptide retimycin A was characterized and linked to gene cluster NRPS40 using pattern-based bioinformatic approaches. PMID:25865308

  4. Institutional challenges for mining and sustainability in Peru.

    PubMed

    Bebbington, Anthony J; Bury, Jeffrey T

    2009-10-13

    Global consumption continues to generate growth in mining. In lesser developed economies, this growth offers the potential to generate new resources for development, but also creates challenges to sustainability in the regions in which extraction occurs. This context leads to debate on the institutional arrangements most likely to build synergies between mining, livelihoods, and development, and on the socio-political conditions under which such institutions can emerge. Building from a multiyear, three-country program of research projects, Peru, a global center of mining expansion, serves as an exemplar for analyzing the effects of extractive industry on livelihoods and the conditions under which arrangements favoring local sustainability might emerge. This program is guided by three emergent hypotheses in human-environmental sciences regarding the relationships among institutions, knowledge, learning, and sustainability. The research combines in-depth and comparative case study analysis, and uses mapping and spatial analysis, surveys, in-depth interviews, participant observation, and our own direct participation in public debates on the regulation of mining for development. The findings demonstrate the pressures that mining expansion has placed on water resources, livelihood assets, and social relationships. These pressures are a result of institutional conditions that separate the governance of mineral expansion, water resources, and local development, and of relationships of power that prioritize large scale investment over livelihood and environment. A further problem is the poor communication between mining sector knowledge systems and those of local populations. These results are consistent with themes recently elaborated in sustainability science.

  5. antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters

    PubMed Central

    Blin, Kai; Duddela, Srikanth; Krug, Daniel; Kim, Hyun Uk; Bruccoleri, Robert; Lee, Sang Yup; Fischbach, Michael A; Müller, Rolf; Wohlleben, Wolfgang; Breitling, Rainer; Takano, Eriko

    2015-01-01

    Abstract Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software. PMID:25948579

  6. The future of Yellowcake: a global assessment of uranium resources and mining.

    PubMed

    Mudd, Gavin M

    2014-02-15

    Uranium (U) mining remains controversial in many parts of the world, especially in a post-Fukushima context, and often in areas with significant U resources. Although nuclear proponents point to the relatively low carbon intensity of nuclear power compared to fossil fuels, opponents argue that this will be eroded in the future as ore grades decline and energy and greenhouse gas emissions (GGEs) intensity increases as a result. Invariably both sides fail to make use of the increasingly available data reported by some U mines through sustainability reporting - allowing a comprehensive assessment of recent trends in the energy and GGE intensity of U production, as well as combining this with reported mineral resources to allow more comprehensive modelling of future energy and GGEs intensity. In this study, detailed data sets are compiled on reported U resources by deposit type, as well as mine production, energy and GGE intensity. Some important aspects included are the relationship between ore grade, deposit type and recovery, which are crucial in future projections of U mining. Overall, the paper demonstrates that there are extensive U resources known to meet potential short to medium term demand, although the future of U mining remains uncertain due to the doubt about the future of nuclear power as well as a range of complex social, environmental, economic and some site-specific technical issues. Copyright © 2013 Elsevier B.V. All rights reserved.

  7. Feasibility of lunar Helium-3 mining

    NASA Astrophysics Data System (ADS)

    Kleinschneider, Andreas; Van Overstraeten, Dmitry; Van der Reijnst, Roy; Van Hoorn, Niels; Lamers, Marvin; Hubert, Laurent; Dijk, Bert; Blangé, Joey; Hogeveen, Joel; De Boer, Lennaert; Noomen, Ron

    With fossil fuels running out and global energy demand increasing, the need for alternative energy sources is apparent. Nuclear fusion using Helium-3 may be a solution. Helium-3 is a rare isotope on Earth, but it is abundant on the Moon. Throughout the space community lunar Helium-3 is often cited as a major reason to return to the Moon. Despite the potential of lunar Helium-3 mining, little research has been conducted on a full end-to-end mission. This abstract presents the results of a feasibility study conducted by students from Delft University of Technology. The goal of the study was to assess whether a continuous end-to-end mission to mine Helium-3 on the Moon and return it to Earth is a viable option for the future energy market. The set requirements for the representative end-to-end mission were to provide 10% of the global energy demand in the year 2040. The mission elements have been selected with multiple trade-offs among both conservative and novel concepts. A mission architecture with multiple decoupled elements for each transportation segment (LEO, transfer, lunar surface) was found to be the best option. It was found that the most critical element is the lunar mining operation itself. To supply 10% of the global energy demand in 2040, 200 tons of Helium-3 would be required per year. The resulting regolith mining rate would be 630 tons per second, based on an optimistic concentration of 20 ppb Helium-3 in lunar regolith. Between 1,700 to 2,000 Helium-3 mining vehicles would be required, if using University of Wisconsin’s Mark III miner. The required heating power, if mining both day and night, would add up to 39 GW. The resulting power system mass for the lunar operations would be in the order of 60,000 to 200,000 tons. A fleet of three lunar ascent/descent vehicles and 22 continuous-thrust vehicles for orbit transfer would be required. The costs of the mission elements have been spread out over expected lifetimes. The resulting profits from Helium

  8. Analysis of Bonds as an Instrument for Financing Mining Investments

    NASA Astrophysics Data System (ADS)

    Ranosz, Robert

    2017-06-01

    The purpose of this article is to examine the structure of financing for mining enterprises in the years 2007-2013, with particular emphasis on bonds. The document pays special attention to Polish mining enterprises. The financing structure analysis was based on data collected from financial statements (cash flows) of the largest mining companies in Poland, and their comparison with the results of global mining enterprises pursuant to reports prepared by international advisory firms. The article takes into account capital sources such as: corporate bonds, bank loans and issue of shares. As indicated by the performed analysis, mining enterprises both around the world and in Poland are increasingly eager to take advantage of obtaining business financing from issue of corporate bonds. It should also be recognized that in the analyzed period, both global and Polish mining enterprises deviate from forms of financing such as issue of shares. This may be caused by the fact that the bonds market in Poland is becoming increasingly popular, mainly due to interest rate on bonds being lower in comparison with bank loans. Another reason may be that banks and potential buyers of shares are less eager to finance this type of investment due to a relatively substantial risk acceptable to bondholders.

  9. Suspended sediment load below open-cast mines for ungauged river basin

    NASA Astrophysics Data System (ADS)

    Kuksina, L.

    2011-12-01

    Placer mines are located in river valleys along river benches or river ancient channels. Frequently the existing mining sites are characterized by low contribution of the environmental technologies. Therefore open-pit mining alters stream hydrology and sediment processes and enhances sediment transport. The most serious environmental consequences of the sediment yield increase occur in the rivers populated by salmon fish community because salmon species prefer clean water with low turbidity. For instance, placer mining located in Kamchatka peninsula (Far East of Russia) which is regarded to be the last global gene pool of wild salmon Oncorhynchus threatens rivers ecosystems significantly. Impact assessment is limited by the hydrological observations scarcity. Gauging network is rare and in many cases whole basins up to 200 km length miss any hydrological data. The main purpose of the work is elaboration of methods for sediment yield estimation in rivers under mining impact and implementation of corresponding calculations. Subjects of the study are rivers of the Vivenka river basin where open-cast platinum mine is situated. It's one of the largest platinum mines in Russian Federation and in the world. This mine is the most well-studied in Kamchatka (research covers a period from 2003 to 2011). Empirical - analytical model of suspended sediment yield estimation was elaborated for rivers draining mine's territories. Sediment delivery at the open-cast mine happens due to the following sediment processes: - erosion in the channel diversions; - soil erosion on the exposed hillsides; - effluent from settling ponds; - mine waste water inflow; - accident mine waste water escape into rivers. Sediment washout caused by erosion was estimated by repeated measurements of the channel profiles in 2003, 2006 and 2008. Estimation of horizontal deformation rates was carried out on the basis of erosion dependence on water discharge rates, slopes and composition of sediments. Soil

  10. Mutual information estimation reveals global associations between stimuli and biological processes

    PubMed Central

    Suzuki, Taiji; Sugiyama, Masashi; Kanamori, Takafumi; Sese, Jun

    2009-01-01

    Background Although microarray gene expression analysis has become popular, it remains difficult to interpret the biological changes caused by stimuli or variation of conditions. Clustering of genes and associating each group with biological functions are often used methods. However, such methods only detect partial changes within cell processes. Herein, we propose a method for discovering global changes within a cell by associating observed conditions of gene expression with gene functions. Results To elucidate the association, we introduce a novel feature selection method called Least-Squares Mutual Information (LSMI), which computes mutual information without density estimaion, and therefore LSMI can detect nonlinear associations within a cell. We demonstrate the effectiveness of LSMI through comparison with existing methods. The results of the application to yeast microarray datasets reveal that non-natural stimuli affect various biological processes, whereas others are no significant relation to specific cell processes. Furthermore, we discover that biological processes can be categorized into four types according to the responses of various stimuli: DNA/RNA metabolism, gene expression, protein metabolism, and protein localization. Conclusion We proposed a novel feature selection method called LSMI, and applied LSMI to mining the association between conditions of yeast and biological processes through microarray datasets. In fact, LSMI allows us to elucidate the global organization of cellular process control. PMID:19208155

  11. Self-Organizing Global Gene Expression Regulated through Criticality: Mechanism of the Cell-Fate Change

    PubMed Central

    Tsuchiya, Masa; Giuliani, Alessandro; Hashimoto, Midori; Erenpreisa, Jekaterina; Yoshikawa, Kenichi

    2016-01-01

    Background A fundamental issue in bioscience is to understand the mechanism that underlies the dynamic control of genome-wide expression through the complex temporal-spatial self-organization of the genome to regulate the change in cell fate. We address this issue by elucidating a physically motivated mechanism of self-organization. Principal Findings Building upon transcriptome experimental data for seven distinct cell fates, including early embryonic development, we demonstrate that self-organized criticality (SOC) plays an essential role in the dynamic control of global gene expression regulation at both the population and single-cell levels. The novel findings are as follows: i) Mechanism of cell-fate changes: A sandpile-type critical transition self-organizes overall expression into a few transcription response domains (critical states). A cell-fate change occurs by means of a dissipative pulse-like global perturbation in self-organization through the erasure of initial-state critical behaviors (criticality). Most notably, the reprogramming of early embryo cells destroys the zygote SOC control to initiate self-organization in the new embryonal genome, which passes through a stochastic overall expression pattern. ii) Mechanism of perturbation of SOC controls: Global perturbations in self-organization involve the temporal regulation of critical states. Quantitative evaluation of this perturbation in terminal cell fates reveals that dynamic interactions between critical states determine the critical-state coherent regulation. The occurrence of a temporal change in criticality perturbs this between-states interaction, which directly affects the entire genomic system. Surprisingly, a sub-critical state, corresponding to an ensemble of genes that shows only marginal changes in expression and consequently are considered to be devoid of any interest, plays an essential role in generating a global perturbation in self-organization directed toward the cell-fate change

  12. Integrating text mining into the MGI biocuration workflow

    PubMed Central

    Dowell, K.G.; McAndrews-Hill, M.S.; Hill, D.P.; Drabkin, H.J.; Blake, J.A.

    2009-01-01

    A major challenge for functional and comparative genomics resource development is the extraction of data from the biomedical literature. Although text mining for biological data is an active research field, few applications have been integrated into production literature curation systems such as those of the model organism databases (MODs). Not only are most available biological natural language (bioNLP) and information retrieval and extraction solutions difficult to adapt to existing MOD curation workflows, but many also have high error rates or are unable to process documents available in those formats preferred by scientific journals. In September 2008, Mouse Genome Informatics (MGI) at The Jackson Laboratory initiated a search for dictionary-based text mining tools that we could integrate into our biocuration workflow. MGI has rigorous document triage and annotation procedures designed to identify appropriate articles about mouse genetics and genome biology. We currently screen ∼1000 journal articles a month for Gene Ontology terms, gene mapping, gene expression, phenotype data and other key biological information. Although we do not foresee that curation tasks will ever be fully automated, we are eager to implement named entity recognition (NER) tools for gene tagging that can help streamline our curation workflow and simplify gene indexing tasks within the MGI system. Gene indexing is an MGI-specific curation function that involves identifying which mouse genes are being studied in an article, then associating the appropriate gene symbols with the article reference number in the MGI database. Here, we discuss our search process, performance metrics and success criteria, and how we identified a short list of potential text mining tools for further evaluation. We provide an overview of our pilot projects with NCBO's Open Biomedical Annotator and Fraunhofer SCAI's ProMiner. In doing so, we prove the potential for the further incorporation of semi

  13. Integrating text mining into the MGI biocuration workflow.

    PubMed

    Dowell, K G; McAndrews-Hill, M S; Hill, D P; Drabkin, H J; Blake, J A

    2009-01-01

    A major challenge for functional and comparative genomics resource development is the extraction of data from the biomedical literature. Although text mining for biological data is an active research field, few applications have been integrated into production literature curation systems such as those of the model organism databases (MODs). Not only are most available biological natural language (bioNLP) and information retrieval and extraction solutions difficult to adapt to existing MOD curation workflows, but many also have high error rates or are unable to process documents available in those formats preferred by scientific journals.In September 2008, Mouse Genome Informatics (MGI) at The Jackson Laboratory initiated a search for dictionary-based text mining tools that we could integrate into our biocuration workflow. MGI has rigorous document triage and annotation procedures designed to identify appropriate articles about mouse genetics and genome biology. We currently screen approximately 1000 journal articles a month for Gene Ontology terms, gene mapping, gene expression, phenotype data and other key biological information. Although we do not foresee that curation tasks will ever be fully automated, we are eager to implement named entity recognition (NER) tools for gene tagging that can help streamline our curation workflow and simplify gene indexing tasks within the MGI system. Gene indexing is an MGI-specific curation function that involves identifying which mouse genes are being studied in an article, then associating the appropriate gene symbols with the article reference number in the MGI database.Here, we discuss our search process, performance metrics and success criteria, and how we identified a short list of potential text mining tools for further evaluation. We provide an overview of our pilot projects with NCBO's Open Biomedical Annotator and Fraunhofer SCAI's ProMiner. In doing so, we prove the potential for the further incorporation of semi

  14. Global land-use change hidden behind nickel consumption.

    PubMed

    Nakajima, Kenichi; Nansai, Keisuke; Matsubae, Kazuyo; Tomita, Makoto; Takayanagi, Wataru; Nagasaka, Tetsuya

    2017-05-15

    Economic growth is associated with a rapid rise in the use of natural resources within the economy, and has potential environmental impacts at local and/or global scales. In today's globalized economy, each country has indirect flows supporting its economic activities, and natural resource consumption through supply chains influences environmental impacts far removed from the place of consumption. One way to control environmental impacts associated with consumption of natural resources is to identify the consumption of natural resources and the associated environmental impacts through the global supply chain. In this study, we used a global link input-output model (GLIO, a hybrid multiregional input-output model) to detect the linkages between national nickel consumption and mining-associated global land-use changes. We focused on nickel, whose global demand has risen rapidly in recent years, as a case study. The estimated area of land-use change around the world caused by nickel mining in 2005 was 1.9km 2 , and that induced by Japanese final demand for nickel was 0.38km 2 . Our modeling also revealed that the areas of greatest land-use change associated with nickel mining were concentrated in only a few countries and regions far removed from the place of consumption. For example, 57.7% of the world's land-use changes caused by nickel mining were concentrated in five countries in 2005: Australia, 13.7%; Russia, 12.9%; Indonesia, 12.5%; New Caledonia, 10.4%; and Colombia, 8.2%. The mining-associated land-use change induced by Japanese final demand accounted for 19.5% of the total area affected by land-use change caused by nickel mining. The top three countries accounted for 70.6% (Indonesia: 47.0%, New Caledonia: 16.0%, and Australia: 7.7%), and the top five accounted for 82.4% (the Philippines: 7.5%, and Canada: 4.3%, in addition to the top three countries and regions). Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  15. Expression of immunoregulatory genes and its relationship to lead exposure and lead-mediated oxidative stress in wild ungulates from an abandoned mining area.

    PubMed

    Rodríguez-Estival, Jaime; de la Lastra, José M Pérez; Ortiz-Santaliestra, Manuel E; Vidal, Dolors; Mateo, Rafael

    2013-04-01

    Lead (Pb) is a highly toxic metal that can induce oxidative stress and affect the immune system by modifying the expression of immunomodulator-related genes. The aim of the present study was to investigate the association between Pb exposure and the transcriptional profiles of some cytokines, as well as the relationship between Pb exposure and changes in oxidative stress biomarkers observed in the spleen of wild ungulates exposed to mining pollution. Red deer and wild boar from the mining area studied had higher spleen, liver, and bone Pb levels than controls, indicating a chronic exposure to Pb pollution. Such exposure caused a depletion of spleen glutathione levels in both species and disrupted the activity of antioxidant enzymes, suggesting the generation of oxidative stress conditions. Deer from the mining area also showed an induced T-helper (Th )-dependent immune response toward the Th 2 pathway, whereas boar from the mining area showed a cytokine profile suggesting an inclination of the immune response toward the Th 1 pathway. These results indicate that environmental exposure to Pb may alter immune responses in wild ungulates exposed to mining pollution. However, evidence of direct relationships between Pb-mediated oxidative stress and the changes detected in immune responses were not found. Further research is needed to evaluate the immunotoxic potential of Pb pollution, also considering the prevalence of chronic infectious diseases in wildlife in environments affected by mining activities. Copyright © 2013 SETAC.

  16. Global mining risk footprint of critical metals necessary for low-carbon technologies: the case of neodymium, cobalt, and platinum in Japan.

    PubMed

    Nansai, Keisuke; Nakajima, Kenichi; Kagawa, Shigemi; Kondo, Yasushi; Shigetomi, Yosuke; Suh, Sangwon

    2015-02-17

    Meeting the 2-degree global warming target requires wide adoption of low-carbon energy technologies. Many such technologies rely on the use of precious metals, however, increasing the dependence of national economies on these resources. Among such metals, those with supply security concerns are referred to as critical metals. Using the Policy Potential Index developed by the Fraser Institute, this study developed a new footprint indicator, the mining risk footprint (MRF), to quantify the mining risk directly and indirectly affecting a national economy through its consumption of critical metals. We formulated the MRF as a product of the material footprint (MF) of the consuming country and the mining risks of the countries where the materials are mined. A case study was conducted for the 2005 Japanese economy to determine the MF and MRF for three critical metals essential for emerging energy technologies: neodymium, cobalt and platinum. The results indicate that in 2005 the MFs generated by Japanese domestic final demand, that is, the consumption-based metal output of Japan, were 1.0 × 10(3) t for neodymium, 9.4 × 10(3) t for cobalt, and 2.1 × 10 t for platinum. Export demand contributes most to the MF, accounting for 3.0 × 10(3) t, 1.3 × 10(5) t, and 3.1 × 10 t, respectively. The MRFs of Japanese total final demand (domestic plus export) were calculated to be 1.7 × 10 points for neodymium, 4.5 × 10(-2) points for cobalt, and 5.6 points for platinum, implying that the Japanese economy is incurring a high mining risk through its use of neodymium. This country's MRFs are all dominated by export demand. The paper concludes by discussing the policy implications and future research directions for measuring the MFs and MRFs of critical metals. For countries poorly endowed with mineral resources, adopting low-carbon energy technologies may imply a shifting of risk from carbon resources to other natural resources, in particular critical metals, and a trade

  17. Global Gene Expression Profiling in Lung Tissues of Rat Exposed to Lunar Dust Particles

    NASA Technical Reports Server (NTRS)

    Yeshitla, Samrawit A.; Lam, Chiu-Wing; Kidane, Yared H.; Feiveson, Alan H.; Ploutz-Snyder, Robert; Wu, Honglu; James, John T.; Meyers, Valerie E.; Zhang, Ye

    2014-01-01

    The Moon's surface is covered by a layer of fine, potential reactive dust. Lunar dust contain about 1-2% respirable very fine dust (less than 3 micrometers). The habitable area of any lunar landing vehicle and outpost would inevitably be contaminated with lunar dust that could pose a health risk. The purpose of the study is to analyze the dynamics of global gene expression changes in lung tissues of rats exposed to lunar dust particles. F344 rats were exposed for 4 weeks (6h/d; 5d/wk) in nose-only inhalation chambers to concentrations of 0 (control air), 2.1, 6.8, 21, and 61 mg/m3 of lunar dust. Animals were euthanized at 1 day and 13 weeks after the last inhalation exposure. After being lavaged, lung tissue from each animal was collected and total RNA was isolated. Four samples of each dose group were analyzed using Agilent Rat GE v3 microarray to profile global gene expression of 44K transcripts. After background subtraction, normalization, and log transformation, t tests were used to compare the mean expression levels of each exposed group to the control group. Correction for multiple testing was made using the method of Benjamini, Krieger, and Yekuteli (1) to control the false discovery rate. Genes with significant changes of at least 1.75 fold were identified as genes of interest. Both low and high doses of lunar dust caused dramatic, dose-dependent global gene expression changes in the lung tissues. However, the responses of lung tissue to low dose lunar dust are distinguished from those of high doses, especially those associated with 61mg/m3 dust exposure. The data were further integrated into the Ingenuity system to analyze the gene ontology (GO), pathway distribution and putative upstream regulators and gene targets. Multiple pathways, functions, and upstream regulators have been identified in response to lunar dust induced damage in the lung tissue.

  18. Global transcriptome analysis of eukaryotic genes affected by gromwell extract.

    PubMed

    Bang, Soohyun; Lee, Dohyun; Kim, Hanhe; Park, Jiyong; Bahn, Yong-Sun

    2014-02-01

    Gromwell is known to have diverse pharmacological, cosmetic and nutritional benefits for humans. Nevertheless, the biological influence of gromwell extract (GE) on the general physiology of eukaryotic cells remains unknown. In this study a global transcriptome analysis was performed to identify genes affected by the addition of GE with Cryptococcus neoformans as the model system. In response to GE treatment, genes involved in signal transduction were immediately regulated, and the evolutionarily conserved sets of genes involved in the core cellular functions, including DNA replication, RNA transcription/processing and protein translation/processing, were generally up-regulated. In contrast, a number of genes involved in carbohydrate metabolism and transport, inorganic ion transport and metabolism, post-translational modification/protein turnover/chaperone functions and signal transduction were down-regulated. Among the GE-responsive genes that are also evolutionarily conserved in the human genome, the expression patterns of YSA1, TPO2, CFO1 and PZF1 were confirmed by northern blot analysis. Based on the functional characterization of some GE-responsive genes, it was found that GE treatment may promote cellular tolerance against a variety of environmental stresses in eukaryotes. GE treatment affects the expression levels of a significant portion of the Cryptococcus genome, implying that GE significantly affects the general physiology of eukaryotic cells. © 2013 Society of Chemical Industry.

  19. Institutional challenges for mining and sustainability in Peru

    PubMed Central

    Bebbington, Anthony J.; Bury, Jeffrey T.

    2009-01-01

    Global consumption continues to generate growth in mining. In lesser developed economies, this growth offers the potential to generate new resources for development, but also creates challenges to sustainability in the regions in which extraction occurs. This context leads to debate on the institutional arrangements most likely to build synergies between mining, livelihoods, and development, and on the socio-political conditions under which such institutions can emerge. Building from a multiyear, three-country program of research projects, Peru, a global center of mining expansion, serves as an exemplar for analyzing the effects of extractive industry on livelihoods and the conditions under which arrangements favoring local sustainability might emerge. This program is guided by three emergent hypotheses in human-environmental sciences regarding the relationships among institutions, knowledge, learning, and sustainability. The research combines in-depth and comparative case study analysis, and uses mapping and spatial analysis, surveys, in-depth interviews, participant observation, and our own direct participation in public debates on the regulation of mining for development. The findings demonstrate the pressures that mining expansion has placed on water resources, livelihood assets, and social relationships. These pressures are a result of institutional conditions that separate the governance of mineral expansion, water resources, and local development, and of relationships of power that prioritize large scale investment over livelihood and environment. A further problem is the poor communication between mining sector knowledge systems and those of local populations. These results are consistent with themes recently elaborated in sustainability science. PMID:19805172

  20. RESEARCH PAPERS : Ionospheric signature of surface mine blasts from Global Positioning System measurements

    NASA Astrophysics Data System (ADS)

    Calais, Eric; Bernard Minster, J.; Hofton, Michelle; Hedlin, Michael

    1998-01-01

    Sources such as atmospheric or buried explosions and shallow earthquakes are known to produce infrasonic pressure waves in the atmosphere Because of the coupling between neutral particles and electrons at ionospheric altitudes, these acoustic and gravity waves induce variations of the ionospheric electron density. The Global Positioning System (GPS) provides a way of directly measuring the total electron content in the ionosphere and, therefore, of detecting such perturbations in the upper atmosphere. In July and August 1996, three large surface mine blasts (1.5 Kt each) were detonated at the Black Thunder coal mine in eastern Wyoming. As part of a seismic and acoustic monitoring experiment, we deployed five dual-frequency GPS receivers at distances ranging from 50 to 200 km from the mine and were able to detect the ionospheric perturbation caused by the blasts. The perturbation starts 10 to 15 min after the blast, lasts for about 30 min, and propagates with an apparent horizontal velocity of 1200 m s- 1. Its amplitude reaches 3 × 1014 el m- 2 in the 7-3 min period band, a value close to the ionospheric perturbation caused by the M=6.7 Northridge earthquake (Calais & Minster 1995). The small signal-to-noise ratio of the perturbation can be improved by slant-stacking the electron content time-series recorded by the different GPS receivers taking into account the horizontal propagation of the perturbation. The energy of the perturbation is concentrated in the 200 to 300 s period band, a result consistent with previous observations and numerical model predictions. The 300 s band probably corresponds to gravity modes and shorter periods to acoustic modes, respectively. Using a 1-D stratified velocity model of the atmosphere we show that linear acoustic ray tracing fits arrival times at all GPS receivers. We interpret the perturbation as a direct acoustic wave caused by the explosion itself. This study shows that even relatively small subsurface events can produce

  1. Frontiers of biomedical text mining: current progress

    PubMed Central

    Zweigenbaum, Pierre; Demner-Fushman, Dina; Yu, Hong; Cohen, Kevin B.

    2008-01-01

    It is now almost 15 years since the publication of the first paper on text mining in the genomics domain, and decades since the first paper on text mining in the medical domain. Enormous progress has been made in the areas of information retrieval, evaluation methodologies and resource construction. Some problems, such as abbreviation-handling, can essentially be considered solved problems, and others, such as identification of gene mentions in text, seem likely to be solved soon. However, a number of problems at the frontiers of biomedical text mining continue to present interesting challenges and opportunities for great improvements and interesting research. In this article we review the current state of the art in biomedical text mining or ‘BioNLP’ in general, focusing primarily on papers published within the past year. PMID:17977867

  2. Impact of Peat Mining and Restoration on Methane Turnover Potential and Methane-Cycling Microorganisms in a Northern Bog.

    PubMed

    Reumer, Max; Harnisz, Monika; Lee, Hyo Jung; Reim, Andreas; Grunert, Oliver; Putkinen, Anuliina; Fritze, Hannu; Bodelier, Paul L E; Ho, Adrian

    2018-02-01

    Ombrotrophic peatlands are a recognized global carbon reservoir. Without restoration and peat regrowth, harvested peatlands are dramatically altered, impairing their carbon sink function, with consequences for methane turnover. Previous studies determined the impact of commercial mining on the physicochemical properties of peat and the effects on methane turnover. However, the response of the underlying microbial communities catalyzing methane production and oxidation have so far received little attention. We hypothesize that with the return of Sphagnum spp. postharvest, methane turnover potential and the corresponding microbial communities will converge in a natural and restored peatland. To address our hypothesis, we determined the potential methane production and oxidation rates in natural (as a reference), actively mined, abandoned, and restored peatlands over two consecutive years. In all sites, the methanogenic and methanotrophic population sizes were enumerated using quantitative PCR (qPCR) assays targeting the mcrA and pmoA genes, respectively. Shifts in the community composition were determined using Illumina MiSeq sequencing of the mcrA gene and a pmoA -based terminal restriction fragment length polymorphism (t-RFLP) analysis, complemented by cloning and sequence analysis of the mmoX gene. Peat mining adversely affected methane turnover potential, but the rates recovered in the restored site. The recovery in potential activity was reflected in the methanogenic and methanotrophic abundances. However, the microbial community composition was altered, being more pronounced for the methanotrophs. Overall, we observed a lag between the recovery of the methanogenic/methanotrophic activity and the return of the corresponding microbial communities, suggesting that a longer duration (>15 years) is needed to reverse mining-induced effects on the methane-cycling microbial communities. IMPORTANCE Ombrotrophic peatlands are a crucial carbon sink, but this environment

  3. Contaminant Attenuation Processes at Mining Sites

    EPA Science Inventory

    Monitored natural attenuation is sometimes used in combination with active treatment technologies to achieve site-specific remediation objectives. The global imprint of acid drainage problems at mining sites, however, is a clear reminder that in most cases natural processes are ...

  4. Text mining patents for biomedical knowledge.

    PubMed

    Rodriguez-Esteban, Raul; Bundschus, Markus

    2016-06-01

    Biomedical text mining of scientific knowledge bases, such as Medline, has received much attention in recent years. Given that text mining is able to automatically extract biomedical facts that revolve around entities such as genes, proteins, and drugs, from unstructured text sources, it is seen as a major enabler to foster biomedical research and drug discovery. In contrast to the biomedical literature, research into the mining of biomedical patents has not reached the same level of maturity. Here, we review existing work and highlight the associated technical challenges that emerge from automatically extracting facts from patents. We conclude by outlining potential future directions in this domain that could help drive biomedical research and drug discovery. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. Haloperidol induces pharmacoepigenetic response by modulating miRNA expression, global DNA methylation and expression profiles of methylation maintenance genes and genes involved in neurotransmission in neuronal cells.

    PubMed

    Swathy, Babu; Banerjee, Moinak

    2017-01-01

    Haloperidol has been extensively used in various psychiatric conditions. It has also been reported to induce severe side effects. We aimed to evaluate whether haloperidol can influence host methylome, and if so what are the possible mechanisms for it in neuronal cells. Impact on host methylome and miRNAs can have wide spread alterations in gene expression, which might possibly help in understanding how haloperidol may impact treatment response or induce side effects. SK-N-SH, a neuroblasoma cell line was treated with haloperidol at 10μm concentration for 24 hours and global DNA methylation was evaluated. Methylation at global level is maintained by methylation maintenance machinery and certain miRNAs. Therefore, the expression of methylation maintenance genes and their putative miRNA expression profiles were assessed. These global methylation alterations could result in gene expression changes. Therefore genes expressions for neurotransmitter receptors, regulators, ion channels and transporters were determined. Subsequently, we were also keen to identify a strong candidate miRNA based on biological and in-silico approach which can reflect on the pharmacoepigenetic trait of haloperidol and can also target the altered neuroscience panel of genes used in the study. Haloperidol induced increase in global DNA methylation which was found to be associated with corresponding increase in expression of various epigenetic modifiers that include DNMT1, DNMT3A, DNMT3B and MBD2. The expression of miR-29b that is known to putatively regulate the global methylation by modulating the expression of epigenetic modifiers was observed to be down regulated by haloperidol. In addition to miR-29b, miR-22 was also found to be downregulated by haloperidol treatment. Both these miRNA are known to putatively target several genes associated with various epigenetic modifiers, pharmacogenes and neurotransmission. Interestingly some of these putative target genes involved in neurotransmission

  6. Haloperidol induces pharmacoepigenetic response by modulating miRNA expression, global DNA methylation and expression profiles of methylation maintenance genes and genes involved in neurotransmission in neuronal cells

    PubMed Central

    Swathy, Babu

    2017-01-01

    Introduction Haloperidol has been extensively used in various psychiatric conditions. It has also been reported to induce severe side effects. We aimed to evaluate whether haloperidol can influence host methylome, and if so what are the possible mechanisms for it in neuronal cells. Impact on host methylome and miRNAs can have wide spread alterations in gene expression, which might possibly help in understanding how haloperidol may impact treatment response or induce side effects. Methods SK-N-SH, a neuroblasoma cell line was treated with haloperidol at 10μm concentration for 24 hours and global DNA methylation was evaluated. Methylation at global level is maintained by methylation maintenance machinery and certain miRNAs. Therefore, the expression of methylation maintenance genes and their putative miRNA expression profiles were assessed. These global methylation alterations could result in gene expression changes. Therefore genes expressions for neurotransmitter receptors, regulators, ion channels and transporters were determined. Subsequently, we were also keen to identify a strong candidate miRNA based on biological and in-silico approach which can reflect on the pharmacoepigenetic trait of haloperidol and can also target the altered neuroscience panel of genes used in the study. Results Haloperidol induced increase in global DNA methylation which was found to be associated with corresponding increase in expression of various epigenetic modifiers that include DNMT1, DNMT3A, DNMT3B and MBD2. The expression of miR-29b that is known to putatively regulate the global methylation by modulating the expression of epigenetic modifiers was observed to be down regulated by haloperidol. In addition to miR-29b, miR-22 was also found to be downregulated by haloperidol treatment. Both these miRNA are known to putatively target several genes associated with various epigenetic modifiers, pharmacogenes and neurotransmission. Interestingly some of these putative target genes

  7. Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data.

    PubMed

    Lee, Hyeonjeong; Shin, Miyoung

    2017-01-01

    The problem of discovering genetic markers as disease signatures is of great significance for the successful diagnosis, treatment, and prognosis of complex diseases. Even if many earlier studies worked on identifying disease markers from a variety of biological resources, they mostly focused on the markers of genes or gene-sets (i.e., pathways). However, these markers may not be enough to explain biological interactions between genetic variables that are related to diseases. Thus, in this study, our aim is to investigate distinctive associations among active pathways (i.e., pathway-sets) shown each in case and control samples which can be observed from gene expression and/or methylation data. The pathway-sets are obtained by identifying a set of associated pathways that are often active together over a significant number of class samples. For this purpose, gene expression or methylation profiles are first analyzed to identify significant (active) pathways via gene-set enrichment analysis. Then, regarding these active pathways, an association rule mining approach is applied to examine interesting pathway-sets in each class of samples (case or control). By doing so, the sets of associated pathways often working together in activity profiles are finally chosen as our distinctive signature of each class. The identified pathway-sets are aggregated into a pathway activity network (PAN), which facilitates the visualization of differential pathway associations between case and control samples. From our experiments with two publicly available datasets, we could find interesting PAN structures as the distinctive signatures of breast cancer and uterine leiomyoma cancer, respectively. Our pathway-set markers were shown to be superior or very comparable to other genetic markers (such as genes or gene-sets) in disease classification. Furthermore, the PAN structure, which can be constructed from the identified markers of pathway-sets, could provide deeper insights into

  8. Global Gene Expression Change Induced by Major Thoracoabdominal Surgery.

    PubMed

    Allen, Casey J; Griswold, Anthony J; Schulman, Carl I; Sleeman, Danny; Levi, Joe U; Livingstone, Alan S; Proctor, Kenneth G

    2017-12-01

    To test the hypothesis that major thoracoabdominal surgery induces gene expression changes associated with adverse outcomes. Widely different traumatic injuries evoke surprisingly similar gene expression profiles, but there is limited information on whether the iatrogenic injury caused by major surgery is associated with similar patterns. With informed consent, blood samples were obtained from 50 patients before and after open transhiatal esophagectomy or pancreaticoduodenectomy. Twelve cases with complicated recoveries (death, infection, venous thromboembolism) were matched with 12 cases with uneventful recoveries. Global gene expression was assayed using human microarray chips. A 2-fold change with a corrected P < 0.05 was considered differentially expressed. In these 24 patients, 522 genes were differentially expressed after surgery; 248 (48%) were upregulated (innate immunity and inflammation) and 274 (52%) were downregulated [adaptive immunity (antigen presentation, T-cell function)]. Hierarchical clustering of the profile reliably predicted pre- and postoperative status. The within-patient change was 3.08 ± 0.91-fold. There was no measurable association with age, malignancy, procedure, surgery length, operative blood loss, or transfusion requirements, but was positively associated with postoperative infection (3.81 ± 0.97 vs 2.79 ± 0.73; P = 0.009) and hospital length of stay (r = 0.583, P = 0.003). Venous thromboembolism and mortality each occurred in one patient, thus no associations were possible. Major surgery induces a quantifiable pattern of gene expression change that is associated with adverse outcome. This could reflect early impaired adaptive immunity and suggests potential therapeutic targets to improve postoperative recovery.

  9. METAL ATTENUATION PROCESSES AT MINING SITES

    EPA Science Inventory

    The purpose of this Issue Paper is to provide scientists and engineers responsible for assessing remediation technologies with background information on MNA processes at mining-impacted sites. The global magnitude of the acid drainage problem is clear evidence that in most cases...

  10. Unsupervised text mining for assessing and augmenting GWAS results.

    PubMed

    Ailem, Melissa; Role, François; Nadif, Mohamed; Demenais, Florence

    2016-04-01

    Text mining can assist in the analysis and interpretation of large-scale biomedical data, helping biologists to quickly and cheaply gain confirmation of hypothesized relationships between biological entities. We set this question in the context of genome-wide association studies (GWAS), an actively emerging field that contributed to identify many genes associated with multifactorial diseases. These studies allow to identify groups of genes associated with the same phenotype, but provide no information about the relationships between these genes. Therefore, our objective is to leverage unsupervised text mining techniques using text-based cosine similarity comparisons and clustering applied to candidate and random gene vectors, in order to augment the GWAS results. We propose a generic framework which we used to characterize the relationships between 10 genes reported associated with asthma by a previous GWAS. The results of this experiment showed that the similarities between these 10 genes were significantly stronger than would be expected by chance (one-sided p-value<0.01). The clustering of observed and randomly selected gene also allowed to generate hypotheses about potential functional relationships between these genes and thus contributed to the discovery of new candidate genes for asthma. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. An Outbreak of Lymphocutaneous Sporotrichosis among Mine-Workers in South Africa.

    PubMed

    Govender, Nelesh P; Maphanga, Tsidiso G; Zulu, Thokozile G; Patel, Jaymati; Walaza, Sibongile; Jacobs, Charlene; Ebonwu, Joy I; Ntuli, Sindile; Naicker, Serisha D; Thomas, Juno

    2015-09-01

    The largest outbreak of sporotrichosis occurred between 1938 and 1947 in the gold mines of Witwatersrand in South Africa. Here, we describe an outbreak of lymphocutaneous sporotrichosis that was investigated in a South African gold mine in 2011. Employees working at a reopened section of the mine were recruited for a descriptive cross-sectional study. Informed consent was sought for interview, clinical examination and medical record review. Specimens were collected from participants with active or partially-healed lymphocutaneous lesions. Environmental samples were collected from underground mine levels. Sporothrix isolates were identified by sequencing of the internal transcribed spacer region of the ribosomal gene and the nuclear calmodulin gene. Of 87 male miners, 81 (93%) were interviewed and examined, of whom 29 (36%) had skin lesions; specimens were collected from 17 (59%). Sporotrichosis was laboratory-confirmed among 10 patients and seven had clinically-compatible lesions. Of 42 miners with known HIV status, 11 (26%) were HIV-infected. No cases of disseminated disease were detected. Participants with ≤ 3 years' mining experience had a four times greater odds of developing sporotrichosis than those who had been employed for >3 years (adjusted OR 4.0, 95% CI 1.2-13.1). Isolates from 8 patients were identified as Sporothrix schenckii sensu stricto by calmodulin gene sequencing while environmental isolates were identified as Sporothrix mexicana. S. schenckii sensu stricto was identified as the causative pathogen. Although genetically distinct species were isolated from clinical and environmental sources, it is likely that the source was contaminated soil and untreated wood underground. No cases occurred following recommendations to close sections of the mine, treat timber and encourage consistent use of personal protective equipment. Sporotrichosis is a potentially re-emerging disease where traditional, rather than heavily mechanised, mining techniques are

  12. An Outbreak of Lymphocutaneous Sporotrichosis among Mine-Workers in South Africa

    PubMed Central

    Govender, Nelesh P.; Maphanga, Tsidiso G.; Zulu, Thokozile G.; Patel, Jaymati; Walaza, Sibongile; Jacobs, Charlene; Ebonwu, Joy I.; Ntuli, Sindile; Naicker, Serisha D.; Thomas, Juno

    2015-01-01

    Background The largest outbreak of sporotrichosis occurred between 1938 and 1947 in the gold mines of Witwatersrand in South Africa. Here, we describe an outbreak of lymphocutaneous sporotrichosis that was investigated in a South African gold mine in 2011. Methodology Employees working at a reopened section of the mine were recruited for a descriptive cross-sectional study. Informed consent was sought for interview, clinical examination and medical record review. Specimens were collected from participants with active or partially-healed lymphocutaneous lesions. Environmental samples were collected from underground mine levels. Sporothrix isolates were identified by sequencing of the internal transcribed spacer region of the ribosomal gene and the nuclear calmodulin gene. Principal Findings Of 87 male miners, 81 (93%) were interviewed and examined, of whom 29 (36%) had skin lesions; specimens were collected from 17 (59%). Sporotrichosis was laboratory-confirmed among 10 patients and seven had clinically-compatible lesions. Of 42 miners with known HIV status, 11 (26%) were HIV-infected. No cases of disseminated disease were detected. Participants with ≤3 years’ mining experience had a four times greater odds of developing sporotrichosis than those who had been employed for >3 years (adjusted OR 4.0, 95% CI 1.2–13.1). Isolates from 8 patients were identified as Sporothrix schenckii sensu stricto by calmodulin gene sequencing while environmental isolates were identified as Sporothrix mexicana. Conclusions/Significance S. schenckii sensu stricto was identified as the causative pathogen. Although genetically distinct species were isolated from clinical and environmental sources, it is likely that the source was contaminated soil and untreated wood underground. No cases occurred following recommendations to close sections of the mine, treat timber and encourage consistent use of personal protective equipment. Sporotrichosis is a potentially re-emerging disease where

  13. PlanMine--a mineable resource of planarian biology and biodiversity.

    PubMed

    Brandl, Holger; Moon, HongKee; Vila-Farré, Miquel; Liu, Shang-Yun; Henry, Ian; Rink, Jochen C

    2016-01-04

    Planarian flatworms are in the midst of a renaissance as a model system for regeneration and stem cells. Besides two well-studied model species, hundreds of species exist worldwide that present a fascinating diversity of regenerative abilities, tissue turnover rates, reproductive strategies and other life history traits. PlanMine (http://planmine.mpi-cbg.de/) aims to accomplish two primary missions: First, to provide an easily accessible platform for sharing, comparing and value-added mining of planarian sequence data. Second, to catalyze the comparative analysis of the phenotypic diversity amongst planarian species. Currently, PlanMine houses transcriptomes independently assembled by our lab and community contributors. Detailed assembly/annotation statistics, a custom-developed BLAST viewer and easy export options enable comparisons at the contig and assembly level. Consistent annotation of all transcriptomes by an automated pipeline, the integration of published gene expression information and inter-relational query tools provide opportunities for mining planarian gene sequences and functions. For inter-species comparisons, we include transcriptomes of, so far, six planarian species, along with images, expert-curated information on their biology and pre-calculated cross-species sequence homologies. PlanMine is based on the popular InterMine system in order to make the rich biology of planarians accessible to the general life sciences research community. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. No immediate relief for large mining tire shortage

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fiscor, S.

    2006-05-15

    Inventories are low and they will not get better anytime soon, but mine operators do have a few options. The three main manufacturers supplying tires to the US mining industry, Bridgestone Firestone Off Road Tire Company, Goodyear and Michelin are struggling to keep up with demand, but are unlikely to restore inventories to manageable levels by 2009. Meanwhile Yokohama and Global Tyres have stepped in to help. The article reports plans for expansions to plant in Japan. In the meantime, mine operators need to plan accordingly and pay increased attention to tire maintenance. The larger the tyre, the less availablemore » it is. 3 photos.« less

  15. Gene Expression Patterns Associated With Histopathology in Toxic Liver Fibrosis.

    PubMed

    Ippolito, Danielle L; AbdulHameed, Mohamed Diwan M; Tawa, Gregory J; Baer, Christine E; Permenter, Matthew G; McDyre, Bonna C; Dennis, William E; Boyle, Molly H; Hobbs, Cheryl A; Streicker, Michael A; Snowden, Bobbi S; Lewis, John A; Wallqvist, Anders; Stallings, Jonathan D

    2016-01-01

    Toxic industrial chemicals induce liver injury, which is difficult to diagnose without invasive procedures. Identifying indicators of end organ injury can complement exposure-based assays and improve predictive power. A multiplexed approach was used to experimentally evaluate a panel of 67 genes predicted to be associated with the fibrosis pathology by computationally mining DrugMatrix, a publicly available repository of gene microarray data. Five-day oral gavage studies in male Sprague Dawley rats dosed with varying concentrations of 3 fibrogenic compounds (allyl alcohol, carbon tetrachloride, and 4,4'-methylenedianiline) and 2 nonfibrogenic compounds (bromobenzene and dexamethasone) were conducted. Fibrosis was definitively diagnosed by histopathology. The 67-plex gene panel accurately diagnosed fibrosis in both microarray and multiplexed-gene expression assays. Necrosis and inflammatory infiltration were comorbid with fibrosis. ANOVA with contrasts identified that 51 of the 67 predicted genes were significantly associated with the fibrosis phenotype, with 24 of these specific to fibrosis alone. The protein product of the gene most strongly correlated with the fibrosis phenotype PCOLCE (Procollagen C-Endopeptidase Enhancer) was dose-dependently elevated in plasma from animals administered fibrogenic chemicals (P < .05). Semiquantitative global mass spectrometry analysis of the plasma identified an additional 5 protein products of the gene panel which increased after fibrogenic toxicant administration: fibronectin, ceruloplasmin, vitronectin, insulin-like growth factor binding protein, and α2-macroglobulin. These results support the data mining approach for identifying gene and/or protein panels for assessing liver injury and may suggest bridging biomarkers for molecular mediators linked to histopathology. Published by Oxford University Press on behalf of the Society of Toxicology 2015. This work is written by US Government employees and is in the public

  16. Differential Connectivity in Colorectal Cancer Gene Expression Network

    PubMed

    Izadi, Fereshteh

    2018-05-30

    Colorectal cancer (CRC) is one of the challenging types of cancers; thus, exploring effective biomarkers related to colorectal could lead to significant progresses toward the treatment of this disease. In the present study, CRC gene expression datasets have been reanalyzed. Mutual differentially expressed genes across 294 normal mucosa and adjacent tumoral samples were then utilized in order to build two independent transcriptional regulatory networks. By analyzing the networks topologically, genes with differential global connectivity related to cancer state were determined for which the potential transcriptional regulators including transcription factors were identified. The majority of differentially connected genes (DCGs) were up-regulated in colorectal transcriptome experiments. Moreover, a number of these genes have been experimentally validated as cancer or CRC-associated genes. The DCGs, including GART, TGFB1, ITGA2, SLC16A5, SOX9, and MMP7, were investigated across 12 cancer types. Functional enrichment analysis followed by detailed data mining exhibited that these candidate genes could be related to CRC by mediating in metastatic cascade in addition to shared pathways with 12 cancer types by triggering the inflammatory events Our study uncovered correlated alterations in gene expression related to CRC susceptibility and progression that the potent candidate biomarkers could provide a link to disease.

  17. Gene expression profiling--Opening the black box of plant ecosystem responses to global change

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Leakey, A.D.B.; Ainsworth, E.A.; Bernard, S.M.

    The use of genomic techniques to address ecological questions is emerging as the field of genomic ecology. Experimentation under environmentally realistic conditions to investigate the molecular response of plants to meaningful changes in growth conditions and ecological interactions is the defining feature of genomic ecology. Since the impact of global change factors on plant performance are mediated by direct effects at the molecular, biochemical and physiological scales, gene expression analysis promises important advances in understanding factors that have previously been consigned to the 'black box' of unknown mechanism. Various tools and approaches are available for assessing gene expression in modelmore » and non-model species as part of global change biology studies. Each approach has its own unique advantages and constraints. A first generation of genomic ecology studies in managed ecosystems and mesocosms have provided a testbed for the approach and have begun to reveal how the experimental design and data analysis of gene expression studies can be tailored for use in an ecological context.« less

  18. Text-mining and information-retrieval services for molecular biology

    PubMed Central

    Krallinger, Martin; Valencia, Alfonso

    2005-01-01

    Text-mining in molecular biology - defined as the automatic extraction of information about genes, proteins and their functional relationships from text documents - has emerged as a hybrid discipline on the edges of the fields of information science, bioinformatics and computational linguistics. A range of text-mining applications have been developed recently that will improve access to knowledge for biologists and database annotators. PMID:15998455

  19. Global gene expression in cotton (Gossypium hirsutum L.) leaves to waterlogging stress.

    PubMed

    Zhang, Yanjun; Kong, Xiangqiang; Dai, Jianlong; Luo, Zhen; Li, Zhenhuai; Lu, Hequan; Xu, Shizhen; Tang, Wei; Zhang, Dongmei; Li, Weijiang; Xin, Chengsong; Dong, Hezhong

    2017-01-01

    Cotton is sensitive to waterlogging stress, which usually results in stunted growth and yield loss. To date, the molecular mechanisms underlying the responses to waterlogging in cotton remain elusive. Cotton was grown in a rain-shelter and subjected to 0 (control)-, 10-, 15- and 20-d waterlogging at flowering stage. The fourth-leaves on the main-stem from the top were sampled and immediately frozen in liquid nitrogen for physiological measurement. Global gene transcription in the leaves of 15-d waterlogged plants was analyzed by RNA-Seq. Seven hundred and ninety four genes were up-regulated and 1018 genes were down-regulated in waterlogged cotton leaves compared with non-waterlogged control. The differentially expressed genes were mainly related to photosynthesis, nitrogen metabolism, starch and sucrose metabolism, glycolysis and plant hormone signal transduction. KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis indicated that most genes related to flavonoid biosynthesis, oxidative phosphorylation, amino acid metabolism and biosynthesis as well as circadian rhythm pathways were differently expressed. Waterlogging increased the expression of anaerobic fermentation related genes, such as alcohol dehydrogenase (ADH), but decreased the leaf chlorophyll concentration and photosynthesis by down-regulating the expression of photosynthesis related genes. Many genes related to plant hormones and transcription factors were differently expressed under waterlogging stress. Most of the ethylene related genes and ethylene-responsive factor-type transcription factors were up-regulated under water-logging stress, suggesting that ethylene may play key roles in the survival of cotton under waterlogging stress.

  20. Global gene expression in cotton (Gossypium hirsutum L.) leaves to waterlogging stress

    PubMed Central

    Zhang, Yanjun; Kong, Xiangqiang; Dai, Jianlong; Luo, Zhen; Li, Zhenhuai; Lu, Hequan; Xu, Shizhen; Tang, Wei; Zhang, Dongmei; Li, Weijiang; Xin, Chengsong

    2017-01-01

    Cotton is sensitive to waterlogging stress, which usually results in stunted growth and yield loss. To date, the molecular mechanisms underlying the responses to waterlogging in cotton remain elusive. Cotton was grown in a rain-shelter and subjected to 0 (control)-, 10-, 15- and 20-d waterlogging at flowering stage. The fourth-leaves on the main-stem from the top were sampled and immediately frozen in liquid nitrogen for physiological measurement. Global gene transcription in the leaves of 15-d waterlogged plants was analyzed by RNA-Seq. Seven hundred and ninety four genes were up-regulated and 1018 genes were down-regulated in waterlogged cotton leaves compared with non-waterlogged control. The differentially expressed genes were mainly related to photosynthesis, nitrogen metabolism, starch and sucrose metabolism, glycolysis and plant hormone signal transduction. KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis indicated that most genes related to flavonoid biosynthesis, oxidative phosphorylation, amino acid metabolism and biosynthesis as well as circadian rhythm pathways were differently expressed. Waterlogging increased the expression of anaerobic fermentation related genes, such as alcohol dehydrogenase (ADH), but decreased the leaf chlorophyll concentration and photosynthesis by down-regulating the expression of photosynthesis related genes. Many genes related to plant hormones and transcription factors were differently expressed under waterlogging stress. Most of the ethylene related genes and ethylene-responsive factor-type transcription factors were up-regulated under water-logging stress, suggesting that ethylene may play key roles in the survival of cotton under waterlogging stress. PMID:28953908

  1. Systematic analysis of molecular mechanisms for HCC metastasis via text mining approach.

    PubMed

    Zhen, Cheng; Zhu, Caizhong; Chen, Haoyang; Xiong, Yiru; Tan, Junyuan; Chen, Dong; Li, Jin

    2017-02-21

    To systematically explore the molecular mechanism for hepatocellular carcinoma (HCC) metastasis and identify regulatory genes with text mining methods. Genes with highest frequencies and significant pathways related to HCC metastasis were listed. A handful of proteins such as EGFR, MDM2, TP53 and APP, were identified as hub nodes in PPI (protein-protein interaction) network. Compared with unique genes for HBV-HCCs, genes particular to HCV-HCCs were less, but may participate in more extensive signaling processes. VEGFA, PI3KCA, MAPK1, MMP9 and other genes may play important roles in multiple phenotypes of metastasis. Genes in abstracts of HCC-metastasis literatures were identified. Word frequency analysis, KEGG pathway and PPI network analysis were performed. Then co-occurrence analysis between genes and metastasis-related phenotypes were carried out. Text mining is effective for revealing potential regulators or pathways, but the purpose of it should be specific, and the combination of various methods will be more useful.

  2. QuadBase2: web server for multiplexed guanine quadruplex mining and visualization

    PubMed Central

    Dhapola, Parashar; Chowdhury, Shantanu

    2016-01-01

    DNA guanine quadruplexes or G4s are non-canonical DNA secondary structures which affect genomic processes like replication, transcription and recombination. G4s are computationally identified by specific nucleotide motifs which are also called putative G4 (PG4) motifs. Despite the general relevance of these structures, there is currently no tool available that can allow batch queries and genome-wide analysis of these motifs in a user-friendly interface. QuadBase2 (quadbase.igib.res.in) presents a completely reinvented web server version of previously published QuadBase database. QuadBase2 enables users to mine PG4 motifs in up to 178 eukaryotes through the EuQuad module. This module interfaces with Ensembl Compara database, to allow users mine PG4 motifs in the orthologues of genes of interest across eukaryotes. PG4 motifs can be mined across genes and their promoter sequences in 1719 prokaryotes through ProQuad module. This module includes a feature that allows genome-wide mining of PG4 motifs and their visualization as circular histograms. TetraplexFinder, the module for mining PG4 motifs in user-provided sequences is now capable of handling up to 20 MB of data. QuadBase2 is a comprehensive PG4 motif mining tool that further expands the configurations and algorithms for mining PG4 motifs in a user-friendly way. PMID:27185890

  3. Genomic Evidence of Chemotrophic Metabolisms in Deep-Dwelling Chloroflexi Conferred by Ancient Horizontal Gene Transfer Events

    NASA Astrophysics Data System (ADS)

    Momper, L. M.; Magnabosco, C.; Amend, J.; Osburn, M. R.; Fournier, G. P.

    2017-12-01

    The marine and terrestrial subsurface biospheres represent quite likely the largest reservoirs for life on Earth, directly impacting surface processes and global cycles throughout Earth's history. In the deep subsurface biosphere (DSB) organic carbon and energy are often extremely scarce. However, archaea and bacteria are able to persist in the DSB to at least 3.5 km below surface [1]. Understanding how they persist, and by what metabolisms they subsist, are key questions in this biosphere. To address these questions we investigated 5 global DSB environments: one legacy mine in South Dakota, USA, 3 mines in South Africa and marine fluids circulating beneath the Juan de Fuca Ridge. Boreholes within these mines provided access to fluids buried beneath the earth's surface and sampled depths down to 3.1 km. Geochemical data were collected concomitantly with DNA for metagenomic sequencing. We examined genomes of the ancient and deeply branching Chloroflexi for metabolic capabilities and interrogated the geochemical drivers behind those metabolisms with in situ thermodynamic modeling of reaction energetics. In total, 23 Chloroflexi genomes were identified and analyzed from the 5 subsurface sites. Genes for nitrate reduction (nar) and sulfite reduction (dsr) were found in many of the South Africa Chloroflexi but were absent from genomes collected in South Dakota. Indeed, nitrate reduction was among the most energetically favorable reactions in South African fluids (10-14 kJ cell-1 sec -1 per mol of reactant) and sulfur reduction with Fe2+ or H2 was also exergonic [2]. Conversely, genes for nitrite and nitrous oxide reduction (nrf, nir and nos) were found in genomes collected in South Dakota and Juan de Fuca, but not South Africa. We examined the origin of genes conferring these metabolisms in the Chloroflexi genomes. We discovered evidence for horizontal gene transfer (HGT) for all of these putative metabolisms. Retention of these genes in Chloroflexi lineages indicates

  4. The structure and infrastructure of the global nanotechnology literature

    NASA Astrophysics Data System (ADS)

    Kostoff, Ronald N.; Stump, Jesse A.; Johnson, Dustin; Murday, James S.; Lau, Clifford G. Y.; Tolles, William M.

    2006-08-01

    Text mining is the extraction of useful information from large volumes of text. A text mining analysis of the global open nanotechnology literature was performed. Records from the Science Citation Index (SCI)/Social SCI were analyzed to provide the infrastructure of the global nanotechnology literature (prolific authors/journals/institutions/countries, most cited authors/papers/journals) and the thematic structure (taxonomy) of the global nanotechnology literature, from a science perspective. Records from the Engineering Compendex (EC) were analyzed to provide a taxonomy from a technology perspective. The Far Eastern countries have expanded nanotechnology publication output dramatically in the past decade.

  5. ClusterMine360: a database of microbial PKS/NRPS biosynthesis

    PubMed Central

    Conway, Kyle R.; Boddy, Christopher N.

    2013-01-01

    ClusterMine360 (http://www.clustermine360.ca/) is a database of microbial polyketide and non-ribosomal peptide gene clusters. It takes advantage of crowd-sourcing by allowing members of the community to make contributions while automation is used to help achieve high data consistency and quality. The database currently has >200 gene clusters from >185 compound families. It also features a unique sequence repository containing >10 000 polyketide synthase/non-ribosomal peptide synthetase domains. The sequences are filterable and downloadable as individual or multiple sequence FASTA files. We are confident that this database will be a useful resource for members of the polyketide synthases/non-ribosomal peptide synthetases research community, enabling them to keep up with the growing number of sequenced gene clusters and rapidly mine these clusters for functional information. PMID:23104377

  6. An index for drought induced financial risk in the mining industry

    NASA Astrophysics Data System (ADS)

    Bonnafous, L.; Lall, U.; Siegel, J.

    2017-02-01

    Water scarcity has emerged as a potential risk for mining operations. High capital spending for desalination and water conflicts leading to asset stranding have recently occurred. Investors in mining companies are interested in the exposure to such risks across portfolios of mining assets (whether the practical at-site consequences are foregone production, higher OPEX and CAPEX and ensuing lost revenues, or asset-stranding). In this paper, an index of the potential financial exposure of a portfolio is developed and its application is illustrated. Since the likely loss at each mine is hard to estimate a priori, one needs a proxy for potential loss. The index considers drought duration, severity and frequency (defined by a return-level in years) at each mining asset, and provides a measure of financial exposure through weighing of production or Net Asset Value. Changes in human needs are not considered, but are relevant, and could be incorporated if global data on mine and other water use were available at the appropriate resolution. Potential for contemporaneous drought incidence across sites in a portfolio is considered specifically. Through an appropriate choice of drought thresholds, an analyst can customize a scenario to assess potential losses in production value or profits, or whether conflicts could emerge that would lead to stranded assets or capital expenditure to secure alternate water supplies. Global climate data sets that allow a customized development of such an index are identified, and selected mining company portfolios are scored as to the risk associated with one publicly available drought index.

  7. Uranium from Africa - An overview on past and current mining activities: Re-appraising associated risks and chances in a global context

    NASA Astrophysics Data System (ADS)

    Winde, Frank; Brugge, Doug; Nidecker, Andreas; Ruegg, Urs

    2017-05-01

    In 2003, nuclear power received renewed interest as a perceived climate-neutral way to meet high energy demands of large industrialized countries, such as China, India, Russia and the USA. It triggered a growing demand for uranium (U) as nuclear fuel. Dubbed the 'nuclear renaissance', the U-price rose over tenfold before the global credit crisis dampend the rush. Many efforts to capitalise on the renewed demand focused on Africa. This paper provides an overview on the type and extent of uranium mining, production and exploration on the African continent and discusses the economic benefits as well as the potential environmental and health risks and the long-term needs for remediation of legacy sites. The actual historical results of uranium mining activities in more than thirty African countries provide data against which to assess the existing risks of uranium development. The already existing uraniferous waste in several African countries threatens scarce water resources and the health of adjacent residents. Responsibility should rest with the governments and the companies to ensure that these threats are not realized.

  8. Global gene expression during stringent response in Corynebacterium glutamicum in presence and absence of the rel gene encoding (p)ppGpp synthase

    PubMed Central

    Brockmann-Gretza, Olaf; Kalinowski, Jörn

    2006-01-01

    Background The stringent response is the initial reaction of microorganisms to nutritional stress. During stringent response the small nucleotides (p)ppGpp act as global regulators and reprogram bacterial transcription. In this work, the genetic network controlled by the stringent response was characterized in the amino acid-producing Corynebacterium glutamicum. Results The transcriptome of a C. glutamicum rel gene deletion mutant, unable to synthesize (p)ppGpp and to induce the stringent response, was compared with that of its rel-proficient parent strain by microarray analysis. A total of 357 genes were found to be transcribed differentially in the rel-deficient mutant strain. In a second experiment, the stringent response was induced by addition of DL-serine hydroxamate (SHX) in early exponential growth phase. The time point of the maximal effect on transcription was determined by real-time RT-PCR using the histidine and serine biosynthetic genes. Transcription of all of these genes reached a maximum at 10 minutes after SHX addition. Microarray experiments were performed comparing the transcriptomes of SHX-induced cultures of the rel-proficient strain and the rel mutant. The differentially expressed genes were grouped into three classes. Class A comprises genes which are differentially regulated only in the presence of an intact rel gene. This class includes the non-essential sigma factor gene sigB which was upregulated and a large number of genes involved in nitrogen metabolism which were downregulated. Class B comprises genes which were differentially regulated in response to SHX in both strains, independent of the rel gene. A large number of genes encoding ribosomal proteins fall into this class, all being downregulated. Class C comprises genes which were differentially regulated in response to SHX only in the rel mutant. This class includes genes encoding putative stress proteins and global transcriptional regulators that might be responsible for the complex

  9. Data mining reveals a network of early-response genes as a consensus signature of drug-induced in vitro and in vivo toxicity.

    PubMed

    Zhang, J D; Berntenis, N; Roth, A; Ebeling, M

    2014-06-01

    Gene signatures of drug-induced toxicity are of broad interest, but they are often identified from small-scale, single-time point experiments, and are therefore of limited applicability. To address this issue, we performed multivariate analysis of gene expression, cell-based assays, and histopathological data in the TG-GATEs (Toxicogenomics Project-Genomics Assisted Toxicity Evaluation system) database. Data mining highlights four genes-EGR1, ATF3, GDF15 and FGF21-that are induced 2 h after drug administration in human and rat primary hepatocytes poised to eventually undergo cytotoxicity-induced cell death. Modelling and simulation reveals that these early stress-response genes form a functional network with evolutionarily conserved structure and intrinsic dynamics. This is underlined by the fact that early induction of this network in vivo predicts drug-induced liver and kidney pathology with high accuracy. Our findings demonstrate the value of early gene-expression signatures in predicting and understanding compound-induced toxicity. The identified network can empower first-line tests that reduce animal use and costs of safety evaluation.

  10. Mine Water Treatment in Hongai Coal Mines

    NASA Astrophysics Data System (ADS)

    Dang, Phuong Thao; Dang, Vu Chi

    2018-03-01

    Acid mine drainage (AMD) is recognized as one of the most serious environmental problem associated with mining industry. Acid water, also known as acid mine drainage forms when iron sulfide minerals found in the rock of coal seams are exposed to oxidizing conditions in coal mining. Until 2009, mine drainage in Hongai coal mines was not treated, leading to harmful effects on humans, animals and aquatic ecosystem. This report has examined acid mine drainage problem and techniques for acid mine drainage treatment in Hongai coal mines. In addition, selection and criteria for the design of the treatment systems have been presented.

  11. WGSSAT: A High-Throughput Computational Pipeline for Mining and Annotation of SSR Markers From Whole Genomes.

    PubMed

    Pandey, Manmohan; Kumar, Ravindra; Srivastava, Prachi; Agarwal, Suyash; Srivastava, Shreya; Nagpure, Naresh S; Jena, Joy K; Kushwaha, Basdeo

    2018-03-16

    Mining and characterization of Simple Sequence Repeat (SSR) markers from whole genomes provide valuable information about biological significance of SSR distribution and also facilitate development of markers for genetic analysis. Whole genome sequencing (WGS)-SSR Annotation Tool (WGSSAT) is a graphical user interface pipeline developed using Java Netbeans and Perl scripts which facilitates in simplifying the process of SSR mining and characterization. WGSSAT takes input in FASTA format and automates the prediction of genes, noncoding RNA (ncRNA), core genes, repeats and SSRs from whole genomes followed by mapping of the predicted SSRs onto a genome (classified according to genes, ncRNA, repeats, exonic, intronic, and core gene region) along with primer identification and mining of cross-species markers. The program also generates a detailed statistical report along with visualization of mapped SSRs, genes, core genes, and RNAs. The features of WGSSAT were demonstrated using Takifugu rubripes data. This yielded a total of 139 057 SSR, out of which 113 703 SSR primer pairs were uniquely amplified in silico onto a T. rubripes (fugu) genome. Out of 113 703 mined SSRs, 81 463 were from coding region (including 4286 exonic and 77 177 intronic), 7 from RNA, 267 from core genes of fugu, whereas 105 641 SSR and 601 SSR primer pairs were uniquely mapped onto the medaka genome. WGSSAT is tested under Ubuntu Linux. The source code, documentation, user manual, example dataset and scripts are available online at https://sourceforge.net/projects/wgssat-nbfgr.

  12. Mobile genes in the human microbiome are structured from global to individual scales

    PubMed Central

    Brito, IL; Jupiter, SD; Jenkins, AP; Naisilisili, W; Tamminen, M; Smillie, CS; Wortman, JR; Birren, BW; Xavier, RJ; Blainey, PC; Singh, AK; Gevers, D; Alm, EJ

    2016-01-01

    Recent work has underscored the importance of the microbiome in human health, largely attributing differences in phenotype to differences in the species present across individuals1,2,3,4,5. But mobile genes can confer profoundly different phenotypes on different strains of the same species. Little is known about the function and distribution of mobile genes in the human microbiome, and in particular whether the gene pool is globally homogenous or constrained by human population structure. Here, we investigate this question by comparing the mobile genes found in the microbiomes of 81 metropolitan North Americans with that of 172 agrarian Fiji islanders using a combination of single-cell genomics and metagenomics. We find large differences in mobile gene content between the Fijian and North American microbiomes, with functional variation that mirrors known dietary differences such as the excess of plant-based starch degradation genes. Remarkably, differences are also observed between the mobile gene pools of proximal Fijian villages, even though microbiome composition across villages is similar. Finally, we observe high rates of recombination leading to individual-specific mobile elements, suggesting that the abundance of some genes may reflect environmental selection rather than dispersal limitation. Together, these data support the hypothesis that human activities and behaviors provide selective pressures that shape mobile gene pools, and that acquisition of mobile genes is important to colonizing specific human populations. PMID:27409808

  13. Constraining Modern and Historic Mercury Emissions From Gold Mining

    NASA Astrophysics Data System (ADS)

    Strode, S. A.; Jaeglé, L.; Selin, N. E.; Sunderland, E.

    2007-12-01

    Mercury emissions from both historic gold and silver mining and modern small-scale gold mining are highly uncertain. Historic mercury emissions can affect the modern atmosphere through reemission from land and ocean, and quantifying mercury emissions from historic gold and silver mining can help constrain modern mining sources. While estimates of mercury emissions during historic gold rushes exceed modern anthropogenic mercury emissions in North America, sediment records in many regions do not show a strong gold rush signal. We use the GEOS-Chem chemical transport model to determine the spatial footprint of mercury emissions from mining and compare model runs from gold rush periods to sediment and ice core records of historic mercury deposition. Based on records of gold and silver production, we include mercury emissions from North and South American mining of 1900 Mg/year in 1880, compared to modern global anthropogenic emissions of 3400 Mg/year. Including this large mining source in GEOS-Chem leads to an overestimate of the modeled 1880 to preindustrial enhancement ratio compared to the sediment core record. We conduct sensitivity studies to constrain the level of mercury emissions from modern and historic mining that is consistent with the deposition records for different regions.

  14. Mercury Mining in Mexico: I. Community Engagement to Improve Health Outcomes from Artisanal Mining.

    PubMed

    Camacho, Andrea; Van Brussel, Evelyn; Carrizales, Leticia; Flores-Ramírez, Rogelio; Verduzco, Beatriz; Huerta, Selene Ruvalcaba-Aranda; Leon, Mauricio; Díaz-Barriga, Fernando

    2016-01-01

    Mercury is an element that cannot be destroyed and is a global threat to human and environmental health. In Latin America and the Caribbean, artisanal and small-scale gold mining represents the main source of mercury emissions, releases, and consumption. However, another source of concern is the primary production of mercury. In the case of Mexico, in the past 2 years the informal production of mercury mining has increased 10-fold. Considering this scenario, an intervention program was initiated to reduce health risks in the mining communities. The program's final goal is to introduce different alternatives in line to stop the mining of mercury, but introducing at the same time, a community-based development program. The aim of this study was to present results from a preliminary study in the community of Plazuela, located in the municipality of Peñamiller in the State of Queretaro, Mexico. Total mercury was measured in urine and environmental samples using atomic absorption spectrometry by cold vapor technique. Urine samples were collected from children aged 6-14 years and who had lived in the selected area from birth. Urine samples were also collected from miners who were currently working in the mine. To confirm the presence of mercury in the community, mining waste, water, soil, and sediment samples were collected from those high-risk areas identified by members of the community. Children, women, and miners were heavily exposed to mercury (urine samples); and in agreement, we registered high concentrations of mercury in soils and sediments. Considering these results and taking into account that the risk perception toward mercury toxicity is very low in the community (mining is the only economic activity), an integral intervention program has started. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  15. An Integrative data mining approach to identifying Adverse ...

    EPA Pesticide Factsheets

    The Adverse Outcome Pathway (AOP) framework is a tool for making biological connections and summarizing key information across different levels of biological organization to connect biological perturbations at the molecular level to adverse outcomes for an individual or population. Computational approaches to explore and determine these connections can accelerate the assembly of AOPs. By leveraging the wealth of publicly available data covering chemical effects on biological systems, computationally-predicted AOPs (cpAOPs) were assembled via data mining of high-throughput screening (HTS) in vitro data, in vivo data and other disease phenotype information. Frequent Itemset Mining (FIM) was used to find associations between the gene targets of ToxCast HTS assays and disease data from Comparative Toxicogenomics Database (CTD) by using the chemicals as the common aggregators between datasets. The method was also used to map gene expression data to disease data from CTD. A cpAOP network was defined by considering genes and diseases as nodes and FIM associations as edges. This network contained 18,283 gene to disease associations for the ToxCast data and 110,253 for CTD gene expression. Two case studies show the value of the cpAOP network by extracting subnetworks focused either on fatty liver disease or the Aryl Hydrocarbon Receptor (AHR). The subnetwork surrounding fatty liver disease included many genes known to play a role in this disease. When querying the cpAOP

  16. Phylogeny-guided (meta)genome mining approach for the targeted discovery of new microbial natural products.

    PubMed

    Kang, Hahk-Soo

    2017-02-01

    Genomics-based methods are now commonplace in natural products research. A phylogeny-guided mining approach provides a means to quickly screen a large number of microbial genomes or metagenomes in search of new biosynthetic gene clusters of interest. In this approach, biosynthetic genes serve as molecular markers, and phylogenetic trees built with known and unknown marker gene sequences are used to quickly prioritize biosynthetic gene clusters for their metabolites characterization. An increase in the use of this approach has been observed for the last couple of years along with the emergence of low cost sequencing technologies. The aim of this review is to discuss the basic concept of a phylogeny-guided mining approach, and also to provide examples in which this approach was successfully applied to discover new natural products from microbial genomes and metagenomes. I believe that the phylogeny-guided mining approach will continue to play an important role in genomics-based natural products research.

  17. Text mining for the biocuration workflow

    PubMed Central

    Hirschman, Lynette; Burns, Gully A. P. C; Krallinger, Martin; Arighi, Cecilia; Cohen, K. Bretonnel; Valencia, Alfonso; Wu, Cathy H.; Chatr-Aryamontri, Andrew; Dowell, Karen G.; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G.

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on ‘Text Mining for the BioCuration Workflow’ at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community. PMID:22513129

  18. Text mining for the biocuration workflow.

    PubMed

    Hirschman, Lynette; Burns, Gully A P C; Krallinger, Martin; Arighi, Cecilia; Cohen, K Bretonnel; Valencia, Alfonso; Wu, Cathy H; Chatr-Aryamontri, Andrew; Dowell, Karen G; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on 'Text Mining for the BioCuration Workflow' at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community.

  19. On the Role of Mining Exposure in Epigenetic Effects in Parkinson's Disease.

    PubMed

    Castillo, Sebastian; Muñoz, Patricia; Behrens, Maria Isabel; Diaz-Grez, Fernando; Segura-Aguilar, Juan

    2017-08-01

    To explore the possible influence of heavy metal mining on incidence of Parkinson's disease (PD), global DNA methylation was assessed in blood samples from a population of PD patients (n = 45) and control subjects (n = 52) in Antofagasta neighborhood, a Chilean city built for exclusive use of mining companies. Comparisons were made with PD subjects (n = 52) and control subjects (n = 59) from Santiago Chile, a city having little association with mining. All subjects were assessed by two neurologists and PD diagnosis was based on UK Parkinson's Disease Society Brain Bank Clinical Diagnostic Criteria. From blood samples obtained from each individual, a decrease in global DNA methylation was observed in PD patients either exposed (49% of control, P < 0.001) or not exposed (47% of control, P < 0.001) to mining activity. Although there was no difference in levels of DNA methylation between PD patients from the two cities, there was a lower level of DNA methylation in control subjects from Santiago versus Antofagasta.

  20. Managing biological networks by using text mining and computer-aided curation

    NASA Astrophysics Data System (ADS)

    Yu, Seok Jong; Cho, Yongseong; Lee, Min-Ho; Lim, Jongtae; Yoo, Jaesoo

    2015-11-01

    In order to understand a biological mechanism in a cell, a researcher should collect a huge number of protein interactions with experimental data from experiments and the literature. Text mining systems that extract biological interactions from papers have been used to construct biological networks for a few decades. Even though the text mining of literature is necessary to construct a biological network, few systems with a text mining tool are available for biologists who want to construct their own biological networks. We have developed a biological network construction system called BioKnowledge Viewer that can generate a biological interaction network by using a text mining tool and biological taggers. It also Boolean simulation software to provide a biological modeling system to simulate the model that is made with the text mining tool. A user can download PubMed articles and construct a biological network by using the Multi-level Knowledge Emergence Model (KMEM), MetaMap, and A Biomedical Named Entity Recognizer (ABNER) as a text mining tool. To evaluate the system, we constructed an aging-related biological network that consist 9,415 nodes (genes) by using manual curation. With network analysis, we found that several genes, including JNK, AP-1, and BCL-2, were highly related in aging biological network. We provide a semi-automatic curation environment so that users can obtain a graph database for managing text mining results that are generated in the server system and can navigate the network with BioKnowledge Viewer, which is freely available at http://bioknowledgeviewer.kisti.re.kr.

  1. Promoter analysis reveals globally differential regulation of human long non-coding RNA and protein-coding genes

    DOE PAGES

    Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; ...

    2014-10-02

    Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less

  2. Promoter analysis reveals globally differential regulation of human long non-coding RNA and protein-coding genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui

    Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less

  3. Fungal diversity in major oil-shale mines in China.

    PubMed

    Jiang, Shaoyan; Wang, Wenxing; Xue, Xiangxin; Cao, Chengyou; Zhang, Ying

    2016-03-01

    As an insufficiently utilized energy resource, oil shale is conducive to the formation of characteristic microbial communities due to its special geological origins. However, little is known about fungal diversity in oil shale. Polymerase chain reaction cloning was used to construct the fungal ribosomal deoxyribonucleic acid internal transcribed spacer (rDNA ITS) clone libraries of Huadian Mine in Jilin Province, Maoming Mine in Guangdong Province, and Fushun Mine in Liaoning Province. Pure culture and molecular identification were applied for the isolation of cultivable fungi in fresh oil shale of each mine. Results of clone libraries indicated that each mine had over 50% Ascomycota (58.4%-98.9%) and 1.1%-13.5% unidentified fungi. Fushun Mine and Huadian Mine had 5.9% and 28.1% Basidiomycota, respectively. Huadian Mine showed the highest fungal diversity, followed by Fushun Mine and Maoming Mine. Jaccard indexes showed that the similarities between any two of three fungal communities at the genus level were very low, indicating that fungi in each mine developed independently during the long geological adaptation and formed a community composition fitting the environment. In the fresh oil-shale samples of the three mines, cultivable fungal phyla were consistent with the results of clone libraries. Fifteen genera and several unidentified fungi were identified as Ascomycota and Basidiomycota using pure culture. Penicillium was the only genus found in all three mines. These findings contributed to gaining a clear understanding of current fungal resources in major oil-shale mines in China and provided useful information for relevant studies on isolation of indigenous fungi carrying functional genes from oil shale. Copyright © 2015. Published by Elsevier B.V.

  4. Integrating text mining, data mining, and network analysis for identifying genetic breast cancer trends.

    PubMed

    Jurca, Gabriela; Addam, Omar; Aksac, Alper; Gao, Shang; Özyer, Tansel; Demetrick, Douglas; Alhajj, Reda

    2016-04-26

    Breast cancer is a serious disease which affects many women and may lead to death. It has received considerable attention from the research community. Thus, biomedical researchers aim to find genetic biomarkers indicative of the disease. Novel biomarkers can be elucidated from the existing literature. However, the vast amount of scientific publications on breast cancer make this a daunting task. This paper presents a framework which investigates existing literature data for informative discoveries. It integrates text mining and social network analysis in order to identify new potential biomarkers for breast cancer. We utilized PubMed for the testing. We investigated gene-gene interactions, as well as novel interactions such as gene-year, gene-country, and abstract-country to find out how the discoveries varied over time and how overlapping/diverse are the discoveries and the interest of various research groups in different countries. Interesting trends have been identified and discussed, e.g., different genes are highlighted in relationship to different countries though the various genes were found to share functionality. Some text analysis based results have been validated against results from other tools that predict gene-gene relations and gene functions.

  5. Shared control of gene expression in bacteria by transcription factors and global physiology of the cell

    PubMed Central

    Berthoumieux, Sara; de Jong, Hidde; Baptist, Guillaume; Pinel, Corinne; Ranquet, Caroline; Ropers, Delphine; Geiselmann, Johannes

    2013-01-01

    Gene expression is controlled by the joint effect of (i) the global physiological state of the cell, in particular the activity of the gene expression machinery, and (ii) DNA-binding transcription factors and other specific regulators. We present a model-based approach to distinguish between these two effects using time-resolved measurements of promoter activities. We demonstrate the strength of the approach by analyzing a circuit involved in the regulation of carbon metabolism in E. coli. Our results show that the transcriptional response of the network is controlled by the physiological state of the cell and the signaling metabolite cyclic AMP (cAMP). The absence of a strong regulatory effect of transcription factors suggests that they are not the main coordinators of gene expression changes during growth transitions, but rather that they complement the effect of global physiological control mechanisms. This change of perspective has important consequences for the interpretation of transcriptome data and the design of biological networks in biotechnology and synthetic biology. PMID:23340840

  6. Differential global gene expression in red and white skeletal muscle

    NASA Technical Reports Server (NTRS)

    Campbell, W. G.; Gordon, S. E.; Carlson, C. J.; Pattison, J. S.; Hamilton, M. T.; Booth, F. W.

    2001-01-01

    The differences in gene expression among the fiber types of skeletal muscle have long fascinated scientists, but for the most part, previous experiments have only reported differences of one or two genes at a time. The evolving technology of global mRNA expression analysis was employed to determine the potential differential expression of approximately 3,000 mRNAs between the white quad (white muscle) and the red soleus muscle (mixed red muscle) of female ICR mice (30-35 g). Microarray analysis identified 49 mRNA sequences that were differentially expressed between white and mixed red skeletal muscle, including newly identified differential expressions between muscle types. For example, the current findings increase the number of known, differentially expressed mRNAs for transcription factors/coregulators by nine and signaling proteins by three. The expanding knowledge of the diversity of mRNA expression between white and mixed red muscle suggests that there could be quite a complex regulation of phenotype between muscles of different fiber types.

  7. The BET protein FSH functionally interacts with ASH1 to orchestrate global gene activity in Drosophila

    PubMed Central

    2013-01-01

    Background The question of how cells re-establish gene expression states after cell division is still poorly understood. Genetic and molecular analyses have indicated that Trithorax group (TrxG) proteins are critical for the long-term maintenance of active gene expression states in many organisms. A generally accepted model suggests that TrxG proteins contribute to maintenance of transcription by protecting genes from inappropriate Polycomb group (PcG)-mediated silencing, instead of directly promoting transcription. Results and discussion Here we report a physical and functional interaction in Drosophila between two members of the TrxG, the histone methyltransferase ASH1 and the bromodomain and extraterminal family protein FSH. We investigated this interface at the genome level, uncovering a widespread co-localization of both proteins at promoters and PcG-bound intergenic elements. Our integrative analysis of chromatin maps and gene expression profiles revealed that the observed ASH1-FSH binding pattern at promoters is a hallmark of active genes. Inhibition of FSH-binding to chromatin resulted in global down-regulation of transcription. In addition, we found that genes displaying marks of robust PcG-mediated repression also have ASH1 and FSH bound to their promoters. Conclusions Our data strongly favor a global coactivator function of ASH1 and FSH during transcription, as opposed to the notion that TrxG proteins impede inappropriate PcG-mediated silencing, but are dispensable elsewhere. Instead, our results suggest that PcG repression needs to overcome the transcription-promoting function of ASH1 and FSH in order to silence genes. PMID:23442797

  8. Epigenetic genome mining of an endophytic fungus leads to the pleiotropic biosynthesis of natural products.

    PubMed

    Mao, Xu-Ming; Xu, Wei; Li, Dehai; Yin, Wen-Bing; Chooi, Yit-Heng; Li, Yong-Quan; Tang, Yi; Hu, Youcai

    2015-06-22

    The small-molecule biosynthetic potential of most filamentous fungi has remained largely unexplored and represents an attractive source for the discovery of new compounds. Genome sequencing of Calcarisporium arbuscula, a mushroom-endophytic fungus, revealed 68 core genes that are involved in natural product biosynthesis. This is in sharp contrast to the predominant production of the ATPase inhibitors aurovertin B and D in the wild-type fungus. Inactivation of a histone H3 deacetylase led to pleiotropic activation and overexpression of more than 75 % of the biosynthetic genes. Sampling of the overproduced compounds led to the isolation of ten compounds of which four contained new structures, including the cyclic peptides arbumycin and arbumelin, the diterpenoid arbuscullic acid A, and the meroterpenoid arbuscullic acid B. Such epigenetic modifications therefore provide a rapid and global approach to mine the chemical diversity of endophytic fungi. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Impacts of surface gold mining on land use systems in Western Ghana.

    PubMed

    Schueler, Vivian; Kuemmerle, Tobias; Schröder, Hilmar

    2011-07-01

    Land use conflicts are becoming increasingly apparent from local to global scales. Surface gold mining is an extreme source of such a conflict, but mining impacts on local livelihoods often remain unclear. Our goal here was to assess land cover change due to gold surface mining in Western Ghana, one of the world's leading gold mining regions, and to study how these changes affected land use systems. We used Landsat satellite images from 1986-2002 to map land cover change and field interviews with farmers to understand the livelihood implications of mining-related land cover change. Our results showed that surface mining resulted in deforestation (58%), a substantial loss of farmland (45%) within mining concessions, and widespread spill-over effects as relocated farmers expand farmland into forests. This points to rapidly eroding livelihood foundations, suggesting that the environmental and social costs of Ghana's gold boom may be much higher than previously thought.

  10. Building a glaucoma interaction network using a text mining approach.

    PubMed

    Soliman, Maha; Nasraoui, Olfa; Cooper, Nigel G F

    2016-01-01

    The volume of biomedical literature and its underlying knowledge base is rapidly expanding, making it beyond the ability of a single human being to read through all the literature. Several automated methods have been developed to help make sense of this dilemma. The present study reports on the results of a text mining approach to extract gene interactions from the data warehouse of published experimental results which are then used to benchmark an interaction network associated with glaucoma. To the best of our knowledge, there is, as yet, no glaucoma interaction network derived solely from text mining approaches. The presence of such a network could provide a useful summative knowledge base to complement other forms of clinical information related to this disease. A glaucoma corpus was constructed from PubMed Central and a text mining approach was applied to extract genes and their relations from this corpus. The extracted relations between genes were checked using reference interaction databases and classified generally as known or new relations. The extracted genes and relations were then used to construct a glaucoma interaction network. Analysis of the resulting network indicated that it bears the characteristics of a small world interaction network. Our analysis showed the presence of seven glaucoma linked genes that defined the network modularity. A web-based system for browsing and visualizing the extracted glaucoma related interaction networks is made available at http://neurogene.spd.louisville.edu/GlaucomaINViewer/Form1.aspx. This study has reported the first version of a glaucoma interaction network using a text mining approach. The power of such an approach is in its ability to cover a wide range of glaucoma related studies published over many years. Hence, a bigger picture of the disease can be established. To the best of our knowledge, this is the first glaucoma interaction network to summarize the known literature. The major findings were a set of

  11. Sediment processes modelling below hydraulic mining: towards environmental impact mitigation

    NASA Astrophysics Data System (ADS)

    Chalov, Sergey R.

    2010-05-01

    Placer mining sites are located in the river valleys so the rivers are influenced by mining operations. Frequently the existing mining sites are characterized by low contribution to the environmental technologies. Therefore hydraulic mining alters stream hydrology and sediment processes and increases water turbidity. The most serious environmental sequences of the sediment yield increase occur in the rivers populated by salmon fish community because salmon species prefer clean water with low turbidity. For instance, the placer mining in Kamchatka peninsula (Far East of Russia) which is regarded to be the last global gene pool of wild salmon Oncorhynchus threatens the rivers ecosystems. System of man-made impact mitigation could be done through the exact recognition of the human role in hydrological processes and sediment transport especially. Sediment budget of rivers below mining sites is transformed according to the appearance of the man-made non-point and point sediment sources. Non-point source pollution occurs due to soil erosion on the exposed hillsides and erosion in the channel diversions. Slope wash on the hillsides is absent during summer days without rainfalls and is many times increased during rainfalls and snow melting. The nearness of the sources of material and the rivers leads to the small time of suspended load increase after rainfalls. The average time of material intake from exposed hillsides to the rivers is less than 1 hour. The main reason of the incision in the channel diversion is river-channel straightening. The increase of channel slopes and transport capacity leads to the intensive incision of flow. Point source pollution is performed by effluents both from mining site (mainly brief effluents) and from settling ponds (permanent effluents), groundwater seepage from tailing pits or from quarries. High rate of groundwater runoff is the main reason of the technological ponds overfilling. Intensive filtration from channel to ponds because of

  12. Elevated rates of gold mining in the Amazon revealed through high-resolution monitoring.

    PubMed

    Asner, Gregory P; Llactayo, William; Tupayachi, Raul; Luna, Ernesto Ráez

    2013-11-12

    Gold mining has rapidly increased in western Amazonia, but the rates and ecological impacts of mining remain poorly known and potentially underestimated. We combined field surveys, airborne mapping, and high-resolution satellite imaging to assess road- and river-based gold mining in the Madre de Dios region of the Peruvian Amazon from 1999 to 2012. In this period, the geographic extent of gold mining increased 400%. The average annual rate of forest loss as a result of gold mining tripled in 2008 following the global economic recession, closely associated with increased gold prices. Small clandestine operations now comprise more than half of all gold mining activities throughout the region. These rates of gold mining are far higher than previous estimates that were based on traditional satellite mapping techniques. Our results prove that gold mining is growing more rapidly than previously thought, and that high-resolution monitoring approaches are required to accurately quantify human impacts on tropical forests.

  13. Elevated rates of gold mining in the Amazon revealed through high-resolution monitoring

    PubMed Central

    Asner, Gregory P.; Llactayo, William; Tupayachi, Raul; Luna, Ernesto Ráez

    2013-01-01

    Gold mining has rapidly increased in western Amazonia, but the rates and ecological impacts of mining remain poorly known and potentially underestimated. We combined field surveys, airborne mapping, and high-resolution satellite imaging to assess road- and river-based gold mining in the Madre de Dios region of the Peruvian Amazon from 1999 to 2012. In this period, the geographic extent of gold mining increased 400%. The average annual rate of forest loss as a result of gold mining tripled in 2008 following the global economic recession, closely associated with increased gold prices. Small clandestine operations now comprise more than half of all gold mining activities throughout the region. These rates of gold mining are far higher than previous estimates that were based on traditional satellite mapping techniques. Our results prove that gold mining is growing more rapidly than previously thought, and that high-resolution monitoring approaches are required to accurately quantify human impacts on tropical forests. PMID:24167281

  14. A systems-genetics approach and data mining tool to assist in the discovery of genes underlying complex traits in Oryza sativa.

    PubMed

    Ficklin, Stephen P; Feltus, Frank Alex

    2013-01-01

    Many traits of biological and agronomic significance in plants are controlled in a complex manner where multiple genes and environmental signals affect the expression of the phenotype. In Oryza sativa (rice), thousands of quantitative genetic signals have been mapped to the rice genome. In parallel, thousands of gene expression profiles have been generated across many experimental conditions. Through the discovery of networks with real gene co-expression relationships, it is possible to identify co-localized genetic and gene expression signals that implicate complex genotype-phenotype relationships. In this work, we used a knowledge-independent, systems genetics approach, to discover a high-quality set of co-expression networks, termed Gene Interaction Layers (GILs). Twenty-two GILs were constructed from 1,306 Affymetrix microarray rice expression profiles that were pre-clustered to allow for improved capture of gene co-expression relationships. Functional genomic and genetic data, including over 8,000 QTLs and 766 phenotype-tagged SNPs (p-value < = 0.001) from genome-wide association studies, both covering over 230 different rice traits were integrated with the GILs. An online systems genetics data-mining resource, the GeneNet Engine, was constructed to enable dynamic discovery of gene sets (i.e. network modules) that overlap with genetic traits. GeneNet Engine does not provide the exact set of genes underlying a given complex trait, but through the evidence of gene-marker correspondence, co-expression, and functional enrichment, site visitors can identify genes with potential shared causality for a trait which could then be used for experimental validation. A set of 2 million SNPs was incorporated into the database and serve as a potential set of testable biomarkers for genes in modules that overlap with genetic traits. Herein, we describe two modules found using GeneNet Engine, one with significant overlap with the trait amylose content and another with

  15. StemTextSearch: Stem cell gene database with evidence from abstracts.

    PubMed

    Chen, Chou-Cheng; Ho, Chung-Liang

    2017-05-01

    Previous studies have used many methods to find biomarkers in stem cells, including text mining, experimental data and image storage. However, no text-mining methods have yet been developed which can identify whether a gene plays a positive or negative role in stem cells. StemTextSearch identifies the role of a gene in stem cells by using a text-mining method to find combinations of gene regulation, stem-cell regulation and cell processes in the same sentences of biomedical abstracts. The dataset includes 5797 genes, with 1534 genes having positive roles in stem cells, 1335 genes having negative roles, 1654 genes with both positive and negative roles, and 1274 with an uncertain role. The precision of gene role in StemTextSearch is 0.66, and the recall is 0.78. StemTextSearch is a web-based engine with queries that specify (i) gene, (ii) category of stem cell, (iii) gene role, (iv) gene regulation, (v) cell process, (vi) stem-cell regulation, and (vii) species. StemTextSearch is available through http://bio.yungyun.com.tw/StemTextSearch.aspx. Copyright © 2017. Published by Elsevier Inc.

  16. A methodology for multivariate phenotype-based genome-wide association studies to mine pleiotropic genes.

    PubMed

    Park, Sung Hee; Lee, Ji Young; Kim, Sangsoo

    2011-01-01

    Current Genome-Wide Association Studies (GWAS) are performed in a single trait framework without considering genetic correlations between important disease traits. Hence, the GWAS have limitations in discovering genetic risk factors affecting pleiotropic effects. This work reports a novel data mining approach to discover patterns of multiple phenotypic associations over 52 anthropometric and biochemical traits in KARE and a new analytical scheme for GWAS of multivariate phenotypes defined by the discovered patterns. This methodology applied to the GWAS for multivariate phenotype highLDLhighTG derived from the predicted patterns of the phenotypic associations. The patterns of the phenotypic associations were informative to draw relations between plasma lipid levels with bone mineral density and a cluster of common traits (Obesity, hypertension, insulin resistance) related to Metabolic Syndrome (MS). A total of 15 SNPs in six genes (PAK7, C20orf103, NRIP1, BCL2, TRPM3, and NAV1) were identified for significant associations with highLDLhighTG. Noteworthy findings were that the significant associations included a mis-sense mutation (PAK7:R335P), a frame shift mutation (C20orf103) and SNPs in splicing sites (TRPM3). The six genes corresponded to rat and mouse quantitative trait loci (QTLs) that had shown associations with the common traits such as the well characterized MS and even tumor susceptibility. Our findings suggest that the six genes may play important roles in the pleiotropic effects on lipid metabolism and the MS, which increase the risk of Type 2 Diabetes and cardiovascular disease. The use of the multivariate phenotypes can be advantageous in identifying genetic risk factors, accounting for the pleiotropic effects when the multivariate phenotypes have a common etiological pathway.

  17. Large Mine Permitting - Div. of Mining, Land, and Water

    Science.gov Websites

    Pebble Project Pogo Mine Red Dog Mine Rock Creek Project True North Mine OPMP Canadian Large Projects Pebble Project Pogo Mine Red Dog Mine Rock Creek Project True North Mine Contact: Kyle Moselle Large Mine

  18. Gene mutations in Mycobacterium tuberculosis: multidrug-resistant TB as an emerging global public health crisis.

    PubMed

    Mishra, Rahul; Shukla, Priyanka; Huang, Wei; Hu, Ning

    2015-01-01

    Against a constant background of established infections, epidemics of new and old infectious diseases periodically emerge, greatly magnifying the global burden of infections. TB poses formidable challenges to the global health at the public health and scientific level by acquiring gene mutation into anti TB drugs specially rifampin and isoniazid which leads resistant to drug regime and treatment forms. Our tools to combat MDR (multidrug resistant) TB are dangerously out of date and ineffective. Besides new tools (TB drugs, vaccines, diagnostics), we also need new strategies to identify key Mycobacterium tuberculosis and human host interaction. It is all equally important that we build up high quality clinical trial capacity and bio banks for TB biomarkers identification. But most important is global commitment at all levels to roll back TB before it expose us again. Rapid development of drug resistance caused by M. tuberculosis has lead to measure resistance accurately and easily. This knowledge will certainly help us to understand how to prevent the occurrence of drug resistance as well as identifying genes associated with new drug resistance. Copyright © 2014 Elsevier Ltd. All rights reserved.

  19. Global gene expression and systems biology analysis of bovine monocyte-derived macrophages in response to in vitro challenge with Mycobacterium bovis.

    PubMed

    Magee, David A; Taraktsoglou, Maria; Killick, Kate E; Nalpas, Nicolas C; Browne, John A; Park, Stephen D E; Conlon, Kevin M; Lynn, David J; Hokamp, Karsten; Gordon, Stephen V; Gormley, Eamonn; MacHugh, David E

    2012-01-01

    Mycobacterium bovis, the causative agent of bovine tuberculosis, is a major cause of mortality in global cattle populations. Macrophages are among the first cell types to encounter M. bovis following exposure and the response elicited by these cells is pivotal in determining the outcome of infection. Here, a functional genomics approach was undertaken to investigate global gene expression profiles in bovine monocyte-derived macrophages (MDM) purified from seven age-matched non-related females, in response to in vitro challenge with M. bovis (multiplicity of infection 2:1). Total cellular RNA was extracted from non-challenged control and M. bovis-challenged MDM for all animals at intervals of 2 hours, 6 hours and 24 hours post-challenge and prepared for global gene expression analysis using the Affymetrix® GeneChip® Bovine Genome Array. Comparison of M. bovis-challenged MDM gene expression profiles with those from the non-challenged MDM controls at each time point identified 3,064 differentially expressed genes 2 hours post-challenge, with 4,451 and 5,267 differentially expressed genes detected at the 6 hour and 24 hour time points, respectively (adjusted P-value threshold ≤ 0.05). Notably, the number of downregulated genes exceeded the number of upregulated genes in the M. bovis-challenged MDM across all time points; however, the fold-change in expression for the upregulated genes was markedly higher than that for the downregulated genes. Systems analysis revealed enrichment for genes involved in: (1) the inflammatory response; (2) cell signalling pathways, including Toll-like receptors and intracellular pathogen recognition receptors; and (3) apoptosis. The increased number of downregulated genes is consistent with previous studies showing that M. bovis infection is associated with the repression of host gene expression. The results also support roles for MyD88-independent signalling and intracellular PRRs in mediating the host response to M. bovis.

  20. Global Landscape of a Co-Expressed Gene Network in Barley and its Application to Gene Discovery in Triticeae Crops

    PubMed Central

    Mochida, Keiichi; Uehara-Yamaguchi, Yukiko; Yoshida, Takuhiro; Sakurai, Tetsuya; Shinozaki, Kazuo

    2011-01-01

    Accumulated transcriptome data can be used to investigate regulatory networks of genes involved in various biological systems. Co-expression analysis data sets generated from comprehensively collected transcriptome data sets now represent efficient resources that are capable of facilitating the discovery of genes with closely correlated expression patterns. In order to construct a co-expression network for barley, we analyzed 45 publicly available experimental series, which are composed of 1,347 sets of GeneChip data for barley. On the basis of a gene-to-gene weighted correlation coefficient, we constructed a global barley co-expression network and classified it into clusters of subnetwork modules. The resulting clusters are candidates for functional regulatory modules in the barley transcriptome. To annotate each of the modules, we performed comparative annotation using genes in Arabidopsis and Brachypodium distachyon. On the basis of a comparative analysis between barley and two model species, we investigated functional properties from the representative distributions of the gene ontology (GO) terms. Modules putatively involved in drought stress response and cellulose biogenesis have been identified. These modules are discussed to demonstrate the effectiveness of the co-expression analysis. Furthermore, we applied the data set of co-expressed genes coupled with comparative analysis in attempts to discover potentially Triticeae-specific network modules. These results demonstrate that analysis of the co-expression network of the barley transcriptome together with comparative analysis should promote the process of gene discovery in barley. Furthermore, the insights obtained should be transferable to investigations of Triticeae plants. The associated data set generated in this analysis is publicly accessible at http://coexpression.psc.riken.jp/barley/. PMID:21441235

  1. Adult mouse brain gene expression patterns bear an embryologic imprint

    PubMed Central

    Zapala, Matthew A.; Hovatta, Iiris; Ellison, Julie A.; Wodicka, Lisa; Del Rio, Jo A.; Tennant, Richard; Tynan, Wendy; Broide, Ron S.; Helton, Rob; Stoveken, Barbara S.; Winrow, Christopher; Lockhart, Daniel J.; Reilly, John F.; Young, Warren G.; Bloom, Floyd E.; Lockhart, David J.; Barlow, Carrolee

    2005-01-01

    The current model to explain the organization of the mammalian nervous system is based on studies of anatomy, embryology, and evolution. To further investigate the molecular organization of the adult mammalian brain, we have built a gene expression-based brain map. We measured gene expression patterns for 24 neural tissues covering the mouse central nervous system and found, surprisingly, that the adult brain bears a transcriptional “imprint” consistent with both embryological origins and classic evolutionary relationships. Embryonic cellular position along the anterior–posterior axis of the neural tube was shown to be closely associated with, and possibly a determinant of, the gene expression patterns in adult structures. We also observed a significant number of embryonic patterning and homeobox genes with region-specific expression in the adult nervous system. The relationships between global expression patterns for different anatomical regions and the nature of the observed region-specific genes suggest that the adult brain retains a degree of overall gene expression established during embryogenesis that is important for regional specificity and the functional relationships between regions in the adult. The complete collection of extensively annotated gene expression data along with data mining and visualization tools have been made available on a publicly accessible web site (www.barlow-lockhart-brainmapnimhgrant.org). PMID:16002470

  2. Ontology-based meta-analysis of global collections of high-throughput public data.

    PubMed

    Kupershmidt, Ilya; Su, Qiaojuan Jane; Grewal, Anoop; Sundaresh, Suman; Halperin, Inbal; Flynn, James; Shekar, Mamatha; Wang, Helen; Park, Jenny; Cui, Wenwu; Wall, Gregory D; Wisotzkey, Robert; Alag, Satnam; Akhtari, Saeid; Ronaghi, Mostafa

    2010-09-29

    The investigation of the interconnections between the molecular and genetic events that govern biological systems is essential if we are to understand the development of disease and design effective novel treatments. Microarray and next-generation sequencing technologies have the potential to provide this information. However, taking full advantage of these approaches requires that biological connections be made across large quantities of highly heterogeneous genomic datasets. Leveraging the increasingly huge quantities of genomic data in the public domain is fast becoming one of the key challenges in the research community today. We have developed a novel data mining framework that enables researchers to use this growing collection of public high-throughput data to investigate any set of genes or proteins. The connectivity between molecular states across thousands of heterogeneous datasets from microarrays and other genomic platforms is determined through a combination of rank-based enrichment statistics, meta-analyses, and biomedical ontologies. We address data quality concerns through dataset replication and meta-analysis and ensure that the majority of the findings are derived using multiple lines of evidence. As an example of our strategy and the utility of this framework, we apply our data mining approach to explore the biology of brown fat within the context of the thousands of publicly available gene expression datasets. Our work presents a practical strategy for organizing, mining, and correlating global collections of large-scale genomic data to explore normal and disease biology. Using a hypothesis-free approach, we demonstrate how a data-driven analysis across very large collections of genomic data can reveal novel discoveries and evidence to support existing hypothesis.

  3. Achieving Carbon Neutrality in the Global Aluminum Industry

    NASA Astrophysics Data System (ADS)

    Das, Subodh

    2012-02-01

    In the 21st century, sustainability is widely regarded as the new corporate culture, and leading manufacturing companies (Toyota, GE, and Alcoa) and service companies (Google and Federal Express) are striving towards carbon neutrality. The current carbon footprint of the global aluminum industry is estimated at 500 million metric tonnes carbon dioxide equivalent (CO2eq), representing about 1.7% of global emissions from all sources. For the global aluminum industry, carbon neutrality is defined as a state where the total "in-use" CO2eq saved from all products in current use, including incremental process efficiency improvements, recycling, and urban mining activities, equals the CO2eq expended to produce the global output of aluminum. This paper outlines an integrated and quantifiable plan for achieving "carbon neutrality" in the global aluminum industry by advocating five actionable steps: (1) increase use of "green" electrical energy grid by 8%, (2) reduce process energy needs by 16%, (3) deploy 35% of products in "in-use" energy saving applications, (4) divert 6.1 million metric tonnes/year from landfills, and (5) mine 4.5 million metric tonnes/year from aluminum-rich "urban mines." Since it takes 20 times more energy to make aluminum from bauxite ore than to recycle it from scrap, the global aluminum industry could set a reasonable, self-imposed energy/carbon neutrality goal to incrementally increase the supply of recycled aluminum by at least 1.05 metric tonnes for every tonne of incremental production via primary aluminum smelter capacity. Furthermore, the aluminum industry can and should take a global leadership position by actively developing internationally accepted and approved carbon footprint credit protocols.

  4. Global gene expression analysis of the heat shock response in the phytopathogen Xylella fastidiosa.

    PubMed

    Koide, Tie; Vêncio, Ricardo Z N; Gomes, Suely L

    2006-08-01

    Xylella fastidiosa is a phytopathogenic bacterium that is responsible for diseases in many economically important crops. Although different strains have been studied, little is known about X. fastidiosa stress responses. One of the better characterized stress responses in bacteria is the heat shock response, which induces the expression of specific genes to prevent protein misfolding and aggregation and to promote degradation of the irreversibly denatured polypeptides. To investigate X. fastidiosa genes involved in the heat shock response, we performed a whole-genome microarray analysis in a time course experiment. Globally, 261 genes were induced (9.7%) and 222 genes were repressed (8.3%). The expression profiles of the differentially expressed genes were grouped, and their expression patterns were validated by quantitative reverse transcription-PCR experiments. We determined the transcription start sites of six heat shock-inducible genes and analyzed their promoter regions, which allowed us to propose a putative consensus for sigma(32) promoters in Xylella and to suggest additional genes as putative members of this regulon. Besides the induction of classical heat shock protein genes, we observed the up-regulation of virulence-associated genes such as vapD and of genes for hemagglutinins, hemolysin, and xylan-degrading enzymes, which may indicate the importance of heat stress to bacterial pathogenesis. In addition, we observed the repression of genes related to fimbriae, aerobic respiration, and protein biosynthesis and the induction of genes related to the extracytoplasmic stress response and some phage-related genes, revealing the complex network of genes that work together in response to heat shock.

  5. Mining of the Uncharacterized Cytochrome P450 Genes Involved in Alkaloid Biosynthesis in California Poppy Using a Draft Genome Sequence

    PubMed Central

    Hori, Kentaro; Yamada, Yasuyuki; Purwanto, Ratmoyo; Minakuchi, Yohei; Toyoda, Atsushi; Hirakawa, Hideki

    2018-01-01

    Abstract Land plants produce specialized low molecular weight metabolites to adapt to various environmental stressors, such as UV radiation, pathogen infection, wounding and animal feeding damage. Due to the large variety of stresses, plants produce various chemicals, particularly plant species-specific alkaloids, through specialized biosynthetic pathways. In this study, using a draft genome sequence and querying known biosynthetic cytochrome P450 (P450) enzyme-encoding genes, we characterized the P450 genes involved in benzylisoquinoline alkaloid (BIA) biosynthesis in California poppy (Eschscholzia californica), as P450s are key enzymes involved in the diversification of specialized metabolism. Our in silico studies showed that all identified enzyme-encoding genes involved in BIA biosynthesis were found in the draft genome sequence of approximately 489 Mb, which covered approximately 97% of the whole genome (502 Mb). Further analyses showed that some P450 families involved in BIA biosynthesis, i.e. the CYP80, CYP82 and CYP719 families, were more enriched in the genome of E. californica than in the genome of Arabidopsis thaliana, a plant that does not produce BIAs. CYP82 family genes were highly abundant, so we measured the expression of CYP82 genes with respect to alkaloid accumulation in different plant tissues and two cell lines whose BIA production differs to estimate the functions of the genes. Further characterization revealed two highly homologous P450s (CYP82P2 and CYP82P3) that exhibited 10-hydroxylase activities with different substrate specificities. Here, we discuss the evolution of the P450 genes and the potential for further genome mining of the genes encoding the enzymes involved in BIA biosynthesis. PMID:29301019

  6. Exploring patterns of epigenetic information with data mining techniques.

    PubMed

    Aguiar-Pulido, Vanessa; Seoane, José A; Gestal, Marcos; Dorado, Julián

    2013-01-01

    Data mining, a part of the Knowledge Discovery in Databases process (KDD), is the process of extracting patterns from large data sets by combining methods from statistics and artificial intelligence with database management. Analyses of epigenetic data have evolved towards genome-wide and high-throughput approaches, thus generating great amounts of data for which data mining is essential. Part of these data may contain patterns of epigenetic information which are mitotically and/or meiotically heritable determining gene expression and cellular differentiation, as well as cellular fate. Epigenetic lesions and genetic mutations are acquired by individuals during their life and accumulate with ageing. Both defects, either together or individually, can result in losing control over cell growth and, thus, causing cancer development. Data mining techniques could be then used to extract the previous patterns. This work reviews some of the most important applications of data mining to epigenetics.

  7. Mining disease state converters for medical intervention of diseases.

    PubMed

    Dong, Guozhu; Duan, Lei; Tang, Changjie

    2010-02-01

    In applications such as gene therapy and drug design, a key goal is to convert the disease state of diseased objects from an undesirable state into a desirable one. Such conversions may be achieved by changing the values of some attributes of the objects. For example, in gene therapy one may convert cancerous cells to normal ones by changing some genes' expression level from low to high or from high to low. In this paper, we define the disease state conversion problem as the discovery of disease state converters; a disease state converter is a small set of attribute value changes that may change an object's disease state from undesirable into desirable. We consider two variants of this problem: personalized disease state converter mining mines disease state converters for a given individual patient with a given disease, and universal disease state converter mining mines disease state converters for all samples with a given disease. We propose a DSCMiner algorithm to discover small and highly effective disease state converters. Since real-life medical experiments on living diseased instances are expensive and time consuming, we use classifiers trained from the datasets of given diseases to evaluate the quality of discovered converter sets. The effectiveness of a disease state converter is measured by the percentage of objects that are successfully converted from undesirable state into desirable state as deemed by state-of-the-art classifiers. We use experiments to evaluate the effectiveness of our algorithm and to show its effectiveness. We also discuss possible research directions for extensions and improvements. We note that the disease state conversion problem also has applications in customer retention, criminal rehabilitation, and company turn-around, where the goal is to convert class membership of objects whose class is an undesirable class.

  8. The impacts of neutralized acid mine drainage contaminated water on the expression of selected endocrine-linked genes in juvenile Mozambique tilapia Oreochromis mossambicus exposed in vivo.

    PubMed

    Truter, Johannes Christoff; va Wyk, Johannes Hendrik; Oberholster, Paul Johan; Botha, Anna-Maria

    2014-02-01

    Acid mine drainage (AMD) is a global environmental concern due to detrimental impacts on river ecosystems. Little is however known regarding the biological impacts of neutralized AMD on aquatic vertebrates despite excessive discharge into watercourses. The aim of this investigation was to evaluate the endocrine modulatory potential of neutralized AMD, using molecular biomarkers in the teleost fish Oreochromis mossambicus in exposure studies. Surface water was collected from six locations downstream of a high density sludge (HDS) AMD treatment plant and a reference site unimpacted by AMD. The concentrations of 28 elements, including 22 metals, were quantified in the exposure water in order to identify potential links to altered gene expression. Relatively high concentrations of manganese (~ 10mg/l), nickel (~ 0.1mg/l) and cobalt (~ 0.03 mg/l) were detected downstream of the HDS plant. The expression of thyroid receptor-α (trα), trβ, androgen receptor-1 (ar1), ar2, glucocorticoid receptor-1 (gr1), gr2, mineralocorticoid receptor (mr) and aromatase (cyp19a1b) was quantified in juvenile fish after 48 h exposure. Slight but significant changes were observed in the expression of gr1 and mr in fish exposed to water collected directly downstream of the HDS plant, consisting of approximately 95 percent neutralized AMD. The most pronounced alterations in gene expression (i.e. trα, trβ, gr1, gr2, ar1 and mr) was associated with water collected further downstream at a location with no other apparent contamination vectors apart from the neutralized AMD. The altered gene expression associated with the "downstream" locality coincided with higher concentrations of certain metals relative to the locality adjacent to the HDS plant which may indicate a causative link. The current study provides evidence of endocrine disruptive activity associated with neutralized AMD contamination in regard to alterations in the expression of key genes linked to the thyroid, interrenal and

  9. Global gene profiling of aging lungs in Atp8b1 mutant mice.

    PubMed

    Soundararajan, Ramani; Stearns, Timothy M; Czachor, Alexander; Fukumoto, Jutaro; Turn, Christina; Westermann-Clark, Emma; Breitzig, Mason; Tan, Lee; Lockey, Richard F; King, Benjamin L; Kolliputi, Narasaiah

    2016-09-29

    Recent studies implicate cardiolipin oxidation in several age-related diseases. Atp8b1 encoding Type 4 P-type ATPases is a cardiolipin transporter. Mutation in Atp8b1 gene or inflammation of the lungs impairs the capacity of Atp8b1 to clear cardiolipin from lung fluid. However, the link between Atp8b1 mutation and age-related gene alteration is unknown. Therefore, we investigated how Atp8b1 mutation alters age-related genes. We performed Affymetrix gene profiling of lungs isolated from young (7-9 wks, n=6) and aged (14 months, 14 M, n=6) C57BL/6 and Atp8b1 mutant mice. In addition, Ingenuity Pathway Analysis (IPA) was performed. Differentially expressed genes were validated by quantitative real-time PCR (qRT-PCR). Global transcriptome analysis revealed 532 differentially expressed genes in Atp8b1 lungs, 157 differentially expressed genes in C57BL/6 lungs, and 37 overlapping genes. IPA of age-related genes in Atp8b1 lungs showed enrichment of Xenobiotic metabolism and Nrf2-mediated signaling pathways. The increase in Adamts2 and Mmp13 transcripts in aged Atp8b1 lungs was validated by qRT-PCR. Similarly, the decrease in Col1a1 and increase in Cxcr6 transcripts was confirmed in both Atp8b1 mutant and C57BL/6 lungs. Based on transcriptome profiling, our study indicates that Atp8b1 mutant mice may be susceptible to age-related lung diseases.

  10. 500 years of mercury production: global annual inventory by region until 2000 and associated emissions.

    PubMed

    Hylander, Lars D; Meili, Markus

    2003-03-20

    Since pre-industrial times, anthropogenic emissions of Hg have at least doubled global atmospheric Hg deposition rates. In order to minimize environmental and human health effects, efforts have been made to reduce Hg emissions from industries and power plants, while less attention has been paid to Hg mining. This paper is a compilation of available data on primary Hg production and associated emissions with regional and annual resolution since colonial times. Globally, approximately one million tons of metallic Hg has been extracted from cinnabar and other ores during the past five centuries, half already before 1925. Roughly half has been used for mining of gold and silver, but the annual Hg production peaked during a short period of recent industrial uses. Comparison with total historic Hg deposition from global anthropogenic emissions (0.1-0.2 Mtons) suggests that only a few percent of all mined Hg have escaped to the atmosphere thus far. While production of primary Hg has changed dramatically over time and among mines, the global production has always been dominant in the region of the mercuriferous belt between the western Mediterranean and central Asia, but appears to be shifting to the east. Roughly half of the registered Hg has been extracted in Europe, where Spanish mines alone have contributed one third of the world's mined Hg. Approximately one fourth has been mined in the Americas, and most of the remaining registered Hg in Asia. However, the Asian figures may be largely underestimated. Presently, the dominant Hg mines are in Almadén in Spain (236 t of Hg produced in 2000), Khaydarkan in Kyrgyzstan (550 t), Algeria (estimated 240 t) and China (ca. 200 t). Mercury by-production from mining of other metals (e.g. copper, zinc, gold, silver) in 2000 includes 48 t from Peru, 45 t from Finland and at least 15 t from the USA. Since 1970, the recorded production of primary Hg has been reduced by almost an order of magnitude to approximately 2000 t in the year

  11. Global Gene-Expression Analysis to Identify Differentially Expressed Genes Critical for the Heat Stress Response in Brassica rapa

    PubMed Central

    Dong, Xiangshu; Yi, Hankuil; Lee, Jeongyeo; Nou, Ill-Sup; Han, Ching-Tack; Hur, Yoonkang

    2015-01-01

    Genome-wide dissection of the heat stress response (HSR) is necessary to overcome problems in crop production caused by global warming. To identify HSR genes, we profiled gene expression in two Chinese cabbage inbred lines with different thermotolerances, Chiifu and Kenshin. Many genes exhibited >2-fold changes in expression upon exposure to 0.5– 4 h at 45°C (high temperature, HT): 5.2% (2,142 genes) in Chiifu and 3.7% (1,535 genes) in Kenshin. The most enriched GO (Gene Ontology) items included ‘response to heat’, ‘response to reactive oxygen species (ROS)’, ‘response to temperature stimulus’, ‘response to abiotic stimulus’, and ‘MAPKKK cascade’. In both lines, the genes most highly induced by HT encoded small heat shock proteins (Hsps) and heat shock factor (Hsf)-like proteins such as HsfB2A (Bra029292), whereas high-molecular weight Hsps were constitutively expressed. Other upstream HSR components were also up-regulated: ROS-scavenging genes like glutathione peroxidase 2 (BrGPX2, Bra022853), protein kinases, and phosphatases. Among heat stress (HS) marker genes in Arabidopsis, only exportin 1A (XPO1A) (Bra008580, Bra006382) can be applied to B. rapa for basal thermotolerance (BT) and short-term acquired thermotolerance (SAT) gene. CYP707A3 (Bra025083, Bra021965), which is involved in the dehydration response in Arabidopsis, was associated with membrane leakage in both lines following HS. Although many transcription factors (TF) genes, including DREB2A (Bra005852), were involved in HS tolerance in both lines, Bra024224 (MYB41) and Bra021735 (a bZIP/AIR1 [Anthocyanin-Impaired-Response-1]) were specific to Kenshin. Several candidate TFs involved in thermotolerance were confirmed as HSR genes by real-time PCR, and these assignments were further supported by promoter analysis. Although some of our findings are similar to those obtained using other plant species, clear differences in Brassica rapa reveal a distinct HSR in this species. Our data

  12. Divergence and gene flow in the globally distributed blue-winged ducks

    USGS Publications Warehouse

    Nelson, Joel; Wilson, Robert E.; McCracken, Kevin G.; Cumming, Graeme; Joseph, Leo; Guay, Patrick-Jean; Peters, Jeffrey

    2017-01-01

    The ability to disperse over long distances can result in a high propensity for colonizing new geographic regions, including uninhabited continents, and lead to lineage diversification via allopatric speciation. However, high vagility can also result in gene flow between otherwise allopatric populations, and in some cases, parapatric or divergence-with-gene-flow models might be more applicable to widely distributed lineages. Here, we use five nuclear introns and the mitochondrial control region along with Bayesian models of isolation with migration to examine divergence, gene flow, and phylogenetic relationships within a cosmopolitan lineage comprising six species, the blue-winged ducks (genus Anas), which inhabit all continents except Antarctica. We found two primary sub-lineages, the globally-distributed shoveler group and the New World blue-winged/cinnamon teal group. The blue-winged/cinnamon sub-lineage is composed of sister taxa from North America and South America, and taxa with parapatric distributions are characterized by low to moderate levels of gene flow. In contrast, our data support strict allopatry for most comparisons within the shovelers. However, we found evidence of gene flow from the migratory, Holarctic northern shoveler (A. clypeata) and the more sedentary, African Cape shoveler (A. smithii) into the Australasian shoveler (A. rhynchotis), although we could not reject strict allopatry. Given the diverse mechanisms of speciation within this complex, the shovelers and blue-winged/cinnamon teals can serve as an effective model system for examining how the genome diverges under different evolutionary processes and how genetic variation is partitioned among highly dispersive taxa.

  13. Developing integrated crop knowledge networks to advance candidate gene discovery.

    PubMed

    Hassani-Pak, Keywan; Castellote, Martin; Esch, Maria; Hindle, Matthew; Lysenko, Artem; Taubert, Jan; Rawlings, Christopher

    2016-12-01

    The chances of raising crop productivity to enhance global food security would be greatly improved if we had a complete understanding of all the biological mechanisms that underpinned traits such as crop yield, disease resistance or nutrient and water use efficiency. With more crop genomes emerging all the time, we are nearer having the basic information, at the gene-level, to begin assembling crop gene catalogues and using data from other plant species to understand how the genes function and how their interactions govern crop development and physiology. Unfortunately, the task of creating such a complete knowledge base of gene functions, interaction networks and trait biology is technically challenging because the relevant data are dispersed in myriad databases in a variety of data formats with variable quality and coverage. In this paper we present a general approach for building genome-scale knowledge networks that provide a unified representation of heterogeneous but interconnected datasets to enable effective knowledge mining and gene discovery. We describe the datasets and outline the methods, workflows and tools that we have developed for creating and visualising these networks for the major crop species, wheat and barley. We present the global characteristics of such knowledge networks and with an example linking a seed size phenotype to a barley WRKY transcription factor orthologous to TTG2 from Arabidopsis, we illustrate the value of integrated data in biological knowledge discovery. The software we have developed (www.ondex.org) and the knowledge resources (http://knetminer.rothamsted.ac.uk) we have created are all open-source and provide a first step towards systematic and evidence-based gene discovery in order to facilitate crop improvement.

  14. From IHE Audit Trails to XES Event Logs Facilitating Process Mining.

    PubMed

    Paster, Ferdinand; Helm, Emmanuel

    2015-01-01

    Recently Business Intelligence approaches like process mining are applied to the healthcare domain. The goal of process mining is to gain process knowledge, compliance and room for improvement by investigating recorded event data. Previous approaches focused on process discovery by event data from various specific systems. IHE, as a globally recognized basis for healthcare information systems, defines in its ATNA profile how real-world events must be recorded in centralized event logs. The following approach presents how audit trails collected by the means of ATNA can be transformed to enable process mining. Using the standardized audit trails provides the ability to apply these methods to all IHE based information systems.

  15. Global transcriptional responses of Acidithiobacillus ferrooxidans Wenelen under different sulfide minerals.

    PubMed

    Latorre, Mauricio; Ehrenfeld, Nicole; Cortés, María Paz; Travisany, Dante; Budinich, Marko; Aravena, Andrés; González, Mauricio; Bobadilla-Fazzini, Roberto A; Parada, Pilar; Maass, Alejandro

    2016-01-01

    In order to provide new information about the adaptation of Acidithiobacillus ferrooxidans during the bioleaching process, the current analysis presents the first report of the global transcriptional response of the native copper mine strain Wenelen (DSM 16786) oxidized under different sulfide minerals. Microarrays were used to measure the response of At. ferrooxidans Wenelen to shifts from iron supplemented liquid cultures (reference state) to the addition of solid substrates enriched in pyrite or chalcopyrite. Genes encoding for energy metabolism showed a similar transcriptional profile for the two sulfide minerals. Interestingly, four operons related to sulfur metabolism were over-expressed during growth on a reduced sulfur source. Genes associated with metal tolerance (RND and ATPases type P) were up-regulated in the presence of pyrite or chalcopyrite. These results suggest that At. ferrooxidans Wenelen presents an efficient transcriptional system developed to respond to environmental conditions, namely the ability to withstand high copper concentrations. Copyright © 2015 Elsevier Ltd. All rights reserved.

  16. On the reported ionospheric precursor of the Hector Mine, California earthquake

    USGS Publications Warehouse

    Thomas, J.N.; Love, J.J.; Komjathy, A.; Verkhoglyadova, O.P.; Butala, M.; Rivera, N.

    2012-01-01

    Using Global Positioning System (GPS) data from sites near the 16 Oct. 1999 Hector Mine, California earthquake, Pulinets et al. (2007) identified anomalous changes in the ionospheric total electron content (TEC) starting one week prior to the earthquake. Pulinets (2007) suggested that precursory phenomena of this type could be useful for predicting earthquakes. On the other hand, and in a separate analysis, Afraimovich et al. (2004) concluded that TEC variations near the epicenter were controlled by solar and geomagnetic activity that were unrelated to the earthquake. In an investigation of these very different results, we examine TEC time series of long duration from GPS stations near and far from the epicenter of the Hector Mine earthquake, and long before and long after the earthquake. While we can reproduce the essential time series results of Pulinets et al., we find that the signal they identified as being anomalous is not actually anomalous. Instead, it is just part of normal global-scale TEC variation. We conclude that the TEC anomaly reported by Pulinets et al. is unrelated to the Hector Mine earthquake.

  17. A planetary nervous system for social mining and collective awareness

    NASA Astrophysics Data System (ADS)

    Giannotti, F.; Pedreschi, D.; Pentland, A.; Lukowicz, P.; Kossmann, D.; Crowley, J.; Helbing, D.

    2012-11-01

    We present a research roadmap of a Planetary Nervous System (PNS), capable of sensing and mining the digital breadcrumbs of human activities and unveiling the knowledge hidden in the big data for addressing the big questions about social complexity. We envision the PNS as a globally distributed, self-organizing, techno-social system for answering analytical questions about the status of world-wide society, based on three pillars: social sensing, social mining and the idea of trust networks and privacy-aware social mining. We discuss the ingredients of a science and a technology necessary to build the PNS upon the three mentioned pillars, beyond the limitations of their respective state-of-art. Social sensing is aimed at developing better methods for harvesting the big data from the techno-social ecosystem and make them available for mining, learning and analysis at a properly high abstraction level. Social mining is the problem of discovering patterns and models of human behaviour from the sensed data across the various social dimensions by data mining, machine learning and social network analysis. Trusted networks and privacy-aware social mining is aimed at creating a new deal around the questions of privacy and data ownership empowering individual persons with full awareness and control on own personal data, so that users may allow access and use of their data for their own good and the common good. The PNS will provide a goal-oriented knowledge discovery framework, made of technology and people, able to configure itself to the aim of answering questions about the pulse of global society. Given an analytical request, the PNS activates a process composed by a variety of interconnected tasks exploiting the social sensing and mining methods within the transparent ecosystem provided by the trusted network. The PNS we foresee is the key tool for individual and collective awareness for the knowledge society. We need such a tool for everyone to become fully aware of how

  18. Poker Flats Mine - Div. of Mining, Land, and Water

    Science.gov Websites

    Lands Coal Regulatory Program Large Mine Permits Mineral Property and Rights Mining Index Land Fishery Water Resources Factsheets Forms banner image of landscape Poker Flats Mine Home Mining Coal Regulatory Program Poker Flats Mine Mining Coal Regulatory Program Info Chickaloon Chuit Watershed Chuitna

  19. Mining of Ruminant Microbial Phytase (RPHY1) from Metagenomic Data of Mehsani Buffalo Breed: Identification, Gene Cloning, and Characterization.

    PubMed

    Mootapally, Chandra Shekar; Nathani, Neelam M; Patel, Amrutlal K; Jakhesara, Subhash J; Joshi, Chaitanya G

    2016-01-01

    Phytases have been widely used as animal feed supplements to increase the availability of digestible phosphorus, especially in monogastric animals fed cereal grains. The present study describes the identification of a full-length phytase gene of Prevotella species present in Mehsani buffalo rumen. The gene, designated as RPHY1, consists of 1,251 bp and is expressed into protein with 417 amino acids. A homology search of the deduced amino acid sequence of the RPHY1 phytase gene in a nonredundant protein database showed that it shares 92% similarity with the histidine acid phosphatase domain. Subsequently, the RPHY1 gene was expressed using a pET32a expression vector in Escherichia coli BL21 and purified using a His60 Ni-NTA gravity column. The mass of the purified RPHY1 was estimated to be approximately 63 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE). The optimal RPHY1 enzyme activity was observed at 55°C (pH 5) and exhibited good stability at 5°C and within the acidic pH range. Significant inhibition of RPHY1 activity was observed for Mg2+ and K+ metal ions, while Ca2+, Mn2+, and Na+ slightly inhibited enzyme activity. The RPHY1 phytase was susceptible to SDS, and it was highly stimulated in the presence of EDTA. Overall, the observed comparatively high enzyme activity levels and characteristics of the RPHY1 gene mined from rumen prove its promising candidature as a feed supplement enzyme in animal farming. © 2016 S. Karger AG, Basel.

  20. Remediation strategies for historical mining and smelting sites.

    PubMed

    Dybowska, Agnieszka; Farago, Margaret; Valsami-Jones, Eugenia; Thornton, Iain

    2006-01-01

    The environmental, social and economic problems associated with abandoned mine sites are serious and global. Environmental damage arising from polluted waters and dispersal of contaminated waste is a feature characteristic of many old mines in North America, Australia, Europe and elsewhere. Today, because of the efficiency of mining operations and legal requirements in many countries for prevention of environmental damage from mining operations, the release of metals to the environment from modern mining is low. However, many mineralized areas that were extensively worked in the 18th and 19th centuries and left abandoned after mining had ceased, have left a legacy of metal contaminated land. Unlike organic chemicals and plastics, metals cannot be degraded chemically or biologically into non-toxic and environmentally neutral constituents. Thus sites contaminated with toxic metals present a particular challenge for remediation. Soil remediation has been the subject of a significant amount of research work in the past decade; this has resulted in a number of remediation options currently available or being developed. Remediation strategies for metal/metalloid contaminated historical mining sites are reviewed and summarized in this article. It focuses on the current applications of in situ remediation with the use of soil amendments (adsorption and precipitation based methods are discussed) and phytoremediation (in situ plant based technology for environmental clean up and restoration). These are promising alternative technologies to traditional options of excavation and ex situ treatment, offering an advantage of being non-invasive and low cost. In particular, they have been shown to be effective in remediation of mining and smelting contaminated sites, although the long-term durability of these treatments cannot be predicted.

  1. GNormPlus: An Integrative Approach for Tagging Genes, Gene Families, and Protein Domains

    PubMed Central

    Lu, Zhiyong

    2015-01-01

    The automatic recognition of gene names and their associated database identifiers from biomedical text has been widely studied in recent years, as these tasks play an important role in many downstream text-mining applications. Despite significant previous research, only a small number of tools are publicly available and these tools are typically restricted to detecting only mention level gene names or only document level gene identifiers. In this work, we report GNormPlus: an end-to-end and open source system that handles both gene mention and identifier detection. We created a new corpus of 694 PubMed articles to support our development of GNormPlus, containing manual annotations for not only gene names and their identifiers, but also closely related concepts useful for gene name disambiguation, such as gene families and protein domains. GNormPlus integrates several advanced text-mining techniques, including SimConcept for resolving composite gene names. As a result, GNormPlus compares favorably to other state-of-the-art methods when evaluated on two widely used public benchmarking datasets, achieving 86.7% F1-score on the BioCreative II Gene Normalization task dataset and 50.1% F1-score on the BioCreative III Gene Normalization task dataset. The GNormPlus source code and its annotated corpus are freely available, and the results of applying GNormPlus to the entire PubMed are freely accessible through our web-based tool PubTator. PMID:26380306

  2. Identification of Thiotetronic Acid Antibiotic Biosynthetic Pathways by Target-directed Genome Mining.

    PubMed

    Tang, Xiaoyu; Li, Jie; Millán-Aguiñaga, Natalie; Zhang, Jia Jia; O'Neill, Ellis C; Ugalde, Juan A; Jensen, Paul R; Mantovani, Simone M; Moore, Bradley S

    2015-12-18

    Recent genome sequencing efforts have led to the rapid accumulation of uncharacterized or "orphaned" secondary metabolic biosynthesis gene clusters (BGCs) in public databases. This increase in DNA-sequenced big data has given rise to significant challenges in the applied field of natural product genome mining, including (i) how to prioritize the characterization of orphan BGCs and (ii) how to rapidly connect genes to biosynthesized small molecules. Here, we show that by correlating putative antibiotic resistance genes that encode target-modified proteins with orphan BGCs, we predict the biological function of pathway specific small molecules before they have been revealed in a process we call target-directed genome mining. By querying the pan-genome of 86 Salinispora bacterial genomes for duplicated house-keeping genes colocalized with natural product BGCs, we prioritized an orphan polyketide synthase-nonribosomal peptide synthetase hybrid BGC (tlm) with a putative fatty acid synthase resistance gene. We employed a new synthetic double-stranded DNA-mediated cloning strategy based on transformation-associated recombination to efficiently capture tlm and the related ttm BGCs directly from genomic DNA and to heterologously express them in Streptomyces hosts. We show the production of a group of unusual thiotetronic acid natural products, including the well-known fatty acid synthase inhibitor thiolactomycin that was first described over 30 years ago, yet never at the genetic level in regards to biosynthesis and autoresistance. This finding not only validates the target-directed genome mining strategy for the discovery of antibiotic producing gene clusters without a priori knowledge of the molecule synthesized but also paves the way for the investigation of novel enzymology involved in thiotetronic acid natural product biosynthesis.

  3. Text mining-based in silico drug discovery in oral mucositis caused by high-dose cancer therapy.

    PubMed

    Kirk, Jon; Shah, Nirav; Noll, Braxton; Stevens, Craig B; Lawler, Marshall; Mougeot, Farah B; Mougeot, Jean-Luc C

    2018-08-01

    Oral mucositis (OM) is a major dose-limiting side effect of chemotherapy and radiation used in cancer treatment. Due to the complex nature of OM, currently available drug-based treatments are of limited efficacy. Our objectives were (i) to determine genes and molecular pathways associated with OM and wound healing using computational tools and publicly available data and (ii) to identify drugs formulated for topical use targeting the relevant OM molecular pathways. OM and wound healing-associated genes were determined by text mining, and the intersection of the two gene sets was selected for gene ontology analysis using the GeneCodis program. Protein interaction network analysis was performed using STRING-db. Enriched gene sets belonging to the identified pathways were queried against the Drug-Gene Interaction database to find drug candidates for topical use in OM. Our analysis identified 447 genes common to both the "OM" and "wound healing" text mining concepts. Gene enrichment analysis yielded 20 genes representing six pathways and targetable by a total of 32 drugs which could possibly be formulated for topical application. A manual search on ClinicalTrials.gov confirmed no relevant pathway/drug candidate had been overlooked. Twenty-five of the 32 drugs can directly affect the PTGS2 (COX-2) pathway, the pathway that has been targeted in previous clinical trials with limited success. Drug discovery using in silico text mining and pathway analysis tools can facilitate the identification of existing drugs that have the potential of topical administration to improve OM treatment.

  4. Accelerated losses of protected forests from gold mining in the Peruvian Amazon

    NASA Astrophysics Data System (ADS)

    Asner, Gregory P.; Tupayachi, Raul

    2016-09-01

    Gold mining in Amazonia involves forest removal, soil excavation, and the use of liquid mercury, which together pose a major threat to biodiversity, water quality, forest carbon stocks, and human health. Within the global biodiversity hotspot of Madre de Dios, Peru, gold mining has continued despite numerous 2012 government decrees and enforcement actions against it. Mining is now also thought to have entered federally protected areas, but the rates of miner encroachment are unknown. Here, we utilize high-resolution remote sensing to assess annual changes in gold mining extent from 1999 to 2016 throughout the Madre de Dios region, including the high-diversity Tambopata National Reserve and buffer zone. Regionally, gold mining-related losses of forest averaged 4437 ha yr-1. A temporary downward inflection in the annual growth rate of mining-related forest loss following 2012 government action was followed by a near doubling of the deforestation rate from mining in 2013-2014. The total estimated area of gold mining throughout the region increased about 40% between 2012 and 2016, including in the Tambopata National Reserve. Our results reveal an urgent need for more socio-environmental effort and law enforcement action to combat illegal gold mining in the Peruvian Amazon.

  5. Study on the transformed strategy of “life field” for aged in coal mine community——A case sstudy of ccommunity rrenewal ddesign of Sihe coal mine in Jincheng, Shanxi

    NASA Astrophysics Data System (ADS)

    Xue, Minghui; Wang, Chenghao; Zhang, Shanshan

    2017-06-01

    Coal mine community is driven by the coal mine industry, and it mainly relies on coal mining enterprises to provide benefits for residents. Under the background of increasing serious global aging problem, the problems in the field of elderly people’s health, life, entertainment, communication, retirement and re-employment and other aspects become more acute and urgently to be solved. So it is necessary to make a more detailed study on how to transform the coal mine community according to the special needs of the elderly miners. This article takes renewal design of SiHe coal mine in JinCheng of ShanXi province as an example and takes the community’s “life field” as a clue, trying to put forward the transformed strategy of “life field” for aged in coal mine community and to come up with a method to update the community throughout the whole atmosphere to the personal space.

  6. Isolation and characterisation of mineral-oxidising "Acidibacillus" spp. from mine sites and geothermal environments in different global locations.

    PubMed

    Holanda, Roseanne; Hedrich, Sabrina; Ňancucheo, Ivan; Oliveira, Guilherme; Grail, Barry M; Johnson, D Barrie

    2016-09-01

    Eight strains of acidophilic bacteria, isolated from mine-impacted and geothermal sites from different parts of the world, were shown to form a distinct clade (proposed genus "Acidibacillus") within the phylum Firmicutes, well separated from the acidophilic genera Sulfobacillus and Alicyclobacillus. Two of the strains (both isolated from sites in Yellowstone National Park, USA) were moderate thermophiles that oxidised both ferrous iron and elemental sulphur, while the other six were mesophiles that also oxidised ferrous iron, but not sulphur. All eight isolates reduced ferric iron to varying degrees. The two groups shared <95% similarity of their 16S rRNA genes and were therefore considered to be distinct species: "Acidibacillus sulfuroxidans" (moderately thermophilic isolates) and "Acidibacillus ferrooxidans" (mesophilic isolates). Both species were obligate heterotrophs; none of the eight strains grew in the absence of organic carbon. "Acidibacillus" spp. were generally highly tolerant of elevated concentrations of cationic transition metals, though "A. sulfuroxidans" strains were more sensitive to some (e.g. nickel and zinc) than those of "A. ferrooxidans". Initial annotation of the genomes of two strains of "A. ferrooxidans" revealed the presence of genes (cbbL) involved in the RuBisCO pathway for CO2 assimilation and iron oxidation (rus), though with relatively low sequence identities. Copyright © 2016. Published by Elsevier Masson SAS.

  7. SparkText: Biomedical Text Mining on Big Data Framework.

    PubMed

    Ye, Zhan; Tafti, Ahmad P; He, Karen Y; Wang, Kai; He, Max M

    Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  8. Ecological restoration alters microbial communities in mine tailings profiles

    NASA Astrophysics Data System (ADS)

    Li, Yang; Jia, Zhongjun; Sun, Qingye; Zhan, Jing; Yang, Yang; Wang, Dan

    2016-04-01

    Ecological restoration of mine tailings have impact on soil physiochemical properties and microbial communities. The surface soil has been a primary concern in the past decades, however it remains poorly understood about the adaptive response of microbial communities along the profile during ecological restoration of the tailings. In this study, microbial communities along a 60-cm profile were investigated in a mine tailing pond during ecological restoration of the bare waste tailings (BW) with two vegetated soils of Imperata cylindrica (IC) and Chrysopogon zizanioides (CZ) plants. Revegetation of both IC and CZ could retard soil degradation of mine tailing by stimulation of soil pH at 0-30 cm soils and altered the bacterial communities at 0-20 cm depths of the mine tailings. Significant differences existed in the relative abundance of the phyla Alphaproteobacteria, Deltaproteobacteria, Acidobacteria, Firmicutes and Nitrospira. Slight difference of bacterial communities were found at 30-60 cm depths of mine tailings. Abundance and activity analysis of nifH genes also explained the elevated soil nitrogen contents at the surface 0-20 cm of the vegetated soils. These results suggest that microbial succession occurred primarily at surface tailings and vegetation of pioneering plants might have promoted ecological restoration of mine tailings.

  9. Ecological restoration alters microbial communities in mine tailings profiles.

    PubMed

    Li, Yang; Jia, Zhongjun; Sun, Qingye; Zhan, Jing; Yang, Yang; Wang, Dan

    2016-04-29

    Ecological restoration of mine tailings have impact on soil physiochemical properties and microbial communities. The surface soil has been a primary concern in the past decades, however it remains poorly understood about the adaptive response of microbial communities along the profile during ecological restoration of the tailings. In this study, microbial communities along a 60-cm profile were investigated in a mine tailing pond during ecological restoration of the bare waste tailings (BW) with two vegetated soils of Imperata cylindrica (IC) and Chrysopogon zizanioides (CZ) plants. Revegetation of both IC and CZ could retard soil degradation of mine tailing by stimulation of soil pH at 0-30 cm soils and altered the bacterial communities at 0-20 cm depths of the mine tailings. Significant differences existed in the relative abundance of the phyla Alphaproteobacteria, Deltaproteobacteria, Acidobacteria, Firmicutes and Nitrospira. Slight difference of bacterial communities were found at 30-60 cm depths of mine tailings. Abundance and activity analysis of nifH genes also explained the elevated soil nitrogen contents at the surface 0-20 cm of the vegetated soils. These results suggest that microbial succession occurred primarily at surface tailings and vegetation of pioneering plants might have promoted ecological restoration of mine tailings.

  10. Ecological restoration alters microbial communities in mine tailings profiles

    PubMed Central

    Li, Yang; Jia, Zhongjun; Sun, Qingye; Zhan, Jing; Yang, Yang; Wang, Dan

    2016-01-01

    Ecological restoration of mine tailings have impact on soil physiochemical properties and microbial communities. The surface soil has been a primary concern in the past decades, however it remains poorly understood about the adaptive response of microbial communities along the profile during ecological restoration of the tailings. In this study, microbial communities along a 60-cm profile were investigated in a mine tailing pond during ecological restoration of the bare waste tailings (BW) with two vegetated soils of Imperata cylindrica (IC) and Chrysopogon zizanioides (CZ) plants. Revegetation of both IC and CZ could retard soil degradation of mine tailing by stimulation of soil pH at 0–30 cm soils and altered the bacterial communities at 0–20 cm depths of the mine tailings. Significant differences existed in the relative abundance of the phyla Alphaproteobacteria, Deltaproteobacteria, Acidobacteria, Firmicutes and Nitrospira. Slight difference of bacterial communities were found at 30–60 cm depths of mine tailings. Abundance and activity analysis of nifH genes also explained the elevated soil nitrogen contents at the surface 0–20 cm of the vegetated soils. These results suggest that microbial succession occurred primarily at surface tailings and vegetation of pioneering plants might have promoted ecological restoration of mine tailings. PMID:27126064

  11. Report of investigation on underground limestone mines in the Ohio region. [Jonathan Mine, Alpha Portland Cement Mine, and Lewisburg Mine

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Byerly, D.W.

    1976-06-01

    The following is a report of investigation on the geologic setting of several underground limestone mines in Ohio other than the PPG mine at Barberton, Ohio. Due to the element of available time, the writer is only able to deliver a brief synopsis of the geology of three sites visited. These three sites and the Barberton, Ohio site are the only underground limestone mines in Ohio to the best of the writer's knowledge. The sites visited include: (1) the Jonathan Mine located near Zanesville, Ohio, and currently operated by the Columbia Cement Corporation; (2) the abandoned Alpha Portland Cement Minemore » located near Ironton, Ohio; and (3) the Lewisburg Mine located at Lewisburg, Ohio, and currently being utilized as an underground storage facility. Other remaining possibilities where limestone is being mined underground are located in middle Ordovician strata near Carntown and Maysville, Kentucky. These are drift mines into a thick sequence of carbonates. The writer predicts, however, that these mines would have some problems with water due to the preponderance of carbonate rocks and the proximity of the mines to the Ohio River. None of the sites visited nor the sites in Kentucky have conditions comparable to the deep mine at Barberton, Ohio.« less

  12. Mining gene link information for survival pathway hunting.

    PubMed

    Jing, Gao-Jian; Zhang, Zirui; Wang, Hong-Qiang; Zheng, Hong-Mei

    2015-08-01

    This study proposes a gene link-based method for survival time-related pathway hunting. In this method, the authors incorporate gene link information to estimate how a pathway is associated with cancer patient's survival time. Specifically, a gene link-based Cox proportional hazard model (Link-Cox) is established, in which two linked genes are considered together to represent a link variable and the association of the link with survival time is assessed using Cox proportional hazard model. On the basis of the Link-Cox model, the authors formulate a new statistic for measuring the association of a pathway with survival time of cancer patients, referred to as pathway survival score (PSS), by summarising survival significance over all the gene links in the pathway, and devise a permutation test to test the significance of an observed PSS. To evaluate the proposed method, the authors applied it to simulation data and two publicly available real-world gene expression data sets. Extensive comparisons with previous methods show the effectiveness and efficiency of the proposed method for survival pathway hunting.

  13. A Systems-Genetics Approach and Data Mining Tool to Assist in the Discovery of Genes Underlying Complex Traits in Oryza sativa

    PubMed Central

    Ficklin, Stephen P.; Feltus, Frank Alex

    2013-01-01

    Many traits of biological and agronomic significance in plants are controlled in a complex manner where multiple genes and environmental signals affect the expression of the phenotype. In Oryza sativa (rice), thousands of quantitative genetic signals have been mapped to the rice genome. In parallel, thousands of gene expression profiles have been generated across many experimental conditions. Through the discovery of networks with real gene co-expression relationships, it is possible to identify co-localized genetic and gene expression signals that implicate complex genotype-phenotype relationships. In this work, we used a knowledge-independent, systems genetics approach, to discover a high-quality set of co-expression networks, termed Gene Interaction Layers (GILs). Twenty-two GILs were constructed from 1,306 Affymetrix microarray rice expression profiles that were pre-clustered to allow for improved capture of gene co-expression relationships. Functional genomic and genetic data, including over 8,000 QTLs and 766 phenotype-tagged SNPs (p-value < = 0.001) from genome-wide association studies, both covering over 230 different rice traits were integrated with the GILs. An online systems genetics data-mining resource, the GeneNet Engine, was constructed to enable dynamic discovery of gene sets (i.e. network modules) that overlap with genetic traits. GeneNet Engine does not provide the exact set of genes underlying a given complex trait, but through the evidence of gene-marker correspondence, co-expression, and functional enrichment, site visitors can identify genes with potential shared causality for a trait which could then be used for experimental validation. A set of 2 million SNPs was incorporated into the database and serve as a potential set of testable biomarkers for genes in modules that overlap with genetic traits. Herein, we describe two modules found using GeneNet Engine, one with significant overlap with the trait amylose content and another with

  14. SSH gene expression profile of Eisenia andrei exposed in situ to a naturally contaminated soil from an abandoned uranium mine.

    PubMed

    Lourenço, Joana; Pereira, Ruth; Gonçalves, Fernando; Mendo, Sónia

    2013-02-01

    The effects of the exposure of earthworms (Eisenia andrei) to contaminated soil from an abandoned uranium mine, were assessed through gene expression profile evaluation by Suppression Subtractive Hybridization (SSH). Organisms were exposed in situ for 56 days, in containers placed both in a contaminated and in a non-contaminated site (reference). Organisms were sampled after 14 and 56 days of exposure. Results showed that the main physiological functions affected by the exposure to metals and radionuclides were: metabolism, oxireductase activity, redox homeostasis and response to chemical stimulus and stress. The relative expression of NADH dehydrogenase subunit 1 and elongation factor 1 alpha was also affected, since the genes encoding these enzymes were significantly up and down-regulated, after 14 and 56 days of exposure, respectively. Also, an EST with homology for SET oncogene was found to be up-regulated. To the best of our knowledge, this is the first time that this gene was identified in earthworms and thus, further studies are required, to clarify its involvement in the toxicity of metals and radionuclides. Considering the results herein presented, gene expression profiling proved to be a very useful tool to detect earthworms underlying responses to metals and radionuclides exposure, pointing out for the detection and development of potential new biomarkers. Copyright © 2012 Elsevier Inc. All rights reserved.

  15. Too much data, but little inter-changeability: a lesson learned from mining public data on tissue specificity of gene expression.

    PubMed

    Li, Shuyu; Li, Yiqun Helen; Wei, Tao; Su, Eric Wen; Duffin, Kevin; Liao, Birong

    2006-10-25

    The tissue expression pattern of a gene often provides an important clue to its potential role in a biological process. A vast amount of gene expression data have been and are being accumulated in public repository through different technology platforms. However, exploitations of these rich data sources remain limited in part due to issues of technology standardization. Our objective is to test the data comparability between SAGE and microarray technologies, through examining the expression pattern of genes under normal physiological states across variety of tissues. There are 42-54% of genes showing significant correlations in tissue expression patterns between SAGE and GeneChip, with 30-40% of genes whose expression patterns are positively correlated and 10-15% of genes whose expression patterns are negatively correlated at a statistically significant level (p = 0.05). Our analysis suggests that the discrepancy on the expression patterns derived from technology platforms is not likely from the heterogeneity of tissues used in these technologies, or other spurious correlations resulting from microarray probe design, abundance of genes, or gene function. The discrepancy can be partially explained by errors in the original assignment of SAGE tags to genes due to the evolution of sequence databases. In addition, sequence analysis has indicated that many SAGE tags and Affymetrix array probe sets are mapped to different splice variants or different sequence regions although they represent the same gene, which also contributes to the observed discrepancies between SAGE and array expression data. To our knowledge, this is the first report attempting to mine gene expression patterns across tissues using public data from different technology platforms. Unlike previous similar studies that only demonstrated the discrepancies between the two gene expression platforms, we carried out in-depth analysis to further investigate the cause for such discrepancies. Our study shows

  16. Implementation of Paste Backfill Mining Technology in Chinese Coal Mines

    PubMed Central

    Chang, Qingliang; Zhou, Huaqiang; Bai, Jianbiao

    2014-01-01

    Implementation of clean mining technology at coal mines is crucial to protect the environment and maintain balance among energy resources, consumption, and ecology. After reviewing present coal clean mining technology, we introduce the technology principles and technological process of paste backfill mining in coal mines and discuss the components and features of backfill materials, the constitution of the backfill system, and the backfill process. Specific implementation of this technology and its application are analyzed for paste backfill mining in Daizhuang Coal Mine; a practical implementation shows that paste backfill mining can improve the safety and excavation rate of coal mining, which can effectively resolve surface subsidence problems caused by underground mining activities, by utilizing solid waste such as coal gangues as a resource. Therefore, paste backfill mining is an effective clean coal mining technology, which has widespread application. PMID:25258737

  17. Implementation of paste backfill mining technology in Chinese coal mines.

    PubMed

    Chang, Qingliang; Chen, Jianhang; Zhou, Huaqiang; Bai, Jianbiao

    2014-01-01

    Implementation of clean mining technology at coal mines is crucial to protect the environment and maintain balance among energy resources, consumption, and ecology. After reviewing present coal clean mining technology, we introduce the technology principles and technological process of paste backfill mining in coal mines and discuss the components and features of backfill materials, the constitution of the backfill system, and the backfill process. Specific implementation of this technology and its application are analyzed for paste backfill mining in Daizhuang Coal Mine; a practical implementation shows that paste backfill mining can improve the safety and excavation rate of coal mining, which can effectively resolve surface subsidence problems caused by underground mining activities, by utilizing solid waste such as coal gangues as a resource. Therefore, paste backfill mining is an effective clean coal mining technology, which has widespread application.

  18. Global differential gene expression in response to growth temperature alteration in group A Streptococcus.

    PubMed

    Smoot, L M; Smoot, J C; Graham, M R; Somerville, G A; Sturdevant, D E; Migliaccio, C A; Sylva, G L; Musser, J M

    2001-08-28

    Pathogens are exposed to different temperatures during an infection cycle and must regulate gene expression accordingly. However, the extent to which virulent bacteria alter gene expression in response to temperatures encountered in the host is unknown. Group A Streptococcus (GAS) is a human-specific pathogen that is responsible for illnesses ranging from superficial skin infections and pharyngitis to severe invasive infections such as necrotizing fasciitis and streptococcal toxic shock syndrome. GAS survives and multiplies at different temperatures during human infection. DNA microarray analysis was used to investigate the influence of temperature on global gene expression in a serotype M1 strain grown to exponential phase at 29 degrees C and 37 degrees C. Approximately 9% of genes were differentially expressed by at least 1.5-fold at 29 degrees C relative to 37 degrees C, including genes encoding transporter proteins, proteins involved in iron homeostasis, transcriptional regulators, phage-associated proteins, and proteins with no known homologue. Relatively few known virulence genes were differentially expressed at this threshold. However, transcription of 28 genes encoding proteins with predicted secretion signal sequences was altered, indicating that growth temperature substantially influences the extracellular proteome. TaqMan real-time reverse transcription-PCR assays confirmed the microarray data. We also discovered that transcription of genes encoding hemolysins, and proteins with inferred roles in iron regulation, transport, and homeostasis, was influenced by growth at 40 degrees C. Thus, GAS profoundly alters gene expression in response to temperature. The data delineate the spectrum of temperature-regulated gene expression in an important human pathogen and provide many unforeseen lines of pathogenesis investigation.

  19. Microbial genotype-phenotype mapping by class association rule mining.

    PubMed

    Tamura, Makio; D'haeseleer, Patrik

    2008-07-01

    Microbial phenotypes are typically due to the concerted action of multiple gene functions, yet the presence of each gene may have only a weak correlation with the observed phenotype. Hence, it may be more appropriate to examine co-occurrence between sets of genes and a phenotype (multiple-to-one) instead of pairwise relations between a single gene and the phenotype. Here, we propose an efficient class association rule mining algorithm, netCAR, in order to extract sets of COGs (clusters of orthologous groups of proteins) associated with a phenotype from COG phylogenetic profiles and a phenotype profile. netCAR takes into account the phylogenetic co-occurrence graph between COGs to restrict hypothesis space, and uses mutual information to evaluate the biconditional relation. We examined the mining capability of pairwise and multiple-to-one association by using netCAR to extract COGs relevant to six microbial phenotypes (aerobic, anaerobic, facultative, endospore, motility and Gram negative) from 11,969 unique COG profiles across 155 prokaryotic organisms. With the same level of false discovery rate, multiple-to-one association can extract about 10 times more relevant COGs than one-to-one association. We also reveal various topologies of association networks among COGs (modules) from extracted multiple-to-one correlation rules relevant with the six phenotypes; including a well-connected network for motility, a star-shaped network for aerobic and intermediate topologies for the other phenotypes. netCAR outperforms a standard CAR mining algorithm, CARapriori, while requiring several orders of magnitude less computational time for extracting 3-COG sets. Source code of the Java implementation is available as Supplementary Material at the Bioinformatics online website, or upon request to the author. Supplementary data are available at Bioinformatics online.

  20. Global Occurrence of Archaeal amoA Genes in Terrestrial Hot Springs▿

    PubMed Central

    Zhang, Chuanlun L.; Ye, Qi; Huang, Zhiyong; Li, WenJun; Chen, Jinquan; Song, Zhaoqi; Zhao, Weidong; Bagwell, Christopher; Inskeep, William P.; Ross, Christian; Gao, Lei; Wiegel, Juergen; Romanek, Christopher S.; Shock, Everett L.; Hedlund, Brian P.

    2008-01-01

    transcribed in situ in one spring and the transcripts were closely related to the amoA genes amplified from the same spring. Our study demonstrates the global occurrence of putative archaeal amoA genes in a wide variety of terrestrial hot springs and suggests that geography may play an important role in selecting different assemblages of AOA. PMID:18676703

  1. Global occurrence of archaeal amoA genes in terrestrial hot springs.

    PubMed

    Zhang, Chuanlun L; Ye, Qi; Huang, Zhiyong; Li, Wenjun; Chen, Jinquan; Song, Zhaoqi; Zhao, Weidong; Bagwell, Christopher; Inskeep, William P; Ross, Christian; Gao, Lei; Wiegel, Juergen; Romanek, Christopher S; Shock, Everett L; Hedlund, Brian P

    2008-10-01

    transcribed in situ in one spring and the transcripts were closely related to the amoA genes amplified from the same spring. Our study demonstrates the global occurrence of putative archaeal amoA genes in a wide variety of terrestrial hot springs and suggests that geography may play an important role in selecting different assemblages of AOA.

  2. Long-Term Improvement of Neurological Signs and Metabolic Dysfunction in a Mouse Model of Krabbe's Disease after Global Gene Therapy.

    PubMed

    Marshall, Michael S; Issa, Yazan; Jakubauskas, Benas; Stoskute, Monika; Elackattu, Vince; Marshall, Jeffrey N; Bogue, Wil; Nguyen, Duc; Hauck, Zane; Rue, Emily; Karumuthil-Melethil, Subha; Zaric, Violeta; Bosland, Maarten; van Breemen, Richard B; Givogri, Maria I; Gray, Steven J; Crocker, Stephen J; Bongarzone, Ernesto R

    2018-03-07

    We report a global adeno-associated virus (AAV)9-based gene therapy protocol to deliver therapeutic galactosylceramidase (GALC), a lysosomal enzyme that is deficient in Krabbe's disease. When globally administered via intrathecal, intracranial, and intravenous injections to newborn mice affected with GALC deficiency (twitcher mice), this approach largely surpassed prior published benchmarks of survival and metabolic correction, showing long-term protection of demyelination, neuroinflammation, and motor function. Bone marrow transplantation, performed in this protocol without immunosuppressive preconditioning, added minimal benefits to the AAV9 gene therapy. Contrasting with other proposed pre-clinical therapies, these results demonstrate that achieving nearly complete correction of GALC's metabolic deficiencies across the entire nervous system via gene therapy can have a significant improvement to behavioral deficits, pathophysiological changes, and survival. These results are an important consideration for determining the safest and most effective manner for adapting gene therapy to treat this leukodystrophy in the clinic. Copyright © 2018 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.

  3. Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II.

    PubMed

    Lu, Zhiyong; Hirschman, Lynette

    2012-01-01

    Manual curation of data from the biomedical literature is a rate-limiting factor for many expert curated databases. Despite the continuing advances in biomedical text mining and the pressing needs of biocurators for better tools, few existing text-mining tools have been successfully integrated into production literature curation systems such as those used by the expert curated databases. To close this gap and better understand all aspects of literature curation, we invited submissions of written descriptions of curation workflows from expert curated databases for the BioCreative 2012 Workshop Track II. We received seven qualified contributions, primarily from model organism databases. Based on these descriptions, we identified commonalities and differences across the workflows, the common ontologies and controlled vocabularies used and the current and desired uses of text mining for biocuration. Compared to a survey done in 2009, our 2012 results show that many more databases are now using text mining in parts of their curation workflows. In addition, the workshop participants identified text-mining aids for finding gene names and symbols (gene indexing), prioritization of documents for curation (document triage) and ontology concept assignment as those most desired by the biocurators. DATABASE URL: http://www.biocreative.org/tasks/bc-workshop-2012/workflow/.

  4. Mining genes involved in insecticide resistance of Liposcelis bostrychophila Badonnel by transcriptome and expression profile analysis.

    PubMed

    Dou, Wei; Shen, Guang-Mao; Niu, Jin-Zhi; Ding, Tian-Bo; Wei, Dan-Dan; Wang, Jin-Jun

    2013-01-01

    Recent studies indicate that infestations of psocids pose a new risk for global food security. Among the psocids species, Liposcelis bostrychophila Badonnel has gained recognition in importance because of its parthenogenic reproduction, rapid adaptation, and increased worldwide distribution. To date, the molecular data available for L. bostrychophila is largely limited to genes identified through homology. Also, no transcriptome data relevant to psocids infection is available. In this study, we generated de novo assembly of L. bostrychophila transcriptome performed through the short read sequencing technology (Illumina). In a single run, we obtained more than 51 million sequencing reads that were assembled into 60,012 unigenes (mean size = 711 bp) by Trinity. The transcriptome sequences from different developmental stages of L. bostrychophila including egg, nymph and adult were annotated with non-redundant (Nr) protein database, gene ontology (GO), cluster of orthologous groups of proteins (COG), and KEGG orthology (KO). The analysis revealed three major enzyme families involved in insecticide metabolism as differentially expressed in the L. bostrychophila transcriptome. A total of 49 P450-, 31 GST- and 21 CES-specific genes representing the three enzyme families were identified. Besides, 16 transcripts were identified to contain target site sequences of resistance genes. Furthermore, we profiled gene expression patterns upon insecticide (malathion and deltamethrin) exposure using the tag-based digital gene expression (DGE) method. The L. bostrychophila transcriptome and DGE data provide gene expression data that would further our understanding of molecular mechanisms in psocids. In particular, the findings of this investigation will facilitate identification of genes involved in insecticide resistance and designing of new compounds for control of psocids.

  5. Mining Genes Involved in Insecticide Resistance of Liposcelis bostrychophila Badonnel by Transcriptome and Expression Profile Analysis

    PubMed Central

    Dou, Wei; Shen, Guang-Mao; Niu, Jin-Zhi; Ding, Tian-Bo; Wei, Dan-Dan; Wang, Jin-Jun

    2013-01-01

    Background Recent studies indicate that infestations of psocids pose a new risk for global food security. Among the psocids species, Liposcelis bostrychophila Badonnel has gained recognition in importance because of its parthenogenic reproduction, rapid adaptation, and increased worldwide distribution. To date, the molecular data available for L. bostrychophila is largely limited to genes identified through homology. Also, no transcriptome data relevant to psocids infection is available. Methodology and Principal Findings In this study, we generated de novo assembly of L. bostrychophila transcriptome performed through the short read sequencing technology (Illumina). In a single run, we obtained more than 51 million sequencing reads that were assembled into 60,012 unigenes (mean size = 711 bp) by Trinity. The transcriptome sequences from different developmental stages of L. bostrychophila including egg, nymph and adult were annotated with non-redundant (Nr) protein database, gene ontology (GO), cluster of orthologous groups of proteins (COG), and KEGG orthology (KO). The analysis revealed three major enzyme families involved in insecticide metabolism as differentially expressed in the L. bostrychophila transcriptome. A total of 49 P450-, 31 GST- and 21 CES-specific genes representing the three enzyme families were identified. Besides, 16 transcripts were identified to contain target site sequences of resistance genes. Furthermore, we profiled gene expression patterns upon insecticide (malathion and deltamethrin) exposure using the tag-based digital gene expression (DGE) method. Conclusion The L. bostrychophila transcriptome and DGE data provide gene expression data that would further our understanding of molecular mechanisms in psocids. In particular, the findings of this investigation will facilitate identification of genes involved in insecticide resistance and designing of new compounds for control of psocids. PMID:24278202

  6. On the reported ionospheric precursor of the 1999 Hector Mine, California earthquake

    USGS Publications Warehouse

    Thomas, Jeremy N.; Love, Jeffrey J.; Komjathy, Attila; Verkhoglyadova, Olga P.; Butala, Mark; Rivera, Nicholas

    2012-01-01

    Using Global Positioning System (GPS) data from sites near the 16 Oct. 1999 Hector Mine, California earthquake, Pulinets et al. (2007) identified anomalous changes in the ionospheric total electron content (TEC) starting one week prior to the earthquake. Pulinets (2007) suggested that precursory phenomena of this type could be useful for predicting earthquakes. On the other hand, and in a separate analysis, Afraimovich et al. (2004) concluded that TEC variations near the epicenter were controlled by solar and geomagnetic activity that were unrelated to the earthquake. In an investigation of these very different results, we examine TEC time series of long duration from GPS stations near and far from the epicenter of the Hector Mine earthquake, and long before and long after the earthquake. While we can reproduce the essential time series results of Pulinets et al., we find that the signal they identify as anomalous is not actually anomalous. Instead, it is just part of normal global-scale TEC variation. We conclude that the TEC anomaly reported by Pulinets et al. is unrelated to the Hector Mine earthquake.

  7. Economics of mining law

    USGS Publications Warehouse

    Long, K.R.

    1995-01-01

    Modern mining law, by facilitating socially and environmentally acceptable exploration, development, and production of mineral materials, helps secure the benefits of mineral production while minimizing environmental harm and accounting for increasing land-use competition. Mining investments are sunk costs, irreversibly tied to a particular mineral site, and require many years to recoup. Providing security of tenure is the most critical element of a practical mining law. Governments owning mineral rights have a conflict of interest between their roles as a profit-maximizing landowner and as a guardian of public welfare. As a monopoly supplier, governments have considerable power to manipulate mineral-rights markets. To avoid monopoly rent-seeking by governments, a competitive market for government-owned mineral rights must be created by artifice. What mining firms will pay for mineral rights depends on expected exploration success and extraction costs. Landowners and mining firms will negotlate respective shares of anticipated differential rents, usually allowing for some form of risk sharing. Private landowners do not normally account for external benefits or costs of minerals use. Government ownership of mineral rights allows for direct accounting of social prices for mineral-bearing lands and external costs. An equitable and efficient method is to charge an appropriate reservation price for surface land use, net of the value of land after reclamation, and to recover all or part of differential rents through a flat income or resource-rent tax. The traditional royalty on gross value of production, essentially a regressive income tax, cannot recover as much rent as a flat income tax, causes arbitrary mineral-reserve sterilization, and creates a bias toward development on the extensive margin where marginal environmental costs are higher. Mitigating environmental costs and resolving land-use conflicts require local evaluation and planning. National oversight ensures

  8. pGenN, a gene normalization tool for plant genes and proteins in scientific literature.

    PubMed

    Ding, Ruoyao; Arighi, Cecilia N; Lee, Jung-Youn; Wu, Cathy H; Vijay-Shanker, K

    2015-01-01

    Automatically detecting gene/protein names in the literature and connecting them to databases records, also known as gene normalization, provides a means to structure the information buried in free-text literature. Gene normalization is critical for improving the coverage of annotation in the databases, and is an essential component of many text mining systems and database curation pipelines. In this manuscript, we describe a gene normalization system specifically tailored for plant species, called pGenN (pivot-based Gene Normalization). The system consists of three steps: dictionary-based gene mention detection, species assignment, and intra species normalization. We have developed new heuristics to improve each of these phases. We evaluated the performance of pGenN on an in-house expertly annotated corpus consisting of 104 plant relevant abstracts. Our system achieved an F-value of 88.9% (Precision 90.9% and Recall 87.2%) on this corpus, outperforming state-of-art systems presented in BioCreative III. We have processed over 440,000 plant-related Medline abstracts using pGenN. The gene normalization results are stored in a local database for direct query from the pGenN web interface (proteininformationresource.org/pgenn/). The annotated literature corpus is also publicly available through the PIR text mining portal (proteininformationresource.org/iprolink/).

  9. 30 CFR 819.21 - Auger mining: Protection of underground mining.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 30 Mineral Resources 3 2011-07-01 2011-07-01 false Auger mining: Protection of underground mining. 819.21 Section 819.21 Mineral Resources OFFICE OF SURFACE MINING RECLAMATION AND ENFORCEMENT... STANDARDS-AUGER MINING § 819.21 Auger mining: Protection of underground mining. Auger holes shall not extend...

  10. Mining Security Pipe(TSM)with Underground GPS Global(RSPG)Escape Security Device in Underground Mining

    NASA Astrophysics Data System (ADS)

    Giménez, Rafael Barrionuevo

    2016-06-01

    TSM is escape pipe in case of collapse of terrain. The TSM is a passive security tool placed underground to connect the work area with secure area (mining gallery mainly). TSM is light and hand able pipe made with aramid (Kevlar), carbon fibre, or other kind of new material. The TSM will be placed as a pipe line network with many in/out entrances/exits to rich and connect problem work areas with another parts in a safe mode. Different levels of instrumentation could be added inside such as micro-led escape way suggested, temperature, humidity, level of oxygen, etc.). The open hardware and software like Arduino will be the heart of control and automation system.

  11. Reconstructing disturbance history for an intensively mined region by time-series analysis of Landsat imagery.

    PubMed

    Li, Jing; Zipper, Carl E; Donovan, Patricia F; Wynne, Randolph H; Oliphant, Adam J

    2015-09-01

    Surface mining disturbances have attracted attention globally due to extensive influence on topography, land use, ecosystems, and human populations in mineral-rich regions. We analyzed a time series of Landsat satellite imagery to produce a 28-year disturbance history for surface coal mining in a segment of eastern USA's central Appalachian coalfield, southwestern Virginia. The method was developed and applied as a three-step sequence: vegetation index selection, persistent vegetation identification, and mined-land delineation by year of disturbance. The overall classification accuracy and kappa coefficient were 0.9350 and 0.9252, respectively. Most surface coal mines were identified correctly by location and by time of initial disturbance. More than 8 % of southwestern Virginia's >4000-km(2) coalfield area was disturbed by surface coal mining over the 28-year period. Approximately 19.5 % of the Appalachian coalfield surface within the most intensively mined county (Wise County) has been disturbed by mining. Mining disturbances expanded steadily and progressively over the study period. Information generated can be applied to gain further insight concerning mining influences on ecosystems and other essential environmental features.

  12. Data Mining in Earth System Science (DMESS 2011)

    Treesearch

    Forrest M. Hoffman; J. Walter Larson; Richard Tran Mills; Bhorn-Gustaf Brooks; Auroop R. Ganguly; William Hargrove; et al

    2011-01-01

    From field-scale measurements to global climate simulations and remote sensing, the growing body of very large and long time series Earth science data are increasingly difficult to analyze, visualize, and interpret. Data mining, information theoretic, and machine learning techniques—such as cluster analysis, singular value decomposition, block entropy, Fourier and...

  13. Benchmarking infrastructure for mutation text mining

    PubMed Central

    2014-01-01

    Background Experimental research on the automatic extraction of information about mutations from texts is greatly hindered by the lack of consensus evaluation infrastructure for the testing and benchmarking of mutation text mining systems. Results We propose a community-oriented annotation and benchmarking infrastructure to support development, testing, benchmarking, and comparison of mutation text mining systems. The design is based on semantic standards, where RDF is used to represent annotations, an OWL ontology provides an extensible schema for the data and SPARQL is used to compute various performance metrics, so that in many cases no programming is needed to analyze results from a text mining system. While large benchmark corpora for biological entity and relation extraction are focused mostly on genes, proteins, diseases, and species, our benchmarking infrastructure fills the gap for mutation information. The core infrastructure comprises (1) an ontology for modelling annotations, (2) SPARQL queries for computing performance metrics, and (3) a sizeable collection of manually curated documents, that can support mutation grounding and mutation impact extraction experiments. Conclusion We have developed the principal infrastructure for the benchmarking of mutation text mining tasks. The use of RDF and OWL as the representation for corpora ensures extensibility. The infrastructure is suitable for out-of-the-box use in several important scenarios and is ready, in its current state, for initial community adoption. PMID:24568600

  14. Benchmarking infrastructure for mutation text mining.

    PubMed

    Klein, Artjom; Riazanov, Alexandre; Hindle, Matthew M; Baker, Christopher Jo

    2014-02-25

    Experimental research on the automatic extraction of information about mutations from texts is greatly hindered by the lack of consensus evaluation infrastructure for the testing and benchmarking of mutation text mining systems. We propose a community-oriented annotation and benchmarking infrastructure to support development, testing, benchmarking, and comparison of mutation text mining systems. The design is based on semantic standards, where RDF is used to represent annotations, an OWL ontology provides an extensible schema for the data and SPARQL is used to compute various performance metrics, so that in many cases no programming is needed to analyze results from a text mining system. While large benchmark corpora for biological entity and relation extraction are focused mostly on genes, proteins, diseases, and species, our benchmarking infrastructure fills the gap for mutation information. The core infrastructure comprises (1) an ontology for modelling annotations, (2) SPARQL queries for computing performance metrics, and (3) a sizeable collection of manually curated documents, that can support mutation grounding and mutation impact extraction experiments. We have developed the principal infrastructure for the benchmarking of mutation text mining tasks. The use of RDF and OWL as the representation for corpora ensures extensibility. The infrastructure is suitable for out-of-the-box use in several important scenarios and is ready, in its current state, for initial community adoption.

  15. Global demand for gold is another threat for tropical forests

    NASA Astrophysics Data System (ADS)

    Alvarez-Berríos, Nora L.; Aide, T. Mitchell

    2015-01-01

    The current global gold rush, driven by increasing consumption in developing countries and uncertainty in financial markets, is an increasing threat for tropical ecosystems. Gold mining causes significant alteration to the environment, yet mining is often overlooked in deforestation analyses because it occupies relatively small areas. As a result, we lack a comprehensive assessment of the spatial extent of gold mining impacts on tropical forests. In this study, we provide a regional assessment of gold mining deforestation in the tropical moist forest biome of South America. Specifically, we analyzed the patterns of forest change in gold mining sites between 2001 and 2013, and evaluated the proximity of gold mining deforestation to protected areas (PAs). The forest cover maps were produced using the Land Mapper web application and images from the MODIS satellite MOD13Q1 vegetation indices 250 m product. Annual maps of forest cover were used to model the incremental change in forest in ˜1600 potential gold mining sites between 2001-2006 and 2007-2013. Approximately 1680 km2 of tropical moist forest was lost in these mining sites between 2001 and 2013. Deforestation was significantly higher during the 2007-2013 period, and this was associated with the increase in global demand for gold after the international financial crisis. More than 90% of the deforestation occurred in four major hotspots: Guianan moist forest ecoregion (41%), Southwest Amazon moist forest ecoregion (28%), Tapajós-Xingú moist forest ecoregion (11%), and Magdalena Valley montane forest and Magdalena-Urabá moist forest ecoregions (9%). In addition, some of the more active zones of gold mining deforestation occurred inside or within 10 km of ˜32 PAs. There is an urgent need to understand the ecological and social impacts of gold mining because it is an important cause of deforestation in the most remote forests in South America, and the impacts, particularly in aquatic systems, spread well

  16. Global Expression Profiling in Atopic Eczema Reveals Reciprocal Expression of Inflammatory and Lipid Genes

    PubMed Central

    Sääf, Annika M.; Tengvall-Linder, Maria; Chang, Howard Y.; Adler, Adam S.; Wahlgren, Carl-Fredrik; Scheynius, Annika; Nordenskjöld, Magnus; Bradley, Maria

    2008-01-01

    Background Atopic eczema (AE) is a common chronic inflammatory skin disorder. In order to dissect the genetic background several linkage and genetic association studies have been performed. Yet very little is known about specific genes involved in this complex skin disease, and the underlying molecular mechanisms are not fully understood. Methodology/Findings We used human DNA microarrays to identify a molecular picture of the programmed responses of the human genome to AE. The transcriptional program was analyzed in skin biopsy samples from lesional and patch-tested skin from AE patients sensitized to Malassezia sympodialis (M. sympodialis), and corresponding biopsies from healthy individuals. The most notable feature of the global gene-expression pattern observed in AE skin was a reciprocal expression of induced inflammatory genes and repressed lipid metabolism genes. The overall transcriptional response in M. sympodialis patch-tested AE skin was similar to the gene-expression signature identified in lesional AE skin. In the constellation of genes differentially expressed in AE skin compared to healthy control skin, we have identified several potential susceptibility genes that may play a critical role in the pathological condition of AE. Many of these genes, including genes with a role in immune responses, lipid homeostasis, and epidermal differentiation, are localized on chromosomal regions previously linked to AE. Conclusions/Significance Through genome-wide expression profiling, we were able to discover a distinct reciprocal expression pattern of induced inflammatory genes and repressed lipid metabolism genes in skin from AE patients. We found a significant enrichment of differentially expressed genes in AE with cytobands associated to the disease, and furthermore new chromosomal regions were found that could potentially guide future region-specific linkage mapping in AE. The full data set is available at http://microarray-pubs.stanford.edu/eczema. PMID

  17. A Long-Term Mathematical Model for Mining Industries

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Achdou, Yves, E-mail: achdou@ljll.univ-paris-diderot.fr; Giraud, Pierre-Noel; Lasry, Jean-Michel

    A parcimonious long term model is proposed for a mining industry. Knowing the dynamics of the global reserve, the strategy of each production unit consists of an optimal control problem with two controls, first the flux invested into prospection and the building of new extraction facilities, second the production rate. In turn, the dynamics of the global reserve depends on the individual strategies of the producers, so the models leads to an equilibrium, which is described by low dimensional systems of partial differential equations. The dimensionality depends on the number of technologies that a mining producer can choose. In somemore » cases, the systems may be reduced to a Hamilton–Jacobi equation which is degenerate at the boundary and whose right hand side may blow up at the boundary. A mathematical analysis is supplied. Then numerical simulations for models with one or two technologies are described. In particular, a numerical calibration of the model in order to fit the historical data is carried out.« less

  18. Global sequence diversity of the lactate dehydrogenase gene in Plasmodium falciparum.

    PubMed

    Simpalipan, Phumin; Pattaradilokrat, Sittiporn; Harnyuttanakorn, Pongchai

    2018-01-09

    Antigen-detecting rapid diagnostic tests (RDTs) have been recommended by the World Health Organization for use in remote areas to improve malaria case management. Lactate dehydrogenase (LDH) of Plasmodium falciparum is one of the main parasite antigens employed by various commercial RDTs. It has been hypothesized that the poor detection of LDH-based RDTs is attributed in part to the sequence diversity of the gene. To test this, the present study aimed to investigate the genetic diversity of the P. falciparum ldh gene in Thailand and to construct the map of LDH sequence diversity in P. falciparum populations worldwide. The ldh gene was sequenced for 50 P. falciparum isolates in Thailand and compared with hundreds of sequences from P. falciparum populations worldwide. Several indices of molecular variation were calculated, including the proportion of polymorphic sites, the average nucleotide diversity index (π), and the haplotype diversity index (H). Tests of positive selection and neutrality tests were performed to determine signatures of natural selection on the gene. Mean genetic distance within and between species of Plasmodium ldh was analysed to infer evolutionary relationships. Nucleotide sequences of P. falciparum ldh could be classified into 9 alleles, encoding 5 isoforms of LDH. L1a was the most common allelic type and was distributed in P. falciparum populations worldwide. Plasmodium falciparum ldh sequences were highly conserved, with haplotype and nucleotide diversity values of 0.203 and 0.0004, respectively. The extremely low genetic diversity was maintained by purifying selection, likely due to functional constraints. Phylogenetic analysis inferred the close genetic relationship of P. falciparum to malaria parasites of great apes, rather than to other human malaria parasites. This study revealed the global genetic variation of the ldh gene in P. falciparum, providing knowledge for improving detection of LDH-based RDTs and supporting the candidacy of

  19. Operational Monitoring of Mines by COSMO-SkyMed PSP SAR Interferometry

    NASA Astrophysics Data System (ADS)

    Costantini, Mario; Malvarosa, Fabio; Miniati, Federico; de Assis, Luciano Mozer

    2016-08-01

    Synthetic aperture radar (SAR) interferometry is a powerful technology for detection and monitoring of slow ground surface movements. Monitoring of ground deformations in mining structures is an important application, particularly difficult because the scene changes with time. The persistent scatterer pair (PSP) approach, recently proposed to overcome some limitations of standard persistent scatter interferometry, proved to be effective also for mine monitoring. In this work, after resuming the main ideas of the PSP method, we describe the PSP measurements obtained from high- resolution X-band COSMO-SkyMed data over a large mining area in Minas Gerais state, Brazil. The outcomes demonstrate that dense and accurate ground deformation measurements can be obtained on the mining area and its structures (such as open pits, waste dumps, conveyor belts, water and tailings dams, etc.), achieving a consistent global view including also areas where field instruments are not installed.

  20. Ammonia-Oligotrophic and Diazotrophic Heavy Metal-Resistant Serratia liquefaciens Strains from Pioneer Plants and Mine Tailings.

    PubMed

    Zelaya-Molina, Lily X; Hernández-Soto, Luis M; Guerra-Camacho, Jairo E; Monterrubio-López, Ricardo; Patiño-Siciliano, Alfredo; Villa-Tanaca, Lourdes; Hernández-Rodríguez, César

    2016-08-01

    Mine tailings are man-made environments characterized by low levels of organic carbon and assimilable nitrogen, as well as moderate concentrations of heavy metals. For the introduction of nitrogen into these environments, a key role is played by ammonia-oligotrophic/diazotrophic heavy metal-resistant guilds. In mine tailings from Zacatecas, Mexico, Serratia liquefaciens was the dominant heterotrophic culturable species isolated in N-free media from bulk mine tailings as well as the rhizosphere, roots, and aerial parts of pioneer plants. S. liquefaciens strains proved to be a meta-population with high intraspecific genetic diversity and a potential to respond to these extreme conditions. The phenotypic and genotypic features of these strains reveal the potential adaptation of S. liquefaciens to oligotrophic and nitrogen-limited mine tailings with high concentrations of heavy metals. These features include ammonia-oligotrophic growth, nitrogen fixation, siderophore and indoleacetic acid production, phosphate solubilization, biofilm formation, moderate tolerance to heavy metals under conditions of diverse nitrogen availability, and the presence of zntA, amtB, and nifH genes. The acetylene reduction assay suggests low nitrogen-fixing activity. The nifH gene was harbored in a plasmid of ∼60 kb and probably was acquired by a horizontal gene transfer event from Klebsiella variicola.

  1. Mine Waste at The Kherzet Youcef Mine : Environmental Characterization

    NASA Astrophysics Data System (ADS)

    Issaad, Mouloud; Boutaleb, Abdelhak; Kolli, Omar

    2017-04-01

    Mining activity in Algeria has existed since antiquity. But it was very important since the 20th century. This activity has virtually ceased since the beginning of the 1990s, leaving many mine sites abandoned (so-called orphan mines). The abandonment of mining today poses many environmental problems (soil pollution, contamination of surface water, mining collapses...). The mining wastes often occupy large volumes that can be hazardous to the environment and human health, often neglected in the past: Faulting geotechnical implementation, acid mine drainage (AMD), alkalinity, presence of pollutants and toxic substances (heavy metals, cyanide...). The study started already six years ago and it covers all mines located in NE Algeria, almost are stopped for more than thirty years. So the most important is to have an overview of all the study area. After the inventory job of the abandoned mines, the rock drainage prediction will help us to classify sites according to their acid generating potential.

  2. SparkText: Biomedical Text Mining on Big Data Framework

    PubMed Central

    He, Karen Y.; Wang, Kai

    2016-01-01

    Background Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. Results In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. Conclusions This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research. PMID:27685652

  3. A Global Survey and Interactive Map Suite of Deep Underground Facilities; Examples of Geotechnical and Engineering Capabilities, Achievements, Challenges: (Mines, Shafts, Tunnels, Boreholes, Sites and Underground Facilities for Nuclear Waste and Physics R&D)

    NASA Astrophysics Data System (ADS)

    Tynan, M. C.; Russell, G. P.; Perry, F.; Kelley, R.; Champenois, S. T.

    2017-12-01

    This global survey presents a synthesis of some notable geotechnical and engineering information reflected in four interactive layer maps for selected: 1) deep mines and shafts; 2) existing, considered or planned radioactive waste management deep underground studies, sites, or disposal facilities; 3) deep large diameter boreholes, and 4) physics underground laboratories and facilities from around the world. These data are intended to facilitate user access to basic information and references regarding deep underground "facilities", history, activities, and plans. In general, the interactive maps and database [http://gis.inl.gov/globalsites/] provide each facility's approximate site location, geology, and engineered features (e.g.: access, geometry, depth, diameter, year of operations, groundwater, lithology, host unit name and age, basin; operator, management organization, geographic data, nearby cultural features, other). Although the survey is not all encompassing, it is a comprehensive review of many of the significant existing and historical underground facilities discussed in the literature addressing radioactive waste management and deep mined geologic disposal safety systems. The global survey is intended to support and to inform: 1) interested parties and decision makers; 2) radioactive waste disposal and siting option evaluations, and 3) safety case development as a communication tool applicable to any mined geologic disposal facility as a demonstration of historical and current engineering and geotechnical capabilities available for use in deep underground facility siting, planning, construction, operations and monitoring.

  4. Uses of antimicrobial genes from microbial genome

    DOEpatents

    Sorek, Rotem; Rubin, Edward M.

    2013-08-20

    We describe a method for mining microbial genomes to discover antimicrobial genes and proteins having broad spectrum of activity. Also described are antimicrobial genes and their expression products from various microbial genomes that were found using this method. The products of such genes can be used as antimicrobial agents or as tools for molecular biology.

  5. Analysis of global and absorption, distribution, metabolism, and elimination gene expression in the progressive stages of human nonalcoholic fatty liver disease.

    PubMed

    Lake, April D; Novak, Petr; Fisher, Craig D; Jackson, Jonathan P; Hardwick, Rhiannon N; Billheimer, D Dean; Klimecki, Walter T; Cherrington, Nathan J

    2011-10-01

    Nonalcoholic fatty liver disease (NAFLD) is characterized by a series of pathological changes that range from simple fatty liver to nonalcoholic steatohepatitis (NASH). The objective of this study is to describe changes in global gene expression associated with the progression of human NAFLD. This study is focused on the expression levels of genes responsible for the absorption, distribution, metabolism, and elimination (ADME) of drugs. Differential gene expression between three clinically defined pathological groups-normal, steatosis, and NASH-was analyzed. Genome-wide mRNA levels in samples of human liver tissue were assayed with Affymetrix GeneChip Human 1.0ST arrays. A total of 11,633 genes exhibited altered expression out of 33,252 genes at a 5% false discovery rate. Most gene expression changes occurred in the progression from steatosis to NASH. Principal component analysis revealed that hepatic disease status was the major determinant of differential ADME gene expression rather than age or sex of sample donors. Among the 515 drug transporters and 258 drug-metabolizing enzymes (DMEs) examined, uptake transporters but not efflux transporters or DMEs were significantly over-represented in the number of genes down-regulated. These results suggest that uptake transporter genes are coordinately targeted for down-regulation at the global level during the pathological development of NASH and that these patients may have decreased drug uptake capacity. This coordinated regulation of uptake transporter genes is indicative of a hepatoprotective mechanism acting to prevent accumulation of toxic intermediates in disease-compromised hepatocytes.

  6. Different Temporal Effects of Ebola Virus VP35 and VP24 Proteins on Global Gene Expression in Human Dendritic Cells.

    PubMed

    Ilinykh, Philipp A; Lubaki, Ndongala M; Widen, Steven G; Renn, Lynnsey A; Theisen, Terence C; Rabin, Ronald L; Wood, Thomas G; Bukreyev, Alexander

    2015-08-01

    Ebola virus (EBOV) causes a severe hemorrhagic fever with a deficient immune response, lymphopenia, and lymphocyte apoptosis. Dendritic cells (DC), which trigger the adaptive response, do not mature despite EBOV infection. We recently demonstrated that DC maturation is unblocked by disabling the innate response antagonizing domains (IRADs) in EBOV VP35 and VP24 by the mutations R312A and K142A, respectively. Here we analyzed the effects of VP35 and VP24 with the IRADs disabled on global gene expression in human DC. Human monocyte-derived DC were infected by wild-type (wt) EBOV or EBOVs carrying the mutation in VP35 (EBOV/VP35m), VP24 (EBOV/VP24m), or both (EBOV/VP35m/VP24m). Global gene expression at 8 and 24 h was analyzed by deep sequencing, and the expression of interferon (IFN) subtypes up to 5 days postinfection was analyzed by quantitative reverse transcription-PCR (qRT-PCR). wt EBOV induced a weak global gene expression response, including markers of DC maturation, cytokines, chemokines, chemokine receptors, and multiple IFNs. The VP35 mutation unblocked the expression, resulting in a dramatic increase in expression of these transcripts at 8 and 24 h. Surprisingly, DC infected with EBOV/VP24m expressed lower levels of many of these transcripts at 8 h after infection, compared to wt EBOV. In contrast, at 24 h, expression of the transcripts increased in DC infected with any of the three mutants, compared to wt EBOV. Moreover, sets of genes affected by the two mutations only partially overlapped. Pathway analysis demonstrated that the VP35 mutation unblocked pathways involved in antigen processing and presentation and IFN signaling. These data suggest that EBOV IRADs have profound effects on the host adaptive immune response through massive transcriptional downregulation of DC. This study shows that infection of DC with EBOV, but not its mutant forms with the VP35 IRAD and/or VP24 IRAD disabled, causes a global block in expression of host genes. The temporal

  7. Innovative Competencies of Mining engineers in Transition to the Sustainable Development

    NASA Astrophysics Data System (ADS)

    Krechetov, Andrey; Khoreshok, Alexey; Blumenstein, Valery

    2017-11-01

    The transition to the sustainable development posed new challenges to the system of mining higher education. They are determined by the acceleration of scientific and technological progress and widespread introduction of innovations, convergence of technologies from various industries. On the one hand, globalization and rapid technology development are constantly increasing quality requirements for the labor resources of the mineral and raw materials complex and constant improvement of their skills. On the other hand, the transition to the sustainable development provides the necessity for rational use of raw materials and environmental protection. This requires the improvement of staff support system for mining operations and the interaction of enterprises with universities training mining engineers, aimed at the innovative competencies development of future miners.

  8. Contextualising the topographic signature of historic mining, a scaling analysis

    NASA Astrophysics Data System (ADS)

    Reinhardt, Liam

    2017-04-01

    Mining is globally one of the most significant means by which humans alter landscapes; we do so through erosion (mining), transport, and deposition of extracted sediments (waste). The iconic Dartmoor mountain landscape of SW England ( 700km2) has experienced over 1000 years of shallow (Cu & Sn) mining that has left a pervasive imprint on the landscape. The availability of high resolution digital elevation models (<=1m) and aerial photographs @12.5 cm resolution) combined with historic records of mining activity and output make this an ideal location to investigate the topographic signature of mining. Conceptually I ask the question: how much (digital elevation model) smoothing is required to remove the human imprint from this landscape ? While we may have entered the Anthropocene other gravity driven process have imparted distinct scale-dependant signatures. How might the human signature differ from these processes and how pervasive is it at the landscape scale? Spatial scaling analysis (curvature & semi-variance) was used to quantify the topographic signature of historic mining and to determine how it differs to a) natural landforms such as bedrock tors; and b) the morphology of biological activity (e.g. peat formation). Other forms of historic activity such as peat cutting and quarrying were also investigated. The existence of 400 years of mine activity archives also makes it possible to distinguish between the imprint of differing forms of mine technology and their spatio-temporal signature. Interestingly the higher technology 19th C mines have left a much smaller topographic legacy than Medieval miners; though the former had a much greater impact in terms of heavy metal contamination.

  9. POST-MINING DEVELOPMENT USING RESOURCES FROM FLOODED UNDERGROUND MINE WORKINGS

    EPA Science Inventory

    Post-mining issues of land and surface utilization now serve to accentuate how important it is to incorporate sustainable development aspects into hard rock mining. In an effort to revitalize lands degraded by historic mining, 10 acres of mine tailings near the Belmont Mine have...

  10. Genomic and transcriptomic analyses reveal adaptation mechanisms of an Acidithiobacillus ferrivorans strain YL15 to alpine acid mine drainage.

    PubMed

    Peng, Tangjian; Ma, Liyuan; Feng, Xue; Tao, Jiemeng; Nan, Meihua; Liu, Yuandong; Li, Jiaokun; Shen, Li; Wu, Xueling; Yu, Runlan; Liu, Xueduan; Qiu, Guanzhou; Zeng, Weimin

    2017-01-01

    Acidithiobacillus ferrivorans is an acidophile that often occurs in low temperature acid mine drainage, e.g., that located at high altitude. Being able to inhabit the extreme environment, the bacterium must possess strategies to copy with the survival stress. Nonetheless, information on the strategies is in demand. Here, genomic and transcriptomic assays were performed to illuminate the adaptation mechanisms of an A. ferrivorans strain YL15, to the alpine acid mine drainage environment in Yulong copper mine in southwest China. Genomic analysis revealed that strain has a gene repertoire for metal-resistance, e.g., genes coding for the mer operon and a variety of transporters/efflux proteins, and for low pH adaptation, such as genes for hopanoid-synthesis and the sodium:proton antiporter. Genes for various DNA repair enzymes and synthesis of UV-absorbing mycosporine-like amino acids precursor indicated hypothetical UV radiation-resistance mechanisms in strain YL15. In addition, it has two types of the acquired immune system-type III-B and type I-F CRISPR/Cas modules against invasion of foreign genetic elements. RNA-seq based analysis uncovered that strain YL15 uses a set of mechanisms to adapt to low temperature. Genes involved in protein synthesis, transmembrane transport, energy metabolism and chemotaxis showed increased levels of RNA transcripts. Furthermore, a bacterioferritin Dps gene had higher RNA transcript counts at 6°C, possibly implicated in protecting DNA against oxidative stress at low temperature. The study represents the first to comprehensively unveil the adaptation mechanisms of an acidophilic bacterium to the acid mine drainage in alpine regions.

  11. Genomic and transcriptomic analyses reveal adaptation mechanisms of an Acidithiobacillus ferrivorans strain YL15 to alpine acid mine drainage

    PubMed Central

    Ma, Liyuan; Feng, Xue; Tao, Jiemeng; Nan, Meihua; Liu, Yuandong; Li, Jiaokun; Shen, Li; Wu, Xueling; Yu, Runlan; Liu, Xueduan; Qiu, Guanzhou; Zeng, Weimin

    2017-01-01

    Acidithiobacillus ferrivorans is an acidophile that often occurs in low temperature acid mine drainage, e.g., that located at high altitude. Being able to inhabit the extreme environment, the bacterium must possess strategies to copy with the survival stress. Nonetheless, information on the strategies is in demand. Here, genomic and transcriptomic assays were performed to illuminate the adaptation mechanisms of an A. ferrivorans strain YL15, to the alpine acid mine drainage environment in Yulong copper mine in southwest China. Genomic analysis revealed that strain has a gene repertoire for metal-resistance, e.g., genes coding for the mer operon and a variety of transporters/efflux proteins, and for low pH adaptation, such as genes for hopanoid-synthesis and the sodium:proton antiporter. Genes for various DNA repair enzymes and synthesis of UV-absorbing mycosporine-like amino acids precursor indicated hypothetical UV radiation—resistance mechanisms in strain YL15. In addition, it has two types of the acquired immune system–type III-B and type I-F CRISPR/Cas modules against invasion of foreign genetic elements. RNA-seq based analysis uncovered that strain YL15 uses a set of mechanisms to adapt to low temperature. Genes involved in protein synthesis, transmembrane transport, energy metabolism and chemotaxis showed increased levels of RNA transcripts. Furthermore, a bacterioferritin Dps gene had higher RNA transcript counts at 6°C, possibly implicated in protecting DNA against oxidative stress at low temperature. The study represents the first to comprehensively unveil the adaptation mechanisms of an acidophilic bacterium to the acid mine drainage in alpine regions. PMID:28542527

  12. Lvr, a Signaling System That Controls Global Gene Regulation and Virulence in Pathogenic Leptospira.

    PubMed

    Adhikarla, Haritha; Wunder, Elsio A; Mechaly, Ariel E; Mehta, Sameet; Wang, Zheng; Santos, Luciane; Bisht, Vimla; Diggle, Peter; Murray, Gerald; Adler, Ben; Lopez, Francesc; Townsend, Jeffrey P; Groisman, Eduardo; Picardeau, Mathieu; Buschiazzo, Alejandro; Ko, Albert I

    2018-01-01

    Leptospirosis is an emerging zoonotic disease with more than 1 million cases annually. Currently there is lack of evidence for signaling pathways involved during the infection process of Leptospira . In our comprehensive genomic analysis of 20 Leptospira spp. we identified seven pathogen-specific Two-Component System (TCS) proteins. Disruption of two these TCS genes in pathogenic Leptospira strain resulted in loss-of-virulence in a hamster model of leptospirosis. Corresponding genes lvrA and lvrB (leptospira virulence regulator ) are juxtaposed in an operon and are predicted to encode a hybrid histidine kinase and a hybrid response regulator, respectively. Transcriptome analysis of lvr mutant strains with disruption of one ( lvrB ) or both genes ( lvrA/B ) revealed global transcriptional regulation of 850 differentially expressed genes. Phosphotransfer assays demonstrated that LvrA phosphorylates LvrB and predicted further signaling downstream to one or more DNA-binding response regulators, suggesting that it is a branched pathway. Phylogenetic analyses indicated that lvrA and lvrB evolved independently within different ecological lineages in Leptospira via gene duplication. This study uncovers a novel-signaling pathway that regulates virulence in pathogenic Leptospira (Lvr), providing a framework to understand the molecular bases of regulation in this life-threatening bacterium.

  13. Lvr, a Signaling System That Controls Global Gene Regulation and Virulence in Pathogenic Leptospira

    PubMed Central

    Adhikarla, Haritha; Wunder, Elsio A.; Mechaly, Ariel E.; Mehta, Sameet; Wang, Zheng; Santos, Luciane; Bisht, Vimla; Diggle, Peter; Murray, Gerald; Adler, Ben; Lopez, Francesc; Townsend, Jeffrey P.; Groisman, Eduardo; Picardeau, Mathieu; Buschiazzo, Alejandro; Ko, Albert I.

    2018-01-01

    Leptospirosis is an emerging zoonotic disease with more than 1 million cases annually. Currently there is lack of evidence for signaling pathways involved during the infection process of Leptospira. In our comprehensive genomic analysis of 20 Leptospira spp. we identified seven pathogen-specific Two-Component System (TCS) proteins. Disruption of two these TCS genes in pathogenic Leptospira strain resulted in loss-of-virulence in a hamster model of leptospirosis. Corresponding genes lvrA and lvrB (leptospira virulence regulator) are juxtaposed in an operon and are predicted to encode a hybrid histidine kinase and a hybrid response regulator, respectively. Transcriptome analysis of lvr mutant strains with disruption of one (lvrB) or both genes (lvrA/B) revealed global transcriptional regulation of 850 differentially expressed genes. Phosphotransfer assays demonstrated that LvrA phosphorylates LvrB and predicted further signaling downstream to one or more DNA-binding response regulators, suggesting that it is a branched pathway. Phylogenetic analyses indicated that lvrA and lvrB evolved independently within different ecological lineages in Leptospira via gene duplication. This study uncovers a novel-signaling pathway that regulates virulence in pathogenic Leptospira (Lvr), providing a framework to understand the molecular bases of regulation in this life-threatening bacterium. PMID:29600195

  14. BGDMdocker: a Docker workflow for data mining and visualization of bacterial pan-genomes and biosynthetic gene clusters.

    PubMed

    Cheng, Gong; Lu, Quan; Ma, Ling; Zhang, Guocai; Xu, Liang; Zhou, Zongshan

    2017-01-01

    Recently, Docker technology has received increasing attention throughout the bioinformatics community. However, its implementation has not yet been mastered by most biologists; accordingly, its application in biological research has been limited. In order to popularize this technology in the field of bioinformatics and to promote the use of publicly available bioinformatics tools, such as Dockerfiles and Images from communities, government sources, and private owners in the Docker Hub Registry and other Docker-based resources, we introduce here a complete and accurate bioinformatics workflow based on Docker. The present workflow enables analysis and visualization of pan-genomes and biosynthetic gene clusters of bacteria. This provides a new solution for bioinformatics mining of big data from various publicly available biological databases. The present step-by-step guide creates an integrative workflow through a Dockerfile to allow researchers to build their own Image and run Container easily.

  15. BGDMdocker: a Docker workflow for data mining and visualization of bacterial pan-genomes and biosynthetic gene clusters

    PubMed Central

    Cheng, Gong; Zhang, Guocai; Xu, Liang

    2017-01-01

    Recently, Docker technology has received increasing attention throughout the bioinformatics community. However, its implementation has not yet been mastered by most biologists; accordingly, its application in biological research has been limited. In order to popularize this technology in the field of bioinformatics and to promote the use of publicly available bioinformatics tools, such as Dockerfiles and Images from communities, government sources, and private owners in the Docker Hub Registry and other Docker-based resources, we introduce here a complete and accurate bioinformatics workflow based on Docker. The present workflow enables analysis and visualization of pan-genomes and biosynthetic gene clusters of bacteria. This provides a new solution for bioinformatics mining of big data from various publicly available biological databases. The present step-by-step guide creates an integrative workflow through a Dockerfile to allow researchers to build their own Image and run Container easily. PMID:29204317

  16. AAV-PHP.B-Mediated Global-Scale Expression in the Mouse Nervous System Enables GBA1 Gene Therapy for Wide Protection from Synucleinopathy.

    PubMed

    Morabito, Giuseppe; Giannelli, Serena G; Ordazzo, Gabriele; Bido, Simone; Castoldi, Valerio; Indrigo, Marzia; Cabassi, Tommaso; Cattaneo, Stefano; Luoni, Mirko; Cancellieri, Cinzia; Sessa, Alessandro; Bacigaluppi, Marco; Taverna, Stefano; Leocani, Letizia; Lanciego, José L; Broccoli, Vania

    2017-12-06

    The lack of technology for direct global-scale targeting of the adult mouse nervous system has hindered research on brain processing and dysfunctions. Currently, gene transfer is normally achieved by intraparenchymal viral injections, but these injections target a restricted brain area. Herein, we demonstrated that intravenous delivery of adeno-associated virus (AAV)-PHP.B viral particles permeated and diffused throughout the neural parenchyma, targeting both the central and the peripheral nervous system in a global pattern. We then established multiple procedures of viral transduction to control gene expression or inactivate gene function exclusively in the adult nervous system and assessed the underlying behavioral effects. Building on these results, we established an effective gene therapy strategy to counteract the widespread accumulation of α-synuclein deposits throughout the forebrain in a mouse model of synucleinopathy. Transduction of A53T-SCNA transgenic mice with AAV-PHP.B-GBA1 restored physiological levels of the enzyme, reduced α-synuclein pathology, and produced significant behavioral recovery. Finally, we provided evidence that AAV-PHP.B brain penetration does not lead to evident dysfunctions in blood-brain barrier integrity or permeability. Altogether, the AAV-PHP.B viral platform enables non-invasive, widespread, and long-lasting global neural expression of therapeutic genes, such as GBA1, providing an invaluable approach to treat neurodegenerative diseases with diffuse brain pathology such as synucleinopathies. Copyright © 2017 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.

  17. 30 CFR 77.1712 - Reopening mines; notification; inspection prior to mining.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... to mining. 77.1712 Section 77.1712 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION... prior to mining. Prior to reopening any surface coal mine after it has been abandoned or declared... an authorized representative of the Secretary before any mining operations in such mine are...

  18. 30 CFR 77.1712 - Reopening mines; notification; inspection prior to mining.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... to mining. 77.1712 Section 77.1712 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION... prior to mining. Prior to reopening any surface coal mine after it has been abandoned or declared... an authorized representative of the Secretary before any mining operations in such mine are...

  19. Genome mining of the sordarin biosynthetic gene cluster from Sordaria araneosa Cain ATCC 36386: characterization of cycloaraneosene synthase and GDP-6-deoxyaltrose transferase.

    PubMed

    Kudo, Fumitaka; Matsuura, Yasunori; Hayashi, Takaaki; Fukushima, Masayuki; Eguchi, Tadashi

    2016-07-01

    Sordarin is a glycoside antibiotic with a unique tetracyclic diterpene aglycone structure called sordaricin. To understand its intriguing biosynthetic pathway that may include a Diels-Alder-type [4+2]cycloaddition, genome mining of the gene cluster from the draft genome sequence of the producer strain, Sordaria araneosa Cain ATCC 36386, was carried out. A contiguous 67 kb gene cluster consisting of 20 open reading frames encoding a putative diterpene cyclase, a glycosyltransferase, a type I polyketide synthase, and six cytochrome P450 monooxygenases were identified. In vitro enzymatic analysis of the putative diterpene cyclase SdnA showed that it catalyzes the transformation of geranylgeranyl diphosphate to cycloaraneosene, a known biosynthetic intermediate of sordarin. Furthermore, a putative glycosyltransferase SdnJ was found to catalyze the glycosylation of sordaricin in the presence of GDP-6-deoxy-d-altrose to give 4'-O-demethylsordarin. These results suggest that the identified sdn gene cluster is responsible for the biosynthesis of sordarin. Based on the isolated potential biosynthetic intermediates and bioinformatics analysis, a plausible biosynthetic pathway for sordarin is proposed.

  20. Endeavour update: a web resource for gene prioritization in multiple species

    PubMed Central

    Tranchevent, Léon-Charles; Barriot, Roland; Yu, Shi; Van Vooren, Steven; Van Loo, Peter; Coessens, Bert; De Moor, Bart; Aerts, Stein; Moreau, Yves

    2008-01-01

    Endeavour (http://www.esat.kuleuven.be/endeavourweb; this web site is free and open to all users and there is no login requirement) is a web resource for the prioritization of candidate genes. Using a training set of genes known to be involved in a biological process of interest, our approach consists of (i) inferring several models (based on various genomic data sources), (ii) applying each model to the candidate genes to rank those candidates against the profile of the known genes and (iii) merging the several rankings into a global ranking of the candidate genes. In the present article, we describe the latest developments of Endeavour. First, we provide a web-based user interface, besides our Java client, to make Endeavour more universally accessible. Second, we support multiple species: in addition to Homo sapiens, we now provide gene prioritization for three major model organisms: Mus musculus, Rattus norvegicus and Caenorhabditis elegans. Third, Endeavour makes use of additional data sources and is now including numerous databases: ontologies and annotations, protein–protein interactions, cis-regulatory information, gene expression data sets, sequence information and text-mining data. We tested the novel version of Endeavour on 32 recent disease gene associations from the literature. Additionally, we describe a number of recent independent studies that made use of Endeavour to prioritize candidate genes for obesity and Type II diabetes, cleft lip and cleft palate, and pulmonary fibrosis. PMID:18508807

  1. pGenN, a Gene Normalization Tool for Plant Genes and Proteins in Scientific Literature

    PubMed Central

    Ding, Ruoyao; Arighi, Cecilia N.; Lee, Jung-Youn; Wu, Cathy H.; Vijay-Shanker, K.

    2015-01-01

    Background Automatically detecting gene/protein names in the literature and connecting them to databases records, also known as gene normalization, provides a means to structure the information buried in free-text literature. Gene normalization is critical for improving the coverage of annotation in the databases, and is an essential component of many text mining systems and database curation pipelines. Methods In this manuscript, we describe a gene normalization system specifically tailored for plant species, called pGenN (pivot-based Gene Normalization). The system consists of three steps: dictionary-based gene mention detection, species assignment, and intra species normalization. We have developed new heuristics to improve each of these phases. Results We evaluated the performance of pGenN on an in-house expertly annotated corpus consisting of 104 plant relevant abstracts. Our system achieved an F-value of 88.9% (Precision 90.9% and Recall 87.2%) on this corpus, outperforming state-of-art systems presented in BioCreative III. We have processed over 440,000 plant-related Medline abstracts using pGenN. The gene normalization results are stored in a local database for direct query from the pGenN web interface (proteininformationresource.org/pgenn/). The annotated literature corpus is also publicly available through the PIR text mining portal (proteininformationresource.org/iprolink/). PMID:26258475

  2. Association mining of dependency between time series

    NASA Astrophysics Data System (ADS)

    Hafez, Alaaeldin

    2001-03-01

    Time series analysis is considered as a crucial component of strategic control over a broad variety of disciplines in business, science and engineering. Time series data is a sequence of observations collected over intervals of time. Each time series describes a phenomenon as a function of time. Analysis on time series data includes discovering trends (or patterns) in a time series sequence. In the last few years, data mining has emerged and been recognized as a new technology for data analysis. Data Mining is the process of discovering potentially valuable patterns, associations, trends, sequences and dependencies in data. Data mining techniques can discover information that many traditional business analysis and statistical techniques fail to deliver. In this paper, we adapt and innovate data mining techniques to analyze time series data. By using data mining techniques, maximal frequent patterns are discovered and used in predicting future sequences or trends, where trends describe the behavior of a sequence. In order to include different types of time series (e.g. irregular and non- systematic), we consider past frequent patterns of the same time sequences (local patterns) and of other dependent time sequences (global patterns). We use the word 'dependent' instead of the word 'similar' for emphasis on real life time series where two time series sequences could be completely different (in values, shapes, etc.), but they still react to the same conditions in a dependent way. In this paper, we propose the Dependence Mining Technique that could be used in predicting time series sequences. The proposed technique consists of three phases: (a) for all time series sequences, generate their trend sequences, (b) discover maximal frequent trend patterns, generate pattern vectors (to keep information of frequent trend patterns), use trend pattern vectors to predict future time series sequences.

  3. Changes in global gene expression profiles induced by HPV 16 E6 oncoprotein variants in cervical carcinoma C33-A cells

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zacapala-Gómez, Ana Elvira, E-mail: zak_ana@yahoo.com.mx; Del Moral-Hernández, Oscar, E-mail: odelmoralh@gmail.com; Villegas-Sepúlveda, Nicolás, E-mail: nvillega@cinvestav.mx

    We analyzed the effects of the expression of HPV 16 E6 oncoprotein variants (AA-a, AA-c, E-A176/G350, E-C188/G350, E-G350), and the E-Prototype in global gene expression profiles in an in vitro model. E6 gene was cloned into an expression vector fused to GFP and was transfected in C33-A cells. Affymetrix GeneChip Human Transcriptome Array 2.0 platform was used to analyze the expression of over 245,000 coding transcripts. We found that HPV16 E6 variants altered the expression of 387 different genes in comparison with E-Prototype. The altered genes are involved in cellular processes related to the development of cervical carcinoma, such asmore » adhesion, angiogenesis, apoptosis, differentiation, cell cycle, proliferation, transcription and protein translation. Our results show that polymorphic changes in HPV16 E6 natural variants are sufficient to alter the overall gene expression profile in C33-A cells, explaining in part the observed differences in oncogenic potential of HPV16 variants. - Highlights: • Amino acid changes in HPV16 E6 variants modulate the transciption of specific genes. • This is the first comparison of global gene expression profile of HPV 16 E6 variants. • Each HPV 16 E6 variant appears to have its own molecular signature.« less

  4. Global analysis of epigenetic regulation of gene expression in response to drought stress in Sorghum.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Reddy, Anireddy; Ben-Hur, Asa

    Abiotic stresses including drought are major limiting factors of crop yields and cause significant crop losses. Acquisition of stress tolerance to abiotic stresses requires coordinated regulation of a multitude of biochemical and physiological changes, and most of these changes depend on alterations in gene expression. The goal of this work is to perform global analysis of differential regulation of gene expression and alternative splicing, and their relationship with chromatin landscape in drought sensitive and tolerant cultivars. our Iso-Seq study revealed transcriptome-wide full-length isoforms at an unprecedented scale with over 11000 novel splice isoforms. Additionally, we uncovered alternative polyadenylation sites ofmore » ~11000 expressed genes and many novel genes. Overall, Iso-Seq results greatly enhanced sorghum gene annotations that are not only useful in analyzing all our RNA-seq, ChIP-seq and ATAC-seq data but also serve as a great resource to the plant biology community. Our studies identified differentially expressed genes and splicing events that are correlated with the drought-resistant phenotype. An association between alternative splicing and chromatin accessibility was also revealed. Several computational tools developed here (TAPIS and iDiffIR) have been made freely available to the research community in analyzing alternative splicing and differential alternative splicing.« less

  5. Biomedical Information Extraction: Mining Disease Associated Genes from Literature

    ERIC Educational Resources Information Center

    Huang, Zhong

    2014-01-01

    Disease associated gene discovery is a critical step to realize the future of personalized medicine. However empirical and clinical validation of disease associated genes are time consuming and expensive. In silico discovery of disease associated genes from literature is therefore becoming the first essential step for biomarker discovery to…

  6. Global Analysis of WRKY Genes and Their Response to Dehydration and Salt Stress in Soybean.

    PubMed

    Song, Hui; Wang, Pengfei; Hou, Lei; Zhao, Shuzhen; Zhao, Chuanzhi; Xia, Han; Li, Pengcheng; Zhang, Ye; Bian, Xiaotong; Wang, Xingjun

    2016-01-01

    WRKY proteins are plant specific transcription factors involved in various developmental and physiological processes, especially in biotic and abiotic stress resistance. Although previous studies suggested that WRKY proteins in soybean (Glycine max var. Williams 82) involved in both abiotic and biotic stress responses, the global information of WRKY proteins in the latest version of soybean genome (Wm82.a2v1) and their response to dehydration and salt stress have not been reported. In this study, we identified 176 GmWRKY proteins from soybean Wm82.a2v1 genome. These proteins could be classified into three groups, namely group I (32 proteins), group II (120 proteins), and group III (24 proteins). Our results showed that most GmWRKY genes were located on Chromosome 6, while chromosome 11, 12, and 20 contained the least number of this gene family. More GmWRKY genes were distributed on the ends of chromosomes to compare with other regions. The cis-acting elements analysis suggested that GmWRKY genes were transcriptionally regulated upon dehydration and salt stress. RNA-seq data analysis indicated that three GmWRKY genes responded negatively to dehydration, and 12 genes positively responded to salt stress at 1, 6, and 12 h, respectively. We confirmed by qRT-PCR that the expression of GmWRKY47 and GmWRKY 58 genes was decreased upon dehydration, and the expression of GmWRKY92, 144 and 165 genes was increased under salt treatment.

  7. Altered Global Gene Expression in First Trimester Placentas of Women Destined to Develop Preeclampsia

    PubMed Central

    Founds, Sandra A.; Conley, Yvette P.; Lyons-Weiler, James F.; Jeyabalan, Arun; Hogge, W. Allen; Conrad, Kirk P.

    2009-01-01

    Background Preeclampsia is a pregnancy-specific disorder that remains a leading cause of maternal, fetal and neonatal morbidity and mortality, and is associated with risk for future cardiovascular disease. There are no reliable predictors, specific preventative measures or treatments other than delivery. A widely-held view is that the antecedents of preeclampsia lie with impaired placentation in early pregnancy. Accordingly, we hypothesized dysregulation of global gene expression in first trimester placentas of women who later manifested preeclampsia. Methods Surplus chorionic villus sampling (CVS) tissues were collected at 10–12 weeks gestation in 160 patients with singleton fetuses. Four patients developed preeclampsia, and their banked CVS specimens were matched to 8 control samples from patients with unaffected pregnancies. Affymetrix HG-U133 Plus 2.0 GeneChips were utilized for microarray analysis. Naïve Bayes prediction modeling and pathway analysis were conducted. qRT-PCR examined three of the dysregulated genes. Results Thirty-six differentially expressed genes were identified in the preeclampsia placentas. qRT-PCR verified the microarray analysis. Thirty-one genes were down-regulated. Many were related to inflammation/immunoregulation and cell motility. Decidual gene dysregulation was prominent. No evidence was found for alterations in hypoxia and oxidative stress regulated genes. Conclusions To our knowledge, this is the first study to show dysregulation of gene expression in the early placentas of women ~6 months before developing preeclampsia, thereby reinforcing a placental origin of the disorder. We hypothesize that placentation in preeclampsia is compromised in the first trimester by maternal and fetal immune dysregulation, abnormal decidualization, or both, thereby impairing trophoblast invasion. Several of the genes provide potential targets for the development of clinical biomarkers in maternal blood during the first trimester. Supplementary

  8. Escherichia coli global gene expression in urine from women with urinary tract infection.

    PubMed

    Hagan, Erin C; Lloyd, Amanda L; Rasko, David A; Faerber, Gary J; Mobley, Harry L T

    2010-11-11

    Murine models of urinary tract infection (UTI) have provided substantial data identifying uropathogenic E. coli (UPEC) virulence factors and assessing their expression in vivo. However, it is unclear how gene expression in these animal models compares to UPEC gene expression during UTI in humans. To address this, we used a UPEC strain CFT073-specific microarray to measure global gene expression in eight E. coli isolates monitored directly from the urine of eight women presenting at a clinic with bacteriuria. The resulting gene expression profiles were compared to those of the same E. coli isolates cultured statically to exponential phase in pooled, sterilized human urine ex vivo. Known fitness factors, including iron acquisition and peptide transport systems, were highly expressed during human UTI and support a model in which UPEC replicates rapidly in vivo. While these findings were often consistent with previous data obtained from the murine UTI model, host-specific differences were observed. Most strikingly, expression of type 1 fimbrial genes, which are among the most highly expressed genes during murine experimental UTI and encode an essential virulence factor for this experimental model, was undetectable in six of the eight E. coli strains from women with UTI. Despite the lack of type 1 fimbrial expression in the urine samples, these E. coli isolates were generally capable of expressing type 1 fimbriae in vitro and highly upregulated fimA upon experimental murine infection. The findings presented here provide insight into the metabolic and pathogenic profile of UPEC in urine from women with UTI and represent the first transcriptome analysis for any pathogenic E. coli during a naturally occurring infection in humans.

  9. Analysis of Mining Terrain Deformation Characteristics with Deformation Information System

    NASA Astrophysics Data System (ADS)

    Blachowski, Jan; Milczarek, Wojciech; Grzempowski, Piotr

    2014-05-01

    Mapping and prediction of mining related deformations of the earth surface is an important measure for minimising threat to surface infrastructure, human population, the environment and safety of the mining operation itself arising from underground extraction of useful minerals. The number of methods and techniques used for monitoring and analysis of mining terrain deformations is wide and increasing with the development of geographical information technologies. These include for example: terrestrial geodetic measurements, global positioning systems, remote sensing, spatial interpolation, finite element method modelling, GIS based modelling, geological modelling, empirical modelling using the Knothe theory, artificial neural networks, fuzzy logic calculations and other. The aim of this paper is to introduce the concept of an integrated Deformation Information System (DIS) developed in geographic information systems environment for analysis and modelling of various spatial data related to mining activity and demonstrate its applications for mapping and visualising, as well as identifying possible mining terrain deformation areas with various spatial modelling methods. The DIS concept is based on connected modules that include: the spatial database - the core of the system, the spatial data collection module formed by: terrestrial, satellite and remote sensing measurements of the ground changes, the spatial data mining module for data discovery and extraction, the geological modelling module, the spatial data modeling module with data processing algorithms for spatio-temporal analysis and mapping of mining deformations and their characteristics (e.g. deformation parameters: tilt, curvature and horizontal strain), the multivariate spatial data classification module and the visualization module allowing two-dimensional interactive and static mapping and three-dimensional visualizations of mining ground characteristics. The Systems's functionality has been presented on

  10. Standardized Plant Disease Evaluations will Enhance Resistance Gene Discovery

    USDA-ARS?s Scientific Manuscript database

    Gene discovery and marker development using DNA based tools require plant populations with well-documented phenotypes. Related crops such as apples and pears may share a number of genes, for example resistance to common diseases, and data mining in one crop may reveal genes for the other. However, u...

  11. Mining for Nonribosomal Peptide Synthetase and Polyketide Synthase Genes Revealed a High Level of Diversity in the Sphagnum Bog Metagenome

    PubMed Central

    Müller, Christina A.; Oberauner-Wappis, Lisa; Peyman, Armin; Amos, Gregory C. A.; Wellington, Elizabeth M. H.

    2015-01-01

    Sphagnum bog ecosystems are among the oldest vegetation forms harboring a specific microbial community and are known to produce an exceptionally wide variety of bioactive substances. Although the Sphagnum metagenome shows a rich secondary metabolism, the genes have not yet been explored. To analyze nonribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs), the diversity of NRPS and PKS genes in Sphagnum-associated metagenomes was investigated by in silico data mining and sequence-based screening (PCR amplification of 9,500 fosmid clones). The in silico Illumina-based metagenomic approach resulted in the identification of 279 NRPSs and 346 PKSs, as well as 40 PKS-NRPS hybrid gene sequences. The occurrence of NRPS sequences was strongly dominated by the members of the Protebacteria phylum, especially by species of the Burkholderia genus, while PKS sequences were mainly affiliated with Actinobacteria. Thirteen novel NRPS-related sequences were identified by PCR amplification screening, displaying amino acid identities of 48% to 91% to annotated sequences of members of the phyla Proteobacteria, Actinobacteria, and Cyanobacteria. Some of the identified metagenomic clones showed the closest similarity to peptide synthases from Burkholderia or Lysobacter, which are emerging bacterial sources of as-yet-undescribed bioactive metabolites. This report highlights the role of the extreme natural ecosystems as a promising source for detection of secondary compounds and enzymes, serving as a source for biotechnological applications. PMID:26002894

  12. The role of wildlife (wild birds) in the global transmission of antimicrobial resistance genes

    PubMed Central

    Wang, Jing; Ma, Zhen-Bao; Zeng, Zhen-Ling; Yang, Xue-Wen; Huang, Ying; Liu, Jian-Hua

    2017-01-01

    Antimicrobial resistance is an urgent global health challenge in human and veterinary medicine. Wild animals are not directly exposed to clinically relevant antibiotics; however, antibacterial resistance in wild animals has been increasingly reported worldwide in parallel to the situation in human and veterinary medicine. This underlies the complexity of bacterial resistance in wild animals and the possible interspecies transmission between humans, domestic animals, the environment, and wildlife. This review summarizes the current data on expanded-spectrum β-lactamase (ESBL), AmpC β-lactamase, carbapenemase, and colistin resistance genes in Enterobacteriaceae isolates of wildlife origin. The aim of this review is to better understand the important role of wild animals as reservoirs and vectors in the global dissemination of crucial clinical antibacterial resistance. In this regard, continued surveillance is urgently needed worldwide.

  13. Knowledge Exchange between Poland and Vietnam in Mining and Geology - the Status Quo and Future Development

    NASA Astrophysics Data System (ADS)

    Nguyen, Nga; Pham, Nguyet

    2018-03-01

    From the beginning of the 21st century, knowledge exchange between Poland and Vietnam in mining and geology has been focusing in technology, education and training. Since years, Polish academic and commercial partners have been developing a close collaboration with Vietnam National Coal - Mineral Industries Holding Corporation Limited. Major outcomes of the collaboration are installations and operation of mining equipments and machines in Vietnamese mining companies, and excellent training programs for graduate and post graduate students and mining staff for both countries, etc. From aspects of knowledge management in globalization, the article highlights the outstanding outcomes of knowledge exchanges between the two countries, outlines cultural and economic challenges for the exchange and proposes some improvement in the future.

  14. Identification of Mouse Serum miRNA Endogenous References by Global Gene Expression Profiles

    PubMed Central

    Mi, Qing-Sheng; Weiland, Matthew; Qi, Rui-Qun; Gao, Xing-Hua; Poisson, Laila M.; Zhou, Li

    2012-01-01

    MicroRNAs (miRNAs) are recently discovered small non-coding RNAs and can serve as serum biomarkers for disease diagnosis and prognoses. Lack of reliable serum miRNA endogenous references for normalization in miRNA gene expression makes single miRNA assays inaccurate. Using TaqMan® real-time PCR miRNA arrays with a global gene expression normalization strategy, we have analyzed serum miRNA expression profiles of 20 female mice of NOD/ShiLtJ (n = 8), NOR/LtJ (n = 6), and C57BL/6J (n = 6) at different ages and disease conditions. We identified five miRNAs, miR-146a, miR-16, miR-195, miR-30e and miR-744, to be stably expressed in all strains, which could serve as mouse serum miRNA endogenous references for single assay experiments. PMID:22348064

  15. Trust Mines

    EPA Pesticide Factsheets

    The United States and the Navajo Nation entered into settlement agreements that provide funds to conduct investigations and any needed cleanup at 16 of the 46 priority mines, including six mines in the Northern Abandoned Uranium Mine Region.

  16. Mining microarray data at NCBI's Gene Expression Omnibus (GEO)*.

    PubMed

    Barrett, Tanya; Edgar, Ron

    2006-01-01

    The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) has emerged as the leading fully public repository for gene expression data. This chapter describes how to use Web-based interfaces, applications, and graphics to effectively explore, visualize, and interpret the hundreds of microarray studies and millions of gene expression patterns stored in GEO. Data can be examined from both experiment-centric and gene-centric perspectives using user-friendly tools that do not require specialized expertise in microarray analysis or time-consuming download of massive data sets. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.

  17. Global monitoring of autumn gene expression within and among phenotypically divergent populations of Sitka spruce (Picea sitchensis).

    PubMed

    Holliday, Jason A; Ralph, Steven G; White, Richard; Bohlmann, Jörg; Aitken, Sally N

    2008-01-01

    Cold acclimation in conifers is a complex process, the timing and extent of which reflects local adaptation and varies widely along latitudinal gradients for many temperate and boreal tree species. Despite their ecological and economic importance, little is known about the global changes in gene expression that accompany autumn cold acclimation in conifers. Using three populations of Sitka spruce (Picea sitchensis) spanning the species range, and a Picea cDNA microarray with 21,840 unique elements, within- and among-population gene expression was monitored during the autumn. Microarray data were validated for selected genes using real-time PCR. Similar numbers of genes were significantly twofold upregulated (1257) and downregulated (967) between late summer and early winter. Among those upregulated were dehydrins, pathogenesis-related/antifreeze genes, carbohydrate and lipid metabolism genes, and genes involved in signal transduction and transcriptional regulation. Among-population microarray hybridizations at early and late autumn time points revealed substantial variation in the autumn transcriptome, some of which may reflect local adaptation. These results demonstrate the complexity of cold acclimation in conifers, highlight similarities and differences to cold tolerance in annual plants, and provide a solid foundation for functional and genetic studies of this important adaptive process.

  18. The potential of text mining in data integration and network biology for plant research: a case study on Arabidopsis.

    PubMed

    Van Landeghem, Sofie; De Bodt, Stefanie; Drebert, Zuzanna J; Inzé, Dirk; Van de Peer, Yves

    2013-03-01

    Despite the availability of various data repositories for plant research, a wealth of information currently remains hidden within the biomolecular literature. Text mining provides the necessary means to retrieve these data through automated processing of texts. However, only recently has advanced text mining methodology been implemented with sufficient computational power to process texts at a large scale. In this study, we assess the potential of large-scale text mining for plant biology research in general and for network biology in particular using a state-of-the-art text mining system applied to all PubMed abstracts and PubMed Central full texts. We present extensive evaluation of the textual data for Arabidopsis thaliana, assessing the overall accuracy of this new resource for usage in plant network analyses. Furthermore, we combine text mining information with both protein-protein and regulatory interactions from experimental databases. Clusters of tightly connected genes are delineated from the resulting network, illustrating how such an integrative approach is essential to grasp the current knowledge available for Arabidopsis and to uncover gene information through guilt by association. All large-scale data sets, as well as the manually curated textual data, are made publicly available, hereby stimulating the application of text mining data in future plant biology studies.

  19. Identification of a new gene regulatory circuit involving B cell receptor activated signaling using a combined analysis of experimental, clinical and global gene expression data

    PubMed Central

    Schrader, Alexandra; Meyer, Katharina; Walther, Neele; Stolz, Ailine; Feist, Maren; Hand, Elisabeth; von Bonin, Frederike; Evers, Maurits; Kohler, Christian; Shirneshan, Katayoon; Vockerodt, Martina; Klapper, Wolfram; Szczepanowski, Monika; Murray, Paul G.; Bastians, Holger; Trümper, Lorenz; Spang, Rainer; Kube, Dieter

    2016-01-01

    To discover new regulatory pathways in B lymphoma cells, we performed a combined analysis of experimental, clinical and global gene expression data. We identified a specific cluster of genes that was coherently expressed in primary lymphoma samples and suppressed by activation of the B cell receptor (BCR) through αIgM treatment of lymphoma cells in vitro. This gene cluster, which we called BCR.1, includes numerous cell cycle regulators. A reduced expression of BCR.1 genes after BCR activation was observed in different cell lines and also in CD10+ germinal center B cells. We found that BCR activation led to a delayed entry to and progression of mitosis and defects in metaphase. Cytogenetic changes were detected upon long-term αIgM treatment. Furthermore, an inverse correlation of BCR.1 genes with c-Myc co-regulated genes in distinct groups of lymphoma patients was observed. Finally, we showed that the BCR.1 index discriminates activated B cell-like and germinal centre B cell-like diffuse large B cell lymphoma supporting the functional relevance of this new regulatory circuit and the power of guided clustering for biomarker discovery. PMID:27166259

  20. Data Mining Approaches for Genomic Biomarker Development: Applications Using Drug Screening Data from the Cancer Genome Project and the Cancer Cell Line Encyclopedia.

    PubMed

    Covell, David G

    2015-01-01

    Developing reliable biomarkers of tumor cell drug sensitivity and resistance can guide hypothesis-driven basic science research and influence pre-therapy clinical decisions. A popular strategy for developing biomarkers uses characterizations of human tumor samples against a range of cancer drug responses that correlate with genomic change; developed largely from the efforts of the Cancer Cell Line Encyclopedia (CCLE) and Sanger Cancer Genome Project (CGP). The purpose of this study is to provide an independent analysis of this data that aims to vet existing and add novel perspectives to biomarker discoveries and applications. Existing and alternative data mining and statistical methods will be used to a) evaluate drug responses of compounds with similar mechanism of action (MOA), b) examine measures of gene expression (GE), copy number (CN) and mutation status (MUT) biomarkers, combined with gene set enrichment analysis (GSEA), for hypothesizing biological processes important for drug response, c) conduct global comparisons of GE, CN and MUT as biomarkers across all drugs screened in the CGP dataset, and d) assess the positive predictive power of CGP-derived GE biomarkers as predictors of drug response in CCLE tumor cells. The perspectives derived from individual and global examinations of GEs, MUTs and CNs confirm existing and reveal unique and shared roles for these biomarkers in tumor cell drug sensitivity and resistance. Applications of CGP-derived genomic biomarkers to predict the drug response of CCLE tumor cells finds a highly significant ROC, with a positive predictive power of 0.78. The results of this study expand the available data mining and analysis methods for genomic biomarker development and provide additional support for using biomarkers to guide hypothesis-driven basic science research and pre-therapy clinical decisions.

  1. Global prevalence and distribution of genes and microorganisms involved in mercury methylation

    DOE PAGES

    Podar, Mircea; Gilmour, C. C.; Brandt, Craig C.; ...

    2015-10-09

    Mercury methylation produces the neurotoxic, highly bioaccumulative methylmercury (MeHg). Recent identification of the methylation genes (hgcAB) provides the foundation for broadly evaluating microbial Hg-methylation potential in nature without making explicit rate measurements. We first queried hgcAB diversity and distribution in all available microbial metagenomes, encompassing most environments. The genes were found in nearly all anaerobic, but not in aerobic, environments including oxygenated layers of the open ocean. Critically, hgcAB was effectively absent in ~1500 human microbiomes, suggesting a low risk of endogenous MeHg production. New potential methylation habitats were identified, including invertebrate guts, thawing permafrost, coastal dead zones, soils, sediments,more » and extreme environments, suggesting multiple routes for MeHg entry into food webs. Several new taxonomic groups potentially capable of Hg-methylation emerged, including lineages having no cultured representatives. We then begin to address long-standing evolutionary questions about Hg-methylation and ancient carbon fixation mechanisms while generating a new global view of Hg-methylation potential.« less

  2. Study of formation of green eggshell color in ducks through global gene expression.

    PubMed

    Xu, Fa Qiong; Li, Ang; Lan, Jing Jing; Wang, Yue Ming; Yan, Mei Jiao; Lian, Sen Yang; Wu, Xu

    2018-01-01

    The green eggshell color produced by ducks is a threshold trait that can be influenced by various factors, such as hereditary, environment and nutrition. The aim of this study was to investigate the genetic regulation of the formation of eggs with green shells in Youxian ducks. We performed integrative analysis of mRNAs and miRNAs expression profiling in the shell gland samples from ducks by RNA-Seq. We found 124 differentially expressed genes that were associated with various pathways, such as the ATP-binding cassette (ABC) transporter and solute carrier supper family pathways. A total of 31 differentially expressed miRNAs were found between ducks laying green eggs and white eggs. KEGG pathway analysis of the predicted miRNA target genes also indicated the functional characteristics of these miRNAs; they were involved in the ABC transporter pathway and the solute carrier (SLC) supper family. Analysis with qRT-PCR was applied to validate the results of global gene expression, which showed a correlation between results obtained by RNA-seq and RT-qPCR. Moreover, a miRNA-mRNA interaction network was established using correlation analysis of differentially expressed mRNA and miRNA. Compared to ducks that lay white eggs, ducks that lay green eggs include six up-regulated miRNAs that had regulatory effects on 35 down-regulated genes, and seven down-regulated miRNAs which influenced 46 up-regulated genes. For example, the ABC transporter pathway could be regulated by expressing gga-miR-144-3p (up-regulated) with ABCG2 (up-regulated) and other miRNAs and genes. This study provides valuable information about mRNA and miRNA regulation in duck shell gland tissues, and provides foundational information for further study on the eggshell color formation and marker-assisted selection for Youxian duck breeding.

  3. In silico mining of putative microsatellite markers from whole genome sequence of water buffalo (Bubalus bubalis) and development of first BuffSatDB

    PubMed Central

    2013-01-01

    Background Though India has sequenced water buffalo genome but its draft assembly is based on cattle genome BTau 4.0, thus de novo chromosome wise assembly is a major pending issue for global community. The existing radiation hybrid of buffalo and these reported STR can be used further in final gap plugging and “finishing” expected in de novo genome assembly. QTL and gene mapping needs mining of putative STR from buffalo genome at equal interval on each and every chromosome. Such markers have potential role in improvement of desirable characteristics, such as high milk yields, resistance to diseases, high growth rate. The STR mining from whole genome and development of user friendly database is yet to be done to reap the benefit of whole genome sequence. Description By in silico microsatellite mining of whole genome, we have developed first STR database of water buffalo, BuffSatDb (Buffalo MicroSatellite Database (http://cabindb.iasri.res.in/buffsatdb/) which is a web based relational database of 910529 microsatellite markers, developed using PHP and MySQL database. Microsatellite markers have been generated using MIcroSAtellite tool. It is simple and systematic web based search for customised retrieval of chromosome wise and genome-wide microsatellites. Search has been enabled based on chromosomes, motif type (mono-hexa), repeat motif and repeat kind (simple and composite). The search may be customised by limiting location of STR on chromosome as well as number of markers in that range. This is a novel approach and not been implemented in any of the existing marker database. This database has been further appended with Primer3 for primer designing of the selected markers enabling researcher to select markers of choice at desired interval over the chromosome. The unique add-on of degenerate bases further helps in resolving presence of degenerate bases in current buffalo assembly. Conclusion Being first buffalo STR database in the world , this would not only

  4. In silico mining of putative microsatellite markers from whole genome sequence of water buffalo (Bubalus bubalis) and development of first BuffSatDB.

    PubMed

    Sarika; Arora, Vasu; Iquebal, Mir Asif; Rai, Anil; Kumar, Dinesh

    2013-01-19

    Though India has sequenced water buffalo genome but its draft assembly is based on cattle genome BTau 4.0, thus de novo chromosome wise assembly is a major pending issue for global community. The existing radiation hybrid of buffalo and these reported STR can be used further in final gap plugging and "finishing" expected in de novo genome assembly. QTL and gene mapping needs mining of putative STR from buffalo genome at equal interval on each and every chromosome. Such markers have potential role in improvement of desirable characteristics, such as high milk yields, resistance to diseases, high growth rate. The STR mining from whole genome and development of user friendly database is yet to be done to reap the benefit of whole genome sequence. By in silico microsatellite mining of whole genome, we have developed first STR database of water buffalo, BuffSatDb (Buffalo MicroSatellite Database (http://cabindb.iasri.res.in/buffsatdb/) which is a web based relational database of 910529 microsatellite markers, developed using PHP and MySQL database. Microsatellite markers have been generated using MIcroSAtellite tool. It is simple and systematic web based search for customised retrieval of chromosome wise and genome-wide microsatellites. Search has been enabled based on chromosomes, motif type (mono-hexa), repeat motif and repeat kind (simple and composite). The search may be customised by limiting location of STR on chromosome as well as number of markers in that range. This is a novel approach and not been implemented in any of the existing marker database. This database has been further appended with Primer3 for primer designing of the selected markers enabling researcher to select markers of choice at desired interval over the chromosome. The unique add-on of degenerate bases further helps in resolving presence of degenerate bases in current buffalo assembly. Being first buffalo STR database in the world , this would not only pave the way in resolving current

  5. Global gene expression profiling of brown to white adipose tissue transformation in sheep reveals novel transcriptional components linked to adipose remodeling.

    PubMed

    Basse, Astrid L; Dixen, Karen; Yadav, Rachita; Tygesen, Malin P; Qvortrup, Klaus; Kristiansen, Karsten; Quistorff, Bjørn; Gupta, Ramneek; Wang, Jun; Hansen, Jacob B

    2015-03-19

    Large mammals are capable of thermoregulation shortly after birth due to the presence of brown adipose tissue (BAT). The majority of BAT disappears after birth and is replaced by white adipose tissue (WAT). We analyzed the postnatal transformation of adipose in sheep with a time course study of the perirenal adipose depot. We observed changes in tissue morphology, gene expression and metabolism within the first two weeks of postnatal life consistent with the expected transition from BAT to WAT. The transformation was characterized by massively decreased mitochondrial abundance and down-regulation of gene expression related to mitochondrial function and oxidative phosphorylation. Global gene expression profiling demonstrated that the time points grouped into three phases: a brown adipose phase, a transition phase and a white adipose phase. Between the brown adipose and the transition phase 170 genes were differentially expressed, and 717 genes were differentially expressed between the transition and the white adipose phase. Thirty-eight genes were shared among the two sets of differentially expressed genes. We identified a number of regulated transcription factors, including NR1H3, MYC, KLF4, ESR1, RELA and BCL6, which were linked to the overall changes in gene expression during the adipose tissue remodeling. Finally, the perirenal adipose tissue expressed both brown and brite/beige adipocyte marker genes at birth, the expression of which changed substantially over time. Using global gene expression profiling of the postnatal BAT to WAT transformation in sheep, we provide novel insight into adipose tissue plasticity in a large mammal, including identification of novel transcriptional components linked to adipose tissue remodeling. Moreover, our data set provides a useful resource for further studies in adipose tissue plasticity.

  6. The Ocean Gene Atlas: exploring the biogeography of plankton genes online.

    PubMed

    Villar, Emilie; Vannier, Thomas; Vernette, Caroline; Lescot, Magali; Cuenca, Miguelangel; Alexandre, Aurélien; Bachelerie, Paul; Rosnet, Thomas; Pelletier, Eric; Sunagawa, Shinichi; Hingamp, Pascal

    2018-05-21

    The Ocean Gene Atlas is a web service to explore the biogeography of genes from marine planktonic organisms. It allows users to query protein or nucleotide sequences against global ocean reference gene catalogs. With just one click, the abundance and location of target sequences are visualized on world maps as well as their taxonomic distribution. Interactive results panels allow for adjusting cutoffs for alignment quality and displaying the abundances of genes in the context of environmental features (temperature, nutrients, etc.) measured at the time of sampling. The ease of use enables non-bioinformaticians to explore quantitative and contextualized information on genes of interest in the global ocean ecosystem. Currently the Ocean Gene Atlas is deployed with (i) the Ocean Microbial Reference Gene Catalog (OM-RGC) comprising 40 million non-redundant mostly prokaryotic gene sequences associated with both Tara Oceans and Global Ocean Sampling (GOS) gene abundances and (ii) the Marine Atlas of Tara Ocean Unigenes (MATOU) composed of >116 million eukaryote unigenes. Additional datasets will be added upon availability of further marine environmental datasets that provide the required complement of sequence assemblies, raw reads and contextual environmental parameters. Ocean Gene Atlas is a freely-available web service at: http://tara-oceans.mio.osupytheas.fr/ocean-gene-atlas/.

  7. An open-source framework for large-scale, flexible evaluation of biomedical text mining systems.

    PubMed

    Baumgartner, William A; Cohen, K Bretonnel; Hunter, Lawrence

    2008-01-29

    Improved evaluation methodologies have been identified as a necessary prerequisite to the improvement of text mining theory and practice. This paper presents a publicly available framework that facilitates thorough, structured, and large-scale evaluations of text mining technologies. The extensibility of this framework and its ability to uncover system-wide characteristics by analyzing component parts as well as its usefulness for facilitating third-party application integration are demonstrated through examples in the biomedical domain. Our evaluation framework was assembled using the Unstructured Information Management Architecture. It was used to analyze a set of gene mention identification systems involving 225 combinations of system, evaluation corpus, and correctness measure. Interactions between all three were found to affect the relative rankings of the systems. A second experiment evaluated gene normalization system performance using as input 4,097 combinations of gene mention systems and gene mention system-combining strategies. Gene mention system recall is shown to affect gene normalization system performance much more than does gene mention system precision, and high gene normalization performance is shown to be achievable with remarkably low levels of gene mention system precision. The software presented in this paper demonstrates the potential for novel discovery resulting from the structured evaluation of biomedical language processing systems, as well as the usefulness of such an evaluation framework for promoting collaboration between developers of biomedical language processing technologies. The code base is available as part of the BioNLP UIMA Component Repository on SourceForge.net.

  8. Assessment of global and gene-specific DNA methylation in rat liver and kidney in response to non-genotoxic carcinogen exposure

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ozden, Sibel, E-mail: stopuz@istanbul.edu.tr; Turgut Kara, Neslihan; Sezerman, Osman Ugur

    Altered expression of tumor suppressor genes and oncogenes, which is regulated in part at the level of DNA methylation, is an important event involved in non-genotoxic carcinogenesis. This may serve as a marker for early detection of non-genotoxic carcinogens. Therefore, we evaluated the effects of non-genotoxic hepatocarcinogens, 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD), hexachlorobenzene (HCB), methapyrilene (MPY) and male rat kidney carcinogens, d-limonene, p-dichlorobenzene (DCB), chloroform and ochratoxin A (OTA) on global and CpG island promoter methylation in their respective target tissues in rats. No significant dose-related effects on global DNA hypomethylation were observed in tissues of rats compared to vehicle controls using LC–MS/MSmore » in response to short-term non-genotoxic carcinogen exposure. Initial experiments investigating gene-specific methylation using methylation-specific PCR and bisulfite sequencing, revealed partial methylation of p16 in the liver of rats treated with HCB and TCDD. However, no treatment related effects on the methylation status of Cx32, e-cadherin, VHL, c-myc, Igfbp2, and p15 were observed. We therefore applied genome-wide DNA methylation analysis using methylated DNA immunoprecipitation combined with microarrays to identify alterations in gene-specific methylation. Under the conditions of our study, some genes were differentially methylated in response to MPY and TCDD, whereas d-limonene, DCB and chloroform did not induce any methylation changes. 90-day OTA treatment revealed enrichment of several categories of genes important in protein kinase activity and mTOR cell signaling process which are related to OTA nephrocarcinogenicity. - Highlights: • Studied non-genotoxic carcinogens caused no change on global DNA hypomethylation. • d-Limonene, DCB and chloroform did not show any genome-wide methylation changes. • Some genes were differentially methylated in response to MPY, TCDD and OTA. • Protein kinase

  9. Biomarkers of metals exposure in fish from lead-zinc mining areas of Southeastern Missouri, USA

    USGS Publications Warehouse

    Schmitt, C.J.; Whyte, J.J.; Roberts, A.P.; Annis, M.L.; May, T.W.; Tillitt, D.E.

    2007-01-01

    The potential effects of proposed lead-zinc mining in an ecologically sensitive area were assessed by studying a nearby mining district that has been exploited for about 30 y under contemporary environmental regulations and with modern technology. Blood and liver samples representing fish of three species (largescale stoneroller, Campostoma oligolepis, n=91; longear sunfish, Lepomis megalotis, n=105; and northern hog sucker, Hypentelium nigricans, n=20) from 16 sites representing a range of conditions relative to mining activities were collected. Samples were analyzed for metals (also reported in a companion paper) and for biomarkers of metals exposure [erythrocyte ??-aminolevulinic acid dehydratase (ALA-D) activity; concentrations of zinc protoporphyrin (ZPP), iron, and hemoglobin (Hb) in blood; and hepatic metallothionein (MT) gene expression and lipid peroxidation]. Blood lead concentrations were significantly higher and ALA-D activity significantly lower in all species at sites nearest to active lead-zinc mines and in a stream contaminated by historical mining than at reference or downstream sites. ALA-D activity was also negatively correlated with blood lead concentrations in all three species but not with other metals. Iron and Hb concentrations were positively correlated in all three species, but were not correlated with any other metals in blood or liver in any species. MT gene expression was positively correlated with liver zinc concentrations, but neither MT nor lipid peroxidase differences among fish grouped according to lead concentrations were statistically significant. ZPP was not detected by hematofluorometry in most fish, but fish with detectable ZPP were from sites affected by mining. Collectively, these results confirm that metals are released to streams from active lead-zinc mining sites and are accumulated by fish. ?? 2007 Elsevier Inc. All rights reserved.

  10. Determinants of Interest Rates on Corporate Bonds of Mining Enterprises

    NASA Astrophysics Data System (ADS)

    Ranosz, Robert

    2017-09-01

    This article is devoted to the determinants of interest rates on corporate bonds of mining enterprises. The study includes a comparison between the cost of foreign capital as resulting from the issue of debt instruments in different sectors of the economy in relation to the mining industry. The article also depicts the correlation between the rating scores published by the three largest rating agencies: S&P, Moody's, and Fitch. The test was based on simple statistical methods. The analysis performed indicated that there is a dependency between the factors listed and the amount of interest rates on corporate bonds of global mining enterprises. Most significant factors include the rating level and the period for which the given series of bonds was issued. Additionally, it is not without significance whether the given bond has additional options. Pursuant to the obtained results, is should be recognized that in order to reduce the interest rate on bonds, mining enterprises should pay particular attention to the rating and attempt to include additional options in issued bonds. Such additional options may comprise, for example, an ability to exchange bonds to shares or raw materials.

  11. Global Deletion of TSPO Does Not Affect the Viability and Gene Expression Profile

    PubMed Central

    Wang, Huaishan; Yang, Jia; Yang, Qi; Fu, Yi; Hu, Yu; Liu, Fang; Wang, Weiqing; Cui, Lianxian; Chen, Hui; Zhang, Jianmin; He, Wei

    2016-01-01

    Translocator Protein (18kDa, TSPO) is a mitochondrial outer membrane transmembrane protein. Its expression is elevated during inflammation and injury. However, the function of TSPO in vivo is still controversial. Here, we constructed a TSPO global knockout (KO) mouse with a Cre-LoxP system that abolished TSPO protein expression in all tissues and showed normal phenotypes in the physiological condition. The birth rates of TSPO heterozygote (Het) x Het or KO x KO breeding were consistent with Mendel’s Law, suggesting a normal viability of TSPO KO mice at birth. RNA-seq analysis showed no significant difference in the gene expression profile of lung tissues from TSPO KO mice compared with wild type mice, including the genes associated with bronchial alveoli immune homeostasis. The alveolar macrophage population was not affected by TSPO deletion in the physiological condition. Our findings contradict the results of Papadopoulos, but confirmed Selvaraj’s findings. This study confirms TSPO deficiency does not affect viability and bronchial alveolar immune homeostasis. PMID:27907096

  12. Global Expression Profiling of Low Temperature Induced Genes in the Chilling Tolerant Japonica Rice Jumli Marshi

    PubMed Central

    Chawade, Aakash; Lindlöf, Angelica; Olsson, Björn; Olsson, Olof

    2013-01-01

    Low temperature is a key factor that limits growth and productivity of many important agronomical crops worldwide. Rice (Oryza sativa L.) is negatively affected already at temperatures below +10°C and is therefore denoted as chilling sensitive. However, chilling tolerant rice cultivars exist and can be commercially cultivated at altitudes up to 3,050 meters with temperatures reaching as low as +4°C. In this work, the global transcriptional response to cold stress (+4°C) was studied in the Nepalese highland variety Jumli Marshi (spp. japonica) and 4,636 genes were identified as significantly differentially expressed within 24 hours of cold stress. Comparison with previously published microarray data from one chilling tolerant and two sensitive rice cultivars identified 182 genes differentially expressed (DE) upon cold stress in all four rice cultivars and 511 genes DE only in the chilling tolerant rice. Promoter analysis of the 182 genes suggests a complex cross-talk between ABRE and CBF regulons. Promoter analysis of the 511 genes identified over-represented ABRE motifs but not DRE motifs, suggesting a role for ABA signaling in cold tolerance. Moreover, 2,101 genes were DE in Jumli Marshi alone. By chromosomal localization analysis, 473 of these cold responsive genes were located within 13 different QTLs previously identified as cold associated. PMID:24349120

  13. Global analysis of gene expression profiles in developing physic nut (Jatropha curcas L.) seeds.

    PubMed

    Jiang, Huawu; Wu, Pingzhi; Zhang, Sheng; Song, Chi; Chen, Yaping; Li, Meiru; Jia, Yongxia; Fang, Xiaohua; Chen, Fan; Wu, Guojiang

    2012-01-01

    Physic nut (Jatropha curcas L.) is an oilseed plant species with high potential utility as a biofuel. Furthermore, following recent sequencing of its genome and the availability of expressed sequence tag (EST) libraries, it is a valuable model plant for studying carbon assimilation in endosperms of oilseed plants. There have been several transcriptomic analyses of developing physic nut seeds using ESTs, but they have provided limited information on the accumulation of stored resources in the seeds. We applied next-generation Illumina sequencing technology to analyze global gene expression profiles of developing physic nut seeds 14, 19, 25, 29, 35, 41, and 45 days after pollination (DAP). The acquired profiles reveal the key genes, and their expression timeframes, involved in major metabolic processes including: carbon flow, starch metabolism, and synthesis of storage lipids and proteins in the developing seeds. The main period of storage reserves synthesis in the seeds appears to be 29-41 DAP, and the fatty acid composition of the developing seeds is consistent with relative expression levels of different isoforms of acyl-ACP thioesterase and fatty acid desaturase genes. Several transcription factor genes whose expression coincides with storage reserve deposition correspond to those known to regulate the process in Arabidopsis. The results will facilitate searches for genes that influence de novo lipid synthesis, accumulation and their regulatory networks in developing physic nut seeds, and other oil seeds. Thus, they will be helpful in attempts to modify these plants for efficient biofuel production.

  14. HaploReg v4: systematic mining of putative causal variants, cell types, regulators and target genes for human complex traits and disease.

    PubMed

    Ward, Lucas D; Kellis, Manolis

    2016-01-04

    More than 90% of common variants associated with complex traits do not affect proteins directly, but instead the circuits that control gene expression. This has increased the urgency of understanding the regulatory genome as a key component for translating genetic results into mechanistic insights and ultimately therapeutics. To address this challenge, we developed HaploReg (http://compbio.mit.edu/HaploReg) to aid the functional dissection of genome-wide association study (GWAS) results, the prediction of putative causal variants in haplotype blocks, the prediction of likely cell types of action, and the prediction of candidate target genes by systematic mining of comparative, epigenomic and regulatory annotations. Since first launching the website in 2011, we have greatly expanded HaploReg, increasing the number of chromatin state maps to 127 reference epigenomes from ENCODE 2012 and Roadmap Epigenomics, incorporating regulator binding data, expanding regulatory motif disruption annotations, and integrating expression quantitative trait locus (eQTL) variants and their tissue-specific target genes from GTEx, Geuvadis, and other recent studies. We present these updates as HaploReg v4, and illustrate a use case of HaploReg for attention deficit hyperactivity disorder (ADHD)-associated SNPs with putative brain regulatory mechanisms. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Data Mining.

    ERIC Educational Resources Information Center

    Benoit, Gerald

    2002-01-01

    Discusses data mining (DM) and knowledge discovery in databases (KDD), taking the view that KDD is the larger view of the entire process, with DM emphasizing the cleaning, warehousing, mining, and visualization of knowledge discovery in databases. Highlights include algorithms; users; the Internet; text mining; and information extraction.…

  16. Mechanisms of crystalline silica-induced pulmonary toxicity revealed by global gene expression profiling

    PubMed Central

    Sellamuthu, Rajendran; Umbright, Christina; Li, Shengqiao; Kashon, Michael; Joseph, Pius

    2015-01-01

    A proper understanding of the mechanisms underlying crystalline silica-induced pulmonary toxicity has implications in the management and potential prevention of the adverse health effects associated with silica exposure including silicosis, cancer and several auto-immune diseases. Human lung type II epithelial cells and rat lungs exposed to crystalline silica were employed as experimental models to determine global gene expression changes in order to understand the molecular mechanisms underlying silica-induced pulmonary toxicity. The differential gene expression profile induced by silica correlated with its toxicity in the A549 cells. The biological processes perturbed by silica exposure in the A549 cells and rat lungs, as identified by the bioinformatics analysis of the differentially expressed genes, demonstrated significant similarity. Functional categorization of the differentially expressed genes identified cancer, cellular movement, cellular growth and proliferation, cell death, inflammatory response, cell cycle, cellular development, and genetic disorder as top ranking biological functions perturbed by silica exposure in A549 cells and rat lungs. Results of our study, in addition to confirming several previously identified molecular targets and mechanisms involved in silica toxicity, identified novel molecular targets and mechanisms potentially involved in silica-induced pulmonary toxicity. Further investigations, including those focused on the novel molecular targets and mechanisms identified in the current study may result in better management and, possibly, reduction and/or prevention of the potential adverse health effects associated with crystalline silica exposure. PMID:22087542

  17. Machine Learning for Detecting Gene-Gene Interactions

    PubMed Central

    McKinney, Brett A.; Reif, David M.; Ritchie, Marylyn D.; Moore, Jason H.

    2011-01-01

    Complex interactions among genes and environmental factors are known to play a role in common human disease aetiology. There is a growing body of evidence to suggest that complex interactions are ‘the norm’ and, rather than amounting to a small perturbation to classical Mendelian genetics, interactions may be the predominant effect. Traditional statistical methods are not well suited for detecting such interactions, especially when the data are high dimensional (many attributes or independent variables) or when interactions occur between more than two polymorphisms. In this review, we discuss machine-learning models and algorithms for identifying and characterising susceptibility genes in common, complex, multifactorial human diseases. We focus on the following machine-learning methods that have been used to detect gene-gene interactions: neural networks, cellular automata, random forests, and multifactor dimensionality reduction. We conclude with some ideas about how these methods and others can be integrated into a comprehensive and flexible framework for data mining and knowledge discovery in human genetics. PMID:16722772

  18. 30 CFR 49.4 - Alternative mine rescue capability for special mining conditions.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 30 Mineral Resources 1 2014-07-01 2014-07-01 false Alternative mine rescue capability for special mining conditions. 49.4 Section 49.4 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR EDUCATION AND TRAINING MINE RESCUE TEAMS Mine Rescue Teams for Underground Metal and...

  19. 30 CFR 49.4 - Alternative mine rescue capability for special mining conditions.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 30 Mineral Resources 1 2012-07-01 2012-07-01 false Alternative mine rescue capability for special mining conditions. 49.4 Section 49.4 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR EDUCATION AND TRAINING MINE RESCUE TEAMS Mine Rescue Teams for Underground Metal and...

  20. 30 CFR 49.4 - Alternative mine rescue capability for special mining conditions.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 30 Mineral Resources 1 2013-07-01 2013-07-01 false Alternative mine rescue capability for special mining conditions. 49.4 Section 49.4 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR EDUCATION AND TRAINING MINE RESCUE TEAMS Mine Rescue Teams for Underground Metal and...

  1. Global transgenerational gene expression dynamics in two newly synthesized allohexaploid wheat (Triticum aestivum) lines

    PubMed Central

    2012-01-01

    genes showing non-additive expression exhibited a significant enrichment for vesicle-function. Conclusions Our results show that two patterns of global alteration in gene expression are conditioned by allohexaploidization in wheat, that is, parental dominance expression and non-additive expression. Both altered patterns of gene expression but not the identity of the genes involved are likely to play functional roles in stabilization and establishment of the newly formed allohexaploid plants, and hence, relevant to speciation and evolution of T. aestivum. PMID:22277161

  2. 30 CFR 49.4 - Alternative mine rescue capability for special mining conditions.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Alternative mine rescue capability for special mining conditions. 49.4 Section 49.4 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR EDUCATION AND TRAINING MINE RESCUE TEAMS § 49.4 Alternative mine rescue capability for...

  3. 30 CFR 49.4 - Alternative mine rescue capability for special mining conditions.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 30 Mineral Resources 1 2011-07-01 2011-07-01 false Alternative mine rescue capability for special mining conditions. 49.4 Section 49.4 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR EDUCATION AND TRAINING MINE RESCUE TEAMS § 49.4 Alternative mine rescue capability for...

  4. GoGene: gene annotation in the fast lane.

    PubMed

    Plake, Conrad; Royer, Loic; Winnenburg, Rainer; Hakenberg, Jörg; Schroeder, Michael

    2009-07-01

    High-throughput screens such as microarrays and RNAi screens produce huge amounts of data. They typically result in hundreds of genes, which are often further explored and clustered via enriched GeneOntology terms. The strength of such analyses is that they build on high-quality manual annotations provided with the GeneOntology. However, the weakness is that annotations are restricted to process, function and location and that they do not cover all known genes in model organisms. GoGene addresses this weakness by complementing high-quality manual annotation with high-throughput text mining extracting co-occurrences of genes and ontology terms from literature. GoGene contains over 4,000,000 associations between genes and gene-related terms for 10 model organisms extracted from more than 18,000,000 PubMed entries. It does not cover only process, function and location of genes, but also biomedical categories such as diseases, compounds, techniques and mutations. By bringing it all together, GoGene provides the most recent and most complete facts about genes and can rank them according to novelty and importance. GoGene accepts keywords, gene lists, gene sequences and protein sequences as input and supports search for genes in PubMed, EntrezGene and via BLAST. Since all associations of genes to terms are supported by evidence in the literature, the results are transparent and can be verified by the user. GoGene is available at http://gopubmed.org/gogene.

  5. Arabidopsis Gene Family Profiler (aGFP)--user-oriented transcriptomic database with easy-to-use graphic interface.

    PubMed

    Dupl'áková, Nikoleta; Renák, David; Hovanec, Patrik; Honysová, Barbora; Twell, David; Honys, David

    2007-07-23

    Microarray technologies now belong to the standard functional genomics toolbox and have undergone massive development leading to increased genome coverage, accuracy and reliability. The number of experiments exploiting microarray technology has markedly increased in recent years. In parallel with the rapid accumulation of transcriptomic data, on-line analysis tools are being introduced to simplify their use. Global statistical data analysis methods contribute to the development of overall concepts about gene expression patterns and to query and compose working hypotheses. More recently, these applications are being supplemented with more specialized products offering visualization and specific data mining tools. We present a curated gene family-oriented gene expression database, Arabidopsis Gene Family Profiler (aGFP; http://agfp.ueb.cas.cz), which gives the user access to a large collection of normalised Affymetrix ATH1 microarray datasets. The database currently contains NASC Array and AtGenExpress transcriptomic datasets for various tissues at different developmental stages of wild type plants gathered from nearly 350 gene chips. The Arabidopsis GFP database has been designed as an easy-to-use tool for users needing an easily accessible resource for expression data of single genes, pre-defined gene families or custom gene sets, with the further possibility of keyword search. Arabidopsis Gene Family Profiler presents a user-friendly web interface using both graphic and text output. Data are stored at the MySQL server and individual queries are created in PHP script. The most distinguishable features of Arabidopsis Gene Family Profiler database are: 1) the presentation of normalized datasets (Affymetrix MAS algorithm and calculation of model-based gene-expression values based on the Perfect Match-only model); 2) the choice between two different normalization algorithms (Affymetrix MAS4 or MAS5 algorithms); 3) an intuitive interface; 4) an interactive "virtual

  6. PREVENTION OF ACID MINE DRAINAGE GENERATION FROM OPEN-PIT MINE HIGHWALLS

    EPA Science Inventory



    Exposed, open pit mine highwalls contribute significantly to the production of acid mine

    drainage (AMD) thus causing environmental concerns upon closure of an operating mine. Available information on the generation of AMD from open-pit mine highwalls is very limit...

  7. 30 CFR 780.27 - Reclamation plan: Surface mining near underground mining.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 30 Mineral Resources 3 2011-07-01 2011-07-01 false Reclamation plan: Surface mining near underground mining. 780.27 Section 780.27 Mineral Resources OFFICE OF SURFACE MINING RECLAMATION AND ENFORCEMENT, DEPARTMENT OF THE INTERIOR SURFACE COAL MINING AND RECLAMATION OPERATIONS PERMITS AND COAL...

  8. Orapa Diamond Mine, Botswana

    NASA Image and Video Library

    2015-11-16

    This image from NASA Terra spacecraft shows the Orapa diamond mine, the world largest diamond mine by area. The mine is located in Botswana. It is the oldest of four mines operated by the same company, having begun operations in 1971. Orapa is an open pit style of mine, located on two kimberlite pipes. Currently, the Orapa mine annually produces approximately 11 million carats (2200 kg) of diamonds. The Letlhakane diamond mine is also an open pit construction. In 2003, the Letlhakane mine produced 1.06 million carats of diamonds. The Damtshaa diamond mine is the newest of four mines, located on top of four distinct kimberlite pipes of varying ore grade. The mine is forecast to produce about 5 million carats of diamond over the projected 31 year life of the mine. The image was acquired October 5, 2014, covers an area of 28 by 45 km, and is located at 21.3 degrees south, 25.4 degrees east. http://photojournal.jpl.nasa.gov/catalog/PIA20104

  9. Controls on bacterial and archaeal community structure and greenhouse gas production in natural, mined, and restored Canadian peatlands

    PubMed Central

    Basiliko, Nathan; Henry, Kevin; Gupta, Varun; Moore, Tim R.; Driscoll, Brian T.; Dunfield, Peter F.

    2013-01-01

    Northern peatlands are important global C reservoirs, largely because of their slow rates of microbial C mineralization. Particularly in sites that are heavily influenced by anthropogenic disturbances, there is scant information about microbial ecology and whether or not microbial community structure influences greenhouse gas production. This work characterized communities of bacteria and archaea using terminal restriction fragment length polymorphism (T-RFLP) and sequence analysis of 16S rRNA and functional genes across eight natural, mined, or restored peatlands in two locations in eastern Canada. Correlations were explored among chemical properties of peat, bacterial and archaeal community structure, and carbon dioxide (CO2) and methane (CH4) production rates under oxic and anoxic conditions. Bacteria and archaea similar to those found in other peat soil environments were detected. In contrast to other reports, methanogen diversity was low in our study, with only 2 groups of known or suspected methanogens. Although mining and restoration affected substrate availability and microbial activity, these land-uses did not consistently affect bacterial or archaeal community composition. In fact, larger differences were observed between the two locations and between oxic and anoxic peat samples than between natural, mined, and restored sites, with anoxic samples characterized by less detectable bacterial diversity and stronger dominance by members of the phylum Acidobacteria. There were also no apparent strong linkages between prokaryote community structure and CH4 or CO2 production, suggesting that different organisms exhibit functional redundancy and/or that the same taxa function at very different rates when exposed to different peat substrates. In contrast to other earlier work focusing on fungal communities across similar mined and restored peatlands, bacterial and archaeal communities appeared to be more resistant or resilient to peat substrate changes brought

  10. A Global Survey of Deep Underground Facilities; Examples of Geotechnical and Engineering Capabilities, Achievements, Challenges (Mines, Shafts, Tunnels, Boreholes, Sites and Underground Facilities for Nuclear Waste and Physics R&D): A Guide to Interactive Global Map Layers, Table Database, References and Notes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tynan, Mark C.; Russell, Glenn P.; Perry, Frank V.

    These associated tables, references, notes, and report present a synthesis of some notable geotechnical and engineering information used to create four interactive layer maps for selected: 1) deep mines and shafts; 2) existing, considered or planned radioactive waste management deep underground studies or disposal facilities 3) deep large diameter boreholes, and 4) physics underground laboratories and facilities from around the world. These data are intended to facilitate user access to basic information and references regarding “deep underground” facilities, history, activities, and plans. In general, the interactive maps and database provide each facility’s approximate site location, geology, and engineered features (e.g.:more » access, geometry, depth, diameter, year of operations, groundwater, lithology, host unit name and age, basin; operator, management organization, geographic data, nearby cultural features, other). Although the survey is not comprehensive, it is representative of many of the significant existing and historical underground facilities discussed in the literature addressing radioactive waste management and deep mined geologic disposal safety systems. The global survey is intended to support and to inform: 1) interested parties and decision makers; 2) radioactive waste disposal and siting option evaluations, and 3) safety case development applicable to any mined geologic disposal facility as a demonstration of historical and current engineering and geotechnical capabilities available for use in deep underground facility siting, planning, construction, operations and monitoring.« less

  11. Text Mining.

    ERIC Educational Resources Information Center

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  12. Surface mining

    Treesearch

    Robert Leopold; Bruce Rowland; Reed Stalder

    1979-01-01

    The surface mining process consists of four phases: (1) exploration; (2) development; (3) production; and (4) reclamation. A variety of surface mining methods has been developed, including strip mining, auger, area strip, open pit, dredging, and hydraulic. Sound planning and design techniques are essential to implement alternatives to meet the myriad of laws,...

  13. 30 CFR 780.27 - Reclamation plan: Surface mining near underground mining.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... RECLAMATION AND OPERATION PLAN § 780.27 Reclamation plan: Surface mining near underground mining. For surface... 30 Mineral Resources 3 2010-07-01 2010-07-01 false Reclamation plan: Surface mining near... ENFORCEMENT, DEPARTMENT OF THE INTERIOR SURFACE COAL MINING AND RECLAMATION OPERATIONS PERMITS AND COAL...

  14. International SUSMIN-project aims at sustainable gold mining in EU

    NASA Astrophysics Data System (ADS)

    Backnäs, Soile; Neitola, Raisa; Turunen, Kaisa; Lima, Alexandre; Fiúza, António; Szlachta, Malgorzata; Wójtowicz, Patryk; Maftei, Raluca; Munteanu, Marian; Alakangas, Lena; Baciu, Calin; Fernández, Dámaris

    2015-04-01

    research partners from six EU member states Finland, Sweden, Portugal, Romania, Poland and Ireland. Additionally eight globally on mining industry working industry partners will contribute in the SUSMIN consortium, so implementation of results from the project will translate into direct and significant economic benefits.

  15. Mining for Nonribosomal Peptide Synthetase and Polyketide Synthase Genes Revealed a High Level of Diversity in the Sphagnum Bog Metagenome.

    PubMed

    Müller, Christina A; Oberauner-Wappis, Lisa; Peyman, Armin; Amos, Gregory C A; Wellington, Elizabeth M H; Berg, Gabriele

    2015-08-01

    Sphagnum bog ecosystems are among the oldest vegetation forms harboring a specific microbial community and are known to produce an exceptionally wide variety of bioactive substances. Although the Sphagnum metagenome shows a rich secondary metabolism, the genes have not yet been explored. To analyze nonribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs), the diversity of NRPS and PKS genes in Sphagnum-associated metagenomes was investigated by in silico data mining and sequence-based screening (PCR amplification of 9,500 fosmid clones). The in silico Illumina-based metagenomic approach resulted in the identification of 279 NRPSs and 346 PKSs, as well as 40 PKS-NRPS hybrid gene sequences. The occurrence of NRPS sequences was strongly dominated by the members of the Protebacteria phylum, especially by species of the Burkholderia genus, while PKS sequences were mainly affiliated with Actinobacteria. Thirteen novel NRPS-related sequences were identified by PCR amplification screening, displaying amino acid identities of 48% to 91% to annotated sequences of members of the phyla Proteobacteria, Actinobacteria, and Cyanobacteria. Some of the identified metagenomic clones showed the closest similarity to peptide synthases from Burkholderia or Lysobacter, which are emerging bacterial sources of as-yet-undescribed bioactive metabolites. This report highlights the role of the extreme natural ecosystems as a promising source for detection of secondary compounds and enzymes, serving as a source for biotechnological applications. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  16. Sustainable rehabilitation of mining waste and acid mine drainage using geochemistry, mine type, mineralogy, texture, ore extraction and climate knowledge.

    PubMed

    Anawar, Hossain Md

    2015-08-01

    The oxidative dissolution of sulfidic minerals releases the extremely acidic leachate, sulfate and potentially toxic elements e.g., As, Ag, Cd, Cr, Cu, Hg, Ni, Pb, Sb, Th, U, Zn, etc. from different mine tailings and waste dumps. For the sustainable rehabilitation and disposal of mining waste, the sources and mechanisms of contaminant generation, fate and transport of contaminants should be clearly understood. Therefore, this study has provided a critical review on (1) recent insights in mechanisms of oxidation of sulfidic minerals, (2) environmental contamination by mining waste, and (3) remediation and rehabilitation techniques, and (4) then developed the GEMTEC conceptual model/guide [(bio)-geochemistry-mine type-mineralogy- geological texture-ore extraction process-climatic knowledge)] to provide the new scientific approach and knowledge for remediation of mining wastes and acid mine drainage. This study has suggested the pre-mining geological, geochemical, mineralogical and microtextural characterization of different mineral deposits, and post-mining studies of ore extraction processes, physical, geochemical, mineralogical and microbial reactions, natural attenuation and effect of climate change for sustainable rehabilitation of mining waste. All components of this model should be considered for effective and integrated management of mining waste and acid mine drainage. Copyright © 2015 Elsevier Ltd. All rights reserved.

  17. Stakeholders' Engagement Methods for the Mining Social Responsibility Practice: Determination of Local Issues and Concerns Related to the Mines Operations in Northwest of the US.

    NASA Astrophysics Data System (ADS)

    Masaitis, A.

    2014-12-01

    Every year, all around the world, global environmental change affects the human habitat. This is effect enhanced by the mining operation, and creates new challenges in relationship between the mining and local community. The purpose of this project are developed the Stakeholders engagement evaluation plan which is currently developed in University of Nevada, Reno for the Emigrant mining project, located in the central Nevada, USA, and belong to the Newmont Mining Corporation, one of the gold production leader worldwide. The needs for this project is to create the open dialog between Newmont mining company and all interested parties which have social or environmental impacts from the Emigrant mine. Identification of the stakeholders list is first and one of the most difficult steps in the developing of mine social responsibility. Stakeholders' engagement evaluation plan must be based on the timing and available resources of the mining company, understanding the goals for the engagement, and on analyzes of the possible risks from engagement. In conclusion, the Stakeholders engagement evaluation plan includes: first, determinations of the stakeholders list, which must include any interested or effected by the mine projects groups, for example: state and local government representatives, people from local communities, business partners, environmental NGOs, indigenous people, and academic groups. The contacts and availability for communication is critical for Stakeholders engagement. Next, is to analyze characteristics of all these parties and determinate the level of interest and level of their influence on the project. The next step includes the Stakeholders matrix and mapping development, where all these information will be put together.After that, must be chosen the methods for stakeholders' engagement. The methods usually depends from the goals of engagement (create the dialog lines, collect the data, determinations of the local issues and concerns, or establish

  18. The Interaction Network Ontology-supported modeling and mining of complex interactions represented with multiple keywords in biomedical literature.

    PubMed

    Özgür, Arzucan; Hur, Junguk; He, Yongqun

    2016-01-01

    The Interaction Network Ontology (INO) logically represents biological interactions, pathways, and networks. INO has been demonstrated to be valuable in providing a set of structured ontological terms and associated keywords to support literature mining of gene-gene interactions from biomedical literature. However, previous work using INO focused on single keyword matching, while many interactions are represented with two or more interaction keywords used in combination. This paper reports our extension of INO to include combinatory patterns of two or more literature mining keywords co-existing in one sentence to represent specific INO interaction classes. Such keyword combinations and related INO interaction type information could be automatically obtained via SPARQL queries, formatted in Excel format, and used in an INO-supported SciMiner, an in-house literature mining program. We studied the gene interaction sentences from the commonly used benchmark Learning Logic in Language (LLL) dataset and one internally generated vaccine-related dataset to identify and analyze interaction types containing multiple keywords. Patterns obtained from the dependency parse trees of the sentences were used to identify the interaction keywords that are related to each other and collectively represent an interaction type. The INO ontology currently has 575 terms including 202 terms under the interaction branch. The relations between the INO interaction types and associated keywords are represented using the INO annotation relations: 'has literature mining keywords' and 'has keyword dependency pattern'. The keyword dependency patterns were generated via running the Stanford Parser to obtain dependency relation types. Out of the 107 interactions in the LLL dataset represented with two-keyword interaction types, 86 were identified by using the direct dependency relations. The LLL dataset contained 34 gene regulation interaction types, each of which associated with multiple keywords. A

  19. Global analysis of gene expression reveals mRNA superinduction is required for the inducible immune response to a bacterial pathogen

    PubMed Central

    Barry, Kevin C; Ingolia, Nicholas T; Vance, Russell E

    2017-01-01

    The inducible innate immune response to infection requires a concerted process of gene expression that is regulated at multiple levels. Most global analyses of the innate immune response have focused on transcription induced by defined immunostimulatory ligands, such as lipopolysaccharide. However, the response to pathogens involves additional complexity, as pathogens interfere with virtually every step of gene expression. How cells respond to pathogen-mediated disruption of gene expression to nevertheless initiate protective responses remains unclear. We previously discovered that a pathogen-mediated blockade of host protein synthesis provokes the production of specific pro-inflammatory cytokines. It remains unclear how these cytokines are produced despite the global pathogen-induced block of translation. We addressed this question by using parallel RNAseq and ribosome profiling to characterize the response of macrophages to infection with the intracellular bacterial pathogen Legionella pneumophila. Our results reveal that mRNA superinduction is required for the inducible immune response to a bacterial pathogen. DOI: http://dx.doi.org/10.7554/eLife.22707.001 PMID:28383283

  20. The Mechanization of Mining.

    ERIC Educational Resources Information Center

    Marovelli, Robert L.; Karhnak, John M.

    1982-01-01

    Mechanization of mining is explained in terms of its effect on the mining of coal, focusing on, among others, types of mining, productivity, machinery, benefits to retired miners, fatality rate in underground coal mines, and output of U.S. mining industry. (Author/JN)

  1. Clustering Algorithms: Their Application to Gene Expression Data

    PubMed Central

    Oyelade, Jelili; Isewon, Itunuoluwa; Oladipupo, Funke; Aromolaran, Olufemi; Uwoghiren, Efosa; Ameh, Faridah; Achas, Moses; Adebiyi, Ezekiel

    2016-01-01

    Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure. PMID:27932867

  2. Global gene expression profiling in infants with acute respiratory syncytial virus broncholitis demonstrates systemic activation of interferon signaling networks

    USDA-ARS?s Scientific Manuscript database

    Respiratory syncytial virus (RSV) is a leading cause of pediatric lower respiratory tract infections and has a high impact on pediatric emergency department utilization. Variation in host response may influence the pathogenesis and disease severity. We evaluated global gene expression profiles to be...

  3. Post-weaning selenium and folate supplementation affects gene and protein expression and global DNA methylation in mice fed high-fat diets.

    PubMed

    Bermingham, Emma N; Bassett, Shalome A; Young, Wayne; Roy, Nicole C; McNabb, Warren C; Cooney, Janine M; Brewster, Di T; Laing, William A; Barnett, Matthew P G

    2013-03-05

    Consumption of high-fat diets has negative impacts on health and well-being, some of which may be epigenetically regulated. Selenium and folate are two compounds which influence epigenetic mechanisms. We investigated the hypothesis that post-weaning supplementation with adequate levels of selenium and folate in offspring of female mice fed a high-fat, low selenium and folate diet during gestation and lactation will lead to epigenetic changes of potential importance for long-term health. Female offspring of mothers fed the experimental diet were either maintained on this diet (HF-low-low), or weaned onto a high-fat diet with sufficient levels of selenium and folate (HF-low-suf), for 8 weeks. Gene and protein expression, DNA methylation, and histone modifications were measured in colon and liver of female offspring. Adequate levels of selenium and folate post-weaning affected gene expression in colon and liver of offspring, including decreasing Slc2a4 gene expression. Protein expression was only altered in the liver. There was no effect of adequate levels of selenium and folate on global histone modifications in the liver. Global liver DNA methylation was decreased in mice switched to adequate levels of selenium and folate, but there was no effect on methylation of specific CpG sites within the Slc2a4 gene in liver. Post-weaning supplementation with adequate levels of selenium and folate in female offspring of mice fed high-fat diets inadequate in selenium and folate during gestation and lactation can alter global DNA methylation in liver. This may be one factor through which the negative effects of a poor diet during early life can be ameliorated. Further research is required to establish what role epigenetic changes play in mediating observed changes in gene and protein expression, and the relevance of these changes to health.

  4. Post-weaning selenium and folate supplementation affects gene and protein expression and global DNA methylation in mice fed high-fat diets

    PubMed Central

    2013-01-01

    Background Consumption of high-fat diets has negative impacts on health and well-being, some of which may be epigenetically regulated. Selenium and folate are two compounds which influence epigenetic mechanisms. We investigated the hypothesis that post-weaning supplementation with adequate levels of selenium and folate in offspring of female mice fed a high-fat, low selenium and folate diet during gestation and lactation will lead to epigenetic changes of potential importance for long-term health. Methods Female offspring of mothers fed the experimental diet were either maintained on this diet (HF-low-low), or weaned onto a high-fat diet with sufficient levels of selenium and folate (HF-low-suf), for 8 weeks. Gene and protein expression, DNA methylation, and histone modifications were measured in colon and liver of female offspring. Results Adequate levels of selenium and folate post-weaning affected gene expression in colon and liver of offspring, including decreasing Slc2a4 gene expression. Protein expression was only altered in the liver. There was no effect of adequate levels of selenium and folate on global histone modifications in the liver. Global liver DNA methylation was decreased in mice switched to adequate levels of selenium and folate, but there was no effect on methylation of specific CpG sites within the Slc2a4 gene in liver. Conclusions Post-weaning supplementation with adequate levels of selenium and folate in female offspring of mice fed high-fat diets inadequate in selenium and folate during gestation and lactation can alter global DNA methylation in liver. This may be one factor through which the negative effects of a poor diet during early life can be ameliorated. Further research is required to establish what role epigenetic changes play in mediating observed changes in gene and protein expression, and the relevance of these changes to health. PMID:23497688

  5. An open-source framework for large-scale, flexible evaluation of biomedical text mining systems

    PubMed Central

    Baumgartner, William A; Cohen, K Bretonnel; Hunter, Lawrence

    2008-01-01

    Background Improved evaluation methodologies have been identified as a necessary prerequisite to the improvement of text mining theory and practice. This paper presents a publicly available framework that facilitates thorough, structured, and large-scale evaluations of text mining technologies. The extensibility of this framework and its ability to uncover system-wide characteristics by analyzing component parts as well as its usefulness for facilitating third-party application integration are demonstrated through examples in the biomedical domain. Results Our evaluation framework was assembled using the Unstructured Information Management Architecture. It was used to analyze a set of gene mention identification systems involving 225 combinations of system, evaluation corpus, and correctness measure. Interactions between all three were found to affect the relative rankings of the systems. A second experiment evaluated gene normalization system performance using as input 4,097 combinations of gene mention systems and gene mention system-combining strategies. Gene mention system recall is shown to affect gene normalization system performance much more than does gene mention system precision, and high gene normalization performance is shown to be achievable with remarkably low levels of gene mention system precision. Conclusion The software presented in this paper demonstrates the potential for novel discovery resulting from the structured evaluation of biomedical language processing systems, as well as the usefulness of such an evaluation framework for promoting collaboration between developers of biomedical language processing technologies. The code base is available as part of the BioNLP UIMA Component Repository on SourceForge.net. PMID:18230184

  6. Overview of mine drainage geochemistry at historical mines, Humboldt River basin and adjacent mining areas, Nevada. Chapter E.

    USGS Publications Warehouse

    Nash, J. Thomas; Stillings, Lisa L.

    2004-01-01

    Reconnaissance hydrogeochemical studies of the Humboldt River basin and adjacent areas of northern Nevada have identified local sources of acidic waters generated by historical mine workings and mine waste. The mine-related acidic waters are rare and generally flow less than a kilometer before being neutralized by natural processes. Where waters have a pH of less than about 3, particularly in the presence of sulfide minerals, the waters take on high to extremely high concentrations of many potentially toxic metals. The processes that create these acidic, metal-rich waters in Nevada are the same as for other parts of the world, but the scale of transport and the fate of metals are much more localized because of the ubiquitous presence of caliche soils. Acid mine drainage is rare in historical mining districts of northern Nevada, and the volume of drainage rarely exceeds about 20 gpm. My findings are in close agreement with those of Price and others (1995) who estimated that less than 0.05 percent of inactive and abandoned mines in Nevada are likely to be a concern for acid mine drainage. Most historical mining districts have no draining mines. Only in two districts (Hilltop and National) does water affected by mining flow into streams of significant size and length (more than 8 km). Water quality in even the worst cases is naturally attenuated to meet water-quality standards within about 1 km of the source. Only a few historical mines release acidic water with elevated metal concentrations to small streams that reach the Humboldt River, and these contaminants and are not detectable in the Humboldt. These reconnaissance studies offer encouraging evidence that abandoned mines in Nevada create only minimal and local water-quality problems. Natural attenuation processes are sufficient to compensate for these relatively small sources of contamination. These results may provide useful analogs for future mining in the Humboldt River basin, but attention must be given to

  7. Triterpenoid Saponin Biosynthetic Pathway Profiling and Candidate Gene Mining of the Ilex asprella Root Using RNA-Seq

    PubMed Central

    Zheng, Xiasheng; Xu, Hui; Ma, Xinye; Zhan, Ruoting; Chen, Weiwen

    2014-01-01

    Ilex asprella, which contains abundant α-amyrin type triterpenoid saponins, is an anti-influenza herbal drug widely used in south China. In this work, we first analysed the transcriptome of the I. asprella root using RNA-Seq, which provided a dataset for functional gene mining. mRNA was isolated from the total RNA of the I. asprella root and reverse-transcribed into cDNA. Then, the cDNA library was sequenced using an Illumina HiSeq™ 2000, which generated 55,028,452 clean reads. De novo assembly of these reads generated 51,865 unigenes, in which 39,269 unigenes were annotated (75.71% yield). According to the structures of the triterpenoid saponins of I. asprella, a putative biosynthetic pathway downstream of 2,3-oxidosqualene was proposed and candidate unigenes in the transcriptome data that were potentially involved in the pathway were screened using homology-based BLAST and phylogenetic analysis. Further amplification and functional analysis of these putative unigenes will provide insight into the biosynthesis of Ilex triterpenoid saponins. PMID:24722569

  8. Hydrogeochemical assessment of mine-impacted water and sediment of iron ore mining

    NASA Astrophysics Data System (ADS)

    Nur Atirah Affandi, Fatin; Kusin, Faradiella Mohd; Aqilah Sulong, Nur; Madzin, Zafira

    2018-04-01

    This study was carried out to evaluate the hydrogeochemical behaviour of mine-impacted water and sediment of a former iron ore mining area. Sampling of mine water and sediment were carried out at selected locations within the mine including the former mining ponds, mine tailings and the nearby stream. The water samples were analysed for their hydrochemical facies, major and trace elements including heavy metals. The water in the mining ponds and the mine tailings was characterised as highly acidic (pH 2.54-3.07), but has near-neutral pH in the nearby stream. Results indicated that Fe and Mn in water have exceeded the recommended guidelines values and was also supported by the results of geochemical modelling. The results also indicated that sediments in the mining area were contaminated with Cd and As as shown by the potential ecological risk index values. The total risk index of heavy metals in the sediment were ranked in the order of Cd>As>Pb>Cu>Zn>Cr. Overall, the extent of potential ecological risks of the mining area were categorised as having low to moderate ecological risk.

  9. Nome Offshore Mining Information

    Science.gov Websites

    Lands Coal Regulatory Program Large Mine Permits Mineral Property and Rights Mining Index Land potential safety concerns, prevent overcrowding, and provide for efficient processing of the permits and Regulatory Program Large Mine Permitting Mineral Property Management Mining Fact Sheets Mining Forms APMA

  10. TCGA4U: A Web-Based Genomic Analysis Platform To Explore And Mine TCGA Genomic Data For Translational Research.

    PubMed

    Huang, Zhenzhen; Duan, Huilong; Li, Haomin

    2015-01-01

    Large-scale human cancer genomics projects, such as TCGA, generated large genomics data for further study. Exploring and mining these data to obtain meaningful analysis results can help researchers find potential genomics alterations that intervene the development and metastasis of tumors. We developed a web-based gene analysis platform, named TCGA4U, which used statistics methods and models to help translational investigators explore, mine and visualize human cancer genomic characteristic information from the TCGA datasets. Furthermore, through Gene Ontology (GO) annotation and clinical data integration, the genomic data were transformed into biological process, molecular function, cellular component and survival curves to help researchers identify potential driver genes. Clinical researchers without expertise in data analysis will benefit from such a user-friendly genomic analysis platform.

  11. Collaborative Data Mining

    NASA Astrophysics Data System (ADS)

    Moyle, Steve

    Collaborative Data Mining is a setting where the Data Mining effort is distributed to multiple collaborating agents - human or software. The objective of the collaborative Data Mining effort is to produce solutions to the tackled Data Mining problem which are considered better by some metric, with respect to those solutions that would have been achieved by individual, non-collaborating agents. The solutions require evaluation, comparison, and approaches for combination. Collaboration requires communication, and implies some form of community. The human form of collaboration is a social task. Organizing communities in an effective manner is non-trivial and often requires well defined roles and processes. Data Mining, too, benefits from a standard process. This chapter explores the standard Data Mining process CRISP-DM utilized in a collaborative setting.

  12. The Potential of Text Mining in Data Integration and Network Biology for Plant Research: A Case Study on Arabidopsis[C][W

    PubMed Central

    Van Landeghem, Sofie; De Bodt, Stefanie; Drebert, Zuzanna J.; Inzé, Dirk; Van de Peer, Yves

    2013-01-01

    Despite the availability of various data repositories for plant research, a wealth of information currently remains hidden within the biomolecular literature. Text mining provides the necessary means to retrieve these data through automated processing of texts. However, only recently has advanced text mining methodology been implemented with sufficient computational power to process texts at a large scale. In this study, we assess the potential of large-scale text mining for plant biology research in general and for network biology in particular using a state-of-the-art text mining system applied to all PubMed abstracts and PubMed Central full texts. We present extensive evaluation of the textual data for Arabidopsis thaliana, assessing the overall accuracy of this new resource for usage in plant network analyses. Furthermore, we combine text mining information with both protein–protein and regulatory interactions from experimental databases. Clusters of tightly connected genes are delineated from the resulting network, illustrating how such an integrative approach is essential to grasp the current knowledge available for Arabidopsis and to uncover gene information through guilt by association. All large-scale data sets, as well as the manually curated textual data, are made publicly available, hereby stimulating the application of text mining data in future plant biology studies. PMID:23532071

  13. 76 FR 70075 - Proximity Detection Systems for Continuous Mining Machines in Underground Coal Mines

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-11-10

    ... Detection Systems for Continuous Mining Machines in Underground Coal Mines AGENCY: Mine Safety and Health... proposed rule addressing Proximity Detection Systems for Continuous Mining Machines in Underground Coal... Detection Systems for Continuous Mining Machines in Underground Coal Mines. MSHA conducted hearings on...

  14. 76 FR 63238 - Proximity Detection Systems for Continuous Mining Machines in Underground Coal Mines

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-10-12

    ... Detection Systems for Continuous Mining Machines in Underground Coal Mines AGENCY: Mine Safety and Health... Agency's proposed rule addressing Proximity Detection Systems for Continuous Mining Machines in... proposed rule for Proximity Detection Systems on Continuous Mining Machines in Underground Coal Mines. Due...

  15. GExplore: a web server for integrated queries of protein domains, gene expression and mutant phenotypes

    PubMed Central

    2009-01-01

    Background The majority of the genes even in well-studied multi-cellular model organisms have not been functionally characterized yet. Mining the numerous genome wide data sets related to protein function to retrieve potential candidate genes for a particular biological process remains a challenge. Description GExplore has been developed to provide a user-friendly database interface for data mining at the gene expression/protein function level to help in hypothesis development and experiment design. It supports combinatorial searches for proteins with certain domains, tissue- or developmental stage-specific expression patterns, and mutant phenotypes. GExplore operates on a stand-alone database and has fast response times, which is essential for exploratory searches. The interface is not only user-friendly, but also modular so that it accommodates additional data sets in the future. Conclusion GExplore is an online database for quick mining of data related to gene and protein function, providing a multi-gene display of data sets related to the domain composition of proteins as well as expression and phenotype data. GExplore is publicly available at: http://genome.sfu.ca/gexplore/ PMID:19917126

  16. Microbial sulfate reduction and metal attenuation in pH 4 acid mine water

    USGS Publications Warehouse

    Church, C.D.; Wilkin, R.T.; Alpers, Charles N.; Rye, R.O.; Blaine, R.B.

    2007-01-01

    Sediments recovered from the flooded mine workings of the Penn Mine, a Cu-Zn mine abandoned since the early 1960s, were cultured for anaerobic bacteria over a range of pH (4.0 to 7.5). The molecular biology of sediments and cultures was studied to determine whether sulfate-reducing bacteria (SRB) were active in moderately acidic conditions present in the underground mine workings. Here we document multiple, independent analyses and show evidence that sulfate reduction and associated metal attenuation are occurring in the pH-4 mine environment. Water-chemistry analyses of the mine water reveal: (1) preferential complexation and precipitation by H2S of Cu and Cd, relative to Zn; (2) stable isotope ratios of 34S/32S and 18O/16O in dissolved SO4 that are 2-3 ??? heavier in the mine water, relative to those in surface waters; (3) reduction/oxidation conditions and dissolved gas concentrations consistent with conditions to support anaerobic processes such as sulfate reduction. Scanning electron microscope (SEM) analyses of sediment show 1.5-micrometer, spherical ZnS precipitates. Phospholipid fatty acid (PLFA) and denaturing gradient gel electrophoresis (DGGE) analyses of Penn Mine sediment show a high biomass level with a moderately diverse community structure composed primarily of iron- and sulfate-reducing bacteria. Cultures of sediment from the mine produced dissolved sulfide at pH values near 7 and near 4, forming precipitates of either iron sulfide or elemental sulfur. DGGE coupled with sequence and phylogenetic analysis of 16S rDNA gene segments showed populations of Desulfosporosinus and Desulfitobacterium in Penn Mine sediment and laboratory cultures. ?? 2007 Church et al; licensee BioMed Central Ltd.

  17. Microbial sulfate reduction and metal attenuation in pH 4 acid mine water

    PubMed Central

    Church, Clinton D; Wilkin, Richard T; Alpers, Charles N; Rye, Robert O; McCleskey, R Blaine

    2007-01-01

    Sediments recovered from the flooded mine workings of the Penn Mine, a Cu-Zn mine abandoned since the early 1960s, were cultured for anaerobic bacteria over a range of pH (4.0 to 7.5). The molecular biology of sediments and cultures was studied to determine whether sulfate-reducing bacteria (SRB) were active in moderately acidic conditions present in the underground mine workings. Here we document multiple, independent analyses and show evidence that sulfate reduction and associated metal attenuation are occurring in the pH-4 mine environment. Water-chemistry analyses of the mine water reveal: (1) preferential complexation and precipitation by H2S of Cu and Cd, relative to Zn; (2) stable isotope ratios of 34S/32S and 18O/16O in dissolved SO4 that are 2–3 ‰ heavier in the mine water, relative to those in surface waters; (3) reduction/oxidation conditions and dissolved gas concentrations consistent with conditions to support anaerobic processes such as sulfate reduction. Scanning electron microscope (SEM) analyses of sediment show 1.5-micrometer, spherical ZnS precipitates. Phospholipid fatty acid (PLFA) and denaturing gradient gel electrophoresis (DGGE) analyses of Penn Mine sediment show a high biomass level with a moderately diverse community structure composed primarily of iron- and sulfate-reducing bacteria. Cultures of sediment from the mine produced dissolved sulfide at pH values near 7 and near 4, forming precipitates of either iron sulfide or elemental sulfur. DGGE coupled with sequence and phylogenetic analysis of 16S rDNA gene segments showed populations of Desulfosporosinus and Desulfitobacterium in Penn Mine sediment and laboratory cultures. PMID:17956615

  18. Global Analysis of Gene Expression Profiles in Developing Physic Nut (Jatropha curcas L.) Seeds

    PubMed Central

    Jiang, Huawu; Wu, Pingzhi; Zhang, Sheng; Song, Chi; Chen, Yaping; Li, Meiru; Jia, Yongxia; Fang, Xiaohua; Chen, Fan; Wu, Guojiang

    2012-01-01

    Background Physic nut (Jatropha curcas L.) is an oilseed plant species with high potential utility as a biofuel. Furthermore, following recent sequencing of its genome and the availability of expressed sequence tag (EST) libraries, it is a valuable model plant for studying carbon assimilation in endosperms of oilseed plants. There have been several transcriptomic analyses of developing physic nut seeds using ESTs, but they have provided limited information on the accumulation of stored resources in the seeds. Methodology/Principal Findings We applied next-generation Illumina sequencing technology to analyze global gene expression profiles of developing physic nut seeds 14, 19, 25, 29, 35, 41, and 45 days after pollination (DAP). The acquired profiles reveal the key genes, and their expression timeframes, involved in major metabolic processes including: carbon flow, starch metabolism, and synthesis of storage lipids and proteins in the developing seeds. The main period of storage reserves synthesis in the seeds appears to be 29–41 DAP, and the fatty acid composition of the developing seeds is consistent with relative expression levels of different isoforms of acyl-ACP thioesterase and fatty acid desaturase genes. Several transcription factor genes whose expression coincides with storage reserve deposition correspond to those known to regulate the process in Arabidopsis. Conclusions/Significance The results will facilitate searches for genes that influence de novo lipid synthesis, accumulation and their regulatory networks in developing physic nut seeds, and other oil seeds. Thus, they will be helpful in attempts to modify these plants for efficient biofuel production. PMID:22574177

  19. Approaches to Post-Mining Land Reclamation in Polish Open-Cast Lignite Mining

    NASA Astrophysics Data System (ADS)

    Kasztelewicz, Zbigniew

    2014-06-01

    The paper presents the situation regarding the reclamation of post-mining land in the case of particular lignite mines in Poland until 2012 against the background of the whole opencast mining. It discusses the process of land purchase for mining operations and its sales after reclamation. It presents the achievements of mines in the reclamation and regeneration of post-mining land as a result of which-after development processes carried out according to European standards-it now serves the inhabitants as a recreational area that increases the attractiveness of the regions.

  20. Mercury Pollution from Small-Scale Gold Mining Can Be Stopped by Implementing the Gravity-Borax Method--A Two-Year Follow-Up Study from Two Mining Communities in the Philippines.

    PubMed

    Køster-Rasmussen, Rasmus; Westergaard, Maria L; Brasholt, Marie; Gutierrez, Richard; Jørs, Erik; Thomsen, Jane F

    2016-02-01

    Mercury is used globally to extract gold in artisanal and small-scale gold mining. The mercury-free gravity-borax method for gold extraction was introduced in two mining communities using mercury in the provinces Kalinga and Camarines Norte. This article describes project activities and quantitative changes in mercury consumption and analyzes the implementation with diffusion of innovations theory. Activities included miner-to-miner training; seminars for health-care workers, school teachers, and children; and involvement of community leaders. Baseline (2011) and follow-up (2013) data were gathered on mining practices and knowledge about mercury toxicology. Most miners in Kalinga converted to the gravity-borax method, whereas only a few did so in Camarines Norte. Differences in the nature of the social systems impacted the success of the implementation, and involvement of the tribal organization facilitated the shift in Kalinga. In conclusion, the gravity-borax method is a doable alternative to mercury use in artisanal and small-scale gold mining, but support from the civil society is needed. © The Author(s) 2016.

  1. Numerical Study on 4-1 Coal Seam of Xiaoming Mine in Ascending Mining

    PubMed Central

    Tianwei, Lan; Hongwei, Zhang; Sheng, Li; Weihua, Song; Batugin, A. C.; Guoshui, Tang

    2015-01-01

    Coal seams ascending mining technology is very significant, since it influences the safety production and the liberation of dull coal, speeds up the construction of energy, improves the stability of stope, and reduces or avoids deep hard rock mining induced mine disaster. Combined with the Xiaoming ascending mining mine 4-1, by numerical calculation, the paper analyses ascending mining 4-1 factors, determines the feasibility of ascending mining 4-1 coalbed, and proposes roadway layout program about working face, which has broad economic and social benefits. PMID:25866840

  2. Thin seam miner/trench mining concepts for Illinois Basin surface coal mines

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Caudle, R.D.; Lall, V.

    1985-07-01

    A hybrid surface/underground mining concept, trench-auger mining is an attempt to increase the depth to which coal seams can be surface mined economically by reducing the amount of overburden which must be removed and reclaimed. In this concept the coal seam is first exposed by digging a series of parallel trenches 400 to 1200 ft apart with conventional surface mining equipment. After surface mining the coal from the bottom of the trench, the coal under the surface between the trenches would be extracted with extended-depth augers, operating from the bottoms of the trenches. The RSV Mining Equipment Co. of Hollandmore » has developed a Thin Seam Miner (TSM). The TSM is essentially a remotely controlled, continuous underground mining machine. The hydraulically driven drum cutter head and coal handling auger flights can be operated from a distance outside the underground mine workings. The purpose of this study is to develop and evaluate Thin Seam Miner/Trench Mining (TSM/TM) concepts for use under conditions existing in the Illinois Coal Basin.« less

  3. Dynamic association rules for gene expression data analysis.

    PubMed

    Chen, Shu-Chuan; Tsai, Tsung-Hsien; Chung, Cheng-Han; Li, Wen-Hsiung

    2015-10-14

    The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed

  4. Mining microarray datasets in nutrition: expression of the GPR120 (n-3 fatty acid receptor/sensor) gene is down-regulated in human adipocytes by macrophage secretions.

    PubMed

    Trayhurn, Paul; Denyer, Gareth

    2012-01-01

    Microarray datasets are a rich source of information in nutritional investigation. Targeted mining of microarray data following initial, non-biased bioinformatic analysis can provide key insight into specific genes and metabolic processes of interest. Microarrays from human adipocytes were examined to explore the effects of macrophage secretions on the expression of the G-protein-coupled receptor (GPR) genes that encode fatty acid receptors/sensors. Exposure of the adipocytes to macrophage-conditioned medium for 4 or 24 h had no effect on GPR40 and GPR43 expression, but there was a marked stimulation of GPR84 expression (receptor for medium-chain fatty acids), the mRNA level increasing 13·5-fold at 24 h relative to unconditioned medium. Importantly, expression of GPR120, which encodes an n-3 PUFA receptor/sensor, was strongly inhibited by the conditioned medium (15-fold decrease in mRNA at 24 h). Macrophage secretions have major effects on the expression of fatty acid receptor/sensor genes in human adipocytes, which may lead to an augmentation of the inflammatory response in adipose tissue in obesity.

  5. Global gene expression of Prochlorococcus ecotypes in response to changes in nitrogen availability

    PubMed Central

    Tolonen, Andrew C; Aach, John; Lindell, Debbie; Johnson, Zackary I; Rector, Trent; Steen, Robert; Church, George M; Chisholm, Sallie W

    2006-01-01

    Nitrogen (N) often limits biological productivity in the oceanic gyres where Prochlorococcus is the most abundant photosynthetic organism. The Prochlorococcus community is composed of strains, such as MED4 and MIT9313, that have different N utilization capabilities and that belong to ecotypes with different depth distributions. An interstrain comparison of how Prochlorococcus responds to changes in ambient nitrogen is thus central to understanding its ecology. We quantified changes in MED4 and MIT9313 global mRNA expression, chlorophyll fluorescence, and photosystem II photochemical efficiency (Fv/Fm) along a time series of increasing N starvation. In addition, the global expression of both strains growing in ammonium-replete medium was compared to expression during growth on alternative N sources. There were interstrain similarities in N regulation such as the activation of a putative NtcA regulon during N stress. There were also important differences between the strains such as in the expression patterns of carbon metabolism genes, suggesting that the two strains integrate N and C metabolism in fundamentally different ways. PMID:17016519

  6. Global gene expression in muscle from fasted/refed trout reveals up-regulation of genes promoting myofibre hypertrophy but not myofibre production.

    PubMed

    Rescan, Pierre-Yves; Le Cam, Aurelie; Rallière, Cécile; Montfort, Jérôme

    2017-06-07

    Compensatory growth is a phase of rapid growth, greater than the growth rate of control animals, that occurs after a period of growth-stunting conditions. Fish show a capacity for compensatory growth after alleviation of dietary restriction, but the underlying cellular mechanisms are unknown. To learn more about the contribution of genes regulating hypertrophy (an increase in muscle fibre size) and hyperplasia (the generation of new muscle fibres) in the compensatory muscle growth response in fish, we used high-density microarray analysis to investigate the global gene expression in muscle of trout during a fasting-refeeding schedule and in muscle of control-fed trout displaying normal growth. The compensatory muscle growth signature, as defined by genes up-regulated in muscles of refed trout compared with control-fed trout, showed enrichment in functional categories related to protein biosynthesis and maturation, such as RNA processing, ribonucleoprotein complex biogenesis, ribosome biogenesis, translation and protein folding. This signature was also enriched in chromatin-remodelling factors of the protein arginine N-methyl transferase family. Unexpectedly, functional categories related to cell division and DNA replication were not inferred from the molecular signature of compensatory muscle growth, and this signature contained virtually none of the genes previously reported to be up-regulated in hyperplastic growth zones of the late trout embryo myotome and to potentially be involved in production of new myofibres, notably genes encoding myogenic regulatory factors, transmembrane receptors essential for myoblast fusion or myofibrillar proteins predominant in nascent myofibres. Genes promoting myofibre growth, but not myofibre formation, were up-regulated in muscles of refed trout compared with continually fed trout. This suggests that a compensatory muscle growth response, resulting from the stimulation of hypertrophy but not the stimulation of hyperplasia

  7. Mining the archives: a cross-platform analysis of gene ...

    EPA Pesticide Factsheets

    Formalin-fixed paraffin-embedded (FFPE) tissue samples represent a potentially invaluable resource for genomic research into the molecular basis of disease. However, use of FFPE samples in gene expression studies has been limited by technical challenges resulting from degradation of nucleic acids. Here we evaluated gene expression profiles derived from fresh-frozen (FRO) and FFPE mouse liver tissues using two DNA microarray protocols and two whole transcriptome sequencing (RNA-seq) library preparation methodologies. The ribo-depletion protocol outperformed the other three methods by having the highest correlations of differentially expressed genes (DEGs) and best overlap of pathways between FRO and FFPE groups. We next tested the effect of sample time in formalin (18 hours or 3 weeks) on gene expression profiles. Hierarchical clustering of the datasets indicated that test article treatment, and not preservation method, was the main driver of gene expression profiles. Meta- and pathway analyses indicated that biological responses were generally consistent for 18-hour and 3-week FFPE samples compared to FRO samples. However, clear erosion of signal intensity with time in formalin was evident, and DEG numbers differed by platform and preservation method. Lastly, we investigated the effect of age in FFPE block on genomic profiles. RNA-seq analysis of 8-, 19-, and 26-year-old control blocks using the ribo-depletion protocol resulted in comparable quality metrics, inc

  8. Design risk assessment for burst-prone mines: Application in a Canadian mine

    NASA Astrophysics Data System (ADS)

    Cheung, David J.

    A proactive stance towards improving the effectiveness and consistency of risk assessments has been adopted recently by mining companies and industry. The next 10-20 years forecasts that ore deposits accessible using shallow mining techniques will diminish. The industry continues to strive for success in "deeper" mining projects in order to keep up with the continuing demand for raw materials. Although the returns are quite profitable, many projects have been sidelined due to high uncertainty and technical risk in the mining of the mineral deposit. Several hardrock mines have faced rockbursting and seismicity problems. Within those reported, mines in countries like South Africa, Australia and Canada have documented cases of severe rockburst conditions attributed to the mining depth. Severe rockburst conditions known as "burst-prone" can be effectively managed with design. Adopting a more robust design can ameliorate the exposure of workers and equipment to adverse conditions and minimize the economic consequences, which can hinder the bottom line of an operation. This thesis presents a methodology created for assessing the design risk in burst-prone mines. The methodology includes an evaluation of relative risk ratings for scenarios with options of risk reduction through several design principles. With rockbursts being a hazard of seismic events, the methodology is based on research in the area of mining seismicity factoring in rockmass failure mechanisms, which results from a combination of mining induced stress, geological structures, rockmass properties and mining influences. The methodology was applied to case studies at Craig Mine of Xstrata Nickel in Sudbury, Ontario, which is known to contain seismically active fault zones. A customized risk assessment was created and applied to rockburst case studies, evaluating the seismic vulnerability and consequence for each case. Application of the methodology to Craig Mine demonstrates that changes in the design can

  9. Impact of Neutron Exposure on Global Gene Expression in a Human Peripheral Blood Model

    PubMed Central

    Broustas, Constantinos G.; Xu, Yanping; Harken, Andrew D.; Chowdhury, Mashkura; Garty, Guy; Amundson, Sally A.

    2017-01-01

    The detonation of an improvised nuclear device would produce prompt radiation consisting of both photons (gamma rays) and neutrons. While much effort in recent years has gone into the development of radiation biodosimetry methods suitable for mass triage, the possible effect of neutrons on the endpoints studied has remained largely uninvestigated. We have used a novel neutron irradiator with an energy spectrum based on that 1–1.5 km from the epicenter of the Hiroshima blast to begin examining the effect of neutrons on global gene expression, and the impact this may have on the development of gene expression signatures for radiation biodosimetry. We have exposed peripheral blood from healthy human donors to 0.1, 0.3, 0.5 or 1 Gy of neutrons ex vivo using our neutron irradiator, and compared the transcriptomic response 24 h later to that resulting from sham exposure or exposure to 0.1, 0.3, 0.5, 1, 2 or 4 Gy of photons (X rays). We identified 125 genes that responded significantly to both radiation qualities as a function of dose, with the magnitude of response to neutrons generally being greater than that seen after X-ray exposure. Gene ontology analysis suggested broad involvement of the p53 signaling pathway and general DNA damage response functions across all doses of both radiation qualities. Regulation of immune response and chromatin-related functions were implicated only following the highest doses of neutrons, suggesting a physiological impact of greater DNA damage. We also identified several genes that seem to respond primarily as a function of dose, with less effect of radiation quality. We confirmed this pattern of response by quantitative real-time RT-PCR for BAX, TNFRSF10B, ITLN2 and AEN and suggest that gene expression may provide a means to differentiate between total dose and a neutron component. PMID:28140791

  10. Impact of Neutron Exposure on Global Gene Expression in a Human Peripheral Blood Model.

    PubMed

    Broustas, Constantinos G; Xu, Yanping; Harken, Andrew D; Chowdhury, Mashkura; Garty, Guy; Amundson, Sally A

    2017-04-01

    The detonation of an improvised nuclear device would produce prompt radiation consisting of both photons (gamma rays) and neutrons. While much effort in recent years has gone into the development of radiation biodosimetry methods suitable for mass triage, the possible effect of neutrons on the endpoints studied has remained largely uninvestigated. We have used a novel neutron irradiator with an energy spectrum based on that 1-1.5 km from the epicenter of the Hiroshima blast to begin examining the effect of neutrons on global gene expression, and the impact this may have on the development of gene expression signatures for radiation biodosimetry. We have exposed peripheral blood from healthy human donors to 0.1, 0.3, 0.5 or 1 Gy of neutrons ex vivo using our neutron irradiator, and compared the transcriptomic response 24 h later to that resulting from sham exposure or exposure to 0.1, 0.3, 0.5, 1, 2 or 4 Gy of photons (X rays). We identified 125 genes that responded significantly to both radiation qualities as a function of dose, with the magnitude of response to neutrons generally being greater than that seen after X-ray exposure. Gene ontology analysis suggested broad involvement of the p53 signaling pathway and general DNA damage response functions across all doses of both radiation qualities. Regulation of immune response and chromatin-related functions were implicated only following the highest doses of neutrons, suggesting a physiological impact of greater DNA damage. We also identified several genes that seem to respond primarily as a function of dose, with less effect of radiation quality. We confirmed this pattern of response by quantitative real-time RT-PCR for BAX, TNFRSF10B, ITLN2 and AEN and suggest that gene expression may provide a means to differentiate between total dose and a neutron component.

  11. Geochemical Characterization of Mine Waste, Mine Drainage, and Stream Sediments at the Pike Hill Copper Mine Superfund Site, Orange County, Vermont

    USGS Publications Warehouse

    Piatak, Nadine M.; Seal, Robert R.; Hammarstrom, Jane M.; Kiah, Richard G.; Deacon, Jeffrey R.; Adams, Monique; Anthony, Michael W.; Briggs, Paul H.; Jackson, John C.

    2006-01-01

    The Pike Hill Copper Mine Superfund Site in the Vermont copper belt consists of the abandoned Smith, Eureka, and Union mines, all of which exploited Besshi-type massive sulfide deposits. The site was listed on the U.S. Environmental Protection Agency (USEPA) National Priorities List in 2004 due to aquatic ecosystem impacts. This study was intended to be a precursor to a formal remedial investigation by the USEPA, and it focused on the characterization of mine waste, mine drainage, and stream sediments. A related study investigated the effects of the mine drainage on downstream surface waters. The potential for mine waste and drainage to have an adverse impact on aquatic ecosystems, on drinking- water supplies, and to human health was assessed on the basis of mineralogy, chemical concentrations, acid generation, and potential for metals to be leached from mine waste and soils. The results were compared to those from analyses of other Vermont copper belt Superfund sites, the Elizabeth Mine and Ely Copper Mine, to evaluate if the waste material at the Pike Hill Copper Mine was sufficiently similar to that of the other mine sites that USEPA can streamline the evaluation of remediation technologies. Mine-waste samples consisted of oxidized and unoxidized sulfidic ore and waste rock, and flotation-mill tailings. These samples contained as much as 16 weight percent sulfides that included chalcopyrite, pyrite, pyrrhotite, and sphalerite. During oxidation, sulfides weather and may release potentially toxic trace elements and may produce acid. In addition, soluble efflorescent sulfate salts were identified at the mines; during rain events, the dissolution of these salts contributes acid and metals to receiving waters. Mine waste contained concentrations of cadmium, copper, and iron that exceeded USEPA Preliminary Remediation Goals. The concentrations of selenium in mine waste were higher than the average composition of eastern United States soils. Most mine waste was

  12. Alchemy and mining: metallogenesis and prospecting in early mining books.

    PubMed

    Dym, Warren Alexander

    2008-11-01

    Historians have assumed that alchemy had a close association with mining, but exactly how and why miners were interested in alchemy remains unclear. This paper argues that alchemical theory began to be synthesised with classical and Christian theories of the earth in mining books after 1500, and served an important practical function. The theory of metals that mining officials addressed spoke of mineral vapours (Witterungen) that left visible markings on the earth's surface. The prospector searched for mineral ore in part by studying these indications. Mineral vapours also explained the functioning of the dowsing rod, which prospectors applied to the discovery of ore. Historians of early chemistry and mining have claimed that mining had a modernising influence by stripping alchemy of its theoretical component, but this paper shows something quite to the contrary: mining officials may have been sceptical of the possibility of artificial transmutation, but they were interested in a theory of the earth that could translate into prospecting knowledge.

  13. Isolation and identification of a Candida digboiensis strain from an extreme acid mine drainage of the Lignite Mine, Gujarat.

    PubMed

    Patel, Mitesh J; Tipre, Devayani R; Dave, Shailesh R

    2009-12-01

    An extremely acidic mine drainage (AMD) water sample was collected in 1998 and 2008 from Panandhro lignite mine, Gujarat, India. The yeast isolated from this sample was identified using mini API identification system, as a member of genus Candida. The major cellular fatty acids detected by FAME from the isolate are C(16:0) and C(18:2) (cis 9,12)/C(18:0alpha) as 25.23 and 19.5%, respectively. The isolate was identified as Candida digboiensis by 18S rRNA gene sequence analysis and designated as Candida digboiensis SRDyeast1. Phylogenetic analysis using D1/D2 variable domains showed that the closest relative of this strain is Candida blankii with 3% divergence. This organism has been reported for the first time from the lignite mine AMD sample, and for cellular fatty acid analysis. This yeast is able to survive in the AMD sample preserved at 10-42 degrees C temperature since last 10 years along with iron oxidizing microorganisms. It can grow in the presence of 40% glucose, 10% NaCl and in the pH range of 1 to 10. The isolate is capable of producing enzymes like protease and lipase. This isolate differs from the type strain Candida digboiensis in as many as six physiological and metabolic characteristics.

  14. Coastal mining

    NASA Astrophysics Data System (ADS)

    Bell, Peter M.

    The Exclusive Economic Zone (EEZ) declared by President Reagan in March 1983 has met with a mixed response from those who would benefit from a guaranteed, 200-nautical-mile (370-km) protected underwater mining zone off the coasts of the United States and its possessions. On the one hand, the U.S. Department of the Interior is looking ahead and has been very successful in safeguarding important natural resources that will be needed in the coming decades. On the other hand, the mining industry is faced with a depressed metals and mining market.A report of the Exclusive Economic Zone Symposium held in November 1983 by the U.S. Geological Survey, the Mineral Management Service, and the Bureau of Mines described the mixed response as: “ … The Department of Interior … raring to go into promotion of deep-seal mining but industrial consortia being very pessimistic about the program, at least for the next 30 or so years.” (Chemical & Engineering News, February 5, 1983).

  15. Mining candidate genes associated with powdery mildew resistance in cucumber via super-BSA by specific length amplified fragment (SLAF) sequencing.

    PubMed

    Zhang, Peng; Zhu, Yuqiang; Wang, Lili; Chen, Liping; Zhou, Shengjun

    2015-12-14

    Powdery mildew (PM) is the most common fungal disease of cucumber and other cucurbit crops, while breeding the PM-resistant materials is the effective way to defense this disease, and the recent development of modern genetics and genomics make us aware of that studying the resistance genes is the essential way to breed the PM high-resistance plant. With the ever increasing throughput of next-generation sequencing (NGS), the development of specific length amplified fragment sequencing (SLAF-seq) as a high-resolution strategy for large-scale de novo SNP discovery is gradually applied for functional gene mining. Here we combined the bulked segregant analysis (BSA) with SLAF-seq to identify candidate genes associated with PM resistance in cucumber. A segregating population comprising 251 F2 individuals was developed using H136 (female parent) as susceptible parent and BK2 (male parent) as resistance donor. After PMR test, total genomic DNA was prepared from each plant. Systemic genomic analysis of the GC content, repeat sequence, etc. was carried out by prediction software SLAF_Predict to establish condition to ensure the uniformity and density of the molecular markers. After samples were gel purified, SLAFs were generated at Biomarker Technologies Corporation in Beijing. Based on SLAF tags and the PMR test result, the hot region were annotated. A total of 73,100 high-quality SLAF tags with an average depth of 99.11× were sequenced. Among these, 5,355 polymorphic tags were identified with a polymorphism rate of 7.34 %, including 7.09 % SNPs and other polymorphism types. Finally, 140 associated SLAFs were identified, and two main Hot Regions were detected on chromosome 1 and 6, which contained five genes invovled in defense response, toxin metabolism, cell stress response, and injury response in cucumber. Associated markers identified by super-BSA in this study, could not only speed up the study of the PMR genes, but also provide a feasible solution for breeding the

  16. Global gene transcriptome analysis in vaccinated cattle revealed a dominant role of IL-22 for protection against bovine tuberculosis.

    PubMed

    Bhuju, Sabin; Aranday-Cortes, Elihu; Villarreal-Ramos, Bernardo; Xing, Zhou; Singh, Mahavir; Vordermeier, H Martin

    2012-12-01

    Bovine tuberculosis (bTB) is a chronic disease of cattle caused by Mycobacterium bovis, a member of the Mycobacterium tuberculosis complex group of bacteria. Vaccination of cattle might offer a long-term solution for controlling the disease and priority has been given to the development of a cattle vaccine against bTB. Identification of biomarkers in tuberculosis research remains elusive and the goal is to identify host correlates of protection. We hypothesized that by studying global gene expression we could identify in vitro predictors of protection that could help to facilitate vaccine development. Calves were vaccinated with BCG or with a heterologous BCG prime adenovirally vectored subunit boosting protocol. Protective efficacy was determined after M. bovis challenge. RNA was prepared from PPD-stimulated PBMC prepared from vaccinated-protected, vaccinated-unprotected and unvaccinated control cattle prior to M. bovis challenge and global gene expression determined by RNA-seq. 668 genes were differentially expressed in vaccinated-protected cattle compared with vaccinated-unprotected and unvaccinated control cattle. Cytokine-cytokine receptor interaction was the most significant pathway related to this dataset with IL-22 expression identified as the dominant surrogate of protection besides INF-γ. Finally, the expression of these candidate genes identified by RNA-seq was evaluated by RT-qPCR in an independent set of PBMC samples from BCG vaccinated and unvaccinated calves. This experiment confirmed the importance of IL-22 as predictor of vaccine efficacy.

  17. Microfilming maps of abandoned anthracite mines: mines in the southern anthracite field

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gait, G.B.

    1978-01-01

    This report is the fifth in a series concerning the Bureau of Mines program for microfilming maps of abandoned mines in the Pennsylvania anthracite region. A catalog of the microfilmed maps of 47 of 49 major mines and 18 independent mines in the Southern field is presented. Previous reports included catalogs of microfilmed maps of mines in the Eastern Middle field, the Wyoming and Lackawanna Basins of the Northern field, and the Western Middle anthracite field.

  18. Global Gene Expression Profiling in PAI-1 Knockout Murine Heart and Kidney: Molecular Basis of Cardiac-Selective Fibrosis

    PubMed Central

    Ghosh, Asish K.; Murphy, Sheila B.; Kishore, Raj; Vaughan, Douglas E.

    2013-01-01

    Fibrosis is defined as an abnormal matrix remodeling due to excessive synthesis and accumulation of extracellular matrix proteins in tissues during wound healing or in response to chemical, mechanical and immunological stresses. At present, there is no effective therapy for organ fibrosis. Previous studies demonstrated that aged plasminogen activator inhibitor-1(PAI-1) knockout mice develop spontaneously cardiac-selective fibrosis without affecting any other organs. We hypothesized that differential expressions of profibrotic and antifibrotic genes in PAI-1 knockout hearts and unaffected organs lead to cardiac selective fibrosis. In order to address this prediction, we have used a genome-wide gene expression profiling of transcripts derived from aged PAI-1 knockout hearts and kidneys. The variations of global gene expression profiling were compared within four groups: wildtype heart vs. knockout heart; wildtype kidney vs. knockout kidney; knockout heart vs. knockout kidney and wildtype heart vs. wildtype kidney. Analysis of illumina-based microarray data revealed that several genes involved in different biological processes such as immune system processing, response to stress, cytokine signaling, cell proliferation, adhesion, migration, matrix organization and transcriptional regulation were affected in hearts and kidneys by the absence of PAI-1, a potent inhibitor of urokinase and tissue-type plasminogen activator. Importantly, the expressions of a number of genes, involved in profibrotic pathways including Ankrd1, Pi16, Egr1, Scx, Timp1, Timp2, Klf6, Loxl1 and Klotho, were deregulated in PAI-1 knockout hearts compared to wildtype hearts and PAI-1 knockout kidneys. While the levels of Ankrd1, Pi16 and Timp1 proteins were elevated during EndMT, the level of Timp4 protein was decreased. To our knowledge, this is the first comprehensive report on the influence of PAI-1 on global gene expression profiling in the heart and kidney and its implication in fibrogenesis and

  19. Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications.

    PubMed

    Iddamalgoda, Lahiru; Das, Partha S; Aponso, Achala; Sundararajan, Vijayaraghava S; Suravajhala, Prashanth; Valadi, Jayaraman K

    2016-01-01

    Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately prioritizing the single nucleotide polymorphisms (SNP) associated with the disease. In this commentary, we review the state-of-art data mining and pattern recognition models for identifying inherited diseases and deliberate the need of binary classification- and scoring-based prioritization methods in determining causal variants. While we discuss the pros and cons associated with these methods known, we argue that the gene prioritization methods and the protein interaction (PPI) methods in conjunction with the K nearest neighbors' could be used in accurately categorizing the genetic factors in disease causation.

  20. Roles for text mining in protein function prediction.

    PubMed

    Verspoor, Karin M

    2014-01-01

    The Human Genome Project has provided science with a hugely valuable resource: the blueprints for life; the specification of all of the genes that make up a human. While the genes have all been identified and deciphered, it is proteins that are the workhorses of the human body: they are essential to virtually all cell functions and are the primary mechanism through which biological function is carried out. Hence in order to fully understand what happens at a molecular level in biological organisms, and eventually to enable development of treatments for diseases where some aspect of a biological system goes awry, we must understand the functions of proteins. However, experimental characterization of protein function cannot scale to the vast amount of DNA sequence data now available. Computational protein function prediction has therefore emerged as a problem at the forefront of modern biology (Radivojac et al., Nat Methods 10(13):221-227, 2013).Within the varied approaches to computational protein function prediction that have been explored, there are several that make use of biomedical literature mining. These methods take advantage of information in the published literature to associate specific proteins with specific protein functions. In this chapter, we introduce two main strategies for doing this: association of function terms, represented as Gene Ontology terms (Ashburner et al., Nat Genet 25(1):25-29, 2000), to proteins based on information in published articles, and a paradigm called LEAP-FS (Literature-Enhanced Automated Prediction of Functional Sites) in which literature mining is used to validate the predictions of an orthogonal computational protein function prediction method.

  1. Mining Metatranscriptomic Data of a Cyanobacterial Bloom for Patterns of Secondary Metabolism Gene Expression

    NASA Astrophysics Data System (ADS)

    Penn, K.; Wang, J.; Thompson, J. R.

    2012-12-01

    The secondary metabolism of bacterial cells produces small molecules that can have both medicinal properties and toxigenic effects. This study focuses on mining metatranscriptomes from a tropical eutrophic water reservoir in Singapore experiencing a cyanobacterial Harmful Algal Bloom dominated by Microcystis, to identify the types of secondary metabolites genes being expressed and by what taxa. A phylogenomic approach as implemented in the online tool Natural Product Domain Seeker (NaPDoS) was used. NaPDoS was recently developed to classify ketosynthase and condensation domains from polyketide synthases and non-ribosomal peptide synthetases, respectively, to provide insight into potential types of pathway products. Water samples from the reservoir were collected six times over a day/night cycle. Total RNA was extracted and subjected to ribosomal depletion followed by cDNA synthesis and next-generation Illumina DNA sequencing, generating 493,468 to 678,064 95-101 base pairs post-quality control reads per sample. Evidence for expression of PKS and NRPS type genes based on identification of a ketosynthase and condensation domains are present in all time points. KS domains fall into to two main phylogenetic groups, type I and type II, within the type II group of domains are domains for fatty acid biosynthesis (fab), which is considered a part of primary metabolism. Type I KS domains are part of the classic PKS natural product biosynthetic genes that make things such as antibiotics and other toxins such as microcystin. 2849 KS domains were detected in the combined reservoir samples, of these 1141 were likely from fatty acid biosynthesis and 1708 were related to secondary metabolism type KS domains. The most abundant KS domains (485) besides the fab genes are closely related to a KS domain that is not currently experimentally linked to a known secondary metabolite but the domain is found in four Microcystis genomes along with two other species of cyanobacteria. The three

  2. Comparative metagenomic and metatranscriptomic analyses of microbial communities in acid mine drainage.

    PubMed

    Chen, Lin-xing; Hu, Min; Huang, Li-nan; Hua, Zheng-shuang; Kuang, Jia-liang; Li, Sheng-jin; Shu, Wen-sheng

    2015-07-01

    The microbial communities in acid mine drainage have been extensively studied to reveal their roles in acid generation and adaption to this environment. Lacking, however, are integrated community- and organism-wide comparative gene transcriptional analyses that could reveal the response and adaptation mechanisms of these extraordinary microorganisms to different environmental conditions. In this study, comparative metagenomics and metatranscriptomics were performed on microbial assemblages collected from four geochemically distinct acid mine drainage (AMD) sites. Taxonomic analysis uncovered unexpectedly high microbial biodiversity of these extremely acidophilic communities, and the abundant taxa of Acidithiobacillus, Leptospirillum and Acidiphilium exhibited high transcriptional activities. Community-wide comparative analyses clearly showed that the AMD microorganisms adapted to the different environmental conditions via regulating the expression of genes involved in multiple in situ functional activities, including low-pH adaptation, carbon, nitrogen and phosphate assimilation, energy generation, environmental stress resistance, and other functions. Organism-wide comparative analyses of the active taxa revealed environment-dependent gene transcriptional profiles, especially the distinct strategies used by Acidithiobacillus ferrivorans and Leptospirillum ferrodiazotrophum in nutrients assimilation and energy generation for survival under different conditions. Overall, these findings demonstrate that the gene transcriptional profiles of AMD microorganisms are closely related to the site physiochemical characteristics, providing clues into the microbial response and adaptation mechanisms in the oligotrophic, extremely acidic environments.

  3. Text mining in livestock animal science: introducing the potential of text mining to animal sciences.

    PubMed

    Sahadevan, S; Hofmann-Apitius, M; Schellander, K; Tesfaye, D; Fluck, J; Friedrich, C M

    2012-10-01

    In biological research, establishing the prior art by searching and collecting information already present in the domain has equal importance as the experiments done. To obtain a complete overview about the relevant knowledge, researchers mainly rely on 2 major information sources: i) various biological databases and ii) scientific publications in the field. The major difference between the 2 information sources is that information from databases is available, typically well structured and condensed. The information content in scientific literature is vastly unstructured; that is, dispersed among the many different sections of scientific text. The traditional method of information extraction from scientific literature occurs by generating a list of relevant publications in the field of interest and manually scanning these texts for relevant information, which is very time consuming. It is more than likely that in using this "classical" approach the researcher misses some relevant information mentioned in the literature or has to go through biological databases to extract further information. Text mining and named entity recognition methods have already been used in human genomics and related fields as a solution to this problem. These methods can process and extract information from large volumes of scientific text. Text mining is defined as the automatic extraction of previously unknown and potentially useful information from text. Named entity recognition (NER) is defined as the method of identifying named entities (names of real world objects; for example, gene/protein names, drugs, enzymes) in text. In animal sciences, text mining and related methods have been briefly used in murine genomics and associated fields, leaving behind other fields of animal sciences, such as livestock genomics. The aim of this work was to develop an information retrieval platform in the livestock domain focusing on livestock publications and the recognition of relevant data from

  4. 30 CFR 49.13 - Alternative mine rescue capability for small and remote mines.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ..., DEPARTMENT OF LABOR EDUCATION AND TRAINING MINE RESCUE TEAMS Mine Rescue Teams for Underground Coal Mines... the operator as to the number of miners willing to serve on a mine rescue team; (8) The operator's...

  5. 30 CFR 49.13 - Alternative mine rescue capability for small and remote mines.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ..., DEPARTMENT OF LABOR EDUCATION AND TRAINING MINE RESCUE TEAMS Mine Rescue Teams for Underground Coal Mines... the operator as to the number of miners willing to serve on a mine rescue team; (8) The operator's...

  6. 30 CFR 49.13 - Alternative mine rescue capability for small and remote mines.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ..., DEPARTMENT OF LABOR EDUCATION AND TRAINING MINE RESCUE TEAMS Mine Rescue Teams for Underground Coal Mines... the operator as to the number of miners willing to serve on a mine rescue team; (8) The operator's...

  7. 30 CFR 49.13 - Alternative mine rescue capability for small and remote mines.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ..., DEPARTMENT OF LABOR EDUCATION AND TRAINING MINE RESCUE TEAMS Mine Rescue Teams for Underground Coal Mines... the operator as to the number of miners willing to serve on a mine rescue team; (8) The operator's...

  8. 30 CFR 49.13 - Alternative mine rescue capability for small and remote mines.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ..., DEPARTMENT OF LABOR EDUCATION AND TRAINING MINE RESCUE TEAMS Mine Rescue Teams for Underground Coal Mines... the operator as to the number of miners willing to serve on a mine rescue team; (8) The operator's...

  9. Engineers of the Future: The Colorado School of Mines' McBride Honors Program.

    ERIC Educational Resources Information Center

    Olds, Barbara M.

    1988-01-01

    More educators argue that science and technology students must be more liberally educated. The McBride Honors Program at Colorado School of Mines addresses the needs of a global society by preparing engineers to be technically competent, with strong communication skills, and knowledge of societal issues. (MLW)

  10. Sustainable Remediation of Legacy Mine Drainage: A Case Study of the Flight 93 National Memorial.

    PubMed

    Emili, Lisa A; Pizarchik, Joseph; Mahan, Carolyn G

    2016-03-01

    Pollution from mining activities is a global environmental concern, not limited to areas of current resource extraction, but including a broader geographic area of historic (legacy) and abandoned mines. The pollution of surface waters from acid mine drainage is a persistent problem and requires a holistic and sustainable approach to addressing the spatial and temporal complexity of mining-specific problems. In this paper, we focus on the environmental, socio-economic, and legal challenges associated with the concurrent activities to remediate a coal mine site and to develop a national memorial following a catastrophic event. We provide a conceptual construct of a socio-ecological system defined at several spatial, temporal, and organizational scales and a critical synthesis of the technical and social learning processes necessary to achieving sustainable environmental remediation. Our case study is an example of a multi-disciplinary management approach, whereby collaborative interaction of stakeholders, the emergence of functional linkages for information exchange, and mediation led to scientifically informed decision making, creative management solutions, and ultimately environmental policy change.

  11. Genomic Location of the Major Ribosomal Protein Gene Locus Determines Vibrio cholerae Global Growth and Infectivity

    PubMed Central

    Soler-Bistué, Alfonso; Mondotte, Juan A.; Bland, Michael Jason; Val, Marie-Eve; Saleh, María-Carla; Mazel, Didier

    2015-01-01

    The effects on cell physiology of gene order within the bacterial chromosome are poorly understood. In silico approaches have shown that genes involved in transcription and translation processes, in particular ribosomal protein (RP) genes, localize near the replication origin (oriC) in fast-growing bacteria suggesting that such a positional bias is an evolutionarily conserved growth-optimization strategy. Such genomic localization could either provide a higher dosage of these genes during fast growth or facilitate the assembly of ribosomes and transcription foci by keeping physically close the many components of these macromolecular machines. To explore this, we used novel recombineering tools to create a set of Vibrio cholerae strains in which S10-spec-α (S10), a locus bearing half of the ribosomal protein genes, was systematically relocated to alternative genomic positions. We show that the relative distance of S10 to the origin of replication tightly correlated with a reduction of S10 dosage, mRNA abundance and growth rate within these otherwise isogenic strains. Furthermore, this was accompanied by a significant reduction in the host-invasion capacity in Drosophila melanogaster. Both phenotypes were rescued in strains bearing two S10 copies highly distal to oriC, demonstrating that replication-dependent gene dosage reduction is the main mechanism behind these alterations. Hence, S10 positioning connects genome structure to cell physiology in Vibrio cholerae. Our results show experimentally for the first time that genomic positioning of genes involved in the flux of genetic information conditions global growth control and hence bacterial physiology and potentially its evolution. PMID:25875621

  12. A study of acid and ferruginous mine water in coal mining operations

    NASA Astrophysics Data System (ADS)

    Atkins, A. S.; Singh, R. N.

    1982-06-01

    The paper describes a bio-chemical investigation in the laboratory to identify various factors which promote the formation of acidic and ferruginous mine water. Biochemical reactions responsible for bacterial oxidation of Iron pyrites are described. The acidic and ferruginous mine water are not only responsible for the corrosion of mine plant and equipment and formation of scales in the delivery pipe range, but also pollution of the mine surface environment, thus affecting the surface ecology. Control measures to mitigate the adverse effects of acid mine discharge include the protection of mining equipment and prevention of formation of acid and ferruginous water. Various control measures discussed in the paper are blending with alkaline or spring water, use of neutralising agents and bactericides, and various types of seals for preventing water and air coming into contact with pyrites in caved mine workings.

  13. Global Identification of Genes Affecting Iron-Sulfur Cluster Biogenesis and Iron Homeostasis

    PubMed Central

    Hidese, Ryota; Kurihara, Tatsuo; Esaki, Nobuyoshi

    2014-01-01

    Iron-sulfur (Fe-S) clusters are ubiquitous cofactors that are crucial for many physiological processes in all organisms. In Escherichia coli, assembly of Fe-S clusters depends on the activity of the iron-sulfur cluster (ISC) assembly and sulfur mobilization (SUF) apparatus. However, the underlying molecular mechanisms and the mechanisms that control Fe-S cluster biogenesis and iron homeostasis are still poorly defined. In this study, we performed a global screen to identify the factors affecting Fe-S cluster biogenesis and iron homeostasis using the Keio collection, which is a library of 3,815 single-gene E. coli knockout mutants. The approach was based on radiolabeling of the cells with [2-14C]dihydrouracil, which entirely depends on the activity of an Fe-S enzyme, dihydropyrimidine dehydrogenase. We identified 49 genes affecting Fe-S cluster biogenesis and/or iron homeostasis, including 23 genes important only under microaerobic/anaerobic conditions. This study defines key proteins associated with Fe-S cluster biogenesis and iron homeostasis, which will aid further understanding of the cellular mechanisms that coordinate the processes. In addition, we applied the [2-14C]dihydrouracil-labeling method to analyze the role of amino acid residues of an Fe-S cluster assembly scaffold (IscU) as a model of the Fe-S cluster assembly apparatus. The analysis showed that Cys37, Cys63, His105, and Cys106 are essential for the function of IscU in vivo, demonstrating the potential of the method to investigate in vivo function of proteins involved in Fe-S cluster assembly. PMID:24415728

  14. Transcriptome meta-analysis reveals common differential and global gene expression profiles in cystic fibrosis and other respiratory disorders and identifies CFTR regulators.

    PubMed

    Clarke, Luka A; Botelho, Hugo M; Sousa, Lisete; Falcao, Andre O; Amaral, Margarida D

    2015-11-01

    A meta-analysis of 13 independent microarray data sets was performed and gene expression profiles from cystic fibrosis (CF), similar disorders (COPD: chronic obstructive pulmonary disease, IPF: idiopathic pulmonary fibrosis, asthma), environmental conditions (smoking, epithelial injury), related cellular processes (epithelial differentiation/regeneration), and non-respiratory "control" conditions (schizophrenia, dieting), were compared. Similarity among differentially expressed (DE) gene lists was assessed using a permutation test, and a clustergram was constructed, identifying common gene markers. Global gene expression values were standardized using a novel approach, revealing that similarities between independent data sets run deeper than shared DE genes. Correlation of gene expression values identified putative gene regulators of the CF transmembrane conductance regulator (CFTR) gene, of potential therapeutic significance. Our study provides a novel perspective on CF epithelial gene expression in the context of other lung disorders and conditions, and highlights the contribution of differentiation/EMT and injury to gene signatures of respiratory disease. Copyright © 2015 Elsevier Inc. All rights reserved.

  15. Archaeal Diversity in Waters from Deep South African Gold Mines

    PubMed Central

    Takai, Ken; Moser, Duane P.; DeFlaun, Mary; Onstott, Tullis C.; Fredrickson, James K.

    2001-01-01

    A culture-independent molecular analysis of archaeal communities in waters collected from deep South African gold mines was performed by performing a PCR-mediated terminal restriction fragment length polymorphism (T-RFLP) analysis of rRNA genes (rDNA) in conjunction with a sequencing analysis of archaeal rDNA clone libraries. The water samples used represented various environments, including deep fissure water, mine service water, and water from an overlying dolomite aquifer. T-RFLP analysis revealed that the ribotype distribution of archaea varied with the source of water. The archaeal communities in the deep gold mine environments exhibited great phylogenetic diversity; the majority of the members were most closely related to uncultivated species. Some archaeal rDNA clones obtained from mine service water and dolomite aquifer water samples were most closely related to environmental rDNA clones from surface soil (soil clones) and marine environments (marine group I [MGI]). Other clones exhibited intermediate phylogenetic affiliation between soil clones and MGI in the Crenarchaeota. Fissure water samples, derived from active or dormant geothermal environments, yielded archaeal sequences that exhibited novel phylogeny, including a novel lineage of Euryarchaeota. These results suggest that deep South African gold mines harbor novel archaeal communities distinct from those observed in other environments. Based on the phylogenetic analysis of archaeal strains and rDNA clones, including the newly discovered archaeal rDNA clones, the evolutionary relationship and the phylogenetic organization of the domain Archaea are reevaluated. PMID:11722932

  16. Global map of physical interactions among differentially expressed genes in multiple sclerosis relapses and remissions.

    PubMed

    Tuller, Tamir; Atar, Shimshi; Ruppin, Eytan; Gurevich, Michael; Achiron, Anat

    2011-09-15

    Multiple sclerosis (MS) is a central nervous system autoimmune inflammatory T-cell-mediated disease with a relapsing-remitting course in the majority of patients. In this study, we performed a high-resolution systems biology analysis of gene expression and physical interactions in MS relapse and remission. To this end, we integrated 164 large-scale measurements of gene expression in peripheral blood mononuclear cells of MS patients in relapse or remission and healthy subjects, with large-scale information about the physical interactions between these genes obtained from public databases. These data were analyzed with a variety of computational methods. We find that there is a clear and significant global network-level signal that is related to the changes in gene expression of MS patients in comparison to healthy subjects. However, despite the clear differences in the clinical symptoms of MS patients in relapse versus remission, the network level signal is weaker when comparing patients in these two stages of the disease. This result suggests that most of the genes have relatively similar expression levels in the two stages of the disease. In accordance with previous studies, we found that the pathways related to regulation of cell death, chemotaxis and inflammatory response are differentially expressed in the disease in comparison to healthy subjects, while pathways related to cell adhesion, cell migration and cell-cell signaling are activated in relapse in comparison to remission. However, the current study includes a detailed report of the exact set of genes involved in these pathways and the interactions between them. For example, we found that the genes TP53 and IL1 are 'network-hub' that interacts with many of the differentially expressed genes in MS patients versus healthy subjects, and the epidermal growth factor receptor is a 'network-hub' in the case of MS patients with relapse versus remission. The statistical approaches employed in this study enabled us

  17. GeneChip Expression Profiling Reveals the Alterations of Energy Metabolism Related Genes in Osteocytes under Large Gradient High Magnetic Fields

    PubMed Central

    Wang, Yang; Chen, Zhi-Hao; Yin, Chun; Ma, Jian-Hua; Li, Di-Jie; Zhao, Fan; Sun, Yu-Long; Hu, Li-Fang; Shang, Peng; Qian, Ai-Rong

    2015-01-01

    The diamagnetic levitation as a novel ground-based model for simulating a reduced gravity environment has recently been applied in life science research. In this study a specially designed superconducting magnet with a large gradient high magnetic field (LG-HMF), which can provide three apparent gravity levels (μ-g, 1-g, and 2-g), was used to simulate a space-like gravity environment. Osteocyte, as the most important mechanosensor in bone, takes a pivotal position in mediating the mechano-induced bone remodeling. In this study, the effects of LG-HMF on gene expression profiling of osteocyte-like cell line MLO-Y4 were investigated by Affymetrix DNA microarray. LG-HMF affected osteocyte gene expression profiling. Differentially expressed genes (DEGs) and data mining were further analyzed by using bioinfomatic tools, such as DAVID, iReport. 12 energy metabolism related genes (PFKL, AK4, ALDOC, COX7A1, STC1, ADM, CA9, CA12, P4HA1, APLN, GPR35 and GPR84) were further confirmed by real-time PCR. An integrated gene interaction network of 12 DEGs was constructed. Bio-data mining showed that genes involved in glucose metabolic process and apoptosis changed notablly. Our results demostrated that LG-HMF affected the expression of energy metabolism related genes in osteocyte. The identification of sensitive genes to special environments may provide some potential targets for preventing and treating bone loss or osteoporosis. PMID:25635858

  18. GeneChip expression profiling reveals the alterations of energy metabolism related genes in osteocytes under large gradient high magnetic fields.

    PubMed

    Wang, Yang; Chen, Zhi-Hao; Yin, Chun; Ma, Jian-Hua; Li, Di-Jie; Zhao, Fan; Sun, Yu-Long; Hu, Li-Fang; Shang, Peng; Qian, Ai-Rong

    2015-01-01

    The diamagnetic levitation as a novel ground-based model for simulating a reduced gravity environment has recently been applied in life science research. In this study a specially designed superconducting magnet with a large gradient high magnetic field (LG-HMF), which can provide three apparent gravity levels (μ-g, 1-g, and 2-g), was used to simulate a space-like gravity environment. Osteocyte, as the most important mechanosensor in bone, takes a pivotal position in mediating the mechano-induced bone remodeling. In this study, the effects of LG-HMF on gene expression profiling of osteocyte-like cell line MLO-Y4 were investigated by Affymetrix DNA microarray. LG-HMF affected osteocyte gene expression profiling. Differentially expressed genes (DEGs) and data mining were further analyzed by using bioinfomatic tools, such as DAVID, iReport. 12 energy metabolism related genes (PFKL, AK4, ALDOC, COX7A1, STC1, ADM, CA9, CA12, P4HA1, APLN, GPR35 and GPR84) were further confirmed by real-time PCR. An integrated gene interaction network of 12 DEGs was constructed. Bio-data mining showed that genes involved in glucose metabolic process and apoptosis changed notablly. Our results demostrated that LG-HMF affected the expression of energy metabolism related genes in osteocyte. The identification of sensitive genes to special environments may provide some potential targets for preventing and treating bone loss or osteoporosis.

  19. A new genome-mining tool redefines the lasso peptide biosynthetic landscape

    PubMed Central

    Tietz, Jonathan I.; Schwalen, Christopher J.; Patel, Parth S.; Maxson, Tucker; Blair, Patricia M.; Tai, Hua-Chia; Zakai, Uzma I.; Mitchell, Douglas A.

    2016-01-01

    Ribosomally synthesized and post-translationally modified peptide (RiPP) natural products are attractive for genome-driven discovery and re-engineering, but limitations in bioinformatic methods and exponentially increasing genomic data make large-scale mining difficult. We report RODEO (Rapid ORF Description and Evaluation Online), which combines hidden Markov model-based analysis, heuristic scoring, and machine learning to identify biosynthetic gene clusters and predict RiPP precursor peptides. We initially focused on lasso peptides, which display intriguing physiochemical properties and bioactivities, but their hypervariability renders them challenging prospects for automated mining. Our approach yielded the most comprehensive mapping of lasso peptide space, revealing >1,300 compounds. We characterized the structures and bioactivities of six lasso peptides, prioritized based on predicted structural novelty, including an unprecedented handcuff-like topology and another with a citrulline modification exceptionally rare among bacteria. These combined insights significantly expand the knowledge of lasso peptides, and more broadly, provide a framework for future genome-mining efforts. PMID:28244986

  20. Mining Microarray Data at NCBI’s Gene Expression Omnibus (GEO)*

    PubMed Central

    Barrett, Tanya; Edgar, Ron

    2006-01-01

    Summary The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) has emerged as the leading fully public repository for gene expression data. This chapter describes how to use Web-based interfaces, applications, and graphics to effectively explore, visualize, and interpret the hundreds of microarray studies and millions of gene expression patterns stored in GEO. Data can be examined from both experiment-centric and gene-centric perspectives using user-friendly tools that do not require specialized expertise in microarray analysis or time-consuming download of massive data sets. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo. PMID:16888359

  1. Plant growth-promoting bacteria for phytostabilization of mine tailings.

    PubMed

    Grandlic, Christopher J; Mendez, Monica O; Chorover, Jon; Machado, Blenda; Maier, Raina M

    2008-03-15

    Eolian dispersion of mine tailings in arid and semiarid environments is an emerging global issue for which economical remediation alternatives are needed. Phytostabilization, the revegetation of these sites with native plants, is one such alternative. Revegetation often requires the addition of bulky amendments such as compost which greatly increases cost. We report the use of plant growth-promoting bacteria (PGPB) to enhance the revegetation of mine tailings and minimize the need for compost amendment. Twenty promising PGPB isolates were used as seed inoculants in a series of greenhouse studies to examine revegetation of an extremely acidic, high metal contenttailings sample previously shown to require 15% compost amendment for normal plant growth. Several isolates significantly enhanced growth of two native species, quailbush and buffalo grass, in tailings. In this study, PGPB/compost outcomes were plant specific; for quailbush, PGPB were most effective in combination with 10% compost addition while for buffalo grass, PGPB enhanced growth in the complete absence of compost. Results indicate that selected PGPB can improve plant establishment and reduce the need for compost amendment. Further, PGPB activities necessary for aiding plant growth in mine tailings likely include tolerance to acidic pH and metals.

  2. Survey of nine surface mines in North America. [Nine different mines in USA and Canada

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hayes, L.G.; Brackett, R.D.; Floyd, F.D.

    This report presents the information gathered by three mining engineers in a 1980 survey of nine surface mines in the United States and Canada. The mines visited included seven coal mines, one copper mine, and one tar sands mine selected as representative of present state of the art in open pit, strip, and terrace pit mining. The purpose of the survey was to investigate mining methods, equipment requirements, operating costs, reclamation procedures and costs, and other aspects of current surface mining practices in order to acquire basic data for a study comparing conventional and terrace pit mining methods, particularly inmore » deeper overburdens. The survey was conducted as part of a project under DOE Contract No. DE-AC01-79ET10023 titled The Development of Optimal Terrace Pit Coal Mining Systems.« less

  3. Mining the pharmacogenomics literature--a survey of the state of the art.

    PubMed

    Hahn, Udo; Cohen, K Bretonnel; Garten, Yael; Shah, Nigam H

    2012-07-01

    This article surveys efforts on text mining of the pharmacogenomics literature, mainly from the period 2008 to 2011. Pharmacogenomics (or pharmacogenetics) is the field that studies how human genetic variation impacts drug response. Therefore, publications span the intersection of research in genotypes, phenotypes and pharmacology, a topic that has increasingly become a focus of active research in recent years. This survey covers efforts dealing with the automatic recognition of relevant named entities (e.g. genes, gene variants and proteins, diseases and other pathological phenomena, drugs and other chemicals relevant for medical treatment), as well as various forms of relations between them. A wide range of text genres is considered, such as scientific publications (abstracts, as well as full texts), patent texts and clinical narratives. We also discuss infrastructure and resources needed for advanced text analytics, e.g. document corpora annotated with corresponding semantic metadata (gold standards and training data), biomedical terminologies and ontologies providing domain-specific background knowledge at different levels of formality and specificity, software architectures for building complex and scalable text analytics pipelines and Web services grounded to them, as well as comprehensive ways to disseminate and interact with the typically huge amounts of semiformal knowledge structures extracted by text mining tools. Finally, we consider some of the novel applications that have already been developed in the field of pharmacogenomic text mining and point out perspectives for future research.

  4. Visual information mining in remote sensing image archives

    NASA Astrophysics Data System (ADS)

    Pelizzari, Andrea; Descargues, Vincent; Datcu, Mihai P.

    2002-01-01

    The present article focuses on the development of interactive exploratory tools for visually mining the image content in large remote sensing archives. Two aspects are treated: the iconic visualization of the global information in the archive and the progressive visualization of the image details. The proposed methods are integrated in the Image Information Mining (I2M) system. The images and image structure in the I2M system are indexed based on a probabilistic approach. The resulting links are managed by a relational data base. Both the intrinsic complexity of the observed images and the diversity of user requests result in a great number of associations in the data base. Thus new tools have been designed to visualize, in iconic representation the relationships created during a query or information mining operation: the visualization of the query results positioned on the geographical map, quick-looks gallery, visualization of the measure of goodness of the query, visualization of the image space for statistical evaluation purposes. Additionally the I2M system is enhanced with progressive detail visualization in order to allow better access for operator inspection. I2M is a three-tier Java architecture and is optimized for the Internet.

  5. Monitoring Metal Pollution Levels in Mine Wastes around a Coal Mine Site Using GIS

    NASA Astrophysics Data System (ADS)

    Sanliyuksel Yucel, D.; Yucel, M. A.; Ileri, B.

    2017-11-01

    In this case study, metal pollution levels in mine wastes at a coal mine site in Etili coal mine (Can coal basin, NW Turkey) are evaluated using geographical information system (GIS) tools. Etili coal mine was operated since the 1980s as an open pit. Acid mine drainage is the main environmental problem around the coal mine. The main environmental contamination source is mine wastes stored around the mine site. Mine wastes were dumped over an extensive area along the riverbeds, and are now abandoned. Mine waste samples were homogenously taken at 10 locations within the sampling area of 102.33 ha. The paste pH and electrical conductivity values of mine wastes ranged from 2.87 to 4.17 and 432 to 2430 μS/cm, respectively. Maximum Al, Fe, Mn, Pb, Zn and Ni concentrations of wastes were measured as 109300, 70600, 309.86, 115.2, 38 and 5.3 mg/kg, respectively. The Al, Fe and Pb concentrations of mine wastes are higher than world surface rock average values. The geochemical analysis results from the study area were presented in the form of maps. The GIS based environmental database will serve as a reference study for our future work.

  6. Global Analysis of Gene Expression Profiles in Physic Nut (Jatropha curcas L.) Seedlings Exposed to Salt Stress

    PubMed Central

    Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2014-01-01

    Background Salt stress interferes with plant growth and production. Plants have evolved a series of molecular and morphological adaptations to cope with this abiotic stress, and overexpression of salt response genes reportedly enhances the productivity of various crops. However, little is known about the salt responsive genes in the energy plant physic nut (Jatropha curcas L.). Thus, excavate salt responsive genes in this plant are informative in uncovering the molecular mechanisms for the salt response in physic nut. Methodology/Principal Findings We applied next-generation Illumina sequencing technology to analyze global gene expression profiles of physic nut plants (roots and leaves) 2 hours, 2 days and 7 days after the onset of salt stress. A total of 1,504 and 1,115 genes were significantly up and down-regulated in roots and leaves, respectively, under salt stress condition. Gene ontology (GO) analysis of physiological process revealed that, in the physic nut, many “biological processes” were affected by salt stress, particular those categories belong to “metabolic process”, such as “primary metabolism process”, “cellular metabolism process” and “macromolecule metabolism process”. The gene expression profiles indicated that the associated genes were responsible for ABA and ethylene signaling, osmotic regulation, the reactive oxygen species scavenging system and the cell structure in physic nut. Conclusions/Significance The major regulated genes detected in this transcriptomic data were related to trehalose synthesis and cell wall structure modification in roots, while related to raffinose synthesis and reactive oxygen scavenger in leaves. The current study shows a comprehensive gene expression profile of physic nut under salt stress. The differential expression genes detected in this study allows the underling the salt responsive mechanism in physic nut with the aim of improving its salt resistance in the future. PMID:24837971

  7. Global analysis of gene expression profiles in physic nut (Jatropha curcas L.) seedlings exposed to salt stress.

    PubMed

    Zhang, Lin; Zhang, Chao; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2014-01-01

    Salt stress interferes with plant growth and production. Plants have evolved a series of molecular and morphological adaptations to cope with this abiotic stress, and overexpression of salt response genes reportedly enhances the productivity of various crops. However, little is known about the salt responsive genes in the energy plant physic nut (Jatropha curcas L.). Thus, excavate salt responsive genes in this plant are informative in uncovering the molecular mechanisms for the salt response in physic nut. We applied next-generation Illumina sequencing technology to analyze global gene expression profiles of physic nut plants (roots and leaves) 2 hours, 2 days and 7 days after the onset of salt stress. A total of 1,504 and 1,115 genes were significantly up and down-regulated in roots and leaves, respectively, under salt stress condition. Gene ontology (GO) analysis of physiological process revealed that, in the physic nut, many "biological processes" were affected by salt stress, particular those categories belong to "metabolic process", such as "primary metabolism process", "cellular metabolism process" and "macromolecule metabolism process". The gene expression profiles indicated that the associated genes were responsible for ABA and ethylene signaling, osmotic regulation, the reactive oxygen species scavenging system and the cell structure in physic nut. The major regulated genes detected in this transcriptomic data were related to trehalose synthesis and cell wall structure modification in roots, while related to raffinose synthesis and reactive oxygen scavenger in leaves. The current study shows a comprehensive gene expression profile of physic nut under salt stress. The differential expression genes detected in this study allows the underling the salt responsive mechanism in physic nut with the aim of improving its salt resistance in the future.

  8. Abandoned Uranium Mine (AUM) Trust Mine Points, Navajo Nation, 2016, US EPA Region 9

    EPA Pesticide Factsheets

    This GIS dataset contains point features that represent mines included in the Navajo Environmental Response Trust. This mine category also includes Priority mines. USEPA and NNEPA prioritized mines based on gamma radiation levels, proximity to homes and potential for water contamination identified in the preliminary assessments. Attributes include mine names, reclaimed status, links to US EPA AUM reports, and the region in which the mine is located. This dataset contains 19 features.

  9. Text Mining for Precision Medicine: Bringing structure to EHRs and biomedical literature to understand genes and health

    PubMed Central

    Simmons, Michael; Singhal, Ayush; Lu, Zhiyong

    2018-01-01

    The key question of precision medicine is whether it is possible to find clinically actionable granularity in diagnosing disease and classifying patient risk. The advent of next generation sequencing and the widespread adoption of electronic health records (EHRs) have provided clinicians and researchers a wealth of data and made possible the precise characterization of individual patient genotypes and phenotypes. Unstructured text — found in biomedical publications and clinical notes — is an important component of genotype and phenotype knowledge. Publications in the biomedical literature provide essential information for interpreting genetic data. Likewise, clinical notes contain the richest source of phenotype information in EHRs. Text mining can render these texts computationally accessible and support information extraction and hypothesis generation. This chapter reviews the mechanics of text mining in precision medicine and discusses several specific use cases, including database curation for personalized cancer medicine, patient outcome prediction from EHR-derived cohorts, and pharmacogenomic research. Taken as a whole, these use cases demonstrate how text mining enables effective utilization of existing knowledge sources and thus promotes increased value for patients and healthcare systems. Text mining is an indispensable tool for translating genotype-phenotype data into effective clinical care that will undoubtedly play an important role in the eventual realization of precision medicine. PMID:27807747

  10. Text Mining for Precision Medicine: Bringing Structure to EHRs and Biomedical Literature to Understand Genes and Health.

    PubMed

    Simmons, Michael; Singhal, Ayush; Lu, Zhiyong

    2016-01-01

    The key question of precision medicine is whether it is possible to find clinically actionable granularity in diagnosing disease and classifying patient risk. The advent of next-generation sequencing and the widespread adoption of electronic health records (EHRs) have provided clinicians and researchers a wealth of data and made possible the precise characterization of individual patient genotypes and phenotypes. Unstructured text-found in biomedical publications and clinical notes-is an important component of genotype and phenotype knowledge. Publications in the biomedical literature provide essential information for interpreting genetic data. Likewise, clinical notes contain the richest source of phenotype information in EHRs. Text mining can render these texts computationally accessible and support information extraction and hypothesis generation. This chapter reviews the mechanics of text mining in precision medicine and discusses several specific use cases, including database curation for personalized cancer medicine, patient outcome prediction from EHR-derived cohorts, and pharmacogenomic research. Taken as a whole, these use cases demonstrate how text mining enables effective utilization of existing knowledge sources and thus promotes increased value for patients and healthcare systems. Text mining is an indispensable tool for translating genotype-phenotype data into effective clinical care that will undoubtedly play an important role in the eventual realization of precision medicine.

  11. A systems level predictive model for global gene regulation of methanogenesis in a hydrogenotrophic methanogen

    PubMed Central

    Yoon, Sung Ho; Turkarslan, Serdar; Reiss, David J.; Pan, Min; Burn, June A.; Costa, Kyle C.; Lie, Thomas J.; Slagel, Joseph; Moritz, Robert L.; Hackett, Murray; Leigh, John A.; Baliga, Nitin S.

    2013-01-01

    Methanogens catalyze the critical methane-producing step (called methanogenesis) in the anaerobic decomposition of organic matter. Here, we present the first predictive model of global gene regulation of methanogenesis in a hydrogenotrophic methanogen, Methanococcus maripaludis. We generated a comprehensive list of genes (protein-coding and noncoding) for M. maripaludis through integrated analysis of the transcriptome structure and a newly constructed Peptide Atlas. The environment and gene-regulatory influence network (EGRIN) model of the strain was constructed from a compendium of transcriptome data that was collected over 58 different steady-state and time-course experiments that were performed in chemostats or batch cultures under a spectrum of environmental perturbations that modulated methanogenesis. Analyses of the EGRIN model have revealed novel components of methanogenesis that included at least three additional protein-coding genes of previously unknown function as well as one noncoding RNA. We discovered that at least five regulatory mechanisms act in a combinatorial scheme to intercoordinate key steps of methanogenesis with different processes such as motility, ATP biosynthesis, and carbon assimilation. Through a combination of genetic and environmental perturbation experiments we have validated the EGRIN-predicted role of two novel transcription factors in the regulation of phosphate-dependent repression of formate dehydrogenase—a key enzyme in the methanogenesis pathway. The EGRIN model demonstrates regulatory affiliations within methanogenesis as well as between methanogenesis and other cellular functions. PMID:24089473

  12. On the classification techniques in data mining for microarray data classification

    NASA Astrophysics Data System (ADS)

    Aydadenta, Husna; Adiwijaya

    2018-03-01

    Cancer is one of the deadly diseases, according to data from WHO by 2015 there are 8.8 million more deaths caused by cancer, and this will increase every year if not resolved earlier. Microarray data has become one of the most popular cancer-identification studies in the field of health, since microarray data can be used to look at levels of gene expression in certain cell samples that serve to analyze thousands of genes simultaneously. By using data mining technique, we can classify the sample of microarray data thus it can be identified with cancer or not. In this paper we will discuss some research using some data mining techniques using microarray data, such as Support Vector Machine (SVM), Artificial Neural Network (ANN), Naive Bayes, k-Nearest Neighbor (kNN), and C4.5, and simulation of Random Forest algorithm with technique of reduction dimension using Relief. The result of this paper show performance measure (accuracy) from classification algorithm (SVM, ANN, Naive Bayes, kNN, C4.5, and Random Forets).The results in this paper show the accuracy of Random Forest algorithm higher than other classification algorithms (Support Vector Machine (SVM), Artificial Neural Network (ANN), Naive Bayes, k-Nearest Neighbor (kNN), and C4.5). It is hoped that this paper can provide some information about the speed, accuracy, performance and computational cost generated from each Data Mining Classification Technique based on microarray data.

  13. Drakelands Mine, England

    NASA Image and Video Library

    2015-08-21

    The Drakelands Mine (previously known as the Hemerdon Mine) is a historic tungsten and tin mine located northeast of Plymouth, England. Tin and tungsten deposits were discovered in 1867, and the mine operated until 1944. Last year work started to re-open the mine, as it hosts the fourth-largest tungsten and tin deposits in the world. Tungsten has innumerable uses due to its incredible density and high melting temperature. Yet more than 80% of world supply is controlled by China, who has imposed restriction on export of the metal. The image covers an area of 17 by 18.9 km, was acquired June 5, 2013, and is located at 50.4 degrees north, 4 degrees west. http://photojournal.jpl.nasa.gov/catalog/PIA19757

  14. PRB mines mature

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Buchsbaum, L.

    2007-08-15

    Already seeing the results of reclamation efforts, America's largest surface mines advance as engineers prepare for the future. 30 years after the signing of the Surface Mining Control and Reclamation Act by Jimmy Carter, western strip mines in the USA, especially in the Powder River Basin, are producing more coal than ever. The article describes the construction and installation of a $38.5 million near-pit crusher and overland belt conveyor system at Foundation Coal West's (FCW) Belle Ayr surface mine in Wyoming, one of the earliest PRB mines. It goes on to describe the development by Rio Tinto of an elkmore » conservatory, the Rochelle Hill Conservation Easement, on reclaimed land at Jacobs Ranch, adjacent to the Rochelle Hills. 4 photos.« less

  15. A baseline lunar mine

    NASA Technical Reports Server (NTRS)

    Gertsch, Richard E.

    1992-01-01

    A models lunar mining method is proposed that illustrates the problems to be expected in lunar mining and how they might be solved. While the method is quite feasible, it is, more importantly, a useful baseline system against which to test other, possible better, methods. Our study group proposed the slusher to stimulate discussion of how a lunar mining operation might be successfully accomplished. Critics of the slusher system were invited to propose better methods. The group noted that while nonterrestrial mining has been a vital part of past space manufacturing proposals, no one has proposed a lunar mining system in any real detail. The group considered it essential that the design of actual, workable, and specific lunar mining methods begin immediately. Based on an earlier proposal, the method is a three-drum slusher, also known as a cable-operated drag scraper. Its terrestrial application is quite limited, as it is relatively inefficient and inflexible. The method usually finds use in underwater mining from the shore and in moving small amounts of ore underground. When lunar mining scales up, the lunarized slusher will be replaced by more efficient, high-volume methods. Other aspects of lunar mining are discussed.

  16. Graph mining for next generation sequencing: leveraging the assembly graph for biological insights.

    PubMed

    Warnke-Sommer, Julia; Ali, Hesham

    2016-05-06

    The assembly of Next Generation Sequencing (NGS) reads remains a challenging task. This is especially true for the assembly of metagenomics data that originate from environmental samples potentially containing hundreds to thousands of unique species. The principle objective of current assembly tools is to assemble NGS reads into contiguous stretches of sequence called contigs while maximizing for both accuracy and contig length. The end goal of this process is to produce longer contigs with the major focus being on assembly only. Sequence read assembly is an aggregative process, during which read overlap relationship information is lost as reads are merged into longer sequences or contigs. The assembly graph is information rich and capable of capturing the genomic architecture of an input read data set. We have developed a novel hybrid graph in which nodes represent sequence regions at different levels of granularity. This model, utilized in the assembly and analysis pipeline Focus, presents a concise yet feature rich view of a given input data set, allowing for the extraction of biologically relevant graph structures for graph mining purposes. Focus was used to create hybrid graphs to model metagenomics data sets obtained from the gut microbiomes of five individuals with Crohn's disease and eight healthy individuals. Repetitive and mobile genetic elements are found to be associated with hybrid graph structure. Using graph mining techniques, a comparative study of the Crohn's disease and healthy data sets was conducted with focus on antibiotics resistance genes associated with transposase genes. Results demonstrated significant differences in the phylogenetic distribution of categories of antibiotics resistance genes in the healthy and diseased patients. Focus was also evaluated as a pure assembly tool and produced excellent results when compared against the Meta-velvet, Omega, and UD-IDBA assemblers. Mining the hybrid graph can reveal biological phenomena captured

  17. Mutations in TET2 and DNMT3A genes are associated with changes in global and gene-specific methylation in acute myeloid leukemia.

    PubMed

    Ponciano-Gómez, Alberto; Martínez-Tovar, Adolfo; Vela-Ojeda, Jorge; Olarte-Carrillo, Irma; Centeno-Cruz, Federico; Garrido, Efraín

    2017-10-01

    Acute myeloid leukemia is characterized by its high biological and clinical heterogeneity, which represents an important barrier for a precise disease classification and accurate therapy. While epigenetic aberrations play a pivotal role in acute myeloid leukemia pathophysiology, molecular signatures such as change in the DNA methylation patterns and genetic mutations in enzymes needed to the methylation process can also be helpful for classifying acute myeloid leukemia. Our study aims to unveil the relevance of DNMT3A and TET2 genes in global and specific methylation patterns in acute myeloid leukemia. Peripheral blood samples from 110 untreated patients with acute myeloid leukemia and 15 healthy control individuals were collected. Global 5-methylcytosine and 5-hydroxymethylcytosine in genomic DNA from peripheral blood leukocytes were measured by using the MethylFlashTM Quantification kits. DNMT3A and TET2 expression levels were evaluated by real-time quantitative polymerase chain reaction. The R882A hotspot of DNMT3A and exons 6-10 of TET2 were amplified by polymerase chain reaction and sequenced using the Sanger method. Methylation patterns of 16 gene promoters were evaluated by pyrosequencing after treating DNA with sodium bisulfite, and their transcriptional products were measured by real-time quantitative polymerase chain reaction.Here, we demonstrate altered levels of 5-methylcytosine and 5-hydroxymethylcytosine and highly variable transcript levels of DNMT3A and TET2 in peripheral blood leukocytes from acute myeloid leukemia patients. We found a mutation prevalence of 2.7% for DNMT3A and 11.8% for TET2 in the Mexican population with this disease. The average overall survival of acute myeloid leukemia patients with DNMT3A mutations was only 4 months. In addition, we showed that mutations in DNMT3A and TET2 may cause irregular DNA methylation patterns and transcriptional expression levels in 16 genes known to be involved in acute myeloid leukemia pathogenesis

  18. Lunar vertical-shaft mining system

    NASA Technical Reports Server (NTRS)

    Introne, Steven D. (Editor); Krause, Roy; Williams, Erik; Baskette, Keith; Martich, Frederick; Weaver, Brad; Meve, Jeff; Alexander, Kyle; Dailey, Ron; White, Matt

    1994-01-01

    This report proposes a method that will allow lunar vertical-shaft mining. Lunar mining allows the exploitation of mineral resources imbedded within the surface. The proposed lunar vertical-shaft mining system is comprised of five subsystems: structure, materials handling, drilling, mining, and planning. The structure provides support for the exploration and mining equipment in the lunar environment. The materials handling subsystem moves mined material outside the structure and mining and drilling equipment inside the structure. The drilling process bores into the surface for the purpose of collecting soil samples, inserting transducer probes, or locating ore deposits. Once the ore deposits are discovered and pinpointed, mining operations bring the ore to the surface. The final subsystem is planning, which involves the construction of the mining structure.

  19. Environmental impact assessment of european non-ferro mining industries through life-cycle assessment

    NASA Astrophysics Data System (ADS)

    Hisan Farjana, Shahjadi; Huda, Nazmul; Parvez Mahmud, M. A.

    2018-05-01

    European mining industries are the vast industrial sector which contributes largely on their economy which constitutes of ferro and non-ferro metals and minerals industries. The non-ferro metals extraction and processing industries require focus of attention due to sustainability concerns as their manufacturing processes are highly energy intensive and impacts globally on environment. This paper analyses major environmental effects caused by European metal industries based on the life-cycle impact analysis technologies. This research work is the first work in considering the comparative environmental impact analysis of European non-ferro metal industries which will reveal their technological similarities and dissimilarities to assess their environmental loads. The life-cycle inventory datasets are collected from the EcoInvent database while the analysis is done using the CML baseline and ReCipe endpoint method using SimaPro software version 8.4. The CML and ReCipe method are chosen because they are specialized impact assessment methods for European continent. The impact categories outlined for discussion here are human health, global warming and ecotoxicity. The analysis results reveal that the gold industry is vulnerable for the environment due to waste emission and similar result retained by silver mines a little bit. But copper, lead, manganese and zinc mining processes and industries are environment friendly in terms of metal extraction technologies and waste emissions.

  20. Mining Critical Metals and Elements from Seawater: Opportunities and Challenges.

    PubMed

    Diallo, Mamadou S; Kotte, Madhusudhana Rao; Cho, Manki

    2015-08-18

    The availability and sustainable supply of technology metals and valuable elements is critical to the global economy. There is a growing realization that the development and deployment of the clean energy technologies and sustainable products and manufacturing industries of the 21st century will require large amounts of critical metals and valuable elements including rare-earth elements (REEs), platinum group metals (PGMs), lithium, copper, cobalt, silver, and gold. Advances in industrial ecology, water purification, and resource recovery have established that seawater is an important and largely untapped source of technology metals and valuable elements. This feature article discusses the opportunities and challenges of mining critical metals and elements from seawater. We highlight recent advances and provide an outlook of the future of metal mining and resource recovery from seawater.

  1. Mine design: Long term effects of high extraction mining

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jeran, P.W.

    1996-12-31

    A consideration when designing a high extraction coal mine is the effects that mining will have on the ground above the mine. This becomes particularly important when the surface has been improved or is inhabited. Surface owners are concerned about; when the effects will begin? how large will they be? and how long they will last? Each of these should be addressed by the designer. For more than a decade, the US Bureau of Mines (USBM) has been monitoring subsidence at various sites. Based upon the data gathered, some inferences may be made regarding the above stated questions. Essentially surfacemore » movement begins with undermining. The magnitude of the movements are proportional to the thickness extracted and the width of the mined area, and inversely proportional to the depth of the mine below surface. The duration of the subsidence process in the northern Appalachian Basin is approximately one year. The USBM has developed a computer model which predicts the final subsidence profile across a longwall panel in the northern Appalachian Coal Basin. USBM studies on the dynamic development of subsidence have shown that the magnitude of the deformations developed during the subsidence process never exceed those exhibited in the final subsidence profile. Use of the model will provide engineers with a starting point in the design process.« less

  2. Global analysis of transcriptome responses and gene expression profiles to cold stress of Jatropha curcas L.

    PubMed

    Wang, Haibo; Zou, Zhurong; Wang, Shasha; Gong, Ming

    2013-01-01

    Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance) that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE) are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs) were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C) for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of crucial genes for genetically enhancing cold resistance

  3. Acid-base accounting to predict post-mining drainage quality on surface mines.

    PubMed

    Skousen, J; Simmons, J; McDonald, L M; Ziemkiewicz, P

    2002-01-01

    Acid-base accounting (ABA) is an analytical procedure that provides values to help assess the acid-producing and acid-neutralizing potential of overburden rocks prior to coal mining and other large-scale excavations. This procedure was developed by West Virginia University scientists during the 1960s. After the passage of laws requiring an assessment of surface mining on water quality, ABA became a preferred method to predict post-mining water quality, and permitting decisions for surface mines are largely based on the values determined by ABA. To predict the post-mining water quality, the amount of acid-producing rock is compared with the amount of acid-neutralizing rock, and a prediction of the water quality at the site (whether acid or alkaline) is obtained. We gathered geologic and geographic data for 56 mined sites in West Virginia, which allowed us to estimate total overburden amounts, and values were determined for maximum potential acidity (MPA), neutralization potential (NP), net neutralization potential (NNP), and NP to MPA ratios for each site based on ABA. These values were correlated to post-mining water quality from springs or seeps on the mined property. Overburden mass was determined by three methods, with the method used by Pennsylvania researchers showing the most accurate results for overburden mass. A poor relationship existed between MPA and post-mining water quality, NP was intermediate, and NNP and the NP to MPA ratio showed the best prediction accuracy. In this study, NNP and the NP to MPA ratio gave identical water quality prediction results. Therefore, with NP to MPA ratios, values were separated into categories: <1 should produce acid drainage, between 1 and 2 can produce either acid or alkaline water conditions, and >2 should produce alkaline water. On our 56 surface mined sites, NP to MPA ratios varied from 0.1 to 31, and six sites (11%) did not fit the expected pattern using this category approach. Two sites with ratios <1 did not

  4. Seismo-ionospheric Precursors in the GPS Total Electron Content of the 16 October 1999 Mw7.1 Hector Mine Earthquake

    NASA Astrophysics Data System (ADS)

    Tsai, H.; Su, Y.; Liu, J. G.; Chen, S.; Chen, M.

    2013-12-01

    In this paper, temporal and spatial analyses are employed to detect seismo-ionospheric precursors (SIPs) in the ionospheric total electron content (TEC) during 16 October 1999 Mw7.1 Hector Mine earthquake. To discriminate anomalies caused by global effects, such as solar radiations, magnetic storms, etc., and local effects, such as earthquake, we cross-examine the GPS TECs and their gradients in the eastward and northward directions at epicenter/centers of the Hector Mine area and the other two reference areas at similar magnetic latitudes in Europe and Japan. Temporal variations of the northward TEC gradient suggest SIPs most likely appearing day 6-5 before the earthquake. A global search by using the TEC of GIM (global ionosphere map) shows that the TEC increase and decrease anomalies continuously and specifically appear around the epicenter day 5 before the earthquake.

  5. Atmospheric carbon mineralization in an industrial-scale chrysotile mining waste pile.

    PubMed

    Nowamooz, Ali; Dupuis, J Christian; Beaudoin, Georges; Molson, John; Lemieux, Jean-Michel; Horswill, Micha; Fortier, Richard; Larachi, Faïçal; Maldague, Xavier; Constantin, Marc; Duchesne, Josee; Therrien, René

    2018-06-12

    Magnesium rich minerals that are abundant in ultramafic mining waste have the potential to be used as a safe and permanent sequestration solution for carbon dioxide (CO2). Our understanding of thermo-hydro-chemical regimes that govern this reaction at an industrial scale, however, has remained an important challenge to its widespread implementation. Through a year-long monitoring experiment performed at a 110Mt chrysotile waste pile, we have documented the existence of two distinct thermo-hydro-chemical regimes that control the ingress of CO2 and the subsequent mineral carbonation of the waste. The experimental results are supported by coupled free-air/porous media numerical flow and transport model that provides insights into optimization strategies to increase the efficiency of mineral sequestration at an industrial-scale. Although functioning passively under less than optimal conditions compared to lab-scale experiments, the 110Mt Thetford Mines pile is nevertheless estimated to be sequestering up to 100 tonnes of CO2 per year, with a potential total carbon capture capacity under optimal conditions of 3 Mt. Yearly, over 100 Mt of ultramafic mine waste suitable for mineral carbonation are generated by the global mining industry. Our results show that this waste material could become a safe and permanent carbon sink for diffuse sources of CO2.

  6. Abundance and activity of 16S rRNA, amoA and nifH bacterial genes during assisted phytostabilization of mine tailings

    PubMed Central

    Nelson, Karis N.; Neilson, Julia W.; Root, Robert A.; Chorover, Jon; Maier, Raina M.

    2014-01-01

    Mine tailings in semiarid regions are highly susceptible to erosion and are sources of dust pollution and potential avenues of human exposure to toxic metals. One constraint to revegetation of tailings by phytostabilization is the absence of microbial communities critical for biogeochemical cycling of plant nutrients. The objective of this study was to evaluate specific genes as in situ indicators of biological soil response during phytoremediation. The abundance and activity of 16S rRNA, nifH, and amoA were monitored during a nine month phytostabilization study using buffalo grass and quailbush grown in compost-amended, metalliferous tailings. The compost amendment provided a greater than 5-log increase in bacterial abundance, and survival of this compost-inoculum was more stable in planted treatments. Despite increased abundance, the activity of the introduced community was low, and significant increases were not detected until six and nine months in quailbush, and unplanted compost and buffalo grass treatments, respectively. In addition, increased abundances of nitrogen-fixation (nifH) and ammonia-oxidizing (amoA) genes were observed in rhizospheres of buffalo grass and quailbush, respectively. Thus, plant establishment facilitated the short term stabilization of introduced bacterial biomass and supported the growth of two key nitrogen-cycling populations in compost-amended tailings. PMID:25495940

  7. Abundance and Activity of 16S rRNA, AmoA and NifH Bacterial Genes During Assisted Phytostabilization of Mine Tailings.

    PubMed

    Nelson, Karis N; Neilson, Julia W; Root, Robert A; Chorover, Jon; Maier, Raina M

    2015-01-01

    Mine tailings in semiarid regions are highly susceptible to erosion and are sources of dust pollution and potential avenues of human exposure to toxic metals. One constraint to revegetation of tailings by phytostabilization is the absence of microbial communities critical for biogeochemical cycling of plant nutrients. The objective of this study was to evaluate specific genes as in situ indicators of biological soil response during phytoremediation. The abundance and activity of 16S rRNA, nifH, and amoA were monitored during a nine month phytostabilization study using buffalo grass and quailbush grown in compost-amended, metalliferous tailings. The compost amendment provided a greater than 5-log increase in bacterial abundance, and survival of this compost-inoculum was more stable in planted treatments. Despite increased abundance, the activity of the introduced community was low, and significant increases were not detected until six and nine months in quailbush, and unplanted compost and buffalo grass treatments, respectively. In addition, increased abundances of nitrogen-fixation (nifH) and ammonia-oxidizing (amoA) genes were observed in rhizospheres of buffalo grass and quailbush, respectively. Thus, plant establishment facilitated the short term stabilization of introduced bacterial biomass and supported the growth of two key nitrogen-cycling populations in compost-amended tailings.

  8. PolySearch2: a significantly improved text-mining system for discovering associations between human diseases, genes, drugs, metabolites, toxins and more

    PubMed Central

    Liu, Yifeng; Liang, Yongjie; Wishart, David

    2015-01-01

    PolySearch2 (http://polysearch.ca) is an online text-mining system for identifying relationships between biomedical entities such as human diseases, genes, SNPs, proteins, drugs, metabolites, toxins, metabolic pathways, organs, tissues, subcellular organelles, positive health effects, negative health effects, drug actions, Gene Ontology terms, MeSH terms, ICD-10 medical codes, biological taxonomies and chemical taxonomies. PolySearch2 supports a generalized ‘Given X, find all associated Ys’ query, where X and Y can be selected from the aforementioned biomedical entities. An example query might be: ‘Find all diseases associated with Bisphenol A’. To find its answers, PolySearch2 searches for associations against comprehensive collections of free-text collections, including local versions of MEDLINE abstracts, PubMed Central full-text articles, Wikipedia full-text articles and US Patent application abstracts. PolySearch2 also searches 14 widely used, text-rich biological databases such as UniProt, DrugBank and Human Metabolome Database to improve its accuracy and coverage. PolySearch2 maintains an extensive thesaurus of biological terms and exploits the latest search engine technology to rapidly retrieve relevant articles and databases records. PolySearch2 also generates, ranks and annotates associative candidates and present results with relevancy statistics and highlighted key sentences to facilitate user interpretation. PMID:25925572

  9. The SAGA/TREX-2 subunit Sus1 binds widely to transcribed genes and affects mRNA turnover globally.

    PubMed

    García-Molinero, Varinia; García-Martínez, José; Reja, Rohit; Furió-Tarí, Pedro; Antúnez, Oreto; Vinayachandran, Vinesh; Conesa, Ana; Pugh, B Franklin; Pérez-Ortín, José E; Rodríguez-Navarro, Susana

    2018-03-29

    Eukaryotic transcription is regulated through two complexes, the general transcription factor IID (TFIID) and the coactivator Spt-Ada-Gcn5 acetyltransferase (SAGA). Recent findings confirm that both TFIID and SAGA contribute to the synthesis of nearly all transcripts and are recruited genome-wide in yeast. However, how this broad recruitment confers selectivity under specific conditions remains an open question. Here we find that the SAGA/TREX-2 subunit Sus1 associates with upstream regulatory regions of many yeast genes and that heat shock drastically changes Sus1 binding. While Sus1 binding to TFIID-dominated genes is not affected by temperature, its recruitment to SAGA-dominated genes and RP genes is significantly disturbed under heat shock, with Sus1 relocated to environmental stress-responsive genes in these conditions. Moreover, in contrast to recent results showing that SAGA deubiquitinating enzyme Ubp8 is dispensable for RNA synthesis, genomic run-on experiments demonstrate that Sus1 contributes to synthesis and stability of a wide range of transcripts. Our study provides support for a model in which SAGA/TREX-2 factor Sus1 acts as a global transcriptional regulator in yeast but has differential activity at yeast genes as a function of their transcription rate or during stress conditions.

  10. Optimizing longwall mine layouts

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Minkel, M.J.

    1996-12-31

    Before spending the time to design an underground mine in detail, the mining engineer should be assured of the economic viability of the location of the layout. This has historically been a trial-and-error, iterative process. Traditional underground mine planning usually bases the layout on the geological characteristics of a deposit such as minimum seam height, quality, and the absence of faults. Whether one attempts to make a decision manually. or use traditional mine planning software, the process works something like this: First you build geological model. Then you impose a {open_quotes}best guess{close_quotes} as to which geological layers will become partmore » of the mined product, or will influence mining. Next you place your design where you believe is the best location to make a mine. Then you select equipment which you believe will cost-effectively mine the area. Finally, you schedule your equipment selection through the design over the mine life, run financial analyses and see if the rate of return is acceptable. If the NPV is acceptable, the design is accepted. If the NPV is not acceptable, the engineer has to restart the cycle of redesigning the layout, rescheduling the equipment, and restudying the economics again.« less

  11. Closedure - Mine Closure Technologies Resource

    NASA Astrophysics Data System (ADS)

    Kauppila, Päivi; Kauppila, Tommi; Pasanen, Antti; Backnäs, Soile; Liisa Räisänen, Marja; Turunen, Kaisa; Karlsson, Teemu; Solismaa, Lauri; Hentinen, Kimmo

    2015-04-01

    Closure of mining operations is an essential part of the development of eco-efficient mining and the Green Mining concept in Finland to reduce the environmental footprint of mining. Closedure is a 2-year joint research project between Geological Survey of Finland and Technical Research Centre of Finland that aims at developing accessible tools and resources for planning, executing and monitoring mine closure. The main outcome of the Closedure project is an updatable wiki technology-based internet platform (http://mineclosure.gtk.fi) in which comprehensive guidance on the mine closure is provided and main methods and technologies related to mine closure are evaluated. Closedure also provides new data on the key issues of mine closure, such as performance of passive water treatment in Finland, applicability of test methods for evaluating cover structures for mining wastes, prediction of water effluents from mine wastes, and isotopic and geophysical methods to recognize contaminant transport paths in crystalline bedrock.

  12. Uncovering text mining: A survey of current work on web-based epidemic intelligence

    PubMed Central

    Collier, Nigel

    2012-01-01

    Real world pandemics such as SARS 2002 as well as popular fiction like the movie Contagion graphically depict the health threat of a global pandemic and the key role of epidemic intelligence (EI). While EI relies heavily on established indicator sources a new class of methods based on event alerting from unstructured digital Internet media is rapidly becoming acknowledged within the public health community. At the heart of automated information gathering systems is a technology called text mining. My contribution here is to provide an overview of the role that text mining technology plays in detecting epidemics and to synthesise my existing research on the BioCaster project. PMID:22783909

  13. Diversity in global gene expression and morphology across a watercress (Nasturtium officinale R. Br.) germplasm collection: first steps to breeding

    PubMed Central

    Payne, Adrienne C.; Clarkson, Graham J.J.; Rothwell, Steve; Taylor, Gail

    2015-01-01

    Watercress (Nasturtium officinale R. Br.) is a nutrient intense, leafy crop that is consumed raw or in soups across the globe, but for which, currently no genomic resources or breeding programme exists. Promising morphological, biochemical and functional genomic variation was identified for the first time in a newly established watercress germplasm collection, consisting of 48 watercress accessions sourced from contrasting global locations. Stem length, stem diameter and anti-oxidant (AO) potential varied across the accessions. This variation was used to identify three extreme contrasting accessions for further analysis. Variation in global gene expression was investigated using an Affymetrix Arabidopsis ATH1 microarray gene chip, using the commercial control (C), an accession selected for dwarf phenotype with a high AO potential (dwarfAO, called ‘Boldrewood’) and one with high AO potential alone. A set of transcripts significantly differentially expressed between these three accessions, were identified, including transcripts involved in the regulation of growth and development and those involved in secondary metabolism. In particular, when differential gene expression was compared between C and dwarfAO, the dwarfAO was characterised by increased expression of genes encoding glucosinolates, which are known precursors of phenethyl isothiocyanate, linked to the anti-carcinogenic effects well-documented in watercress. This study provides the first analysis of natural variation across the watercress genome and has identified important underpinning information for future breeding for enhanced anti-carcinogenic properties and morphology traits in this nutrient-intense crop. PMID:26504575

  14. Mining microarray datasets in nutrition: expression of the GPR120 (n-3 fatty acid receptor/sensor) gene is down-regulated in human adipocytes by macrophage secretions

    PubMed Central

    Trayhurn, Paul; Denyer, Gareth

    2012-01-01

    Microarray datasets are a rich source of information in nutritional investigation. Targeted mining of microarray data following initial, non-biased bioinformatic analysis can provide key insight into specific genes and metabolic processes of interest. Microarrays from human adipocytes were examined to explore the effects of macrophage secretions on the expression of the G-protein-coupled receptor (GPR) genes that encode fatty acid receptors/sensors. Exposure of the adipocytes to macrophage-conditioned medium for 4 or 24 h had no effect on GPR40 and GPR43 expression, but there was a marked stimulation of GPR84 expression (receptor for medium-chain fatty acids), the mRNA level increasing 13·5-fold at 24 h relative to unconditioned medium. Importantly, expression of GPR120, which encodes an n-3 PUFA receptor/sensor, was strongly inhibited by the conditioned medium (15-fold decrease in mRNA at 24 h). Macrophage secretions have major effects on the expression of fatty acid receptor/sensor genes in human adipocytes, which may lead to an augmentation of the inflammatory response in adipose tissue in obesity. PMID:25191551

  15. Mine wastes and human health

    USGS Publications Warehouse

    Plumlee, Geoffrey S.; Morman, Suzette A.

    2011-01-01

    Historical mining and mineral processing have been linked definitively to health problems resulting from occupational and environmental exposures to mine wastes. Modern mining and processing methods, when properly designed and implemented, prevent or greatly reduce potential environmental health impacts. However, particularly in developing countries, there are examples of health problems linked to recent mining. In other cases, recent mining has been blamed for health problems but no clear links have been found. The types and abundances of potential toxicants in mine wastes are predictably influenced by the geologic characteristics of the deposit being mined. Hence, Earth scientists can help understand, anticipate, and mitigate potential health issues associated with mining and mineral processing.

  16. Cyanotrophic and arsenic oxidizing activities of Pseudomonas mendocina P6115 isolated from mine tailings containing high cyanide concentration.

    PubMed

    Miranda-Carrazco, Alejandra; Vigueras-Cortés, Juan M; Villa-Tanaca, Lourdes; Hernández-Rodríguez, César

    2018-04-11

    Mine tailings and wastewater generate man-made environments with several selective pressures, including the presence of heavy metals, arsenic and high cyanide concentrations, but severe nutritional limitations. Some oligotrophic and pioneer bacteria can colonise and grow in mine wastes containing a low concentration of organic matter and combined nitrogen sources. In this study, Pseudomonas mendocina P6115 was isolated from mine tailings in Durango, Mexico, and identified through a phylogenetic approach of 16S rRNA, gyrB, rpoB, and rpoD genes. Cell growth, cyanide consumption, and ammonia production kinetics in a medium with cyanide as sole nitrogen source showed that at the beginning, the strain grew assimilating cyanide, when cyanide was removed, ammonium was produced and accumulated in the culture medium. However, no clear stoichiometric relationship between both nitrogen sources was observed. Also, cyanide complexes were assimilated as nitrogen sources. Other phenotypic tasks that contribute to the strain's adaptation to a mine tailing environment included siderophores production in media with moderate amounts of heavy metals, arsenite and arsenate tolerance, and the capacity of oxidizing arsenite. P. mendocina P6115 harbours cioA/cioB and aoxB genes encoding for a cyanide-insensitive oxidase and an arsenite oxidase, respectively. This is the first report where P. mendocina is described as a cyanotrophic and arsenic oxidizing species. Genotypic and phenotypic tasks of P. mendocina P6115 autochthonous from mine wastes are potentially relevant for biological treatment of residues contaminated with cyanide and arsenic.

  17. Mercury Contamination and Biogeochemical Cycling Associated with the Historic Idrija Mining Area of Slovenia

    NASA Astrophysics Data System (ADS)

    Hines, M. E.; Bonzongo, J. J.; Barkay, T.; Horvat, M.; Faganeli, J.

    2001-12-01

    The Idrija Mine is the second largest Hg mine in the world, which operated for 500 years before recently closing. More than five million tons of ore were mined with only 73% recovered. Hg-laden tailings still line the banks. Exhausts from stacks and mineshafts caused elevated levels of airborne Hg, most of which was deposited in the Idrija basin leading to elevated Hg levels in surficial soils. Hg is continually being transported downstream with approximately 1,500 kg per year entering the northern Adriatic Sea 100 km away. Multidisciplinary studies were conducted on samples collected throughout the Idrija and Soca River systems and waters and sediments in the Gulf of Trieste including Hg speciation, Hg transformation activities in sediments and soils, and the presence and expression of bacterial Hg resistance (mer) genes. Total Hg in the Idrija River increased from <3 to >300 ng/L with MeHg accounting for about 0.5%. Concentrations decreased downstream, but increased again in the Soca River and in the estuary with MeHg accounting for nearly 1.5% of the total. However, while bacteria upstream of the mine did not contain mer genes, such genes were detected in bacteria collected downstream for nearly 40 km, and these genes were transcribed. Total Hg levels decreased offshore, but values over 30 ng/L were noted in bottom waters. MeHg concentrations in the Gulf were highest in bottom waters. Sediments near the river mouth contained 40 micro-g/g total Hg with MeHg concentrations of about 3 ng/g. Sediments several km into the Gulf contained 50-fold less total Hg but only 10-fold less MeHg that decreased with depth in the sediment. Hg in sediment pore waters varied between 1 and 8 ng/L, with MeHg accounting for about 30%. Hg methylation and MeHg demethylation were active in Gulf sediments with highest activities near the surface. MeHg was degraded by an oxidative pathway with >97% of the C released from MeHg as carbon dioxide. Hg methylation depth profiles resembled

  18. A Review of Mine Rescue Ensembles for Underground Coal Mining in the United States.

    PubMed

    Kilinc, F Selcen; Monaghan, William D; Powell, Jeffrey B

    The mining industry is among the top ten industries nationwide with high occupational injury and fatality rates, and mine rescue response may be considered one of the most hazardous activities in mining operations. In the aftermath of an underground mine fire, explosion or water inundation, specially equipped and trained teams have been sent underground to fight fires, rescue entrapped miners, test atmospheric conditions, investigate the causes of the disaster, or recover the dead. Special personal protective ensembles are used by the team members to improve the protection of rescuers against the hazards of mine rescue and recovery. Personal protective ensembles used by mine rescue teams consist of helmet, cap lamp, hood, gloves, protective clothing, boots, kneepads, facemask, breathing apparatus, belt, and suspenders. While improved technology such as wireless warning and communication systems, lifeline pulleys, and lighted vests have been developed for mine rescuers over the last 100 years, recent research in this area of personal protective ensembles has been minimal due to the trending of reduced exposure of rescue workers. In recent years, the exposure of mine rescue teams to hazardous situations has been changing. However, it is vital that members of the teams have the capability and proper protection to immediately respond to a wide range of hazardous situations. Currently, there are no minimum requirements, best practice documents, or nationally recognized consensus standards for protective clothing used by mine rescue teams in the United States (U.S.). The following review provides a summary of potential issues that can be addressed by rescue teams and industry to improve potential exposures to rescue team members should a disaster situation occur. However, the continued trending in the mining industry toward non-exposure to potential hazards for rescue workers should continue to be the primary goal. To assist in continuing this trend, the mining industry

  19. A Review of Mine Rescue Ensembles for Underground Coal Mining in the United States

    PubMed Central

    Kilinc, F. Selcen; Monaghan, William D.; Powell, Jeffrey B.

    2016-01-01

    The mining industry is among the top ten industries nationwide with high occupational injury and fatality rates, and mine rescue response may be considered one of the most hazardous activities in mining operations. In the aftermath of an underground mine fire, explosion or water inundation, specially equipped and trained teams have been sent underground to fight fires, rescue entrapped miners, test atmospheric conditions, investigate the causes of the disaster, or recover the dead. Special personal protective ensembles are used by the team members to improve the protection of rescuers against the hazards of mine rescue and recovery. Personal protective ensembles used by mine rescue teams consist of helmet, cap lamp, hood, gloves, protective clothing, boots, kneepads, facemask, breathing apparatus, belt, and suspenders. While improved technology such as wireless warning and communication systems, lifeline pulleys, and lighted vests have been developed for mine rescuers over the last 100 years, recent research in this area of personal protective ensembles has been minimal due to the trending of reduced exposure of rescue workers. In recent years, the exposure of mine rescue teams to hazardous situations has been changing. However, it is vital that members of the teams have the capability and proper protection to immediately respond to a wide range of hazardous situations. Currently, there are no minimum requirements, best practice documents, or nationally recognized consensus standards for protective clothing used by mine rescue teams in the United States (U.S.). The following review provides a summary of potential issues that can be addressed by rescue teams and industry to improve potential exposures to rescue team members should a disaster situation occur. However, the continued trending in the mining industry toward non-exposure to potential hazards for rescue workers should continue to be the primary goal. To assist in continuing this trend, the mining industry

  20. Network-based prediction and knowledge mining of disease genes

    PubMed Central

    2015-01-01

    Background In recent years, high-throughput protein interaction identification methods have generated a large amount of data. When combined with the results from other in vivo and in vitro experiments, a complex set of relationships between biological molecules emerges. The growing popularity of network analysis and data mining has allowed researchers to recognize indirect connections between these molecules. Due to the interdependent nature of network entities, evaluating proteins in this context can reveal relationships that may not otherwise be evident. Methods We examined the human protein interaction network as it relates to human illness using the Disease Ontology. After calculating several topological metrics, we trained an alternating decision tree (ADTree) classifier to identify disease-associated proteins. Using a bootstrapping method, we created a tree to highlight conserved characteristics shared by many of these proteins. Subsequently, we reviewed a set of non-disease-associated proteins that were misclassified by the algorithm with high confidence and searched for evidence of a disease relationship. Results Our classifier was able to predict disease-related genes with 79% area under the receiver operating characteristic (ROC) curve (AUC), which indicates the tradeoff between sensitivity and specificity and is a good predictor of how a classifier will perform on future data sets. We found that a combination of several network characteristics including degree centrality, disease neighbor ratio, eccentricity, and neighborhood connectivity help to distinguish between disease- and non-disease-related proteins. Furthermore, the ADTree allowed us to understand which combinations of strongly predictive attributes contributed most to protein-disease classification. In our post-processing evaluation, we found several examples of potential novel disease-related proteins and corresponding literature evidence. In addition, we showed that first- and second

  1. Network-based prediction and knowledge mining of disease genes.

    PubMed

    Carson, Matthew B; Lu, Hui

    2015-01-01

    In recent years, high-throughput protein interaction identification methods have generated a large amount of data. When combined with the results from other in vivo and in vitro experiments, a complex set of relationships between biological molecules emerges. The growing popularity of network analysis and data mining has allowed researchers to recognize indirect connections between these molecules. Due to the interdependent nature of network entities, evaluating proteins in this context can reveal relationships that may not otherwise be evident. We examined the human protein interaction network as it relates to human illness using the Disease Ontology. After calculating several topological metrics, we trained an alternating decision tree (ADTree) classifier to identify disease-associated proteins. Using a bootstrapping method, we created a tree to highlight conserved characteristics shared by many of these proteins. Subsequently, we reviewed a set of non-disease-associated proteins that were misclassified by the algorithm with high confidence and searched for evidence of a disease relationship. Our classifier was able to predict disease-related genes with 79% area under the receiver operating characteristic (ROC) curve (AUC), which indicates the tradeoff between sensitivity and specificity and is a good predictor of how a classifier will perform on future data sets. We found that a combination of several network characteristics including degree centrality, disease neighbor ratio, eccentricity, and neighborhood connectivity help to distinguish between disease- and non-disease-related proteins. Furthermore, the ADTree allowed us to understand which combinations of strongly predictive attributes contributed most to protein-disease classification. In our post-processing evaluation, we found several examples of potential novel disease-related proteins and corresponding literature evidence. In addition, we showed that first- and second-order neighbors in the PPI network

  2. Atmospheric particulate matter size distribution and concentration in West Virginia coal mining and non-mining areas.

    PubMed

    Kurth, Laura M; McCawley, Michael; Hendryx, Michael; Lusk, Stephanie

    2014-07-01

    People who live in Appalachian areas where coal mining is prominent have increased health problems compared with people in non-mining areas of Appalachia. Coal mines and related mining activities result in the production of atmospheric particulate matter (PM) that is associated with human health effects. There is a gap in research regarding particle size concentration and distribution to determine respiratory dose around coal mining and non-mining areas. Mass- and number-based size distributions were determined with an Aerodynamic Particle Size and Scanning Mobility Particle Sizer to calculate lung deposition around mining and non-mining areas of West Virginia. Particle number concentrations and deposited lung dose were significantly greater around mining areas compared with non-mining areas, demonstrating elevated risks to humans. The greater dose was correlated with elevated disease rates in the West Virginia mining areas. Number concentrations in the mining areas were comparable to a previously documented urban area where number concentration was associated with respiratory and cardiovascular disease.

  3. Alternate Bearing in Citrus: Changes in the Expression of Flowering Control Genes and in Global Gene Expression in ON- versus OFF-Crop Trees

    PubMed Central

    Shalom, Liron; Samuels, Sivan; Zur, Naftali; Shlizerman, Lyudmila; Zemach, Hanita; Weissberg, Mira; Ophir, Ron; Blumwald, Eduardo; Sadka, Avi

    2012-01-01

    Alternate bearing (AB) is the process in fruit trees by which cycles of heavy yield (ON crop) one year are followed by a light yield (OFF crop) the next. Heavy yield usually reduces flowering intensity the following year. Despite its agricultural importance, how the developing crop influences the following year's return bloom and yield is not fully understood. It might be assumed that an ‘AB signal’ is generated in the fruit, or in another organ that senses fruit presence, and moves into the bud to determine its fate—flowering or vegetative growth. The bud then responds to fruit presence by altering regulatory and metabolic pathways. Determining these pathways, and when they are altered, might indicate the nature of this putative AB signal. We studied bud morphology, the expression of flowering control genes, and global gene expression in ON- and OFF-crop buds. In May, shortly after flowering and fruit set, OFF-crop buds were already significantly longer than ON-crop buds. The number of differentially expressed genes was higher in May than at the other tested time points. Processes differentially expressed between ON- and OFF-crop trees included key metabolic and regulatory pathways, such as photosynthesis and secondary metabolism. The expression of genes of trehalose metabolism and flavonoid metabolism was validated by nCounter technology, and the latter was confirmed by metabolomic analysis. Among genes induced in OFF-crop trees was one homologous to SQUAMOSA PROMOTER BINDING-LIKE (SPL), which controls juvenile-to-adult and annual phase transitions, regulated by miR156. The expression pattern of SPL-like, miR156 and other flowering control genes suggested that fruit load affects bud fate, and therefore development and metabolism, a relatively long time before the flowering induction period. Results shed light on some of the metabolic and regulatory processes that are altered in ON and OFF buds. PMID:23071667

  4. Alternate bearing in citrus: changes in the expression of flowering control genes and in global gene expression in ON- versus OFF-crop trees.

    PubMed

    Shalom, Liron; Samuels, Sivan; Zur, Naftali; Shlizerman, Lyudmila; Zemach, Hanita; Weissberg, Mira; Ophir, Ron; Blumwald, Eduardo; Sadka, Avi

    2012-01-01

    Alternate bearing (AB) is the process in fruit trees by which cycles of heavy yield (ON crop) one year are followed by a light yield (OFF crop) the next. Heavy yield usually reduces flowering intensity the following year. Despite its agricultural importance, how the developing crop influences the following year's return bloom and yield is not fully understood. It might be assumed that an 'AB signal' is generated in the fruit, or in another organ that senses fruit presence, and moves into the bud to determine its fate-flowering or vegetative growth. The bud then responds to fruit presence by altering regulatory and metabolic pathways. Determining these pathways, and when they are altered, might indicate the nature of this putative AB signal. We studied bud morphology, the expression of flowering control genes, and global gene expression in ON- and OFF-crop buds. In May, shortly after flowering and fruit set, OFF-crop buds were already significantly longer than ON-crop buds. The number of differentially expressed genes was higher in May than at the other tested time points. Processes differentially expressed between ON- and OFF-crop trees included key metabolic and regulatory pathways, such as photosynthesis and secondary metabolism. The expression of genes of trehalose metabolism and flavonoid metabolism was validated by nCounter technology, and the latter was confirmed by metabolomic analysis. Among genes induced in OFF-crop trees was one homologous to SQUAMOSA PROMOTER BINDING-LIKE (SPL), which controls juvenile-to-adult and annual phase transitions, regulated by miR156. The expression pattern of SPL-like, miR156 and other flowering control genes suggested that fruit load affects bud fate, and therefore development and metabolism, a relatively long time before the flowering induction period. Results shed light on some of the metabolic and regulatory processes that are altered in ON and OFF buds.

  5. Mining Deployment Optimization

    NASA Astrophysics Data System (ADS)

    Čech, Jozef

    2016-09-01

    The deployment problem, researched primarily in the military sector, is emerging in some other industries, mining included. The principal decision is how to deploy some activities in space and time to achieve desired outcome while complying with certain requirements or limits. Requirements and limits are on the side constraints, while minimizing costs or maximizing some benefits are on the side of objectives. A model with application to mining of polymetallic deposit is presented. To obtain quick and immediate decision solutions for a mining engineer with experimental possibilities is the main intention of a computer-based tool. The task is to determine strategic deployment of mining activities on a deposit, meeting planned output from the mine and at the same time complying with limited reserves and haulage capacities. Priorities and benefits can be formulated by the planner.

  6. Public data mining plus domestic experimental study defined involvement of the old-yet-uncharacterized gene matrix-remodeling associated 7 (MXRA7) in physiopathology of the eye.

    PubMed

    Jia, Changkai; Zhang, Feng; Zhu, Ying; Qi, Xia; Wang, Yiqiang

    2017-10-20

    Matrix-remodeling associated 7 (MXRA7) gene was first reported in 2002 and named so for its co-expression with several genes known to relate with matrix-remodeling. However, not any studies had been intentionally performed to characterize this gene. We started defining the functions of MXRA7 by integrating bioinformatics analysis and experimental study. Data mining of MXRA7 expression in BioGPS, Gene Expression Omnibus and EurExpress platforms highlighted high level expression of Mxra7 in murine ocular tissues. Real-time PCR was employed to measure Mxra7 mRNA in tissues of adult C57BL/6 mice and demonstrated that Mxra7 was preferentially expressed at higher level in retina, corneas and lens than in other tissues. Then the inflammatory corneal neovascularization (CorNV) model and fungal corneal infections were induced in Balb/c mice, and mRNA levels of Mxra7 as well as several matrix-remodeling related genes (Mmp3, Mmp13, Ecm1, Timp1) were monitored with RT-PCR. The results demonstrated a time-dependent Mxra7 under-expression pattern (U-shape curve along timeline), while all other matrix-remodeling related genes manifested an opposite changes pattern (dome-shape curve). When limited data from BioGPS concerning human MXRA7 gene expression in human tissues were looked at, it was found that ocular tissue was also the one expressing highest level of MXRA7. To conclude, integrative assay of MXRA7 gene expression in public databank as well as domestic animal models revealed a selective high expression MXRA7 in murine and human ocular tissues, and its change patterns in two corneal disease models implied that MXRA7 might play a role in pathological processes or diseases involving injury, neovascularization and would healing. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Global preamplification simplifies targeted mRNA quantification

    PubMed Central

    Kroneis, Thomas; Jonasson, Emma; Andersson, Daniel; Dolatabadi, Soheila; Ståhlberg, Anders

    2017-01-01

    The need to perform gene expression profiling using next generation sequencing and quantitative real-time PCR (qPCR) on small sample sizes and single cells is rapidly expanding. However, to analyse few molecules, preamplification is required. Here, we studied global and target-specific preamplification using 96 optimised qPCR assays. To evaluate the preamplification strategies, we monitored the reactions in real-time using SYBR Green I detection chemistry followed by melting curve analysis. Next, we compared yield and reproducibility of global preamplification to that of target-specific preamplification by qPCR using the same amount of total RNA. Global preamplification generated 9.3-fold lower yield and 1.6-fold lower reproducibility than target-specific preamplification. However, the performance of global preamplification is sufficient for most downstream applications and offers several advantages over target-specific preamplification. To demonstrate the potential of global preamplification we analysed the expression of 15 genes in 60 single cells. In conclusion, we show that global preamplification simplifies targeted gene expression profiling of small sample sizes by a flexible workflow. We outline the pros and cons for global preamplification compared to target-specific preamplification. PMID:28332609

  8. Data mining in radiology

    PubMed Central

    Kharat, Amit T; Singh, Amarjit; Kulkarni, Vilas M; Shah, Digish

    2014-01-01

    Data mining facilitates the study of radiology data in various dimensions. It converts large patient image and text datasets into useful information that helps in improving patient care and provides informative reports. Data mining technology analyzes data within the Radiology Information System and Hospital Information System using specialized software which assesses relationships and agreement in available information. By using similar data analysis tools, radiologists can make informed decisions and predict the future outcome of a particular imaging finding. Data, information and knowledge are the components of data mining. Classes, Clusters, Associations, Sequential patterns, Classification, Prediction and Decision tree are the various types of data mining. Data mining has the potential to make delivery of health care affordable and ensure that the best imaging practices are followed. It is a tool for academic research. Data mining is considered to be ethically neutral, however concerns regarding privacy and legality exists which need to be addressed to ensure success of data mining. PMID:25024513

  9. A gossip based information fusion protocol for distributed frequent itemset mining

    NASA Astrophysics Data System (ADS)

    Sohrabi, Mohammad Karim

    2018-07-01

    The computational complexity, huge memory space requirement, and time-consuming nature of frequent pattern mining process are the most important motivations for distribution and parallelization of this mining process. On the other hand, the emergence of distributed computational and operational environments, which causes the production and maintenance of data on different distributed data sources, makes the parallelization and distribution of the knowledge discovery process inevitable. In this paper, a gossip based distributed itemset mining (GDIM) algorithm is proposed to extract frequent itemsets, which are special types of frequent patterns, in a wireless sensor network environment. In this algorithm, local frequent itemsets of each sensor are extracted using a bit-wise horizontal approach (LHPM) from the nodes which are clustered using a leach-based protocol. Heads of clusters exploit a gossip based protocol in order to communicate each other to find the patterns which their global support is equal to or more than the specified support threshold. Experimental results show that the proposed algorithm outperforms the best existing gossip based algorithm in term of execution time.

  10. Biogeochemical interactions between of coal mine water and gas well cement

    NASA Astrophysics Data System (ADS)

    Gulliver, D. M.; Gardiner, J. B.; Kutchko, B. G.; Hakala, A.; Spaulding, R.; Tkach, M. K.; Ross, D.

    2017-12-01

    Unconventional natural gas wells drilled in Northern Appalachia often pass through abandoned coal mines before reaching the Marcellus or Utica formations. Biogeochemical interactions between coal mine waters and gas well cements have the potential to alter the cement and compromise its sealing integrity. This study investigates the mineralogical, geochemical, and microbial changes of cement cores exposed to natural coal mine waters. Static reactors with Class H Portland cement cores and water samples from an abandoned bituminous Pittsburgh coal mine simulated the cement-fluid interactions at relevant temperature for time periods of 1, 2, 4, and 6 weeks. Fluids were analyzed for cation and anion concentrations and extracted DNA was analyzed by 16S rRNA gene sequencing and shotgun sequencing. Cement core material was evaluated via scanning electron microscope. Results suggest that the sampled coal mine water altered the permeability and matrix mineralogy of the cement cores. Scanning electron microscope images display an increase in mineral precipitates inside the cement matrix over the course of the experiment. Chemistry results from the reaction vessels' effluent waters display decreases in dissolved calcium, iron, silica, chloride, and sulfate. The microbial community decreased in diversity over the 6-week experiment, with Hydrogenophaga emerging as dominant. These results provide insight in the complex microbial-fluid-mineral interactions of these environments. This study begins to characterize the rarely documented biogeochemical impacts that coal waters may have on unconventional gas well integrity.

  11. An integrative data mining approach to identifying adverse outcome pathway signatures.

    PubMed

    Oki, Noffisat O; Edwards, Stephen W

    2016-03-28

    The Adverse Outcome Pathway (AOP) framework is a tool for making biological connections and summarizing key information across different levels of biological organization to connect biological perturbations at the molecular level to adverse outcomes for an individual or population. Computational approaches to explore and determine these connections can accelerate the assembly of AOPs. By leveraging the wealth of publicly available data covering chemical effects on biological systems, computationally-predicted AOPs (cpAOPs) were assembled via data mining of high-throughput screening (HTS) in vitro data, in vivo data and other disease phenotype information. Frequent Itemset Mining (FIM) was used to find associations between the gene targets of ToxCast HTS assays and disease data from Comparative Toxicogenomics Database (CTD) by using the chemicals as the common aggregators between datasets. The method was also used to map gene expression data to disease data from CTD. A cpAOP network was defined by considering genes and diseases as nodes and FIM associations as edges. This network contained 18,283 gene to disease associations for the ToxCast data and 110,253 for CTD gene expression. Two case studies show the value of the cpAOP network by extracting subnetworks focused either on fatty liver disease or the Aryl Hydrocarbon Receptor (AHR). The subnetwork surrounding fatty liver disease included many genes known to play a role in this disease. When querying the cpAOP network with the AHR gene, an interesting subnetwork including glaucoma was identified. While substantial literature exists to support the potential for AHR ligands to elicit glaucoma, it was not explicitly captured in the public annotation information in CTD. The subnetwork from this analysis suggests a cpAOP that includes changes in CYP1B1 expression, which has been previously established in the literature as a primary cause of glaucoma. These case studies highlight the value in integrating multiple data

  12. DrugQuest - a text mining workflow for drug association discovery.

    PubMed

    Papanikolaou, Nikolas; Pavlopoulos, Georgios A; Theodosiou, Theodosios; Vizirianakis, Ioannis S; Iliopoulos, Ioannis

    2016-06-06

    Text mining and data integration methods are gaining ground in the field of health sciences due to the exponential growth of bio-medical literature and information stored in biological databases. While such methods mostly try to extract bioentity associations from PubMed, very few of them are dedicated in mining other types of repositories such as chemical databases. Herein, we apply a text mining approach on the DrugBank database in order to explore drug associations based on the DrugBank "Description", "Indication", "Pharmacodynamics" and "Mechanism of Action" text fields. We apply Name Entity Recognition (NER) techniques on these fields to identify chemicals, proteins, genes, pathways, diseases, and we utilize the TextQuest algorithm to find additional biologically significant words. Using a plethora of similarity and partitional clustering techniques, we group the DrugBank records based on their common terms and investigate possible scenarios why these records are clustered together. Different views such as clustered chemicals based on their textual information, tag clouds consisting of Significant Terms along with the terms that were used for clustering are delivered to the user through a user-friendly web interface. DrugQuest is a text mining tool for knowledge discovery: it is designed to cluster DrugBank records based on text attributes in order to find new associations between drugs. The service is freely available at http://bioinformatics.med.uoc.gr/drugquest .

  13. Long Term Analysis of Deformations in Salt Mines: Kłodawa Salt Mine Case Study, Central Poland

    NASA Astrophysics Data System (ADS)

    Cała, Marek; Tajduś, Antoni; Andrusikiewicz, Wacław; Kowalski, Michał; Kolano, Malwina; Stopkowicz, Agnieszka; Cyran, Katarzyna; Jakóbczyk, Joanna

    2017-09-01

    Located in central Poland, the Kłodawa salt dome is 26 km long and about 2 km wide. Exploitation of the dome started in 1956, currently rock salt extraction is carried out in 7 mining fields and the 12 mining levels at the depth from 322 to 625 meters below sea level (m.b.s.l.). It is planned to maintain the mining activity till 2052 and extend rock salt extraction to deeper levels. The dome is characterised by complex geological structure resulted from halokinetic and tectonic processes. Projection of the 3D numerical analysis took into account the following factors: mine working distribution within the Kłodawa mine (about 1000 rooms, 350 km of galleries), complex geological structure of the salt dome, complicated structure and geometry of mine workings and distinction in rocks mechanical properties e.g. rock salt and anhydrite. Analysis of past mine workings deformation and prediction of future rock mass behaviour was divided into four stages: building of the 3D model (state of mine workings in year 2014), model extension of the future mine workings planned for extraction in years 2015-2052, the 3D model calibration and stability analysis of all mine workings. The 3D numerical model of Kłodawa salt mine included extracted and planned mine workings in 7 mining fields and 14 mining levels (about 2000 mine workings). The dimensions of the model were 4200 m × 4700 m × 1200 m what was simulated by 33 million elements. The 3D model was calibrated on the grounds of convergence measurements and laboratory tests. Stability assessment of mine workings was based on analysis of the strength/stress ratio and vertical stress. The strength/stress ratio analysis enabled to indicate endangered area in mine workings and can be defined as the factor of safety. Mine workings in state close to collapse are indicated by the strength/stress ratio equals 1. Analysis of the vertical stress in mine workings produced the estimation of current state of stress in comparison to initial

  14. Influence of Genetic Variations in Selenoprotein Genes on the Pattern of Gene Expression after Supplementation with Brazil Nuts

    PubMed Central

    Rogero, Marcelo M.; Hesketh, John

    2017-01-01

    Selenium (Se) is an essential micronutrient for human health. Its beneficial effects are exerted by selenoproteins, which can be quantified in blood and used as molecular biomarkers of Se status. We hypothesize that the presence of genetic polymorphisms in selenoprotein genes may: (1) influence the gene expression of specific selenoproteins and (2) influence the pattern of global gene expression after Brazil nut supplementation. The study was conducted with 130 healthy volunteers in Sao Paulo, Brazil, who consumed one Brazil nut (300 μg/Se) a day for eight weeks. Gene expression of GPX1 and SELENOP and genotyping were measured by real-time PCR using TaqMan Assays. Global gene expression was assessed by microarray using Illumina HumanHT-12 v4 BeadChips. Brazil nut supplementation significantly increased GPX1 mRNA expression only in subjects with CC genotype at rs1050450 (p < 0.05). SELENOP mRNA expression was significantly higher in A-carriers at rs7579 either before or after supplementation (p < 0.05). Genotype for rs713041 in GPX4 affected the pattern of blood cell global gene expression. Genetic variations in selenoprotein genes modulated both GPX1 and SELENOP selenoprotein gene expression and global gene expression in response to Brazil nut supplementation. PMID:28696394

  15. Global Regulatory Pathways in the Alphaproteobacteria

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    none

    A major goal for microbiologists in the twenty-first century is to develop an understanding of the microbial cell in all its complexity. In addition to understanding the function of individual gene products we need to focus on how the cell regulates gene expression at a global level to respond to different environmental parameters. Development of genomic technologies such as complete genome sequencing, proteomics, and global comparisons of mRNA expression patterns allows us to begin to address this issue. This proposal focuses on a number of phylogenetically related bacteria that are involved in environmentally important processes such as carbon sequestration andmore » bioremediation. Genome sequencing projects of a number of these bacteria have revealed the presence of a small family of regulatory genes found thus far only in the alpha-proteobacteria. These genes encode proteins that are related to the global regulatory protein RosR in Rhizobium etli, which is involved in determining nodulation competitiveness in this bacterium. Our goal is to examine the function of the proteins encoded by this gene family in several of the bacteria containing homologs to RosR. We will construct gene disruption mutations in a number of these bacteria and characterize the resulting mutant strains using two-dimensional gel electrophoresis and genetic and biochemical techniques. We will thus determine if the other proteins also function as global regulators of gene expression. Using proteomics methods we will identify the specific proteins whose expression varies depending on the presence or absence of the RosR homolog. Over fifty loci regulated by RosR have been identified in R. etli using transposon mutagenesis; this will serve as out benchmark to which we will compare the other regulons. We expect to identify genes regulated by RosR homologs in several bacterial species, including, but not limited to Rhodopseudomonas palustris and Sphingomonas aromaticivorans. In this way we

  16. BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins.

    PubMed

    van Heel, Auke J; de Jong, Anne; Song, Chunxu; Viel, Jakob H; Kok, Jan; Kuipers, Oscar P

    2018-05-21

    Interest in secondary metabolites such as RiPPs (ribosomally synthesized and posttranslationally modified peptides) is increasing worldwide. To facilitate the research in this field we have updated our mining web server. BAGEL4 is faster than its predecessor and is now fully independent from ORF-calling. Gene clusters of interest are discovered using the core-peptide database and/or through HMM motifs that are present in associated context genes. The databases used for mining have been updated and extended with literature references and links to UniProt and NCBI. Additionally, we have included automated promoter and terminator prediction and the option to upload RNA expression data, which can be displayed along with the identified clusters. Further improvements include the annotation of the context genes, which is now based on a fast blast against the prokaryote part of the UniRef90 database, and the improved web-BLAST feature that dynamically loads structural data such as internal cross-linking from UniProt. Overall BAGEL4 provides the user with more information through a user-friendly web-interface which simplifies data evaluation. BAGEL4 is freely accessible at http://bagel4.molgenrug.nl.

  17. Rescue complex for coal mines

    NASA Astrophysics Data System (ADS)

    Yungmeyster, D. A.; Urazbakhtin, R. Yu

    2017-10-01

    The mining industry was potentially dangerous at all times, even with the use of modern equipment in mines, accidents continue to occur, including catastrophic ones. Accidents in mines are due to the presence of specific features in the conduct of mining operations. These include the inconsistency of mining and geological conditions, the contamination of the mine atmosphere due to the release of gases from minerals, the presence of self-igniting coal strata, which creates the danger of underground fires, gas explosions. The main cause of accidents is the irresponsibility of both the manager and the personnel who violate the safety rules during mining operations.

  18. Information and communication technology and climate change adaptation: Evidence from selected mining companies in South Africa

    PubMed Central

    Nhamo, Godwell

    2016-01-01

    The mining sector is a significant contributor to the gross domestic product of many global economies. Given the increasing trends in climate-induced disasters and the growing desire to find lasting solutions, information and communication technology (ICT) has been introduced into the climate change adaptation mix. Climate change-induced extreme weather events such as flooding, drought, excessive fog, and cyclones have compounded the environmental challenges faced by the mining sector. This article presents the adoption of ICT innovation as part of the adaptation strategies towards reducing the mining sector’s vulnerability and exposure to climate change disaster risks. Document analysis and systematic literature review were adopted as the methodology. Findings from the study reflect how ICT intervention orchestrated changes in communication patterns which are tailored towards the reduction in climate change vulnerability and exposure. The research concludes with a proposition that ICT intervention must be part of the bigger and ongoing climate change adaptation agenda in the mining sector.

  19. The Global Error Assessment (GEA) model for the selection of differentially expressed genes in microarray data.

    PubMed

    Mansourian, Robert; Mutch, David M; Antille, Nicolas; Aubert, Jerome; Fogel, Paul; Le Goff, Jean-Marc; Moulin, Julie; Petrov, Anton; Rytz, Andreas; Voegel, Johannes J; Roberts, Matthew-Alan

    2004-11-01

    Microarray technology has become a powerful research tool in many fields of study; however, the cost of microarrays often results in the use of a low number of replicates (k). Under circumstances where k is low, it becomes difficult to perform standard statistical tests to extract the most biologically significant experimental results. Other more advanced statistical tests have been developed; however, their use and interpretation often remain difficult to implement in routine biological research. The present work outlines a method that achieves sufficient statistical power for selecting differentially expressed genes under conditions of low k, while remaining as an intuitive and computationally efficient procedure. The present study describes a Global Error Assessment (GEA) methodology to select differentially expressed genes in microarray datasets, and was developed using an in vitro experiment that compared control and interferon-gamma treated skin cells. In this experiment, up to nine replicates were used to confidently estimate error, thereby enabling methods of different statistical power to be compared. Gene expression results of a similar absolute expression are binned, so as to enable a highly accurate local estimate of the mean squared error within conditions. The model then relates variability of gene expression in each bin to absolute expression levels and uses this in a test derived from the classical ANOVA. The GEA selection method is compared with both the classical and permutational ANOVA tests, and demonstrates an increased stability, robustness and confidence in gene selection. A subset of the selected genes were validated by real-time reverse transcription-polymerase chain reaction (RT-PCR). All these results suggest that GEA methodology is (i) suitable for selection of differentially expressed genes in microarray data, (ii) intuitive and computationally efficient and (iii) especially advantageous under conditions of low k. The GEA code for R

  20. Extracting Cross-Ontology Weighted Association Rules from Gene Ontology Annotations.

    PubMed

    Agapito, Giuseppe; Milano, Marianna; Guzzi, Pietro Hiram; Cannataro, Mario

    2016-01-01

    Gene Ontology (GO) is a structured repository of concepts (GO Terms) that are associated to one or more gene products through a process referred to as annotation. The analysis of annotated data is an important opportunity for bioinformatics. There are different approaches of analysis, among those, the use of association rules (AR) which provides useful knowledge, discovering biologically relevant associations between terms of GO, not previously known. In a previous work, we introduced GO-WAR (Gene Ontology-based Weighted Association Rules), a methodology for extracting weighted association rules from ontology-based annotated datasets. We here adapt the GO-WAR algorithm to mine cross-ontology association rules, i.e., rules that involve GO terms present in the three sub-ontologies of GO. We conduct a deep performance evaluation of GO-WAR by mining publicly available GO annotated datasets, showing how GO-WAR outperforms current state of the art approaches.