Sample records for harvesting community annotations

  1. Community annotation and bioinformatics workforce development in concert--Little Skate Genome Annotation Workshops and Jamborees.

    PubMed

    Wang, Qinghua; Arighi, Cecilia N; King, Benjamin L; Polson, Shawn W; Vincent, James; Chen, Chuming; Huang, Hongzhan; Kingham, Brewster F; Page, Shallee T; Rendino, Marc Farnum; Thomas, William Kelley; Udwary, Daniel W; Wu, Cathy H

    2012-01-01

    Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood of sequence data has made it critically important to train the next generation of scientists to handle the inherent bioinformatic challenges. The North East Bioinformatics Collaborative (NEBC) is undertaking the genome sequencing and annotation of the little skate (Leucoraja erinacea) to promote advancement of bioinformatics infrastructure in our region, with an emphasis on practical education to create a critical mass of informatically savvy life scientists. In support of the Little Skate Genome Project, the NEBC members have developed several annotation workshops and jamborees to provide training in genome sequencing, annotation and analysis. Acting as a nexus for both curation activities and dissemination of project data, a project web portal, SkateBase (http://skatebase.org) has been developed. As a case study to illustrate effective coupling of community annotation with workforce development, we report the results of the Mitochondrial Genome Annotation Jamborees organized to annotate the first completely assembled element of the Little Skate Genome Project, as a culminating experience for participants from our three prior annotation workshops. We are applying the physical/virtual infrastructure and lessons learned from these activities to enhance and streamline the genome annotation workflow, as we look toward our continuing efforts for larger-scale functional and structural community annotation of the L. erinacea genome.

  2. Community annotation and bioinformatics workforce development in concert—Little Skate Genome Annotation Workshops and Jamborees

    PubMed Central

    Wang, Qinghua; Arighi, Cecilia N.; King, Benjamin L.; Polson, Shawn W.; Vincent, James; Chen, Chuming; Huang, Hongzhan; Kingham, Brewster F.; Page, Shallee T.; Farnum Rendino, Marc; Thomas, William Kelley; Udwary, Daniel W.; Wu, Cathy H.

    2012-01-01

    Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood of sequence data has made it critically important to train the next generation of scientists to handle the inherent bioinformatic challenges. The North East Bioinformatics Collaborative (NEBC) is undertaking the genome sequencing and annotation of the little skate (Leucoraja erinacea) to promote advancement of bioinformatics infrastructure in our region, with an emphasis on practical education to create a critical mass of informatically savvy life scientists. In support of the Little Skate Genome Project, the NEBC members have developed several annotation workshops and jamborees to provide training in genome sequencing, annotation and analysis. Acting as a nexus for both curation activities and dissemination of project data, a project web portal, SkateBase (http://skatebase.org) has been developed. As a case study to illustrate effective coupling of community annotation with workforce development, we report the results of the Mitochondrial Genome Annotation Jamborees organized to annotate the first completely assembled element of the Little Skate Genome Project, as a culminating experience for participants from our three prior annotation workshops. We are applying the physical/virtual infrastructure and lessons learned from these activities to enhance and streamline the genome annotation workflow, as we look toward our continuing efforts for larger-scale functional and structural community annotation of the L. erinacea genome. PMID:22434832

  3. The Co-regulation Data Harvester: Automating gene annotation starting from a transcriptome database

    NASA Astrophysics Data System (ADS)

    Tsypin, Lev M.; Turkewitz, Aaron P.

    Identifying co-regulated genes provides a useful approach for defining pathway-specific machinery in an organism. To be efficient, this approach relies on thorough genome annotation, a process much slower than genome sequencing per se. Tetrahymena thermophila, a unicellular eukaryote, has been a useful model organism and has a fully sequenced but sparsely annotated genome. One important resource for studying this organism has been an online transcriptomic database. We have developed an automated approach to gene annotation in the context of transcriptome data in T. thermophila, called the Co-regulation Data Harvester (CDH). Beginning with a gene of interest, the CDH identifies co-regulated genes by accessing the Tetrahymena transcriptome database. It then identifies their closely related genes (orthologs) in other organisms by using reciprocal BLAST searches. Finally, it collates the annotations of those orthologs' functions, which provides the user with information to help predict the cellular role of the initial query. The CDH, which is freely available, represents a powerful new tool for analyzing cell biological pathways in Tetrahymena. Moreover, to the extent that genes and pathways are conserved between organisms, the inferences obtained via the CDH should be relevant, and can be explored, in many other systems.

  4. Community annotation experiment for ground truth generation for the i2b2 medication challenge

    PubMed Central

    Solti, Imre; Xia, Fei; Cadag, Eithon

    2010-01-01

    Objective Within the context of the Third i2b2 Workshop on Natural Language Processing Challenges for Clinical Records, the authors (also referred to as ‘the i2b2 medication challenge team’ or ‘the i2b2 team’ for short) organized a community annotation experiment. Design For this experiment, the authors released annotation guidelines and a small set of annotated discharge summaries. They asked the participants of the Third i2b2 Workshop to annotate 10 discharge summaries per person; each discharge summary was annotated by two annotators from two different teams, and a third annotator from a third team resolved disagreements. Measurements In order to evaluate the reliability of the annotations thus produced, the authors measured community inter-annotator agreement and compared it with the inter-annotator agreement of expert annotators when both the community and the expert annotators generated ground truth based on pooled system outputs. For this purpose, the pool consisted of the three most densely populated automatic annotations of each record. The authors also compared the community inter-annotator agreement with expert inter-annotator agreement when the experts annotated raw records without using the pool. Finally, they measured the quality of the community ground truth by comparing it with the expert ground truth. Results and conclusions The authors found that the community annotators achieved comparable inter-annotator agreement to expert annotators, regardless of whether the experts annotated from the pool. Furthermore, the ground truth generated by the community obtained F-measures above 0.90 against the ground truth of the experts, indicating the value of the community as a source of high-quality ground truth even on intricate and domain-specific annotation tasks. PMID:20819855

  5. EuCAP, a Eukaryotic Community Annotation Package, and its application to the rice genome

    PubMed Central

    Thibaud-Nissen, Françoise; Campbell, Matthew; Hamilton, John P; Zhu, Wei; Buell, C Robin

    2007-01-01

    Background Despite the improvements of tools for automated annotation of genome sequences, manual curation at the structural and functional level can provide an increased level of refinement to genome annotation. The Institute for Genomic Research Rice Genome Annotation (hereafter named the Osa1 Genome Annotation) is the product of an automated pipeline and, for this reason, will benefit from the input of biologists with expertise in rice and/or particular gene families. Leveraging knowledge from a dispersed community of scientists is a demonstrated way of improving a genome annotation. This requires tools that facilitate 1) the submission of gene annotation to an annotation project, 2) the review of the submitted models by project annotators, and 3) the incorporation of the submitted models in the ongoing annotation effort. Results We have developed the Eukaryotic Community Annotation Package (EuCAP), an annotation tool, and have applied it to the rice genome. The primary level of curation by community annotators (CA) has been the annotation of gene families. Annotation can be submitted by email or through the EuCAP Web Tool. The CA models are aligned to the rice pseudomolecules and the coordinates of these alignments, along with functional annotation, are stored in the MySQL EuCAP Gene Model database. Web pages displaying the alignments of the CA models to the Osa1 Genome models are automatically generated from the EuCAP Gene Model database. The alignments are reviewed by the project annotators (PAs) in the context of experimental evidence. Upon approval by the PAs, the CA models, along with the corresponding functional annotations, are integrated into the Osa1 Genome Annotation. The CA annotations, grouped by family, are displayed on the Community Annotation pages of the project website , as well as in the Community Annotation track of the Genome Browser. Conclusion We have applied EuCAP to rice. As of July 2007, the structural and/or functional annotation of 1

  6. Supporting community annotation and user collaboration in the integrated microbial genomes (IMG) system

    DOE PAGES

    Chen, I-Min A.; Markowitz, Victor M.; Palaniappan, Krishna; ...

    2016-04-26

    Background: The exponential growth of genomic data from next generation technologies renders traditional manual expert curation effort unsustainable. Many genomic systems have included community annotation tools to address the problem. Most of these systems adopted a "Wiki-based" approach to take advantage of existing wiki technologies, but encountered obstacles in issues such as usability, authorship recognition, information reliability and incentive for community participation. Results: Here, we present a different approach, relying on tightly integrated method rather than "Wiki-based" method, to support community annotation and user collaboration in the Integrated Microbial Genomes (IMG) system. The IMG approach allows users to use existingmore » IMG data warehouse and analysis tools to add gene, pathway and biosynthetic cluster annotations, to analyze/reorganize contigs, genes and functions using workspace datasets, and to share private user annotations and workspace datasets with collaborators. We show that the annotation effort using IMG can be part of the research process to overcome the user incentive and authorship recognition problems thus fostering collaboration among domain experts. The usability and reliability issues are addressed by the integration of curated information and analysis tools in IMG, together with DOE Joint Genome Institute (JGI) expert review. Conclusion: By incorporating annotation operations into IMG, we provide an integrated environment for users to perform deeper and extended data analysis and annotation in a single system that can lead to publications and community knowledge sharing as shown in the case studies.« less

  7. Supporting community annotation and user collaboration in the integrated microbial genomes (IMG) system

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, I-Min A.; Markowitz, Victor M.; Palaniappan, Krishna

    Background: The exponential growth of genomic data from next generation technologies renders traditional manual expert curation effort unsustainable. Many genomic systems have included community annotation tools to address the problem. Most of these systems adopted a "Wiki-based" approach to take advantage of existing wiki technologies, but encountered obstacles in issues such as usability, authorship recognition, information reliability and incentive for community participation. Results: Here, we present a different approach, relying on tightly integrated method rather than "Wiki-based" method, to support community annotation and user collaboration in the Integrated Microbial Genomes (IMG) system. The IMG approach allows users to use existingmore » IMG data warehouse and analysis tools to add gene, pathway and biosynthetic cluster annotations, to analyze/reorganize contigs, genes and functions using workspace datasets, and to share private user annotations and workspace datasets with collaborators. We show that the annotation effort using IMG can be part of the research process to overcome the user incentive and authorship recognition problems thus fostering collaboration among domain experts. The usability and reliability issues are addressed by the integration of curated information and analysis tools in IMG, together with DOE Joint Genome Institute (JGI) expert review. Conclusion: By incorporating annotation operations into IMG, we provide an integrated environment for users to perform deeper and extended data analysis and annotation in a single system that can lead to publications and community knowledge sharing as shown in the case studies.« less

  8. Supporting community annotation and user collaboration in the integrated microbial genomes (IMG) system.

    PubMed

    Chen, I-Min A; Markowitz, Victor M; Palaniappan, Krishna; Szeto, Ernest; Chu, Ken; Huang, Jinghua; Ratner, Anna; Pillay, Manoj; Hadjithomas, Michalis; Huntemann, Marcel; Mikhailova, Natalia; Ovchinnikova, Galina; Ivanova, Natalia N; Kyrpides, Nikos C

    2016-04-26

    The exponential growth of genomic data from next generation technologies renders traditional manual expert curation effort unsustainable. Many genomic systems have included community annotation tools to address the problem. Most of these systems adopted a "Wiki-based" approach to take advantage of existing wiki technologies, but encountered obstacles in issues such as usability, authorship recognition, information reliability and incentive for community participation. Here, we present a different approach, relying on tightly integrated method rather than "Wiki-based" method, to support community annotation and user collaboration in the Integrated Microbial Genomes (IMG) system. The IMG approach allows users to use existing IMG data warehouse and analysis tools to add gene, pathway and biosynthetic cluster annotations, to analyze/reorganize contigs, genes and functions using workspace datasets, and to share private user annotations and workspace datasets with collaborators. We show that the annotation effort using IMG can be part of the research process to overcome the user incentive and authorship recognition problems thus fostering collaboration among domain experts. The usability and reliability issues are addressed by the integration of curated information and analysis tools in IMG, together with DOE Joint Genome Institute (JGI) expert review. By incorporating annotation operations into IMG, we provide an integrated environment for users to perform deeper and extended data analysis and annotation in a single system that can lead to publications and community knowledge sharing as shown in the case studies.

  9. Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome

    PubMed Central

    2010-01-01

    Background A goal of the Bovine Genome Database (BGD; http://BovineGenome.org) has been to support the Bovine Genome Sequencing and Analysis Consortium (BGSAC) in the annotation and analysis of the bovine genome. We were faced with several challenges, including the need to maintain consistent quality despite diversity in annotation expertise in the research community, the need to maintain consistent data formats, and the need to minimize the potential duplication of annotation effort. With new sequencing technologies allowing many more eukaryotic genomes to be sequenced, the demand for collaborative annotation is likely to increase. Here we present our approach, challenges and solutions facilitating a large distributed annotation project. Results and Discussion BGD has provided annotation tools that supported 147 members of the BGSAC in contributing 3,871 gene models over a fifteen-week period, and these annotations have been integrated into the bovine Official Gene Set. Our approach has been to provide an annotation system, which includes a BLAST site, multiple genome browsers, an annotation portal, and the Apollo Annotation Editor configured to connect directly to our Chado database. In addition to implementing and integrating components of the annotation system, we have performed computational analyses to create gene evidence tracks and a consensus gene set, which can be viewed on individual gene pages at BGD. Conclusions We have provided annotation tools that alleviate challenges associated with distributed annotation. Our system provides a consistent set of data to all annotators and eliminates the need for annotators to format data. Involving the bovine research community in genome annotation has allowed us to leverage expertise in various areas of bovine biology to provide biological insight into the genome sequence. PMID:21092105

  10. Environment and the Community: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Department of Housing and Urban Development, Washington, DC.

    Three hundred and nine citations of books, reports, and articles dating from 1964 to 1971 are included in this annotated bibliography, intended as a selection tool for concerned citizens, architects, builders, and city planners emphasizing the environment of American cities and communities. It is topically arranged into sixteen broad sections with…

  11. The consequences of balanced harvesting of fish communities

    PubMed Central

    Jacobsen, Nis S.; Gislason, Henrik; Andersen, Ken H.

    2014-01-01

    Balanced harvesting, where species or individuals are exploited in accordance with their productivity, has been proposed as a way to minimize the effects of fishing on marine fish communities and ecosystems. This calls for a thorough examination of the consequences balanced harvesting has on fish community structure and yield. We use a size- and trait-based model that resolves individual interactions through competition and predation to compare balanced harvesting with traditional selective harvesting, which protects juvenile fish from fishing. Four different exploitation patterns, generated by combining selective or unselective harvesting with balanced or unbalanced fishing, are compared. We find that unselective balanced fishing, where individuals are exploited in proportion to their productivity, produces a slightly larger total maximum sustainable yield than the other exploitation patterns and, for a given yield, the least change in the relative biomass composition of the fish community. Because fishing reduces competition, predation and cannibalism within the community, the total maximum sustainable yield is achieved at high exploitation rates. The yield from unselective balanced fishing is dominated by small individuals, whereas selective fishing produces a much higher proportion of large individuals in the yield. Although unselective balanced fishing is predicted to produce the highest total maximum sustainable yield and the lowest impact on trophic structure, it is effectively a fishery predominantly targeting small forage fish. PMID:24307676

  12. Calling on a million minds for community annotation in WikiProteins

    PubMed Central

    Mons, Barend; Ashburner, Michael; Chichester, Christine; van Mulligen, Erik; Weeber, Marc; den Dunnen, Johan; van Ommen, Gert-Jan; Musen, Mark; Cockerill, Matthew; Hermjakob, Henning; Mons, Albert; Packer, Abel; Pacheco, Roberto; Lewis, Suzanna; Berkeley, Alfred; Melton, William; Barris, Nickolas; Wales, Jimmy; Meijssen, Gerard; Moeller, Erik; Roes, Peter Jan; Borner, Katy; Bairoch, Amos

    2008-01-01

    WikiProteins enables community annotation in a Wiki-based system. Extracts of major data sources have been fused into an editable environment that links out to the original sources. Data from community edits create automatic copies of the original data. Semantic technology captures concepts co-occurring in one sentence and thus potential factual statements. In addition, indirect associations between concepts have been calculated. We call on a 'million minds' to annotate a 'million concepts' and to collect facts from the literature with the reward of collaborative knowledge discovery. The system is available for beta testing at . PMID:18507872

  13. Genome Annotation in a Community College Cell Biology Lab

    ERIC Educational Resources Information Center

    Beagley, C. Timothy

    2013-01-01

    The Biology Department at Salt Lake Community College has used the IMG-ACT toolbox to introduce a genome mapping and annotation exercise into the laboratory portion of its Cell Biology course. This project provides students with an authentic inquiry-based learning experience while introducing them to computational biology and contemporary learning…

  14. Pine Forest Harvest Leads to Decade-Scale Alterations in Soil Fungal Communities

    NASA Astrophysics Data System (ADS)

    Boutton, T. W.; Mushinski, R. M.; Gentry, T. J.

    2016-12-01

    Forestlands provide a multitude of ecosystem services, and sustainable management is crucial to maintaining the benefits of these ecosystems. Intensive organic matter removal (OMR) of logging residues and forest litter during forest harvest may result in long-term alterations to soil properties and processes. Because fungal activity regulates essential biogeochemical processes in forestlands, changes in soil fungal community structure following OMR may translate into altered soil function. Using a replicated field experiment in southern pine forest in eastern Texas, USA, we sampled soil to a depth of 1 m to assess the impact of intensive OMR on soil fungal communities. Soils were collected from replicated (n = 3 ) loblolly pine (Pinus taeda L.) stands subjected to 3 different harvest intensities (i.e., unharvested old growth stands, bole-only harvest stands, and whole-tree harvest + forest floor removal stands) in 1997. Nearly two decades after trees were harvested and replanted, next generation sequencing of the fungal internal transcribed spacer showed the diversity and community structure of the entire fungal community was altered relative to the unharvested stands. The relative abundance of Ascomycetes increased as OMR intensity increased and was positively correlated to concurrent changes in soil pH. The community composition of fungal functional groups (e.g., ecto- and arbuscular mycorrhizal, saprophytic fungi) was also altered by OMR. The most abundant taxa, Russula exhibited significant reductions in response to increasing intensity of OMR. Results of this study illustrate a linkage between anthropogenically-induced aboveground perturbation, edaphic factors, and belowground soil fungal communities of southern pine forests. Also, these results indicate that tree harvesting effects on soil fungal communities can persist for decades post-harvest, with potential implications for soil functional characteristics.

  15. Chado controller: advanced annotation management with a community annotation system.

    PubMed

    Guignon, Valentin; Droc, Gaëtan; Alaux, Michael; Baurens, Franc-Christophe; Garsmeur, Olivier; Poiron, Claire; Carver, Tim; Rouard, Mathieu; Bocs, Stéphanie

    2012-04-01

    We developed a controller that is compliant with the Chado database schema, GBrowse and genome annotation-editing tools such as Artemis and Apollo. It enables the management of public and private data, monitors manual annotation (with controlled vocabularies, structural and functional annotation controls) and stores versions of annotation for all modified features. The Chado controller uses PostgreSQL and Perl. The Chado Controller package is available for download at http://www.gnpannot.org/content/chado-controller and runs on any Unix-like operating system, and documentation is available at http://www.gnpannot.org/content/chado-controller-doc The system can be tested using the GNPAnnot Sandbox at http://www.gnpannot.org/content/gnpannot-sandbox-form valentin.guignon@cirad.fr; stephanie.sidibe-bocs@cirad.fr Supplementary data are available at Bioinformatics online.

  16. Chado Controller: advanced annotation management with a community annotation system

    PubMed Central

    Guignon, Valentin; Droc, Gaëtan; Alaux, Michael; Baurens, Franc-Christophe; Garsmeur, Olivier; Poiron, Claire; Carver, Tim; Rouard, Mathieu; Bocs, Stéphanie

    2012-01-01

    Summary: We developed a controller that is compliant with the Chado database schema, GBrowse and genome annotation-editing tools such as Artemis and Apollo. It enables the management of public and private data, monitors manual annotation (with controlled vocabularies, structural and functional annotation controls) and stores versions of annotation for all modified features. The Chado controller uses PostgreSQL and Perl. Availability: The Chado Controller package is available for download at http://www.gnpannot.org/content/chado-controller and runs on any Unix-like operating system, and documentation is available at http://www.gnpannot.org/content/chado-controller-doc The system can be tested using the GNPAnnot Sandbox at http://www.gnpannot.org/content/gnpannot-sandbox-form Contact: valentin.guignon@cirad.fr; stephanie.sidibe-bocs@cirad.fr Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22285827

  17. Long-term effects of timber harvesting on forest soil communities and their catabolic capacity

    NASA Astrophysics Data System (ADS)

    Mohn, W. W.

    2016-12-01

    We examined the effect of forest harvesting on metagenomes of soil communities in ecozones across North America. The overall effect of harvesting on community composition was very small relative to major differences between soil horizons and among geographically distinct ecozones. However, in some ecozones, harvesting substantially altered bacterial and fungal community composition and diminished the genetic potential for biomass decomposition while increasing the potential for nitrogen cycling. Stable isotope probing identified populations involved in hemicellulose and cellulose decomposition. Known cellulolytic organisms were found in the organic soil layer, while novel cellulolytic organisms were identified in the mineral soil layer. Lignolytic populations identified were mainly bacterial, and metagenomics analysis identified lignin degradation enzymes in the genomes of some of these populations. In some ecozones, cellulolytic and hemicellulolytic populations were substantially impacted by harvesting. Soil carbon, nitrogen and pH were related to the relative susceptibility of forest soil communities in the different ecozones to harvesting impacts.

  18. Teaching and Learning Communities through Online Annotation

    NASA Astrophysics Data System (ADS)

    van der Pluijm, B.

    2016-12-01

    What do colleagues do with your assigned textbook? What they say or think about the material? Want students to be more engaged in their learning experience? If so, online materials that complement standard lecture format provide new opportunity through managed, online group annotation that leverages the ubiquity of internet access, while personalizing learning. The concept is illustrated with the new online textbook "Processes in Structural Geology and Tectonics", by Ben van der Pluijm and Stephen Marshak, which offers a platform for sharing of experiences, supplementary materials and approaches, including readings, mathematical applications, exercises, challenge questions, quizzes, alternative explanations, and more. The annotation framework used is Hypothes.is, which offers a free, open platform markup environment for annotation of websites and PDF postings. The annotations can be public, grouped or individualized, as desired, including export access and download of annotations. A teacher group, hosted by a moderator/owner, limits access to members of a user group of teachers, so that its members can use, copy or transcribe annotations for their own lesson material. Likewise, an instructor can host a student group that encourages sharing of observations, questions and answers among students and instructor. Also, the instructor can create one or more closed groups that offers study help and hints to students. Options galore, all of which aim to engage students and to promote greater responsibility for their learning experience. Beyond new capacity, the ability to analyze student annotation supports individual learners and their needs. For example, student notes can be analyzed for key phrases and concepts, and identify misunderstandings, omissions and problems. Also, example annotations can be shared to enhance notetaking skills and to help with studying. Lastly, online annotation allows active application to lecture posted slides, supporting real-time notetaking

  19. The Chicago American Indian Community, 1893-1988. Annotated Bibliography and Guide to Sources in Chicago.

    ERIC Educational Resources Information Center

    Beck, David

    This annotated bibliography identifies and describes documentary evidence of Chicago's American Indian population since the 1893 World's Columbian Exposition. Sources include studies and reports generated by Indian community organizations and agencies, community newsletters, newspapers, oral histories, grant applications, personal papers, and…

  20. Apollo: a community resource for genome annotation editing.

    PubMed

    Lee, Ed; Harris, Nomi; Gibson, Mark; Chetty, Raymond; Lewis, Suzanna

    2009-07-15

    Apollo is a genome annotation-editing tool with an easy to use graphical interface. It is a component of the GMOD project, with ongoing development driven by the community. Recent additions to the software include support for the generic feature format version 3 (GFF3), continuous transcriptome data, a full Chado database interface, integration with remote services for on-the-fly BLAST and Primer BLAST analyses, graphical interfaces for configuring user preferences and full undo of all edit operations. Apollo's user community continues to grow, including its use as an educational tool for college and high-school students. Apollo is a Java application distributed under a free and open source license. Installers for Windows, Linux, Unix, Solaris and Mac OS X are available at http://apollo.berkeleybop.org, and the source code is available from the SourceForge CVS repository at http://gmod.cvs.sourceforge.net/gmod/apollo.

  1. Community-based Ontology Development, Annotation and Discussion with MediaWiki extension Ontokiwi and Ontokiwi-based Ontobedia

    PubMed Central

    Ong, Edison; He, Yongqun

    2016-01-01

    Hundreds of biological and biomedical ontologies have been developed to support data standardization, integration and analysis. Although ontologies are typically developed for community usage, community efforts in ontology development are limited. To support ontology visualization, distribution, and community-based annotation and development, we have developed Ontokiwi, an ontology extension to the MediaWiki software. Ontokiwi displays hierarchical classes and ontological axioms. Ontology classes and axioms can be edited and added using Ontokiwi form or MediaWiki source editor. Ontokiwi also inherits MediaWiki features such as Wikitext editing and version control. Based on the Ontokiwi/MediaWiki software package, we have developed Ontobedia, which targets to support community-based development and annotations of biological and biomedical ontologies. As demonstrations, we have loaded the Ontology of Adverse Events (OAE) and the Cell Line Ontology (CLO) into Ontobedia. Our studies showed that Ontobedia was able to achieve expected Ontokiwi features. PMID:27570653

  2. Waterbird communities in rice fields subjected to different post-harvest treatments

    USGS Publications Warehouse

    Day, J.H.; Colwell, M.A.

    1998-01-01

    In California's Sacramento Valley, the potential value of rice fields as habitat for waterbirds may vary with harvest method, post-harvest treatment of rice straw (chopped, burned, plowed), and extent of flooding. Recent changes in rice harvesting methods (i.e., use of stripper-headers) and a legislative mandate to decrease burning of rice straw after harvest may alter habitat availability and use. Thus, we investigated species richness and community composition of nonbreeding waterbirds during October-March 1993-94 and 1994-95 in rice fields of the northern Sacramento Valley. Most (85-91% of land area) rice was conventionally harvested (i.e., cutter bar), and the remainder was stripped. Rice straw was left untreated in more than half of fields (52% in 1994 and 54% in 1995), especially in stripped fields (56-70%). In fields where farmers treated straw, the most common management methods were plowing (15-21%), burning (19-24%), and chopping (3-5%). Fields became increasingly wet from October through March as seasonal precipitation accumulated and farmers flooded fields to facilitate straw decomposition and provide habitat for ducks. Species richness of waterbirds was greater (P 0.23). Species richness in stripped fields probably was low because foraging opportunities were limited by tall dense straw, decreased grain density, and infrequent flooding. We recommend that land managers wishing to provide habitat for a diverse waterbird community harvest rice using conventional methods and flood fields shallowly.

  3. Apollo: a community resource for genome annotation editing

    PubMed Central

    Ed, Lee; Nomi, Harris; Mark, Gibson; Raymond, Chetty; Suzanna, Lewis

    2009-01-01

    Summary: Apollo is a genome annotation-editing tool with an easy to use graphical interface. It is a component of the GMOD project, with ongoing development driven by the community. Recent additions to the software include support for the generic feature format version 3 (GFF3), continuous transcriptome data, a full Chado database interface, integration with remote services for on-the-fly BLAST and Primer BLAST analyses, graphical interfaces for configuring user preferences and full undo of all edit operations. Apollo's user community continues to grow, including its use as an educational tool for college and high-school students. Availability: Apollo is a Java application distributed under a free and open source license. Installers for Windows, Linux, Unix, Solaris and Mac OS X are available at http://apollo.berkeleybop.org, and the source code is available from the SourceForge CVS repository at http://gmod.cvs.sourceforge.net/gmod/apollo. Contact: elee@berkeleybop.org PMID:19439563

  4. 76 FR 79212 - Agency Information Collection Activities: Proposed Information Collection for Community Harvest...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-12-21

    ... parklands, the NPS needs information on harvest patterns among residents of communities with subsistence... been designated as resident zone communities for the respective park in recognition that many residents...

  5. Community attachment and resource harvesting in rural Denmark

    Treesearch

    Rodney R. Zwick; David Solan

    2002-01-01

    Community attachment has been related to "sense of place," and by extension to factors such as the natural resource base of a local geographic area and the utilitarian uses of those resources-a functional attachment that helps root people to a place. The purpose of this study was to examine the resource harvest activities of residents of three modern rural...

  6. The distributed annotation system.

    PubMed

    Dowell, R D; Jokerst, R M; Day, A; Eddy, S R; Stein, L

    2001-01-01

    Currently, most genome annotation is curated by centralized groups with limited resources. Efforts to share annotations transparently among multiple groups have not yet been satisfactory. Here we introduce a concept called the Distributed Annotation System (DAS). DAS allows sequence annotations to be decentralized among multiple third-party annotators and integrated on an as-needed basis by client-side software. The communication between client and servers in DAS is defined by the DAS XML specification. Annotations are displayed in layers, one per server. Any client or server adhering to the DAS XML specification can participate in the system; we describe a simple prototype client and server example. The DAS specification is being used experimentally by Ensembl, WormBase, and the Berkeley Drosophila Genome Project. Continued success will depend on the readiness of the research community to adopt DAS and provide annotations. All components are freely available from the project website http://www.biodas.org/.

  7. A Survey of Timber Harvesting Simulation Models for Use in the South

    Treesearch

    Daniel V. Goulet; Donald L. Sirois; Ronald H. Iff

    1979-01-01

    Reviews literature about nine forest harvesting simulation models with potential for simulating southern operations. From the nine, five appearusefulenough to warrant further analysis.An annotated bibliography of associated forest harvesting simulation literature is appended.

  8. Limited Effects of Variable-Retention Harvesting on Fungal Communities Decomposing Fine Roots in Coastal Temperate Rainforests.

    PubMed

    Philpott, Timothy J; Barker, Jason S; Prescott, Cindy E; Grayston, Sue J

    2018-02-01

    Fine root litter is the principal source of carbon stored in forest soils and a dominant source of carbon for fungal decomposers. Differences in decomposer capacity between fungal species may be important determinants of fine-root decomposition rates. Variable-retention harvesting (VRH) provides refuge for ectomycorrhizal fungi, but its influence on fine-root decomposers is unknown, as are the effects of functional shifts in these fungal communities on carbon cycling. We compared fungal communities decomposing fine roots (in litter bags) under VRH, clear-cut, and uncut stands at two sites (6 and 13 years postharvest) and two decay stages (43 days and 1 year after burial) in Douglas fir forests in coastal British Columbia, Canada. Fungal species and guilds were identified from decomposed fine roots using high-throughput sequencing. Variable retention had short-term effects on β-diversity; harvest treatment modified the fungal community composition at the 6-year-postharvest site, but not at the 13-year-postharvest site. Ericoid and ectomycorrhizal guilds were not more abundant under VRH, but stand age significantly structured species composition. Guild composition varied by decay stage, with ruderal species later replaced by saprotrophs and ectomycorrhizae. Ectomycorrhizal abundance on decomposing fine roots may partially explain why fine roots typically decompose more slowly than surface litter. Our results indicate that stand age structures fine-root decomposers but that decay stage is more important in structuring the fungal community than shifts caused by harvesting. The rapid postharvest recovery of fungal communities decomposing fine roots suggests resiliency within this community, at least in these young regenerating stands in coastal British Columbia. IMPORTANCE Globally, fine roots are a dominant source of carbon in forest soils, yet the fungi that decompose this material and that drive the sequestration or respiration of this carbon remain largely

  9. The influence of partial timber harvesting in riparian buffers on macroinvertebrate and fish communities in small streams in Minnesota, USA

    USGS Publications Warehouse

    Chizinski, Christopher J.; Vondracek, Bruce C.; Blinn, Charles R.; Newman, Raymond M.; Atuke, Dickson M.; Fredricks, Keith; Hemstad, Nathaniel A.; Merten, Eric C.; Schlesser, Nicholas

    2010-01-01

    Relatively few evaluations of aquatic macroinvertebrate and fish communities have been published in peer-reviewed literature detailing the effect of varying residual basal area (RBA) after timber harvesting in riparian buffers. Our analysis investigated the effects of partial harvesting within riparian buffers on aquatic macroinvertebrate and fish communities in small streams from two experiments in northern Minnesota northern hardwood-aspen forests. Each experiment evaluated partial harvesting within riparian buffers. In both experiments, benthic macroinvertebrates and fish were collected 1 year prior to harvest and in each of 3 years after harvest. We observed interannual variation for the macroinvertebrate abundance, diversity and taxon richness in the single-basin study and abundance and diversity in the multiple-basin study, but few effects related to harvest treatments in either study. However, interannual variation was not evident in the fish communities and we detected no significant changes in the stream fish communities associated with partially harvested riparian buffers in either study. This would suggest that timber harvesting in riparian management zones along reaches ≤200 m in length on both sides of the stream that retains RBA ≥ 12.4 ± 1.3 m2 ha−1 or on a single side of the stream that retains RBA ≥ 8.7 ± 1.6 m2 ha−1 may be adequate to protect macroinvertebrate and fish communities in our Minnesota study systems given these specific timber harvesting techniques.

  10. Effects of oyster harvest activities on Louisiana reef habitat and resident nekton communities

    USGS Publications Warehouse

    Beck, Steve; LaPeyre, Megan K.

    2015-01-01

    Oysters are often cited as “ecosystem engineers” because they modify their environment. Coastal Louisiana contains extensive oyster reef areas that have been harvested for decades, and whether differences in habitat functions exist between those areas and nonharvested reefs is unclear. We compared reef physical structure and resident community metrics between these 2 subtidal reef types. Harvested reefs were more fragmented and had lower densities of live eastern oysters (Crassostrea virginica) and hooked mussels (Ischadium recurvum) than the nonharvested reefs. Stable isotope values (13C and 15N) of dominant nekton species and basal food sources were used to compare food web characteristics. Nonpelagic source contributions and trophic positions of dominant species were slightly elevated at harvested sites. Oyster harvesting appeared to have decreased the number of large oysters and to have increased the percentage of reefs that were nonliving by decreasing water column filtration and benthopelagic coupling. The differences in reef matrix composition, however, had little effect on resident nekton communities. Understanding the thresholds of reef habitat areas, the oyster density or oyster size distribution below which ecosystem services may be compromised, remains key to sustainable management.

  11. Genome annotation in a community college cell biology lab.

    PubMed

    Beagley, C Timothy

    2013-01-01

    The Biology Department at Salt Lake Community College has used the IMG-ACT toolbox to introduce a genome mapping and annotation exercise into the laboratory portion of its Cell Biology course. This project provides students with an authentic inquiry-based learning experience while introducing them to computational biology and contemporary learning skills. Additionally, the project strengthens student understanding of the scientific method and contributes to student learning gains in curricular objectives centered around basic molecular biology, specifically, the Central Dogma. Importantly, inclusion of this project in the laboratory course provides students with a positive learning environment and allows for the use of cooperative learning strategies to increase overall student success. Copyright © 2012 International Union of Biochemistry and Molecular Biology, Inc.

  12. Continuity and change in subsistence harvests in five Bering Sea communities: Akutan, Emmonak, Savoonga, St. Paul, and Togiak

    NASA Astrophysics Data System (ADS)

    Fall, James A.; Braem, Nicole S.; Brown, Caroline L.; Hutchinson-Scarbrough, Lisa B.; Koster, David S.; Krieg, Theodore M.

    2013-10-01

    To document and quantify subsistence harvests of fish and wildlife resources, and provide topics for subsequent key respondent interviews to collect local and traditional knowledge (LTK) about the Bering Sea ecosystem, comprehensive household harvest surveys were conducted in four Bering Sea Alaska Native communities: Akutan, Emmonak, Savoonga, and Togiak. In a fifth community, St. Paul, annual programs to document two key subsistence resources, fur seals and sea lions, continued. Surveys documented relatively high and diverse subsistence harvests, consistent with earlier research that demonstrated the continuing economic, social, and cultural importance of subsistence uses of wild resources. The research also found differences in subsistence use patterns compared to previous years' studies, such as harvest levels, harvest composition, and diversity of resources used, although differences between study years were not uniform across communities. Survey respondents, as well as key respondents in subsequent interviews, identified a complex range of personal, economic, and environmental factors when comparing subsistence uses in the study year with other years, such as increasing costs of fuel and purchased food, commercial fisheries harvests and bycatch, more persistent storms and less predictable winds, and reduced sea ice. Such conditions affect resource abundance and locations as well as access to fish and wildlife populations, and may shape long-term trends. So far, as in the past, families and communities have adapted to changing economic, social, and environmental conditions, but the future is less clear if such changes intensify or accelerate. Local community residents should be essential partners in future efforts to understand these complex processes that affect the natural resources of the Bering Sea.

  13. Resequencing and annotation of the Nostoc punctiforme ATTC 29133 genome: facilitating biofuel and high-value chemical production

    DOE PAGES

    Moraes, Luis E.; Blow, Matthew J.; Hawley, Erik R.; ...

    2017-02-16

    Cyanobacteria have the potential to produce bulk and fine chemicals and members belonging to Nostoc sp. have received particular attention due to their relatively fast growth rate and the relative ease with which they can be harvested. Nostoc punctiforme is an aerobic, motile, Gram-negative, filamentous cyanobacterium that has been studied intensively to enhance our understanding of microbial carbon and nitrogen fixation. The genome of the type strain N. punctiforme ATCC 29133 was sequenced in 2001 and the scientific community has used these genome data extensively since then. Advances in bioinformatics tools for sequence annotation and the importance of this organismmore » prompted us to resequence and reanalyze its genome and to make both, the initial and improved annotation, available to the scientific community. The new draft genome has a total size of 9.1 Mbp and consists of 65 contiguous pieces of DNA with a GC content of 41.38% and 7664 protein-coding genes. Furthermore, the resequenced genome is slightly (5152 bp) larger and contains 987 more genes with functional prediction when compared to the previously published version. We deposited the annotation of both genomes in the Department of Energy’s IMG database to facilitate easy genome exploration by the scientific community without the need of in-depth bioinformatics skills. We expect that an facilitated access and ability to search the N. punctiforme ATCC 29133 for genes of interest will significantly facilitate metabolic engineering and genome prospecting efforts and ultimately the synthesis of biofuels and natural products from this keystone organism and closely related cyanobacteria.« less

  14. Resequencing and annotation of the Nostoc punctiforme ATTC 29133 genome: facilitating biofuel and high-value chemical production

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Moraes, Luis E.; Blow, Matthew J.; Hawley, Erik R.

    Cyanobacteria have the potential to produce bulk and fine chemicals and members belonging to Nostoc sp. have received particular attention due to their relatively fast growth rate and the relative ease with which they can be harvested. Nostoc punctiforme is an aerobic, motile, Gram-negative, filamentous cyanobacterium that has been studied intensively to enhance our understanding of microbial carbon and nitrogen fixation. The genome of the type strain N. punctiforme ATCC 29133 was sequenced in 2001 and the scientific community has used these genome data extensively since then. Advances in bioinformatics tools for sequence annotation and the importance of this organismmore » prompted us to resequence and reanalyze its genome and to make both, the initial and improved annotation, available to the scientific community. The new draft genome has a total size of 9.1 Mbp and consists of 65 contiguous pieces of DNA with a GC content of 41.38% and 7664 protein-coding genes. Furthermore, the resequenced genome is slightly (5152 bp) larger and contains 987 more genes with functional prediction when compared to the previously published version. We deposited the annotation of both genomes in the Department of Energy’s IMG database to facilitate easy genome exploration by the scientific community without the need of in-depth bioinformatics skills. We expect that an facilitated access and ability to search the N. punctiforme ATCC 29133 for genes of interest will significantly facilitate metabolic engineering and genome prospecting efforts and ultimately the synthesis of biofuels and natural products from this keystone organism and closely related cyanobacteria.« less

  15. TriAnnot: A Versatile and High Performance Pipeline for the Automated Annotation of Plant Genomes

    PubMed Central

    Leroy, Philippe; Guilhot, Nicolas; Sakai, Hiroaki; Bernard, Aurélien; Choulet, Frédéric; Theil, Sébastien; Reboux, Sébastien; Amano, Naoki; Flutre, Timothée; Pelegrin, Céline; Ohyanagi, Hajime; Seidel, Michael; Giacomoni, Franck; Reichstadt, Mathieu; Alaux, Michael; Gicquello, Emmanuelle; Legeai, Fabrice; Cerutti, Lorenzo; Numa, Hisataka; Tanaka, Tsuyoshi; Mayer, Klaus; Itoh, Takeshi; Quesneville, Hadi; Feuillet, Catherine

    2012-01-01

    In support of the international effort to obtain a reference sequence of the bread wheat genome and to provide plant communities dealing with large and complex genomes with a versatile, easy-to-use online automated tool for annotation, we have developed the TriAnnot pipeline. Its modular architecture allows for the annotation and masking of transposable elements, the structural, and functional annotation of protein-coding genes with an evidence-based quality indexing, and the identification of conserved non-coding sequences and molecular markers. The TriAnnot pipeline is parallelized on a 712 CPU computing cluster that can run a 1-Gb sequence annotation in less than 5 days. It is accessible through a web interface for small scale analyses or through a server for large scale annotations. The performance of TriAnnot was evaluated in terms of sensitivity, specificity, and general fitness using curated reference sequence sets from rice and wheat. In less than 8 h, TriAnnot was able to predict more than 83% of the 3,748 CDS from rice chromosome 1 with a fitness of 67.4%. On a set of 12 reference Mb-sized contigs from wheat chromosome 3B, TriAnnot predicted and annotated 93.3% of the genes among which 54% were perfectly identified in accordance with the reference annotation. It also allowed the curation of 12 genes based on new biological evidences, increasing the percentage of perfect gene prediction to 63%. TriAnnot systematically showed a higher fitness than other annotation pipelines that are not improved for wheat. As it is easily adaptable to the annotation of other plant genomes, TriAnnot should become a useful resource for the annotation of large and complex genomes in the future. PMID:22645565

  16. A metagenomic survey of forest soil microbial communities more than a decade after timber harvesting.

    PubMed

    Wilhelm, Roland C; Cardenas, Erick; Leung, Hilary; Maas, Kendra; Hartmann, Martin; Hahn, Aria; Hallam, Steven; Mohn, William W

    2017-01-01

    The scarcity of long-term data on soil microbial communities in the decades following timber harvesting limits current understanding of the ecological problems associated with maintaining the productivity of managed forests. The high complexity of soil communities and the heterogeneity of forest and soil necessitates a comprehensive approach to understand the role of microbial processes in managed forest ecosystems. Here, we describe a curated collection of well replicated, multi-faceted data from eighteen reforested sites in six different North American ecozones within the Long-term Soil Productivity (LTSP) Study, without detailed analysis of results or discussion. The experiments were designed to contrast microbial community composition and function among forest soils from harvested treatment plots with varying intensities of organic matter removal. The collection includes 724 bacterial (16S) and 658 fungal (ITS2) amplicon libraries, 133 shotgun metagenomic libraries as well as stable isotope probing amplicon libraries capturing the effects of harvesting on hemicellulolytic and cellulolytic populations. This collection serves as a foundation for the LTSP Study and other studies of the ecology of forest soil and forest disturbance.

  17. A metagenomic survey of forest soil microbial communities more than a decade after timber harvesting

    PubMed Central

    Wilhelm, Roland C.; Cardenas, Erick; Leung, Hilary; Maas, Kendra; Hartmann, Martin; Hahn, Aria; Hallam, Steven; Mohn, William W.

    2017-01-01

    The scarcity of long-term data on soil microbial communities in the decades following timber harvesting limits current understanding of the ecological problems associated with maintaining the productivity of managed forests. The high complexity of soil communities and the heterogeneity of forest and soil necessitates a comprehensive approach to understand the role of microbial processes in managed forest ecosystems. Here, we describe a curated collection of well replicated, multi-faceted data from eighteen reforested sites in six different North American ecozones within the Long-term Soil Productivity (LTSP) Study, without detailed analysis of results or discussion. The experiments were designed to contrast microbial community composition and function among forest soils from harvested treatment plots with varying intensities of organic matter removal. The collection includes 724 bacterial (16S) and 658 fungal (ITS2) amplicon libraries, 133 shotgun metagenomic libraries as well as stable isotope probing amplicon libraries capturing the effects of harvesting on hemicellulolytic and cellulolytic populations. This collection serves as a foundation for the LTSP Study and other studies of the ecology of forest soil and forest disturbance. PMID:28765786

  18. Breeding Bird Community Continues to Colonize Riparian Buffers Ten Years after Harvest

    PubMed Central

    2015-01-01

    Riparian ecosystems integrate aquatic and terrestrial communities and often contain unique assemblages of flora and fauna. Retention of forested buffers along riparian habitats is a commonly employed practice to reduce potential negative effects of land use on aquatic systems. However, very few studies have examined long-term population and community responses to buffers, leading to considerable uncertainty about effectiveness of this practice for achieving conservation and management outcomes. We examined short- (1–2 years) and long-term (~10 years) avian community responses (occupancy and abundance) to riparian buffer prescriptions to clearcut logging silvicultural practices in the Pacific Northwest USA. We used a Before-After-Control-Impact experimental approach and temporally replicated point counts analyzed within a Bayesian framework. Our experimental design consisted of forested control sites with no harvest, sites with relatively narrow (~13m) forested buffers on each side of the stream, and sites with wider (~30m) and more variable width unharvested buffer. Buffer treatments exhibited a 31–44% increase in mean species richness in the post-harvest years, a pattern most evident 10 years post-harvest. Post-harvest, species turnover was much higher on both treatments (63–74%) relative to the controls (29%). We did not find evidence of local extinction for any species but found strong evidence (no overlap in 95% credible intervals) for an increase in site occupancy on both Narrow (short-term: 7%; long-term 29%) and Wide buffers (short-term: 21%; long-term 93%) relative to controls after harvest. We did not find a treatment effect on total avian abundance. When assessing relationships between buffer width and site level abundance of four riparian specialists, we did not find strong evidence of reduced abundance in Narrow or Wide buffers. Silviculture regulations in this region dictate average buffer widths on small and large permanent streams that range

  19. The relative abundance of predicted genes associated with ammonia-oxidation, nitrate reduction, and biomass decomposition in mineral soil are altered by intensive timber harvest.

    NASA Astrophysics Data System (ADS)

    Mushinski, R. M.; Zhou, Y.; Gentry, T. J.; Boutton, T. W.

    2017-12-01

    Forest ecosystems in the southern United States are substantially altered by anthropogenic disturbances such as timber harvest and land conversion, with effects being observed in carbon and nutrient pools as well as biogeochemical processes. Furthermore, the desire to develop renewable energy sources in the form of biomass extraction from logging residues may result in alterations in soil community structure and function. While the impact of forest management on soil physicochemical properties of the region has been studied, its' long-term effect on soil bacterial community composition and metagenomic potential is relatively unknown, especially at deeper soil depths. This study investigates how intensive organic matter removal intensities associated with timber harvest influence decadal-scale alterations in bacterial community structure and functional potential in the upper 1-m of the soil profile, 18 years post-harvest in a Pinus taeda L. forest of eastern Texas. Amplicon sequencing of the 16S rRNA gene was used in conjunction with soil chemical analyses to evaluate treatment-induced differences in community composition and potential environmental drivers of associated change. Furthermore, functional potential was assessed by using amplicon data to make metagenomic predictions. Results indicate that increasing organic matter removal intensity leads to altered community composition and the relative abundance of dominant OTUs annotated to Burkholderia and Aciditerrimonas. The relative abundance of predicted genes associated with dissimilatory nitrate reduction and denitrification were highest in the most intensively harvested treatment while genes involved in nitrification were significantly lower in the most intensively harvested treatment. Furthermore, genes associated with glycosyltransferases were significantly reduced with increasing harvest intensity while polysaccharide lyases increased. These results imply that intensive organic matter removal may create

  20. Considering departures from current timber harvesting policies: case studies of four communities in the Pacific Northwest.

    Treesearch

    Con H Schallau; Paul E. Polzin

    1983-01-01

    U.S. Department of Agriculture regulations permit departures from current National Forest timber harvesting policies when "implementation of base harvest schedules.., would cause a substantial adverse impact upon a community .... " This paper describes the kinds of information needed for forest managers to adequately assess the relevance of the departure...

  1. Promotores de salud and community health workers: an annotated bibliography.

    PubMed

    WestRasmus, Emma K; Pineda-Reyes, Fernando; Tamez, Montelle; Westfall, John M

    2012-01-01

    For underserved and disenfranchised communities in the United States, affordable, effective health care can be nearly inaccessible, which often leads to the exclusion of these communities from relevant medical information and care. Barriers to care are especially salient in minority communities, where language, traditions and customs, socioeconomics, and access to education can serve as additional roadblocks to accessing health care information and services. These factors have contributed to a national health disparity crisis that unnecessarily places some communities in a vulnerable position without adequate prevention and treatment opportunities. One solution to the exclusion some communities face in the health care system may be the promotores de salud (PdS)/community health worker (CHW), an approach to culturally competent health care delivery whose popularity in the mainstream health care system has been steadily growing in recent decades. Known by a wide variety of names and broad in the spectrum of health issues they address, the PdS/CHW serves as cultural brokers between their own community and the formal health care system and can play a crucial role in promoting health and wellness within their community. This annotated bibliography was created to educate the reader about the history, definition, key features, utility, outcomes, and broad potential of the CHW approach in a variety of populations. Intended to serve as a reference point to a vast body of information on the CHW/PdS approach, this document is a resource for those wishing to effect change in the disparities within the health care system, and to improve the access to, quality, and cost of health care for underserved patients and their communities. Promotores de Salud is a Spanish term that translates to Health Promoter. A female health worker may be referred to as a Promotora, a male as a Promotor, and the plural of both is Promotores. For the purposes of this bibliography, the terms community

  2. Quality assessment and public health status of harvested rainwater in a peri-urban community in Edo State of Nigeria.

    PubMed

    Igbinosa, Isoken H; Aighewi, Isoken T

    2017-08-01

    The harvested rainwater is an alternative water source in communities where there is limited or scarcity of water distribution system. However, contamination of roof-harvested rainwater is of immense concern to the general public health. Therefore, this study was initiated to assess the levels of physicochemical quality and heavy metal concentrations in the harvested rainwater from Oluku communities in Benin City, Edo State, Nigeria. The roof-harvested rainwater samples were collected from 20 independent different residential households in Oluku communities, between April 2015 and September 2015. Physicochemical analyses were carried out using standard analytical methods, and heavy metal concentrations were determined using atomic absorption spectrophotometry. The evaluation of the rainwater harvesting shows that 60% (12/20) of the roofs were made of corrugated iron sheets; aluminum sheets, 20% (4/20); asbestos, 10% (2/20); and open space was 10% (2/20). Also, the storage systems used for the storage of harvested rainwater were as follows: PVC tanks, 20% (4/20); drums, 30% (6/20); buckets, 25% (5/20); and wells, 25% (5/20). The physicochemical indicators investigated (temperature, nitrate, chlorine content, electrical conductivity, phosphate, total dissolved solids, and sulfate) were within World Health Organization (WHO) guidelines. However, some pH levels of the roof-harvested rainwater were acidic and below the WHO standard. Furthermore, a high value of turbidity was observed in some locations and exceeded the WHO guidelines. Though some heavy metal indicators (Zn, Na, K, and Ca) in this study were within the WHO guidelines, some locations revealed heavy metal (Cu, Fe, and Cd) concentrations slightly above the WHO guidelines. There is need for proper rainwater harvesting system and continuous monitoring of harvested rainwater for potable uses.

  3. CommWalker: correctly evaluating modules in molecular networks in light of annotation bias.

    PubMed

    Luecken, M D; Page, M J T; Crosby, A J; Mason, S; Reinert, G; Deane, C M

    2018-03-15

    Detecting novel functional modules in molecular networks is an important step in biological research. In the absence of gold standard functional modules, functional annotations are often used to verify whether detected modules/communities have biological meaning. However, as we show, the uneven distribution of functional annotations means that such evaluation methods favor communities of well-studied proteins. We propose a novel framework for the evaluation of communities as functional modules. Our proposed framework, CommWalker, takes communities as inputs and evaluates them in their local network environment by performing short random walks. We test CommWalker's ability to overcome annotation bias using input communities from four community detection methods on two protein interaction networks. We find that modules accepted by CommWalker are similarly co-expressed as those accepted by current methods. Crucially, CommWalker performs well not only in well-annotated regions, but also in regions otherwise obscured by poor annotation. CommWalker community prioritization both faithfully captures well-validated communities and identifies functional modules that may correspond to more novel biology. The CommWalker algorithm is freely available at opig.stats.ox.ac.uk/resources or as a docker image on the Docker Hub at hub.docker.com/r/lueckenmd/commwalker/. deane@stats.ox.ac.uk. Supplementary data are available at Bioinformatics online.

  4. Influences of upland timber harvest on aquatic invertebrate communities in seasonal ponds: efficacy of forested buffers

    Treesearch

    Mark A. Hanson; Brian J. Palik; James O. Church; Anthony T. Miller

    2010-01-01

    We assessed community responses of aquatic invertebrates in 16 small, seasonal ponds in a forested region of north central Minnesota, USA, to evaluate potential influences of timber harvest and efficacy of uncut forested buffers in adjacent uplands. Invertebrate data gathered before (2000) and during the first 4 years following clearcut timber harvest (2001-2004)...

  5. Annotated bibliography on forest practices legislation related to water quality

    Treesearch

    Neil K. Huyler; David McMath; Daphne Hewitt

    1999-01-01

    Includes annotated citations of literature on forest practices regulations related to all aspects of water quality protection. The bibliography is divided into three sections: 1) Water quality protection during timber harvesting; 2) Methods for assessing the costs and benefits of water quality protection; and 3) Effectiveness of regulatory programs in protecting water...

  6. The influence of partial timber harvest in riparian management zones on macroinvertebrate and fish communities on first- and second-order streams in northern Minnesota

    USGS Publications Warehouse

    Chizinski, Christopher J.; Vondracek, Bruce C.; Blinn, Charles R.; Newman, Raymond M.; Atuke, Dickson M.; Fredricks, Keith; Hemstad, Nathaniel A.; Merten, Eric; Schlesser, Nicholas

    2010-01-01

    Relatively few evaluations of aquatic macroinvertebrate and fish communities have been published in peer-reviewed literature detailing the effect of varying residual basal area (RBA) after timber harvesting in riparian buffers. Our analysis investigated the effects of partial harvesting within riparian buffers on aquatic macroinvertebrate and fish communities in small streams from two experiments in northern Minnesota northern hardwood-aspen forests. Each experiment evaluated partial harvesting within riparian buffers. In both experiments, benthic macroinvertebrates and fish were collected 1 year prior to harvest and in each of 3 years after harvest. We observed interannual variation for the macroinvertebrate abundance, diversity and taxon richness in the single-basin study and abundance and diversity in the multiple-basin study, but few effects related to harvest treatments in either study. However, interannual variation was not evident in the fish communities and we detected no significant changes in the stream fish communities associated with partially harvested riparian buffers in either study. This would suggest that timber harvesting in riparian management zones along reaches ≤200 m in length on both sides of the stream that retains RBA ≥ 12.4 ± 1.3 m2 ha−1 or on a single side of the stream that retains RBA ≥ 8.7 ± 1.6 m2 ha−1 may be adequate to protect macroinvertebrate and fish communities in our Minnesota study systems given these specific timber harvesting techniques.

  7. A guide to best practices for Gene Ontology (GO) manual annotation

    PubMed Central

    Balakrishnan, Rama; Harris, Midori A.; Huntley, Rachael; Van Auken, Kimberly; Cherry, J. Michael

    2013-01-01

    The Gene Ontology Consortium (GOC) is a community-based bioinformatics project that classifies gene product function through the use of structured controlled vocabularies. A fundamental application of the Gene Ontology (GO) is in the creation of gene product annotations, evidence-based associations between GO definitions and experimental or sequence-based analysis. Currently, the GOC disseminates 126 million annotations covering >374 000 species including all the kingdoms of life. This number includes two classes of GO annotations: those created manually by experienced biocurators reviewing the literature or by examination of biological data (1.1 million annotations covering 2226 species) and those generated computationally via automated methods. As manual annotations are often used to propagate functional predictions between related proteins within and between genomes, it is critical to provide accurate consistent manual annotations. Toward this goal, we present here the conventions defined by the GOC for the creation of manual annotation. This guide represents the best practices for manual annotation as established by the GOC project over the past 12 years. We hope this guide will encourage research communities to annotate gene products of their interest to enhance the corpus of GO annotations available to all. Database URL: http://www.geneontology.org PMID:23842463

  8. The Community Junior College: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Rarig, Emory W., Jr., Ed.

    This annotated bibliography on the junior college is arranged by topic: research tools, history, functions and purposes, organization and administration, students, programs, personnel, facilities, and research. It covers publications through the fall of 1965 and has an author index. (HH)

  9. FOCIH: Form-Based Ontology Creation and Information Harvesting

    NASA Astrophysics Data System (ADS)

    Tao, Cui; Embley, David W.; Liddle, Stephen W.

    Creating an ontology and populating it with data are both labor-intensive tasks requiring a high degree of expertise. Thus, scaling ontology creation and population to the size of the web in an effort to create a web of data—which some see as Web 3.0—is prohibitive. Can we find ways to streamline these tasks and lower the barrier enough to enable Web 3.0? Toward this end we offer a form-based approach to ontology creation that provides a way to create Web 3.0 ontologies without the need for specialized training. And we offer a way to semi-automatically harvest data from the current web of pages for a Web 3.0 ontology. In addition to harvesting information with respect to an ontology, the approach also annotates web pages and links facts in web pages to ontological concepts, resulting in a web of data superimposed over the web of pages. Experience with our prototype system shows that mappings between conceptual-model-based ontologies and forms are sufficient for creating the kind of ontologies needed for Web 3.0, and experiments with our prototype system show that automatic harvesting, automatic annotation, and automatic superimposition of a web of data over a web of pages work well.

  10. Avian influenza prevalence among hunter-harvested birds in a remote Canadian First Nation community.

    PubMed

    Liberda, Eric N; Meldrum, Richard; Charania, Nadia A; Davey, Robert; Tsuji, Leonard Js

    2017-01-01

    Avian influenza virus (AIV) prevalence has been associated with wild game and other bird species. The contamination of these birds may pose a greater risk to those who regularly hunt and consumed infected species. Due to resident concerns communicated by local Band Council, hunter-harvested birds from a remote First Nation community in subArctic Ontario, Canada were assessed for AIV. Hunters, and especially those who live a subsistence lifestyle, are at higher risk of AIV exposure due to their increased contact with wild birds, which represent an important part of their diet. Cloacal swabs from 304 harvested game birds representing several species of wild birds commonly hunted and consumed in this First Nation community were analyzed for AIV using real-time reverse transcription polymerase chain reaction. Subtyping was performed using reverse transcription polymerase chain reaction. Sequences were assembled using Lasergene, and the sequences were compared to Genbank. In total, 16 of the 304 cloacal swab samples were positive for AIV. Of the 16 positive samples, 12 were found in mallard ducks, 3 were found in snow geese (wavies), and 1 positive sample was found in partridge. The AIV samples were subtyped, when possible, and found to be positive for the low pathogenic avian influenza virus subtypes H3 and H4. No samples were positive for subtypes of human concern, namely H5 and H7. This work represents the first AIV monitoring program results of hunter-harvested birds in a remote subsistence First Nation community. Community-level surveillance of AIV in remote subsistence hunting communities may help to identify future risks, while educating those who may have the highest exposure about proper handling of hunted birds. Ultimately, only low pathogenic strains of AIV were found, but monitoring should be continued and expanded to safeguard those with the highest exposure risk to AIV.

  11. Semantic annotation in biomedicine: the current landscape.

    PubMed

    Jovanović, Jelena; Bagheri, Ebrahim

    2017-09-22

    The abundance and unstructured nature of biomedical texts, be it clinical or research content, impose significant challenges for the effective and efficient use of information and knowledge stored in such texts. Annotation of biomedical documents with machine intelligible semantics facilitates advanced, semantics-based text management, curation, indexing, and search. This paper focuses on annotation of biomedical entity mentions with concepts from relevant biomedical knowledge bases such as UMLS. As a result, the meaning of those mentions is unambiguously and explicitly defined, and thus made readily available for automated processing. This process is widely known as semantic annotation, and the tools that perform it are known as semantic annotators.Over the last dozen years, the biomedical research community has invested significant efforts in the development of biomedical semantic annotation technology. Aiming to establish grounds for further developments in this area, we review a selected set of state of the art biomedical semantic annotators, focusing particularly on general purpose annotators, that is, semantic annotation tools that can be customized to work with texts from any area of biomedicine. We also examine potential directions for further improvements of today's annotators which could make them even more capable of meeting the needs of real-world applications. To motivate and encourage further developments in this area, along the suggested and/or related directions, we review existing and potential practical applications and benefits of semantic annotators.

  12. Collective dynamics of social annotation

    PubMed Central

    Cattuto, Ciro; Barrat, Alain; Baldassarri, Andrea; Schehr, Gregory; Loreto, Vittorio

    2009-01-01

    The enormous increase of popularity and use of the worldwide web has led in the recent years to important changes in the ways people communicate. An interesting example of this fact is provided by the now very popular social annotation systems, through which users annotate resources (such as web pages or digital photographs) with keywords known as “tags.” Understanding the rich emergent structures resulting from the uncoordinated actions of users calls for an interdisciplinary effort. In particular concepts borrowed from statistical physics, such as random walks (RWs), and complex networks theory, can effectively contribute to the mathematical modeling of social annotation systems. Here, we show that the process of social annotation can be seen as a collective but uncoordinated exploration of an underlying semantic space, pictured as a graph, through a series of RWs. This modeling framework reproduces several aspects, thus far unexplained, of social annotation, among which are the peculiar growth of the size of the vocabulary used by the community and its complex network structure that represents an externalization of semantic structures grounded in cognition and that are typically hard to access. PMID:19506244

  13. Temporal Annotation in the Clinical Domain

    PubMed Central

    Styler, William F.; Bethard, Steven; Finan, Sean; Palmer, Martha; Pradhan, Sameer; de Groen, Piet C; Erickson, Brad; Miller, Timothy; Lin, Chen; Savova, Guergana; Pustejovsky, James

    2014-01-01

    This article discusses the requirements of a formal specification for the annotation of temporal information in clinical narratives. We discuss the implementation and extension of ISO-TimeML for annotating a corpus of clinical notes, known as the THYME corpus. To reflect the information task and the heavily inference-based reasoning demands in the domain, a new annotation guideline has been developed, “the THYME Guidelines to ISO-TimeML (THYME-TimeML)”. To clarify what relations merit annotation, we distinguish between linguistically-derived and inferentially-derived temporal orderings in the text. We also apply a top performing TempEval 2013 system against this new resource to measure the difficulty of adapting systems to the clinical domain. The corpus is available to the community and has been proposed for use in a SemEval 2015 task. PMID:29082229

  14. ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level

    PubMed Central

    Rocca-Serra, Philippe; Brandizi, Marco; Maguire, Eamonn; Sklyar, Nataliya; Taylor, Chris; Begley, Kimberly; Field, Dawn; Harris, Stephen; Hide, Winston; Hofmann, Oliver; Neumann, Steffen; Sterk, Peter; Tong, Weida; Sansone, Susanna-Assunta

    2010-01-01

    Summary: The first open source software suite for experimentalists and curators that (i) assists in the annotation and local management of experimental metadata from high-throughput studies employing one or a combination of omics and other technologies; (ii) empowers users to uptake community-defined checklists and ontologies; and (iii) facilitates submission to international public repositories. Availability and Implementation: Software, documentation, case studies and implementations at http://www.isa-tools.org Contact: isatools@googlegroups.com PMID:20679334

  15. Preventing eye injuries among citrus harvesters: the community health worker model.

    PubMed

    Monaghan, Paul F; Forst, Linda S; Tovar-Aguilar, Jose Antonio; Bryant, Carol A; Israel, Glenn D; Galindo-Gonzalez, Sebastian; Thompson, Zachary; Zhu, Yiliang; McDermott, Robert J

    2011-12-01

    Although eye injuries are common among citrus harvesters, the proportion of workers using protective eyewear has been negligible. We focused on adoption of worker-tested safety glasses with and without the presence and activities of trained peer-worker role models on harvesting crews. Observation of 13 citrus harvesting crews established baseline use of safety eyewear. Nine crews subsequently were assigned a peer worker to model use of safety glasses, conduct eye safety education, and treat minor eye injuries. Safety eyewear use by crews was monitored up to 15 weeks into the intervention. Intervention crews with peer workers had significantly higher rates of eyewear use than control crews. Intervention exposure time and level of worker use were strongly correlated. Among intervention crews, workers with 1 to 2 years of experience (odds ratio [OR] = 2.89; 95% confidence interval [CI] = 1.11, 7.55) and who received help from their peer worker (OR = 3.73; 95% CI = 1.21, 11.57) were significantly more likely to use glasses than were other intervention crew members. Adaptation of the community health worker model for this setting improved injury prevention practices and may have relevance for similar agricultural settings.

  16. NoGOA: predicting noisy GO annotations using evidences and sparse representation.

    PubMed

    Yu, Guoxian; Lu, Chang; Wang, Jun

    2017-07-21

    Gene Ontology (GO) is a community effort to represent functional features of gene products. GO annotations (GOA) provide functional associations between GO terms and gene products. Due to resources limitation, only a small portion of annotations are manually checked by curators, and the others are electronically inferred. Although quality control techniques have been applied to ensure the quality of annotations, the community consistently report that there are still considerable noisy (or incorrect) annotations. Given the wide application of annotations, however, how to identify noisy annotations is an important but yet seldom studied open problem. We introduce a novel approach called NoGOA to predict noisy annotations. NoGOA applies sparse representation on the gene-term association matrix to reduce the impact of noisy annotations, and takes advantage of sparse representation coefficients to measure the semantic similarity between genes. Secondly, it preliminarily predicts noisy annotations of a gene based on aggregated votes from semantic neighborhood genes of that gene. Next, NoGOA estimates the ratio of noisy annotations for each evidence code based on direct annotations in GOA files archived on different periods, and then weights entries of the association matrix via estimated ratios and propagates weights to ancestors of direct annotations using GO hierarchy. Finally, it integrates evidence-weighted association matrix and aggregated votes to predict noisy annotations. Experiments on archived GOA files of six model species (H. sapiens, A. thaliana, S. cerevisiae, G. gallus, B. Taurus and M. musculus) demonstrate that NoGOA achieves significantly better results than other related methods and removing noisy annotations improves the performance of gene function prediction. The comparative study justifies the effectiveness of integrating evidence codes with sparse representation for predicting noisy GO annotations. Codes and datasets are available at http://mlda.swu.edu.cn/codes.php?name=NoGOA .

  17. Perceptions of environmental changes and lethargic crab disease among crab harvesters in a Brazilian coastal community.

    PubMed

    Firmo, Angélica M S; Tognella, Mônica M P; Có, Walter L O; Barboza, Raynner R D; Alves, Rômulo R N

    2011-11-16

    Lethargic Crab Disease (LCD) has caused significant mortalities in the population of Ucides cordatus crabs in the Mucuri estuary in Bahia State, Brazil, and has brought social and economic problems to many crab-harvesting communities that depend on this natural resource. The present work examined the perceptions of members of a Brazilian crab harvesting community concerning environmental changes and the Lethargic Crab Disease. Field work was undertaken during the period between January and April/2009, with weekly or biweekly field excursions during which open and semi-structured interviews were held with local residents in the municipality of Mucuri, Bahia State, Brazil. A total of 23 individuals were interviewed, all of whom had at least 20 years of crab-collecting experience in the study region. Key-informants (more experienced crab harvesters) were selected among the interviewees using the "native specialist" criterion. According to the collectors, LCD reached the Mucuri mangroves between 2004 and 2005, decimating almost all crab population in the area, and in 2007, 2008 and 2009 high mortalities of U. cordatus were again observed as a result of recurrences of this disease in the region. In addition to LCD, crabs were also suffering great stock reductions due to habitat degradation caused by deforestation, landfills, sewage effluents, domestic and industrial wastes and the introduction of exotic fish in the Mucuri River estuary. The harvesting community was found to have significant ecological knowledge about the functioning of mangrove swamp ecology, the biology of crabs, and the mass mortality that directly affected the economy of this community, and this information was largely in accordance with scientific knowledge. The study of traditional knowledge makes it possible to better understand human interactions with the environment and aids in the elaboration of appropriate strategies for natural resource conservation.

  18. Perceptions of environmental changes and Lethargic crab disease among crab harvesters in a Brazilian coastal community

    PubMed Central

    2011-01-01

    Background Lethargic Crab Disease (LCD) has caused significant mortalities in the population of Ucides cordatus crabs in the Mucuri estuary in Bahia State, Brazil, and has brought social and economic problems to many crab-harvesting communities that depend on this natural resource. The present work examined the perceptions of members of a Brazilian crab harvesting community concerning environmental changes and the Lethargic Crab Disease. Methods Field work was undertaken during the period between January and April/2009, with weekly or biweekly field excursions during which open and semi-structured interviews were held with local residents in the municipality of Mucuri, Bahia State, Brazil. A total of 23 individuals were interviewed, all of whom had at least 20 years of crab-collecting experience in the study region. Key-informants (more experienced crab harvesters) were selected among the interviewees using the "native specialist" criterion. Results According to the collectors, LCD reached the Mucuri mangroves between 2004 and 2005, decimating almost all crab population in the area, and in 2007, 2008 and 2009 high mortalities of U. cordatus were again observed as a result of recurrences of this disease in the region. In addition to LCD, crabs were also suffering great stock reductions due to habitat degradation caused by deforestation, landfills, sewage effluents, domestic and industrial wastes and the introduction of exotic fish in the Mucuri River estuary. The harvesting community was found to have significant ecological knowledge about the functioning of mangrove swamp ecology, the biology of crabs, and the mass mortality that directly affected the economy of this community, and this information was largely in accordance with scientific knowledge. Conclusions The study of traditional knowledge makes it possible to better understand human interactions with the environment and aids in the elaboration of appropriate strategies for natural resource conservation

  19. Genome and proteome annotation: organization, interpretation and integration

    PubMed Central

    Reeves, Gabrielle A.; Talavera, David; Thornton, Janet M.

    2008-01-01

    Recent years have seen a huge increase in the generation of genomic and proteomic data. This has been due to improvements in current biological methodologies, the development of new experimental techniques and the use of computers as support tools. All these raw data are useless if they cannot be properly analysed, annotated, stored and displayed. Consequently, a vast number of resources have been created to present the data to the wider community. Annotation tools and databases provide the means to disseminate these data and to comprehend their biological importance. This review examines the various aspects of annotation: type, methodology and availability. Moreover, it puts a special interest on novel annotation fields, such as that of phenotypes, and highlights the recent efforts focused on the integrating annotations. PMID:19019817

  20. Collaborative Movie Annotation

    NASA Astrophysics Data System (ADS)

    Zad, Damon Daylamani; Agius, Harry

    In this paper, we focus on metadata for self-created movies like those found on YouTube and Google Video, the duration of which are increasing in line with falling upload restrictions. While simple tags may have been sufficient for most purposes for traditionally very short video footage that contains a relatively small amount of semantic content, this is not the case for movies of longer duration which embody more intricate semantics. Creating metadata is a time-consuming process that takes a great deal of individual effort; however, this effort can be greatly reduced by harnessing the power of Web 2.0 communities to create, update and maintain it. Consequently, we consider the annotation of movies within Web 2.0 environments, such that users create and share that metadata collaboratively and propose an architecture for collaborative movie annotation. This architecture arises from the results of an empirical experiment where metadata creation tools, YouTube and an MPEG-7 modelling tool, were used by users to create movie metadata. The next section discusses related work in the areas of collaborative retrieval and tagging. Then, we describe the experiments that were undertaken on a sample of 50 users. Next, the results are presented which provide some insight into how users interact with existing tools and systems for annotating movies. Based on these results, the paper then develops an architecture for collaborative movie annotation.

  1. AphidBase: A centralized bioinformatic resource for annotation of the pea aphid genome

    PubMed Central

    Legeai, Fabrice; Shigenobu, Shuji; Gauthier, Jean-Pierre; Colbourne, John; Rispe, Claude; Collin, Olivier; Richards, Stephen; Wilson, Alex C. C.; Tagu, Denis

    2015-01-01

    AphidBase is a centralized bioinformatic resource that was developed to facilitate community annotation of the pea aphid genome by the International Aphid Genomics Consortium (IAGC). The AphidBase Information System designed to organize and distribute genomic data and annotations for a large international community was constructed using open source software tools from the Generic Model Organism Database (GMOD). The system includes Apollo and GBrowse utilities as well as a wiki, blast search capabilities and a full text search engine. AphidBase strongly supported community cooperation and coordination in the curation of gene models during community annotation of the pea aphid genome. AphidBase can be accessed at http://www.aphidbase.com. PMID:20482635

  2. Forest lepidopteran communities are more resilient to shelterwood harvests compared to more intensive logging regimes.

    PubMed

    Summerville, Keith S

    2013-07-01

    A common measure of ecosystem resilience is the time course to recovery for a system that has been previously disturbed. The goal of this study was to assess whether forest lepidopteran communities displayed three different forms of resilience following experimental timber harvest. Specifically, I examined whether moth species assemblages returned to pre-logging composition (compositional resilience), species richness (structural resilience), and guild diversity (functional resilience) after forest management. Lepidoptera were sampled from 16 forest stands managed with one of four harvest treatments: no logging, clear-cutting, shelterwood harvests, and group selection harvests. Moths were sampled from all forest stands one year prior to harvest in 2007 and immediately postharvest in 2009-2011. Moth community composition only appeared to be resilient to timber harvest in stands managed with shelterwood methods (15% biomass removed) or in the unlogged stands within managed concession units. Both total species richness and species richness of Quercus-feeding moths also appeared to recover to a near original condition three years post-shelterwood logging. In contrast, moth assemblages in clear-cut stands and group selection stands (80% biomass removed) remained impoverished. Tests of functional resilience suggested that richness of species known to be pollinators was largely unaffected by timber management, and the number of moth species known to feed on herbaceous vegetation doubled in stands logged using group selection methods. Dietary specialists were disproportionately abundant in the unlogged stands postharvest, suggesting that species with more narrow dietary niches have the lowest resilience to timber management. These results suggest that most methods of forest management have short-term negative impacts on woody-plant-feeding Lepidoptera, but that the effects are limited to a few years when the harvest method involves shelterwood cuts. Herbaceous

  3. Metagenomic gene annotation by a homology-independent approach

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Froula, Jeff; Zhang, Tao; Salmeen, Annette

    2011-06-02

    Fully understanding the genetic potential of a microbial community requires functional annotation of all the genes it encodes. The recently developed deep metagenome sequencing approach has enabled rapid identification of millions of genes from a complex microbial community without cultivation. Current homology-based gene annotation fails to detect distantly-related or structural homologs. Furthermore, homology searches with millions of genes are very computational intensive. To overcome these limitations, we developed rhModeller, a homology-independent software pipeline to efficiently annotate genes from metagenomic sequencing projects. Using cellulases and carbonic anhydrases as two independent test cases, we demonstrated that rhModeller is much faster than HMMERmore » but with comparable accuracy, at 94.5percent and 99.9percent accuracy, respectively. More importantly, rhModeller has the ability to detect novel proteins that do not share significant homology to any known protein families. As {approx}50percent of the 2 million genes derived from the cow rumen metagenome failed to be annotated based on sequence homology, we tested whether rhModeller could be used to annotate these genes. Preliminary results suggest that rhModeller is robust in the presence of missense and frameshift mutations, two common errors in metagenomic genes. Applying the pipeline to the cow rumen genes identified 4,990 novel cellulases candidates and 8,196 novel carbonic anhydrase candidates.In summary, we expect rhModeller to dramatically increase the speed and quality of metagnomic gene annotation.« less

  4. Harvest-created canopy gaps increase species and functional trait diversity of the forest ground-layer community

    Treesearch

    Christel C. Kern; Rebecca A. Montgomery; Peter B. Reich; Terry F. Strong

    2014-01-01

    Biodiversity conservation within managed forests depends, in part, on management practices that restore or maintain plant community diversity and function. Because many plant communities are adapted to natural disturbances, gap-based management has potential to meet this need by using the historical range of variation in canopy disturbances to guide elements of harvest...

  5. Improved annotation of the insect vector of citrus greening disease: biocuration by a diverse genomics community

    PubMed Central

    Hosmani, Prashant S.; Villalobos-Ayala, Krystal; Miller, Sherry; Shippy, Teresa; Flores, Mirella; Rosendale, Andrew; Cordola, Chris; Bell, Tracey; Mann, Hannah; DeAvila, Gabe; DeAvila, Daniel; Moore, Zachary; Buller, Kyle; Ciolkevich, Kathryn; Nandyal, Samantha; Mahoney, Robert; Van Voorhis, Joshua; Dunlevy, Megan; Farrow, David; Hunter, David; Morgan, Taylar; Shore, Kayla; Guzman, Victoria; Izsak, Allison; Dixon, Danielle E.; Cridge, Andrew; Cano, Liliana; Cao, Xiaolong; Jiang, Haobo; Leng, Nan; Johnson, Shannon; Cantarel, Brandi L.; Richards, Stephen; English, Adam; Shatters, Robert G.; Childers, Chris; Chen, Mei-Ju; Hunter, Wayne; Cilia, Michelle; Mueller, Lukas A.; Munoz-Torres, Monica; Nelson, David; Poelchau, Monica F.; Benoit, Joshua B.; Wiersma-Koch, Helen; D’Elia, Tom; Brown, Susan J.

    2017-01-01

    Abstract The Asian citrus psyllid (Diaphorina citri Kuwayama) is the insect vector of the bacterium Candidatus Liberibacter asiaticus (CLas), the pathogen associated with citrus Huanglongbing (HLB, citrus greening). HLB threatens citrus production worldwide. Suppression or reduction of the insect vector using chemical insecticides has been the primary method to inhibit the spread of citrus greening disease. Accurate structural and functional annotation of the Asian citrus psyllid genome, as well as a clear understanding of the interactions between the insect and CLas, are required for development of new molecular-based HLB control methods. A draft assembly of the D. citri genome has been generated and annotated with automated pipelines. However, knowledge transfer from well-curated reference genomes such as that of Drosophila melanogaster to newly sequenced ones is challenging due to the complexity and diversity of insect genomes. To identify and improve gene models as potential targets for pest control, we manually curated several gene families with a focus on genes that have key functional roles in D. citri biology and CLas interactions. This community effort produced 530 manually curated gene models across developmental, physiological, RNAi regulatory and immunity-related pathways. As previously shown in the pea aphid, RNAi machinery genes putatively involved in the microRNA pathway have been specifically duplicated. A comprehensive transcriptome enabled us to identify a number of gene families that are either missing or misassembled in the draft genome. In order to develop biocuration as a training experience, we included undergraduate and graduate students from multiple institutions, as well as experienced annotators from the insect genomics research community. The resulting gene set (OGS v1.0) combines both automatically predicted and manually curated gene models. Database URL: https://citrusgreening.org/ PMID:29220441

  6. Biogeography and organic matter removal shape long-term effects of timber harvesting on forest soil microbial communities.

    PubMed

    Wilhelm, Roland C; Cardenas, Erick; Maas, Kendra R; Leung, Hilary; McNeil, Larisa; Berch, Shannon; Chapman, William; Hope, Graeme; Kranabetter, J M; Dubé, Stephane; Busse, Matt; Fleming, Robert; Hazlett, Paul; Webster, Kara L; Morris, David; Scott, D Andrew; Mohn, William W

    2017-11-01

    The growing demand for renewable, carbon-neutral materials and energy is leading to intensified forest land-use. The long-term ecological challenges associated with maintaining soil fertility in managed forests are not yet known, in part due to the complexity of soil microbial communities and the heterogeneity of forest soils. This study determined the long-term effects of timber harvesting, accompanied by varied organic matter (OM) removal, on bacterial and fungal soil populations in 11- to 17-year-old reforested coniferous plantations at 18 sites across North America. Analysis of highly replicated 16 S rRNA gene and ITS region pyrotag libraries and shotgun metagenomes demonstrated consistent changes in microbial communities in harvested plots that included the expansion of desiccation- and heat-tolerant organisms and decline in diversity of ectomycorrhizal fungi. However, the majority of taxa, including the most abundant and cosmopolitan groups, were unaffected by harvesting. Shifts in microbial populations that corresponded to increased temperature and soil dryness were moderated by OM retention, which also selected for sub-populations of fungal decomposers. Biogeographical differences in the distribution of taxa as well as local edaphic and environmental conditions produced substantial variation in the effects of harvesting. This extensive molecular-based investigation of forest soil advances our understanding of forest disturbance and lays the foundation for monitoring long-term impacts of timber harvesting.

  7. Inclusion: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Moore, Caroline; Carter, Susanne

    This annotated bibliography is a compilation of recently published literature about inclusion of students with disabilities in the mainstream of school and community life. The 279 resources are organized into 19 topical areas and are indexed by more than 200 subject descriptors. Within each section, resources are displayed alphabetically by author…

  8. Concept annotation in the CRAFT corpus.

    PubMed

    Bada, Michael; Eckert, Miriam; Evans, Donald; Garcia, Kristin; Shipley, Krista; Sitnikov, Dmitry; Baumgartner, William A; Cohen, K Bretonnel; Verspoor, Karin; Blake, Judith A; Hunter, Lawrence E

    2012-07-09

    Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml.

  9. Draft genome of the red harvester ant Pogonomyrmex barbatus.

    PubMed

    Smith, Chris R; Smith, Christopher D; Robertson, Hugh M; Helmkampf, Martin; Zimin, Aleksey; Yandell, Mark; Holt, Carson; Hu, Hao; Abouheif, Ehab; Benton, Richard; Cash, Elizabeth; Croset, Vincent; Currie, Cameron R; Elhaik, Eran; Elsik, Christine G; Favé, Marie-Julie; Fernandes, Vilaiwan; Gibson, Joshua D; Graur, Dan; Gronenberg, Wulfila; Grubbs, Kirk J; Hagen, Darren E; Viniegra, Ana Sofia Ibarraran; Johnson, Brian R; Johnson, Reed M; Khila, Abderrahman; Kim, Jay W; Mathis, Kaitlyn A; Munoz-Torres, Monica C; Murphy, Marguerite C; Mustard, Julie A; Nakamura, Rin; Niehuis, Oliver; Nigam, Surabhi; Overson, Rick P; Placek, Jennifer E; Rajakumar, Rajendhran; Reese, Justin T; Suen, Garret; Tao, Shu; Torres, Candice W; Tsutsui, Neil D; Viljakainen, Lumi; Wolschin, Florian; Gadau, Jürgen

    2011-04-05

    We report the draft genome sequence of the red harvester ant, Pogonomyrmex barbatus. The genome was sequenced using 454 pyrosequencing, and the current assembly and annotation were completed in less than 1 y. Analyses of conserved gene groups (more than 1,200 manually annotated genes to date) suggest a high-quality assembly and annotation comparable to recently sequenced insect genomes using Sanger sequencing. The red harvester ant is a model for studying reproductive division of labor, phenotypic plasticity, and sociogenomics. Although the genome of P. barbatus is similar to other sequenced hymenopterans (Apis mellifera and Nasonia vitripennis) in GC content and compositional organization, and possesses a complete CpG methylation toolkit, its predicted genomic CpG content differs markedly from the other hymenopterans. Gene networks involved in generating key differences between the queen and worker castes (e.g., wings and ovaries) show signatures of increased methylation and suggest that ants and bees may have independently co-opted the same gene regulatory mechanisms for reproductive division of labor. Gene family expansions (e.g., 344 functional odorant receptors) and pseudogene accumulation in chemoreception and P450 genes compared with A. mellifera and N. vitripennis are consistent with major life-history changes during the adaptive radiation of Pogonomyrmex spp., perhaps in parallel with the development of the North American deserts.

  10. Preventing Eye Injuries Among Citrus Harvesters: The Community Health Worker Model

    PubMed Central

    Monaghan, Paul F.; Forst, Linda S.; Tovar-Aguilar, Jose Antonio; Bryant, Carol A.; Israel, Glenn D.; Galindo-Gonzalez, Sebastian; Thompson, Zachary; Zhu, Yiliang

    2011-01-01

    Objectives. Although eye injuries are common among citrus harvesters, the proportion of workers using protective eyewear has been negligible. We focused on adoption of worker-tested safety glasses with and without the presence and activities of trained peer-worker role models on harvesting crews. Methods. Observation of 13 citrus harvesting crews established baseline use of safety eyewear. Nine crews subsequently were assigned a peer worker to model use of safety glasses, conduct eye safety education, and treat minor eye injuries. Safety eyewear use by crews was monitored up to 15 weeks into the intervention. Results. Intervention crews with peer workers had significantly higher rates of eyewear use than control crews. Intervention exposure time and level of worker use were strongly correlated. Among intervention crews, workers with 1 to 2 years of experience (odds ratio [OR] = 2.89; 95% confidence interval [CI] = 1.11, 7.55) and who received help from their peer worker (OR = 3.73; 95% CI = 1.21, 11.57) were significantly more likely to use glasses than were other intervention crew members. Conclusions. Adaptation of the community health worker model for this setting improved injury prevention practices and may have relevance for similar agricultural settings. PMID:22021291

  11. Variation of Bacterial Community Diversity in Rhizosphere Soil of Sole-Cropped versus Intercropped Wheat Field after Harvest.

    PubMed

    Yang, Zhenping; Yang, Wenping; Li, Shengcai; Hao, Jiaomin; Su, Zhifeng; Sun, Min; Gao, Zhiqiang; Zhang, Chunlai

    2016-01-01

    As the major crops in north China, spring crops are usually planted from April through May every spring and harvested in fall. Wheat is also a very common crop traditionally planted in fall or spring and harvested in summer year by year. This continuous cropping system exhibited the disadvantages of reducing the fertility of soil through decreasing microbial diversity. Thus, management of microbial diversity in the rhizosphere plays a vital role in sustainable crop production. In this study, ten common spring crops in north China were chosen sole-cropped and four were chosen intercropped with peanut in wheat fields after harvest. Denaturing gradient gel electrophoresis (DGGE) and DNA sequencing of one 16S rDNA fragment were used to analyze the bacterial diversity and species identification. DGGE profiles showed the bacterial community diversity in rhizosphere soil samples varied among various crops under different cropping systems, more diverse under intercropping system than under sole-cropping. Some intercropping-specific bands in DGGE profiles suggested that several bacterial species were stimulated by intercropping systems specifically. Furthermore, the identification of these dominant and functional bacteria by DNA sequencing indicated that intercropping systems are more beneficial to improve soil fertility. Compared to intercropping systems, we also observed changes in microbial community of rhizosphere soil under sole-crops. The rhizosphere bacterial community structure in spring crops showed a strong crop species-specific pattern. More importantly, Empedobacter brevis, a typical plant pathogen, was only found in the carrot rhizosphere, suggesting carrot should be sown prudently. In conclusion, our study demonstrated that crop species and cropping systems had significant effects on bacterial community diversity in the rhizosphere soils. We strongly suggest sorghum, glutinous millet and buckwheat could be taken into account as intercropping crops with peanut

  12. Considerations to improve functional annotations in biological databases.

    PubMed

    Benítez-Páez, Alfonso

    2009-12-01

    Despite the great effort to design efficient systems allowing the electronic indexation of information concerning genes, proteins, structures, and interactions published daily in scientific journals, some problems are still observed in specific tasks such as functional annotation. The annotation of function is a critical issue for bioinformatic routines, such as for instance, in functional genomics and the further prediction of unknown protein function, which are highly dependent of the quality of existing annotations. Some information management systems evolve to efficiently incorporate information from large-scale projects, but often, annotation of single records from the literature is difficult and slow. In this short report, functional characterizations of a representative sample of the entire set of uncharacterized proteins from Escherichia coli K12 was compiled from Swiss-Prot, PubMed, and EcoCyc and demonstrate a functional annotation deficit in biological databases. Some issues are postulated as causes of the lack of annotation, and different solutions are evaluated and proposed to avoid them. The hope is that as a consequence of these observations, there will be new impetus to improve the speed and quality of functional annotation and ultimately provide updated, reliable information to the scientific community.

  13. Annotated Videography. Part 3. [Revised].

    ERIC Educational Resources Information Center

    United States Holocaust Memorial Museum, Washington, DC.

    This annotated videography has been designed to identify videotapes addressing Holocaust history that have been used effectively in classrooms and are available readily to most communities. The guide is divided into 15 topical categories, including: life before the Holocaust; perpetrators; propaganda; racism; antisemitism; mosaic of victims;…

  14. An annotated bibliography of completed and in-progress behavioral research for the Office of Buildings and Community Systems. [About 1000 items, usually with abstracts

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Weijo, R.O.; Roberson, B.F.; Eckert, R.

    This report provides an annotated bibliography of completed and in-progress consumer decision research useful for technology transfer and commercialization planning by the US Department of Energy's (DOE) Office of Buildings and Community Systems (OBCS). This report attempts to integrate the consumer research studies conducted across several public and private organizations over the last four to five years. Some of the sources of studies included in this annotated bibliography are DOE National Laboratories, public and private utilities, trade associations, states, and nonprofit organizations. This study divides the articles identified in this annotated bibliography into sections that are consistent with or similarmore » to the system of organization used by OBCS.« less

  15. Effects of harvesting treatments on the ant community in a Mississippi River bottomland hardwood forest in west-central Mississippi

    Treesearch

    Lynne C. Thompson; David M. General; Brian Roy Lockhart

    2010-01-01

    We assessed effects that harvesting treatments had on the ant community in a Mississippi River bottomland hardwood forest in west-central MS. Ants were collected on Pittman Island using pitfall traps from July to November in 1996, 1997, and 2000. The forest received three replicated harvesting treatments in 1995, including: 1) uncut controls (check), 2) selection...

  16. An Annotated Bibliography: Budgeting in Higher Education.

    ERIC Educational Resources Information Center

    Emery, Rebecca A.

    Though the original focus of this annotated bibliography was upon budgeting in the public community college, its scope was expanded to include articles on budgeting in four-year colleges and in some private two- and four-year institutions when the articles were relevant to community colleges. Due to the inter-relationship between budgeting and…

  17. Concept annotation in the CRAFT corpus

    PubMed Central

    2012-01-01

    Background Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. Results This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. Conclusions As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http

  18. Response of herbaceous plant community diversity and composition to overstorey harvest within riparian management zones in Northern Hardwoods

    Treesearch

    Eric K. Zenner; Michelle A. Martin; Brian J. Palik; Jerilynn E. Peck; Charles R. Blinn

    2013-01-01

    Partial timber harvest within riparian management zones (RMZs) may permit active management of riparian forests while protecting stream ecosystems, but impacts on herbaceous communities are poorly understood. We compared herbaceous plant community abundance, diversity and composition in RMZs along small streams in northern Minnesota, USA, among four treatments before...

  19. Open semantic annotation of scientific publications using DOMEO.

    PubMed

    Ciccarese, Paolo; Ocana, Marco; Clark, Tim

    2012-04-24

    Our group has developed a useful shared software framework for performing, versioning, sharing and viewing Web annotations of a number of kinds, using an open representation model. The Domeo Annotation Tool was developed in tandem with this open model, the Annotation Ontology (AO). Development of both the Annotation Framework and the open model was driven by requirements of several different types of alpha users, including bench scientists and biomedical curators from university research labs, online scientific communities, publishing and pharmaceutical companies.Several use cases were incrementally implemented by the toolkit. These use cases in biomedical communications include personal note-taking, group document annotation, semantic tagging, claim-evidence-context extraction, reagent tagging, and curation of textmining results from entity extraction algorithms. We report on the Domeo user interface here. Domeo has been deployed in beta release as part of the NIH Neuroscience Information Framework (NIF, http://www.neuinfo.org) and is scheduled for production deployment in the NIF's next full release.Future papers will describe other aspects of this work in detail, including Annotation Framework Services and components for integrating with external textmining services, such as the NCBO Annotator web service, and with other textmining applications using the Apache UIMA framework.

  20. Open semantic annotation of scientific publications using DOMEO

    PubMed Central

    2012-01-01

    Background Our group has developed a useful shared software framework for performing, versioning, sharing and viewing Web annotations of a number of kinds, using an open representation model. Methods The Domeo Annotation Tool was developed in tandem with this open model, the Annotation Ontology (AO). Development of both the Annotation Framework and the open model was driven by requirements of several different types of alpha users, including bench scientists and biomedical curators from university research labs, online scientific communities, publishing and pharmaceutical companies. Several use cases were incrementally implemented by the toolkit. These use cases in biomedical communications include personal note-taking, group document annotation, semantic tagging, claim-evidence-context extraction, reagent tagging, and curation of textmining results from entity extraction algorithms. Results We report on the Domeo user interface here. Domeo has been deployed in beta release as part of the NIH Neuroscience Information Framework (NIF, http://www.neuinfo.org) and is scheduled for production deployment in the NIF’s next full release. Future papers will describe other aspects of this work in detail, including Annotation Framework Services and components for integrating with external textmining services, such as the NCBO Annotator web service, and with other textmining applications using the Apache UIMA framework. PMID:22541592

  1. Modeling loosely annotated images using both given and imagined annotations

    NASA Astrophysics Data System (ADS)

    Tang, Hong; Boujemaa, Nozha; Chen, Yunhao; Deng, Lei

    2011-12-01

    In this paper, we present an approach to learn latent semantic analysis models from loosely annotated images for automatic image annotation and indexing. The given annotation in training images is loose due to: 1. ambiguous correspondences between visual features and annotated keywords; 2. incomplete lists of annotated keywords. The second reason motivates us to enrich the incomplete annotation in a simple way before learning a topic model. In particular, some ``imagined'' keywords are poured into the incomplete annotation through measuring similarity between keywords in terms of their co-occurrence. Then, both given and imagined annotations are employed to learn probabilistic topic models for automatically annotating new images. We conduct experiments on two image databases (i.e., Corel and ESP) coupled with their loose annotations, and compare the proposed method with state-of-the-art discrete annotation methods. The proposed method improves word-driven probability latent semantic analysis (PLSA-words) up to a comparable performance with the best discrete annotation method, while a merit of PLSA-words is still kept, i.e., a wider semantic range.

  2. NUTRIENT UPTAKE AND COMMUNITY METABOLISM IN STREAMS DRAINING HARVESTED AND OLD GROWTH WATERSHEDS: A PRELIMINARY ASSESSMENT

    EPA Science Inventory

    The effect of timber harvesting on streams is assessed using two measures of ecosystem function: nutrient ad community metabolism. This research is being conducted in streams of the southern Appalachian Mountains of North Carolina, the Ouachita Mountains of Arkansas, the Cascad...

  3. Rural Development Literature 1976-1977: An Updated Annotated Bibliography.

    ERIC Educational Resources Information Center

    Buzzard, Shirley, Comp.

    More than 100 books and articles on rural development published during 1976-77 are annotated in this selective bibliography. Concentrating on social science literature, the bibliography is interdisciplinary in nature, spanning agricultural economics, anthropology, community development, community health, and rural sociology. Types of works…

  4. Solving the Problem: Genome Annotation Standards before the Data Deluge.

    PubMed

    Klimke, William; O'Donovan, Claire; White, Owen; Brister, J Rodney; Clark, Karen; Fedorov, Boris; Mizrachi, Ilene; Pruitt, Kim D; Tatusova, Tatiana

    2011-10-15

    The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboration with sequencing centers, archival databases, and researchers, has developed the first international annotation standards, a fundamental step in ensuring that high quality complete prokaryotic genomes are available as gold standard references. Highlights include the development of annotation assessment tools, community acceptance of protein naming standards, comparison of annotation resources to provide consistent annotation, and improved tracking of the evidence used to generate a particular annotation. The development of a set of minimal standards, including the requirement for annotated complete prokaryotic genomes to contain a full set of ribosomal RNAs, transfer RNAs, and proteins encoding core conserved functions, is an historic milestone. The use of these standards in existing genomes and future submissions will increase the quality of databases, enabling researchers to make accurate biological discoveries.

  5. The caBIG annotation and image Markup project.

    PubMed

    Channin, David S; Mongkolwat, Pattanasak; Kleper, Vladimir; Sepukar, Kastubh; Rubin, Daniel L

    2010-04-01

    Image annotation and markup are at the core of medical interpretation in both the clinical and the research setting. Digital medical images are managed with the DICOM standard format. While DICOM contains a large amount of meta-data about whom, where, and how the image was acquired, DICOM says little about the content or meaning of the pixel data. An image annotation is the explanatory or descriptive information about the pixel data of an image that is generated by a human or machine observer. An image markup is the graphical symbols placed over the image to depict an annotation. While DICOM is the standard for medical image acquisition, manipulation, transmission, storage, and display, there are no standards for image annotation and markup. Many systems expect annotation to be reported verbally, while markups are stored in graphical overlays or proprietary formats. This makes it difficult to extract and compute with both of them. The goal of the Annotation and Image Markup (AIM) project is to develop a mechanism, for modeling, capturing, and serializing image annotation and markup data that can be adopted as a standard by the medical imaging community. The AIM project produces both human- and machine-readable artifacts. This paper describes the AIM information model, schemas, software libraries, and tools so as to prepare researchers and developers for their use of AIM.

  6. A call for benchmarking transposable element annotation methods.

    PubMed

    Hoen, Douglas R; Hickey, Glenn; Bourque, Guillaume; Casacuberta, Josep; Cordaux, Richard; Feschotte, Cédric; Fiston-Lavier, Anna-Sophie; Hua-Van, Aurélie; Hubley, Robert; Kapusta, Aurélie; Lerat, Emmanuelle; Maumus, Florian; Pollock, David D; Quesneville, Hadi; Smit, Arian; Wheeler, Travis J; Bureau, Thomas E; Blanchette, Mathieu

    2015-01-01

    DNA derived from transposable elements (TEs) constitutes large parts of the genomes of complex eukaryotes, with major impacts not only on genomic research but also on how organisms evolve and function. Although a variety of methods and tools have been developed to detect and annotate TEs, there are as yet no standard benchmarks-that is, no standard way to measure or compare their accuracy. This lack of accuracy assessment calls into question conclusions from a wide range of research that depends explicitly or implicitly on TE annotation. In the absence of standard benchmarks, toolmakers are impeded in improving their tools, annotators cannot properly assess which tools might best suit their needs, and downstream researchers cannot judge how accuracy limitations might impact their studies. We therefore propose that the TE research community create and adopt standard TE annotation benchmarks, and we call for other researchers to join the authors in making this long-overdue effort a success.

  7. Solving the Problem: Genome Annotation Standards before the Data Deluge

    PubMed Central

    Klimke, William; O'Donovan, Claire; White, Owen; Brister, J. Rodney; Clark, Karen; Fedorov, Boris; Mizrachi, Ilene; Pruitt, Kim D.; Tatusova, Tatiana

    2011-01-01

    The promise of genome sequencing was that the vast undiscovered country would be mapped out by comparison of the multitude of sequences available and would aid researchers in deciphering the role of each gene in every organism. Researchers recognize that there is a need for high quality data. However, different annotation procedures, numerous databases, and a diminishing percentage of experimentally determined gene functions have resulted in a spectrum of annotation quality. NCBI in collaboration with sequencing centers, archival databases, and researchers, has developed the first international annotation standards, a fundamental step in ensuring that high quality complete prokaryotic genomes are available as gold standard references. Highlights include the development of annotation assessment tools, community acceptance of protein naming standards, comparison of annotation resources to provide consistent annotation, and improved tracking of the evidence used to generate a particular annotation. The development of a set of minimal standards, including the requirement for annotated complete prokaryotic genomes to contain a full set of ribosomal RNAs, transfer RNAs, and proteins encoding core conserved functions, is an historic milestone. The use of these standards in existing genomes and future submissions will increase the quality of databases, enabling researchers to make accurate biological discoveries. PMID:22180819

  8. WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Putman, Tim E.; Lelong, Sebastien; Burgstaller-Muehlbacher, Sebastian

    With the advancement of genome-sequencing technologies, new genomes are being sequenced daily. Although these sequences are deposited in publicly available data warehouses, their functional and genomic annotations (beyond genes which are predicted automatically) mostly reside in the text of primary publications. Professional curators are hard at work extracting those annotations from the literature for the most studied organisms and depositing them in structured databases. However, the resources don’t exist to fund the comprehensive curation of the thousands of newly sequenced organisms in this manner. Here, we describe WikiGenomes (wikigenomes.org), a web application that facilitates the consumption and curation of genomicmore » data by the entire scientific community. WikiGenomes is based on Wikidata, an openly editable knowledge graph with the goal of aggregating published knowledge into a free and open database. WikiGenomes empowers the individual genomic researcher to contribute their expertise to the curation effort and integrates the knowledge into Wikidata, enabling it to be accessed by anyone without restriction.« less

  9. WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata

    DOE PAGES

    Putman, Tim E.; Lelong, Sebastien; Burgstaller-Muehlbacher, Sebastian; ...

    2017-03-06

    With the advancement of genome-sequencing technologies, new genomes are being sequenced daily. Although these sequences are deposited in publicly available data warehouses, their functional and genomic annotations (beyond genes which are predicted automatically) mostly reside in the text of primary publications. Professional curators are hard at work extracting those annotations from the literature for the most studied organisms and depositing them in structured databases. However, the resources don’t exist to fund the comprehensive curation of the thousands of newly sequenced organisms in this manner. Here, we describe WikiGenomes (wikigenomes.org), a web application that facilitates the consumption and curation of genomicmore » data by the entire scientific community. WikiGenomes is based on Wikidata, an openly editable knowledge graph with the goal of aggregating published knowledge into a free and open database. WikiGenomes empowers the individual genomic researcher to contribute their expertise to the curation effort and integrates the knowledge into Wikidata, enabling it to be accessed by anyone without restriction.« less

  10. Adolescent Fertility: Selected, Annotated Resources for the International Community.

    ERIC Educational Resources Information Center

    Ogden, Celia

    This bibliography on adolescent fertility contains over 300 annotations of articles, audiovisual materials, books, charts, comic books, games, journals, papers, pamphlets, and packets. With a few exceptions entries were published from 1974 through 1978; they are categorized according to geographic section: World, Africa, Asia, Europe, Latin…

  11. The new modern era of yeast genomics: community sequencing and the resulting annotation of multiple Saccharomyces cerevisiae strains at the Saccharomyces Genome Database

    PubMed Central

    Engel, Stacia R.; Cherry, J. Michael

    2013-01-01

    The first completed eukaryotic genome sequence was that of the yeast Saccharomyces cerevisiae, and the Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) is the original model organism database. SGD remains the authoritative community resource for the S. cerevisiae reference genome sequence and its annotation, and continues to provide comprehensive biological information correlated with S. cerevisiae genes and their products. A diverse set of yeast strains have been sequenced to explore commercial and laboratory applications, and a brief history of those strains is provided. The publication of these new genomes has motivated the creation of new tools, and SGD will annotate and provide comparative analyses of these sequences, correlating changes with variations in strain phenotypes and protein function. We are entering a new era at SGD, as we incorporate these new sequences and make them accessible to the scientific community, all in an effort to continue in our mission of educating researchers and facilitating discovery. Database URL: http://www.yeastgenome.org/ PMID:23487186

  12. Semantator: semantic annotator for converting biomedical text to linked data.

    PubMed

    Tao, Cui; Song, Dezhao; Sharma, Deepak; Chute, Christopher G

    2013-10-01

    More than 80% of biomedical data is embedded in plain text. The unstructured nature of these text-based documents makes it challenging to easily browse and query the data of interest in them. One approach to facilitate browsing and querying biomedical text is to convert the plain text to a linked web of data, i.e., converting data originally in free text to structured formats with defined meta-level semantics. In this paper, we introduce Semantator (Semantic Annotator), a semantic-web-based environment for annotating data of interest in biomedical documents, browsing and querying the annotated data, and interactively refining annotation results if needed. Through Semantator, information of interest can be either annotated manually or semi-automatically using plug-in information extraction tools. The annotated results will be stored in RDF and can be queried using the SPARQL query language. In addition, semantic reasoners can be directly applied to the annotated data for consistency checking and knowledge inference. Semantator has been released online and was used by the biomedical ontology community who provided positive feedbacks. Our evaluation results indicated that (1) Semantator can perform the annotation functionalities as designed; (2) Semantator can be adopted in real applications in clinical and transactional research; and (3) the annotated results using Semantator can be easily used in Semantic-web-based reasoning tools for further inference. Copyright © 2013 Elsevier Inc. All rights reserved.

  13. dictyBase 2015: Expanding data and annotations in a new software environment.

    PubMed

    Basu, Siddhartha; Fey, Petra; Jimenez-Morales, David; Dodson, Robert J; Chisholm, Rex L

    2015-08-01

    dictyBase is the model organism database for the social amoeba Dictyostelium discoideum and related species. The primary mission of dictyBase is to provide the biomedical research community with well-integrated high quality data, and tools that enable original research. Data presented at dictyBase is obtained from sequencing centers, groups performing high throughput experiments such as large-scale mutagenesis studies, and RNAseq data, as well as a growing number of manually added functional gene annotations from the published literature, including Gene Ontology, strain, and phenotype annotations. Through the Dicty Stock Center we provide the community with an impressive amount of annotated strains and plasmids. Recently, dictyBase accomplished a major overhaul to adapt an outdated infrastructure to the current technological advances, thus facilitating the implementation of innovative tools and comparative genomics. It also provides new strategies for high quality annotations that enable bench researchers to benefit from the rapidly increasing volume of available data. dictyBase is highly responsive to its users needs, building a successful relationship that capitalizes on the vast efforts of the Dictyostelium research community. dictyBase has become the trusted data resource for Dictyostelium investigators, other investigators or organizations seeking information about Dictyostelium, as well as educators who use this model system. © 2015 Wiley Periodicals, Inc.

  14. Bridging the phenotypic and genetic data useful for integrated breeding through a data annotation using the Crop Ontology developed by the crop communities of practice

    PubMed Central

    Shrestha, Rosemary; Matteis, Luca; Skofic, Milko; Portugal, Arllet; McLaren, Graham; Hyman, Glenn; Arnaud, Elizabeth

    2012-01-01

    The Crop Ontology (CO) of the Generation Challenge Program (GCP) (http://cropontology.org/) is developed for the Integrated Breeding Platform (IBP) (http://www.integratedbreeding.net/) by several centers of The Consultative Group on International Agricultural Research (CGIAR): bioversity, CIMMYT, CIP, ICRISAT, IITA, and IRRI. Integrated breeding necessitates that breeders access genotypic and phenotypic data related to a given trait. The CO provides validated trait names used by the crop communities of practice (CoP) for harmonizing the annotation of phenotypic and genotypic data and thus supporting data accessibility and discovery through web queries. The trait information is completed by the description of the measurement methods and scales, and images. The trait dictionaries used to produce the Integrated Breeding (IB) fieldbooks are synchronized with the CO terms for an automatic annotation of the phenotypic data measured in the field. The IB fieldbook provides breeders with direct access to the CO to get additional descriptive information on the traits. Ontologies and trait dictionaries are online for cassava, chickpea, common bean, groundnut, maize, Musa, potato, rice, sorghum, and wheat. Online curation and annotation tools facilitate (http://cropontology.org) direct maintenance of the trait information and production of trait dictionaries by the crop communities. An important feature is the cross referencing of CO terms with the Crop database trait ID and with their synonyms in Plant Ontology (PO) and Trait Ontology (TO). Web links between cross referenced terms in CO provide online access to data annotated with similar ontological terms, particularly the genetic data in Gramene (University of Cornell) or the evaluation and climatic data in the Global Repository of evaluation trials of the Climate Change, Agriculture and Food Security programme (CCAFS). Cross-referencing and annotation will be further applied in the IBP. PMID:22934074

  15. ASGARD: an open-access database of annotated transcriptomes for emerging model arthropod species.

    PubMed

    Zeng, Victor; Extavour, Cassandra G

    2012-01-01

    The increased throughput and decreased cost of next-generation sequencing (NGS) have shifted the bottleneck genomic research from sequencing to annotation, analysis and accessibility. This is particularly challenging for research communities working on organisms that lack the basic infrastructure of a sequenced genome, or an efficient way to utilize whatever sequence data may be available. Here we present a new database, the Assembled Searchable Giant Arthropod Read Database (ASGARD). This database is a repository and search engine for transcriptomic data from arthropods that are of high interest to multiple research communities but currently lack sequenced genomes. We demonstrate the functionality and utility of ASGARD using de novo assembled transcriptomes from the milkweed bug Oncopeltus fasciatus, the cricket Gryllus bimaculatus and the amphipod crustacean Parhyale hawaiensis. We have annotated these transcriptomes to assign putative orthology, coding region determination, protein domain identification and Gene Ontology (GO) term annotation to all possible assembly products. ASGARD allows users to search all assemblies by orthology annotation, GO term annotation or Basic Local Alignment Search Tool. User-friendly features of ASGARD include search term auto-completion suggestions based on database content, the ability to download assembly product sequences in FASTA format, direct links to NCBI data for predicted orthologs and graphical representation of the location of protein domains and matches to similar sequences from the NCBI non-redundant database. ASGARD will be a useful repository for transcriptome data from future NGS studies on these and other emerging model arthropods, regardless of sequencing platform, assembly or annotation status. This database thus provides easy, one-stop access to multi-species annotated transcriptome information. We anticipate that this database will be useful for members of multiple research communities, including developmental

  16. Ecological effects of the harvest phase of geoduck clam (Panopea generosa Gould, 1850) aquaculture on infaunal communities in southern Puget Sound, Washington USA.

    USGS Publications Warehouse

    VanBlaricom, Glenn R.; Eccles, Jennifer L.; Olden, Julian D.; Mcdonald, P. Sean

    2015-01-01

    Intertidal aquaculture for geoducks (Panopea generosa Gould, 1850) is expanding in southern Puget Sound, Washington, where gently sloping sandy beaches are used for field culture. Geoduck aquaculture contributes significantly to the regional economy, but has become controversial because of a range of unresolved questions involving potential biological impacts on marine ecosystems. From 2008 through 2012, the authors used a “before-after-control-impact” experimental design, emphasizing spatial scales comparable with those used by geoduck culturists to evaluate the effects of harvesting market-ready geoducks on associated benthic infaunal communities. Infauna were sampled at three different study locations in southern Puget Sound at monthly intervals before, during, and after harvests of clams, and along extralimital transects extending away from the edges of cultured plots to assess the effects of harvest activities in adjacent uncultured habitat. Using multivariate statistical approaches, strong seasonal and spatial signals in patterns of abundance were found, but there was scant evidence of effects on the community structure associated with geoduck harvest disturbances within cultured plots. Likewise, no indications of significant “spillover” effects of harvest on uncultured habitat adjacent to cultured plots were noted. Complementary univariate approaches revealed little evidence of harvest effects on infaunal biodiversity and indications of modest effects on populations of individual infaunal taxa. Of 10 common taxa analyzed, only three showed evidence of reduced densities, although minor, after harvests whereas the remaining seven taxa indicated either neutral responses to harvest disturbances or increased abundance either during or in the months after harvest events. It is suggested that a relatively active natural disturbance regime, including both small-scale and large-scale events that occur with comparable intensity but more frequently than

  17. Response of Vascular Plant Communities to Harvest in Southern Appalachian Mixed-Oak Forests: Two-Year Results

    Treesearch

    Bryan W. Wender; Sharon M. Hood; David W. Smith; Shepard M. Zedaker; David L. Loftis

    1999-01-01

    A long-term study has been established to monitor the effects of seven silvicultural prescriptions on vascular flora community attributes. Treatments include a control, understory vegetation control, group selection, two levels of shelterwoods, leave-tree, and clearcut. Second growing season. post-treatment results are compared to pre-harvest values for residual~...

  18. Semi-automatic semantic annotation of PubMed Queries: a study on quality, efficiency, satisfaction

    PubMed Central

    Névéol, Aurélie; Islamaj-Doğan, Rezarta; Lu, Zhiyong

    2010-01-01

    Information processing algorithms require significant amounts of annotated data for training and testing. The availability of such data is often hindered by the complexity and high cost of production. In this paper, we investigate the benefits of a state-of-the-art tool to help with the semantic annotation of a large set of biomedical information queries. Seven annotators were recruited to annotate a set of 10,000 PubMed® queries with 16 biomedical and bibliographic categories. About half of the queries were annotated from scratch, while the other half were automatically pre-annotated and manually corrected. The impact of the automatic pre-annotations was assessed on several aspects of the task: time, number of actions, annotator satisfaction, inter-annotator agreement, quality and number of the resulting annotations. The analysis of annotation results showed that the number of required hand annotations is 28.9% less when using pre-annotated results from automatic tools. As a result, the overall annotation time was substantially lower when pre-annotations were used, while inter-annotator agreement was significantly higher. In addition, there was no statistically significant difference in the semantic distribution or number of annotations produced when pre-annotations were used. The annotated query corpus is freely available to the research community. This study shows that automatic pre-annotations are found helpful by most annotators. Our experience suggests using an automatic tool to assist large-scale manual annotation projects. This helps speed-up the annotation time and improve annotation consistency while maintaining high quality of the final annotations. PMID:21094696

  19. Nutrient uptake and community metabolism in streams draining harvested and old-growth watersheds: A preliminary assessment

    Treesearch

    Brian H. Hill; Frank H. McCormick

    2004-01-01

    The effect of timber harvesting on streams is assessed using two measures of ecosystem function: nutrient spiraling and community metabolism. This research is being conducted in streams of the southern Appalachian Mountains of North Carolina, the Ouachita Mountains of Arkansas, the Cascade Mountains of Oregon, and the redwood forests of northern California, in order to...

  20. Annotated Bibliography for Adult Educators in Institutional Settings.

    ERIC Educational Resources Information Center

    Elwyn Inst., PA.

    This annotated bibliography of instructional materials for adult educators in institutional settings lists materials available in fourteen areas: basic skills, citizenship education, community services, consumer education, health and safety, mathematics, meal planning, money management, personal information/general life skills, pre-employment…

  1. Gene Ontology annotations at SGD: new data sources and annotation methods

    PubMed Central

    Hong, Eurie L.; Balakrishnan, Rama; Dong, Qing; Christie, Karen R.; Park, Julie; Binkley, Gail; Costanzo, Maria C.; Dwight, Selina S.; Engel, Stacia R.; Fisk, Dianna G.; Hirschman, Jodi E.; Hitz, Benjamin C.; Krieger, Cynthia J.; Livstone, Michael S.; Miyasato, Stuart R.; Nash, Robert S.; Oughtred, Rose; Skrzypek, Marek S.; Weng, Shuai; Wong, Edith D.; Zhu, Kathy K.; Dolinski, Kara; Botstein, David; Cherry, J. Michael

    2008-01-01

    The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) collects and organizes biological information about the chromosomal features and gene products of the budding yeast Saccharomyces cerevisiae. Although published data from traditional experimental methods are the primary sources of evidence supporting Gene Ontology (GO) annotations for a gene product, high-throughput experiments and computational predictions can also provide valuable insights in the absence of an extensive body of literature. Therefore, GO annotations available at SGD now include high-throughput data as well as computational predictions provided by the GO Annotation Project (GOA UniProt; http://www.ebi.ac.uk/GOA/). Because the annotation method used to assign GO annotations varies by data source, GO resources at SGD have been modified to distinguish data sources and annotation methods. In addition to providing information for genes that have not been experimentally characterized, GO annotations from independent sources can be compared to those made by SGD to help keep the literature-based GO annotations current. PMID:17982175

  2. MEGAnnotator: a user-friendly pipeline for microbial genomes assembly and annotation.

    PubMed

    Lugli, Gabriele Andrea; Milani, Christian; Mancabelli, Leonardo; van Sinderen, Douwe; Ventura, Marco

    2016-04-01

    Genome annotation is one of the key actions that must be undertaken in order to decipher the genetic blueprint of organisms. Thus, a correct and reliable annotation is essential in rendering genomic data valuable. Here, we describe a bioinformatics pipeline based on freely available software programs coordinated by a multithreaded script named MEGAnnotator (Multithreaded Enhanced prokaryotic Genome Annotator). This pipeline allows the generation of multiple annotated formats fulfilling the NCBI guidelines for assembled microbial genome submission, based on DNA shotgun sequencing reads, and minimizes manual intervention, while also reducing waiting times between software program executions and improving final quality of both assembly and annotation outputs. MEGAnnotator provides an efficient way to pre-arrange the assembly and annotation work required to process NGS genome sequence data. The script improves the final quality of microbial genome annotation by reducing ambiguous annotations. Moreover, the MEGAnnotator platform allows the user to perform a partial annotation of pre-assembled genomes and includes an option to accomplish metagenomic data set assemblies. MEGAnnotator platform will be useful for microbiologists interested in genome analyses of bacteria as well as those investigating the complexity of microbial communities that do not possess the necessary skills to prepare their own bioinformatics pipeline. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  3. EGASP: the human ENCODE Genome Annotation Assessment Project

    PubMed Central

    Guigó, Roderic; Flicek, Paul; Abril, Josep F; Reymond, Alexandre; Lagarde, Julien; Denoeud, France; Antonarakis, Stylianos; Ashburner, Michael; Bajic, Vladimir B; Birney, Ewan; Castelo, Robert; Eyras, Eduardo; Ucla, Catherine; Gingeras, Thomas R; Harrow, Jennifer; Hubbard, Tim; Lewis, Suzanna E; Reese, Martin G

    2006-01-01

    Background We present the results of EGASP, a community experiment to assess the state-of-the-art in genome annotation within the ENCODE regions, which span 1% of the human genome sequence. The experiment had two major goals: the assessment of the accuracy of computational methods to predict protein coding genes; and the overall assessment of the completeness of the current human genome annotations as represented in the ENCODE regions. For the computational prediction assessment, eighteen groups contributed gene predictions. We evaluated these submissions against each other based on a 'reference set' of annotations generated as part of the GENCODE project. These annotations were not available to the prediction groups prior to the submission deadline, so that their predictions were blind and an external advisory committee could perform a fair assessment. Results The best methods had at least one gene transcript correctly predicted for close to 70% of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into account alternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotide level, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programs relying on mRNA and protein sequences were the most accurate in reproducing the manually curated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could be verified. Conclusion This is the first such experiment in human DNA, and we have followed the standards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe the results presented here contribute to the value of ongoing large-scale annotation projects and should guide further experimental methods when being scaled up to the entire human genome sequence. PMID:16925836

  4. Liming in the sugarcane burnt system and the green harvest practice affect soil bacterial community in northeastern São Paulo, Brazil.

    PubMed

    Val-Moraes, Silvana Pompeia; de Macedo, Helena Suleiman; Kishi, Luciano Takeshi; Pereira, Rodrigo Matheus; Navarrete, Acacio Aparecido; Mendes, Lucas William; de Figueiredo, Eduardo Barretto; La Scala, Newton; Tsai, Siu Mui; de Macedo Lemos, Eliana Gertrudes; Alves, Lúcia Maria Carareto

    2016-12-01

    Here we show that both liming the burnt sugarcane and the green harvest practice alter bacterial community structure, diversity and composition in sugarcane fields in northeastern São Paulo state, Brazil. Terminal restriction fragment length polymorphism fingerprinting and 16S rRNA gene cloning and sequencing were used to analyze changes in soil bacterial communities. The field experiment consisted of sugarcane-cultivated soils under different regimes: green sugarcane (GS), burnt sugarcane (BS), BS in soil amended with lime applied to increase soil pH (BSL), and native forest (NF) as control soil. The bacterial community structures revealed disparate patterns in sugarcane-cultivated soils and forest soil (R = 0.786, P = 0.002), and overlapping patterns were shown for the bacterial community structure among the different management regimes applied to sugarcane (R = 0.194, P = 0.002). The numbers of operational taxonomic units (OTUs) found in the libraries were 117, 185, 173 and 166 for NF, BS, BSL and GS, respectively. Sugarcane-cultivated soils revealed higher bacterial diversity than NF soil, with BS soil accounting for a higher richness of unique OTUs (101 unique OTUs) than NF soil (23 unique OTUs). Cluster analysis based on OTUs revealed similar bacterial communities in NF and GS soils, while the bacterial community from BS soil was most distinct from the others. Acidobacteria and Alphaproteobacteria were the most abundant bacterial phyla across the different soils with Acidobacteria Gp1 accounting for a higher abundance in NF and GS soils than burnt sugarcane-cultivated soils (BS and BSL). In turn, Acidobacteria Gp4 abundance was higher in BS soils than in other soils. These differential responses in soil bacterial community structure, diversity and composition can be associated with the agricultural management, mainly liming practices, and harvest methods in the sugarcane-cultivated soils, and they can be detected shortly after harvest.

  5. High-performance web services for querying gene and variant annotation.

    PubMed

    Xin, Jiwen; Mark, Adam; Afrasiabi, Cyrus; Tsueng, Ginger; Juchler, Moritz; Gopal, Nikhil; Stupp, Gregory S; Putman, Timothy E; Ainscough, Benjamin J; Griffith, Obi L; Torkamani, Ali; Whetzel, Patricia L; Mungall, Christopher J; Mooney, Sean D; Su, Andrew I; Wu, Chunlei

    2016-05-06

    Efficient tools for data management and integration are essential for many aspects of high-throughput biology. In particular, annotations of genes and human genetic variants are commonly used but highly fragmented across many resources. Here, we describe MyGene.info and MyVariant.info, high-performance web services for querying gene and variant annotation information. These web services are currently accessed more than three million times permonth. They also demonstrate a generalizable cloud-based model for organizing and querying biological annotation information. MyGene.info and MyVariant.info are provided as high-performance web services, accessible at http://mygene.info and http://myvariant.info . Both are offered free of charge to the research community.

  6. Enhanced annotations and features for comparing thousands of Pseudomonas genomes in the Pseudomonas genome database.

    PubMed

    Winsor, Geoffrey L; Griffiths, Emma J; Lo, Raymond; Dhillon, Bhavjinder K; Shay, Julie A; Brinkman, Fiona S L

    2016-01-04

    The Pseudomonas Genome Database (http://www.pseudomonas.com) is well known for the application of community-based annotation approaches for producing a high-quality Pseudomonas aeruginosa PAO1 genome annotation, and facilitating whole-genome comparative analyses with other Pseudomonas strains. To aid analysis of potentially thousands of complete and draft genome assemblies, this database and analysis platform was upgraded to integrate curated genome annotations and isolate metadata with enhanced tools for larger scale comparative analysis and visualization. Manually curated gene annotations are supplemented with improved computational analyses that help identify putative drug targets and vaccine candidates or assist with evolutionary studies by identifying orthologs, pathogen-associated genes and genomic islands. The database schema has been updated to integrate isolate metadata that will facilitate more powerful analysis of genomes across datasets in the future. We continue to place an emphasis on providing high-quality updates to gene annotations through regular review of the scientific literature and using community-based approaches including a major new Pseudomonas community initiative for the assignment of high-quality gene ontology terms to genes. As we further expand from thousands of genomes, we plan to provide enhancements that will aid data visualization and analysis arising from whole-genome comparative studies including more pan-genome and population-based approaches. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Response of the soil microbial community and soil nutrient bioavailability to biomass harvesting and reserve tree retention in northern Minnesota aspen-dominated forests

    Treesearch

    Tera E. Lewandowski; Jodi A. Forrester; David J. Mladenoff; Anthony W. D' Amato; Brian J. Palik

    2016-01-01

    Intensive forest biomass harvesting, or the removal of harvesting slash (woody debris from tree branches and tops) for use as biofuel, has the potential to negatively affect the soil microbial community (SMC) due to loss of carbon and nutrient inputs from the slash, alteration of the soil microclimate, and increased nutrient leaching. These effects could result in...

  8. Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kuo, Alan; Grigoriev, Igor

    2009-04-17

    Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentousmore » ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.« less

  9. AnnotCompute: annotation-based exploration and meta-analysis of genomics experiments

    PubMed Central

    Zheng, Jie; Stoyanovich, Julia; Manduchi, Elisabetta; Liu, Junmin; Stoeckert, Christian J.

    2011-01-01

    The ever-increasing scale of biological data sets, particularly those arising in the context of high-throughput technologies, requires the development of rich data exploration tools. In this article, we present AnnotCompute, an information discovery platform for repositories of functional genomics experiments such as ArrayExpress. Our system leverages semantic annotations of functional genomics experiments with controlled vocabulary and ontology terms, such as those from the MGED Ontology, to compute conceptual dissimilarities between pairs of experiments. These dissimilarities are then used to support two types of exploratory analysis—clustering and query-by-example. We show that our proposed dissimilarity measures correspond to a user's intuition about conceptual dissimilarity, and can be used to support effective query-by-example. We also evaluate the quality of clustering based on these measures. While AnnotCompute can support a richer data exploration experience, its effectiveness is limited in some cases, due to the quality of available annotations. Nonetheless, tools such as AnnotCompute may provide an incentive for richer annotations of experiments. Code is available for download at http://www.cbil.upenn.edu/downloads/AnnotCompute. Database URL: http://www.cbil.upenn.edu/annotCompute/ PMID:22190598

  10. Marky: a tool supporting annotation consistency in multi-user and iterative document annotation projects.

    PubMed

    Pérez-Pérez, Martín; Glez-Peña, Daniel; Fdez-Riverola, Florentino; Lourenço, Anália

    2015-02-01

    Document annotation is a key task in the development of Text Mining methods and applications. High quality annotated corpora are invaluable, but their preparation requires a considerable amount of resources and time. Although the existing annotation tools offer good user interaction interfaces to domain experts, project management and quality control abilities are still limited. Therefore, the current work introduces Marky, a new Web-based document annotation tool equipped to manage multi-user and iterative projects, and to evaluate annotation quality throughout the project life cycle. At the core, Marky is a Web application based on the open source CakePHP framework. User interface relies on HTML5 and CSS3 technologies. Rangy library assists in browser-independent implementation of common DOM range and selection tasks, and Ajax and JQuery technologies are used to enhance user-system interaction. Marky grants solid management of inter- and intra-annotator work. Most notably, its annotation tracking system supports systematic and on-demand agreement analysis and annotation amendment. Each annotator may work over documents as usual, but all the annotations made are saved by the tracking system and may be further compared. So, the project administrator is able to evaluate annotation consistency among annotators and across rounds of annotation, while annotators are able to reject or amend subsets of annotations made in previous rounds. As a side effect, the tracking system minimises resource and time consumption. Marky is a novel environment for managing multi-user and iterative document annotation projects. Compared to other tools, Marky offers a similar visually intuitive annotation experience while providing unique means to minimise annotation effort and enforce annotation quality, and therefore corpus consistency. Marky is freely available for non-commercial use at http://sing.ei.uvigo.es/marky. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  11. Early changes in arbuscular mycorrhiza development in sugarcane under two harvest management systems.

    PubMed

    de Azevedo, Lucas Carvalho Basilio; Stürmer, Sidney Luiz; Lambais, Marcio Rodrigues

    2014-01-01

    Sugarcane (Saccharum spp.) is grown on over 8 million ha in Brazil and is used to produce ethanol and sugar. Some sugarcane fields are burned to facilitate harvesting, which can affect the soil microbial community. However, whether sugarcane pre-harvest burning affects the community of arbuscular mycorrhizal fungi (AMF) and symbioses development is not known. In this study, we investigated the early impacts of harvest management on AMF spore communities and root colonization in three sugarcane varieties, under two harvest management systems (no-burning and pre-harvest burning). Soil and root samples were collected in the field after the first harvest of sugarcane varieties SP813250, SP801842, and RB72454, and AMF species were identified based on spore morphology. Diversity indices were determined based on spore populations and root colonization determined as an indicator of symbioses development. Based on the diversity indices, spore number and species occurrence in soil, no significant differences were observed among the AMF communities, regardless of harvest management type, sugarcane variety or interactions between harvest management type and sugarcane variety. However, mycorrhiza development was stimulated in sugarcane under the no-burning management system. Our data suggest that the sugarcane harvest management system may cause early changes in arbuscular mycorrhiza development.

  12. Early changes in arbuscular mycorrhiza development in sugarcane under two harvest management systems

    PubMed Central

    de Azevedo, Lucas Carvalho Basilio; Stürmer, Sidney Luiz; Lambais, Marcio Rodrigues

    2014-01-01

    Sugarcane (Saccharum spp.) is grown on over 8 million ha in Brazil and is used to produce ethanol and sugar. Some sugarcane fields are burned to facilitate harvesting, which can affect the soil microbial community. However, whether sugarcane pre-harvest burning affects the community of arbuscular mycorrhizal fungi (AMF) and symbioses development is not known. In this study, we investigated the early impacts of harvest management on AMF spore communities and root colonization in three sugarcane varieties, under two harvest management systems (no-burning and pre-harvest burning). Soil and root samples were collected in the field after the first harvest of sugarcane varieties SP813250, SP801842, and RB72454, and AMF species were identified based on spore morphology. Diversity indices were determined based on spore populations and root colonization determined as an indicator of symbioses development. Based on the diversity indices, spore number and species occurrence in soil, no significant differences were observed among the AMF communities, regardless of harvest management type, sugarcane variety or interactions between harvest management type and sugarcane variety. However, mycorrhiza development was stimulated in sugarcane under the no-burning management system. Our data suggest that the sugarcane harvest management system may cause early changes in arbuscular mycorrhiza development. PMID:25477936

  13. EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation.

    PubMed

    Pafilis, Evangelos; Buttigieg, Pier Luigi; Ferrell, Barbra; Pereira, Emiliano; Schnetzer, Julia; Arvanitidis, Christos; Jensen, Lars Juhl

    2016-01-01

    The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment, organism, tissue and disease terms. The evaluators in the BioCreative V Interactive Annotation Task found the system to be intuitive, useful, well documented and sufficiently accurate to be helpful in spotting relevant text passages and extracting organism and environment terms. Comparison of fully manual and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15-25% and helps curators to detect terms that would otherwise have been missed. Database URL: https://extract.hcmr.gr/. © The Author(s) 2016. Published by Oxford University Press.

  14. EXTRACT: Interactive extraction of environment metadata and term suggestion for metagenomic sample annotation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pafilis, Evangelos; Buttigieg, Pier Luigi; Ferrell, Barbra

    The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment, organism, tissue and disease terms. The evaluators in the BioCreative V Interactive Annotation Task found the system to be intuitive, useful, wellmore » documented and sufficiently accurate to be helpful in spotting relevant text passages and extracting organism and environment terms. Here the comparison of fully manual and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15–25% and helps curators to detect terms that would otherwise have been missed.« less

  15. EXTRACT: Interactive extraction of environment metadata and term suggestion for metagenomic sample annotation

    DOE PAGES

    Pafilis, Evangelos; Buttigieg, Pier Luigi; Ferrell, Barbra; ...

    2016-01-01

    The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment, organism, tissue and disease terms. The evaluators in the BioCreative V Interactive Annotation Task found the system to be intuitive, useful, wellmore » documented and sufficiently accurate to be helpful in spotting relevant text passages and extracting organism and environment terms. Here the comparison of fully manual and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15–25% and helps curators to detect terms that would otherwise have been missed.« less

  16. RysannMD: A biomedical semantic annotator balancing speed and accuracy.

    PubMed

    Cuzzola, John; Jovanović, Jelena; Bagheri, Ebrahim

    2017-07-01

    Recently, both researchers and practitioners have explored the possibility of semantically annotating large and continuously evolving collections of biomedical texts such as research papers, medical reports, and physician notes in order to enable their efficient and effective management and use in clinical practice or research laboratories. Such annotations can be automatically generated by biomedical semantic annotators - tools that are specifically designed for detecting and disambiguating biomedical concepts mentioned in text. The biomedical community has already presented several solid automated semantic annotators. However, the existing tools are either strong in their disambiguation capacity, i.e., the ability to identify the correct biomedical concept for a given piece of text among several candidate concepts, or they excel in their processing time, i.e., work very efficiently, but none of the semantic annotation tools reported in the literature has both of these qualities. In this paper, we present RysannMD (Ryerson Semantic Annotator for Medical Domain), a biomedical semantic annotation tool that strikes a balance between processing time and performance while disambiguating biomedical terms. In other words, RysannMD provides reasonable disambiguation performance when choosing the right sense for a biomedical term in a given context, and does that in a reasonable time. To examine how RysannMD stands with respect to the state of the art biomedical semantic annotators, we have conducted a series of experiments using standard benchmarking corpora, including both gold and silver standards, and four modern biomedical semantic annotators, namely cTAKES, MetaMap, NOBLE Coder, and Neji. The annotators were compared with respect to the quality of the produced annotations measured against gold and silver standards using precision, recall, and F 1 measure and speed, i.e., processing time. In the experiments, RysannMD achieved the best median F 1 measure across the

  17. Sma3s: a three-step modular annotator for large sequence datasets.

    PubMed

    Muñoz-Mérida, Antonio; Viguera, Enrique; Claros, M Gonzalo; Trelles, Oswaldo; Pérez-Pulido, Antonio J

    2014-08-01

    Automatic sequence annotation is an essential component of modern 'omics' studies, which aim to extract information from large collections of sequence data. Most existing tools use sequence homology to establish evolutionary relationships and assign putative functions to sequences. However, it can be difficult to define a similarity threshold that achieves sufficient coverage without sacrificing annotation quality. Defining the correct configuration is critical and can be challenging for non-specialist users. Thus, the development of robust automatic annotation techniques that generate high-quality annotations without needing expert knowledge would be very valuable for the research community. We present Sma3s, a tool for automatically annotating very large collections of biological sequences from any kind of gene library or genome. Sma3s is composed of three modules that progressively annotate query sequences using either: (i) very similar homologues, (ii) orthologous sequences or (iii) terms enriched in groups of homologous sequences. We trained the system using several random sets of known sequences, demonstrating average sensitivity and specificity values of ~85%. In conclusion, Sma3s is a versatile tool for high-throughput annotation of a wide variety of sequence datasets that outperforms the accuracy of other well-established annotation algorithms, and it can enrich existing database annotations and uncover previously hidden features. Importantly, Sma3s has already been used in the functional annotation of two published transcriptomes. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  18. Annotating images by mining image search results.

    PubMed

    Wang, Xin-Jing; Zhang, Lei; Li, Xirong; Ma, Wei-Ying

    2008-11-01

    Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search results. Some 2.4 million images with their surrounding text are collected from a few photo forums to support this approach. The entire process is formulated in a divide-and-conquer framework where a query keyword is provided along with the uncaptioned image to improve both the effectiveness and efficiency. This is helpful when the collected data set is not dense everywhere. In this sense, our approach contains three steps: 1) the search process to discover visually and semantically similar search results, 2) the mining process to identify salient terms from textual descriptions of the search results, and 3) the annotation rejection process to filter out noisy terms yielded by Step 2. To ensure real-time annotation, two key techniques are leveraged-one is to map the high-dimensional image visual features into hash codes, the other is to implement it as a distributed system, of which the search and mining processes are provided as Web services. As a typical result, the entire process finishes in less than 1 second. Since no training data set is required, our approach enables annotating with unlimited vocabulary and is highly scalable and robust to outliers. Experimental results on both real Web images and a benchmark image data set show the effectiveness and efficiency of the proposed algorithm. It is also worth noting that, although the entire approach is illustrated within the divide-and conquer framework, a query keyword is not crucial to our current implementation. We provide experimental results to prove this.

  19. Protein Information Resource: a community resource for expert annotation of protein data

    PubMed Central

    Barker, Winona C.; Garavelli, John S.; Hou, Zhenglin; Huang, Hongzhan; Ledley, Robert S.; McGarvey, Peter B.; Mewes, Hans-Werner; Orcutt, Bruce C.; Pfeiffer, Friedhelm; Tsugita, Akira; Vinayaka, C. R.; Xiao, Chunlin; Yeh, Lai-Su L.; Wu, Cathy

    2001-01-01

    The Protein Information Resource, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the most comprehensive and expertly annotated protein sequence database in the public domain, the PIR-International Protein Sequence Database. To provide timely and high quality annotation and promote database interoperability, the PIR-International employs rule-based and classification-driven procedures based on controlled vocabulary and standard nomenclature and includes status tags to distinguish experimentally determined from predicted protein features. The database contains about 200 000 non-redundant protein sequences, which are classified into families and superfamilies and their domains and motifs identified. Entries are extensively cross-referenced to other sequence, classification, genome, structure and activity databases. The PIR web site features search engines that use sequence similarity and database annotation to facilitate the analysis and functional identification of proteins. The PIR-Inter­national databases and search tools are accessible on the PIR web site at http://pir.georgetown.edu/ and at the MIPS web site at http://www.mips.biochem.mpg.de. The PIR-International Protein Sequence Database and other files are also available by FTP. PMID:11125041

  20. Enhanced functionalities for annotating and indexing clinical text with the NCBO Annotator.

    PubMed

    Tchechmedjiev, Andon; Abdaoui, Amine; Emonet, Vincent; Melzi, Soumia; Jonnagaddala, Jitendra; Jonquet, Clement

    2018-06-01

    Second use of clinical data commonly involves annotating biomedical text with terminologies and ontologies. The National Center for Biomedical Ontology Annotator is a frequently used annotation service, originally designed for biomedical data, but not very suitable for clinical text annotation. In order to add new functionalities to the NCBO Annotator without hosting or modifying the original Web service, we have designed a proxy architecture that enables seamless extensions by pre-processing of the input text and parameters, and post processing of the annotations. We have then implemented enhanced functionalities for annotating and indexing free text such as: scoring, detection of context (negation, experiencer, temporality), new output formats and coarse-grained concept recognition (with UMLS Semantic Groups). In this paper, we present the NCBO Annotator+, a Web service which incorporates these new functionalities as well as a small set of evaluation results for concept recognition and clinical context detection on two standard evaluation tasks (Clef eHealth 2017, SemEval 2014). The Annotator+ has been successfully integrated into the SIFR BioPortal platform-an implementation of NCBO BioPortal for French biomedical terminologies and ontologies-to annotate English text. A Web user interface is available for testing and ontology selection (http://bioportal.lirmm.fr/ncbo_annotatorplus); however the Annotator+ is meant to be used through the Web service application programming interface (http://services.bioportal.lirmm.fr/ncbo_annotatorplus). The code is openly available, and we also provide a Docker packaging to enable easy local deployment to process sensitive (e.g. clinical) data in-house (https://github.com/sifrproject). andon.tchechmedjiev@lirmm.fr. Supplementary data are available at Bioinformatics online.

  1. An open annotation ontology for science on web 3.0.

    PubMed

    Ciccarese, Paolo; Ocana, Marco; Garcia Castro, Leyla Jael; Das, Sudeshna; Clark, Tim

    2011-05-17

    There is currently a gap between the rich and expressive collection of published biomedical ontologies, and the natural language expression of biomedical papers consumed on a daily basis by scientific researchers. The purpose of this paper is to provide an open, shareable structure for dynamic integration of biomedical domain ontologies with the scientific document, in the form of an Annotation Ontology (AO), thus closing this gap and enabling application of formal biomedical ontologies directly to the literature as it emerges. Initial requirements for AO were elicited by analysis of integration needs between biomedical web communities, and of needs for representing and integrating results of biomedical text mining. Analysis of strengths and weaknesses of previous efforts in this area was also performed. A series of increasingly refined annotation tools were then developed along with a metadata model in OWL, and deployed for feedback and additional requirements the ontology to users at a major pharmaceutical company and a major academic center. Further requirements and critiques of the model were also elicited through discussions with many colleagues and incorporated into the work. This paper presents Annotation Ontology (AO), an open ontology in OWL-DL for annotating scientific documents on the web. AO supports both human and algorithmic content annotation. It enables "stand-off" or independent metadata anchored to specific positions in a web document by any one of several methods. In AO, the document may be annotated but is not required to be under update control of the annotator. AO contains a provenance model to support versioning, and a set model for specifying groups and containers of annotation. AO is freely available under open source license at http://purl.org/ao/, and extensive documentation including screencasts is available on AO's Google Code page: http://code.google.com/p/annotation-ontology/ . The Annotation Ontology meets critical requirements for

  2. The Persistence of Subsistence: Wild Food Harvests in Rural Alaska, 1983-2013

    NASA Astrophysics Data System (ADS)

    Magdanz, J.; Greenberg, J.; Little, J.; Koster, D.

    2016-12-01

    Many Alaskans depend on family-centered harvests of wild fish, wildlife, and plants in what could be considered a home production model. State and federal laws provide priorities for these "subsistence uses," a divisive political issue in Alaska. We explore Alaska's subsistence economies using community-level demographic, economic, and subsistence harvest estimates from more than 18,000 household surveys administered during 354 projects in 179 Alaska communities from 1983 to 2013. Neither mean subsistence harvests nor mean incomes are significantly associated with time alone. But harvests are associated with time in multiple regression models that explain more than 60% of the variation in mean subsistence harvests per person at the community level. Propensity score matching finds that roads have significant, strong, and negative effects on subsistence harvests, but no significant effects on incomes. Results suggest that - given sustainably managed renewable resources and appropriate levels of exclusion - subsistence economies can co-exist with market economies.

  3. Annotating ebony on the fly.

    PubMed

    Kohn, Michael H; Wittkopp, Patricia J

    2007-07-01

    The distinctive black phenotype of ebony mutants has made it one of the most widely used phenotypic markers in Drosophila genetics. Without doubt, ebony showcases the fruits of the fly community's labours to annotate gene function. As of this writing, FlyBase lists 142 references, 1277 fly stocks, 15 phenotypes and 44 alleles. In addition to its namesake pigmentation phenotype, ebony mutants affect other traits, including phototaxis and courtship. With phenotypic consequences of ebony variants readily apparent in the laboratory, does natural selection also see them in the wild? In this issue of Molecular Ecology, Pool & Aquadro investigate this question and found signs of natural selection on the ebony gene that appear to have resulted from selection for darker pigmentation at higher elevations in sub-Saharan populations of Drosophila melanogaster. Such findings from population genomic analysis of wild-derived strains should be included in gene annotations to provide a more holistic view of a gene's function. The evolutionary annotation of ebony added by Pool & Aquadro substantiates that pigmentation can be adaptive and implicates elevation as an important selective factor. This is important progress because the selective factors seem to differ between populations and species. In addition, the study raises issues to consider when extrapolating from selection at the molecular level to selection at the phenotypic level.

  4. Assessment of streamside management zones for conserving benthic macroinvertebrate communities following timber harvest in eastern Kentucky headwater catchments

    Treesearch

    Joshua Adkins; Christopher Barton; Scott Grubbs; Jeffrey Stringer; Randy Kolka

    2016-01-01

    Headwater streams generally comprise the majority of stream area in a watershed and can have a strong influence on downstream food webs. Our objective was to determine the effect of altering streamside management zone (SMZ) configurations on headwater aquatic insect communities. Timber harvests were implemented within six watersheds in eastern Kentucky. The SMZ...

  5. Social-Ecological Soundscapes: Examining Aircraft-Harvester-Caribou Conflict in Arctic Alaska

    NASA Astrophysics Data System (ADS)

    Stinchcomb, Taylor R.

    As human development expands across the Arctic, it is crucial to carefully assess the impacts to remote natural ecosystems and to indigenous communities that rely on wild resources for nutritional and cultural wellbeing. Because indigenous communities and wildlife populations are interdependent, assessing how human activities impact traditional harvest practices can advance our understanding of the human dimensions of wildlife management. Indigenous communities across Arctic Alaska have expressed concern over the last four decades that low-flying aircraft interfere with their traditional harvest practices. For example, communities often have testified that aircraft disturb caribou (Rangifer tarandus) and thereby reduce harvest opportunities. Despite this longstanding concern, little research exists on the extent of aircraft activity in Arctic Alaska and on how aircraft affect the behavior and perceptions of harvesters. Therefore, the overarching goal of my research was to highlight the importance of aircraft-harvester conflict in Arctic Alaska and begin to address the issue using a scientific and community-driven approach. In Chapter 1, I demonstrated that conflict between aircraft and indigenous harvesters in Arctic Alaska is a widespread, understudied, and complex issue. By conducting a meta-analysis of the available literature, I quantified the deficiency of scientific knowledge about the impacts of aircraft on rural communities and traditional harvest practices in the Arctic. My results indicated that no peer-reviewed literature has addressed the conflict between low-flying aircraft and traditional harvesters in Arctic Alaska. I speculated that the scale over which aircraft, rural communities, and wildlife interact limits scientists' ability to determine causal relationships and therefore detracts from their interest in researching the human dimension of this social-ecological system. Innovative research approaches like soundscape ecology could begin to

  6. Evaluating Computational Gene Ontology Annotations.

    PubMed

    Škunca, Nives; Roberts, Richard J; Steffen, Martin

    2017-01-01

    Two avenues to understanding gene function are complementary and often overlapping: experimental work and computational prediction. While experimental annotation generally produces high-quality annotations, it is low throughput. Conversely, computational annotations have broad coverage, but the quality of annotations may be variable, and therefore evaluating the quality of computational annotations is a critical concern.In this chapter, we provide an overview of strategies to evaluate the quality of computational annotations. First, we discuss why evaluating quality in this setting is not trivial. We highlight the various issues that threaten to bias the evaluation of computational annotations, most of which stem from the incompleteness of biological databases. Second, we discuss solutions that address these issues, for example, targeted selection of new experimental annotations and leveraging the existing experimental annotations.

  7. Forest harvesting reduces the soil metagenomic potential for biomass decomposition.

    PubMed

    Cardenas, Erick; Kranabetter, J M; Hope, Graeme; Maas, Kendra R; Hallam, Steven; Mohn, William W

    2015-11-01

    Soil is the key resource that must be managed to ensure sustainable forest productivity. Soil microbial communities mediate numerous essential ecosystem functions, and recent studies show that forest harvesting alters soil community composition. From a long-term soil productivity study site in a temperate coniferous forest in British Columbia, 21 forest soil shotgun metagenomes were generated, totaling 187 Gb. A method to analyze unassembled metagenome reads from the complex community was optimized and validated. The subsequent metagenome analysis revealed that, 12 years after forest harvesting, there were 16% and 8% reductions in relative abundances of biomass decomposition genes in the organic and mineral soil layers, respectively. Organic and mineral soil layers differed markedly in genetic potential for biomass degradation, with the organic layer having greater potential and being more strongly affected by harvesting. Gene families were disproportionately affected, and we identified 41 gene families consistently affected by harvesting, including families involved in lignin, cellulose, hemicellulose and pectin degradation. The results strongly suggest that harvesting profoundly altered below-ground cycling of carbon and other nutrients at this site, with potentially important consequences for forest regeneration. Thus, it is important to determine whether these changes foreshadow long-term changes in forest productivity or resilience and whether these changes are broadly characteristic of harvested forests.

  8. Annotation of UAV surveillance video

    NASA Astrophysics Data System (ADS)

    Howlett, Todd; Robertson, Mark A.; Manthey, Dan; Krol, John

    2004-08-01

    Significant progress toward the development of a video annotation capability is presented in this paper. Research and development of an object tracking algorithm applicable for UAV video is described. Object tracking is necessary for attaching the annotations to the objects of interest. A methodology and format is defined for encoding video annotations using the SMPTE Key-Length-Value encoding standard. This provides the following benefits: a non-destructive annotation, compliance with existing standards, video playback in systems that are not annotation enabled and support for a real-time implementation. A model real-time video annotation system is also presented, at a high level, using the MPEG-2 Transport Stream as the transmission medium. This work was accomplished to meet the Department of Defense"s (DoD"s) need for a video annotation capability. Current practices for creating annotated products are to capture a still image frame, annotate it using an Electric Light Table application, and then pass the annotated image on as a product. That is not adequate for reporting or downstream cueing. It is too slow and there is a severe loss of information. This paper describes a capability for annotating directly on the video.

  9. Comparison of outcomes of permanently closed and periodically harvested coral reef reserves.

    PubMed

    Bartlett, C Y; Manua, C; Cinner, J; Sutton, S; Jimmy, R; South, R; Nilsson, J; Raina, J

    2009-12-01

    In many areas of the developing world, the establishment of permanent marine reserves is inhibited by cultural norms or socioeconomic pressures. Community conserved areas that are periodically harvested are increasingly being implemented as fisheries management tools, but few researchers have empirically compared them with permanently closed reserves. We used a hierarchical control-impact experimental design to compare the abundance and biomass of reef fishes, invertebrates, and substrate composition in periodically harvested and permanent reserves and in openly fished (control sites) of the South Pacific island country of Vanuatu. Fished species had significantly higher biomass in periodically harvested reserves than in adjacent openly fished areas. We did not detect differences in substratum composition between permanent reserves and openly fished areas or between permanent reserves and periodically harvested reserves. Giant clams (tridacnids) and top shells (Trochus niloticus) were vulnerable to periodic harvest, and we suggest that for adequate management of these species, periodically harvested community conservation areas be used in conjunction with other management strategies. Periodic harvest within reserves is an example of adaptive and flexible management that may meet conservation goals and that is suited to the social, economic, and cultural contexts of many coastal communities in the developing world.

  10. An open annotation ontology for science on web 3.0

    PubMed Central

    2011-01-01

    Background There is currently a gap between the rich and expressive collection of published biomedical ontologies, and the natural language expression of biomedical papers consumed on a daily basis by scientific researchers. The purpose of this paper is to provide an open, shareable structure for dynamic integration of biomedical domain ontologies with the scientific document, in the form of an Annotation Ontology (AO), thus closing this gap and enabling application of formal biomedical ontologies directly to the literature as it emerges. Methods Initial requirements for AO were elicited by analysis of integration needs between biomedical web communities, and of needs for representing and integrating results of biomedical text mining. Analysis of strengths and weaknesses of previous efforts in this area was also performed. A series of increasingly refined annotation tools were then developed along with a metadata model in OWL, and deployed for feedback and additional requirements the ontology to users at a major pharmaceutical company and a major academic center. Further requirements and critiques of the model were also elicited through discussions with many colleagues and incorporated into the work. Results This paper presents Annotation Ontology (AO), an open ontology in OWL-DL for annotating scientific documents on the web. AO supports both human and algorithmic content annotation. It enables “stand-off” or independent metadata anchored to specific positions in a web document by any one of several methods. In AO, the document may be annotated but is not required to be under update control of the annotator. AO contains a provenance model to support versioning, and a set model for specifying groups and containers of annotation. AO is freely available under open source license at http://purl.org/ao/, and extensive documentation including screencasts is available on AO’s Google Code page: http://code.google.com/p/annotation-ontology/ . Conclusions The

  11. MicroScope: a platform for microbial genome annotation and comparative genomics

    PubMed Central

    Vallenet, D.; Engelen, S.; Mornico, D.; Cruveiller, S.; Fleury, L.; Lajus, A.; Rouy, Z.; Roche, D.; Salvignol, G.; Scarpelli, C.; Médigue, C.

    2009-01-01

    The initial outcome of genome sequencing is the creation of long text strings written in a four letter alphabet. The role of in silico sequence analysis is to assist biologists in the act of associating biological knowledge with these sequences, allowing investigators to make inferences and predictions that can be tested experimentally. A wide variety of software is available to the scientific community, and can be used to identify genomic objects, before predicting their biological functions. However, only a limited number of biologically interesting features can be revealed from an isolated sequence. Comparative genomics tools, on the other hand, by bringing together the information contained in numerous genomes simultaneously, allow annotators to make inferences based on the idea that evolution and natural selection are central to the definition of all biological processes. We have developed the MicroScope platform in order to offer a web-based framework for the systematic and efficient revision of microbial genome annotation and comparative analysis (http://www.genoscope.cns.fr/agc/microscope). Starting with the description of the flow chart of the annotation processes implemented in the MicroScope pipeline, and the development of traditional and novel microbial annotation and comparative analysis tools, this article emphasizes the essential role of expert annotation as a complement of automatic annotation. Several examples illustrate the use of implemented tools for the review and curation of annotations of both new and publicly available microbial genomes within MicroScope’s rich integrated genome framework. The platform is used as a viewer in order to browse updated annotation information of available microbial genomes (more than 440 organisms to date), and in the context of new annotation projects (117 bacterial genomes). The human expertise gathered in the MicroScope database (about 280,000 independent annotations) contributes to improve the quality of

  12. MicroScope: a platform for microbial genome annotation and comparative genomics.

    PubMed

    Vallenet, D; Engelen, S; Mornico, D; Cruveiller, S; Fleury, L; Lajus, A; Rouy, Z; Roche, D; Salvignol, G; Scarpelli, C; Médigue, C

    2009-01-01

    The initial outcome of genome sequencing is the creation of long text strings written in a four letter alphabet. The role of in silico sequence analysis is to assist biologists in the act of associating biological knowledge with these sequences, allowing investigators to make inferences and predictions that can be tested experimentally. A wide variety of software is available to the scientific community, and can be used to identify genomic objects, before predicting their biological functions. However, only a limited number of biologically interesting features can be revealed from an isolated sequence. Comparative genomics tools, on the other hand, by bringing together the information contained in numerous genomes simultaneously, allow annotators to make inferences based on the idea that evolution and natural selection are central to the definition of all biological processes. We have developed the MicroScope platform in order to offer a web-based framework for the systematic and efficient revision of microbial genome annotation and comparative analysis (http://www.genoscope.cns.fr/agc/microscope). Starting with the description of the flow chart of the annotation processes implemented in the MicroScope pipeline, and the development of traditional and novel microbial annotation and comparative analysis tools, this article emphasizes the essential role of expert annotation as a complement of automatic annotation. Several examples illustrate the use of implemented tools for the review and curation of annotations of both new and publicly available microbial genomes within MicroScope's rich integrated genome framework. The platform is used as a viewer in order to browse updated annotation information of available microbial genomes (more than 440 organisms to date), and in the context of new annotation projects (117 bacterial genomes). The human expertise gathered in the MicroScope database (about 280,000 independent annotations) contributes to improve the quality of

  13. Effects of riparian timber harvesting on instream habitat and fish assemblages in northern Minnesota streams

    USGS Publications Warehouse

    Chizinski, Christopher J.; Vondracek, Bruce C.; Blinn, Charles R.; Newman, Raymond M.; Atuke, Dickson M.; Fredricks, Keith; Hemstad, Nathaniel A.; Merten, Eric; Schlesser, Nicholas

    2010-01-01

    Relatively few evaluations of aquatic macroinvertebrate and fish communities have been published in peer-reviewed literature detailing the effect of varying residual basal area (RBA) after timber harvesting in riparian buffers. Our analysis investigated the effects of partial harvesting within riparian buffers on aquatic macroinvertebrate and fish communities in small streams from two experiments in northern Minnesota northern hardwood-aspen forests. Each experiment evaluated partial harvesting within riparian buffers. In both experiments, benthic macroinvertebrates and fish were collected 1 year prior to harvest and in each of 3 years after harvest. We observed interannual variation for the macroinvertebrate abundance, diversity and taxon richness in the single-basin study and abundance and diversity in the multiple-basin study, but few effects related to harvest treatments in either study. However, interannual variation was not evident in the fish communities and we detected no significant changes in the stream fish communities associated with partially harvested riparian buffers in either study. This would suggest that timber harvesting in riparian management zones along reaches ≤200 m in length on both sides of the stream that retains RBA ≥ 12.4 ± 1.3 m2 ha−1 or on a single side of the stream that retains RBA ≥ 8.7 ± 1.6 m2 ha−1 may be adequate to protect macroinvertebrate and fish communities in our Minnesota study systems given these specific timber harvesting techniques.

  14. Multi-Atlas Segmentation using Partially Annotated Data: Methods and Annotation Strategies.

    PubMed

    Koch, Lisa M; Rajchl, Martin; Bai, Wenjia; Baumgartner, Christian F; Tong, Tong; Passerat-Palmbach, Jonathan; Aljabar, Paul; Rueckert, Daniel

    2017-08-22

    Multi-atlas segmentation is a widely used tool in medical image analysis, providing robust and accurate results by learning from annotated atlas datasets. However, the availability of fully annotated atlas images for training is limited due to the time required for the labelling task. Segmentation methods requiring only a proportion of each atlas image to be labelled could therefore reduce the workload on expert raters tasked with annotating atlas images. To address this issue, we first re-examine the labelling problem common in many existing approaches and formulate its solution in terms of a Markov Random Field energy minimisation problem on a graph connecting atlases and the target image. This provides a unifying framework for multi-atlas segmentation. We then show how modifications in the graph configuration of the proposed framework enable the use of partially annotated atlas images and investigate different partial annotation strategies. The proposed method was evaluated on two Magnetic Resonance Imaging (MRI) datasets for hippocampal and cardiac segmentation. Experiments were performed aimed at (1) recreating existing segmentation techniques with the proposed framework and (2) demonstrating the potential of employing sparsely annotated atlas data for multi-atlas segmentation.

  15. Seed harvesting by a generalist consumer is context-dependent: Interactive effects across multiple spatial scales

    USGS Publications Warehouse

    Ostoja, Steven M.; Schupp, Eugene W.; Klinger, Rob

    2013-01-01

    Granivore foraging decisions affect consumer success and determine the quantity and spatial pattern of seed survival. These decisions are influenced by environmental variation at spatial scales ranging from landscapes to local foraging patches. In a field experiment, the effects of seed patch variation across three spatial scales on seed removal by western harvester ants Pogonomyrmex occidentalis were evaluated. At the largest scale we assessed harvesting in different plant communities, at the intermediate scale we assessed harvesting at different distances from ant mounds, and at the smallest scale we assessed the effects of interactions among seed species in local seed neighborhoods on seed harvesting (i.e. resource–consumer interface). Selected seed species were presented alone (monospecific treatment) and in mixture with Bromus tectorum (cheatgrass; mixture treatment) at four distances from P. occidentalis mounds in adjacent intact sagebrush and non-native cheatgrass-dominated communities in the Great Basin, Utah, USA. Seed species differed in harvest, with B. tectorum being least preferred. Large and intermediate scale variation influenced harvest. More seeds were harvested in sagebrush than in cheatgrass-dominated communities (largest scale), and the quantity of seed harvested varied with distance from mounds (intermediate-scale), although the form of the distance effect differed between plant communities. At the smallest scale, seed neighborhood affected harvest, but the patterns differed among seed species considered. Ants harvested fewer seeds from mixed-seed neighborhoods than from monospecific neighborhoods, suggesting context dependence and potential associational resistance. Further, the effects of plant community and distance from mound on seed harvest in mixtures differed from their effects in monospecific treatments. Beyond the local seed neighborhood, selection of seed resources is better understood by simultaneously evaluating removal at

  16. Breeding bird response to partially harvested riparian management zones

    USGS Publications Warehouse

    Chizinski, Christopher J.; Peterson, Anna; Hanowski, JoAnn; Blinn, Charles R.; Vondracek, Bruce C.; Niemi, Gerald

    2011-01-01

    We compared avian communities among three timber harvesting treatments in 45-m wide even-age riparian management zones (RMZs) placed between upland clearcuts and along one side of first- or second-order streams in northern Minnesota, USA. The RMZs had three treatments: (1) unharvested, (2) intermediate residual basal area (RBA) (targeted goal 11.5 m2/ha, realized 16.0 m2/ha), and (3) low RBA (targeted goal 5.7 m2/ha, realized 8.7 m2/ha). Surveys were conducted one year pre-harvest and three consecutive years post-harvest. There was no change in species richness, diversity, or total abundance associated with harvest but there were shifts in the types of birds within the community. In particular, White-throated Sparrows (Zonotrichia albicollis) and Chestnut-sided Warblers (Dendroica pensylvanica) increased while Ovenbirds (Seiurus aurocapilla) and Red-eyed Vireos (Vireo olivaceus) decreased. The decline of avian species associated with mature forest in the partially harvested treatments relative to controls indicates that maintaining an unharvested RMZ adjacent to an upland harvest may aid in maintaining avian species associated mature forest in Minnesota for at least three years post-harvest. However, our observations do not reflect reproductive success, which is an area for future research.

  17. Job Creation in Rural Areas: A Select Annotated Bibliography.

    ERIC Educational Resources Information Center

    Pankratz, John

    1989-01-01

    This annotated bibliography is designed to assist rural leaders seeking ways to effectively structure successful job development projects in their communities. The 120 entries are listed in the main body alphabetically by author, and are grouped in the index into categories reflecting Thomas's "seven hallmarks of successful rural development": (1)…

  18. Mitochondrial Disease Sequence Data Resource (MSeqDR): a global grass-roots consortium to facilitate deposition, curation, annotation, and integrated analysis of genomic data for the mitochondrial disease clinical and research communities.

    PubMed

    Falk, Marni J; Shen, Lishuang; Gonzalez, Michael; Leipzig, Jeremy; Lott, Marie T; Stassen, Alphons P M; Diroma, Maria Angela; Navarro-Gomez, Daniel; Yeske, Philip; Bai, Renkui; Boles, Richard G; Brilhante, Virginia; Ralph, David; DaRe, Jeana T; Shelton, Robert; Terry, Sharon F; Zhang, Zhe; Copeland, William C; van Oven, Mannis; Prokisch, Holger; Wallace, Douglas C; Attimonelli, Marcella; Krotoski, Danuta; Zuchner, Stephan; Gai, Xiaowu

    2015-03-01

    Success rates for genomic analyses of highly heterogeneous disorders can be greatly improved if a large cohort of patient data is assembled to enhance collective capabilities for accurate sequence variant annotation, analysis, and interpretation. Indeed, molecular diagnostics requires the establishment of robust data resources to enable data sharing that informs accurate understanding of genes, variants, and phenotypes. The "Mitochondrial Disease Sequence Data Resource (MSeqDR) Consortium" is a grass-roots effort facilitated by the United Mitochondrial Disease Foundation to identify and prioritize specific genomic data analysis needs of the global mitochondrial disease clinical and research community. A central Web portal (https://mseqdr.org) facilitates the coherent compilation, organization, annotation, and analysis of sequence data from both nuclear and mitochondrial genomes of individuals and families with suspected mitochondrial disease. This Web portal provides users with a flexible and expandable suite of resources to enable variant-, gene-, and exome-level sequence analysis in a secure, Web-based, and user-friendly fashion. Users can also elect to share data with other MSeqDR Consortium members, or even the general public, either by custom annotation tracks or through the use of a convenient distributed annotation system (DAS) mechanism. A range of data visualization and analysis tools are provided to facilitate user interrogation and understanding of genomic, and ultimately phenotypic, data of relevance to mitochondrial biology and disease. Currently available tools for nuclear and mitochondrial gene analyses include an MSeqDR GBrowse instance that hosts optimized mitochondrial disease and mitochondrial DNA (mtDNA) specific annotation tracks, as well as an MSeqDR locus-specific database (LSDB) that curates variant data on more than 1300 genes that have been implicated in mitochondrial disease and/or encode mitochondria-localized proteins. MSeqDR is

  19. Mitochondrial Disease Sequence Data Resource (MSeqDR): A global grass-roots consortium to facilitate deposition, curation, annotation, and integrated analysis of genomic data for the mitochondrial disease clinical and research communities

    PubMed Central

    Falk, Marni J.; Shen, Lishuang; Gonzalez, Michael; Leipzig, Jeremy; Lott, Marie T.; Stassen, Alphons P.M.; Diroma, Maria Angela; Navarro-Gomez, Daniel; Yeske, Philip; Bai, Renkui; Boles, Richard G.; Brilhante, Virginia; Ralph, David; DaRe, Jeana T.; Shelton, Robert; Terry, Sharon; Zhang, Zhe; Copeland, William C.; van Oven, Mannis; Prokisch, Holger; Wallace, Douglas C.; Attimonelli, Marcella; Krotoski, Danuta; Zuchner, Stephan; Gai, Xiaowu

    2014-01-01

    Success rates for genomic analyses of highly heterogeneous disorders can be greatly improved if a large cohort of patient data is assembled to enhance collective capabilities for accurate sequence variant annotation, analysis, and interpretation. Indeed, molecular diagnostics requires the establishment of robust data resources to enable data sharing that informs accurate understanding of genes, variants, and phenotypes. The “Mitochondrial Disease Sequence Data Resource (MSeqDR) Consortium” is a grass-roots effort facilitated by the United Mitochondrial Disease Foundation to identify and prioritize specific genomic data analysis needs of the global mitochondrial disease clinical and research community. A central Web portal (https://mseqdr.org) facilitates the coherent compilation, organization, annotation, and analysis of sequence data from both nuclear and mitochondrial genomes of individuals and families with suspected mitochondrial disease. This Web portal provides users with a flexible and expandable suite of resources to enable variant-, gene-, and exome-level sequence analysis in a secure, Web-based, and user-friendly fashion. Users can also elect to share data with other MSeqDR Consortium members, or even the general public, either by custom annotation tracks or through use of a convenient distributed annotation system (DAS) mechanism. A range of data visualization and analysis tools are provided to facilitate user interrogation and understanding of genomic, and ultimately phenotypic, data of relevance to mitochondrial biology and disease. Currently available tools for nuclear and mitochondrial gene analyses include an MSeqDR GBrowse instance that hosts optimized mitochondrial disease and mitochondrial DNA (mtDNA) specific annotation tracks, as well as an MSeqDR locus-specific database (LSDB) that curates variant data on more than 1,300 genes that have been implicated in mitochondrial disease and/or encode mitochondria-localized proteins. MSeqDR is

  20. Early response of ground layer plant communities to wildfire and harvesting disturbance in forested peatland ecosystems in northern Minnesota, USA

    Treesearch

    Erika R. Rowe; Anthony W. D' Amato; Brian J. Palik; John C. Almendinger

    2017-01-01

    A rare, stand-replacing fire in northern Minnesota, USA provided the opportunity to compare the effects of wildfire and timber harvesting in two peatland forest communities, nutrient-poor black spruce (Picea mariana) bogs (BSB) and nutrient-rich tamarack (Larix laricina) swamps (RTS). We found the response between the two...

  1. Challenges and Insights in Using HIPAA Privacy Rule for Clinical Text Annotation.

    PubMed

    Kayaalp, Mehmet; Browne, Allen C; Sagan, Pamela; McGee, Tyne; McDonald, Clement J

    2015-01-01

    The Privacy Rule of Health Insurance Portability and Accountability Act (HIPAA) requires that clinical documents be stripped of personally identifying information before they can be released to researchers and others. We have been manually annotating clinical text since 2008 in order to test and evaluate an algorithmic clinical text de-identification tool, NLM Scrubber, which we have been developing in parallel. Although HIPAA provides some guidance about what must be de-identified, translating those guidelines into practice is not as straightforward, especially when one deals with free text. As a result we have changed our manual annotation labels and methods six times. This paper explains why we have made those annotation choices, which have been evolved throughout seven years of practice on this field. The aim of this paper is to start a community discussion towards developing standards for clinical text annotation with the end goal of studying and comparing clinical text de-identification systems more accurately.

  2. Student Learning and the College Library: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Shklanka, Olga

    The purpose of this annotated bibliography is twofold: (1) to identify which educational and library science literature deals with the learning needs of college students in libraries, and (2) to identify the extent to which library services have been integrated into the educational objectives and learning practices of Canadian community colleges.…

  3. Annotation Graphs: A Graph-Based Visualization for Meta-Analysis of Data Based on User-Authored Annotations.

    PubMed

    Zhao, Jian; Glueck, Michael; Breslav, Simon; Chevalier, Fanny; Khan, Azam

    2017-01-01

    User-authored annotations of data can support analysts in the activity of hypothesis generation and sensemaking, where it is not only critical to document key observations, but also to communicate insights between analysts. We present annotation graphs, a dynamic graph visualization that enables meta-analysis of data based on user-authored annotations. The annotation graph topology encodes annotation semantics, which describe the content of and relations between data selections, comments, and tags. We present a mixed-initiative approach to graph layout that integrates an analyst's manual manipulations with an automatic method based on similarity inferred from the annotation semantics. Various visual graph layout styles reveal different perspectives on the annotation semantics. Annotation graphs are implemented within C8, a system that supports authoring annotations during exploratory analysis of a dataset. We apply principles of Exploratory Sequential Data Analysis (ESDA) in designing C8, and further link these to an existing task typology in the visualization literature. We develop and evaluate the system through an iterative user-centered design process with three experts, situated in the domain of analyzing HCI experiment data. The results suggest that annotation graphs are effective as a method of visually extending user-authored annotations to data meta-analysis for discovery and organization of ideas.

  4. The Accuracy and Reliability of Crowdsource Annotations of Digital Retinal Images.

    PubMed

    Mitry, Danny; Zutis, Kris; Dhillon, Baljean; Peto, Tunde; Hayat, Shabina; Khaw, Kay-Tee; Morgan, James E; Moncur, Wendy; Trucco, Emanuele; Foster, Paul J

    2016-09-01

    Crowdsourcing is based on outsourcing computationally intensive tasks to numerous individuals in the online community who have no formal training. Our aim was to develop a novel online tool designed to facilitate large-scale annotation of digital retinal images, and to assess the accuracy of crowdsource grading using this tool, comparing it to expert classification. We used 100 retinal fundus photograph images with predetermined disease criteria selected by two experts from a large cohort study. The Amazon Mechanical Turk Web platform was used to drive traffic to our site so anonymous workers could perform a classification and annotation task of the fundus photographs in our dataset after a short training exercise. Three groups were assessed: masters only, nonmasters only and nonmasters with compulsory training. We calculated the sensitivity, specificity, and area under the curve (AUC) of receiver operating characteristic (ROC) plots for all classifications compared to expert grading, and used the Dice coefficient and consensus threshold to assess annotation accuracy. In total, we received 5389 annotations for 84 images (excluding 16 training images) in 2 weeks. A specificity and sensitivity of 71% (95% confidence interval [CI], 69%-74%) and 87% (95% CI, 86%-88%) was achieved for all classifications. The AUC in this study for all classifications combined was 0.93 (95% CI, 0.91-0.96). For image annotation, a maximal Dice coefficient (∼0.6) was achieved with a consensus threshold of 0.25. This study supports the hypothesis that annotation of abnormalities in retinal images by ophthalmologically naive individuals is comparable to expert annotation. The highest AUC and agreement with expert annotation was achieved in the nonmasters with compulsory training group. The use of crowdsourcing as a technique for retinal image analysis may be comparable to expert graders and has the potential to deliver timely, accurate, and cost-effective image analysis.

  5. Economic valuation of subsistence harvest of wildlife in Madagascar.

    PubMed

    Golden, Christopher D; Bonds, Matthew H; Brashares, Justin S; Rasolofoniaina, B J Rodolph; Kremen, Claire

    2014-02-01

    Wildlife consumption can be viewed as an ecosystem provisioning service (the production of a material good through ecological functioning) because of wildlife's ability to persist under sustainable levels of harvest. We used the case of wildlife harvest and consumption in northeastern Madagascar to identify the distribution of these services to local households and communities to further our understanding of local reliance on natural resources. We inferred these benefits from demand curves built with data on wildlife sales transactions. On average, the value of wildlife provisioning represented 57% of annual household cash income in local communities from the Makira Natural Park and Masoala National Park, and harvested areas produced an economic return of U.S.$0.42 ha(-1) · year(-1). Variability in value of harvested wildlife was high among communities and households with an approximate 2 orders of magnitude difference in the proportional value of wildlife to household income. The imputed price of harvested wildlife and its consumption were strongly associated (p< 0.001), and increases in price led to reduced harvest for consumption. Heightened monitoring and enforcement of hunting could increase the costs of harvesting and thus elevate the price and reduce consumption of wildlife. Increased enforcement would therefore be beneficial to biodiversity conservation but could limit local people's food supply. Specifically, our results provide an estimate of the cost of offsetting economic losses to local populations from the enforcement of conservation policies. By explicitly estimating the welfare effects of consumed wildlife, our results may inform targeted interventions by public health and development specialists as they allocate sparse funds to support regions, households, or individuals most vulnerable to changes in access to wildlife. © 2013 Society for Conservation Biology.

  6. Apollo: a sequence annotation editor

    PubMed Central

    Lewis, SE; Searle, SMJ; Harris, N; Gibson, M; Iyer, V; Richter, J; Wiel, C; Bayraktaroglu, L; Birney, E; Crosby, MA; Kaminker, JS; Matthews, BB; Prochnik, SE; Smith, CD; Tupy, JL; Rubin, GM; Misra, S; Mungall, CJ; Clamp, ME

    2002-01-01

    The well-established inaccuracy of purely computational methods for annotating genome sequences necessitates an interactive tool to allow biological experts to refine these approximations by viewing and independently evaluating the data supporting each annotation. Apollo was developed to meet this need, enabling curators to inspect genome annotations closely and edit them. FlyBase biologists successfully used Apollo to annotate the Drosophila melanogaster genome and it is increasingly being used as a starting point for the development of customized annotation editing tools for other genome projects. PMID:12537571

  7. An approach to describing and analysing bulk biological annotation quality: a case study using UniProtKB.

    PubMed

    Bell, Michael J; Gillespie, Colin S; Swan, Daniel; Lord, Phillip

    2012-09-15

    Annotations are a key feature of many biological databases, used to convey our knowledge of a sequence to the reader. Ideally, annotations are curated manually, however manual curation is costly, time consuming and requires expert knowledge and training. Given these issues and the exponential increase of data, many databases implement automated annotation pipelines in an attempt to avoid un-annotated entries. Both manual and automated annotations vary in quality between databases and annotators, making assessment of annotation reliability problematic for users. The community lacks a generic measure for determining annotation quality and correctness, which we look at addressing within this article. Specifically we investigate word reuse within bulk textual annotations and relate this to Zipf's Principle of Least Effort. We use the UniProt Knowledgebase (UniProtKB) as a case study to demonstrate this approach since it allows us to compare annotation change, both over time and between automated and manually curated annotations. By applying power-law distributions to word reuse in annotation, we show clear trends in UniProtKB over time, which are consistent with existing studies of quality on free text English. Further, we show a clear distinction between manual and automated analysis and investigate cohorts of protein records as they mature. These results suggest that this approach holds distinct promise as a mechanism for judging annotation quality. Source code is available at the authors website: http://homepages.cs.ncl.ac.uk/m.j.bell1/annotation. phillip.lord@newcastle.ac.uk.

  8. Towards Automated Annotation of Benthic Survey Images: Variability of Human Experts and Operational Modes of Automation

    PubMed Central

    Beijbom, Oscar; Edmunds, Peter J.; Roelfsema, Chris; Smith, Jennifer; Kline, David I.; Neal, Benjamin P.; Dunlap, Matthew J.; Moriarty, Vincent; Fan, Tung-Yung; Tan, Chih-Jui; Chan, Stephen; Treibitz, Tali; Gamst, Anthony; Mitchell, B. Greg; Kriegman, David

    2015-01-01

    Global climate change and other anthropogenic stressors have heightened the need to rapidly characterize ecological changes in marine benthic communities across large scales. Digital photography enables rapid collection of survey images to meet this need, but the subsequent image annotation is typically a time consuming, manual task. We investigated the feasibility of using automated point-annotation to expedite cover estimation of the 17 dominant benthic categories from survey-images captured at four Pacific coral reefs. Inter- and intra- annotator variability among six human experts was quantified and compared to semi- and fully- automated annotation methods, which are made available at coralnet.ucsd.edu. Our results indicate high expert agreement for identification of coral genera, but lower agreement for algal functional groups, in particular between turf algae and crustose coralline algae. This indicates the need for unequivocal definitions of algal groups, careful training of multiple annotators, and enhanced imaging technology. Semi-automated annotation, where 50% of the annotation decisions were performed automatically, yielded cover estimate errors comparable to those of the human experts. Furthermore, fully-automated annotation yielded rapid, unbiased cover estimates but with increased variance. These results show that automated annotation can increase spatial coverage and decrease time and financial outlay for image-based reef surveys. PMID:26154157

  9. The annotation-enriched non-redundant patent sequence databases.

    PubMed

    Li, Weizhong; Kondratowicz, Bartosz; McWilliam, Hamish; Nauche, Stephane; Lopez, Rodrigo

    2013-01-01

    The EMBL-European Bioinformatics Institute (EMBL-EBI) offers public access to patent sequence data, providing a valuable service to the intellectual property and scientific communities. The non-redundant (NR) patent sequence databases comprise two-level nucleotide and protein sequence clusters (NRNL1, NRNL2, NRPL1 and NRPL2) based on sequence identity (level-1) and patent family (level-2). Annotation from the source entries in these databases is merged and enhanced with additional information from the patent literature and biological context. Corrections in patent publication numbers, kind-codes and patent equivalents significantly improve the data quality. Data are available through various user interfaces including web browser, downloads via FTP, SRS, Dbfetch and EBI-Search. Sequence similarity/homology searches against the databases are available using BLAST, FASTA and PSI-Search. In this article, we describe the data collection and annotation and also outline major changes and improvements introduced since 2009. Apart from data growth, these changes include additional annotation for singleton clusters, the identifier versioning for tracking entry change and the entry mappings between the two-level databases. Database URL: http://www.ebi.ac.uk/patentdata/nr/

  10. The Annotation-enriched non-redundant patent sequence databases

    PubMed Central

    Li, Weizhong; Kondratowicz, Bartosz; McWilliam, Hamish; Nauche, Stephane; Lopez, Rodrigo

    2013-01-01

    The EMBL-European Bioinformatics Institute (EMBL-EBI) offers public access to patent sequence data, providing a valuable service to the intellectual property and scientific communities. The non-redundant (NR) patent sequence databases comprise two-level nucleotide and protein sequence clusters (NRNL1, NRNL2, NRPL1 and NRPL2) based on sequence identity (level-1) and patent family (level-2). Annotation from the source entries in these databases is merged and enhanced with additional information from the patent literature and biological context. Corrections in patent publication numbers, kind-codes and patent equivalents significantly improve the data quality. Data are available through various user interfaces including web browser, downloads via FTP, SRS, Dbfetch and EBI-Search. Sequence similarity/homology searches against the databases are available using BLAST, FASTA and PSI-Search. In this article, we describe the data collection and annotation and also outline major changes and improvements introduced since 2009. Apart from data growth, these changes include additional annotation for singleton clusters, the identifier versioning for tracking entry change and the entry mappings between the two-level databases. Database URL: http://www.ebi.ac.uk/patentdata/nr/ PMID:23396323

  11. The Evidence and Conclusion Ontology (ECO): Supporting GO Annotations.

    PubMed

    Chibucos, Marcus C; Siegele, Deborah A; Hu, James C; Giglio, Michelle

    2017-01-01

    The Evidence and Conclusion Ontology (ECO) is a community resource for describing the various types of evidence that are generated during the course of a scientific study and which are typically used to support assertions made by researchers. ECO describes multiple evidence types, including evidence resulting from experimental (i.e., wet lab) techniques, evidence arising from computational methods, statements made by authors (whether or not supported by evidence), and inferences drawn by researchers curating the literature. In addition to summarizing the evidence that supports a particular assertion, ECO also offers a means to document whether a computer or a human performed the process of making the annotation. Incorporating ECO into an annotation system makes it possible to leverage the structure of the ontology such that associated data can be grouped hierarchically, users can select data associated with particular evidence types, and quality control pipelines can be optimized. Today, over 30 resources, including the Gene Ontology, use the Evidence and Conclusion Ontology to represent both evidence and how annotations are made.

  12. Breeding, Early-Successional Bird Response to Forest Harvests for Bioenergy.

    PubMed

    Grodsky, Steven M; Moorman, Christopher E; Fritts, Sarah R; Castleberry, Steven B; Wigley, T Bently

    2016-01-01

    Forest regeneration following timber harvest is a principal source of habitat for early-successional birds and characterized by influxes of early-successional vegetation and residual downed woody material. Early-successional birds may use harvest residues for communication, cover, foraging, and nesting. Yet, increased market viability of woody biomass as bioenergy feedstock may intensify harvest residue removal. Our objectives were to: 1) evaluate effects of varying intensities of woody biomass harvest on the early-successional bird community; and (2) document early-successional bird use of harvest residues in regenerating stands. We spot-mapped birds from 15 April- 15 July, 2012-2014, in six woody biomass removal treatments within regenerating stands in North Carolina (n = 4) and Georgia (n = 4), USA. Treatments included clearcut harvest followed by: (1) traditional woody biomass harvest with no specific retention target; (2) 15% retention with harvest residues dispersed; (3) 15% retention with harvest residues clustered; (4) 30% retention with harvest residues dispersed; (5) 30% retention with harvest residues clustered; and (6) no woody biomass harvest (i.e., reference site). We tested for treatment-level effects on breeding bird species diversity and richness, early-successional focal species territory density (combined and individual species), counts of breeding birds detected near, in, or on branches of harvest piles/windrows, counts of breeding bird behaviors, and vegetation composition and structure. Pooled across three breeding seasons, we delineated 536 and 654 territories and detected 2,489 and 4,204 birds in the North Carolina and Georgia treatments, respectively. Woody biomass harvest had limited or short-lived effects on the early-successional, breeding bird community. The successional trajectory of vegetation structure, rather than availability of harvest residues, primarily drove avian use of regenerating stands. However, many breeding bird species

  13. Breeding, Early-Successional Bird Response to Forest Harvests for Bioenergy

    PubMed Central

    Grodsky, Steven M.; Moorman, Christopher E.; Fritts, Sarah R.; Castleberry, Steven B.; Wigley, T. Bently

    2016-01-01

    Forest regeneration following timber harvest is a principal source of habitat for early-successional birds and characterized by influxes of early-successional vegetation and residual downed woody material. Early-successional birds may use harvest residues for communication, cover, foraging, and nesting. Yet, increased market viability of woody biomass as bioenergy feedstock may intensify harvest residue removal. Our objectives were to: 1) evaluate effects of varying intensities of woody biomass harvest on the early-successional bird community; and (2) document early-successional bird use of harvest residues in regenerating stands. We spot-mapped birds from 15 April– 15 July, 2012–2014, in six woody biomass removal treatments within regenerating stands in North Carolina (n = 4) and Georgia (n = 4), USA. Treatments included clearcut harvest followed by: (1) traditional woody biomass harvest with no specific retention target; (2) 15% retention with harvest residues dispersed; (3) 15% retention with harvest residues clustered; (4) 30% retention with harvest residues dispersed; (5) 30% retention with harvest residues clustered; and (6) no woody biomass harvest (i.e., reference site). We tested for treatment-level effects on breeding bird species diversity and richness, early-successional focal species territory density (combined and individual species), counts of breeding birds detected near, in, or on branches of harvest piles/windrows, counts of breeding bird behaviors, and vegetation composition and structure. Pooled across three breeding seasons, we delineated 536 and 654 territories and detected 2,489 and 4,204 birds in the North Carolina and Georgia treatments, respectively. Woody biomass harvest had limited or short-lived effects on the early-successional, breeding bird community. The successional trajectory of vegetation structure, rather than availability of harvest residues, primarily drove avian use of regenerating stands. However, many breeding bird

  14. "Straight from the heavens into your bucket": domestic rainwater harvesting as a measure to improve water security in a subarctic indigenous community.

    PubMed

    Mercer, Nicholas; Hanrahan, Maura

    2017-01-01

    Black Tickle-Domino is an extremely water-insecure remote Inuit community in the Canadian subarctic that lacks piped-water. Drinking water consumption in the community is less than a third of the Canadian national average. Water insecurity in the community contributes to adverse health, economic, and social effects and requires urgent action. To test the ability of domestic rainwater harvesting (DRWH) for the first time in the subarctic with the goal of improving water access and use in the community. This project utilised quantitative weekly reporting of water collection and use, as well as focus group discussions. DRWH units were installed at seven water-insecure households chosen by the local government. Results were measured over a 6-week period in 2016. Participants harvested 19.07 gallons of rainwater per week. General purpose water consumption increased by 17% and water retrieval efforts declined by 40.92%. Households saved $12.70 CDN per week. Participants reported perceived improvements to psychological health. Because no potable water was collected, drinking water consumption did not increase. The study identified additional water-insecurity impacts. DRWH cannot supply drinking water without proper treatment and filtration; however, it can be a partial remedy to water insecurity in the subarctic. DRWH is appropriately scaled, inexpensive, and participants identified several significant benefits.

  15. An Introduction to Youth with Disabilities: Annotated Bibliography. CYDLINE Reviews.

    ERIC Educational Resources Information Center

    Minnesota Univ., Minneapolis. National Center for Youth with Disabilities.

    The annotated bibliography describes resources covering a wide range of issues related to disabled youth and their families. The 38 bibliographic citations date from 1980 to 1989 and are grouped into the following categories: psychosocial issues, health issues, educational issues, and community living. Information is also provided on services of…

  16. Genes controlling seed dormancy and pre-harvest sprouting in a rice-wheat-barley comparison.

    PubMed

    Li, Chengdao; Ni, Peixiang; Francki, Michael; Hunter, Adam; Zhang, Yong; Schibeci, David; Li, Heng; Tarr, Allen; Wang, Jun; Cakir, Mehmet; Yu, Jun; Bellgard, Matthew; Lance, Reg; Appels, Rudi

    2004-05-01

    Pre-harvest sprouting results in significant economic loss for the grain industry around the world. Lack of adequate seed dormancy is the major reason for pre-harvest sprouting in the field under wet weather conditions. Although this trait is governed by multiple genes it is also highly heritable. A major QTL controlling both pre-harvest sprouting and seed dormancy has been identified on the long arm of barley chromosome 5H, and it explains over 70% of the phenotypic variation. Comparative genomics approaches among barley, wheat and rice were used to identify candidate gene(s) controlling seed dormancy and hence one aspect of pre-harvest sprouting. The barley seed dormancy/pre-harvest sprouting QTL was located in a region that showed good synteny with the terminal end of the long arm of rice chromosome 3. The rice DNA sequences were annotated and a gene encoding GA20-oxidase was identified as a candidate gene controlling the seed dormancy/pre-harvest sprouting QTL on 5HL. This chromosomal region also shared synteny with the telomere region of wheat chromosome 4AL, but was located outside of the QTL reported for seed dormancy in wheat. The wheat chromosome 4AL QTL region for seed dormancy was syntenic to both rice chromosome 3 and 11. In both cases, corresponding QTLs for seed dormancy have been mapped in rice.

  17. Dry creek long-term watershed study: the effects of harvesting in streamside management zones and adjacent uplands of riparian corridors on avian communities in the Coastal Plain of Georgia

    Treesearch

    Merideth P. Grooms; J. Drew Lanham; T. Bently Wigley

    2006-01-01

    We evaluated the effects of Best Management Practices (BMPs) harvesting on avian communities associated with headwater streams in the Georgia Coastal Plain. Two watersheds served as references, with no timber harvesting, and two treatment watersheds were clearcut with retention of Streamside Management Zones (SMZs) according to Georgia BMPs for forestry. Bird...

  18. Phenex: ontological annotation of phenotypic diversity.

    PubMed

    Balhoff, James P; Dahdul, Wasila M; Kothari, Cartik R; Lapp, Hilmar; Lundberg, John G; Mabee, Paula; Midford, Peter E; Westerfield, Monte; Vision, Todd J

    2010-05-05

    Phenotypic differences among species have long been systematically itemized and described by biologists in the process of investigating phylogenetic relationships and trait evolution. Traditionally, these descriptions have been expressed in natural language within the context of individual journal publications or monographs. As such, this rich store of phenotype data has been largely unavailable for statistical and computational comparisons across studies or integration with other biological knowledge. Here we describe Phenex, a platform-independent desktop application designed to facilitate efficient and consistent annotation of phenotypic similarities and differences using Entity-Quality syntax, drawing on terms from community ontologies for anatomical entities, phenotypic qualities, and taxonomic names. Phenex can be configured to load only those ontologies pertinent to a taxonomic group of interest. The graphical user interface was optimized for evolutionary biologists accustomed to working with lists of taxa, characters, character states, and character-by-taxon matrices. Annotation of phenotypic data using ontologies and globally unique taxonomic identifiers will allow biologists to integrate phenotypic data from different organisms and studies, leveraging decades of work in systematics and comparative morphology.

  19. RATT: Rapid Annotation Transfer Tool

    PubMed Central

    Otto, Thomas D.; Dillon, Gary P.; Degrave, Wim S.; Berriman, Matthew

    2011-01-01

    Second-generation sequencing technologies have made large-scale sequencing projects commonplace. However, making use of these datasets often requires gene function to be ascribed genome wide. Although tool development has kept pace with the changes in sequence production, for tasks such as mapping, de novo assembly or visualization, genome annotation remains a challenge. We have developed a method to rapidly provide accurate annotation for new genomes using previously annotated genomes as a reference. The method, implemented in a tool called RATT (Rapid Annotation Transfer Tool), transfers annotations from a high-quality reference to a new genome on the basis of conserved synteny. We demonstrate that a Mycobacterium tuberculosis genome or a single 2.5 Mb chromosome from a malaria parasite can be annotated in less than five minutes with only modest computational resources. RATT is available at http://ratt.sourceforge.net. PMID:21306991

  20. AnnotateGenomicRegions: a web application.

    PubMed

    Zammataro, Luca; DeMolfetta, Rita; Bucci, Gabriele; Ceol, Arnaud; Muller, Heiko

    2014-01-01

    Modern genomic technologies produce large amounts of data that can be mapped to specific regions in the genome. Among the first steps in interpreting the results is annotation of genomic regions with known features such as genes, promoters, CpG islands etc. Several tools have been published to perform this task. However, using these tools often requires a significant amount of bioinformatics skills and/or downloading and installing dedicated software. Here we present AnnotateGenomicRegions, a web application that accepts genomic regions as input and outputs a selection of overlapping and/or neighboring genome annotations. Supported organisms include human (hg18, hg19), mouse (mm8, mm9, mm10), zebrafish (danRer7), and Saccharomyces cerevisiae (sacCer2, sacCer3). AnnotateGenomicRegions is accessible online on a public server or can be installed locally. Some frequently used annotations and genomes are embedded in the application while custom annotations may be added by the user. The increasing spread of genomic technologies generates the need for a simple-to-use annotation tool for genomic regions that can be used by biologists and bioinformaticians alike. AnnotateGenomicRegions meets this demand. AnnotateGenomicRegions is an open-source web application that can be installed on any personal computer or institute server. AnnotateGenomicRegions is available at: http://cru.genomics.iit.it/AnnotateGenomicRegions.

  1. Evaluating Hierarchical Structure in Music Annotations

    PubMed Central

    McFee, Brian; Nieto, Oriol; Farbood, Morwaread M.; Bello, Juan Pablo

    2017-01-01

    Music exhibits structure at multiple scales, ranging from motifs to large-scale functional components. When inferring the structure of a piece, different listeners may attend to different temporal scales, which can result in disagreements when they describe the same piece. In the field of music informatics research (MIR), it is common to use corpora annotated with structural boundaries at different levels. By quantifying disagreements between multiple annotators, previous research has yielded several insights relevant to the study of music cognition. First, annotators tend to agree when structural boundaries are ambiguous. Second, this ambiguity seems to depend on musical features, time scale, and genre. Furthermore, it is possible to tune current annotation evaluation metrics to better align with these perceptual differences. However, previous work has not directly analyzed the effects of hierarchical structure because the existing methods for comparing structural annotations are designed for “flat” descriptions, and do not readily generalize to hierarchical annotations. In this paper, we extend and generalize previous work on the evaluation of hierarchical descriptions of musical structure. We derive an evaluation metric which can compare hierarchical annotations holistically across multiple levels. sing this metric, we investigate inter-annotator agreement on the multilevel annotations of two different music corpora, investigate the influence of acoustic properties on hierarchical annotations, and evaluate existing hierarchical segmentation algorithms against the distribution of inter-annotator agreement. PMID:28824514

  2. MGmapper: Reference based mapping and taxonomy annotation of metagenomics sequence reads.

    PubMed

    Petersen, Thomas Nordahl; Lukjancenko, Oksana; Thomsen, Martin Christen Frølund; Maddalena Sperotto, Maria; Lund, Ole; Møller Aarestrup, Frank; Sicheritz-Pontén, Thomas

    2017-01-01

    An increasing amount of species and gene identification studies rely on the use of next generation sequence analysis of either single isolate or metagenomics samples. Several methods are available to perform taxonomic annotations and a previous metagenomics benchmark study has shown that a vast number of false positive species annotations are a problem unless thresholds or post-processing are applied to differentiate between correct and false annotations. MGmapper is a package to process raw next generation sequence data and perform reference based sequence assignment, followed by a post-processing analysis to produce reliable taxonomy annotation at species and strain level resolution. An in-vitro bacterial mock community sample comprised of 8 genuses, 11 species and 12 strains was previously used to benchmark metagenomics classification methods. After applying a post-processing filter, we obtained 100% correct taxonomy assignments at species and genus level. A sensitivity and precision at 75% was obtained for strain level annotations. A comparison between MGmapper and Kraken at species level, shows MGmapper assigns taxonomy at species level using 84.8% of the sequence reads, compared to 70.5% for Kraken and both methods identified all species with no false positives. Extensive read count statistics are provided in plain text and excel sheets for both rejected and accepted taxonomy annotations. The use of custom databases is possible for the command-line version of MGmapper, and the complete pipeline is freely available as a bitbucked package (https://bitbucket.org/genomicepidemiology/mgmapper). A web-version (https://cge.cbs.dtu.dk/services/MGmapper) provides the basic functionality for analysis of small fastq datasets.

  3. Alignment-Annotator web server: rendering and annotating sequence alignments

    PubMed Central

    Gille, Christoph; Fähling, Michael; Weyand, Birgit; Wieland, Thomas; Gille, Andreas

    2014-01-01

    Alignment-Annotator is a novel web service designed to generate interactive views of annotated nucleotide and amino acid sequence alignments (i) de novo and (ii) embedded in other software. All computations are performed at server side. Interactivity is implemented in HTML5, a language native to web browsers. The alignment is initially displayed using default settings and can be modified with the graphical user interfaces. For example, individual sequences can be reordered or deleted using drag and drop, amino acid color code schemes can be applied and annotations can be added. Annotations can be made manually or imported (BioDAS servers, the UniProt, the Catalytic Site Atlas and the PDB). Some edits take immediate effect while others require server interaction and may take a few seconds to execute. The final alignment document can be downloaded as a zip-archive containing the HTML files. Because of the use of HTML the resulting interactive alignment can be viewed on any platform including Windows, Mac OS X, Linux, Android and iOS in any standard web browser. Importantly, no plugins nor Java are required and therefore Alignment-Anotator represents the first interactive browser-based alignment visualization. Availability: http://www.bioinformatics.org/strap/aa/ and http://strap.charite.de/aa/. PMID:24813445

  4. Light harvesting control in plants.

    PubMed

    Ruban, Alexander V

    2018-05-23

    In 1991, my colleagues and I published a hypothesis article that proposed a mechanism that controls light harvesting in plants and protects them against photodamage. The major light harvesting complex, LHCII, was suggested to undergo aggregation upon exposure of the plant to damaging levels of light. Aggregated LHCII was found to be much less efficient in light harvesting, as it promptly dissipated absorbed energy into heat, possessing a very low chlorophyll fluorescence yield. Non-photochemical quenching (NPQ) is a term coined to describe this reduction in chlorophyll fluorescence yield. This article is a story of how the hypothesis that LHCII aggregation is involved in NPQ is developed into a model that is now becoming broadly accepted by the research community. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  5. Protein Sequence Annotation Tool (PSAT): A centralized web-based meta-server for high-throughput sequence annotations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Leung, Elo; Huang, Amy; Cadag, Eithon

    In this study, we introduce the Protein Sequence Annotation Tool (PSAT), a web-based, sequence annotation meta-server for performing integrated, high-throughput, genome-wide sequence analyses. Our goals in building PSAT were to (1) create an extensible platform for integration of multiple sequence-based bioinformatics tools, (2) enable functional annotations and enzyme predictions over large input protein fasta data sets, and (3) provide a web interface for convenient execution of the tools. In this paper, we demonstrate the utility of PSAT by annotating the predicted peptide gene products of Herbaspirillum sp. strain RV1423, importing the results of PSAT into EC2KEGG, and using the resultingmore » functional comparisons to identify a putative catabolic pathway, thereby distinguishing RV1423 from a well annotated Herbaspirillum species. This analysis demonstrates that high-throughput enzyme predictions, provided by PSAT processing, can be used to identify metabolic potential in an otherwise poorly annotated genome. Lastly, PSAT is a meta server that combines the results from several sequence-based annotation and function prediction codes, and is available at http://psat.llnl.gov/psat/. PSAT stands apart from other sequencebased genome annotation systems in providing a high-throughput platform for rapid de novo enzyme predictions and sequence annotations over large input protein sequence data sets in FASTA. PSAT is most appropriately applied in annotation of large protein FASTA sets that may or may not be associated with a single genome.« less

  6. Protein Sequence Annotation Tool (PSAT): A centralized web-based meta-server for high-throughput sequence annotations

    DOE PAGES

    Leung, Elo; Huang, Amy; Cadag, Eithon; ...

    2016-01-20

    In this study, we introduce the Protein Sequence Annotation Tool (PSAT), a web-based, sequence annotation meta-server for performing integrated, high-throughput, genome-wide sequence analyses. Our goals in building PSAT were to (1) create an extensible platform for integration of multiple sequence-based bioinformatics tools, (2) enable functional annotations and enzyme predictions over large input protein fasta data sets, and (3) provide a web interface for convenient execution of the tools. In this paper, we demonstrate the utility of PSAT by annotating the predicted peptide gene products of Herbaspirillum sp. strain RV1423, importing the results of PSAT into EC2KEGG, and using the resultingmore » functional comparisons to identify a putative catabolic pathway, thereby distinguishing RV1423 from a well annotated Herbaspirillum species. This analysis demonstrates that high-throughput enzyme predictions, provided by PSAT processing, can be used to identify metabolic potential in an otherwise poorly annotated genome. Lastly, PSAT is a meta server that combines the results from several sequence-based annotation and function prediction codes, and is available at http://psat.llnl.gov/psat/. PSAT stands apart from other sequencebased genome annotation systems in providing a high-throughput platform for rapid de novo enzyme predictions and sequence annotations over large input protein sequence data sets in FASTA. PSAT is most appropriately applied in annotation of large protein FASTA sets that may or may not be associated with a single genome.« less

  7. Approaches to Fungal Genome Annotation

    PubMed Central

    Haas, Brian J.; Zeng, Qiandong; Pearson, Matthew D.; Cuomo, Christina A.; Wortman, Jennifer R.

    2011-01-01

    Fungal genome annotation is the starting point for analysis of genome content. This generally involves the application of diverse methods to identify features on a genome assembly such as protein-coding and non-coding genes, repeats and transposable elements, and pseudogenes. Here we describe tools and methods leveraged for eukaryotic genome annotation with a focus on the annotation of fungal nuclear and mitochondrial genomes. We highlight the application of the latest technologies and tools to improve the quality of predicted gene sets. The Broad Institute eukaryotic genome annotation pipeline is described as one example of how such methods and tools are integrated into a sequencing center’s production genome annotation environment. PMID:22059117

  8. AnnotateGenomicRegions: a web application

    PubMed Central

    2014-01-01

    Background Modern genomic technologies produce large amounts of data that can be mapped to specific regions in the genome. Among the first steps in interpreting the results is annotation of genomic regions with known features such as genes, promoters, CpG islands etc. Several tools have been published to perform this task. However, using these tools often requires a significant amount of bioinformatics skills and/or downloading and installing dedicated software. Results Here we present AnnotateGenomicRegions, a web application that accepts genomic regions as input and outputs a selection of overlapping and/or neighboring genome annotations. Supported organisms include human (hg18, hg19), mouse (mm8, mm9, mm10), zebrafish (danRer7), and Saccharomyces cerevisiae (sacCer2, sacCer3). AnnotateGenomicRegions is accessible online on a public server or can be installed locally. Some frequently used annotations and genomes are embedded in the application while custom annotations may be added by the user. Conclusions The increasing spread of genomic technologies generates the need for a simple-to-use annotation tool for genomic regions that can be used by biologists and bioinformaticians alike. AnnotateGenomicRegions meets this demand. AnnotateGenomicRegions is an open-source web application that can be installed on any personal computer or institute server. AnnotateGenomicRegions is available at: http://cru.genomics.iit.it/AnnotateGenomicRegions. PMID:24564446

  9. The Accuracy and Reliability of Crowdsource Annotations of Digital Retinal Images

    PubMed Central

    Mitry, Danny; Zutis, Kris; Dhillon, Baljean; Peto, Tunde; Hayat, Shabina; Khaw, Kay-Tee; Morgan, James E.; Moncur, Wendy; Trucco, Emanuele; Foster, Paul J.

    2016-01-01

    Purpose Crowdsourcing is based on outsourcing computationally intensive tasks to numerous individuals in the online community who have no formal training. Our aim was to develop a novel online tool designed to facilitate large-scale annotation of digital retinal images, and to assess the accuracy of crowdsource grading using this tool, comparing it to expert classification. Methods We used 100 retinal fundus photograph images with predetermined disease criteria selected by two experts from a large cohort study. The Amazon Mechanical Turk Web platform was used to drive traffic to our site so anonymous workers could perform a classification and annotation task of the fundus photographs in our dataset after a short training exercise. Three groups were assessed: masters only, nonmasters only and nonmasters with compulsory training. We calculated the sensitivity, specificity, and area under the curve (AUC) of receiver operating characteristic (ROC) plots for all classifications compared to expert grading, and used the Dice coefficient and consensus threshold to assess annotation accuracy. Results In total, we received 5389 annotations for 84 images (excluding 16 training images) in 2 weeks. A specificity and sensitivity of 71% (95% confidence interval [CI], 69%–74%) and 87% (95% CI, 86%–88%) was achieved for all classifications. The AUC in this study for all classifications combined was 0.93 (95% CI, 0.91–0.96). For image annotation, a maximal Dice coefficient (∼0.6) was achieved with a consensus threshold of 0.25. Conclusions This study supports the hypothesis that annotation of abnormalities in retinal images by ophthalmologically naive individuals is comparable to expert annotation. The highest AUC and agreement with expert annotation was achieved in the nonmasters with compulsory training group. Translational Relevance The use of crowdsourcing as a technique for retinal image analysis may be comparable to expert graders and has the potential to deliver

  10. Towards comprehensive syntactic and semantic annotations of the clinical narrative

    PubMed Central

    Albright, Daniel; Lanfranchi, Arrick; Fredriksen, Anwen; Styler, William F; Warner, Colin; Hwang, Jena D; Choi, Jinho D; Dligach, Dmitriy; Nielsen, Rodney D; Martin, James; Ward, Wayne; Palmer, Martha; Savova, Guergana K

    2013-01-01

    Objective To create annotated clinical narratives with layers of syntactic and semantic labels to facilitate advances in clinical natural language processing (NLP). To develop NLP algorithms and open source components. Methods Manual annotation of a clinical narrative corpus of 127 606 tokens following the Treebank schema for syntactic information, PropBank schema for predicate-argument structures, and the Unified Medical Language System (UMLS) schema for semantic information. NLP components were developed. Results The final corpus consists of 13 091 sentences containing 1772 distinct predicate lemmas. Of the 766 newly created PropBank frames, 74 are verbs. There are 28 539 named entity (NE) annotations spread over 15 UMLS semantic groups, one UMLS semantic type, and the Person semantic category. The most frequent annotations belong to the UMLS semantic groups of Procedures (15.71%), Disorders (14.74%), Concepts and Ideas (15.10%), Anatomy (12.80%), Chemicals and Drugs (7.49%), and the UMLS semantic type of Sign or Symptom (12.46%). Inter-annotator agreement results: Treebank (0.926), PropBank (0.891–0.931), NE (0.697–0.750). The part-of-speech tagger, constituency parser, dependency parser, and semantic role labeler are built from the corpus and released open source. A significant limitation uncovered by this project is the need for the NLP community to develop a widely agreed-upon schema for the annotation of clinical concepts and their relations. Conclusions This project takes a foundational step towards bringing the field of clinical NLP up to par with NLP in the general domain. The corpus creation and NLP components provide a resource for research and application development that would have been previously impossible. PMID:23355458

  11. Escherichia coli K-12: a cooperatively developed annotation snapshot—2005

    PubMed Central

    Riley, Monica; Abe, Takashi; Arnaud, Martha B.; Berlyn, Mary K.B.; Blattner, Frederick R.; Chaudhuri, Roy R.; Glasner, Jeremy D.; Horiuchi, Takashi; Keseler, Ingrid M.; Kosuge, Takehide; Mori, Hirotada; Perna, Nicole T.; Plunkett, Guy; Rudd, Kenneth E.; Serres, Margrethe H.; Thomas, Gavin H.; Thomson, Nicholas R.; Wishart, David; Wanner, Barry L.

    2006-01-01

    The goal of this group project has been to coordinate and bring up-to-date information on all genes of Escherichia coli K-12. Annotation of the genome of an organism entails identification of genes, the boundaries of genes in terms of precise start and end sites, and description of the gene products. Known and predicted functions were assigned to each gene product on the basis of experimental evidence or sequence analysis. Since both kinds of evidence are constantly expanding, no annotation is complete at any moment in time. This is a snapshot analysis based on the most recent genome sequences of two E.coli K-12 bacteria. An accurate and up-to-date description of E.coli K-12 genes is of particular importance to the scientific community because experimentally determined properties of its gene products provide fundamental information for annotation of innumerable genes of other organisms. Availability of the complete genome sequence of two K-12 strains allows comparison of their genotypes and mutant status of alleles. PMID:16397293

  12. Quality of Computationally Inferred Gene Ontology Annotations

    PubMed Central

    Škunca, Nives; Altenhoff, Adrian; Dessimoz, Christophe

    2012-01-01

    Gene Ontology (GO) has established itself as the undisputed standard for protein function annotation. Most annotations are inferred electronically, i.e. without individual curator supervision, but they are widely considered unreliable. At the same time, we crucially depend on those automated annotations, as most newly sequenced genomes are non-model organisms. Here, we introduce a methodology to systematically and quantitatively evaluate electronic annotations. By exploiting changes in successive releases of the UniProt Gene Ontology Annotation database, we assessed the quality of electronic annotations in terms of specificity, reliability, and coverage. Overall, we not only found that electronic annotations have significantly improved in recent years, but also that their reliability now rivals that of annotations inferred by curators when they use evidence other than experiments from primary literature. This work provides the means to identify the subset of electronic annotations that can be relied upon—an important outcome given that >98% of all annotations are inferred without direct curation. PMID:22693439

  13. SEED Software Annotations.

    ERIC Educational Resources Information Center

    Bethke, Dee; And Others

    This document provides a composite index of the first five sets of software annotations produced by Project SEED. The software has been indexed by title, subject area, and grade level, and it covers sets of annotations distributed in September 1986, April 1987, September 1987, November 1987, and February 1988. The date column in the index…

  14. Semantic annotation of consumer health questions.

    PubMed

    Kilicoglu, Halil; Ben Abacha, Asma; Mrabet, Yassine; Shooshan, Sonya E; Rodriguez, Laritza; Masterton, Kate; Demner-Fushman, Dina

    2018-02-06

    Consumers increasingly use online resources for their health information needs. While current search engines can address these needs to some extent, they generally do not take into account that most health information needs are complex and can only fully be expressed in natural language. Consumer health question answering (QA) systems aim to fill this gap. A major challenge in developing consumer health QA systems is extracting relevant semantic content from the natural language questions (question understanding). To develop effective question understanding tools, question corpora semantically annotated for relevant question elements are needed. In this paper, we present a two-part consumer health question corpus annotated with several semantic categories: named entities, question triggers/types, question frames, and question topic. The first part (CHQA-email) consists of relatively long email requests received by the U.S. National Library of Medicine (NLM) customer service, while the second part (CHQA-web) consists of shorter questions posed to MedlinePlus search engine as queries. Each question has been annotated by two annotators. The annotation methodology is largely the same between the two parts of the corpus; however, we also explain and justify the differences between them. Additionally, we provide information about corpus characteristics, inter-annotator agreement, and our attempts to measure annotation confidence in the absence of adjudication of annotations. The resulting corpus consists of 2614 questions (CHQA-email: 1740, CHQA-web: 874). Problems are the most frequent named entities, while treatment and general information questions are the most common question types. Inter-annotator agreement was generally modest: question types and topics yielded highest agreement, while the agreement for more complex frame annotations was lower. Agreement in CHQA-web was consistently higher than that in CHQA-email. Pairwise inter-annotator agreement proved most

  15. Crowdsourcing lung nodules detection and annotation

    NASA Astrophysics Data System (ADS)

    Boorboor, Saeed; Nadeem, Saad; Park, Ji Hwan; Baker, Kevin; Kaufman, Arie

    2018-03-01

    We present crowdsourcing as an additional modality to aid radiologists in the diagnosis of lung cancer from clinical chest computed tomography (CT) scans. More specifically, a complete work flow is introduced which can help maximize the sensitivity of lung nodule detection by utilizing the collective intelligence of the crowd. We combine the concept of overlapping thin-slab maximum intensity projections (TS-MIPs) and cine viewing to render short videos that can be outsourced as an annotation task to the crowd. These videos are generated by linearly interpolating overlapping TS-MIPs of CT slices through the depth of each quadrant of a patient's lung. The resultant videos are outsourced to an online community of non-expert users who, after a brief tutorial, annotate suspected nodules in these video segments. Using our crowdsourcing work flow, we achieved a lung nodule detection sensitivity of over 90% for 20 patient CT datasets (containing 178 lung nodules with sizes between 1-30mm), and only 47 false positives from a total of 1021 annotations on nodules of all sizes (96% sensitivity for nodules>4mm). These results show that crowdsourcing can be a robust and scalable modality to aid radiologists in screening for lung cancer, directly or in combination with computer-aided detection (CAD) algorithms. For CAD algorithms, the presented work flow can provide highly accurate training data to overcome the high false-positive rate (per scan) problem. We also provide, for the first time, analysis on nodule size and position which can help improve CAD algorithms.

  16. MGmapper: Reference based mapping and taxonomy annotation of metagenomics sequence reads

    PubMed Central

    Lukjancenko, Oksana; Thomsen, Martin Christen Frølund; Maddalena Sperotto, Maria; Lund, Ole; Møller Aarestrup, Frank; Sicheritz-Pontén, Thomas

    2017-01-01

    An increasing amount of species and gene identification studies rely on the use of next generation sequence analysis of either single isolate or metagenomics samples. Several methods are available to perform taxonomic annotations and a previous metagenomics benchmark study has shown that a vast number of false positive species annotations are a problem unless thresholds or post-processing are applied to differentiate between correct and false annotations. MGmapper is a package to process raw next generation sequence data and perform reference based sequence assignment, followed by a post-processing analysis to produce reliable taxonomy annotation at species and strain level resolution. An in-vitro bacterial mock community sample comprised of 8 genuses, 11 species and 12 strains was previously used to benchmark metagenomics classification methods. After applying a post-processing filter, we obtained 100% correct taxonomy assignments at species and genus level. A sensitivity and precision at 75% was obtained for strain level annotations. A comparison between MGmapper and Kraken at species level, shows MGmapper assigns taxonomy at species level using 84.8% of the sequence reads, compared to 70.5% for Kraken and both methods identified all species with no false positives. Extensive read count statistics are provided in plain text and excel sheets for both rejected and accepted taxonomy annotations. The use of custom databases is possible for the command-line version of MGmapper, and the complete pipeline is freely available as a bitbucked package (https://bitbucket.org/genomicepidemiology/mgmapper). A web-version (https://cge.cbs.dtu.dk/services/MGmapper) provides the basic functionality for analysis of small fastq datasets. PMID:28467460

  17. Alignment-Annotator web server: rendering and annotating sequence alignments.

    PubMed

    Gille, Christoph; Fähling, Michael; Weyand, Birgit; Wieland, Thomas; Gille, Andreas

    2014-07-01

    Alignment-Annotator is a novel web service designed to generate interactive views of annotated nucleotide and amino acid sequence alignments (i) de novo and (ii) embedded in other software. All computations are performed at server side. Interactivity is implemented in HTML5, a language native to web browsers. The alignment is initially displayed using default settings and can be modified with the graphical user interfaces. For example, individual sequences can be reordered or deleted using drag and drop, amino acid color code schemes can be applied and annotations can be added. Annotations can be made manually or imported (BioDAS servers, the UniProt, the Catalytic Site Atlas and the PDB). Some edits take immediate effect while others require server interaction and may take a few seconds to execute. The final alignment document can be downloaded as a zip-archive containing the HTML files. Because of the use of HTML the resulting interactive alignment can be viewed on any platform including Windows, Mac OS X, Linux, Android and iOS in any standard web browser. Importantly, no plugins nor Java are required and therefore Alignment-Anotator represents the first interactive browser-based alignment visualization. http://www.bioinformatics.org/strap/aa/ and http://strap.charite.de/aa/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Effects of forest harvesting, logging debris, and herbicides on the composition, diversity and assembly of a western Washington, USA plant community

    Treesearch

    David H. Peter; Timothy B. Harrington

    2018-01-01

    We examined plant community organization over the first five growing seasons after clearcut harvesting with retention of two levels of logging debris (light and heavy) and application of four vegetation control treatments (non-sprayed control, aminopyralid (A), triclopyr (T), and A+T). Our study site was 44 km northwest of Olympia, WA., USA, and before forest...

  19. A Software Tool for the Annotation of Embolic Events in Echo Doppler Audio Signals

    PubMed Central

    Pierleoni, Paola; Maurizi, Lorenzo; Palma, Lorenzo; Belli, Alberto; Valenti, Simone; Marroni, Alessandro

    2017-01-01

    The use of precordial Doppler monitoring to prevent decompression sickness (DS) is well known by the scientific community as an important instrument for early diagnosis of DS. However, the timely and correct diagnosis of DS without assistance from diving medical specialists is unreliable. Thus, a common protocol for the manual annotation of echo Doppler signals and a tool for their automated recording and annotation are necessary. We have implemented original software for efficient bubble appearance annotation and proposed a unified annotation protocol. The tool auto-sets the response time of human “bubble examiners,” performs playback of the Doppler file by rendering it independent of the specific audio player, and enables the annotation of individual bubbles or multiple bubbles known as “showers.” The tool provides a report with an optimized data structure and estimates the embolic risk level according to the Extended Spencer Scale. The tool is built in accordance with ISO/IEC 9126 on software quality and has been projected and tested with assistance from the Divers Alert Network (DAN) Europe Foundation, which employs this tool for its diving data acquisition campaigns. PMID:29242701

  20. Beginning Science Teachers' Use of a Digital Video Annotation Tool to Promote Reflective Practices

    NASA Astrophysics Data System (ADS)

    McFadden, Justin; Ellis, Joshua; Anwar, Tasneem; Roehrig, Gillian

    2014-06-01

    The development of teachers as reflective practitioners is a central concept in national guidelines for teacher preparation and induction (National Council for Accreditation of Teacher Education 2008). The Teacher Induction Network (TIN) supports the development of reflective practice for beginning secondary science teachers through the creation of online "communities of practice" (Barab et al. in Inf Soc, 237-256, 2003), which have been shown to have positive impacts on teacher collaboration, communication, and reflection. Specifically, TIN integrated the use of asynchronous, video annotation as an affordance to directly facilitate teachers' reflection on their classroom practices (Tripp and Rich in Teach Teach Educ 28(5):728-739, 2013). This study examines the use of video annotation as a tool for developing reflective practices for beginning secondary science teachers. Teachers were enrolled in an online teacher induction course designed to promote reflective practice and inquiry-based instruction. A modified version of the Learning to Notice Framework (Sherin and van Es in J Teach Educ 60(1):20-37, 2009) was used to classify teachers' annotations on video of their teaching. Findings from the study include the tendency of teachers to focus on themselves in their annotations, as well as a preponderance of annotations focused on lower-level reflective practices of description and explanation. Suggestions for utilizing video annotation tools are discussed, as well as design features, which could be improved to further the development of richer annotations and deeper reflective practices.

  1. Exogean: a framework for annotating protein-coding genes in eukaryotic genomic DNA

    PubMed Central

    Djebali, Sarah; Delaplace, Franck; Crollius, Hugues Roest

    2006-01-01

    Background Accurate and automatic gene identification in eukaryotic genomic DNA is more than ever of crucial importance to efficiently exploit the large volume of assembled genome sequences available to the community. Automatic methods have always been considered less reliable than human expertise. This is illustrated in the EGASP project, where reference annotations against which all automatic methods are measured are generated by human annotators and experimentally verified. We hypothesized that replicating the accuracy of human annotators in an automatic method could be achieved by formalizing the rules and decisions that they use, in a mathematical formalism. Results We have developed Exogean, a flexible framework based on directed acyclic colored multigraphs (DACMs) that can represent biological objects (for example, mRNA, ESTs, protein alignments, exons) and relationships between them. Graphs are analyzed to process the information according to rules that replicate those used by human annotators. Simple individual starting objects given as input to Exogean are thus combined and synthesized into complex objects such as protein coding transcripts. Conclusion We show here, in the context of the EGASP project, that Exogean is currently the method that best reproduces protein coding gene annotations from human experts, in terms of identifying at least one exact coding sequence per gene. We discuss current limitations of the method and several avenues for improvement. PMID:16925841

  2. Measuring the Measurements: A Study of Evaluation of Writing: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Scherer, Darlene Lienau

    Intended to make the educational community aware of how research has defined acceptable practice in writing assessment, this annotated bibliography examines research about writing evaluation. Divided into five sections, the first section of the bibliography surveys some psychological and linguistic studies of the development of students' writing…

  3. Annotation and visualization of endogenous retroviral sequences using the Distributed Annotation System (DAS) and eBioX

    PubMed Central

    Martínez Barrio, Álvaro; Lagercrantz, Erik; Sperber, Göran O; Blomberg, Jonas; Bongcam-Rudloff, Erik

    2009-01-01

    Background The Distributed Annotation System (DAS) is a widely used network protocol for sharing biological information. The distributed aspects of the protocol enable the use of various reference and annotation servers for connecting biological sequence data to pertinent annotations in order to depict an integrated view of the data for the final user. Results An annotation server has been devised to provide information about the endogenous retroviruses detected and annotated by a specialized in silico tool called RetroTector. We describe the procedure to implement the DAS 1.5 protocol commands necessary for constructing the DAS annotation server. We use our server to exemplify those steps. Data distribution is kept separated from visualization which is carried out by eBioX, an easy to use open source program incorporating multiple bioinformatics utilities. Some well characterized endogenous retroviruses are shown in two different DAS clients. A rapid analysis of areas free from retroviral insertions could be facilitated by our annotations. Conclusion The DAS protocol has shown to be advantageous in the distribution of endogenous retrovirus data. The distributed nature of the protocol is also found to aid in combining annotation and visualization along a genome in order to enhance the understanding of ERV contribution to its evolution. Reference and annotation servers are conjointly used by eBioX to provide visualization of ERV annotations as well as other data sources. Our DAS data source can be found in the central public DAS service repository, , or at . PMID:19534743

  4. Corpus annotation for mining biomedical events from literature

    PubMed Central

    Kim, Jin-Dong; Ohta, Tomoko; Tsujii, Jun'ichi

    2008-01-01

    Background Advanced Text Mining (TM) such as semantic enrichment of papers, event or relation extraction, and intelligent Question Answering have increasingly attracted attention in the bio-medical domain. For such attempts to succeed, text annotation from the biological point of view is indispensable. However, due to the complexity of the task, semantic annotation has never been tried on a large scale, apart from relatively simple term annotation. Results We have completed a new type of semantic annotation, event annotation, which is an addition to the existing annotations in the GENIA corpus. The corpus has already been annotated with POS (Parts of Speech), syntactic trees, terms, etc. The new annotation was made on half of the GENIA corpus, consisting of 1,000 Medline abstracts. It contains 9,372 sentences in which 36,114 events are identified. The major challenges during event annotation were (1) to design a scheme of annotation which meets specific requirements of text annotation, (2) to achieve biology-oriented annotation which reflect biologists' interpretation of text, and (3) to ensure the homogeneity of annotation quality across annotators. To meet these challenges, we introduced new concepts such as Single-facet Annotation and Semantic Typing, which have collectively contributed to successful completion of a large scale annotation. Conclusion The resulting event-annotated corpus is the largest and one of the best in quality among similar annotation efforts. We expect it to become a valuable resource for NLP (Natural Language Processing)-based TM in the bio-medical domain. PMID:18182099

  5. Genome-wide Annotation, Identification, and Global Transcriptomic Analysis of Regulatory or Small RNA Gene Expression in Staphylococcus aureus

    PubMed Central

    Weiss, Andy; Broach, William H.; Wiemels, Richard E.; Mogen, Austin B.; Rice, Kelly C.

    2016-01-01

    ABSTRACT In Staphylococcus aureus, hundreds of small regulatory or small RNAs (sRNAs) have been identified, yet this class of molecule remains poorly understood and severely understudied. sRNA genes are typically absent from genome annotation files, and as a consequence, their existence is often overlooked, particularly in global transcriptomic studies. To facilitate improved detection and analysis of sRNAs in S. aureus, we generated updated GenBank files for three commonly used S. aureus strains (MRSA252, NCTC 8325, and USA300), in which we added annotations for >260 previously identified sRNAs. These files, the first to include genome-wide annotation of sRNAs in S. aureus, were then used as a foundation to identify novel sRNAs in the community-associated methicillin-resistant strain USA300. This analysis led to the discovery of 39 previously unidentified sRNAs. Investigating the genomic loci of the newly identified sRNAs revealed a surprising degree of inconsistency in genome annotation in S. aureus, which may be hindering the analysis and functional exploration of these elements. Finally, using our newly created annotation files as a reference, we perform a global analysis of sRNA gene expression in S. aureus and demonstrate that the newly identified tsr25 is the most highly upregulated sRNA in human serum. This study provides an invaluable resource to the S. aureus research community in the form of our newly generated annotation files, while at the same time presenting the first examination of differential sRNA expression in pathophysiologically relevant conditions. PMID:26861020

  6. Family Support Services. A Review of the Literature and Selected Annotated Bibliography.

    ERIC Educational Resources Information Center

    Wolcott, Ilene

    This document contains a literature review and annotated bibliography on family support services in Australia and overseas. Literature relating to services for families with dependent adolescent children as well as young children is included. The review and bibliography concentrate primarily on community-based services defined in the literature as…

  7. Annotated Bibliography of Alcohol, Other Drug, and Violence Prevention Resources, 2006-2008

    ERIC Educational Resources Information Center

    Segars, Lance, Ed.; Akinola, Olayinka, Ed.

    2009-01-01

    The U.S. Department of Education's Higher Education Center for Alcohol and Other Drug Abuse and Violence Prevention has developed this annotated bibliography to provide those interested in prevention at colleges and universities--and in surrounding communities--with a ready reference of current, important, and available information resources.…

  8. Selected Annotated Bibliography of Recent Research on Rural Life on Prince Edward Island. Community Studies, Report No. 1.

    ERIC Educational Resources Information Center

    MacDonald, Allan F.; O'Connell, Harold J.

    A review of research literature was the first step in a program of rural development and planning on Prince Edward Island. This bibliography containing 80 annotations of extended research reports from 1960-71 is the result of that search. The bibliography is divided into 4 main subject areas within which the annotations appear in alphabetical…

  9. “Straight from the heavens into your bucket”: domestic rainwater harvesting as a measure to improve water security in a subarctic indigenous community

    PubMed Central

    Mercer, Nicholas; Hanrahan, Maura

    2017-01-01

    ABSTRACT Background: Black Tickle-Domino is an extremely water-insecure remote Inuit community in the Canadian subarctic that lacks piped-water. Drinking water consumption in the community is less than a third of the Canadian national average. Water insecurity in the community contributes to adverse health, economic, and social effects and requires urgent action. Objectives: To test the ability of domestic rainwater harvesting (DRWH) for the first time in the subarctic with the goal of improving water access and use in the community. Design: This project utilised quantitative weekly reporting of water collection and use, as well as focus group discussions. DRWH units were installed at seven water-insecure households chosen by the local government. Results were measured over a 6-week period in 2016. Results: Participants harvested 19.07 gallons of rainwater per week. General purpose water consumption increased by 17% and water retrieval efforts declined by 40.92%. Households saved $12.70 CDN per week. Participants reported perceived improvements to psychological health. Because no potable water was collected, drinking water consumption did not increase. The study identified additional water-insecurity impacts. Conclusion: DRWH cannot supply drinking water without proper treatment and filtration; however, it can be a partial remedy to water insecurity in the subarctic. DRWH is appropriately scaled, inexpensive, and participants identified several significant benefits. PMID:28422581

  10. SigReannot-mart: a query environment for expression microarray probe re-annotations.

    PubMed

    Moreews, François; Rauffet, Gaelle; Dehais, Patrice; Klopp, Christophe

    2011-01-01

    Expression microarrays are commonly used to study transcriptomes. Most of the arrays are now based on oligo-nucleotide probes. Probe design being a tedious task, it often takes place once at the beginning of the project. The oligo set is then used for several years. During this time period, the knowledge gathered by the community on the genome and the transcriptome increases and gets more precise. Therefore re-annotating the set is essential to supply the biologists with up-to-date annotations. SigReannot-mart is a query environment populated with regularly updated annotations for different oligo sets. It stores the results of the SigReannot pipeline that has mainly been used on farm and aquaculture species. It permits easy extraction in different formats using filters. It is used to compare probe sets on different criteria, to choose the set for a given experiment to mix probe sets in order to create a new one.

  11. Morphosyntactic annotation of CHILDES transcripts*

    PubMed Central

    SAGAE, KENJI; DAVIS, ERIC; LAVIE, ALON; MACWHINNEY, BRIAN; WINTNER, SHULY

    2014-01-01

    Corpora of child language are essential for research in child language acquisition and psycholinguistics. Linguistic annotation of the corpora provides researchers with better means for exploring the development of grammatical constructions and their usage. We describe a project whose goal is to annotate the English section of the CHILDES database with grammatical relations in the form of labeled dependency structures. We have produced a corpus of over 18,800 utterances (approximately 65,000 words) with manually curated gold-standard grammatical relation annotations. Using this corpus, we have developed a highly accurate data-driven parser for the English CHILDES data, which we used to automatically annotate the remainder of the English section of CHILDES. We have also extended the parser to Spanish, and are currently working on supporting more languages. The parser and the manually and automatically annotated data are freely available for research purposes. PMID:20334720

  12. SABER: The Searchable Annotated Bibliography of Education Research in Astronomy

    NASA Astrophysics Data System (ADS)

    Bruning, David; Bailey, Janelle M.; Brissenden, Gina

    Starting a new research project can be a challenge, but especially so in education research because the literature is scattered throughout many journals. Relevant astronomy education research may be in psychology journals, science education journals, physics education journals, or even in science journals. Tracking the vast realm of literature is difficult, especially because libraries frequently do not subscribe to many of the relevant journals and abstracting services. The Searchable Annotated Bibliography of Education Research (SABER) is an online resource that was started to service the needs of the astronomy education community, specifically to reduce this "scatter" by compiling an annotated bibliography of education research articles in one electronic location. Although SABER started in 2001, the database has a new URL—http://astronom- y.uwp.edu/saber/—and has recently undergone a major update.

  13. Assisted annotation of medical free text using RapTAT

    PubMed Central

    Gobbel, Glenn T; Garvin, Jennifer; Reeves, Ruth; Cronin, Robert M; Heavirland, Julia; Williams, Jenifer; Weaver, Allison; Jayaramaraja, Shrimalini; Giuse, Dario; Speroff, Theodore; Brown, Steven H; Xu, Hua; Matheny, Michael E

    2014-01-01

    Objective To determine whether assisted annotation using interactive training can reduce the time required to annotate a clinical document corpus without introducing bias. Materials and methods A tool, RapTAT, was designed to assist annotation by iteratively pre-annotating probable phrases of interest within a document, presenting the annotations to a reviewer for correction, and then using the corrected annotations for further machine learning-based training before pre-annotating subsequent documents. Annotators reviewed 404 clinical notes either manually or using RapTAT assistance for concepts related to quality of care during heart failure treatment. Notes were divided into 20 batches of 19–21 documents for iterative annotation and training. Results The number of correct RapTAT pre-annotations increased significantly and annotation time per batch decreased by ∼50% over the course of annotation. Annotation rate increased from batch to batch for assisted but not manual reviewers. Pre-annotation F-measure increased from 0.5 to 0.6 to >0.80 (relative to both assisted reviewer and reference annotations) over the first three batches and more slowly thereafter. Overall inter-annotator agreement was significantly higher between RapTAT-assisted reviewers (0.89) than between manual reviewers (0.85). Discussion The tool reduced workload by decreasing the number of annotations needing to be added and helping reviewers to annotate at an increased rate. Agreement between the pre-annotations and reference standard, and agreement between the pre-annotations and assisted annotations, were similar throughout the annotation process, which suggests that pre-annotation did not introduce bias. Conclusions Pre-annotations generated by a tool capable of interactive training can reduce the time required to create an annotated document corpus by up to 50%. PMID:24431336

  14. Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator

    NASA Astrophysics Data System (ADS)

    Seyed, P.; Chastain, K.; McGuinness, D. L.

    2013-12-01

    Use of Semantic Web technologies for data management in the Earth sciences (and beyond) has great potential but is still in its early stages, since the challenges of translating data into a more explicit or semantic form for immediate use within applications has not been fully addressed. In this abstract we help address this challenge by introducing the SemantEco Annotator, which enables anyone, regardless of expertise, to semantically annotate tabular Earth Science data and translate it into linked data format, while applying the logic inherent in community-standard vocabularies to guide the process. The Annotator was conceived under a desire to unify dataset content from a variety of sources under common vocabularies, for use in semantically-enabled web applications. Our current use case employs linked data generated by the Annotator for use in the SemantEco environment, which utilizes semantics to help users explore, search, and visualize water or air quality measurement and species occurrence data through a map-based interface. The generated data can also be used immediately to facilitate discovery and search capabilities within 'big data' environments. The Annotator provides a method for taking information about a dataset, that may only be known to its maintainers, and making it explicit, in a uniform and machine-readable fashion, such that a person or information system can more easily interpret the underlying structure and meaning. Its primary mechanism is to enable a user to formally describe how columns of a tabular dataset relate and/or describe entities. For example, if a user identifies columns for latitude and longitude coordinates, we can infer the data refers to a point that can be plotted on a map. Further, it can be made explicit that measurements of 'nitrate' and 'NO3-' are of the same entity through vocabulary assignments, thus more easily utilizing data sets that use different nomenclatures. The Annotator provides an extensive and searchable

  15. Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission.

    PubMed

    Geib, Scott M; Hall, Brian; Derego, Theodore; Bremer, Forest T; Cannoles, Kyle; Sim, Sheina B

    2018-04-01

    One of the most overlooked, yet critical, components of a whole genome sequencing (WGS) project is the submission and curation of the data to a genomic repository, most commonly the National Center for Biotechnology Information (NCBI). While large genome centers or genome groups have developed software tools for post-annotation assembly filtering, annotation, and conversion into the NCBI's annotation table format, these tools typically require back-end setup and connection to an Structured Query Language (SQL) database and/or some knowledge of programming (Perl, Python) to implement. With WGS becoming commonplace, genome sequencing projects are moving away from the genome centers and into the ecology or biology lab, where fewer resources are present to support the process of genome assembly curation. To fill this gap, we developed software to assess, filter, and transfer annotation and convert a draft genome assembly and annotation set into the NCBI annotation table (.tbl) format, facilitating submission to the NCBI Genome Assembly database. This software has no dependencies, is compatible across platforms, and utilizes a simple command to perform a variety of simple and complex post-analysis, pre-NCBI submission WGS project tasks. The Genome Annotation Generator is a consistent and user-friendly bioinformatics tool that can be used to generate a .tbl file that is consistent with the NCBI submission pipeline. The Genome Annotation Generator achieves the goal of providing a publicly available tool that will facilitate the submission of annotated genome assemblies to the NCBI. It is useful for any individual researcher or research group that wishes to submit a genome assembly of their study system to the NCBI.

  16. Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission

    PubMed Central

    Hall, Brian; Derego, Theodore; Bremer, Forest T; Cannoles, Kyle

    2018-01-01

    Abstract Background One of the most overlooked, yet critical, components of a whole genome sequencing (WGS) project is the submission and curation of the data to a genomic repository, most commonly the National Center for Biotechnology Information (NCBI). While large genome centers or genome groups have developed software tools for post-annotation assembly filtering, annotation, and conversion into the NCBI’s annotation table format, these tools typically require back-end setup and connection to an Structured Query Language (SQL) database and/or some knowledge of programming (Perl, Python) to implement. With WGS becoming commonplace, genome sequencing projects are moving away from the genome centers and into the ecology or biology lab, where fewer resources are present to support the process of genome assembly curation. To fill this gap, we developed software to assess, filter, and transfer annotation and convert a draft genome assembly and annotation set into the NCBI annotation table (.tbl) format, facilitating submission to the NCBI Genome Assembly database. This software has no dependencies, is compatible across platforms, and utilizes a simple command to perform a variety of simple and complex post-analysis, pre-NCBI submission WGS project tasks. Findings The Genome Annotation Generator is a consistent and user-friendly bioinformatics tool that can be used to generate a .tbl file that is consistent with the NCBI submission pipeline Conclusions The Genome Annotation Generator achieves the goal of providing a publicly available tool that will facilitate the submission of annotated genome assemblies to the NCBI. It is useful for any individual researcher or research group that wishes to submit a genome assembly of their study system to the NCBI. PMID:29635297

  17. Rainwater harvesting in the United States: a survey of common system practices

    EPA Science Inventory

    Rainwater harvesting (RWH) systems in the United States vary in terms of design and operation. To better understand common practices in the RWH community and motivation for collecting harvested rainwater, an electronic survey was used to poll members of the American Rainwater Cat...

  18. MitoFish and MitoAnnotator: A Mitochondrial Genome Database of Fish with an Accurate and Automatic Annotation Pipeline

    PubMed Central

    Iwasaki, Wataru; Fukunaga, Tsukasa; Isagozawa, Ryota; Yamada, Koichiro; Maeda, Yasunobu; Satoh, Takashi P.; Sado, Tetsuya; Mabuchi, Kohji; Takeshima, Hirohiko; Miya, Masaki; Nishida, Mutsumi

    2013-01-01

    Mitofish is a database of fish mitochondrial genomes (mitogenomes) that includes powerful and precise de novo annotations for mitogenome sequences. Fish occupy an important position in the evolution of vertebrates and the ecology of the hydrosphere, and mitogenomic sequence data have served as a rich source of information for resolving fish phylogenies and identifying new fish species. The importance of a mitogenomic database continues to grow at a rapid pace as massive amounts of mitogenomic data are generated with the advent of new sequencing technologies. A severe bottleneck seems likely to occur with regard to mitogenome annotation because of the overwhelming pace of data accumulation and the intrinsic difficulties in annotating sequences with degenerating transfer RNA structures, divergent start/stop codons of the coding elements, and the overlapping of adjacent elements. To ease this data backlog, we developed an annotation pipeline named MitoAnnotator. MitoAnnotator automatically annotates a fish mitogenome with a high degree of accuracy in approximately 5 min; thus, it is readily applicable to data sets of dozens of sequences. MitoFish also contains re-annotations of previously sequenced fish mitogenomes, enabling researchers to refer to them when they find annotations that are likely to be erroneous or while conducting comparative mitogenomic analyses. For users who need more information on the taxonomy, habitats, phenotypes, or life cycles of fish, MitoFish provides links to related databases. MitoFish and MitoAnnotator are freely available at http://mitofish.aori.u-tokyo.ac.jp/ (last accessed August 28, 2013); all of the data can be batch downloaded, and the annotation pipeline can be used via a web interface. PMID:23955518

  19. Rainwater Harvesting-based Safe Water Access in Diarrhea-endemic Coastal Communities of Bangladesh under Threats of Climate Change

    NASA Astrophysics Data System (ADS)

    Akanda, A. S.; Redwan, A. M.; Ali, M. A.; Alam, M.; Jutla, A.; Colwell, R. R.

    2014-12-01

    The highly populated coastal floodplains of the Bengal Delta have a long history of water-related natural calamities such as droughts, floods, and cyclones. Population centers along the floodplain corridors of the GBM (Ganges-Brahmaputra-Meghna) river system remain vulnerable to such natural hazards and waterborne epidemic outbreaks due to increasing intensity and changing frequency of extreme events over many areas in the delta region. Such changes in hydrologic extremes and resulting environmental conditions would likely lengthen the transmission seasons of prevalent waterborne diseases and alter their geographic range as well as seasonality. In addition, the combination of changing upstream precipitation and temperature, and coastal sea-level rise are exposing a vast area in Southwestern Bangladesh to increased diarrheal disease outbreaks due to higher salinity and water scarcity in the dry season as well as coastal flooding and water resources contamination in the wet season. It is thus essential to establish sustainable safe water access practices in these regions for the rural communities of low-income people. The impact of climate change in the recent past on the people of coastal rural areas of Bangladesh has been severe, and the water sector is one of its biggest victims. Previously, pond and groundwater sources were considered dependable, but salinity intrusion in both water resources have left the vulnerable people with only a few scarce ponds and forced them to depend more on rainwater than before. The poorest group is suffering the most for this crisis even though paying more of the percentage of their income especially in the dry season (December-March). As rainwater is their most preferred and dependable option during this part of the year, outbreaks of waterborne diseases can be minimized by installing rainwater harvesting systems with effective disinfection system at both household and community levels. In this study, we explore the technical

  20. Prokaryotic Contig Annotation Pipeline Server: Web Application for a Prokaryotic Genome Annotation Pipeline Based on the Shiny App Package.

    PubMed

    Park, Byeonghyeok; Baek, Min-Jeong; Min, Byoungnam; Choi, In-Geol

    2017-09-01

    Genome annotation is a primary step in genomic research. To establish a light and portable prokaryotic genome annotation pipeline for use in individual laboratories, we developed a Shiny app package designated as "P-CAPS" (Prokaryotic Contig Annotation Pipeline Server). The package is composed of R and Python scripts that integrate publicly available annotation programs into a server application. P-CAPS is not only a browser-based interactive application but also a distributable Shiny app package that can be installed on any personal computer. The final annotation is provided in various standard formats and is summarized in an R markdown document. Annotation can be visualized and examined with a public genome browser. A benchmark test showed that the annotation quality and completeness of P-CAPS were reliable and compatible with those of currently available public pipelines.

  1. FIGENIX: Intelligent automation of genomic annotation: expertise integration in a new software platform

    PubMed Central

    Gouret, Philippe; Vitiello, Vérane; Balandraud, Nathalie; Gilles, André; Pontarotti, Pierre; Danchin, Etienne GJ

    2005-01-01

    Background Two of the main objectives of the genomic and post-genomic era are to structurally and functionally annotate genomes which consists of detecting genes' position and structure, and inferring their function (as well as of other features of genomes). Structural and functional annotation both require the complex chaining of numerous different software, algorithms and methods under the supervision of a biologist. The automation of these pipelines is necessary to manage huge amounts of data released by sequencing projects. Several pipelines already automate some of these complex chaining but still necessitate an important contribution of biologists for supervising and controlling the results at various steps. Results Here we propose an innovative automated platform, FIGENIX, which includes an expert system capable to substitute to human expertise at several key steps. FIGENIX currently automates complex pipelines of structural and functional annotation under the supervision of the expert system (which allows for example to make key decisions, check intermediate results or refine the dataset). The quality of the results produced by FIGENIX is comparable to those obtained by expert biologists with a drastic gain in terms of time costs and avoidance of errors due to the human manipulation of data. Conclusion The core engine and expert system of the FIGENIX platform currently handle complex annotation processes of broad interest for the genomic community. They could be easily adapted to new, or more specialized pipelines, such as for example the annotation of miRNAs, the classification of complex multigenic families, annotation of regulatory elements and other genomic features of interest. PMID:16083500

  2. Assessment of the performance of water harvesting systems in semi-arid regions

    NASA Astrophysics Data System (ADS)

    Lasage, Ralph

    2016-04-01

    Water harvesting is widely practiced and has the potential to improve water availability for domestic and agricultural use in semi-arid regions. New funds are becoming available to stimulate the implementation of water harvesting projects, for meeting the Sustainable Development Goals and to help communities to adapt to climate change. For this, it is important to understand which factors determine the success of water harvesting techniques under different conditions. For this, we review the literature, including information on the crop yield impacts of water harvesting projects in semi-arid Africa and Asia. Results show that large water harvesting structures (> 500 m3) are less expensive than small structures, when taking into account investment costs, storage capacity and lifetimes. We also find that water harvesting improves crop yields significantly, and that the relative impact of water harvesting on crop yields is largest in low rainfall years. We also see that the governance, technical knowledge and initial investment are more demanding for the larger structures than for smaller structures, which may affect their spontaneous adoption and long term sustainability when managed by local communities. To support the selection of appropriate techniques, we present a decision framework based on case specific characteristics. This framework can also be used when reporting and evaluating the performance of water harvesting techniques, which is up to now quite limited in peer reviewed literature. Based on Bouma, J., Hegde, S.E., Lasage, R., (2016). Assessing the returns to water harvesting: A meta-analysis. Agricultural Water Management 163, 100-109. Lasage, R., Verburg P.H., (2015). Evaluation of small scale water harvesting techniques for semi-arid environments. Journal of Arid Environments 118, 48-57.

  3. Harvests from bone marrow donors who weigh less than their recipients are associated with a significantly increased probability of a suboptimal harvest yield.

    PubMed

    Anthias, Chloe; Billen, Annelies; Arkwright, Rebecca; Szydlo, Richard M; Madrigal, J Alejandro; Shaw, Bronwen E

    2016-05-01

    Previous studies have demonstrated the importance of bone marrow (BM) harvest yield in determining transplant outcomes, but little is known regarding donor and procedure variables associated with achievement of an optimal yield. We hypothesized that donor demographics and variables relating to the procedure were likely to impact the yield (total nucleated cells [TNCs]/kg recipient weight) and quality (TNCs/mL) of the harvest. To test our hypothesis, BM harvests of 110 consecutive unrelated donors were evaluated. The relationship between donor or procedure characteristics and the BM harvest yield was examined. The relationship between donor and recipient weight significantly influenced the harvest yield; only 14% of BM harvests from donors who weighed less than their recipient achieved a TNC count of more than 4 × 10(8) /kg compared to 56% of harvests from donors heavier than their recipient (p = 0.001). Higher-volume harvests were significantly less likely to achieve an optimal yield than lower-volume harvests (32% vs. 78%; p = 0.007), and higher-volume harvests contained significantly fewer TNCs/mL, indicating peripheral blood contamination. BM harvest quality also varied significantly between collection centers adding to recent concerns regarding maintenance of BM harvest expertise within the transplant community. Since the relationship between donor and recipient weight has a critical influence yield, we recommend prioritizing this secondary donor characteristic when selecting from multiple well-matched donors. Given the declining number of requests for BM harvests, it is crucial that systems are developed to train operators and ensure expertise in this procedure is retained. © 2016 AABB.

  4. Active learning reduces annotation time for clinical concept extraction.

    PubMed

    Kholghi, Mahnoosh; Sitbon, Laurianne; Zuccon, Guido; Nguyen, Anthony

    2017-10-01

    To investigate: (1) the annotation time savings by various active learning query strategies compared to supervised learning and a random sampling baseline, and (2) the benefits of active learning-assisted pre-annotations in accelerating the manual annotation process compared to de novo annotation. There are 73 and 120 discharge summary reports provided by Beth Israel institute in the train and test sets of the concept extraction task in the i2b2/VA 2010 challenge, respectively. The 73 reports were used in user study experiments for manual annotation. First, all sequences within the 73 reports were manually annotated from scratch. Next, active learning models were built to generate pre-annotations for the sequences selected by a query strategy. The annotation/reviewing time per sequence was recorded. The 120 test reports were used to measure the effectiveness of the active learning models. When annotating from scratch, active learning reduced the annotation time up to 35% and 28% compared to a fully supervised approach and a random sampling baseline, respectively. Reviewing active learning-assisted pre-annotations resulted in 20% further reduction of the annotation time when compared to de novo annotation. The number of concepts that require manual annotation is a good indicator of the annotation time for various active learning approaches as demonstrated by high correlation between time rate and concept annotation rate. Active learning has a key role in reducing the time required to manually annotate domain concepts from clinical free text, either when annotating from scratch or reviewing active learning-assisted pre-annotations. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Monsoon Harvests: Assessing the Impact of Rainwater Harvesting Ponds on Subsistence-Level Agriculture in the Gundar Basin, Tamil Nadu, India

    NASA Astrophysics Data System (ADS)

    Steiff, M.; Van Meter, K. J.; Basu, N. B.

    2013-12-01

    Lack of consistent water availability for irrigated agriculture is recognized as one of the primary constraints to meeting the UN Millennium Development Goals to alleviate hunger, and in semi-arid landscapes such as those of southern India, which are characterized by high intra-annual variability in rainfall, provision of capabilities for seasonal storage is recognized to be one of the key strategies towards alleviating water scarcity problems and ensuring food security. Although the issue of increased storage can be addressed by centralized infrastructure projects such as large-scale irrigation systems and dams, an alternative is the "soft path" approach, in which existing large-scale projects are complemented by small-scale, decentralized solutions. Such a decentralized approach has been utilized in southern India for thousands of years in the form of village rainwater harvesting tanks or ponds, providing a local and inherently sustainable approach to providing sufficient water for rice cultivation. Over the last century, however, large-scale canal projects and groundwater pumping have replaced rainwater harvesting as the primary source of irrigation water. But with groundwater withdrawals now exceeding recharge in many areas and water tables continuing to drop, many NGOs and government agencies are advocating for a revival of the older rainwater harvesting systems. Questions remain, however, regarding the limits to which rainwater harvesting can provide a solution to decades of water overexploitation. In the present work, we have utilized secondary data sources to analyze the linkages between the tank irrigation systems and the village communities that depend on them within the Gundar Basin of southern Tamil Nadu. Combining socioeconomic data with information regarding climate, land use, groundwater depletion, and tank density, we have developed indicators of sustainability for these systems. Using these indicators, we have attempted to unravel the close

  6. Retention of seed trees fails to lifeboat ectomycorrhizal fungal diversity in harvested Scots pine forests.

    PubMed

    Varenius, Kerstin; Lindahl, Björn D; Dahlberg, Anders

    2017-09-01

    Fennoscandian forestry has in the past decades changed from natural regeneration of forests towards replantation of clear-cuts, which negatively impacts ectomycorrhizal fungal (EMF) diversity. Retention of trees during harvesting enables EMF survival, and we therefore expected EMF communities to be more similar to those in old natural stands after forest regeneration using seed trees compared to full clear-cutting and replanting. We sequenced fungal internal transcribed spacer 2 (ITS2) amplicons to assess EMF communities in 10- to 60-year-old Scots pine stands regenerated either using seed trees or through replanting of clear-cuts with old natural stands as reference. We also investigated local EMF communities around retained old trees. We found that retention of seed trees failed to mitigate the impact of harvesting on EMF community composition and diversity. With increasing stand age, EMF communities became increasingly similar to those in old natural stands and permanently retained trees maintained EMF locally. From our observations, we conclude that EMF communities, at least common species, post-harvest are more influenced by environmental filtering, resulting from environmental changes induced by harvest, than by the continuity of trees. These results suggest that retention of intact forest patches is a more efficient way to conserve EMF diversity than retaining dispersed single trees. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  7. Harvest-associated disturbance in upland Ozark forests of the Missouri Ozark Forest Ecosystem Project

    Treesearch

    Johann N. Bruhn; James J. Wetteroff; Jeanne D. Mihail; Randy G. Jensen; James B. Pickens

    2002-01-01

    The Missouri Ozark Forest Ecosystem Project (MOFEP) is a long-term, multidisciplinary, landscape-based research program studying effects of even-aged (EAM), uneven-aged (UAM), and no-harvest (NHM) management on forest communities. The first MOFEP timber harvests occurred from May through November 1996. Harvest- related disturbance occurred on 69 of 180 permanent 0.2-ha...

  8. Benthic meiofauna responses to five forest harvest methods

    Treesearch

    Freese Smith; Arthur V. Brown; Misty Pope; Jerry L. Michael

    2001-01-01

    Benthic meiofauna were collected from the pools of minute (0 order) streams in the Ouachita National Forest, Arkansas during March 21-23, 1996 to see if benthic communities responded to forest harvest methods in a similar manner as plankton communities collected two years prior. The study streams and their watersheds (2-6 ha) were located in 14-16 ha forest stands that...

  9. Genome-wide Annotation, Identification, and Global Transcriptomic Analysis of Regulatory or Small RNA Gene Expression in Staphylococcus aureus.

    PubMed

    Carroll, Ronan K; Weiss, Andy; Broach, William H; Wiemels, Richard E; Mogen, Austin B; Rice, Kelly C; Shaw, Lindsey N

    2016-02-09

    In Staphylococcus aureus, hundreds of small regulatory or small RNAs (sRNAs) have been identified, yet this class of molecule remains poorly understood and severely understudied. sRNA genes are typically absent from genome annotation files, and as a consequence, their existence is often overlooked, particularly in global transcriptomic studies. To facilitate improved detection and analysis of sRNAs in S. aureus, we generated updated GenBank files for three commonly used S. aureus strains (MRSA252, NCTC 8325, and USA300), in which we added annotations for >260 previously identified sRNAs. These files, the first to include genome-wide annotation of sRNAs in S. aureus, were then used as a foundation to identify novel sRNAs in the community-associated methicillin-resistant strain USA300. This analysis led to the discovery of 39 previously unidentified sRNAs. Investigating the genomic loci of the newly identified sRNAs revealed a surprising degree of inconsistency in genome annotation in S. aureus, which may be hindering the analysis and functional exploration of these elements. Finally, using our newly created annotation files as a reference, we perform a global analysis of sRNA gene expression in S. aureus and demonstrate that the newly identified tsr25 is the most highly upregulated sRNA in human serum. This study provides an invaluable resource to the S. aureus research community in the form of our newly generated annotation files, while at the same time presenting the first examination of differential sRNA expression in pathophysiologically relevant conditions. Despite a large number of studies identifying regulatory or small RNA (sRNA) genes in Staphylococcus aureus, their annotation is notably lacking in available genome files. In addition to this, there has been a considerable lack of cross-referencing in the wealth of studies identifying these elements, often leading to the same sRNA being identified multiple times and bearing multiple names. In this work

  10. Towards the VWO Annotation Service: a Success Story of the IMAGE RPI Expert Rating System

    NASA Astrophysics Data System (ADS)

    Reinisch, B. W.; Galkin, I. A.; Fung, S. F.; Benson, R. F.; Kozlov, A. V.; Khmyrov, G. M.; Garcia, L. N.

    2010-12-01

    Interpretation of Heliophysics wave data requires specialized knowledge of wave phenomena. Users of the virtual wave observatory (VWO) will greatly benefit from a data annotation service that will allow querying of data by phenomenon type, thus helping accomplish the VWO goal to make Heliophysics wave data searchable, understandable, and usable by the scientific community. Individual annotations can be sorted by phenomenon type and reduced into event lists (catalogs). However, in contrast to the event lists, annotation records allow a greater flexibility of collaborative management by more easily admitting operations of addition, revision, or deletion. They can therefore become the building blocks for an interactive Annotation Service with a suitable graphic user interface to the VWO middleware. The VWO Annotation Service vision is an interactive, collaborative sharing of domain expert knowledge with fellow scientists and students alike. An effective prototype of the VWO Annotation Service has been in operation at the University of Massachusetts Lowell since 2001. An expert rating system (ERS) was developed for annotating the IMAGE radio plasma imager (RPI) active sounding data containing 1.2 million plasmagrams. The RPI data analysts can use ERS to submit expert ratings of plasmagram features, such as presence of echo traces resulted from reflected RPI signals from distant plasma structures. Since its inception in 2001, the RPI ERS has accumulated 7351 expert plasmagram ratings in 16 phenomenon categories, together with free-text descriptions and other metadata. In addition to human expert ratings, the system holds 225,125 ratings submitted by the CORPRAL data prospecting software that employs a model of the human pre-attentive vision to select images potentially containing interesting features. The annotation records proved to be instrumental in a number of investigations where manual data exploration would have been prohibitively tedious and expensive

  11. RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes

    DOE PAGES

    Brettin, Thomas; Davis, James J.; Disz, Terry; ...

    2015-02-10

    The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offersmore » a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.« less

  12. Sonic-boom research: Selected bibliography with annotation

    NASA Technical Reports Server (NTRS)

    Hubbard, H. H.; Maglieri, D. J.; Stephens, D. G.

    1986-01-01

    Citations of selected documents are included which represent the state of the art of technology in each of the following subject areas: prediction, measurement, and minimization of steady-flight sonic booms; prediction and measurement of accelerating-flight sonic booms; sonic-boom propagation; the effects of sonic booms on people, communities, structures, animals, birds, and terrain; and sonic-boom simulator technology. Documents are listed in chronological order in each section of the paper, with key documents and associated annotation listed first. The sources are given along with acquisition numbers, when available, to expedite the acquisition of copies of the documents.

  13. The center for expanded data annotation and retrieval

    PubMed Central

    Bean, Carol A; Cheung, Kei-Hoi; Dumontier, Michel; Durante, Kim A; Gevaert, Olivier; Gonzalez-Beltran, Alejandra; Khatri, Purvesh; Kleinstein, Steven H; O’Connor, Martin J; Pouliot, Yannick; Rocca-Serra, Philippe; Sansone, Susanna-Assunta; Wiser, Jeffrey A

    2015-01-01

    The Center for Expanded Data Annotation and Retrieval is studying the creation of comprehensive and expressive metadata for biomedical datasets to facilitate data discovery, data interpretation, and data reuse. We take advantage of emerging community-based standard templates for describing different kinds of biomedical datasets, and we investigate the use of computational techniques to help investigators to assemble templates and to fill in their values. We are creating a repository of metadata from which we plan to identify metadata patterns that will drive predictive data entry when filling in metadata templates. The metadata repository not only will capture annotations specified when experimental datasets are initially created, but also will incorporate links to the published literature, including secondary analyses and possible refinements or retractions of experimental interpretations. By working initially with the Human Immunology Project Consortium and the developers of the ImmPort data repository, we are developing and evaluating an end-to-end solution to the problems of metadata authoring and management that will generalize to other data-management environments. PMID:26112029

  14. Representing annotation compositionality and provenance for the Semantic Web

    PubMed Central

    2013-01-01

    Background Though the annotation of digital artifacts with metadata has a long history, the bulk of that work focuses on the association of single terms or concepts to single targets. As annotation efforts expand to capture more complex information, annotations will need to be able to refer to knowledge structures formally defined in terms of more atomic knowledge structures. Existing provenance efforts in the Semantic Web domain primarily focus on tracking provenance at the level of whole triples and do not provide enough detail to track how individual triple elements of annotations were derived from triple elements of other annotations. Results We present a task- and domain-independent ontological model for capturing annotations and their linkage to their denoted knowledge representations, which can be singular concepts or more complex sets of assertions. We have implemented this model as an extension of the Information Artifact Ontology in OWL and made it freely available, and we show how it can be integrated with several prominent annotation and provenance models. We present several application areas for the model, ranging from linguistic annotation of text to the annotation of disease-associations in genome sequences. Conclusions With this model, progressively more complex annotations can be composed from other annotations, and the provenance of compositional annotations can be represented at the annotation level or at the level of individual elements of the RDF triples composing the annotations. This in turn allows for progressively richer annotations to be constructed from previous annotation efforts, the precise provenance recording of which facilitates evidence-based inference and error tracking. PMID:24268021

  15. Annotations and the Collaborative Digital Library: Effects of an Aligned Annotation Interface on Student Argumentation and Reading Strategies

    ERIC Educational Resources Information Center

    Wolfe, Joanna

    2008-01-01

    Recent research on annotation interfaces provides provocative evidence that anchored, annotation-based discussion environments may lead to better conversations about a text. However, annotation interfaces raise complicated tradeoffs regarding screen real estate and positioning. It is argued that solving this screen real estate problem requires…

  16. Traditional uses of plants in a rural community of Mozambique and possible links with Miombo degradation and harvesting sustainability.

    PubMed

    Bruschi, Piero; Mancini, Matteo; Mattioli, Elisabetta; Morganti, Michela; Signorini, Maria Adele

    2014-07-23

    Miombo woodlands play an important role in the livelihood of people living in sub-equatorial African countries, contributing to satisfy basic human needs such as food, medicine, fuelwood and building materials. However, over-exploitation of plant resources and unsustainable harvest practices can potentially degrade forests. The aim of this study was to document the use of Miombo plant products, other than medicinal plants, in local communities, within a wider framework in which we discussed possible links between traditional uses and conservation status of the used species and of the whole Miombo environment. Fieldwork took place in four communities of Muda-Serração, central Mozambique. We conducted semi-structured interviews with 52 informants about their knowledge, use and harvesting practices of useful plants. A survey on local Miombo vegetation was also carried out in order to assess abundance and distribution of useful woody plants cited in the interviews in areas exposed to different exploitation rates. A Conservation Priority index was also applied to rank conservation values of each used woody species. Ninety-eight plants cited by the informants were botanically identified. The most relevant general category was represented by food plants (45 species), followed by handicraft plants (38 species) and domestic plants (37 species). Among the 54 woody species observed in vegetation plots, 52% were cited as useful in the interviews. Twenty-six woody species found in 'natural' Miombo areas were not found in 'degraded' ones: of these, 46% were cited in the interviews (58% in the food category, 50% in the handicraft category, 25% in the domestic category and 8% in the fishing category). Results of conservation ranking showed that 7 woody species deserve conservation priority in the investigated area. This study shows that the communities investigated rely heavily on local forest products for their daily subsistence requirements in food, firewood/charcoal and

  17. Assembly, Annotation, and Analysis of Multiple Mycorrhizal Fungal Genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Initiative Consortium, Mycorrhizal Genomics; Kuo, Alan; Grigoriev, Igor

    Mycorrhizal fungi play critical roles in host plant health, soil community structure and chemistry, and carbon and nutrient cycling, all areas of intense interest to the US Dept. of Energy (DOE) Joint Genome Institute (JGI). To this end we are building on our earlier sequencing of the Laccaria bicolor genome by partnering with INRA-Nancy and the mycorrhizal research community in the MGI to sequence and analyze dozens of mycorrhizal genomes of all Basidiomycota and Ascomycota orders and multiple ecological types (ericoid, orchid, and ectomycorrhizal). JGI has developed and deployed high-throughput sequencing techniques, and Assembly, RNASeq, and Annotation Pipelines. In 2012more » alone we sequenced, assembled, and annotated 12 draft or improved genomes of mycorrhizae, and predicted ~;;232831 genes and ~;;15011 multigene families, All of this data is publicly available on JGI MycoCosm (http://jgi.doe.gov/fungi/), which provides access to both the genome data and tools with which to analyze the data. Preliminary comparisons of the current total of 14 public mycorrhizal genomes suggest that 1) short secreted proteins potentially involved in symbiosis are more enriched in some orders than in others amongst the mycorrhizal Agaricomycetes, 2) there are wide ranges of numbers of genes involved in certain functional categories, such as signal transduction and post-translational modification, and 3) novel gene families are specific to some ecological types.« less

  18. Effects of forest harvest on biogeochemical processes in the Caspar Creek watershed

    Treesearch

    Randy A. Dahlgren

    1998-01-01

    Water quality and long-term sustainability are major components addressed within the ecosystem approach to forest management. Forest harvest practices are often implicated as having adverse impacts on sensitive aquatic communities and on the long-term sustainability of forest ecosystems. While careless harvest practices can certainly cause adverse impacts, proper...

  19. Computer systems for annotation of single molecule fragments

    DOEpatents

    Schwartz, David Charles; Severin, Jessica

    2016-07-19

    There are provided computer systems for visualizing and annotating single molecule images. Annotation systems in accordance with this disclosure allow a user to mark and annotate single molecules of interest and their restriction enzyme cut sites thereby determining the restriction fragments of single nucleic acid molecules. The markings and annotations may be automatically generated by the system in certain embodiments and they may be overlaid translucently onto the single molecule images. An image caching system may be implemented in the computer annotation systems to reduce image processing time. The annotation systems include one or more connectors connecting to one or more databases capable of storing single molecule data as well as other biomedical data. Such diverse array of data can be retrieved and used to validate the markings and annotations. The annotation systems may be implemented and deployed over a computer network. They may be ergonomically optimized to facilitate user interactions.

  20. Genome re-annotation: a wiki solution?

    PubMed Central

    Salzberg, Steven L

    2007-01-01

    The annotation of most genomes becomes outdated over time, owing in part to our ever-improving knowledge of genomes and in part to improvements in bioinformatics software. Unfortunately, annotation is rarely if ever updated and resources to support routine reannotation are scarce. Wiki software, which would allow many scientists to edit each genome's annotation, offers one possible solution. PMID:17274839

  1. Wind turbine acoustics research bibliography with selected annotation

    NASA Technical Reports Server (NTRS)

    Hubbard, Harvey H.; Shepherd, Kevin P.

    1988-01-01

    Citations of documents are included, which represent the state-of-the-art of technology in each of the following acoustics subject areas: Prediction of Wind Turbine Noise; Acoustic Measurements for Wind Tunnels; Effect of Wind Turbine Noise on Building Structures, People and Communities; Atmospheric Propagation; and Measurement Technology Including Wind Screens. Documents are listed in chronological order in each section of the paper, with key documents and associated annotation listed first. The sources are given along with acquisition numbers, when available, to expedite the acquisition of copies of the documents.

  2. xGDBvm: A Web GUI-Driven Workflow for Annotating Eukaryotic Genomes in the Cloud.

    PubMed

    Duvick, Jon; Standage, Daniel S; Merchant, Nirav; Brendel, Volker P

    2016-04-01

    Genome-wide annotation of gene structure requires the integration of numerous computational steps. Currently, annotation is arguably best accomplished through collaboration of bioinformatics and domain experts, with broad community involvement. However, such a collaborative approach is not scalable at today's pace of sequence generation. To address this problem, we developed the xGDBvm software, which uses an intuitive graphical user interface to access a number of common genome analysis and gene structure tools, preconfigured in a self-contained virtual machine image. Once their virtual machine instance is deployed through iPlant's Atmosphere cloud services, users access the xGDBvm workflow via a unified Web interface to manage inputs, set program parameters, configure links to high-performance computing (HPC) resources, view and manage output, apply analysis and editing tools, or access contextual help. The xGDBvm workflow will mask the genome, compute spliced alignments from transcript and/or protein inputs (locally or on a remote HPC cluster), predict gene structures and gene structure quality, and display output in a public or private genome browser complete with accessory tools. Problematic gene predictions are flagged and can be reannotated using the integrated yrGATE annotation tool. xGDBvm can also be configured to append or replace existing data or load precomputed data. Multiple genomes can be annotated and displayed, and outputs can be archived for sharing or backup. xGDBvm can be adapted to a variety of use cases including de novo genome annotation, reannotation, comparison of different annotations, and training or teaching. © 2016 American Society of Plant Biologists. All rights reserved.

  3. Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database.

    PubMed

    Carver, Tim; Berriman, Matthew; Tivey, Adrian; Patel, Chinmay; Böhme, Ulrike; Barrell, Barclay G; Parkhill, Julian; Rajandream, Marie-Adèle

    2008-12-01

    Artemis and Artemis Comparison Tool (ACT) have become mainstream tools for viewing and annotating sequence data, particularly for microbial genomes. Since its first release, Artemis has been continuously developed and supported with additional functionality for editing and analysing sequences based on feedback from an active user community of laboratory biologists and professional annotators. Nevertheless, its utility has been somewhat restricted by its limitation to reading and writing from flat files. Therefore, a new version of Artemis has been developed, which reads from and writes to a relational database schema, and allows users to annotate more complex, often large and fragmented, genome sequences. Artemis and ACT have now been extended to read and write directly to the Generic Model Organism Database (GMOD, http://www.gmod.org) Chado relational database schema. In addition, a Gene Builder tool has been developed to provide structured forms and tables to edit coordinates of gene models and edit functional annotation, based on standard ontologies, controlled vocabularies and free text. Artemis and ACT are freely available (under a GPL licence) for download (for MacOSX, UNIX and Windows) at the Wellcome Trust Sanger Institute web sites: http://www.sanger.ac.uk/Software/Artemis/ http://www.sanger.ac.uk/Software/ACT/

  4. BEACON: automated tool for Bacterial GEnome Annotation ComparisON.

    PubMed

    Kalkatawi, Manal; Alam, Intikhab; Bajic, Vladimir B

    2015-08-18

    Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON's utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27%, while the number of genes without any function assignment is reduced. We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/ .

  5. JGI Plant Genomics Gene Annotation Pipeline

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shu, Shengqiang; Rokhsar, Dan; Goodstein, David

    2014-07-14

    Plant genomes vary in size and are highly complex with a high amount of repeats, genome duplication and tandem duplication. Gene encodes a wealth of information useful in studying organism and it is critical to have high quality and stable gene annotation. Thanks to advancement of sequencing technology, many plant species genomes have been sequenced and transcriptomes are also sequenced. To use these vastly large amounts of sequence data to make gene annotation or re-annotation in a timely fashion, an automatic pipeline is needed. JGI plant genomics gene annotation pipeline, called integrated gene call (IGC), is our effort toward thismore » aim with aid of a RNA-seq transcriptome assembly pipeline. It utilizes several gene predictors based on homolog peptides and transcript ORFs. See Methods for detail. Here we present genome annotation of JGI flagship green plants produced by this pipeline plus Arabidopsis and rice except for chlamy which is done by a third party. The genome annotations of these species and others are used in our gene family build pipeline and accessible via JGI Phytozome portal whose URL and front page snapshot are shown below.« less

  6. Resources for Community Organizing.

    ERIC Educational Resources Information Center

    Valadez, Cristina, Comp.

    This document is composed of two parts: a bibliography of community organizing and support materials and a directory of community organizing resource centers. The 25 bibliographic entries are grouped according to subject, and include author, title, publication date, publisher, number of pages, annotation, and ordering information. Subjects…

  7. NCBI prokaryotic genome annotation pipeline.

    PubMed

    Tatusova, Tatiana; DiCuccio, Michael; Badretdin, Azat; Chetvernin, Vyacheslav; Nawrocki, Eric P; Zaslavsky, Leonid; Lomsadze, Alexandre; Pruitt, Kim D; Borodovsky, Mark; Ostell, James

    2016-08-19

    Recent technological advances have opened unprecedented opportunities for large-scale sequencing and analysis of populations of pathogenic species in disease outbreaks, as well as for large-scale diversity studies aimed at expanding our knowledge across the whole domain of prokaryotes. To meet the challenge of timely interpretation of structure, function and meaning of this vast genetic information, a comprehensive approach to automatic genome annotation is critically needed. In collaboration with Georgia Tech, NCBI has developed a new approach to genome annotation that combines alignment based methods with methods of predicting protein-coding and RNA genes and other functional elements directly from sequence. A new gene finding tool, GeneMarkS+, uses the combined evidence of protein and RNA placement by homology as an initial map of annotation to generate and modify ab initio gene predictions across the whole genome. Thus, the new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies more on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence. The pipeline provides a framework for generation and analysis of annotation on the full breadth of prokaryotic taxonomy. For additional information on PGAP see https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ and the NCBI Handbook, https://www.ncbi.nlm.nih.gov/books/NBK174280/. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  8. Dictionary-driven protein annotation

    PubMed Central

    Rigoutsos, Isidore; Huynh, Tien; Floratos, Aris; Parida, Laxmi; Platt, Daniel

    2002-01-01

    Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has in turn generated a renewed demand for automated approaches that can annotate individual sequences and complete genomes quickly, exhaustively and objectively. In this paper, we present one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families. Our annotation approach also makes use of a weighted, position-specific scoring scheme that is unaffected by the over-representation of well-conserved proteins and protein fragments in the databases used. For a given query sequence, the method permits one to determine, in a single pass, the following: local and global similarities between the query and any protein already present in a public database; the likeness of the query to all available archaeal/bacterial/eukaryotic/viral sequences in the database as a function of amino acid position within the query; the character of secondary structure of the query as a function of amino acid position within the query; the cytoplasmic, transmembrane or extracellular behavior of the query; the nature and position of binding domains, active sites, post-translationally modified sites, signal peptides, etc. In terms of performance, the proposed method is exhaustive, objective and allows for the rapid annotation of individual sequences and full genomes. Annotation examples are presented and discussed in Results, including individual queries and complete genomes that were

  9. Dictionary-driven protein annotation.

    PubMed

    Rigoutsos, Isidore; Huynh, Tien; Floratos, Aris; Parida, Laxmi; Platt, Daniel

    2002-09-01

    Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has in turn generated a renewed demand for automated approaches that can annotate individual sequences and complete genomes quickly, exhaustively and objectively. In this paper, we present one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families. Our annotation approach also makes use of a weighted, position-specific scoring scheme that is unaffected by the over-representation of well-conserved proteins and protein fragments in the databases used. For a given query sequence, the method permits one to determine, in a single pass, the following: local and global similarities between the query and any protein already present in a public database; the likeness of the query to all available archaeal/ bacterial/eukaryotic/viral sequences in the database as a function of amino acid position within the query; the character of secondary structure of the query as a function of amino acid position within the query; the cytoplasmic, transmembrane or extracellular behavior of the query; the nature and position of binding domains, active sites, post-translationally modified sites, signal peptides, etc. In terms of performance, the proposed method is exhaustive, objective and allows for the rapid annotation of individual sequences and full genomes. Annotation examples are presented and discussed in Results, including individual queries and complete genomes that were

  10. Annotated Bibliography on Community Integration. Revised.

    ERIC Educational Resources Information Center

    Shoultz, Bonnie, Ed.

    This abstract bibliography lists approxiately 365 selected resources (published from 1972 through 1990) for promoting the participation of people with developmental and other disabilities in all aspects of community life. The bibliography concentrates more heavily on books, monographs, and unpublished and publicly available documents than on…

  11. A Guide to Orientation Materials for Indochinese Refugees and Their Sponsors. A Selected, Annotated Bibliography.

    ERIC Educational Resources Information Center

    Center for Applied Linguistics, Washington, DC. Language and Orientation Resource Center.

    This is an annotated bibliography of orientation materials for Indochinese refugees and their sponsors. The materials have been grouped under fourteen headings: community services, consumer education, culture, education, employment, family planning and child care, finances, health, housing, legal problems, nutrition, sponsorship and resettlement,…

  12. Functional annotation of regulatory pathways.

    PubMed

    Pandey, Jayesh; Koyutürk, Mehmet; Kim, Yohan; Szpankowski, Wojciech; Subramaniam, Shankar; Grama, Ananth

    2007-07-01

    Standardized annotations of biomolecules in interaction networks (e.g. Gene Ontology) provide comprehensive understanding of the function of individual molecules. Extending such annotations to pathways is a critical component of functional characterization of cellular signaling at the systems level. We propose a framework for projecting gene regulatory networks onto the space of functional attributes using multigraph models, with the objective of deriving statistically significant pathway annotations. We first demonstrate that annotations of pairwise interactions do not generalize to indirect relationships between processes. Motivated by this result, we formalize the problem of identifying statistically overrepresented pathways of functional attributes. We establish the hardness of this problem by demonstrating the non-monotonicity of common statistical significance measures. We propose a statistical model that emphasizes the modularity of a pathway, evaluating its significance based on the coupling of its building blocks. We complement the statistical model by an efficient algorithm and software, Narada, for computing significant pathways in large regulatory networks. Comprehensive results from our methods applied to the Escherichia coli transcription network demonstrate that our approach is effective in identifying known, as well as novel biological pathway annotations. Narada is implemented in Java and is available at http://www.cs.purdue.edu/homes/jpandey/narada/.

  13. Energy Harvesting Research: The Road from Single Source to Multisource.

    PubMed

    Bai, Yang; Jantunen, Heli; Juuti, Jari

    2018-06-07

    Energy harvesting technology may be considered an ultimate solution to replace batteries and provide a long-term power supply for wireless sensor networks. Looking back into its research history, individual energy harvesters for the conversion of single energy sources into electricity are developed first, followed by hybrid counterparts designed for use with multiple energy sources. Very recently, the concept of a truly multisource energy harvester built from only a single piece of material as the energy conversion component is proposed. This review, from the aspect of materials and device configurations, explains in detail a wide scope to give an overview of energy harvesting research. It covers single-source devices including solar, thermal, kinetic and other types of energy harvesters, hybrid energy harvesting configurations for both single and multiple energy sources and single material, and multisource energy harvesters. It also includes the energy conversion principles of photovoltaic, electromagnetic, piezoelectric, triboelectric, electrostatic, electrostrictive, thermoelectric, pyroelectric, magnetostrictive, and dielectric devices. This is one of the most comprehensive reviews conducted to date, focusing on the entire energy harvesting research scene and providing a guide to seeking deeper and more specific research references and resources from every corner of the scientific community. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Qcorp: an annotated classification corpus of Chinese health questions.

    PubMed

    Guo, Haihong; Na, Xu; Li, Jiao

    2018-03-22

    Health question-answering (QA) systems have become a typical application scenario of Artificial Intelligent (AI). An annotated question corpus is prerequisite for training machines to understand health information needs of users. Thus, we aimed to develop an annotated classification corpus of Chinese health questions (Qcorp) and make it openly accessible. We developed a two-layered classification schema and corresponding annotation rules on basis of our previous work. Using the schema, we annotated 5000 questions that were randomly selected from 5 Chinese health websites within 6 broad sections. 8 annotators participated in the annotation task, and the inter-annotator agreement was evaluated to ensure the corpus quality. Furthermore, the distribution and relationship of the annotated tags were measured by descriptive statistics and social network map. The questions were annotated using 7101 tags that covers 29 topic categories in the two-layered schema. In our released corpus, the distribution of questions on the top-layered categories was treatment of 64.22%, diagnosis of 37.14%, epidemiology of 14.96%, healthy lifestyle of 10.38%, and health provider choice of 4.54% respectively. Both the annotated health questions and annotation schema were openly accessible on the Qcorp website. Users can download the annotated Chinese questions in CSV, XML, and HTML format. We developed a Chinese health question corpus including 5000 manually annotated questions. It is openly accessible and would contribute to the intelligent health QA system development.

  15. The effectiveness of annotated (vs. non-annotated) digital pathology slides as a teaching tool during dermatology and pathology residencies.

    PubMed

    Marsch, Amanda F; Espiritu, Baltazar; Groth, John; Hutchens, Kelli A

    2014-06-01

    With today's technology, paraffin-embedded, hematoxylin & eosin-stained pathology slides can be scanned to generate high quality virtual slides. Using proprietary software, digital images can also be annotated with arrows, circles and boxes to highlight certain diagnostic features. Previous studies assessing digital microscopy as a teaching tool did not involve the annotation of digital images. The objective of this study was to compare the effectiveness of annotated digital pathology slides versus non-annotated digital pathology slides as a teaching tool during dermatology and pathology residencies. A study group composed of 31 dermatology and pathology residents was asked to complete an online pre-quiz consisting of 20 multiple choice style questions, each associated with a static digital pathology image. After completion, participants were given access to an online tutorial composed of digitally annotated pathology slides and subsequently asked to complete a post-quiz. A control group of 12 residents completed a non-annotated version of the tutorial. Nearly all participants in the study group improved their quiz score, with an average improvement of 17%, versus only 3% (P = 0.005) in the control group. These results support the notion that annotated digital pathology slides are superior to non-annotated slides for the purpose of resident education. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  16. Ecosystem engineering of harvester ants: effects on vegetation in a sagebrush-steppe ecosystem

    USGS Publications Warehouse

    Gosselin, Elyce N; Holbrook, Joseph D.; Huggler, Katey; Brown, Emily; Vierling, Kerri T.; Arkle, Robert; Pilliod, David S.

    2016-01-01

    Harvester ants are influential in many ecosystems because they distribute and consume seeds, remove vegetation, and redistribute soil particles and nutrients. Understanding the interaction between harvester ants and plant communities is important for management and restoration efforts, particularly in systems altered by fire and invasive species such as the sagebrush-steppe. Our objective was to evaluate how vegetation cover changed as a function of distance from Owyhee harvester ant (Pogonomyrmex salinus) nests within a sagebrush-steppe ecosystem. We sampled 105 harvester ant nests within southern Idaho, USA, that occurred in different habitats: annual grassland, perennial grassland, and native shrubland. The influence of Owyhee harvester ants on vegetation was larger at the edge of ant nests, but the relationship was inconsistent among plant species. Percent cover was positively associated with distance from harvester ant nests for plant species that were considered undesirable food sources and were densely distributed. However, percent cover was negatively associated with distance-from-nests for patchily distributed and desirable plant species. For some plant species, there was no change in cover associated with distance-from-nests. Total vegetation cover was associated with distance-from-nests in the shrubland habitat but not in the 2 grasslands. The dominant plant species in the shrubland habitat was a densely distributed shrub (winterfat, Krascheninnikovia lanata) that was defoliated by harvester ants. Our results suggest that Owyhee harvester ants increase spatial heterogeneity in plant communities through plant clearing, but the direction and magnitude of effect will likely be contingent on the dominant vegetation groups. This information may inform future management and plant restoration efforts in sagebrush-steppe by directly considering the islands of influence associated with harvester ant engineering.

  17. Large-scale inference of gene function through phylogenetic annotation of Gene Ontology terms: case study of the apoptosis and autophagy cellular processes.

    PubMed

    Feuermann, Marc; Gaudet, Pascale; Mi, Huaiyu; Lewis, Suzanna E; Thomas, Paul D

    2016-01-01

    We previously reported a paradigm for large-scale phylogenomic analysis of gene families that takes advantage of the large corpus of experimentally supported Gene Ontology (GO) annotations. This 'GO Phylogenetic Annotation' approach integrates GO annotations from evolutionarily related genes across ∼100 different organisms in the context of a gene family tree, in which curators build an explicit model of the evolution of gene functions. GO Phylogenetic Annotation models the gain and loss of functions in a gene family tree, which is used to infer the functions of uncharacterized (or incompletely characterized) gene products, even for human proteins that are relatively well studied. Here, we report our results from applying this paradigm to two well-characterized cellular processes, apoptosis and autophagy. This revealed several important observations with respect to GO annotations and how they can be used for function inference. Notably, we applied only a small fraction of the experimentally supported GO annotations to infer function in other family members. The majority of other annotations describe indirect effects, phenotypes or results from high throughput experiments. In addition, we show here how feedback from phylogenetic annotation leads to significant improvements in the PANTHER trees, the GO annotations and GO itself. Thus GO phylogenetic annotation both increases the quantity and improves the accuracy of the GO annotations provided to the research community. We expect these phylogenetically based annotations to be of broad use in gene enrichment analysis as well as other applications of GO annotations.Database URL: http://amigo.geneontology.org/amigo. © The Author(s) 2016. Published by Oxford University Press.

  18. Do biomass harvesting guidelines influence herpetofauna following harvests of logging residues for renewable energy?.

    PubMed

    Fritts, Sarah; Moorman, Christopher; Grodsky, Steven; Hazel, Dennis; Homyack, Jessica; Farrell, Chris; Castleberry, Steven

    2016-04-01

    were weak or absent. The lack of consistent community or population responses suggests the addition of a woody biomass harvest to a clearcut in pine plantations does not impact herpetofauna use of Coastal Plain loblolly plantations in the southeastern United States. We recommend additional research to examine relationships between woody biomass harvesting and rarer species or amphibians with high desiccation risk, particularly in other regions and harvesting systems.

  19. The National Cancer Informatics Program (NCIP) Annotation and Image Markup (AIM) Foundation model.

    PubMed

    Mongkolwat, Pattanasak; Kleper, Vladimir; Talbot, Skip; Rubin, Daniel

    2014-12-01

    Knowledge contained within in vivo imaging annotated by human experts or computer programs is typically stored as unstructured text and separated from other associated information. The National Cancer Informatics Program (NCIP) Annotation and Image Markup (AIM) Foundation information model is an evolution of the National Institute of Health's (NIH) National Cancer Institute's (NCI) Cancer Bioinformatics Grid (caBIG®) AIM model. The model applies to various image types created by various techniques and disciplines. It has evolved in response to the feedback and changing demands from the imaging community at NCI. The foundation model serves as a base for other imaging disciplines that want to extend the type of information the model collects. The model captures physical entities and their characteristics, imaging observation entities and their characteristics, markups (two- and three-dimensional), AIM statements, calculations, image source, inferences, annotation role, task context or workflow, audit trail, AIM creator details, equipment used to create AIM instances, subject demographics, and adjudication observations. An AIM instance can be stored as a Digital Imaging and Communications in Medicine (DICOM) structured reporting (SR) object or Extensible Markup Language (XML) document for further processing and analysis. An AIM instance consists of one or more annotations and associated markups of a single finding along with other ancillary information in the AIM model. An annotation describes information about the meaning of pixel data in an image. A markup is a graphical drawing placed on the image that depicts a region of interest. This paper describes fundamental AIM concepts and how to use and extend AIM for various imaging disciplines.

  20. Pooled assembly of marine metagenomic datasets: enriching annotation through chimerism.

    PubMed

    Magasin, Jonathan D; Gerloff, Dietlind L

    2015-02-01

    Despite advances in high-throughput sequencing, marine metagenomic samples remain largely opaque. A typical sample contains billions of microbial organisms from thousands of genomes and quadrillions of DNA base pairs. Its derived metagenomic dataset underrepresents this complexity by orders of magnitude because of the sparseness and shortness of sequencing reads. Read shortness and sequencing errors pose a major challenge to accurate species and functional annotation. This includes distinguishing known from novel species. Often the majority of reads cannot be annotated and thus cannot help our interpretation of the sample. Here, we demonstrate quantitatively how careful assembly of marine metagenomic reads within, but also across, datasets can alleviate this problem. For 10 simulated datasets, each with species complexity modeled on a real counterpart, chimerism remained within the same species for most contigs (97%). For 42 real pyrosequencing ('454') datasets, assembly increased the proportion of annotated reads, and even more so when datasets were pooled, by on average 1.6% (max 6.6%) for species, 9.0% (max 28.7%) for Pfam protein domains and 9.4% (max 22.9%) for PANTHER gene families. Our results outline exciting prospects for data sharing in the metagenomics community. While chimeric sequences should be avoided in other areas of metagenomics (e.g. biodiversity analyses), conservative pooled assembly is advantageous for annotation specificity and sensitivity. Intriguingly, our experiment also found potential prospects for (low-cost) discovery of new species in 'old' data. dgerloff@ffame.org Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. Competency Testing. An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Jackson, Michael; Battiste, Barbara

    Competency testing for either graduation from high school, or as a method for assessing whether a student should advance to a higher grade level, is the focus of this annotated bibliography. Included are annotations that relate to accountability, competency testing, program descriptions where competency testing is utilized, general testing…

  2. Protannotator: a semiautomated pipeline for chromosome-wise functional annotation of the "missing" human proteome.

    PubMed

    Islam, Mohammad T; Garg, Gagan; Hancock, William S; Risk, Brian A; Baker, Mark S; Ranganathan, Shoba

    2014-01-03

    The chromosome-centric human proteome project (C-HPP) aims to define the complete set of proteins encoded in each human chromosome. The neXtProt database (September 2013) lists 20,128 proteins for the human proteome, of which 3831 human proteins (∼19%) are considered "missing" according to the standard metrics table (released September 27, 2013). In support of the C-HPP initiative, we have extended the annotation strategy developed for human chromosome 7 "missing" proteins into a semiautomated pipeline to functionally annotate the "missing" human proteome. This pipeline integrates a suite of bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology, and biochemical pathways. From sequential BLAST searches, we have primarily identified homologues from reviewed nonhuman mammalian proteins with protein evidence for 1271 (33.2%) "missing" proteins, followed by 703 (18.4%) homologues from reviewed nonhuman mammalian proteins and subsequently 564 (14.7%) homologues from reviewed human proteins. Functional annotations for 1945 (50.8%) "missing" proteins were also determined. To accelerate the identification of "missing" proteins from proteomics studies, we generated proteotypic peptides in silico. Matching these proteotypic peptides to ENCODE proteogenomic data resulted in proteomic evidence for 107 (2.8%) of the 3831 "missing proteins, while evidence from a recent membrane proteomic study supported the existence for another 15 "missing" proteins. The chromosome-wise functional annotation of all "missing" proteins is freely available to the scientific community through our web server (http://biolinfo.org/protannotator).

  3. xGDBvm: A Web GUI-Driven Workflow for Annotating Eukaryotic Genomes in the Cloud[OPEN

    PubMed Central

    Merchant, Nirav

    2016-01-01

    Genome-wide annotation of gene structure requires the integration of numerous computational steps. Currently, annotation is arguably best accomplished through collaboration of bioinformatics and domain experts, with broad community involvement. However, such a collaborative approach is not scalable at today’s pace of sequence generation. To address this problem, we developed the xGDBvm software, which uses an intuitive graphical user interface to access a number of common genome analysis and gene structure tools, preconfigured in a self-contained virtual machine image. Once their virtual machine instance is deployed through iPlant’s Atmosphere cloud services, users access the xGDBvm workflow via a unified Web interface to manage inputs, set program parameters, configure links to high-performance computing (HPC) resources, view and manage output, apply analysis and editing tools, or access contextual help. The xGDBvm workflow will mask the genome, compute spliced alignments from transcript and/or protein inputs (locally or on a remote HPC cluster), predict gene structures and gene structure quality, and display output in a public or private genome browser complete with accessory tools. Problematic gene predictions are flagged and can be reannotated using the integrated yrGATE annotation tool. xGDBvm can also be configured to append or replace existing data or load precomputed data. Multiple genomes can be annotated and displayed, and outputs can be archived for sharing or backup. xGDBvm can be adapted to a variety of use cases including de novo genome annotation, reannotation, comparison of different annotations, and training or teaching. PMID:27020957

  4. Annotation and Classification of Argumentative Writing Revisions

    ERIC Educational Resources Information Center

    Zhang, Fan; Litman, Diane

    2015-01-01

    This paper explores the annotation and classification of students' revision behaviors in argumentative writing. A sentence-level revision schema is proposed to capture why and how students make revisions. Based on the proposed schema, a small corpus of student essays and revisions was annotated. Studies show that manual annotation is reliable with…

  5. Discovering gene annotations in biomedical text databases.

    PubMed

    Cakmak, Ali; Ozsoyoglu, Gultekin

    2008-03-06

    Genes and gene products are frequently annotated with Gene Ontology concepts based on the evidence provided in genomics articles. Manually locating and curating information about a genomic entity from the biomedical literature requires vast amounts of human effort. Hence, there is clearly a need forautomated computational tools to annotate the genes and gene products with Gene Ontology concepts by computationally capturing the related knowledge embedded in textual data. In this article, we present an automated genomic entity annotation system, GEANN, which extracts information about the characteristics of genes and gene products in article abstracts from PubMed, and translates the discoveredknowledge into Gene Ontology (GO) concepts, a widely-used standardized vocabulary of genomic traits. GEANN utilizes textual "extraction patterns", and a semantic matching framework to locate phrases matching to a pattern and produce Gene Ontology annotations for genes and gene products. In our experiments, GEANN has reached to the precision level of 78% at therecall level of 61%. On a select set of Gene Ontology concepts, GEANN either outperforms or is comparable to two other automated annotation studies. Use of WordNet for semantic pattern matching improves the precision and recall by 24% and 15%, respectively, and the improvement due to semantic pattern matching becomes more apparent as the Gene Ontology terms become more general. GEANN is useful for two distinct purposes: (i) automating the annotation of genomic entities with Gene Ontology concepts, and (ii) providing existing annotations with additional "evidence articles" from the literature. The use of textual extraction patterns that are constructed based on the existing annotations achieve high precision. The semantic pattern matching framework provides a more flexible pattern matching scheme with respect to "exactmatching" with the advantage of locating approximate pattern occurrences with similar semantics. Relatively

  6. Displaying Annotations for Digitised Globes

    NASA Astrophysics Data System (ADS)

    Gede, Mátyás; Farbinger, Anna

    2018-05-01

    Thanks to the efforts of the various globe digitising projects, nowadays there are plenty of old globes that can be examined as 3D models on the computer screen. These globes usually contain a lot of interesting details that an average observer would not entirely discover for the first time. The authors developed a website that can display annotations for such digitised globes. These annotations help observers of the globe to discover all the important, interesting details. Annotations consist of a plain text title, a HTML formatted descriptive text and a corresponding polygon and are stored in KML format. The website is powered by the Cesium virtual globe engine.

  7. THE DIMENSIONS OF COMPOSITION ANNOTATION.

    ERIC Educational Resources Information Center

    MCCOLLY, WILLIAM

    ENGLISH TEACHER ANNOTATIONS WERE STUDIED TO DETERMINE THE DIMENSIONS AND PROPERTIES OF THE ENTIRE SYSTEM FOR WRITING CORRECTIONS AND CRITICISMS ON COMPOSITIONS. FOUR SETS OF COMPOSITIONS WERE WRITTEN BY STUDENTS IN GRADES 9 THROUGH 13. TYPESCRIPTS OF THE COMPOSITIONS WERE ANNOTATED BY CLASSROOM ENGLISH TEACHERS. THEN, 32 ENGLISH TEACHERS JUDGED…

  8. MimoSA: a system for minimotif annotation

    PubMed Central

    2010-01-01

    Background Minimotifs are short peptide sequences within one protein, which are recognized by other proteins or molecules. While there are now several minimotif databases, they are incomplete. There are reports of many minimotifs in the primary literature, which have yet to be annotated, while entirely novel minimotifs continue to be published on a weekly basis. Our recently proposed function and sequence syntax for minimotifs enables us to build a general tool that will facilitate structured annotation and management of minimotif data from the biomedical literature. Results We have built the MimoSA application for minimotif annotation. The application supports management of the Minimotif Miner database, literature tracking, and annotation of new minimotifs. MimoSA enables the visualization, organization, selection and editing functions of minimotifs and their attributes in the MnM database. For the literature components, Mimosa provides paper status tracking and scoring of papers for annotation through a freely available machine learning approach, which is based on word correlation. The paper scoring algorithm is also available as a separate program, TextMine. Form-driven annotation of minimotif attributes enables entry of new minimotifs into the MnM database. Several supporting features increase the efficiency of annotation. The layered architecture of MimoSA allows for extensibility by separating the functions of paper scoring, minimotif visualization, and database management. MimoSA is readily adaptable to other annotation efforts that manually curate literature into a MySQL database. Conclusions MimoSA is an extensible application that facilitates minimotif annotation and integrates with the Minimotif Miner database. We have built MimoSA as an application that integrates dynamic abstract scoring with a high performance relational model of minimotif syntax. MimoSA's TextMine, an efficient paper-scoring algorithm, can be used to dynamically rank papers with

  9. Automated clinical annotation of tissue bank specimens.

    PubMed

    Gilbertson, John R; Gupta, Rajnish; Nie, Yimin; Patel, Ashokkumar A; Becich, Michael J

    2004-01-01

    Modern, molecular bio-medicine is driving a growing demand for extensively annotated tissue bank specimens. With careful clinical, pathologic and outcomes annotation, samples can be better matched to the research question at hand and experimental results better understood and verified. However, the difficulty and expense of detailed specimen annotation is well beyond the capability of most banks and has made access to well documented tissue a major limitation in medical re-search. In this context, we have implemented automated annotation of banked tissue by integrating data from three clinical systems--the cancer registry, the pathology LIS and the tissue bank inventory system--through a classical data warehouse environment. The project required modification of clinical systems, development of methods to identify patients between and map data elements across systems and the creation of de-identified data in data marts for use by researchers. The result has been much more extensive and accurate initial tissue annotation with less effort in the tissue bank, as well as dynamic ongoing annotation as the cancer registry follows patients over time.

  10. Annotate-it: a Swiss-knife approach to annotation, analysis and interpretation of single nucleotide variation in human disease

    PubMed Central

    2012-01-01

    The increasing size and complexity of exome/genome sequencing data requires new tools for clinical geneticists to discover disease-causing variants. Bottlenecks in identifying the causative variation include poor cross-sample querying, constantly changing functional annotation and not considering existing knowledge concerning the phenotype. We describe a methodology that facilitates exploration of patient sequencing data towards identification of causal variants under different genetic hypotheses. Annotate-it facilitates handling, analysis and interpretation of high-throughput single nucleotide variant data. We demonstrate our strategy using three case studies. Annotate-it is freely available and test data are accessible to all users at http://www.annotate-it.org. PMID:23013645

  11. Collaborative web-based annotation of video footage of deep-sea life, ecosystems and geological processes

    NASA Astrophysics Data System (ADS)

    Kottmann, R.; Ratmeyer, V.; Pop Ristov, A.; Boetius, A.

    2012-04-01

    More and more seagoing scientific expeditions use video-controlled research platforms such as Remote Operating Vehicles (ROV), Autonomous Underwater Vehicles (AUV), and towed camera systems. These produce many hours of video material which contains detailed and scientifically highly valuable footage of the biological, chemical, geological, and physical aspects of the oceans. Many of the videos contain unique observations of unknown life-forms which are rare, and which cannot be sampled and studied otherwise. To make such video material online accessible and to create a collaborative annotation environment the "Video Annotation and processing platform" (V-App) was developed. A first solely web-based installation for ROV videos is setup at the German Center for Marine Environmental Sciences (available at http://videolib.marum.de). It allows users to search and watch videos with a standard web browser based on the HTML5 standard. Moreover, V-App implements social web technologies allowing a distributed world-wide scientific community to collaboratively annotate videos anywhere at any time. It has several features fully implemented among which are: • User login system for fine grained permission and access control • Video watching • Video search using keywords, geographic position, depth and time range and any combination thereof • Video annotation organised in themes (tracks) such as biology and geology among others in standard or full screen mode • Annotation keyword management: Administrative users can add, delete, and update single keywords for annotation or upload sets of keywords from Excel-sheets • Download of products for scientific use This unique web application system helps making costly ROV videos online available (estimated cost range between 5.000 - 10.000 Euros per hour depending on the combination of ship and ROV). Moreover, with this system each expert annotation adds instantaneous available and valuable knowledge to otherwise uncharted

  12. Impact of timber harvest on species accumulation curves for oak herbivore communities of the Missouri Ozarks

    Treesearch

    Robert J. Marquis; Rebecca Forkner; John T. Lill; Josiane Le Corff

    2002-01-01

    We report the effects of two timber harvest methods, even-aged and uneven-aged harvest, versus no harvest on species accumulation curves for leaf-chewing herbivores of Quercus alba and Q. velutina in the Missouri Ozarks. The study was part of a larger project, the Missouri Ozark Forest Ecosystem Project (MOFEP). Herbivores were...

  13. Annotated chemical patent corpus: a gold standard for text mining.

    PubMed

    Akhondi, Saber A; Klenner, Alexander G; Tyrchan, Christian; Manchala, Anil K; Boppana, Kiran; Lowe, Daniel; Zimmermann, Marc; Jagarlapudi, Sarma A R P; Sayle, Roger; Kors, Jan A; Muresan, Sorel

    2014-01-01

    Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org.

  14. Annotated Chemical Patent Corpus: A Gold Standard for Text Mining

    PubMed Central

    Akhondi, Saber A.; Klenner, Alexander G.; Tyrchan, Christian; Manchala, Anil K.; Boppana, Kiran; Lowe, Daniel; Zimmermann, Marc; Jagarlapudi, Sarma A. R. P.; Sayle, Roger; Kors, Jan A.; Muresan, Sorel

    2014-01-01

    Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org. PMID:25268232

  15. GARNET--gene set analysis with exploration of annotation relations.

    PubMed

    Rho, Kyoohyoung; Kim, Bumjin; Jang, Youngjun; Lee, Sanghyun; Bae, Taejeong; Seo, Jihae; Seo, Chaehwa; Lee, Jihyun; Kang, Hyunjung; Yu, Ungsik; Kim, Sunghoon; Lee, Sanghyuk; Kim, Wan Kyu

    2011-02-15

    Gene set analysis is a powerful method of deducing biological meaning for an a priori defined set of genes. Numerous tools have been developed to test statistical enrichment or depletion in specific pathways or gene ontology (GO) terms. Major difficulties towards biological interpretation are integrating diverse types of annotation categories and exploring the relationships between annotation terms of similar information. GARNET (Gene Annotation Relationship NEtwork Tools) is an integrative platform for gene set analysis with many novel features. It includes tools for retrieval of genes from annotation database, statistical analysis & visualization of annotation relationships, and managing gene sets. In an effort to allow access to a full spectrum of amassed biological knowledge, we have integrated a variety of annotation data that include the GO, domain, disease, drug, chromosomal location, and custom-defined annotations. Diverse types of molecular networks (pathways, transcription and microRNA regulations, protein-protein interaction) are also included. The pair-wise relationship between annotation gene sets was calculated using kappa statistics. GARNET consists of three modules--gene set manager, gene set analysis and gene set retrieval, which are tightly integrated to provide virtually automatic analysis for gene sets. A dedicated viewer for annotation network has been developed to facilitate exploration of the related annotations. GARNET (gene annotation relationship network tools) is an integrative platform for diverse types of gene set analysis, where complex relationships among gene annotations can be easily explored with an intuitive network visualization tool (http://garnet.isysbio.org/ or http://ercsb.ewha.ac.kr/garnet/).

  16. The contribution of lakes to global inland fisheries harvest

    USGS Publications Warehouse

    Deines, Andrew M.; Bunnell, David B.; Rogers, Mark W.; Bennion, David; Woelmer, Whitney; Sayers, Michael J.; Grimm, Amanda G.; Shuchman, Robert A.; Raymer, Zachary B.; Brooks, Colin N.; Mychek-Londer, Justin G.; Taylor, William W.; Beard, Douglas

    2017-01-01

    Freshwater ecosystems provide numerous services for communities worldwide, including irrigation, hydropower, and municipal water; however, the services provided by inland fisheries – nourishment, employment, and recreational opportunities – are often comparatively undervalued. We provide an independent estimate of global lake harvest to improve biological and socioeconomic assessments of inland fisheries. On the basis of satellite-derived estimates of chlorophyll concentration from 80,012 globally distributed lakes, lake-specific fishing effort based on human population, and output from a Bayesian hierarchical model, we estimated that the global lake fishery harvest in the year 2011 was 8.4 million tons (mt). Our calculations excluded harvests from highly productive rivers, wetlands, and very small lakes; therefore, the true cumulative global fishery harvest from all freshwater sources likely exceeded 11 mt as reported by the Food and Agriculture Organization of the United Nations (FAO). This putative underestimate by the FAO could diminish the perceived importance of inland fisheries and perpetuate decisions that adversely affect these fisheries and millions of people.

  17. Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL.

    PubMed

    Apweiler, R; Gateau, A; Contrino, S; Martin, M J; Junker, V; O'Donovan, C; Lang, F; Mitaritonna, N; Kappus, S; Bairoch, A

    1997-01-01

    SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporating sequences without proper sequence analysis and annotation, we cannot speed up the incorporation of new incoming data indefinitely. However, as we also want to make the sequences available as fast as possible, we introduced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT. TREMBL consists of computer-annotated entries in SWISS-PROT format derived from the translation of all coding sequences (CDS) in the EMBL nucleotide sequence database, except for CDS already included in SWISS-PROT. While TREMBL is already of immense value, its computer-generated annotation does not match the quality of SWISS-PROTs. The main difference is in the protein functional information attached to sequences. With this in mind, we are dedicating substantial effort to develop and apply computer methods to enhance the functional information attached to TREMBL entries.

  18. Traditional uses of plants in a rural community of Mozambique and possible links with Miombo degradation and harvesting sustainability

    PubMed Central

    2014-01-01

    Background Miombo woodlands play an important role in the livelihood of people living in sub-equatorial African countries, contributing to satisfy basic human needs such as food, medicine, fuelwood and building materials. However, over-exploitation of plant resources and unsustainable harvest practices can potentially degrade forests. The aim of this study was to document the use of Miombo plant products, other than medicinal plants, in local communities, within a wider framework in which we discussed possible links between traditional uses and conservation status of the used species and of the whole Miombo environment. Methods Fieldwork took place in four communities of Muda-Serração, central Mozambique. We conducted semi-structured interviews with 52 informants about their knowledge, use and harvesting practices of useful plants. A survey on local Miombo vegetation was also carried out in order to assess abundance and distribution of useful woody plants cited in the interviews in areas exposed to different exploitation rates. A Conservation Priority index was also applied to rank conservation values of each used woody species. Results Ninety-eight plants cited by the informants were botanically identified. The most relevant general category was represented by food plants (45 species), followed by handicraft plants (38 species) and domestic plants (37 species). Among the 54 woody species observed in vegetation plots, 52% were cited as useful in the interviews. Twenty-six woody species found in ‘natural’ Miombo areas were not found in ‘degraded’ ones: of these, 46% were cited in the interviews (58% in the food category, 50% in the handicraft category, 25% in the domestic category and 8% in the fishing category). Results of conservation ranking showed that 7 woody species deserve conservation priority in the investigated area. Conclusions This study shows that the communities investigated rely heavily on local forest products for their daily subsistence

  19. "La Cosecha"/The Harvest: Sustainable Models of School-Community Engagement at a Bilingual Program

    ERIC Educational Resources Information Center

    Mangual Figueroa, Ariana; Baquedano-López, Patricia; Leyva-Cutler, Beatriz

    2014-01-01

    This article examines the culminating activity--"la cosecha" or the harvest--in a yearlong project in which teachers at a bilingual afterschool program and staff from a citywide environmental advocacy group taught students to plant, harvest, and sell produce grown at the school site. The authors show how students are socialized to become…

  20. Effects of timber harvesting on birds in the Black Hills of South Dakota and Wyoming, USA

    Treesearch

    Brian L. Dykstra; Mark A. Rumble; Lester D. Flake

    1997-01-01

    Timber harvest alters structural characteristics in ponderosa pine forests. In the Black Hills, harvested stands with 40-70% overstory canopy cover are managed as sapling/pole (3.0 - 22.9 cm dbh) or mature (> 22.9 cm dbh) stands. Changing the forest structure to two size classes has unknown effects on bird communities in this region. We counted birds in 20 harvested...

  1. Solar Tutorial and Annotation Resource (STAR)

    NASA Astrophysics Data System (ADS)

    Showalter, C.; Rex, R.; Hurlburt, N. E.; Zita, E. J.

    2009-12-01

    We have written a software suite designed to facilitate solar data analysis by scientists, students, and the public, anticipating enormous datasets from future instruments. Our “STAR" suite includes an interactive learning section explaining 15 classes of solar events. Users learn software tools that exploit humans’ superior ability (over computers) to identify many events. Annotation tools include time slice generation to quantify loop oscillations, the interpolation of event shapes using natural cubic splines (for loops, sigmoids, and filaments) and closed cubic splines (for coronal holes). Learning these tools in an environment where examples are provided prepares new users to comfortably utilize annotation software with new data. Upon completion of our tutorial, users are presented with media of various solar events and asked to identify and annotate the images, to test their mastery of the system. Goals of the project include public input into the data analysis of very large datasets from future solar satellites, and increased public interest and knowledge about the Sun. In 2010, the Solar Dynamics Observatory (SDO) will be launched into orbit. SDO’s advancements in solar telescope technology will generate a terabyte per day of high-quality data, requiring innovation in data management. While major projects develop automated feature recognition software, so that computers can complete much of the initial event tagging and analysis, still, that software cannot annotate features such as sigmoids, coronal magnetic loops, coronal dimming, etc., due to large amounts of data concentrated in relatively small areas. Previously, solar physicists manually annotated these features, but with the imminent influx of data it is unrealistic to expect specialized researchers to examine every image that computers cannot fully process. A new approach is needed to efficiently process these data. Providing analysis tools and data access to students and the public have proven

  2. Discovering gene annotations in biomedical text databases

    PubMed Central

    Cakmak, Ali; Ozsoyoglu, Gultekin

    2008-01-01

    Background Genes and gene products are frequently annotated with Gene Ontology concepts based on the evidence provided in genomics articles. Manually locating and curating information about a genomic entity from the biomedical literature requires vast amounts of human effort. Hence, there is clearly a need forautomated computational tools to annotate the genes and gene products with Gene Ontology concepts by computationally capturing the related knowledge embedded in textual data. Results In this article, we present an automated genomic entity annotation system, GEANN, which extracts information about the characteristics of genes and gene products in article abstracts from PubMed, and translates the discoveredknowledge into Gene Ontology (GO) concepts, a widely-used standardized vocabulary of genomic traits. GEANN utilizes textual "extraction patterns", and a semantic matching framework to locate phrases matching to a pattern and produce Gene Ontology annotations for genes and gene products. In our experiments, GEANN has reached to the precision level of 78% at therecall level of 61%. On a select set of Gene Ontology concepts, GEANN either outperforms or is comparable to two other automated annotation studies. Use of WordNet for semantic pattern matching improves the precision and recall by 24% and 15%, respectively, and the improvement due to semantic pattern matching becomes more apparent as the Gene Ontology terms become more general. Conclusion GEANN is useful for two distinct purposes: (i) automating the annotation of genomic entities with Gene Ontology concepts, and (ii) providing existing annotations with additional "evidence articles" from the literature. The use of textual extraction patterns that are constructed based on the existing annotations achieve high precision. The semantic pattern matching framework provides a more flexible pattern matching scheme with respect to "exactmatching" with the advantage of locating approximate pattern occurrences with

  3. Community College Faculty Retention: Examining Burnout, Stress, and Job Satisfaction. UCLA Community College Bibliography

    ERIC Educational Resources Information Center

    McJunkin, Kyle Stewart

    2005-01-01

    Recent literature on faculty departure from community colleges has focused primarily on faculty retirement. Less research has been conducted on turnover related to stress and faculty burnout, particularly at the community college level. In order to shed some light on this subject, the citations in this annotated bibliography focus on the…

  4. MIPS bacterial genomes functional annotation benchmark dataset.

    PubMed

    Tetko, Igor V; Brauner, Barbara; Dunger-Kaltenbach, Irmtraud; Frishman, Goar; Montrone, Corinna; Fobo, Gisela; Ruepp, Andreas; Antonov, Alexey V; Surmeli, Dimitrij; Mewes, Hans-Wernen

    2005-05-15

    Any development of new methods for automatic functional annotation of proteins according to their sequences requires high-quality data (as benchmark) as well as tedious preparatory work to generate sequence parameters required as input data for the machine learning methods. Different program settings and incompatible protocols make a comparison of the analyzed methods difficult. The MIPS Bacterial Functional Annotation Benchmark dataset (MIPS-BFAB) is a new, high-quality resource comprising four bacterial genomes manually annotated according to the MIPS functional catalogue (FunCat). These resources include precalculated sequence parameters, such as sequence similarity scores, InterPro domain composition and other parameters that could be used to develop and benchmark methods for functional annotation of bacterial protein sequences. These data are provided in XML format and can be used by scientists who are not necessarily experts in genome annotation. BFAB is available at http://mips.gsf.de/proj/bfab

  5. Quantification of the impact of PSI:Biology according to the annotations of the determined structures.

    PubMed

    DePietro, Paul J; Julfayev, Elchin S; McLaughlin, William A

    2013-10-21

    Protein Structure Initiative:Biology (PSI:Biology) is the third phase of PSI where protein structures are determined in high-throughput to characterize their biological functions. The transition to the third phase entailed the formation of PSI:Biology Partnerships which are composed of structural genomics centers and biomedical science laboratories. We present a method to examine the impact of protein structures determined under the auspices of PSI:Biology by measuring their rates of annotations. The mean numbers of annotations per structure and per residue are examined. These are designed to provide measures of the amount of structure to function connections that can be leveraged from each structure. One result is that PSI:Biology structures are found to have a higher rate of annotations than structures determined during the first two phases of PSI. A second result is that the subset of PSI:Biology structures determined through PSI:Biology Partnerships have a higher rate of annotations than those determined exclusive of those partnerships. Both results hold when the annotation rates are examined either at the level of the entire protein or for annotations that are known to fall at specific residues within the portion of the protein that has a determined structure. We conclude that PSI:Biology determines structures that are estimated to have a higher degree of biomedical interest than those determined during the first two phases of PSI based on a broad array of biomedical annotations. For the PSI:Biology Partnerships, we see that there is an associated added value that represents part of the progress toward the goals of PSI:Biology. We interpret the added value to mean that team-based structural biology projects that utilize the expertise and technologies of structural genomics centers together with biological laboratories in the community are conducted in a synergistic manner. We show that the annotation rates can be used in conjunction with established metrics, i

  6. Quantification of the impact of PSI:Biology according to the annotations of the determined structures

    PubMed Central

    2013-01-01

    Background Protein Structure Initiative:Biology (PSI:Biology) is the third phase of PSI where protein structures are determined in high-throughput to characterize their biological functions. The transition to the third phase entailed the formation of PSI:Biology Partnerships which are composed of structural genomics centers and biomedical science laboratories. We present a method to examine the impact of protein structures determined under the auspices of PSI:Biology by measuring their rates of annotations. The mean numbers of annotations per structure and per residue are examined. These are designed to provide measures of the amount of structure to function connections that can be leveraged from each structure. Results One result is that PSI:Biology structures are found to have a higher rate of annotations than structures determined during the first two phases of PSI. A second result is that the subset of PSI:Biology structures determined through PSI:Biology Partnerships have a higher rate of annotations than those determined exclusive of those partnerships. Both results hold when the annotation rates are examined either at the level of the entire protein or for annotations that are known to fall at specific residues within the portion of the protein that has a determined structure. Conclusions We conclude that PSI:Biology determines structures that are estimated to have a higher degree of biomedical interest than those determined during the first two phases of PSI based on a broad array of biomedical annotations. For the PSI:Biology Partnerships, we see that there is an associated added value that represents part of the progress toward the goals of PSI:Biology. We interpret the added value to mean that team-based structural biology projects that utilize the expertise and technologies of structural genomics centers together with biological laboratories in the community are conducted in a synergistic manner. We show that the annotation rates can be used in

  7. The Viking viewer for connectomics: scalable multi-user annotation and summarization of large volume data sets

    PubMed Central

    ANDERSON, JR; MOHAMMED, S; GRIMM, B; JONES, BW; KOSHEVOY, P; TASDIZEN, T; WHITAKER, R; MARC, RE

    2011-01-01

    Modern microscope automation permits the collection of vast amounts of continuous anatomical imagery in both two and three dimensions. These large data sets present significant challenges for data storage, access, viewing, annotation and analysis. The cost and overhead of collecting and storing the data can be extremely high. Large data sets quickly exceed an individual's capability for timely analysis and present challenges in efficiently applying transforms, if needed. Finally annotated anatomical data sets can represent a significant investment of resources and should be easily accessible to the scientific community. The Viking application was our solution created to view and annotate a 16.5 TB ultrastructural retinal connectome volume and we demonstrate its utility in reconstructing neural networks for a distinctive retinal amacrine cell class. Viking has several key features. (1) It works over the internet using HTTP and supports many concurrent users limited only by hardware. (2) It supports a multi-user, collaborative annotation strategy. (3) It cleanly demarcates viewing and analysis from data collection and hosting. (4) It is capable of applying transformations in real-time. (5) It has an easily extensible user interface, allowing addition of specialized modules without rewriting the viewer. PMID:21118201

  8. Systems Theory and Communication. Annotated Bibliography.

    ERIC Educational Resources Information Center

    Covington, William G., Jr.

    This annotated bibliography presents annotations of 31 books and journal articles dealing with systems theory and its relation to organizational communication, marketing, information theory, and cybernetics. Materials were published between 1963 and 1992 and are listed alphabetically by author. (RS)

  9. Current and future trends in marine image annotation software

    NASA Astrophysics Data System (ADS)

    Gomes-Pereira, Jose Nuno; Auger, Vincent; Beisiegel, Kolja; Benjamin, Robert; Bergmann, Melanie; Bowden, David; Buhl-Mortensen, Pal; De Leo, Fabio C.; Dionísio, Gisela; Durden, Jennifer M.; Edwards, Luke; Friedman, Ariell; Greinert, Jens; Jacobsen-Stout, Nancy; Lerner, Steve; Leslie, Murray; Nattkemper, Tim W.; Sameoto, Jessica A.; Schoening, Timm; Schouten, Ronald; Seager, James; Singh, Hanumant; Soubigou, Olivier; Tojeira, Inês; van den Beld, Inge; Dias, Frederico; Tempera, Fernando; Santos, Ricardo S.

    2016-12-01

    Given the need to describe, analyze and index large quantities of marine imagery data for exploration and monitoring activities, a range of specialized image annotation tools have been developed worldwide. Image annotation - the process of transposing objects or events represented in a video or still image to the semantic level, may involve human interactions and computer-assisted solutions. Marine image annotation software (MIAS) have enabled over 500 publications to date. We review the functioning, application trends and developments, by comparing general and advanced features of 23 different tools utilized in underwater image analysis. MIAS requiring human input are basically a graphical user interface, with a video player or image browser that recognizes a specific time code or image code, allowing to log events in a time-stamped (and/or geo-referenced) manner. MIAS differ from similar software by the capability of integrating data associated to video collection, the most simple being the position coordinates of the video recording platform. MIAS have three main characteristics: annotating events in real time, posteriorly to annotation and interact with a database. These range from simple annotation interfaces, to full onboard data management systems, with a variety of toolboxes. Advanced packages allow to input and display data from multiple sensors or multiple annotators via intranet or internet. Posterior human-mediated annotation often include tools for data display and image analysis, e.g. length, area, image segmentation, point count; and in a few cases the possibility of browsing and editing previous dive logs or to analyze the annotations. The interaction with a database allows the automatic integration of annotations from different surveys, repeated annotation and collaborative annotation of shared datasets, browsing and querying of data. Progress in the field of automated annotation is mostly in post processing, for stable platforms or still images

  10. Improving eye safety in citrus harvest crews through the acceptance of personal protective equipment, community-based participatory research, social marketing, and community health workers.

    PubMed

    Tovar-Aguilar, J Antonio; Monaghan, Paul F; Bryant, Carol A; Esposito, Andrew; Wade, Mark; Ruiz, Omar; McDermott, Robert J

    2014-01-01

    For the last 10 years, the Partnership for Citrus Workers Health (PCWH) has been an evidence-based intervention program that promotes the adoption of protective eye safety equipment among Spanish-speaking farmworkers of Florida. At the root of this program is the systematic use of community-based preventive marketing (CBPM) and the training of community health workers (CHWs) among citrus harvester using popular education. CBPM is a model that combines the organizational system of community-based participatory research (CBPR) and the strategies of social marketing. This particular program relied on formative research data using a mixed-methods approach and a multilevel stakeholder analysis that allowed for rapid dissemination, effective increase of personal protective equipment (PPE) usage, and a subsequent impact on adoptive workers and companies. Focus groups, face-to-face interviews, surveys, participant observation, Greco-Latin square, and quasi-experimental tests were implemented. A 20-hour popular education training produced CHWs that translated results of the formative research to potential adopters and also provided first aid skills for eye injuries. Reduction of injuries is not limited to the use of safety glasses, but also to the adoption of timely intervention and regular eye hygiene. Limitations include adoption in only large companies, rapid decline of eye safety glasses without consistent intervention, technological limitations of glasses, and thorough cost-benefit analysis.

  11. PANNZER2: a rapid functional annotation web server.

    PubMed

    Törönen, Petri; Medlar, Alan; Holm, Liisa

    2018-05-08

    The unprecedented growth of high-throughput sequencing has led to an ever-widening annotation gap in protein databases. While computational prediction methods are available to make up the shortfall, a majority of public web servers are hindered by practical limitations and poor performance. Here, we introduce PANNZER2 (Protein ANNotation with Z-scoRE), a fast functional annotation web server that provides both Gene Ontology (GO) annotations and free text description predictions. PANNZER2 uses SANSparallel to perform high-performance homology searches, making bulk annotation based on sequence similarity practical. PANNZER2 can output GO annotations from multiple scoring functions, enabling users to see which predictions are robust across predictors. Finally, PANNZER2 predictions scored within the top 10 methods for molecular function and biological process in the CAFA2 NK-full benchmark. The PANNZER2 web server is updated on a monthly schedule and is accessible at http://ekhidna2.biocenter.helsinki.fi/sanspanz/. The source code is available under the GNU Public Licence v3.

  12. The History Harvest: An Experiment in Democratizing the Past through Experiential Learning

    ERIC Educational Resources Information Center

    Thomas, William G.; Jones, Patrick D.

    2013-01-01

    The History Harvest project (http://historyharvest.unl.edu) is an open, digital archive of historical artifacts gathered from communities across the United States. Each year, The University of Nebraska-Lincoln Department of History partners with local institutions and community members within a highlighted area to collect, preserve, and share…

  13. Structural and functional annotation of the porcine immunome

    PubMed Central

    2013-01-01

    Background The domestic pig is known as an excellent model for human immunology and the two species share many pathogens. Susceptibility to infectious disease is one of the major constraints on swine performance, yet the structure and function of genes comprising the pig immunome are not well-characterized. The completion of the pig genome provides the opportunity to annotate the pig immunome, and compare and contrast pig and human immune systems. Results The Immune Response Annotation Group (IRAG) used computational curation and manual annotation of the swine genome assembly 10.2 (Sscrofa10.2) to refine the currently available automated annotation of 1,369 immunity-related genes through sequence-based comparison to genes in other species. Within these genes, we annotated 3,472 transcripts. Annotation provided evidence for gene expansions in several immune response families, and identified artiodactyl-specific expansions in the cathelicidin and type 1 Interferon families. We found gene duplications for 18 genes, including 13 immune response genes and five non-immune response genes discovered in the annotation process. Manual annotation provided evidence for many new alternative splice variants and 8 gene duplications. Over 1,100 transcripts without porcine sequence evidence were detected using cross-species annotation. We used a functional approach to discover and accurately annotate porcine immune response genes. A co-expression clustering analysis of transcriptomic data from selected experimental infections or immune stimulations of blood, macrophages or lymph nodes identified a large cluster of genes that exhibited a correlated positive response upon infection across multiple pathogens or immune stimuli. Interestingly, this gene cluster (cluster 4) is enriched for known general human immune response genes, yet contains many un-annotated porcine genes. A phylogenetic analysis of the encoded proteins of cluster 4 genes showed that 15% exhibited an accelerated

  14. Adding Value to Large Multimedia Collections through Annotation Technologies and Tools: Serving Communities of Interest.

    ERIC Educational Resources Information Center

    Shabajee, Paul; Miller, Libby; Dingley, Andy

    A group of research projects based at HP-Labs Bristol, the University of Bristol (England) and ARKive (a new large multimedia database project focused on the worlds biodiversity based in the United Kingdom) are working to develop a flexible model for the indexing of multimedia collections that allows users to annotate content utilizing extensible…

  15. Non-redundant patent sequence databases with value-added annotations at two levels

    PubMed Central

    Li, Weizhong; McWilliam, Hamish; de la Torre, Ana Richart; Grodowski, Adam; Benediktovich, Irina; Goujon, Mickael; Nauche, Stephane; Lopez, Rodrigo

    2010-01-01

    The European Bioinformatics Institute (EMBL-EBI) provides public access to patent data, including abstracts, chemical compounds and sequences. Sequences can appear multiple times due to the filing of the same invention with multiple patent offices, or the use of the same sequence by different inventors in different contexts. Information relating to the source invention may be incomplete, and biological information available in patent documents elsewhere may not be reflected in the annotation of the sequence. Search and analysis of these data have become increasingly challenging for both the scientific and intellectual-property communities. Here, we report a collection of non-redundant patent sequence databases, which cover the EMBL-Bank nucleotides patent class and the patent protein databases and contain value-added annotations from patent documents. The databases were created at two levels by the use of sequence MD5 checksums. Sequences within a level-1 cluster are 100% identical over their whole length. Level-2 clusters were defined by sub-grouping level-1 clusters based on patent family information. Value-added annotations, such as publication number corrections, earliest publication dates and feature collations, significantly enhance the quality of the data, allowing for better tracking and cross-referencing. The databases are available format: http://www.ebi.ac.uk/patentdata/nr/. PMID:19884134

  16. Non-redundant patent sequence databases with value-added annotations at two levels.

    PubMed

    Li, Weizhong; McWilliam, Hamish; de la Torre, Ana Richart; Grodowski, Adam; Benediktovich, Irina; Goujon, Mickael; Nauche, Stephane; Lopez, Rodrigo

    2010-01-01

    The European Bioinformatics Institute (EMBL-EBI) provides public access to patent data, including abstracts, chemical compounds and sequences. Sequences can appear multiple times due to the filing of the same invention with multiple patent offices, or the use of the same sequence by different inventors in different contexts. Information relating to the source invention may be incomplete, and biological information available in patent documents elsewhere may not be reflected in the annotation of the sequence. Search and analysis of these data have become increasingly challenging for both the scientific and intellectual-property communities. Here, we report a collection of non-redundant patent sequence databases, which cover the EMBL-Bank nucleotides patent class and the patent protein databases and contain value-added annotations from patent documents. The databases were created at two levels by the use of sequence MD5 checksums. Sequences within a level-1 cluster are 100% identical over their whole length. Level-2 clusters were defined by sub-grouping level-1 clusters based on patent family information. Value-added annotations, such as publication number corrections, earliest publication dates and feature collations, significantly enhance the quality of the data, allowing for better tracking and cross-referencing. The databases are available format: http://www.ebi.ac.uk/patentdata/nr/.

  17. A Factor Graph Approach to Automated GO Annotation.

    PubMed

    Spetale, Flavio E; Tapia, Elizabeth; Krsticevic, Flavia; Roda, Fernando; Bulacio, Pilar

    2016-01-01

    As volume of genomic data grows, computational methods become essential for providing a first glimpse onto gene annotations. Automated Gene Ontology (GO) annotation methods based on hierarchical ensemble classification techniques are particularly interesting when interpretability of annotation results is a main concern. In these methods, raw GO-term predictions computed by base binary classifiers are leveraged by checking the consistency of predefined GO relationships. Both formal leveraging strategies, with main focus on annotation precision, and heuristic alternatives, with main focus on scalability issues, have been described in literature. In this contribution, a factor graph approach to the hierarchical ensemble formulation of the automated GO annotation problem is presented. In this formal framework, a core factor graph is first built based on the GO structure and then enriched to take into account the noisy nature of GO-term predictions. Hence, starting from raw GO-term predictions, an iterative message passing algorithm between nodes of the factor graph is used to compute marginal probabilities of target GO-terms. Evaluations on Saccharomyces cerevisiae, Arabidopsis thaliana and Drosophila melanogaster protein sequences from the GO Molecular Function domain showed significant improvements over competing approaches, even when protein sequences were naively characterized by their physicochemical and secondary structure properties or when loose noisy annotation datasets were considered. Based on these promising results and using Arabidopsis thaliana annotation data, we extend our approach to the identification of most promising molecular function annotations for a set of proteins of unknown function in Solanum lycopersicum.

  18. Leveraging annotation-based modeling with Jump.

    PubMed

    Bergmayr, Alexander; Grossniklaus, Michael; Wimmer, Manuel; Kappel, Gerti

    2018-01-01

    The capability of UML profiles to serve as annotation mechanism has been recognized in both research and industry. Today's modeling tools offer profiles specific to platforms, such as Java, as they facilitate model-based engineering approaches. However, considering the large number of possible annotations in Java, manually developing the corresponding profiles would only be achievable by huge development and maintenance efforts. Thus, leveraging annotation-based modeling requires an automated approach capable of generating platform-specific profiles from Java libraries. To address this challenge, we present the fully automated transformation chain realized by Jump, thereby continuing existing mapping efforts between Java and UML by emphasizing on annotations and profiles. The evaluation of Jump shows that it scales for large Java libraries and generates profiles of equal or even improved quality compared to profiles currently used in practice. Furthermore, we demonstrate the practical value of Jump by contributing profiles that facilitate reverse engineering and forward engineering processes for the Java platform by applying it to a modernization scenario.

  19. Drivers of Tree Growth, Mortality and Harvest Preferences in Species-Rich Plantations for Smallholders and Communities in the Tropics

    PubMed Central

    Nguyen, Huong; Vanclay, Jerome; Herbohn, John; Firn, Jennifer

    2016-01-01

    There is growing interest in multi-species tropical plantations but little information exists to guide their design and silviculture. The Rainforestation Farming system is the oldest tropical polyculture planting system in the Philippines and provides a unique opportunity to understand the underlying processes affecting tree performance within diverse plantings. Data collected from 85 plots distributed across the 18 mixed-species plantations in the Philippines was used to identify the factors influencing growth, probability of harvest, and death of trees in these complex plantings. The 18 sites (aged from 6 to 11 years at time of first measurement) were measured on three occasions over a 6-year period. We used data from the first period of data collection to develop models predicting harvesting probability and growth of trees in the second period. We found little evidence that tree species diversity had an effect on tree growth and tree loss at the community level, although a negative effect was found on tree growth of specific species such as Parashorea plicata and Swietenia macrophylla. While tree density of stands at age 10+ years (more than 1000 trees/ha with diameter > 5cm) did not have an impact on growth, growth rates were decreasing in stands with a high basal area. Tree size in the first period of measure was a good predictor for both tree growth and tree status in the next period, with larger trees tending to grow faster and having a greater chance of being harvested, and a lower possibility of mortality than smaller trees. Shade-intolerant trees were both more likely to be harvested, and had a higher probability of death, than shade-tolerant individuals. Native species and exotic species were equally likely to have been lost from the plots between measurement periods. However, shade-tolerant native trees were likely to grow faster than the others at age 10+ years. Our findings suggest that species traits (e.g. shade tolerance) could play an important

  20. Drivers of Tree Growth, Mortality and Harvest Preferences in Species-Rich Plantations for Smallholders and Communities in the Tropics.

    PubMed

    Nguyen, Huong; Vanclay, Jerome; Herbohn, John; Firn, Jennifer

    2016-01-01

    There is growing interest in multi-species tropical plantations but little information exists to guide their design and silviculture. The Rainforestation Farming system is the oldest tropical polyculture planting system in the Philippines and provides a unique opportunity to understand the underlying processes affecting tree performance within diverse plantings. Data collected from 85 plots distributed across the 18 mixed-species plantations in the Philippines was used to identify the factors influencing growth, probability of harvest, and death of trees in these complex plantings. The 18 sites (aged from 6 to 11 years at time of first measurement) were measured on three occasions over a 6-year period. We used data from the first period of data collection to develop models predicting harvesting probability and growth of trees in the second period. We found little evidence that tree species diversity had an effect on tree growth and tree loss at the community level, although a negative effect was found on tree growth of specific species such as Parashorea plicata and Swietenia macrophylla. While tree density of stands at age 10+ years (more than 1000 trees/ha with diameter > 5cm) did not have an impact on growth, growth rates were decreasing in stands with a high basal area. Tree size in the first period of measure was a good predictor for both tree growth and tree status in the next period, with larger trees tending to grow faster and having a greater chance of being harvested, and a lower possibility of mortality than smaller trees. Shade-intolerant trees were both more likely to be harvested, and had a higher probability of death, than shade-tolerant individuals. Native species and exotic species were equally likely to have been lost from the plots between measurement periods. However, shade-tolerant native trees were likely to grow faster than the others at age 10+ years. Our findings suggest that species traits (e.g. shade tolerance) could play an important

  1. Terra Harvest software architecture

    NASA Astrophysics Data System (ADS)

    Humeniuk, Dave; Klawon, Kevin

    2012-06-01

    Under the Terra Harvest Program, the DIA has the objective of developing a universal Controller for the Unattended Ground Sensor (UGS) community. The mission is to define, implement, and thoroughly document an open architecture that universally supports UGS missions, integrating disparate systems, peripherals, etc. The Controller's inherent interoperability with numerous systems enables the integration of both legacy and future UGS System (UGSS) components, while the design's open architecture supports rapid third-party development to ensure operational readiness. The successful accomplishment of these objectives by the program's Phase 3b contractors is demonstrated via integration of the companies' respective plug-'n'-play contributions that include controllers, various peripherals, such as sensors, cameras, etc., and their associated software drivers. In order to independently validate the Terra Harvest architecture, L-3 Nova Engineering, along with its partner, the University of Dayton Research Institute, is developing the Terra Harvest Open Source Environment (THOSE), a Java Virtual Machine (JVM) running on an embedded Linux Operating System. The Use Cases on which the software is developed support the full range of UGS operational scenarios such as remote sensor triggering, image capture, and data exfiltration. The Team is additionally developing an ARM microprocessor-based evaluation platform that is both energy-efficient and operationally flexible. The paper describes the overall THOSE architecture, as well as the design decisions for some of the key software components. Development process for THOSE is discussed as well.

  2. Elementary Health: Authorized Resources Annotated List.

    ERIC Educational Resources Information Center

    Alberta Dept. of Education, Edmonton. Curriculum Standards Branch.

    This comprehensive, annotated resource list is designed to assist in selecting resources authorized by the Alberta (Canada) Education Department for the elementary health classroom (Grades 1-6). Within each grade and topic, annotated entries for basic learning resources are listed, followed by support learning resources and authorized teaching…

  3. A semi-automatic annotation tool for cooking video

    NASA Astrophysics Data System (ADS)

    Bianco, Simone; Ciocca, Gianluigi; Napoletano, Paolo; Schettini, Raimondo; Margherita, Roberto; Marini, Gianluca; Gianforme, Giorgio; Pantaleo, Giuseppe

    2013-03-01

    In order to create a cooking assistant application to guide the users in the preparation of the dishes relevant to their profile diets and food preferences, it is necessary to accurately annotate the video recipes, identifying and tracking the foods of the cook. These videos present particular annotation challenges such as frequent occlusions, food appearance changes, etc. Manually annotate the videos is a time-consuming, tedious and error-prone task. Fully automatic tools that integrate computer vision algorithms to extract and identify the elements of interest are not error free, and false positive and false negative detections need to be corrected in a post-processing stage. We present an interactive, semi-automatic tool for the annotation of cooking videos that integrates computer vision techniques under the supervision of the user. The annotation accuracy is increased with respect to completely automatic tools and the human effort is reduced with respect to completely manual ones. The performance and usability of the proposed tool are evaluated on the basis of the time and effort required to annotate the same video sequences.

  4. MPEG-7 based video annotation and browsing

    NASA Astrophysics Data System (ADS)

    Hoeynck, Michael; Auweiler, Thorsten; Wellhausen, Jens

    2003-11-01

    The huge amount of multimedia data produced worldwide requires annotation in order to enable universal content access and to provide content-based search-and-retrieval functionalities. Since manual video annotation can be time consuming, automatic annotation systems are required. We review recent approaches to content-based indexing and annotation of videos for different kind of sports and describe our approach to automatic annotation of equestrian sports videos. We especially concentrate on MPEG-7 based feature extraction and content description, where we apply different visual descriptors for cut detection. Further, we extract the temporal positions of single obstacles on the course by analyzing MPEG-7 edge information. Having determined single shot positions as well as the visual highlights, the information is jointly stored with meta-textual information in an MPEG-7 description scheme. Based on this information, we generate content summaries which can be utilized in a user-interface in order to provide content-based access to the video stream, but further for media browsing on a streaming server.

  5. Articles on Mass Communication in U.S. and Foreign Journals: A Selected Annotated Bibliography--January, February, March 1980.

    ERIC Educational Resources Information Center

    McKerns, Joseph P.; Delahaye, Alfred N.

    1980-01-01

    Lists and annotates more than 250 articles on mass communication, grouped according to topic. Topics include advertising, audience and communicator analysis, broadcasting, community journalism, courts and law, criticism and defense of media, education for journalism, history and biography, international, management, public relations, visual…

  6. An Annotated and Federated Digital Library of Marine Animal Sounds

    DTIC Science & Technology

    2005-01-01

    of the annotations and the relevant segment delimitation points and linkages to other relevant metadata fields; e) search engines that support the...annotators to add information to the same recording, and search engines that permit either all-annotator or specific-annotator searches. To our knowledge

  7. Harvesting NASA's Common Metadata Repository

    NASA Astrophysics Data System (ADS)

    Shum, D.; Mitchell, A. E.; Durbin, C.; Norton, J.

    2017-12-01

    As part of NASA's Earth Observing System Data and Information System (EOSDIS), the Common Metadata Repository (CMR) stores metadata for over 30,000 datasets from both NASA and international providers along with over 300M granules. This metadata enables sub-second discovery and facilitates data access. While the CMR offers a robust temporal, spatial and keyword search functionality to the general public and international community, it is sometimes more desirable for international partners to harvest the CMR metadata and merge the CMR metadata into a partner's existing metadata repository. This poster will focus on best practices to follow when harvesting CMR metadata to ensure that any changes made to the CMR can also be updated in a partner's own repository. Additionally, since each partner has distinct metadata formats they are able to consume, the best practices will also include guidance on retrieving the metadata in the desired metadata format using CMR's Unified Metadata Model translation software.

  8. A Factor Graph Approach to Automated GO Annotation

    PubMed Central

    Spetale, Flavio E.; Tapia, Elizabeth; Krsticevic, Flavia; Roda, Fernando; Bulacio, Pilar

    2016-01-01

    As volume of genomic data grows, computational methods become essential for providing a first glimpse onto gene annotations. Automated Gene Ontology (GO) annotation methods based on hierarchical ensemble classification techniques are particularly interesting when interpretability of annotation results is a main concern. In these methods, raw GO-term predictions computed by base binary classifiers are leveraged by checking the consistency of predefined GO relationships. Both formal leveraging strategies, with main focus on annotation precision, and heuristic alternatives, with main focus on scalability issues, have been described in literature. In this contribution, a factor graph approach to the hierarchical ensemble formulation of the automated GO annotation problem is presented. In this formal framework, a core factor graph is first built based on the GO structure and then enriched to take into account the noisy nature of GO-term predictions. Hence, starting from raw GO-term predictions, an iterative message passing algorithm between nodes of the factor graph is used to compute marginal probabilities of target GO-terms. Evaluations on Saccharomyces cerevisiae, Arabidopsis thaliana and Drosophila melanogaster protein sequences from the GO Molecular Function domain showed significant improvements over competing approaches, even when protein sequences were naively characterized by their physicochemical and secondary structure properties or when loose noisy annotation datasets were considered. Based on these promising results and using Arabidopsis thaliana annotation data, we extend our approach to the identification of most promising molecular function annotations for a set of proteins of unknown function in Solanum lycopersicum. PMID:26771463

  9. How Does the Scientific Community Contribute to Gene Ontology?

    PubMed

    Lovering, Ruth C

    2017-01-01

    Collaborations between the scientific community and members of the Gene Ontology (GO) Consortium have led to an increase in the number and specificity of GO terms, as well as increasing the number of GO annotations. A variety of approaches have been taken to encourage research scientists to contribute to the GO, but the success of these approaches has been variable. This chapter reviews both the successes and failures of engaging the scientific community in GO development and annotation, as well as, providing motivation and advice to encourage individual researchers to contribute to GO.

  10. Strength analysis of piezoceramic materials for structural considerations in energy harvesting for UAVs

    NASA Astrophysics Data System (ADS)

    Anton, S. R.; Erturk, A.; Inman, D. J.

    2010-04-01

    Vibration energy harvesting has received considerable attention in the research community over the past decade. Typical vibration harvesting systems are designed to be added on to existing host structures and capture ambient vibration energy. An interesting application of vibration energy harvesting exists in unmanned aerial vehicles (UAVs), where a multifunctional approach, as opposed to the traditional method, is needed due to weight and aerodynamic considerations. The authors propose a multifunctional design for energy harvesting in UAVs where the piezoelectric harvesting device is integrated into the wing of a UAV and provides energy harvesting, energy storage, and load bearing capability. The brittle piezoceramic layer of the harvester is a critical member in load bearing applications; therefore, it is the goal of this research to investigate the bending strength of various common piezoceramic materials. Three-point bend tests are carried out on several piezoelectric ceramics including monolithic piezoceramics PZT-5A and PZT-5H, single crystal piezoelectric PMN-PZT, and commercially packaged QuickPack devices. Bending strength results are reported and can be used as a design tool in the development of piezoelectric vibration energy harvesting systems in which the active device is subjected to bending loads.

  11. Evaluation of modern cotton harvest systems on irrigated cotton: harvester performance

    USDA-ARS?s Scientific Manuscript database

    Picker and stripper harvest systems were evaluated on production-scale irrigated cotton on the High Plains of Texas over three harvest seasons. Observations on harvester performance, including time-in-motion, harvest loss, seed cotton composition, and turnout, were conducted at seven locations with...

  12. Propagating annotations of molecular networks using in silico fragmentation

    PubMed Central

    da Silva, Ricardo R.; Wang, Mingxun; Fox, Evan; Balunas, Marcy J.; Klassen, Jonathan L.; Dorrestein, Pieter C.

    2018-01-01

    The annotation of small molecules is one of the most challenging and important steps in untargeted mass spectrometry analysis, as most of our biological interpretations rely on structural annotations. Molecular networking has emerged as a structured way to organize and mine data from untargeted tandem mass spectrometry (MS/MS) experiments and has been widely applied to propagate annotations. However, propagation is done through manual inspection of MS/MS spectra connected in the spectral networks and is only possible when a reference library spectrum is available. One of the alternative approaches used to annotate an unknown fragmentation mass spectrum is through the use of in silico predictions. One of the challenges of in silico annotation is the uncertainty around the correct structure among the predicted candidate lists. Here we show how molecular networking can be used to improve the accuracy of in silico predictions through propagation of structural annotations, even when there is no match to a MS/MS spectrum in spectral libraries. This is accomplished through creating a network consensus of re-ranked structural candidates using the molecular network topology and structural similarity to improve in silico annotations. The Network Annotation Propagation (NAP) tool is accessible through the GNPS web-platform https://gnps.ucsd.edu/ProteoSAFe/static/gnps-theoretical.jsp. PMID:29668671

  13. Propagating annotations of molecular networks using in silico fragmentation.

    PubMed

    da Silva, Ricardo R; Wang, Mingxun; Nothias, Louis-Félix; van der Hooft, Justin J J; Caraballo-Rodríguez, Andrés Mauricio; Fox, Evan; Balunas, Marcy J; Klassen, Jonathan L; Lopes, Norberto Peporine; Dorrestein, Pieter C

    2018-04-01

    The annotation of small molecules is one of the most challenging and important steps in untargeted mass spectrometry analysis, as most of our biological interpretations rely on structural annotations. Molecular networking has emerged as a structured way to organize and mine data from untargeted tandem mass spectrometry (MS/MS) experiments and has been widely applied to propagate annotations. However, propagation is done through manual inspection of MS/MS spectra connected in the spectral networks and is only possible when a reference library spectrum is available. One of the alternative approaches used to annotate an unknown fragmentation mass spectrum is through the use of in silico predictions. One of the challenges of in silico annotation is the uncertainty around the correct structure among the predicted candidate lists. Here we show how molecular networking can be used to improve the accuracy of in silico predictions through propagation of structural annotations, even when there is no match to a MS/MS spectrum in spectral libraries. This is accomplished through creating a network consensus of re-ranked structural candidates using the molecular network topology and structural similarity to improve in silico annotations. The Network Annotation Propagation (NAP) tool is accessible through the GNPS web-platform https://gnps.ucsd.edu/ProteoSAFe/static/gnps-theoretical.jsp.

  14. Gene calling and bacterial genome annotation with BG7.

    PubMed

    Tobes, Raquel; Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Kovach, Evdokim; Alekhin, Alexey; Pareja, Eduardo

    2015-01-01

    New massive sequencing technologies are providing many bacterial genome sequences from diverse taxa but a refined annotation of these genomes is crucial for obtaining scientific findings and new knowledge. Thus, bacterial genome annotation has emerged as a key point to investigate in bacteria. Any efficient tool designed specifically to annotate bacterial genomes sequenced with massively parallel technologies has to consider the specific features of bacterial genomes (absence of introns and scarcity of nonprotein-coding sequence) and of next-generation sequencing (NGS) technologies (presence of errors and not perfectly assembled genomes). These features make it convenient to focus on coding regions and, hence, on protein sequences that are the elements directly related with biological functions. In this chapter we describe how to annotate bacterial genomes with BG7, an open-source tool based on a protein-centered gene calling/annotation paradigm. BG7 is specifically designed for the annotation of bacterial genomes sequenced with NGS. This tool is sequence error tolerant maintaining their capabilities for the annotation of highly fragmented genomes or for annotating mixed sequences coming from several genomes (as those obtained through metagenomics samples). BG7 has been designed with scalability as a requirement, with a computing infrastructure completely based on cloud computing (Amazon Web Services).

  15. The Effects of Visual and Textual Annotations on Spanish Listening Comprehension, Vocabulary Acquisition and Cognitive Load

    ERIC Educational Resources Information Center

    Cottam, Michael Evan

    2010-01-01

    The purpose of this experimental study was to investigate the effects of textual and visual annotations on Spanish listening comprehension and vocabulary acquisition in the context of an online multimedia listening activity. 95 students who were enrolled in different sections of first year Spanish classes at a community college and a large…

  16. The Viking viewer for connectomics: scalable multi-user annotation and summarization of large volume data sets.

    PubMed

    Anderson, J R; Mohammed, S; Grimm, B; Jones, B W; Koshevoy, P; Tasdizen, T; Whitaker, R; Marc, R E

    2011-01-01

    Modern microscope automation permits the collection of vast amounts of continuous anatomical imagery in both two and three dimensions. These large data sets present significant challenges for data storage, access, viewing, annotation and analysis. The cost and overhead of collecting and storing the data can be extremely high. Large data sets quickly exceed an individual's capability for timely analysis and present challenges in efficiently applying transforms, if needed. Finally annotated anatomical data sets can represent a significant investment of resources and should be easily accessible to the scientific community. The Viking application was our solution created to view and annotate a 16.5 TB ultrastructural retinal connectome volume and we demonstrate its utility in reconstructing neural networks for a distinctive retinal amacrine cell class. Viking has several key features. (1) It works over the internet using HTTP and supports many concurrent users limited only by hardware. (2) It supports a multi-user, collaborative annotation strategy. (3) It cleanly demarcates viewing and analysis from data collection and hosting. (4) It is capable of applying transformations in real-time. (5) It has an easily extensible user interface, allowing addition of specialized modules without rewriting the viewer. © 2010 The Authors Journal of Microscopy © 2010 The Royal Microscopical Society.

  17. Coreference annotation and resolution in the Colorado Richly Annotated Full Text (CRAFT) corpus of biomedical journal articles.

    PubMed

    Cohen, K Bretonnel; Lanfranchi, Arrick; Choi, Miji Joo-Young; Bada, Michael; Baumgartner, William A; Panteleyeva, Natalya; Verspoor, Karin; Palmer, Martha; Hunter, Lawrence E

    2017-08-17

    Coreference resolution is the task of finding strings in text that have the same referent as other strings. Failures of coreference resolution are a common cause of false negatives in information extraction from the scientific literature. In order to better understand the nature of the phenomenon of coreference in biomedical publications and to increase performance on the task, we annotated the Colorado Richly Annotated Full Text (CRAFT) corpus with coreference relations. The corpus was manually annotated with coreference relations, including identity and appositives for all coreferring base noun phrases. The OntoNotes annotation guidelines, with minor adaptations, were used. Interannotator agreement ranges from 0.480 (entity-based CEAF) to 0.858 (Class-B3), depending on the metric that is used to assess it. The resulting corpus adds nearly 30,000 annotations to the previous release of the CRAFT corpus. Differences from related projects include a much broader definition of markables, connection to extensive annotation of several domain-relevant semantic classes, and connection to complete syntactic annotation. Tool performance was benchmarked on the data. A publicly available out-of-the-box, general-domain coreference resolution system achieved an F-measure of 0.14 (B3), while a simple domain-adapted rule-based system achieved an F-measure of 0.42. An ensemble of the two reached F of 0.46. Following the IDENTITY chains in the data would add 106,263 additional named entities in the full 97-paper corpus, for an increase of 76% percent in the semantic classes of the eight ontologies that have been annotated in earlier versions of the CRAFT corpus. The project produced a large data set for further investigation of coreference and coreference resolution in the scientific literature. The work raised issues in the phenomenon of reference in this domain and genre, and the paper proposes that many mentions that would be considered generic in the general domain are not

  18. A Novel Approach to Semantic and Coreference Annotation at LLNL

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Firpo, M

    A case is made for the importance of high quality semantic and coreference annotation. The challenges of providing such annotation are described. Asperger's Syndrome is introduced, and the connections are drawn between the needs of text annotation and the abilities of persons with Asperger's Syndrome to meet those needs. Finally, a pilot program is recommended wherein semantic annotation is performed by people with Asperger's Syndrome. The primary points embodied in this paper are as follows: (1) Document annotation is essential to the Natural Language Processing (NLP) projects at Lawrence Livermore National Laboratory (LLNL); (2) LLNL does not currently have amore » system in place to meet its need for text annotation; (3) Text annotation is challenging for a variety of reasons, many related to its very rote nature; (4) Persons with Asperger's Syndrome are particularly skilled at rote verbal tasks, and behavioral experts agree that they would excel at text annotation; and (6) A pilot study is recommend in which two to three people with Asperger's Syndrome annotate documents and then the quality and throughput of their work is evaluated relative to that of their neuro-typical peers.« less

  19. Annotated Videography.

    ERIC Educational Resources Information Center

    United States Holocaust Memorial Museum, Washington, DC.

    This annotated list of 43 videotapes recommended for classroom use addresses various themes for teaching about the Holocaust, including: (1) overviews of the Holocaust; (2) life before the Holocaust; (3) propaganda; (4) racism, anti-Semitism; (5) "enemies of the state"; (6) ghettos; (7) camps; (8) genocide; (9) rescue; (10) resistance;…

  20. Comparative Omics-Driven Genome Annotation Refinement: Application across Yersiniae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rutledge, Alexandra C.; Jones, Marcus B.; Chauhan, Sadhana

    2012-03-27

    Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. To date, the perceived value of manual curation for genome annotations is not offset by the real cost and time associated with the process. In order to balance the large number of sequences generated, the annotation process is now performed almost exclusively in an automated fashion for most genome sequencing projects. One possible way to reduce errors inherent to automated computational annotations is to apply data from 'omics' measurements (i.e. transcriptional and proteomic) to themore » un-annotated genome with a proteogenomic-based approach. This approach does require additional experimental and bioinformatics methods to include omics technologies; however, the approach is readily automatable and can benefit from rapid developments occurring in those research domains as well. The annotation process can be improved by experimental validation of transcription and translation and aid in the discovery of annotation errors. Here the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species, as is becoming common in sequencing efforts. Transcriptomic and proteomic data derived from three highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 previously incorrect protein-coding sequences (e.g., observed frameshifts, extended start sites, and translated pseudogenes) within the three current Yersinia genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent

  1. Dynamics of a delayed intraguild predation model with harvesting

    NASA Astrophysics Data System (ADS)

    Collera, Juancho A.; Balilo, Aldrin T.

    2018-03-01

    In [1], a delayed three-species intraguild predation (IGP) model was considered. This particular tri-trophic community module includes a predator and its prey which share a common basal resource for their sustenance [3]. Here, it is assumed that in the absence of predation, the growth of the basal resource follows the delayed logistic equation. Without delay time, the IGP model in [1] reduces to the system considered in [7] where it was shown that IGP may induce chaos even if the functional responses are linear. Meanwhile, in [2] the delayed IGP model in [1] was generalized to include harvesting. Under the assumption that the basal resource has some economic value, a constant harvesting term on the basal resource was incorporated. However, both models in [1] and [2] use the delay time as the main parameter. In this research, we studied the delayed IGP model in [1] with the addition of linear harvesting term on each of the three species. The dynamical behavior of this system is examined using the harvesting rates as main parameter. In particular, we give conditions on the existence, stability, and bifurcations of equilibrium solutions of this system. This allows us to better understand the effects of harvesting in terms of the survival or extinction of one or more species in our system. Numerical simulations are carried out to illustrate our results. In fact, we show that the chaotic behavior in [7] unfolds when the harvesting rate parameter is varied.

  2. Pine straw harvesting, fire, and fertilization affect understory vegetation within a Louisiana longleaf pine stand

    Treesearch

    James D. Haywood

    2012-01-01

    Pine straw harvesting can provide an economic benefit to landowners, but the practice may also change the composition of plant communities. This research was initiated in a 34-year-old stand of longleaf pine (Pinus palustris Mill.) established in 1956 to study how pine straw management practices (fertilization, prescribed fire, and straw harvesting) affected plant...

  3. Managing harvest and habitat as integrated components

    USGS Publications Warehouse

    Osnas, Erik; Runge, Michael C.; Mattsson, Brady J.; Austin, Jane E.; Boomer, G. S.; Clark, R. G.; Devers, P.; Eadie, J. M.; Lonsdorf, E. V.; Tavernia, Brian G.

    2014-01-01

    In 2007, several important initiatives in the North American waterfowl management community called for an integrated approach to habitat and harvest management. The essence of the call for integration is that harvest and habitat management affect the same resources, yet exist as separate endeavours with very different regulatory contexts. A common modelling framework could help these management streams to better understand their mutual effects. Particularly, how does successful habitat management increase harvest potential? Also, how do regional habitat programmes and large-scale harvest strategies affect continental population sizes (a metric used to express habitat goals)? In the ensuing five years, several projects took on different aspects of these challenges. While all of these projects are still on-going, and are not yet sufficiently developed to produce guidance for management decisions, they have been influential in expanding the dialogue and producing some important emerging lessons. The first lesson has been that one of the more difficult aspects of integration is not the integration across decision contexts, but the integration across spatial and temporal scales. Habitat management occurs at local and regional scales. Harvest management decisions are made at a continental scale. How do these actions, taken at different scales, combine to influence waterfowl population dynamics at all scales? The second lesson has been that consideration of the interface of habitat and harvest management can generate important insights into the objectives underlying the decision context. Often the objectives are very complex and trade-off against one another. The third lesson follows from the second – if an understanding of the fundamental objectives is paramount, there is no escaping the need for a better understanding of human dimensions, specifically the desires of hunters and nonhunters and the role they play in conservation. In the end, the compelling question is

  4. Annotations of Early Childhood Assessment Instruments.

    ERIC Educational Resources Information Center

    Texas Education Agency, Austin.

    An annotated listing of selected instruments which may be appropriate for the young child who appears to be handicapped and who may be placed in an early childhood unit for the handicapped is provided. The list is not comprehensive nor does it contain annotations from all companies which produce this type of material. It is offered to apprise…

  5. MEETING: Chlamydomonas Annotation Jamboree - October 2003

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Grossman, Arthur R

    2007-04-13

    Shotgun sequencing of the nuclear genome of Chlamydomonas reinhardtii (Chlamydomonas throughout) was performed at an approximate 10X coverage by JGI. Roughly half of the genome is now contained on 26 scaffolds, all of which are at least 1.6 Mb, and the coverage of the genome is ~95%. There are now over 200,000 cDNA sequence reads that we have generated as part of the Chlamydomonas genome project (Grossman, 2003; Shrager et al., 2003; Grossman et al. 2007; Merchant et al., 2007); other sequences have also been generated by the Kasuza sequence group (Asamizu et al., 1999; Asamizu et al., 2000) ormore » individual laboratories that have focused on specific genes. Shrager et al. (2003) placed the reads into distinct contigs (an assemblage of reads with overlapping nucleotide sequences), and contigs that group together as part of the same genes have been designated ACEs (assembly of contigs generated from EST information). All of the reads have also been mapped to the Chlamydomonas nuclear genome and the cDNAs and their corresponding genomic sequences have been reassembled, and the resulting assemblage is called an ACEG (an Assembly of contiguous EST sequences supported by genomic sequence) (Jain et al., 2007). Most of the unique genes or ACEGs are also represented by gene models that have been generated by the Joint Genome Institute (JGI, Walnut Creek, CA). These gene models have been placed onto the DNA scaffolds and are presented as a track on the Chlamydomonas genome browser associated with the genome portal (http://genome.jgi-psf.org/Chlre3/Chlre3.home.html). Ultimately, the meeting grant awarded by DOE has helped enormously in the development of an annotation pipeline (a set of guidelines used in the annotation of genes) and resulted in high quality annotation of over 4,000 genes; the annotators were from both Europe and the USA. Some of the people who led the annotation initiative were Arthur Grossman, Olivier Vallon, and Sabeeha Merchant (with many individual

  6. Harvesting

    USDA-ARS?s Scientific Manuscript database

    The spindle picker and brush-roll stripper are the two machines used to harvest cotton produced in the United States. Adoption of each harvester type is dictated by regional differences in regard to production environment, production practices, cultivar, and yield. The spindle picker is a selectiv...

  7. Grassland bird response to harvesting switchgrass as a biomass energy crop

    USGS Publications Warehouse

    Roth, A.M.; Sample, D.W.; Ribic, C.A.; Paine, L.; Undersander, D.J.; Bartelt, G.A.

    2005-01-01

    The combustion of perennial grass biomass to generate electricity may be a promising renewable energy option. Switchgrass (Panicum virgatum) grown as a biofuel has the potential to provide a cash crop for farmers and quality nesting cover for grassland birds. In southwestern Wisconsin (near lat. 42??52???, long. 90??08???), we investigated the impact of an August harvest of switchgrass for bioenergy on community composition and abundance of Wisconsin grassland bird species of management concern. Harvesting the switchgrass in August resulted in changes in vegetation structure and bird species composition the following nesting season. In harvested transects, residual vegetation was shorter and the litter layer was reduced in the year following harvest. Grassland bird species that preferred vegetation of short to moderate height and low to moderate density were found in harvested areas. Unharvested areas provided tall, dense vegetation structure that was especially attractive to tall-grass bird species, such as sedge wren (Cistothorus platensis) and Henslow's sparrow (Ammodramus henslowii). When considering wildlife habitat value in harvest management of switchgrass for biofuel, leaving some fields unharvested each year would be a good compromise, providing some habitat for a larger number of grassland bird species of management concern than if all fields were harvested annually. In areas where most idle grassland habitat present on the landscape is tallgrass, harvest of switchgrass for biofuel has the potential to increase the local diversity of grassland birds.

  8. Ontology modularization to improve semantic medical image annotation.

    PubMed

    Wennerberg, Pinar; Schulz, Klaus; Buitelaar, Paul

    2011-02-01

    Searching for medical images and patient reports is a significant challenge in a clinical setting. The contents of such documents are often not described in sufficient detail thus making it difficult to utilize the inherent wealth of information contained within them. Semantic image annotation addresses this problem by describing the contents of images and reports using medical ontologies. Medical images and patient reports are then linked to each other through common annotations. Subsequently, search algorithms can more effectively find related sets of documents on the basis of these semantic descriptions. A prerequisite to realizing such a semantic search engine is that the data contained within should have been previously annotated with concepts from medical ontologies. One major challenge in this regard is the size and complexity of medical ontologies as annotation sources. Manual annotation is particularly time consuming labor intensive in a clinical environment. In this article we propose an approach to reducing the size of clinical ontologies for more efficient manual image and text annotation. More precisely, our goal is to identify smaller fragments of a large anatomy ontology that are relevant for annotating medical images from patients suffering from lymphoma. Our work is in the area of ontology modularization, which is a recent and active field of research. We describe our approach, methods and data set in detail and we discuss our results. Copyright © 2010 Elsevier Inc. All rights reserved.

  9. Gene Ontology annotation of the rice blast fungus, Magnaporthe oryzae

    PubMed Central

    Meng, Shaowu; Brown, Douglas E; Ebbole, Daniel J; Torto-Alalibo, Trudy; Oh, Yeon Yee; Deng, Jixin; Mitchell, Thomas K; Dean, Ralph A

    2009-01-01

    Background Magnaporthe oryzae, the causal agent of blast disease of rice, is the most destructive disease of rice worldwide. The genome of this fungal pathogen has been sequenced and an automated annotation has recently been updated to Version 6 . However, a comprehensive manual curation remains to be performed. Gene Ontology (GO) annotation is a valuable means of assigning functional information using standardized vocabulary. We report an overview of the GO annotation for Version 5 of M. oryzae genome assembly. Methods A similarity-based (i.e., computational) GO annotation with manual review was conducted, which was then integrated with a literature-based GO annotation with computational assistance. For similarity-based GO annotation a stringent reciprocal best hits method was used to identify similarity between predicted proteins of M. oryzae and GO proteins from multiple organisms with published associations to GO terms. Significant alignment pairs were manually reviewed. Functional assignments were further cross-validated with manually reviewed data, conserved domains, or data determined by wet lab experiments. Additionally, biological appropriateness of the functional assignments was manually checked. Results In total, 6,286 proteins received GO term assignment via the homology-based annotation, including 2,870 hypothetical proteins. Literature-based experimental evidence, such as microarray, MPSS, T-DNA insertion mutation, or gene knockout mutation, resulted in 2,810 proteins being annotated with GO terms. Of these, 1,673 proteins were annotated with new terms developed for Plant-Associated Microbe Gene Ontology (PAMGO). In addition, 67 experiment-determined secreted proteins were annotated with PAMGO terms. Integration of the two data sets resulted in 7,412 proteins (57%) being annotated with 1,957 distinct and specific GO terms. Unannotated proteins were assigned to the 3 root terms. The Version 5 GO annotation is publically queryable via the GO site

  10. Adoption of safety eyewear among citrus harvesters in rural Florida.

    PubMed

    Monaghan, Paul F; Bryant, Carol A; McDermott, Robert J; Forst, Linda S; Luque, John S; Contreras, Ricardo B

    2012-06-01

    The community-based prevention marketing program planning framework was used to adapt an evidence-based intervention to address eye injuries among Florida's migrant citrus harvesters. Participant-observer techniques, other direct observations, and individual and focus group interviews provided data that guided refinement of a safety eyewear intervention. Workers were attracted to the eyewear's ability to minimize irritation, offer protection from trauma, and enable work without declines in productivity or comfort. Access to safety glasses equipped with worker-designed features reduced the perceived barriers of using them; deployment of trained peer-leaders helped promote adoption. Workers' use of safety glasses increased from less than 2% to between 28% and 37% in less than two full harvesting seasons. The combination of formative research and program implementation data provided insights for tailoring an existing evidence-based program for this occupational community and increase potential for future dissemination and worker protection.

  11. [Prescription annotations in Welfare Pharmacy].

    PubMed

    Han, Yi

    2018-03-01

    Welfare Pharmacy contains medical formulas documented by the government and official prescriptions used by the official pharmacy in the pharmaceutical process. In the last years of Southern Song Dynasty, anonyms gave a lot of prescription annotations, made textual researches for the name, source, composition and origin of the prescriptions, and supplemented important historical data of medical cases and researched historical facts. The annotations of Welfare Pharmacy gathered the essence of medical theory, and can be used as precious materials to correctly understand the syndrome differentiation, compatibility regularity and clinical application of prescriptions. This article deeply investigated the style and form of the prescription annotations in Welfare Pharmacy, the name of prescriptions and the evolution of terminology, the major functions of the prescriptions, processing methods, instructions for taking medicine and taboos of prescriptions, the medical cases and clinical efficacy of prescriptions, the backgrounds, sources, composition and cultural meanings of prescriptions, proposed that the prescription annotations played an active role in the textual dissemination, patent medicine production and clinical diagnosis and treatment of Welfare Pharmacy. This not only helps understand the changes in the names and terms of traditional Chinese medicines in Welfare Pharmacy, but also provides the basis for understanding the knowledge sources, compatibility regularity, important drug innovations and clinical medications of prescriptions in Welfare Pharmacy. Copyright© by the Chinese Pharmaceutical Association.

  12. APPRIS: annotation of principal and alternative splice isoforms

    PubMed Central

    Rodriguez, Jose Manuel; Maietta, Paolo; Ezkurdia, Iakes; Pietrelli, Alessandro; Wesselink, Jan-Jaap; Lopez, Gonzalo; Valencia, Alfonso; Tress, Michael L.

    2013-01-01

    Here, we present APPRIS (http://appris.bioinfo.cnio.es), a database that houses annotations of human splice isoforms. APPRIS has been designed to provide value to manual annotations of the human genome by adding reliable protein structural and functional data and information from cross-species conservation. The visual representation of the annotations provided by APPRIS for each gene allows annotators and researchers alike to easily identify functional changes brought about by splicing events. In addition to collecting, integrating and analyzing reliable predictions of the effect of splicing events, APPRIS also selects a single reference sequence for each gene, here termed the principal isoform, based on the annotations of structure, function and conservation for each transcript. APPRIS identifies a principal isoform for 85% of the protein-coding genes in the GENCODE 7 release for ENSEMBL. Analysis of the APPRIS data shows that at least 70% of the alternative (non-principal) variants would lose important functional or structural information relative to the principal isoform. PMID:23161672

  13. Harvesting in delayed food web model with omnivory

    NASA Astrophysics Data System (ADS)

    Collera, Juancho A.

    2016-02-01

    We consider a tri-trophic community module called intraguild predation (IGP) that includes a prey and its predator which share a common basal resource for their sustenance. The growth of the basal resource in the absence of predation follows the Hutchinson's equation where the delay parameter arises, while functional responses in our model are of Lotka-Volterra type. Moreover, the basal resource is harvested for its economic value with a constant harvesting rate. This work generalizes the previous works on the same model with no harvesting and no time delay. We show that the harvesting rate has to be small enough in order for the equilibria to exist. Moreover, we show that by increasing the delay parameter the stability of the equilibrium solutions may change, and periodic solutions may emerge through Hopf bifurcations. In the case of the positive equilibrium solution, multiple stability switches are obtained, and numerical continuation shows that a stable branch of periodic solutions emerges once the positive equilibrium loses its stability at the first Hopf bifurcation point. This result is important because it gives an alternative for the coexistence of all three species, avoiding extinction of one or more species when the positive equilibrium becomes unstable.

  14. Leveraging the crowd for annotation of retinal images.

    PubMed

    Leifman, George; Swedish, Tristan; Roesch, Karin; Raskar, Ramesh

    2015-01-01

    Medical data presents a number of challenges. It tends to be unstructured, noisy and protected. To train algorithms to understand medical images, doctors can label the condition associated with a particular image, but obtaining enough labels can be difficult. We propose an annotation approach which starts with a small pool of expertly annotated images and uses their expertise to rate the performance of crowd-sourced annotations. In this paper we demonstrate how to apply our approach for annotation of large-scale datasets of retinal images. We introduce a novel data validation procedure which is designed to cope with noisy ground-truth data and with non-consistent input from both experts and crowd-workers.

  15. A database of annotated tentative orthologs from crop abiotic stress transcripts.

    PubMed

    Balaji, Jayashree; Crouch, Jonathan H; Petite, Prasad V N S; Hoisington, David A

    2006-10-07

    A minimal requirement to initiate a comparative genomics study on plant responses to abiotic stresses is a dataset of orthologous sequences. The availability of a large amount of sequence information, including those derived from stress cDNA libraries allow for the identification of stress related genes and orthologs associated with the stress response. Orthologous sequences serve as tools to explore genes and their relationships across species. For this purpose, ESTs from stress cDNA libraries across 16 crop species including 6 important cereal crops and 10 dicots were systematically collated and subjected to bioinformatics analysis such as clustering, grouping of tentative orthologous sets, identification of protein motifs/patterns in the predicted protein sequence, and annotation with stress conditions, tissue/library source and putative function. All data are available to the scientific community at http://intranet.icrisat.org/gt1/tog/homepage.htm. We believe that the availability of annotated plant abiotic stress ortholog sets will be a valuable resource for researchers studying the biology of environmental stresses in plant systems, molecular evolution and genomics.

  16. The Harvest and Management of Migratory Bird Eggs by Inuit in Nunatsiavut, Labrador

    NASA Astrophysics Data System (ADS)

    Natcher, David; Felt, Larry; Chaulk, Keith; Procter, Andrea

    2012-12-01

    This paper presents the results of collaborative research conducted in 2007 on the harvest of migratory bird eggs by Inuit households of Nunatsiavut, Labrador. Harvest variability between communities and species is examined, as is the social and ecological factors affecting the 2007 Inuit egg harvest. Representing the first comprehensive account of Inuit egg use in Labrador, this information should be valuable to agencies responsible for managing migratory bird populations in North America and will contribute to a more informed understanding of the complexity and temporal variability in subsistence harvesting among Labrador Inuit. It is argued that the recognition of this complexity will be critical as the Nunatsiavut Government and other wildlife management agencies formulate management policies that are supportive rather, than constraining, to Inuit resource use in the future.

  17. Digital Ink: In-Class Annotation of PowerPoint Lectures

    ERIC Educational Resources Information Center

    Johnson, Anne E.

    2008-01-01

    Digital ink is a tool that, in conjunction with Microsoft PowerPoint software, allows real-time freehand annotation of presentations. Annotation of slides during class encourages student engagement with the material and problems under discussion. Digital ink annotation is a technique suitable for teaching across many disciplines, but is especially…

  18. 1971 Oregon timber harvest.

    Treesearch

    Brian R. Wall

    1972-01-01

    The 1971 Oregon timber harvest of 9.03 billion board feet was the highest since 1969 when 9.15 billion board feet was harvested. The 1971 total harvest was 13.1 percent above the 1970 figure. Western Oregon's harvest rose 11-5 percent, and eastern Oregon's harvest rose 18.6 percent.

  19. Annotated Bibliography of the Air Force Human Resources Laboratory Technical Reports - 1979.

    DTIC Science & Technology

    1981-05-01

    Force Human Resources Laboratory, March 1980. (Covers all AFHRL projects.) NTIS. This document provides the academic and industrial R&D community with...D-AI02 04𔃾 AIR FORCE HUMAN RESOURCES LAB BROOKS AF TX F/G 5/2 ANNOTATED BIBLIOGRAPHY OF THE AIR FORCE HUMAN RESOURCES LABORAT--ETC(U) MAY 81 E M...OF THE AIR FORCE HUMAN RESOURCES LABORATORY TECHNICAL REPORTS - 1979U M By M Esther M. Barlow A N TECHNICAL SERVICES DIVISION Brooks Air Force Base

  20. Functional Annotation of the Arabidopsis Genome Using Controlled Vocabularies1

    PubMed Central

    Berardini, Tanya Z.; Mundodi, Suparna; Reiser, Leonore; Huala, Eva; Garcia-Hernandez, Margarita; Zhang, Peifen; Mueller, Lukas A.; Yoon, Jungwoon; Doyle, Aisling; Lander, Gabriel; Moseyko, Nick; Yoo, Danny; Xu, Iris; Zoeckler, Brandon; Montoya, Mary; Miller, Neil; Weems, Dan; Rhee, Seung Y.

    2004-01-01

    Controlled vocabularies are increasingly used by databases to describe genes and gene products because they facilitate identification of similar genes within an organism or among different organisms. One of The Arabidopsis Information Resource's goals is to associate all Arabidopsis genes with terms developed by the Gene Ontology Consortium that describe the molecular function, biological process, and subcellular location of a gene product. We have also developed terms describing Arabidopsis anatomy and developmental stages and use these to annotate published gene expression data. As of March 2004, we used computational and manual annotation methods to make 85,666 annotations representing 26,624 unique loci. We focus on associating genes to controlled vocabulary terms based on experimental data from the literature and use The Arabidopsis Information Resource-developed PubSearch software to facilitate this process. Each annotation is tagged with a combination of evidence codes, evidence descriptions, and references that provide a robust means to assess data quality. Annotation of all Arabidopsis genes will allow quantitative comparisons between sets of genes derived from sources such as microarray experiments. The Arabidopsis annotation data will also facilitate annotation of newly sequenced plant genomes by using sequence similarity to transfer annotations to homologous genes. In addition, complete and up-to-date annotations will make unknown genes easy to identify and target for experimentation. Here, we describe the process of Arabidopsis functional annotation using a variety of data sources and illustrate several ways in which this information can be accessed and used to infer knowledge about Arabidopsis and other plant species. PMID:15173566

  1. Harnessing Collaborative Annotations on Online Formative Assessments

    ERIC Educational Resources Information Center

    Lin, Jian-Wei; Lai, Yuan-Cheng

    2013-01-01

    This paper harnesses collaborative annotations by students as learning feedback on online formative assessments to improve the learning achievements of students. Through the developed Web platform, students can conduct formative assessments, collaboratively annotate, and review historical records in a convenient way, while teachers can generate…

  2. VideoANT: Extending Online Video Annotation beyond Content Delivery

    ERIC Educational Resources Information Center

    Hosack, Bradford

    2010-01-01

    This paper expands the boundaries of video annotation in education by outlining the need for extended interaction in online video use, identifying the challenges faced by existing video annotation tools, and introducing Video-ANT, a tool designed to create text-based annotations integrated within the time line of a video hosted online. Several…

  3. SNAD: Sequence Name Annotation-based Designer.

    PubMed

    Sidorov, Igor A; Reshetov, Denis A; Gorbalenya, Alexander E

    2009-08-14

    A growing diversity of biological data is tagged with unique identifiers (UIDs) associated with polynucleotides and proteins to ensure efficient computer-mediated data storage, maintenance, and processing. These identifiers, which are not informative for most people, are often substituted by biologically meaningful names in various presentations to facilitate utilization and dissemination of sequence-based knowledge. This substitution is commonly done manually that may be a tedious exercise prone to mistakes and omissions. Here we introduce SNAD (Sequence Name Annotation-based Designer) that mediates automatic conversion of sequence UIDs (associated with multiple alignment or phylogenetic tree, or supplied as plain text list) into biologically meaningful names and acronyms. This conversion is directed by precompiled or user-defined templates that exploit wealth of annotation available in cognate entries of external databases. Using examples, we demonstrate how this tool can be used to generate names for practical purposes, particularly in virology. A tool for controllable annotation-based conversion of sequence UIDs into biologically meaningful names and acronyms has been developed and placed into service, fostering links between quality of sequence annotation, and efficiency of communication and knowledge dissemination among researchers.

  4. Use of Annotations for Component and Framework Interoperability

    NASA Astrophysics Data System (ADS)

    David, O.; Lloyd, W.; Carlson, J.; Leavesley, G. H.; Geter, F.

    2009-12-01

    The popular programming languages Java and C# provide annotations, a form of meta-data construct. Software frameworks for web integration, web services, database access, and unit testing now take advantage of annotations to reduce the complexity of APIs and the quantity of integration code between the application and framework infrastructure. Adopting annotation features in frameworks has been observed to lead to cleaner and leaner application code. The USDA Object Modeling System (OMS) version 3.0 fully embraces the annotation approach and additionally defines a meta-data standard for components and models. In version 3.0 framework/model integration previously accomplished using API calls is now achieved using descriptive annotations. This enables the framework to provide additional functionality non-invasively such as implicit multithreading, and auto-documenting capabilities while achieving a significant reduction in the size of the model source code. Using a non-invasive methodology leads to models and modeling components with only minimal dependencies on the modeling framework. Since models and modeling components are not directly bound to framework by the use of specific APIs and/or data types they can more easily be reused both within the framework as well as outside of it. To study the effectiveness of an annotation based framework approach with other modeling frameworks, a framework-invasiveness study was conducted to evaluate the effects of framework design on model code quality. A monthly water balance model was implemented across several modeling frameworks and several software metrics were collected. The metrics selected were measures of non-invasive design methods for modeling frameworks from a software engineering perspective. It appears that the use of annotations positively impacts several software quality measures. In a next step, the PRMS model was implemented in OMS 3.0 and is currently being implemented for water supply forecasting in the

  5. Harvesting

    Treesearch

    John R. Jones; Wayne D. Shepperd

    1985-01-01

    Harvesting is the removal of produce from the forest for utilization. It includes cutting, any further initial processing, such as topping and trimming, and extraction (Ford-Robertson 1971). Commercial intermediate cutting, such as commercial thinning, as well as regeneration cutting are included. Harvesting and the income that it produces sometimes is regarded as an...

  6. The Bologna Annotation Resource (BAR 3.0): improving protein functional annotation

    PubMed Central

    Casadio, Rita

    2017-01-01

    Abstract BAR 3.0 updates our server BAR (Bologna Annotation Resource) for predicting protein structural and functional features from sequence. We increase data volume, query capabilities and information conveyed to the user. The core of BAR 3.0 is a graph-based clustering procedure of UniProtKB sequences, following strict pairwise similarity criteria (sequence identity ≥40% with alignment coverage ≥90%). Each cluster contains the available annotation downloaded from UniProtKB, GO, PFAM and PDB. After statistical validation, GO terms and PFAM domains are cluster-specific and annotate new sequences entering the cluster after satisfying similarity constraints. BAR 3.0 includes 28 869 663 sequences in 1 361 773 clusters, of which 22.2% (22 241 661 sequences) and 47.4% (24 555 055 sequences) have at least one validated GO term and one PFAM domain, respectively. 1.4% of the clusters (36% of all sequences) include PDB structures and the cluster is associated to a hidden Markov model that allows building template-target alignment suitable for structural modeling. Some other 3 399 026 sequences are singletons. BAR 3.0 offers an improved search interface, allowing queries by UniProtKB-accession, Fasta sequence, GO-term, PFAM-domain, organism, PDB and ligand/s. When evaluated on the CAFA2 targets, BAR 3.0 largely outperforms our previous version and scores among state-of-the-art methods. BAR 3.0 is publicly available and accessible at http://bar.biocomp.unibo.it/bar3. PMID:28453653

  7. Traditional and formal ecological knowledge to assess harvesting and conservation of a Mexican Tropical Dry Forest.

    PubMed

    Monroy-Ortiz, Columba; García-Moya, Edmundo; Romero-Manzanares, Angélica; Luna-Cavazos, Mario; Monroy, Rafael

    2018-05-15

    This research integrates Traditional and Formal Ecological Knowledge (TEK / FEK) of a Tropical Dry Forest in central Mexico, in order to assess harvesting and conservation of the non-timber forest species. We were interested in: knowing the structure and diversity of the forest community; identifying which are the tree resources of common interest to the users through participatory workshops. A further interest was to identify those resources which are important to local people in terms of preservation; explaining the relationship of the species with some environmental factors; and visualizing which management practices endanger or facilitate the conservation of species. Studied areas were defined and labelled on a map drawn by local informants, where they indicated those plant species of common interest for preservation. Ethnobotanical techniques were used to reveal the TEK and assess harvesting and conservation of the species. With the FEK through community and population ecology, we detected the importance of five environmental factors, obtained various ecological indicators of the vegetation, and studied the population structure of the relevant species. The FEK was analyzed using descriptive and multivariate statistics. As a result, low density and small basal area of trees were registered. Species richness and diversity index were similar to other natural protected areas in Mexico. Tree species harvested shown an asymmetric distribution of diameters. Harvesting, elevation, and accessibility were the most influential factors on tree density. FEK demonstrated that TEK is helpful for the assessment of forest harvesting. Ecological analysis complemented the local knowledge detecting that Lysiloma tergemina is a species non-identified for the people as interesting, although we discover that it is a threatened species by over-harvesting. Haematoxylum brasiletto was identified as important for conservation due to its scarcity and medicinal use. Our results advanced

  8. 1972 Oregon timber harvest.

    Treesearch

    J.D. Jr. Lloyd

    1973-01-01

    The 1972 Oregon timber harvest of 9.6 billion board feet was 602 million board feet (6.7 percent) above the 1971 harvest. Western Oregon's harvest rose 8 percent and eastern Oregon's harvest rose 2 percent.

  9. Metadata and annotations for multi-scale electrophysiological data.

    PubMed

    Bower, Mark R; Stead, Matt; Brinkmann, Benjamin H; Dufendach, Kevin; Worrell, Gregory A

    2009-01-01

    The increasing use of high-frequency (kHz), long-duration (days) intracranial monitoring from multiple electrodes during pre-surgical evaluation for epilepsy produces large amounts of data that are challenging to store and maintain. Descriptive metadata and clinical annotations of these large data sets also pose challenges to simple, often manual, methods of data analysis. The problems of reliable communication of metadata and annotations between programs, the maintenance of the meanings within that information over long time periods, and the flexibility to re-sort data for analysis place differing demands on data structures and algorithms. Solutions to these individual problem domains (communication, storage and analysis) can be configured to provide easy translation and clarity across the domains. The Multi-scale Annotation Format (MAF) provides an integrated metadata and annotation environment that maximizes code reuse, minimizes error probability and encourages future changes by reducing the tendency to over-fit information technology solutions to current problems. An example of a graphical utility for generating and evaluating metadata and annotations for "big data" files is presented.

  10. Real-time image annotation by manifold-based biased Fisher discriminant analysis

    NASA Astrophysics Data System (ADS)

    Ji, Rongrong; Yao, Hongxun; Wang, Jicheng; Sun, Xiaoshuai; Liu, Xianming

    2008-01-01

    Automatic Linguistic Annotation is a promising solution to bridge the semantic gap in content-based image retrieval. However, two crucial issues are not well addressed in state-of-art annotation algorithms: 1. The Small Sample Size (3S) problem in keyword classifier/model learning; 2. Most of annotation algorithms can not extend to real-time online usage due to their low computational efficiencies. This paper presents a novel Manifold-based Biased Fisher Discriminant Analysis (MBFDA) algorithm to address these two issues by transductive semantic learning and keyword filtering. To address the 3S problem, Co-Training based Manifold learning is adopted for keyword model construction. To achieve real-time annotation, a Bias Fisher Discriminant Analysis (BFDA) based semantic feature reduction algorithm is presented for keyword confidence discrimination and semantic feature reduction. Different from all existing annotation methods, MBFDA views image annotation from a novel Eigen semantic feature (which corresponds to keywords) selection aspect. As demonstrated in experiments, our manifold-based biased Fisher discriminant analysis annotation algorithm outperforms classical and state-of-art annotation methods (1.K-NN Expansion; 2.One-to-All SVM; 3.PWC-SVM) in both computational time and annotation accuracy with a large margin.

  11. Annotated Catalog of Bilingual Vocational Training Materials.

    ERIC Educational Resources Information Center

    Miranda (L.) and Associates, Bethesda, MD.

    This catalog contains annotations for 170 bilingual vocational training materials. Most of the materials are written in English, but materials written in 13 source languages and directed toward speakers of 17 target languages are provided. Annotations are provided for the following different types of documents: administrative, assessment and…

  12. Harvesting NASA's Common Metadata Repository (CMR)

    NASA Technical Reports Server (NTRS)

    Shum, Dana; Durbin, Chris; Norton, James; Mitchell, Andrew

    2017-01-01

    As part of NASA's Earth Observing System Data and Information System (EOSDIS), the Common Metadata Repository (CMR) stores metadata for over 30,000 datasets from both NASA and international providers along with over 300M granules. This metadata enables sub-second discovery and facilitates data access. While the CMR offers a robust temporal, spatial and keyword search functionality to the general public and international community, it is sometimes more desirable for international partners to harvest the CMR metadata and merge the CMR metadata into a partner's existing metadata repository. This poster will focus on best practices to follow when harvesting CMR metadata to ensure that any changes made to the CMR can also be updated in a partner's own repository. Additionally, since each partner has distinct metadata formats they are able to consume, the best practices will also include guidance on retrieving the metadata in the desired metadata format using CMR's Unified Metadata Model translation software.

  13. 1975 Oregon timber harvest.

    Treesearch

    J.D. Jr. Lloyd

    1976-01-01

    The 1975 Oregon timber harvest declined to its lowest level since 1961 with a harvest of 7.37 billion board feet, 991 million board feet (11.9 percent) below the 1974 harvest. The harvest was down in both western Oregon (823 million board feet, 13.2 percent) and eastern Oregon (168 million board feet, 7.7 percent). For the first time since 1961, the harvest on private...

  14. Effects of Forest Harvesting on Ecosystem Health in the Headwaters of the New York City Water Supply, Catskill Mountains, New York

    USGS Publications Warehouse

    McHale, Michael R.; Murdoch, Peter S.; Burns, Douglas A.; Baldigo, Barry P.

    2008-01-01

    The effects of forest clearcutting and selective harvesting on forest soils, soil and stream water chemistry, forest regrowth, and aquatic communities were studied in four small headwater catchments. This research was conducted to identify the sensitivity of forested ecosystems to forest disturbance in the northeastern United States. The study area was in the headwaters of the Neversink Reservoir watershed, part of the New York City water supply system, in the Catskill Mountains of southeastern New York. Two sub-catchments of the Shelter Creek watershed were selectively harvested, one in its northern half and one more heavily in its southern half in 1995?96, the Dry Creek watershed was clearcut in the winter of 1996?97, and the Clear Creek watershed was left undisturbed and monitored as a control site. Monitoring was conducted from 4 years before the harvests until 4 years after the harvests. Clearcutting caused a large release of nitrate (NO3-) from watershed soils and a concurrent release of inorganic monomeric aluminum (Alim), which is toxic to some aquatic biota. The increased soil NO3- concentrations measured after the harvest could be completely accounted for by the decrease in nitrogen (N) uptake by watershed trees, rather than an increase in N mineralization and nitrification. The large increase in stream water NO3- and Alim concentrations caused 100-percent mortality of caged brook trout (Salvelinus fontinalis) during the first year after the clearcut and adversely affected macroinvertebrate communities for 2 years after the harvest. Nutrient uptake and biomass accumulation increased in uncut mature trees after the two selective harvests. There was no increase in stream-water NO3- or Alim concentrations, and so there were no adverse affects on macroinvertebrate or trout communities. The amount of tree biomass that can be removed without causing a sharp increase in stream-water NO3- and Alim stream-water concentrations is unknown, but probably depends on

  15. Post-harvest physiology

    USDA-ARS?s Scientific Manuscript database

    Weather and management constraints, as well as the intended use of the harvested forage, all influence the forage harvest system selected by the producer. Generally, maximum retention of dry matter from harvested forage crops is achieved at moistures intermediate between the standing fresh crop and ...

  16. Sources and Information: Media Relations in Community Colleges.

    ERIC Educational Resources Information Center

    Tobolowsky, Barbara

    2000-01-01

    Presents an annotated bibliography of recent ERIC documents that provide insight into the public perception of community colleges, the potential influence of the media on public opinion regarding community colleges, institutional relations with the media, and the role that Web pages play in strategic marketing. (VWC)

  17. Host Genotype and Harvest Practices Shape the Leaf and Root Microbiomes of the Biofuel Crop Switchgrass

    NASA Astrophysics Data System (ADS)

    Singer, E.; Gonzalez, J.; Juenger, T. E.; Woyke, T.

    2016-12-01

    Growing energy demands and concerns for climate change have urgently pushed forward the timeline for the implementation of biofuel energies. Switchgrass (Panicum virgatum) is a leading biofuel crop in the United States. Bacteria living on and inside leaves and roots affect plant health, hence a plant's genetic control over its microbiota is of great interest to crop breeders and evolutionary biologists. We present a large-scale field experiment to untangle the effects of genotype, environment, soil horizon and harvest treatment practices on prokaryotic and fungal communities associated with leaves and roots of switchgrass. Using V4 16S rRNA and ITS gene as well as metagenome sequencing, we show that host genotype is significant in both, leaves and roots, and varies among sites. Microbiome composition along the rhizosphere also shifts with soil depth. Furthermore, plant harvest significantly changes both, leaf surface and rhizosphere communities, which can be seen a year after the harvest event. Gene function analysis shows that rhizosphere communities are enriched in genes encoding nitrate reduction, carbohydrate transport and metabolism, motility, and sensory and signal transduction proteins relative to leaf surface communities. Our results demonstrate how genotype-environment interactions contribute to the complexity of microbiome assembly in natural environments.

  18. Aggregating and Predicting Sequence Labels from Crowd Annotations

    PubMed Central

    Nguyen, An T.; Wallace, Byron C.; Li, Junyi Jessy; Nenkova, Ani; Lease, Matthew

    2017-01-01

    Despite sequences being core to NLP, scant work has considered how to handle noisy sequence labels from multiple annotators for the same text. Given such annotations, we consider two complementary tasks: (1) aggregating sequential crowd labels to infer a best single set of consensus annotations; and (2) using crowd annotations as training data for a model that can predict sequences in unannotated text. For aggregation, we propose a novel Hidden Markov Model variant. To predict sequences in unannotated text, we propose a neural approach using Long Short Term Memory. We evaluate a suite of methods across two different applications and text genres: Named-Entity Recognition in news articles and Information Extraction from biomedical abstracts. Results show improvement over strong baselines. Our source code and data are available online1. PMID:29093611

  19. Metatranscriptomes reveal functional variation in diatom communities from the Antarctic Peninsula.

    PubMed

    Pearson, Gareth A; Lago-Leston, Asuncion; Cánovas, Fernando; Cox, Cymon J; Verret, Frederic; Lasternas, Sebastian; Duarte, Carlos M; Agusti, Susana; Serrão, Ester A

    2015-10-01

    Functional genomics of diatom-dominated communities from the Antarctic Peninsula was studied using comparative metatranscriptomics. Samples obtained from diatom-rich communities in the Bransfield Strait, the western Weddell Sea and sea ice in the Bellingshausen Sea/Wilkins Ice Shelf yielded more than 500K pyrosequencing reads that were combined to produce a global metatranscriptome assembly. Multi-gene phylogenies recovered three distinct communities, and diatom-assigned contigs further indicated little read-sharing between communities, validating an assembly-based annotation and analysis approach. Although functional analysis recovered a core of abundant shared annotations that were expressed across the three diatom communities, over 40% of annotations (but accounting for <10% of sequences) were community-specific. The two pelagic communities differed in their expression of N-metabolism and acquisition genes, which was almost absent in post-bloom conditions in the Weddell Sea community, while enrichment of transporters for ammonia and urea in Bransfield Strait diatoms suggests a physiological stance towards acquisition of reduced N-sources. The depletion of carbohydrate and energy metabolism pathways in sea ice relative to pelagic communities, together with increased light energy dissipation (via LHCSR proteins), photorespiration, and NO3(-) uptake and utilization all pointed to irradiance stress and/or inorganic carbon limitation within sea ice. Ice-binding proteins and cold-shock transcription factors were also enriched in sea ice diatoms. Surprisingly, the abundance of gene transcripts for the translational machinery tracked decreasing environmental temperature across only a 4 °C range, possibly reflecting constraints on translational efficiency and protein production in cold environments.

  20. Model and Interoperability using Meta Data Annotations

    NASA Astrophysics Data System (ADS)

    David, O.

    2011-12-01

    Software frameworks and architectures are in need for meta data to efficiently support model integration. Modelers have to know the context of a model, often stepping into modeling semantics and auxiliary information usually not provided in a concise structure and universal format, consumable by a range of (modeling) tools. XML often seems the obvious solution for capturing meta data, but its wide adoption to facilitate model interoperability is limited by XML schema fragmentation, complexity, and verbosity outside of a data-automation process. Ontologies seem to overcome those shortcomings, however the practical significance of their use remains to be demonstrated. OMS version 3 took a different approach for meta data representation. The fundamental building block of a modular model in OMS is a software component representing a single physical process, calibration method, or data access approach. Here, programing language features known as Annotations or Attributes were adopted. Within other (non-modeling) frameworks it has been observed that annotations lead to cleaner and leaner application code. Framework-supported model integration, traditionally accomplished using Application Programming Interfaces (API) calls is now achieved using descriptive code annotations. Fully annotated components for various hydrological and Ag-system models now provide information directly for (i) model assembly and building, (ii) data flow analysis for implicit multi-threading or visualization, (iii) automated and comprehensive model documentation of component dependencies, physical data properties, (iv) automated model and component testing, calibration, and optimization, and (v) automated audit-traceability to account for all model resources leading to a particular simulation result. Such a non-invasive methodology leads to models and modeling components with only minimal dependencies on the modeling framework but a strong reference to its originating code. Since models and

  1. GAMOLA2, a Comprehensive Software Package for the Annotation and Curation of Draft and Complete Microbial Genomes

    PubMed Central

    Altermann, Eric; Lu, Jingli; McCulloch, Alan

    2017-01-01

    Expert curated annotation remains one of the critical steps in achieving a reliable biological relevant annotation. Here we announce the release of GAMOLA2, a user friendly and comprehensive software package to process, annotate and curate draft and complete bacterial, archaeal, and viral genomes. GAMOLA2 represents a wrapping tool to combine gene model determination, functional Blast, COG, Pfam, and TIGRfam analyses with structural predictions including detection of tRNAs, rRNA genes, non-coding RNAs, signal protein cleavage sites, transmembrane helices, CRISPR repeats and vector sequence contaminations. GAMOLA2 has already been validated in a wide range of bacterial and archaeal genomes, and its modular concept allows easy addition of further functionality in future releases. A modified and adapted version of the Artemis Genome Viewer (Sanger Institute) has been developed to leverage the additional features and underlying information provided by the GAMOLA2 analysis, and is part of the software distribution. In addition to genome annotations, GAMOLA2 features, among others, supplemental modules that assist in the creation of custom Blast databases, annotation transfers between genome versions, and the preparation of Genbank files for submission via the NCBI Sequin tool. GAMOLA2 is intended to be run under a Linux environment, whereas the subsequent visualization and manual curation in Artemis is mobile and platform independent. The development of GAMOLA2 is ongoing and community driven. New functionality can easily be added upon user requests, ensuring that GAMOLA2 provides information relevant to microbiologists. The software is available free of charge for academic use. PMID:28386247

  2. GAMOLA2, a Comprehensive Software Package for the Annotation and Curation of Draft and Complete Microbial Genomes.

    PubMed

    Altermann, Eric; Lu, Jingli; McCulloch, Alan

    2017-01-01

    Expert curated annotation remains one of the critical steps in achieving a reliable biological relevant annotation. Here we announce the release of GAMOLA2, a user friendly and comprehensive software package to process, annotate and curate draft and complete bacterial, archaeal, and viral genomes. GAMOLA2 represents a wrapping tool to combine gene model determination, functional Blast, COG, Pfam, and TIGRfam analyses with structural predictions including detection of tRNAs, rRNA genes, non-coding RNAs, signal protein cleavage sites, transmembrane helices, CRISPR repeats and vector sequence contaminations. GAMOLA2 has already been validated in a wide range of bacterial and archaeal genomes, and its modular concept allows easy addition of further functionality in future releases. A modified and adapted version of the Artemis Genome Viewer (Sanger Institute) has been developed to leverage the additional features and underlying information provided by the GAMOLA2 analysis, and is part of the software distribution. In addition to genome annotations, GAMOLA2 features, among others, supplemental modules that assist in the creation of custom Blast databases, annotation transfers between genome versions, and the preparation of Genbank files for submission via the NCBI Sequin tool. GAMOLA2 is intended to be run under a Linux environment, whereas the subsequent visualization and manual curation in Artemis is mobile and platform independent. The development of GAMOLA2 is ongoing and community driven. New functionality can easily be added upon user requests, ensuring that GAMOLA2 provides information relevant to microbiologists. The software is available free of charge for academic use.

  3. New directions in biomedical text annotation: definitions, guidelines and corpus construction

    PubMed Central

    Wilbur, W John; Rzhetsky, Andrey; Shatkay, Hagit

    2006-01-01

    Background While biomedical text mining is emerging as an important research area, practical results have proven difficult to achieve. We believe that an important first step towards more accurate text-mining lies in the ability to identify and characterize text that satisfies various types of information needs. We report here the results of our inquiry into properties of scientific text that have sufficient generality to transcend the confines of a narrow subject area, while supporting practical mining of text for factual information. Our ultimate goal is to annotate a significant corpus of biomedical text and train machine learning methods to automatically categorize such text along certain dimensions that we have defined. Results We have identified five qualitative dimensions that we believe characterize a broad range of scientific sentences, and are therefore useful for supporting a general approach to text-mining: focus, polarity, certainty, evidence, and directionality. We define these dimensions and describe the guidelines we have developed for annotating text with regard to them. To examine the effectiveness of the guidelines, twelve annotators independently annotated the same set of 101 sentences that were randomly selected from current biomedical periodicals. Analysis of these annotations shows 70–80% inter-annotator agreement, suggesting that our guidelines indeed present a well-defined, executable and reproducible task. Conclusion We present our guidelines defining a text annotation task, along with annotation results from multiple independently produced annotations, demonstrating the feasibility of the task. The annotation of a very large corpus of documents along these guidelines is currently ongoing. These annotations form the basis for the categorization of text along multiple dimensions, to support viable text mining for experimental results, methodology statements, and other forms of information. We are currently developing machine learning

  4. MEGANTE: A Web-Based System for Integrated Plant Genome Annotation

    PubMed Central

    Numa, Hisataka; Itoh, Takeshi

    2014-01-01

    The recent advancement of high-throughput genome sequencing technologies has resulted in a considerable increase in demands for large-scale genome annotation. While annotation is a crucial step for downstream data analyses and experimental studies, this process requires substantial expertise and knowledge of bioinformatics. Here we present MEGANTE, a web-based annotation system that makes plant genome annotation easy for researchers unfamiliar with bioinformatics. Without any complicated configuration, users can perform genomic sequence annotations simply by uploading a sequence and selecting the species to query. MEGANTE automatically runs several analysis programs and integrates the results to select the appropriate consensus exon–intron structures and to predict open reading frames (ORFs) at each locus. Functional annotation, including a similarity search against known proteins and a functional domain search, are also performed for the predicted ORFs. The resultant annotation information is visualized with a widely used genome browser, GBrowse. For ease of analysis, the results can be downloaded in Microsoft Excel format. All of the query sequences and annotation results are stored on the server side so that users can access their own data from virtually anywhere on the web. The current release of MEGANTE targets 24 plant species from the Brassicaceae, Fabaceae, Musaceae, Poaceae, Salicaceae, Solanaceae, Rosaceae and Vitaceae families, and it allows users to submit a sequence up to 10 Mb in length and to save up to 100 sequences with the annotation information on the server. The MEGANTE web service is available at https://megante.dna.affrc.go.jp/. PMID:24253915

  5. Broadband pendulum energy harvester

    NASA Astrophysics Data System (ADS)

    Liang, Changwei; Wu, You; Zuo, Lei

    2016-09-01

    A novel electromagnetic pendulum energy harvester with mechanical motion rectifier (MMR) is proposed and investigated in this paper. MMR is a mechanism which rectifies the bidirectional swing motion of the pendulum into unidirectional rotation of the generator by using two one-way clutches in the gear system. In this paper, two prototypes of pendulum energy harvester with MMR and without MMR are designed and fabricated. The dynamic model of the proposed MMR pendulum energy harvester is established by considering the engagement and disengagement of the one way clutches. The simulation results show that the proposed MMR pendulum energy harvester has a larger output power at high frequencies comparing with non-MMR pendulum energy harvester which benefits from the disengagement of one-way clutch during pendulum vibration. Moreover, the proposed MMR pendulum energy harvester is broadband compare with non-MMR pendulum energy harvester, especially when the equivalent inertia is large. An experiment is also conducted to compare the energy harvesting performance of these two prototypes. A flywheel is attached at the end of the generator to make the disengagement more significant. The experiment results also verify that MMR pendulum energy harvester is broadband and has a larger output power at high frequency over the non-MMR pendulum energy harvester.

  6. Special Issue: Annotated Bibliography for Volumes XIX-XXXII.

    ERIC Educational Resources Information Center

    Pullin, Richard A.

    1998-01-01

    This annotated bibliography lists 310 articles from the "Journal of Cooperative Education" from Volumes XIX-XXXII, 1983-1997. Annotations are presented in the order they appear in the journal; author and subject indexes are provided. (JOW)

  7. MIPS: analysis and annotation of genome information in 2007

    PubMed Central

    Mewes, H. W.; Dietmann, S.; Frishman, D.; Gregory, R.; Mannhaupt, G.; Mayer, K. F. X.; Münsterkötter, M.; Ruepp, A.; Spannagl, M.; Stümpflen, V.; Rattei, T.

    2008-01-01

    The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) combines automatic processing of large amounts of sequences with manual annotation of selected model genomes. Due to the massive growth of the available data, the depth of annotation varies widely between independent databases. Also, the criteria for the transfer of information from known to orthologous sequences are diverse. To cope with the task of global in-depth genome annotation has become unfeasible. Therefore, our efforts are dedicated to three levels of annotation: (i) the curation of selected genomes, in particular from fungal and plant taxa (e.g. CYGD, MNCDB, MatDB), (ii) the comprehensive, consistent, automatic annotation employing exhaustive methods for the computation of sequence similarities and sequence-related attributes as well as the classification of individual sequences (SIMAP, PEDANT and FunCat) and (iii) the compilation of manually curated databases for protein interactions based on scrutinized information from the literature to serve as an accepted set of reliable annotated interaction data (MPACT, MPPI, CORUM). All databases and tools described as well as the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de). PMID:18158298

  8. MIPS: analysis and annotation of genome information in 2007.

    PubMed

    Mewes, H W; Dietmann, S; Frishman, D; Gregory, R; Mannhaupt, G; Mayer, K F X; Münsterkötter, M; Ruepp, A; Spannagl, M; Stümpflen, V; Rattei, T

    2008-01-01

    The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) combines automatic processing of large amounts of sequences with manual annotation of selected model genomes. Due to the massive growth of the available data, the depth of annotation varies widely between independent databases. Also, the criteria for the transfer of information from known to orthologous sequences are diverse. To cope with the task of global in-depth genome annotation has become unfeasible. Therefore, our efforts are dedicated to three levels of annotation: (i) the curation of selected genomes, in particular from fungal and plant taxa (e.g. CYGD, MNCDB, MatDB), (ii) the comprehensive, consistent, automatic annotation employing exhaustive methods for the computation of sequence similarities and sequence-related attributes as well as the classification of individual sequences (SIMAP, PEDANT and FunCat) and (iii) the compilation of manually curated databases for protein interactions based on scrutinized information from the literature to serve as an accepted set of reliable annotated interaction data (MPACT, MPPI, CORUM). All databases and tools described as well as the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).

  9. MetaStorm: A Public Resource for Customizable Metagenomics Annotation

    PubMed Central

    Arango-Argoty, Gustavo; Singh, Gargi; Heath, Lenwood S.; Pruden, Amy; Xiao, Weidong; Zhang, Liqing

    2016-01-01

    Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution. PMID:27632579

  10. MetaStorm: A Public Resource for Customizable Metagenomics Annotation.

    PubMed

    Arango-Argoty, Gustavo; Singh, Gargi; Heath, Lenwood S; Pruden, Amy; Xiao, Weidong; Zhang, Liqing

    2016-01-01

    Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution.

  11. Improving Microbial Genome Annotations in an Integrated Database Context

    PubMed Central

    Chen, I-Min A.; Markowitz, Victor M.; Chu, Ken; Anderson, Iain; Mavromatis, Konstantinos; Kyrpides, Nikos C.; Ivanova, Natalia N.

    2013-01-01

    Effective comparative analysis of microbial genomes requires a consistent and complete view of biological data. Consistency regards the biological coherence of annotations, while completeness regards the extent and coverage of functional characterization for genomes. We have developed tools that allow scientists to assess and improve the consistency and completeness of microbial genome annotations in the context of the Integrated Microbial Genomes (IMG) family of systems. All publicly available microbial genomes are characterized in IMG using different functional annotation and pathway resources, thus providing a comprehensive framework for identifying and resolving annotation discrepancies. A rule based system for predicting phenotypes in IMG provides a powerful mechanism for validating functional annotations, whereby the phenotypic traits of an organism are inferred based on the presence of certain metabolic reactions and pathways and compared to experimentally observed phenotypes. The IMG family of systems are available at http://img.jgi.doe.gov/. PMID:23424620

  12. Presence of pathogenic Escherichia coli is correlated with bacterial community diversity and composition on pre-harvest cattle hides.

    PubMed

    Chopyk, Jessica; Moore, Ryan M; DiSpirito, Zachary; Stromberg, Zachary R; Lewis, Gentry L; Renter, David G; Cernicchiaro, Natalia; Moxley, Rodney A; Wommack, K Eric

    2016-03-22

    Since 1982, specific serotypes of Shiga toxin-producing Escherichia coli (STEC) have been recognized as significant foodborne pathogens acquired from contaminated beef and, more recently, other food products. Cattle are the major reservoir hosts of these organisms, and while there have been advancements in food safety practices and industry standards, STEC still remains prevalent within beef cattle operations with cattle hides implicated as major sources of carcass contamination. To investigate whether the composition of hide-specific microbial communities are associated with STEC prevalence, 16S ribosomal RNA (rRNA) bacterial community profiles were obtained from hide and fecal samples collected from a large commercial feedlot over a 3-month period. These community data were examined amidst an extensive collection of prevalence data on a subgroup of STEC that cause illness in humans, referred to as enterohemorrhagic E. coli (EHEC). Fecal 16S rRNA gene OTUs (operational taxonomic units) were subtracted from the OTUs found within each hide 16S rRNA amplicon library to identify hide-specific bacterial populations. Comparative analysis of alpha diversity revealed a significant correlation between low bacterial diversity and samples positive for the presence of E. coli O157:H7 and/or the non-O157 groups: O26, O111, O103, O121, O45, and O145. This trend occurred regardless of diversity metric or fecal OTU presence. The number of EHEC serogroups present in the samples had a compounding effect on the inverse relationship between pathogen presence and bacterial diversity. Beta diversity data showed differences in bacterial community composition between samples containing O157 and non-O157 populations, with certain OTUs demonstrating significant changes in relative abundance. The cumulative prevalence of the targeted EHEC serogroups was correlated with low bacterial community diversity on pre-harvest cattle hides. Understanding the relationship between indigenous hide

  13. The Bologna Annotation Resource (BAR 3.0): improving protein functional annotation.

    PubMed

    Profiti, Giuseppe; Martelli, Pier Luigi; Casadio, Rita

    2017-07-03

    BAR 3.0 updates our server BAR (Bologna Annotation Resource) for predicting protein structural and functional features from sequence. We increase data volume, query capabilities and information conveyed to the user. The core of BAR 3.0 is a graph-based clustering procedure of UniProtKB sequences, following strict pairwise similarity criteria (sequence identity ≥40% with alignment coverage ≥90%). Each cluster contains the available annotation downloaded from UniProtKB, GO, PFAM and PDB. After statistical validation, GO terms and PFAM domains are cluster-specific and annotate new sequences entering the cluster after satisfying similarity constraints. BAR 3.0 includes 28 869 663 sequences in 1 361 773 clusters, of which 22.2% (22 241 661 sequences) and 47.4% (24 555 055 sequences) have at least one validated GO term and one PFAM domain, respectively. 1.4% of the clusters (36% of all sequences) include PDB structures and the cluster is associated to a hidden Markov model that allows building template-target alignment suitable for structural modeling. Some other 3 399 026 sequences are singletons. BAR 3.0 offers an improved search interface, allowing queries by UniProtKB-accession, Fasta sequence, GO-term, PFAM-domain, organism, PDB and ligand/s. When evaluated on the CAFA2 targets, BAR 3.0 largely outperforms our previous version and scores among state-of-the-art methods. BAR 3.0 is publicly available and accessible at http://bar.biocomp.unibo.it/bar3. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. Dual mechanisms regulate ecosystem stability under decade-long warming and hay harvest

    PubMed Central

    Shi, Zheng; Xu, Xia; Souza, Lara; Wilcox, Kevin; Jiang, Lifen; Liang, Junyi; Xia, Jianyang; García-Palacios, Pablo; Luo, Yiqi

    2016-01-01

    Past global change studies have identified changes in species diversity as a major mechanism regulating temporal stability of production, measured as the ratio of the mean to the standard deviation of community biomass. However, the dominant plant functional group can also strongly determine the temporal stability. Here, in a grassland ecosystem subject to 15 years of experimental warming and hay harvest, we reveal that warming increases while hay harvest decreases temporal stability. This corresponds with the biomass of the dominant C4 functional group being higher under warming and lower under hay harvest. As a secondary mechanism, biodiversity also explains part of the variation in temporal stability of production. Structural equation modelling further shows that warming and hay harvest regulate temporal stability through influencing both temporal mean and variation of production. Our findings demonstrate the joint roles that dominant plant functional group and biodiversity play in regulating the temporal stability of an ecosystem under global change. PMID:27302085

  15. Annotating Socio-Cultural Structures in Text

    DTIC Science & Technology

    2012-10-31

    parts of speech (POS) within text, using the Stanford Part of Speech Tagger (Stanford Log-Linear, 2011). The ERDC-CERL taxonomy is then used to...annotated NP/VP Pane: Shows the sentence parsed using the Parts of Speech tagger Document View Pane: Specifies the document (being annotated) in three...first parsed using the Stanford Parts of Speech tagger and converted to an XML document both components which are done through the Import function

  16. Surveys of Librarians' Benefits: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Jennerich, Elaine Zaremba; And Others

    This annotated bibliography cites 39 titles of reports on academic, research, and public library conditions, which were compiled over a 2-year period by the LAMA/PAS Committee on Economic Status, Welfare and Fringe Benefits. Each annotated item was personally examined by a committee member; the six items in the addendum were not examined because…

  17. Annotations of Mexican bullfighting videos for semantic index

    NASA Astrophysics Data System (ADS)

    Montoya Obeso, Abraham; Oropesa Morales, Lester Arturo; Fernando Vázquez, Luis; Cocolán Almeda, Sara Ivonne; Stoian, Andrei; García Vázquez, Mireya Saraí; Zamudio Fuentes, Luis Miguel; Montiel Perez, Jesús Yalja; de la O Torres, Saul; Ramírez Acosta, Alejandro Alvaro

    2015-09-01

    The video annotation is important for web indexing and browsing systems. Indeed, in order to evaluate the performance of video query and mining techniques, databases with concept annotations are required. Therefore, it is necessary generate a database with a semantic indexing that represents the digital content of the Mexican bullfighting atmosphere. This paper proposes a scheme to make complex annotations in a video in the frame of multimedia search engine project. Each video is partitioned using our segmentation algorithm that creates shots of different length and different number of frames. In order to make complex annotations about the video, we use ELAN software. The annotations are done in two steps: First, we take note about the whole content in each shot. Second, we describe the actions as parameters of the camera like direction, position and deepness. As a consequence, we obtain a more complete descriptor of every action. In both cases we use the concepts of the TRECVid 2014 dataset. We also propose new concepts. This methodology allows to generate a database with the necessary information to create descriptors and algorithms capable to detect actions to automatically index and classify new bullfighting multimedia content.

  18. An annotated corpus with nanomedicine and pharmacokinetic parameters

    PubMed Central

    Lewinski, Nastassja A; Jimenez, Ivan; McInnes, Bridget T

    2017-01-01

    A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP) efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration’s Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided. PMID:29066897

  19. Mapping annotations with textual evidence using an scLDA model.

    PubMed

    Jin, Bo; Chen, Vicky; Chen, Lujia; Lu, Xinghua

    2011-01-01

    Most of the knowledge regarding genes and proteins is stored in biomedical literature as free text. Extracting information from complex biomedical texts demands techniques capable of inferring biological concepts from local text regions and mapping them to controlled vocabularies. To this end, we present a sentence-based correspondence latent Dirichlet allocation (scLDA) model which, when trained with a corpus of PubMed documents with known GO annotations, performs the following tasks: 1) learning major biological concepts from the corpus, 2) inferring the biological concepts existing within text regions (sentences), and 3) identifying the text regions in a document that provides evidence for the observed annotations. When applied to new gene-related documents, a trained scLDA model is capable of predicting GO annotations and identifying text regions as textual evidence supporting the predicted annotations. This study uses GO annotation data as a testbed; the approach can be generalized to other annotated data, such as MeSH and MEDLINE documents.

  20. Evaluation of web-based annotation of ophthalmic images for multicentric clinical trials.

    PubMed

    Chalam, K V; Jain, P; Shah, V A; Shah, Gaurav Y

    2006-06-01

    An Internet browser-based annotation system can be used to identify and describe features in digitalized retinal images, in multicentric clinical trials, in real time. In this web-based annotation system, the user employs a mouse to draw and create annotations on a transparent layer, that encapsulates the observations and interpretations of a specific image. Multiple annotation layers may be overlaid on a single image. These layers may correspond to annotations by different users on the same image or annotations of a temporal sequence of images of a disease process, over a period of time. In addition, geometrical properties of annotated figures may be computed and measured. The annotations are stored in a central repository database on a server, which can be retrieved by multiple users in real time. This system facilitates objective evaluation of digital images and comparison of double-blind readings of digital photographs, with an identifiable audit trail. Annotation of ophthalmic images allowed clinically feasible and useful interpretation to track properties of an area of fundus pathology. This provided an objective method to monitor properties of pathologies over time, an essential component of multicentric clinical trials. The annotation system also allowed users to view stereoscopic images that are stereo pairs. This web-based annotation system is useful and valuable in monitoring patient care, in multicentric clinical trials, telemedicine, teaching and routine clinical settings.

  1. Essential Annotation Schema for Ecology (EASE)-A framework supporting the efficient data annotation and faceted navigation in ecology.

    PubMed

    Pfaff, Claas-Thido; Eichenberg, David; Liebergesell, Mario; König-Ries, Birgitta; Wirth, Christian

    2017-01-01

    Ecology has become a data intensive science over the last decades which often relies on the reuse of data in cross-experimental analyses. However, finding data which qualifies for the reuse in a specific context can be challenging. It requires good quality metadata and annotations as well as efficient search strategies. To date, full text search (often on the metadata only) is the most widely used search strategy although it is known to be inaccurate. Faceted navigation is providing a filter mechanism which is based on fine granular metadata, categorizing search objects along numeric and categorical parameters relevant for their discovery. Selecting from these parameters during a full text search creates a system of filters which allows to refine and improve the results towards more relevance. We developed a framework for the efficient annotation and faceted navigation in ecology. It consists of an XML schema for storing the annotation of search objects and is accompanied by a vocabulary focused on ecology to support the annotation process. The framework consolidates ideas which originate from widely accepted metadata standards, textbooks, scientific literature, and vocabularies as well as from expert knowledge contributed by researchers from ecology and adjacent disciplines.

  2. Recharge the Rain: Community Resilience Through STEM Education

    NASA Astrophysics Data System (ADS)

    Wilkening, B.; Shipek, C.

    2017-12-01

    Starting in January 2017, Recharge the Rain moves sixth through twelfth grade teachers, students and the public through a continuum from awareness, to knowledge gain, to conceptual understanding, to action; building community resiliency to hazards associated with increased temperatures, drought and flooding in Arizona. Watershed Management Group with Arizona Project WET are utilizing NOAA assets, experts from the National Weather Service and Climate Assessment for the Southwest (CLIMAS), and Pima County hazard mitigation plan and planning tools to inform citizens and galvanize their commitment to building a community, resilient to the effects of a warming climate. In the first of four years, the project is 1) developing climate-literacy curriculum with 16 Tucson-area teachers that incorporates systems-thinking and increases understanding of earth systems, weather and climate, 2) training teachers and community docents in water harvesting practices and citizen-science data collection, 3) laying the framework for the development of rainwater harvesting engineering design curriculum, 4) involving Tucson community members in water harvesting principles through project implementation workshops, special events, and tours. In years two through four, the project will build resiliency to the effects of climate threats by 1) installing student-designed rainwater harvesting systems, 2) providing community tours of schoolyard systems to educate the public, 3) expanding the program to incorporate curriculum use in Phoenix-area teachers' classrooms and 4) finalizing a replicable model for other communities facing similar threats. What are the lessons learned after one year of Recharge the Rain? How can these lessons be used to inform this project and other projects in building resilient communities?

  3. Plant genome and transcriptome annotations: from misconceptions to simple solutions

    PubMed Central

    Bolger, Marie E; Arsova, Borjana; Usadel, Björn

    2018-01-01

    Abstract Next-generation sequencing has triggered an explosion of available genomic and transcriptomic resources in the plant sciences. Although genome and transcriptome sequencing has become orders of magnitudes cheaper and more efficient, often the functional annotation process is lagging behind. This might be hampered by the lack of a comprehensive enumeration of simple-to-use tools available to the plant researcher. In this comprehensive review, we present (i) typical ontologies to be used in the plant sciences, (ii) useful databases and resources used for functional annotation, (iii) what to expect from an annotated plant genome, (iv) an automated annotation pipeline and (v) a recipe and reference chart outlining typical steps used to annotate plant genomes/transcriptomes using publicly available resources. PMID:28062412

  4. IMG ER: a system for microbial genome annotation expert review and curation.

    PubMed

    Markowitz, Victor M; Mavromatis, Konstantinos; Ivanova, Natalia N; Chen, I-Min A; Chu, Ken; Kyrpides, Nikos C

    2009-09-01

    A rapidly increasing number of microbial genomes are sequenced by organizations worldwide and are eventually included into various public genome data resources. The quality of the annotations depends largely on the original dataset providers, with erroneous or incomplete annotations often carried over into the public resources and difficult to correct. We have developed an Expert Review (ER) version of the Integrated Microbial Genomes (IMG) system, with the goal of supporting systematic and efficient revision of microbial genome annotations. IMG ER provides tools for the review and curation of annotations of both new and publicly available microbial genomes within IMG's rich integrated genome framework. New genome datasets are included into IMG ER prior to their public release either with their native annotations or with annotations generated by IMG ER's annotation pipeline. IMG ER tools allow addressing annotation problems detected with IMG's comparative analysis tools, such as genes missed by gene prediction pipelines or genes without an associated function. Over the past year, IMG ER was used for improving the annotations of about 150 microbial genomes.

  5. Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'

    PubMed Central

    Draper, John; Enot, David P; Parker, David; Beckmann, Manfred; Snowdon, Stuart; Lin, Wanchang; Zubair, Hassan

    2009-01-01

    Background Metabolomics experiments using Mass Spectrometry (MS) technology measure the mass to charge ratio (m/z) and intensity of ionised molecules in crude extracts of complex biological samples to generate high dimensional metabolite 'fingerprint' or metabolite 'profile' data. High resolution MS instruments perform routinely with a mass accuracy of < 5 ppm (parts per million) thus providing potentially a direct method for signal putative annotation using databases containing metabolite mass information. Most database interfaces support only simple queries with the default assumption that molecules either gain or lose a single proton when ionised. In reality the annotation process is confounded by the fact that many ionisation products will be not only molecular isotopes but also salt/solvent adducts and neutral loss fragments of original metabolites. This report describes an annotation strategy that will allow searching based on all potential ionisation products predicted to form during electrospray ionisation (ESI). Results Metabolite 'structures' harvested from publicly accessible databases were converted into a common format to generate a comprehensive archive in MZedDB. 'Rules' were derived from chemical information that allowed MZedDB to generate a list of adducts and neutral loss fragments putatively able to form for each structure and calculate, on the fly, the exact molecular weight of every potential ionisation product to provide targets for annotation searches based on accurate mass. We demonstrate that data matrices representing populations of ionisation products generated from different biological matrices contain a large proportion (sometimes > 50%) of molecular isotopes, salt adducts and neutral loss fragments. Correlation analysis of ESI-MS data features confirmed the predicted relationships of m/z signals. An integrated isotope enumerator in MZedDB allowed verification of exact isotopic pattern distributions to corroborate experimental data

  6. Using comparative genome analysis to identify problems in annotated microbial genomes.

    PubMed

    Poptsova, Maria S; Gogarten, J Peter

    2010-07-01

    Genome annotation is a tedious task that is mostly done by automated methods; however, the accuracy of these approaches has been questioned since the beginning of the sequencing era. Genome annotation is a multilevel process, and errors can emerge at different stages: during sequencing, as a result of gene-calling procedures, and in the process of assigning gene functions. Missed or wrongly annotated genes differentially impact different types of analyses. Here we discuss and demonstrate how the methods of comparative genome analysis can refine annotations by locating missing orthologues. We also discuss possible reasons for errors and show that the second-generation annotation systems, which combine multiple gene-calling programs with similarity-based methods, perform much better than the first annotation tools. Since old errors may propagate to the newly sequenced genomes, we emphasize that the problem of continuously updating popular public databases is an urgent and unresolved one. Due to the progress in genome-sequencing technologies, automated annotation techniques will remain the main approach in the future. Researchers need to be aware of the existing errors in the annotation of even well-studied genomes, such as Escherichia coli, and consider additional quality control for their results.

  7. Literacy and Basic Education: A Selected, Annotated Bibliography. Annotated Bibliography #3.

    ERIC Educational Resources Information Center

    Michigan State Univ., East Lansing. Non-Formal Education Information Center.

    A selected annotated bibliography on literacy and basic education, including contributions from practitioners in the worldwide non-formal education network and compiled for them, has three interrelated themes: integration of literacy programs with broader development efforts; the learner-centered or "psycho-social" approach to literacy,…

  8. Managing and Querying Image Annotation and Markup in XML.

    PubMed

    Wang, Fusheng; Pan, Tony; Sharma, Ashish; Saltz, Joel

    2010-01-01

    Proprietary approaches for representing annotations and image markup are serious barriers for researchers to share image data and knowledge. The Annotation and Image Markup (AIM) project is developing a standard based information model for image annotation and markup in health care and clinical trial environments. The complex hierarchical structures of AIM data model pose new challenges for managing such data in terms of performance and support of complex queries. In this paper, we present our work on managing AIM data through a native XML approach, and supporting complex image and annotation queries through native extension of XQuery language. Through integration with xService, AIM databases can now be conveniently shared through caGrid.

  9. Managing and Querying Image Annotation and Markup in XML

    PubMed Central

    Wang, Fusheng; Pan, Tony; Sharma, Ashish; Saltz, Joel

    2010-01-01

    Proprietary approaches for representing annotations and image markup are serious barriers for researchers to share image data and knowledge. The Annotation and Image Markup (AIM) project is developing a standard based information model for image annotation and markup in health care and clinical trial environments. The complex hierarchical structures of AIM data model pose new challenges for managing such data in terms of performance and support of complex queries. In this paper, we present our work on managing AIM data through a native XML approach, and supporting complex image and annotation queries through native extension of XQuery language. Through integration with xService, AIM databases can now be conveniently shared through caGrid. PMID:21218167

  10. AutoFACT: An Automatic Functional Annotation and Classification Tool

    PubMed Central

    Koski, Liisa B; Gray, Michael W; Lang, B Franz; Burger, Gertraud

    2005-01-01

    Background Assignment of function to new molecular sequence data is an essential step in genomics projects. The usual process involves similarity searches of a given sequence against one or more databases, an arduous process for large datasets. Results We present AutoFACT, a fully automated and customizable annotation tool that assigns biologically informative functions to a sequence. Key features of this tool are that it (1) analyzes nucleotide and protein sequence data; (2) determines the most informative functional description by combining multiple BLAST reports from several user-selected databases; (3) assigns putative metabolic pathways, functional classes, enzyme classes, GeneOntology terms and locus names; and (4) generates output in HTML, text and GFF formats for the user's convenience. We have compared AutoFACT to four well-established annotation pipelines. The error rate of functional annotation is estimated to be only between 1–2%. Comparison of AutoFACT to the traditional top-BLAST-hit annotation method shows that our procedure increases the number of functionally informative annotations by approximately 50%. Conclusion AutoFACT will serve as a useful annotation tool for smaller sequencing groups lacking dedicated bioinformatics staff. It is implemented in PERL and runs on LINUX/UNIX platforms. AutoFACT is available at . PMID:15960857

  11. Harvesting implementation for the GI-cat distributed catalog

    NASA Astrophysics Data System (ADS)

    Boldrini, Enrico; Papeschi, Fabrizio; Bigagli, Lorenzo; Mazzetti, Paolo

    2010-05-01

    GI-cat framework implements a distributed catalog service supporting different international standards and interoperability arrangements in use by the geoscientific community. The distribution functionality in conjunction with the mediation functionality allows to seamlessly query remote heterogeneous data sources, including OGC Web Services - e.e. OGC CSW, WCS, WFS and WMS, community standards such as UNIDATA THREDDS/OPeNDAP, SeaDataNet CDI (Common Data Index), GBIF (Global Biodiversity Information Facility) services and OpenSearch engines. In the GI-cat modular architecture a distributor component carry out the distribution functionality by query delegation to the mediator components (one for each different data source). Each of these mediator components is able to query a specific data source and convert back the results by mapping of the foreign data model to the GI-cat internal one, based on ISO 19139. In order to cope with deployment scenarios in which local data is expected, an harvesting approach has been experimented. The new strategy comes in addition to the consolidated distributed approach, allowing the user to switch between a remote and a local search at will for each federated resource; this extends GI-cat configuration possibilities. The harvesting strategy is designed in GI-cat by the use at the core of a local cache component, implemented as a native XML database and based on eXist. The different heterogeneous sources are queried for the bulk of available data; this data is then injected into the cache component after being converted to the GI-cat data model. The query and conversion steps are performed by the mediator components that were are part of the GI-cat framework. Afterward each new query can be exercised against local data that have been stored in the cache component. Considering both advantages and shortcomings that affect harvesting and query distribution approaches, it comes out that a user driven tuning is required to take the best

  12. Fuzzy Emotional Semantic Analysis and Automated Annotation of Scene Images

    PubMed Central

    Cao, Jianfang; Chen, Lichao

    2015-01-01

    With the advances in electronic and imaging techniques, the production of digital images has rapidly increased, and the extraction and automated annotation of emotional semantics implied by images have become issues that must be urgently addressed. To better simulate human subjectivity and ambiguity for understanding scene images, the current study proposes an emotional semantic annotation method for scene images based on fuzzy set theory. A fuzzy membership degree was calculated to describe the emotional degree of a scene image and was implemented using the Adaboost algorithm and a back-propagation (BP) neural network. The automated annotation method was trained and tested using scene images from the SUN Database. The annotation results were then compared with those based on artificial annotation. Our method showed an annotation accuracy rate of 91.2% for basic emotional values and 82.4% after extended emotional values were added, which correspond to increases of 5.5% and 8.9%, respectively, compared with the results from using a single BP neural network algorithm. Furthermore, the retrieval accuracy rate based on our method reached approximately 89%. This study attempts to lay a solid foundation for the automated emotional semantic annotation of more types of images and therefore is of practical significance. PMID:25838818

  13. Unbiased Taxonomic Annotation of Metagenomic Samples

    PubMed Central

    Fosso, Bruno; Pesole, Graziano; Rosselló, Francesc

    2018-01-01

    Abstract The classification of reads from a metagenomic sample using a reference taxonomy is usually based on first mapping the reads to the reference sequences and then classifying each read at a node under the lowest common ancestor of the candidate sequences in the reference taxonomy with the least classification error. However, this taxonomic annotation can be biased by an imbalanced taxonomy and also by the presence of multiple nodes in the taxonomy with the least classification error for a given read. In this article, we show that the Rand index is a better indicator of classification error than the often used area under the receiver operating characteristic (ROC) curve and F-measure for both balanced and imbalanced reference taxonomies, and we also address the second source of bias by reducing the taxonomic annotation problem for a whole metagenomic sample to a set cover problem, for which a logarithmic approximation can be obtained in linear time and an exact solution can be obtained by integer linear programming. Experimental results with a proof-of-concept implementation of the set cover approach to taxonomic annotation in a next release of the TANGO software show that the set cover approach further reduces ambiguity in the taxonomic annotation obtained with TANGO without distorting the relative abundance profile of the metagenomic sample. PMID:29028181

  14. Energy Harvesting & Recapture from Human Subjects: Dual-Stage MEMS Cantilever Energy Harvester

    DTIC Science & Technology

    2015-03-01

    15 Figure 5. (a) In-plane overlap-varying capacitive harvester, (b) In-plane gap-closing capacitive harvester, (c) Out -of-plane gap-closing...capacitive harvester, (c) Out -of-plane gap-closing capacitive harvester [1] The two-way arrows in each subpart of Figure 5 indicate the shuttle’s direction...are compatible with other wafer -based technologies. Bismuth Telluride (Bi2Te3), a common Seebeck thermoelectric material, is able to be processed

  15. Ten steps to get started in Genome Assembly and Annotation

    PubMed Central

    Dominguez Del Angel, Victoria; Hjerde, Erik; Sterck, Lieven; Capella-Gutierrez, Salvadors; Notredame, Cederic; Vinnere Pettersson, Olga; Amselem, Joelle; Bouri, Laurent; Bocs, Stephanie; Klopp, Christophe; Gibrat, Jean-Francois; Vlasova, Anna; Leskosek, Brane L.; Soler, Lucile; Binzer-Panchal, Mahesh; Lantz, Henrik

    2018-01-01

    As a part of the ELIXIR-EXCELERATE efforts in capacity building, we present here 10 steps to facilitate researchers getting started in genome assembly and genome annotation. The guidelines given are broadly applicable, intended to be stable over time, and cover all aspects from start to finish of a general assembly and annotation project. Intrinsic properties of genomes are discussed, as is the importance of using high quality DNA. Different sequencing technologies and generally applicable workflows for genome assembly are also detailed. We cover structural and functional annotation and encourage readers to also annotate transposable elements, something that is often omitted from annotation workflows. The importance of data management is stressed, and we give advice on where to submit data and how to make your results Findable, Accessible, Interoperable, and Reusable (FAIR). PMID:29568489

  16. Determining similarity of scientific entities in annotation datasets

    PubMed Central

    Palma, Guillermo; Vidal, Maria-Esther; Haag, Eric; Raschid, Louiqa; Thor, Andreas

    2015-01-01

    Linked Open Data initiatives have made available a diversity of scientific collections where scientists have annotated entities in the datasets with controlled vocabulary terms from ontologies. Annotations encode scientific knowledge, which is captured in annotation datasets. Determining relatedness between annotated entities becomes a building block for pattern mining, e.g. identifying drug–drug relationships may depend on the similarity of the targets that interact with each drug. A diversity of similarity measures has been proposed in the literature to compute relatedness between a pair of entities. Each measure exploits some knowledge including the name, function, relationships with other entities, taxonomic neighborhood and semantic knowledge. We propose a novel general-purpose annotation similarity measure called ‘AnnSim’ that measures the relatedness between two entities based on the similarity of their annotations. We model AnnSim as a 1–1 maximum weight bipartite match and exploit properties of existing solvers to provide an efficient solution. We empirically study the performance of AnnSim on real-world datasets of drugs and disease associations from clinical trials and relationships between drugs and (genomic) targets. Using baselines that include a variety of measures, we identify where AnnSim can provide a deeper understanding of the semantics underlying the relatedness of a pair of entities or where it could lead to predicting new links or identifying potential novel patterns. Although AnnSim does not exploit knowledge or properties of a particular domain, its performance compares well with a variety of state-of-the-art domain-specific measures. Database URL: http://www.yeastgenome.org/ PMID:25725057

  17. Orienteering: An Annotated Bibliography = Orientierungslauf: Eine kommentierte Bibliographie.

    ERIC Educational Resources Information Center

    Seiler, Roland, Ed.; Hartmann, Wolfgang, Ed.

    1994-01-01

    Annotated bibliography of 220 books, monographs, and journal articles on orienteering published 1984-94, from SPOLIT database of the Federal Institute of Sport Science (Cologne, Germany). Annotations in English or German. Ten sections including psychological, physiological, health, sociological, and environmental aspects; training and coaching;…

  18. 1976 Oregon timber harvest.

    Treesearch

    J.D. Jr. Lloyd

    1978-01-01

    The 1976 Oregon timber harvest of 8.15 billion board feet ended a 3-year decline. The cut was 783 million board feet (10.6 percent) above the 1975 harvest. The western Oregon harvest rose 812 million board feet (15 percent) while eastern Oregon declined 29 million board feet (15 percent). The proportion of total harvest which comes from eastern Oregon has gradually...

  19. Annotation: a computational solution for streamlining metabolomics analysis

    PubMed Central

    Domingo-Almenara, Xavier; Montenegro-Burke, J. Rafael; Benton, H. Paul; Siuzdak, Gary

    2017-01-01

    Metabolite identification is still considered an imposing bottleneck in liquid chromatography mass spectrometry (LC/MS) untargeted metabolomics. The identification workflow usually begins with detecting relevant LC/MS peaks via peak-picking algorithms and retrieving putative identities based on accurate mass searching. However, accurate mass search alone provides poor evidence for metabolite identification. For this reason, computational annotation is used to reveal the underlying metabolites monoisotopic masses, improving putative identification in addition to confirmation with tandem mass spectrometry. This review examines LC/MS data from a computational and analytical perspective, focusing on the occurrence of neutral losses and in-source fragments, to understand the challenges in computational annotation methodologies. Herein, we examine the state-of-the-art strategies for computational annotation including: (i) peak grouping or full scan (MS1) pseudo-spectra extraction, i.e., clustering all mass spectral signals stemming from each metabolite; (ii) annotation using ion adduction and mass distance among ion peaks; (iii) incorporation of biological knowledge such as biotransformations or pathways; (iv) tandem MS data; and (v) metabolite retention time calibration, usually achieved by prediction from molecular descriptors. Advantages and pitfalls of each of these strategies are discussed, as well as expected future trends in computational annotation. PMID:29039932

  20. Annotated Bibliography on Apartheid.

    ERIC Educational Resources Information Center

    Totten, Sam, ed.

    1985-01-01

    This annotated listing on apartheid in South Africa cites general resources, classroom materials, fiction, poetry, audio visuals, and organizations and associations. Also included are a glossary and a brief chronology of South Africa's apartheid system. (RM)

  1. 1974 Oregon timber harvest.

    Treesearch

    J.D. Jr. Lloyd

    1976-01-01

    The 1974 Oregon timber harvest of 8.36 billion board feet was 9.2 percent, or 0.84 billion board feet, below the 1973 harvest. (The data for 1973 were adjusted to reflect the change in reporting of harvest on Bureau of Land Management lands; see footnote 3 of table.) While the harvest in western Oregon decreased 14.7 percent, eastern Oregon cut increased 11.5 percent...

  2. Non-Formal Education and Agriculture: A Selected Annotated Bibliography. Annotated Bibliography #10.

    ERIC Educational Resources Information Center

    Sullivan, Karen Collamore; And Others

    Intended for those actively engaged in nonformal education for development, this annotated bibliography contains approximately 300 references to documents that highlight issues concerning food production, distribution, and consumption. It also demonstrates education's role in enhancing developmental efforts to alleviate world hunger. Materials are…

  3. Analog self-powered harvester achieving switching pause control to increase harvested energy

    NASA Astrophysics Data System (ADS)

    Makihara, Kanjuro; Asahina, Kei

    2017-05-01

    In this paper, we propose a self-powered analog controller circuit to increase the efficiency of electrical energy harvesting from vibrational energy using piezoelectric materials. Although the existing synchronized switch harvesting on inductor (SSHI) method is designed to produce efficient harvesting, its switching operation generates a vibration-suppression effect that reduces the harvested levels of electrical energy. To solve this problem, the authors proposed—in a previous paper—a switching method that takes this vibration-suppression effect into account. This method temporarily pauses the switching operation, allowing the recovery of the mechanical displacement and, therefore, of the piezoelectric voltage. In this paper, we propose a self-powered analog circuit to implement this switching control method. Self-powered vibration harvesting is achieved in this study by attaching a newly designed circuit to an existing analog controller for SSHI. This circuit aims to effectively implement the aforementioned new switching control strategy, where switching is paused in some vibration peaks, in order to allow motion recovery and a consequent increase in the harvested energy. Harvesting experiments performed using the proposed circuit reveal that the proposed method can increase the energy stored in the storage capacitor by a factor of 8.5 relative to the conventional SSHI circuit. This proposed technique is useful to increase the harvested energy especially for piezoelectric systems having large coupling factor.

  4. Evaluating Functional Annotations of Enzymes Using the Gene Ontology.

    PubMed

    Holliday, Gemma L; Davidson, Rebecca; Akiva, Eyal; Babbitt, Patricia C

    2017-01-01

    The Gene Ontology (GO) (Ashburner et al., Nat Genet 25(1):25-29, 2000) is a powerful tool in the informatics arsenal of methods for evaluating annotations in a protein dataset. From identifying the nearest well annotated homologue of a protein of interest to predicting where misannotation has occurred to knowing how confident you can be in the annotations assigned to those proteins is critical. In this chapter we explore what makes an enzyme unique and how we can use GO to infer aspects of protein function based on sequence similarity. These can range from identification of misannotation or other errors in a predicted function to accurate function prediction for an enzyme of entirely unknown function. Although GO annotation applies to any gene products, we focus here a describing our approach for hierarchical classification of enzymes in the Structure-Function Linkage Database (SFLD) (Akiva et al., Nucleic Acids Res 42(Database issue):D521-530, 2014) as a guide for informed utilisation of annotation transfer based on GO terms.

  5. GFam: a platform for automatic annotation of gene families.

    PubMed

    Sasidharan, Rajkumar; Nepusz, Tamás; Swarbreck, David; Huala, Eva; Paccanaro, Alberto

    2012-10-01

    We have developed GFam, a platform for automatic annotation of gene/protein families. GFam provides a framework for genome initiatives and model organism resources to build domain-based families, derive meaningful functional labels and offers a seamless approach to propagate functional annotation across periodic genome updates. GFam is a hybrid approach that uses a greedy algorithm to chain component domains from InterPro annotation provided by its 12 member resources followed by a sequence-based connected component analysis of un-annotated sequence regions to derive consensus domain architecture for each sequence and subsequently generate families based on common architectures. Our integrated approach increases sequence coverage by 7.2 percentage points and residue coverage by 14.6 percentage points higher than the coverage relative to the best single-constituent database within InterPro for the proteome of Arabidopsis. The true power of GFam lies in maximizing annotation provided by the different InterPro data sources that offer resource-specific coverage for different regions of a sequence. GFam's capability to capture higher sequence and residue coverage can be useful for genome annotation, comparative genomics and functional studies. GFam is a general-purpose software and can be used for any collection of protein sequences. The software is open source and can be obtained from http://www.paccanarolab.org/software/gfam/.

  6. UniProtKB/Swiss-Prot, the Manually Annotated Section of the UniProt KnowledgeBase: How to Use the Entry View.

    PubMed

    Boutet, Emmanuel; Lieberherr, Damien; Tognolli, Michael; Schneider, Michel; Bansal, Parit; Bridge, Alan J; Poux, Sylvain; Bougueleret, Lydie; Xenarios, Ioannis

    2016-01-01

    The Universal Protein Resource (UniProt, http://www.uniprot.org ) consortium is an initiative of the SIB Swiss Institute of Bioinformatics (SIB), the European Bioinformatics Institute (EBI) and the Protein Information Resource (PIR) to provide the scientific community with a central resource for protein sequences and functional information. The UniProt consortium maintains the UniProt KnowledgeBase (UniProtKB), updated every 4 weeks, and several supplementary databases including the UniProt Reference Clusters (UniRef) and the UniProt Archive (UniParc).The Swiss-Prot section of the UniProt KnowledgeBase (UniProtKB/Swiss-Prot) contains publicly available expertly manually annotated protein sequences obtained from a broad spectrum of organisms. Plant protein entries are produced in the frame of the Plant Proteome Annotation Program (PPAP), with an emphasis on characterized proteins of Arabidopsis thaliana and Oryza sativa. High level annotations provided by UniProtKB/Swiss-Prot are widely used to predict annotation of newly available proteins through automatic pipelines.The purpose of this chapter is to present a guided tour of a UniProtKB/Swiss-Prot entry. We will also present some of the tools and databases that are linked to each entry.

  7. ExpTreeDB: web-based query and visualization of manually annotated gene expression profiling experiments of human and mouse from GEO.

    PubMed

    Ni, Ming; Ye, Fuqiang; Zhu, Juanjuan; Li, Zongwei; Yang, Shuai; Yang, Bite; Han, Lu; Wu, Yongge; Chen, Ying; Li, Fei; Wang, Shengqi; Bo, Xiaochen

    2014-12-01

    Numerous public microarray datasets are valuable resources for the scientific communities. Several online tools have made great steps to use these data by querying related datasets with users' own gene signatures or expression profiles. However, dataset annotation and result exhibition still need to be improved. ExpTreeDB is a database that allows for queries on human and mouse microarray experiments from Gene Expression Omnibus with gene signatures or profiles. Compared with similar applications, ExpTreeDB pays more attention to dataset annotations and result visualization. We introduced a multiple-level annotation system to depict and organize original experiments. For example, a tamoxifen-treated cell line experiment is hierarchically annotated as 'agent→drug→estrogen receptor antagonist→tamoxifen'. Consequently, retrieved results are exhibited by an interactive tree-structured graphics, which provide an overview for related experiments and might enlighten users on key items of interest. The database is freely available at http://biotech.bmi.ac.cn/ExpTreeDB. Web site is implemented in Perl, PHP, R, MySQL and Apache. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Essential Annotation Schema for Ecology (EASE)—A framework supporting the efficient data annotation and faceted navigation in ecology

    PubMed Central

    Eichenberg, David; Liebergesell, Mario; König-Ries, Birgitta; Wirth, Christian

    2017-01-01

    Ecology has become a data intensive science over the last decades which often relies on the reuse of data in cross-experimental analyses. However, finding data which qualifies for the reuse in a specific context can be challenging. It requires good quality metadata and annotations as well as efficient search strategies. To date, full text search (often on the metadata only) is the most widely used search strategy although it is known to be inaccurate. Faceted navigation is providing a filter mechanism which is based on fine granular metadata, categorizing search objects along numeric and categorical parameters relevant for their discovery. Selecting from these parameters during a full text search creates a system of filters which allows to refine and improve the results towards more relevance. We developed a framework for the efficient annotation and faceted navigation in ecology. It consists of an XML schema for storing the annotation of search objects and is accompanied by a vocabulary focused on ecology to support the annotation process. The framework consolidates ideas which originate from widely accepted metadata standards, textbooks, scientific literature, and vocabularies as well as from expert knowledge contributed by researchers from ecology and adjacent disciplines. PMID:29023519

  9. Next Generation Models for Storage and Representation of Microbial Biological Annotation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Quest, Daniel J; Land, Miriam L; Brettin, Thomas S

    2010-01-01

    Background Traditional genome annotation systems were developed in a very different computing era, one where the World Wide Web was just emerging. Consequently, these systems are built as centralized black boxes focused on generating high quality annotation submissions to GenBank/EMBL supported by expert manual curation. The exponential growth of sequence data drives a growing need for increasingly higher quality and automatically generated annotation. Typical annotation pipelines utilize traditional database technologies, clustered computing resources, Perl, C, and UNIX file systems to process raw sequence data, identify genes, and predict and categorize gene function. These technologies tightly couple the annotation software systemmore » to hardware and third party software (e.g. relational database systems and schemas). This makes annotation systems hard to reproduce, inflexible to modification over time, difficult to assess, difficult to partition across multiple geographic sites, and difficult to understand for those who are not domain experts. These systems are not readily open to scrutiny and therefore not scientifically tractable. The advent of Semantic Web standards such as Resource Description Framework (RDF) and OWL Web Ontology Language (OWL) enables us to construct systems that address these challenges in a new comprehensive way. Results Here, we develop a framework for linking traditional data to OWL-based ontologies in genome annotation. We show how data standards can decouple hardware and third party software tools from annotation pipelines, thereby making annotation pipelines easier to reproduce and assess. An illustrative example shows how TURTLE (Terse RDF Triple Language) can be used as a human readable, but also semantically-aware, equivalent to GenBank/EMBL files. Conclusions The power of this approach lies in its ability to assemble annotation data from multiple databases across multiple locations into a representation that is

  10. Toward a semi-mechanical harvesting platform system for harvesting blueberries with fresh-market quality

    USDA-ARS?s Scientific Manuscript database

    Major concerns related to harvesting blueberries for fresh market with over-the-row (OTR) harvesters are that the quality of the fruit harvested with OTR machines is generally low and ground loss is excessive. Machine-harvested blueberries have more internal bruise and usually soften rapidly in col...

  11. Determining similarity of scientific entities in annotation datasets.

    PubMed

    Palma, Guillermo; Vidal, Maria-Esther; Haag, Eric; Raschid, Louiqa; Thor, Andreas

    2015-01-01

    Linked Open Data initiatives have made available a diversity of scientific collections where scientists have annotated entities in the datasets with controlled vocabulary terms from ontologies. Annotations encode scientific knowledge, which is captured in annotation datasets. Determining relatedness between annotated entities becomes a building block for pattern mining, e.g. identifying drug-drug relationships may depend on the similarity of the targets that interact with each drug. A diversity of similarity measures has been proposed in the literature to compute relatedness between a pair of entities. Each measure exploits some knowledge including the name, function, relationships with other entities, taxonomic neighborhood and semantic knowledge. We propose a novel general-purpose annotation similarity measure called 'AnnSim' that measures the relatedness between two entities based on the similarity of their annotations. We model AnnSim as a 1-1 maximum weight bipartite match and exploit properties of existing solvers to provide an efficient solution. We empirically study the performance of AnnSim on real-world datasets of drugs and disease associations from clinical trials and relationships between drugs and (genomic) targets. Using baselines that include a variety of measures, we identify where AnnSim can provide a deeper understanding of the semantics underlying the relatedness of a pair of entities or where it could lead to predicting new links or identifying potential novel patterns. Although AnnSim does not exploit knowledge or properties of a particular domain, its performance compares well with a variety of state-of-the-art domain-specific measures. Database URL: http://www.yeastgenome.org/ © The Author(s) 2015. Published by Oxford University Press.

  12. Gathering in Thoreau's backyard: nontimber forest product harvesting as practice

    Treesearch

    Paul Robbins; Marla Emery; Jennifer L. Rice

    2008-01-01

    Understanding of the gathering of nontimber forest products (NTFPs) in woodlands has focused heavily on politics surrounding public lands and harvester communities. Yet forest gathering may be far more universal. This paper reports the results of a survey of residents in New England, querying whether people gather wild things and for what purposes. The results suggest...

  13. Stump Harvesting

    Treesearch

    Dana Mitchell

    2009-01-01

    Increased use of forest fuel requires larger and larger procurement areas. Inclusion of stump material within the shorter distances could make this unusual source of biomass more economical to harvest. Land clearing activities are also helping to raise interest in stump harvesting. Processing stump material for biomass is an alternative...

  14. GeneTools--application for functional annotation and statistical hypothesis testing.

    PubMed

    Beisvag, Vidar; Jünge, Frode K R; Bergum, Hallgeir; Jølsum, Lars; Lydersen, Stian; Günther, Clara-Cecilie; Ramampiaro, Heri; Langaas, Mette; Sandvik, Arne K; Laegreid, Astrid

    2006-10-24

    Modern biology has shifted from "one gene" approaches to methods for genomic-scale analysis like microarray technology, which allow simultaneous measurement of thousands of genes. This has created a need for tools facilitating interpretation of biological data in "batch" mode. However, such tools often leave the investigator with large volumes of apparently unorganized information. To meet this interpretation challenge, gene-set, or cluster testing has become a popular analytical tool. Many gene-set testing methods and software packages are now available, most of which use a variety of statistical tests to assess the genes in a set for biological information. However, the field is still evolving, and there is a great need for "integrated" solutions. GeneTools is a web-service providing access to a database that brings together information from a broad range of resources. The annotation data are updated weekly, guaranteeing that users get data most recently available. Data submitted by the user are stored in the database, where it can easily be updated, shared between users and exported in various formats. GeneTools provides three different tools: i) NMC Annotation Tool, which offers annotations from several databases like UniGene, Entrez Gene, SwissProt and GeneOntology, in both single- and batch search mode. ii) GO Annotator Tool, where users can add new gene ontology (GO) annotations to genes of interest. These user defined GO annotations can be used in further analysis or exported for public distribution. iii) eGOn, a tool for visualization and statistical hypothesis testing of GO category representation. As the first GO tool, eGOn supports hypothesis testing for three different situations (master-target situation, mutually exclusive target-target situation and intersecting target-target situation). An important additional function is an evidence-code filter that allows users, to select the GO annotations for the analysis. GeneTools is the first "all in one

  15. A Selected Annotated Bibliography on Work Time Options.

    ERIC Educational Resources Information Center

    Ivantcho, Barbara

    This annotated bibliography is divided into three sections. Section I contains annotations of general publications on work time options. Section II presents resources on flexitime and the compressed work week. In Section III are found resources related to these reduced work time options: permanent part-time employment, job sharing, voluntary…

  16. A User-Driven Annotation Framework for Scientific Data

    ERIC Educational Resources Information Center

    Li, Qinglan

    2013-01-01

    Annotations play an increasingly crucial role in scientific exploration and discovery, as the amount of data and the level of collaboration among scientists increases. There are many systems today focusing on annotation management, querying, and propagation. Although all such systems are implemented to take user input (i.e., the annotations…

  17. De Novo Assembly and Functional Annotation of the Olive (Olea europaea) Transcriptome

    PubMed Central

    Muñoz-Mérida, Antonio; González-Plaza, Juan José; Cañada, Andrés; Blanco, Ana María; García-López, Maria del Carmen; Rodríguez, José Manuel; Pedrola, Laia; Sicardo, M. Dolores; Hernández, M. Luisa; De la Rosa, Raúl; Belaj, Angjelina; Gil-Borja, Mayte; Luque, Francisco; Martínez-Rivas, José Manuel; Pisano, David G.; Trelles, Oswaldo; Valpuesta, Victoriano; Beuzón, Carmen R.

    2013-01-01

    Olive breeding programmes are focused on selecting for traits as short juvenile period, plant architecture suited for mechanical harvest, or oil characteristics, including fatty acid composition, phenolic, and volatile compounds to suit new markets. Understanding the molecular basis of these characteristics and improving the efficiency of such breeding programmes require the development of genomic information and tools. However, despite its economic relevance, genomic information on olive or closely related species is still scarce. We have applied Sanger and 454 pyrosequencing technologies to generate close to 2 million reads from 12 cDNA libraries obtained from the Picual, Arbequina, and Lechin de Sevilla cultivars and seedlings from a segregating progeny of a Picual × Arbequina cross. The libraries include fruit mesocarp and seeds at three relevant developmental stages, young stems and leaves, active juvenile and adult buds as well as dormant buds, and juvenile and adult roots. The reads were assembled by library or tissue and then assembled together into 81 020 unigenes with an average size of 496 bases. Here, we report their assembly and their functional annotation. PMID:23297299

  18. Composite Sickles and Cereal Harvesting Methods at 23,000-Years-Old Ohalo II, Israel

    PubMed Central

    Weiss, Ehud; Nadel, Dani

    2016-01-01

    Use-wear analysis of five glossed flint blades found at Ohalo II, a 23,000-years-old fisher-hunter-gatherers’ camp on the shore of the Sea of Galilee, Northern Israel, provides the earliest evidence for the use of composite cereal harvesting tools. The wear traces indicate that tools were used for harvesting near-ripe semi-green wild cereals, shortly before grains are ripe and disperse naturally. The studied tools were not used intensively, and they reflect two harvesting modes: flint knives held by hand and inserts hafted in a handle. The finds shed new light on cereal harvesting techniques some 8,000 years before the Natufian and 12,000 years before the establishment of sedentary farming communities in the Near East. Furthermore, the new finds accord well with evidence for the earliest ever cereal cultivation at the site and the use of stone-made grinding implements. PMID:27880839

  19. Composite Sickles and Cereal Harvesting Methods at 23,000-Years-Old Ohalo II, Israel.

    PubMed

    Groman-Yaroslavski, Iris; Weiss, Ehud; Nadel, Dani

    2016-01-01

    Use-wear analysis of five glossed flint blades found at Ohalo II, a 23,000-years-old fisher-hunter-gatherers' camp on the shore of the Sea of Galilee, Northern Israel, provides the earliest evidence for the use of composite cereal harvesting tools. The wear traces indicate that tools were used for harvesting near-ripe semi-green wild cereals, shortly before grains are ripe and disperse naturally. The studied tools were not used intensively, and they reflect two harvesting modes: flint knives held by hand and inserts hafted in a handle. The finds shed new light on cereal harvesting techniques some 8,000 years before the Natufian and 12,000 years before the establishment of sedentary farming communities in the Near East. Furthermore, the new finds accord well with evidence for the earliest ever cereal cultivation at the site and the use of stone-made grinding implements.

  20. A Systematic Bioinformatics Approach to Identify High Quality Mass Spectrometry Data and Functionally Annotate Proteins and Proteomes.

    PubMed

    Islam, Mohammad Tawhidul; Mohamedali, Abidali; Ahn, Seong Beom; Nawar, Ishmam; Baker, Mark S; Ranganathan, Shoba

    2017-01-01

    In the past decade, proteomics and mass spectrometry have taken tremendous strides forward, particularly in the life sciences, spurred on by rapid advances in technology resulting in generation and conglomeration of vast amounts of data. Though this has led to tremendous advancements in biology, the interpretation of the data poses serious challenges for many practitioners due to the immense size and complexity of the data. Furthermore, the lack of annotation means that a potential gold mine of relevant biological information may be hiding within this data. We present here a simple and intuitive workflow for the research community to investigate and mine this data, not only to extract relevant data but also to segregate usable, quality data to develop hypotheses for investigation and validation. We apply an MS evidence workflow for verifying peptides of proteins from one's own data as well as publicly available databases. We then integrate a suite of freely available bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology and biochemical pathways. We also provide an example of the functional annotation of missing proteins in human chromosome 7 data from the NeXtProt database, where no evidence is available at the proteomic, antibody, or structural levels. We give examples of protocols, tools and detailed flowcharts that can be extended or tailored to interpret and annotate the proteome of any novel organism.

  1. HAMAP in 2013, new developments in the protein family classification and annotation system

    PubMed Central

    Pedruzzi, Ivo; Rivoire, Catherine; Auchincloss, Andrea H.; Coudert, Elisabeth; Keller, Guillaume; de Castro, Edouard; Baratin, Delphine; Cuche, Béatrice A.; Bougueleret, Lydie; Poux, Sylvain; Redaschi, Nicole; Xenarios, Ioannis; Bridge, Alan

    2013-01-01

    HAMAP (High-quality Automated and Manual Annotation of Proteins—available at http://hamap.expasy.org/) is a system for the classification and annotation of protein sequences. It consists of a collection of manually curated family profiles for protein classification, and associated annotation rules that specify annotations that apply to family members. HAMAP was originally developed to support the manual curation of UniProtKB/Swiss-Prot records describing microbial proteins. Here we describe new developments in HAMAP, including the extension of HAMAP to eukaryotic proteins, the use of HAMAP in the automated annotation of UniProtKB/TrEMBL, providing high-quality annotation for millions of protein sequences, and the future integration of HAMAP into a unified system for UniProtKB annotation, UniRule. HAMAP is continuously updated by expert curators with new family profiles and annotation rules as new protein families are characterized. The collection of HAMAP family classification profiles and annotation rules can be browsed and viewed on the HAMAP website, which also provides an interface to scan user sequences against HAMAP profiles. PMID:23193261

  2. TSSAR: TSS annotation regime for dRNA-seq data.

    PubMed

    Amman, Fabian; Wolfinger, Michael T; Lorenz, Ronny; Hofacker, Ivo L; Stadler, Peter F; Findeiß, Sven

    2014-03-27

    Differential RNA sequencing (dRNA-seq) is a high-throughput screening technique designed to examine the architecture of bacterial operons in general and the precise position of transcription start sites (TSS) in particular. Hitherto, dRNA-seq data were analyzed by visualizing the sequencing reads mapped to the reference genome and manually annotating reliable positions. This is very labor intensive and, due to the subjectivity, biased. Here, we present TSSAR, a tool for automated de novo TSS annotation from dRNA-seq data that respects the statistics of dRNA-seq libraries. TSSAR uses the premise that the number of sequencing reads starting at a certain genomic position within a transcriptional active region follows a Poisson distribution with a parameter that depends on the local strength of expression. The differences of two dRNA-seq library counts thus follow a Skellam distribution. This provides a statistical basis to identify significantly enriched primary transcripts.We assessed the performance by analyzing a publicly available dRNA-seq data set using TSSAR and two simple approaches that utilize user-defined score cutoffs. We evaluated the power of reproducing the manual TSS annotation. Furthermore, the same data set was used to reproduce 74 experimentally validated TSS in H. pylori from reliable techniques such as RACE or primer extension. Both analyses showed that TSSAR outperforms the static cutoff-dependent approaches. Having an automated and efficient tool for analyzing dRNA-seq data facilitates the use of the dRNA-seq technique and promotes its application to more sophisticated analysis. For instance, monitoring the plasticity and dynamics of the transcriptomal architecture triggered by different stimuli and growth conditions becomes possible.The main asset of a novel tool for dRNA-seq analysis that reaches out to a broad user community is usability. As such, we provide TSSAR both as intuitive RESTful Web service ( http

  3. Developing Annotation Solutions for Online Data Driven Learning

    ERIC Educational Resources Information Center

    Perez-Paredes, Pascual; Alcaraz-Calero, Jose M.

    2009-01-01

    Although "annotation" is a widely-researched topic in Corpus Linguistics (CL), its potential role in Data Driven Learning (DDL) has not been addressed in depth by Foreign Language Teaching (FLT) practitioners. Furthermore, most of the research in the use of DDL methods pays little attention to annotation in the design and implementation…

  4. Prepare-Participate-Connect: Active Learning with Video Annotation

    ERIC Educational Resources Information Center

    Colasante, Meg; Douglas, Kathy

    2016-01-01

    Annotation of video provides students with the opportunity to view and engage with audiovisual content in an interactive and participatory way rather than in passive-receptive mode. This article discusses research into the use of video annotation in four vocational programs at RMIT University in Melbourne, which allowed students to interact with…

  5. Collaborative Annotation System Environment (CASE) for Online Learning

    ERIC Educational Resources Information Center

    Glover, Ian; Hardaker, Glenn; Xu, Zhijie

    2004-01-01

    This paper outlines the design and development process of an online annotation system and how it is applied to the sphere of collaborative online learning. The architecture and design of the annotation system, illustrated in this paper, have been developed to enrich collaborative learning content through adding a layer of information in online…

  6. Making adjustments to event annotations for improved biological event extraction.

    PubMed

    Baek, Seung-Cheol; Park, Jong C

    2016-09-16

    Current state-of-the-art approaches to biological event extraction train statistical models in a supervised manner on corpora annotated with event triggers and event-argument relations. Inspecting such corpora, we observe that there is ambiguity in the span of event triggers (e.g., "transcriptional activity" vs. 'transcriptional'), leading to inconsistencies across event trigger annotations. Such inconsistencies make it quite likely that similar phrases are annotated with different spans of event triggers, suggesting the possibility that a statistical learning algorithm misses an opportunity for generalizing from such event triggers. We anticipate that adjustments to the span of event triggers to reduce these inconsistencies would meaningfully improve the present performance of event extraction systems. In this study, we look into this possibility with the corpora provided by the 2009 BioNLP shared task as a proof of concept. We propose an Informed Expectation-Maximization (EM) algorithm, which trains models using the EM algorithm with a posterior regularization technique, which consults the gold-standard event trigger annotations in a form of constraints. We further propose four constraints on the possible event trigger annotations to be explored by the EM algorithm. The algorithm is shown to outperform the state-of-the-art algorithm on the development corpus in a statistically significant manner and on the test corpus by a narrow margin. The analysis of the annotations generated by the algorithm shows that there are various types of ambiguity in event annotations, even though they could be small in number.

  7. Stand, Harvest, and Equipment Interactions in Simulated Harvesting Prescriptions

    Treesearch

    Jingxin Wang; W. Dale Greene; Bryce J. Stokes

    1998-01-01

    We evaluated potential interactions of stand type, harvesting method, and equipment in an experiment using interactive simulation. We examined three felling methods (chain saw, feller-buncher, harvester) and two extraction methods (grapple skidder and forwarder) performing clearcuts, sheltenvood cuts, and single-tree selection cuts in both an uneven-aged natural stand...

  8. A Collaborative Multimedia Annotation Tool for Enhancing Knowledge Sharing in CSCL

    ERIC Educational Resources Information Center

    Yang, Stephen J. H.; Zhang, Jia; Su, Addison Y. S.; Tsai, Jeffrey J. P.

    2011-01-01

    Knowledge sharing in computer supported collaborative learning (CSCL) requires intensive social interactions among participants, typically in the form of annotations. An annotation refers to an explicit expression of knowledge that is attached to a document to reveal the conceptual meanings of an annotator's implicit thoughts. In this research, we…

  9. A survey on annotation tools for the biomedical literature.

    PubMed

    Neves, Mariana; Leser, Ulf

    2014-03-01

    New approaches to biomedical text mining crucially depend on the existence of comprehensive annotated corpora. Such corpora, commonly called gold standards, are important for learning patterns or models during the training phase, for evaluating and comparing the performance of algorithms and also for better understanding the information sought for by means of examples. Gold standards depend on human understanding and manual annotation of natural language text. This process is very time-consuming and expensive because it requires high intellectual effort from domain experts. Accordingly, the lack of gold standards is considered as one of the main bottlenecks for developing novel text mining methods. This situation led the development of tools that support humans in annotating texts. Such tools should be intuitive to use, should support a range of different input formats, should include visualization of annotated texts and should generate an easy-to-parse output format. Today, a range of tools which implement some of these functionalities are available. In this survey, we present a comprehensive survey of tools for supporting annotation of biomedical texts. Altogether, we considered almost 30 tools, 13 of which were selected for an in-depth comparison. The comparison was performed using predefined criteria and was accompanied by hands-on experiences whenever possible. Our survey shows that current tools can support many of the tasks in biomedical text annotation in a satisfying manner, but also that no tool can be considered as a true comprehensive solution.

  10. Dizeez: An Online Game for Human Gene-Disease Annotation

    PubMed Central

    Loguercio, Salvatore; Good, Benjamin M.; Su, Andrew I.

    2013-01-01

    Structured gene annotations are a foundation upon which many bioinformatics and statistical analyses are built. However the structured annotations available in public databases are a sparse representation of biological knowledge as a whole. The rate of biomedical data generation is such that centralized biocuration efforts struggle to keep up. New models for gene annotation need to be explored that expand the pace at which we are able to structure biomedical knowledge. Recently, online games have emerged as an effective way to recruit, engage and organize large numbers of volunteers to help address difficult biological challenges. For example, games have been successfully developed for protein folding (Foldit), multiple sequence alignment (Phylo) and RNA structure design (EteRNA). Here we present Dizeez, a simple online game built with the purpose of structuring knowledge of gene-disease associations. Preliminary results from game play online and at scientific conferences suggest that Dizeez is producing valid gene-disease annotations not yet present in any public database. These early results provide a basic proof of principle that online games can be successfully applied to the challenge of gene annotation. Dizeez is available at http://genegames.org. PMID:23951102

  11. Moss harvest truncates the successional development of epiphytic bryophytes in the Pacific Northwest.

    PubMed

    Peck, Jerilynn E; Frelich, Lee E

    2008-01-01

    We evaluated the impact of commercial moss harvest on the development of an understory epiphyte community in the Pacific Northwest by characterizing natural development stages using data from both a long-term regrowth study and demographic sampling. First, experimentally stripped 1 m long cylindrats on 46 shrub stems in the Oregon Coast Range were monitored for species composition and abundance annually during the first five years of recovery and again in year 10. Second, a pathway of community development was inferred by examining the relative species composition and abundance of epiphytic species present in moss mats in a four-stage chronosequence. We (1) characterized the change in richness and composition from year 1 through 10 of regrowth following experimental disturbance, (2) quantified the proportion of approximately 1-, 10-, 25-, and 50-year-old moss mats of commercially harvestable species that were monodominant, diverse, and late successional, and (3) contrasted these proportions with estimates from a compositional transition matrix derived from long-term monitoring. Roughly half of the observed moss mats demonstrated neutral dynamics and were composed of a mixture of readily dispersed acrocarps and pleurocarps. The remaining half exhibited positive dynamics and were dominated by aggressively growing pleurocarpous species such as Isothecium myosuroides. Following structural developmental pathways well established for vascular plants, moss mats shift with time from high diversity and evenness in the initial colonization and extended establishment phases to increasing Isothecium dominance during a presumed competitive-exclusion phase. Old mats exist in alternate states of either Isothecium dominance or mixed composition, either of which may have late-successional species. Patchy historic commercial moss harvest likely facilitated high diversity by increasing the simultaneous occurrence of all moss mat age classes, while modern strip harvesting methods are

  12. 50 CFR 100.6 - Licenses, permits, harvest tickets, tags, and reports.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 50 Wildlife and Fisheries 8 2011-10-01 2011-10-01 false Licenses, permits, harvest tickets, tags, and reports. 100.6 Section 100.6 Wildlife and Fisheries UNITED STATES FISH AND WILDLIFE SERVICE... provisions as set forth in subpart D of this part. (e) If you take fish and wildlife under a community...

  13. 50 CFR 100.6 - Licenses, permits, harvest tickets, tags, and reports.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... 50 Wildlife and Fisheries 6 2010-10-01 2010-10-01 false Licenses, permits, harvest tickets, tags, and reports. 100.6 Section 100.6 Wildlife and Fisheries UNITED STATES FISH AND WILDLIFE SERVICE... provisions as set forth in subpart D of this part. (e) If you take fish and wildlife under a community...

  14. 50 CFR 100.6 - Licenses, permits, harvest tickets, tags, and reports.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... 50 Wildlife and Fisheries 9 2013-10-01 2013-10-01 false Licenses, permits, harvest tickets, tags, and reports. 100.6 Section 100.6 Wildlife and Fisheries UNITED STATES FISH AND WILDLIFE SERVICE... provisions as set forth in subpart D of this part. (e) If you take fish and wildlife under a community...

  15. 50 CFR 100.6 - Licenses, permits, harvest tickets, tags, and reports.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 50 Wildlife and Fisheries 9 2014-10-01 2014-10-01 false Licenses, permits, harvest tickets, tags, and reports. 100.6 Section 100.6 Wildlife and Fisheries UNITED STATES FISH AND WILDLIFE SERVICE... provisions as set forth in subpart D of this part. (e) If you take fish and wildlife under a community...

  16. Annotation an effective device for student feedback: a critical review of the literature.

    PubMed

    Ball, Elaine C

    2010-05-01

    The paper examines hand-written annotation, its many features, difficulties and strengths as a feedback tool. It extends and clarifies what modest evidence is in the public domain and offers an evaluation of how to use annotation effectively in the support of student feedback [Marshall, C.M., 1998a. The Future of Annotation in a Digital (paper) World. Presented at the 35th Annual GLSLIS Clinic: Successes and Failures of Digital Libraries, June 20-24, University of Illinois at Urbana-Champaign, March 24, pp. 1-20; Marshall, C.M., 1998b. Toward an ecology of hypertext annotation. Hypertext. In: Proceedings of the Ninth ACM Conference on Hypertext and Hypermedia, June 20-24, Pittsburgh Pennsylvania, US, pp. 40-49; Wolfe, J.L., Nuewirth, C.M., 2001. From the margins to the centre: the future of annotation. Journal of Business and Technical Communication, 15(3), 333-371; Diyanni, R., 2002. One Hundred Great Essays. Addison-Wesley, New York; Wolfe, J.L., 2002. Marginal pedagogy: how annotated texts affect writing-from-source texts. Written Communication, 19(2), 297-333; Liu, K., 2006. Annotation as an index to critical writing. Urban Education, 41, 192-207; Feito, A., Donahue, P., 2008. Minding the gap annotation as preparation for discussion. Arts and Humanities in Higher Education, 7(3), 295-307; Ball, E., 2009. A participatory action research study on handwritten annotation feedback and its impact on staff and students. Systemic Practice and Action Research, 22(2), 111-124; Ball, E., Franks, H., McGrath, M., Leigh, J., 2009. Annotation is a valuable tool to enhance learning and assessment in student essays. Nurse Education Today, 29(3), 284-291]. Although a significant number of studies examine annotation, this is largely related to on-line tools and computer mediated communication and not hand-written annotation as comment, phrase or sign written on the student essay to provide critique. Little systematic research has been conducted to consider how this latter form

  17. INFINITY harvest

    NASA Image and Video Library

    2012-05-07

    Lauren Lombard from Benjamin E. Mays Preparatory School in New Orleans enjoys lettuce she helped to harvest at the INFINITY at NASA Stennis Space Center facility May 7, 2012. The Louisiana students assisted in the first harvest of lettuce from the Controlled Environment Agriculture unit, which uses an aeroponic process that involves no soil and advance LED lighting techniques

  18. Fog Harvesting with Harps.

    PubMed

    Shi, Weiwei; Anderson, Mark J; Tulkoff, Joshua B; Kennedy, Brook S; Boreyko, Jonathan B

    2018-04-11

    Fog harvesting is a useful technique for obtaining fresh water in arid climates. The wire meshes currently utilized for fog harvesting suffer from dual constraints: coarse meshes cannot efficiently capture microscopic fog droplets, whereas fine meshes suffer from clogging issues. Here, we design and fabricate fog harvesters comprising an array of vertical wires, which we call "fog harps". Under controlled laboratory conditions, the fog-harvesting rates for fog harps with three different wire diameters were compared to conventional meshes of equivalent dimensions. As expected for the mesh structures, the mid-sized wires exhibited the largest fog collection rate, with a drop-off in performance for the fine or coarse meshes. In contrast, the fog-harvesting rate continually increased with decreasing wire diameter for the fog harps due to efficient droplet shedding that prevented clogging. This resulted in a 3-fold enhancement in the fog-harvesting rate for the harp design compared to an equivalent mesh.

  19. AGORA : Organellar genome annotation from the amino acid and nucleotide references.

    PubMed

    Jung, Jaehee; Kim, Jong Im; Jeong, Young-Sik; Yi, Gangman

    2018-03-29

    Next-generation sequencing (NGS) technologies have led to the accumulation of highthroughput sequence data from various organisms in biology. To apply gene annotation of organellar genomes for various organisms, more optimized tools for functional gene annotation are required. Almost all gene annotation tools are mainly focused on the chloroplast genome of land plants or the mitochondrial genome of animals.We have developed a web application AGORA for the fast, user-friendly, and improved annotations of organellar genomes. AGORA annotates genes based on a BLAST-based homology search and clustering with selected reference sequences from the NCBI database or user-defined uploaded data. AGORA can annotate the functional genes in almost all mitochondrion and plastid genomes of eukaryotes. The gene annotation of a genome with an exon-intron structure within a gene or inverted repeat region is also available. It provides information of start and end positions of each gene, BLAST results compared with the reference sequence, and visualization of gene map by OGDRAW. Users can freely use the software, and the accessible URL is https://bigdata.dongguk.edu/gene_project/AGORA/.The main module of the tool is implemented by the python and php, and the web page is built by the HTML and CSS to support all browsers. gangman@dongguk.edu.

  20. BioSAVE: display of scored annotation within a sequence context.

    PubMed

    Pollock, Richard F; Adryan, Boris

    2008-03-20

    Visualization of sequence annotation is a common feature in many bioinformatics tools. For many applications it is desirable to restrict the display of such annotation according to a score cutoff, as biological interpretation can be difficult in the presence of the entire data. Unfortunately, many visualisation solutions are somewhat static in the way they handle such score cutoffs. We present BioSAVE, a sequence annotation viewer with on-the-fly selection of visualisation thresholds for each feature. BioSAVE is a versatile OS X program for visual display of scored features (annotation) within a sequence context. The program reads sequence and additional supplementary annotation data (e.g., position weight matrix matches, conservation scores, structural domains) from a variety of commonly used file formats and displays them graphically. Onscreen controls then allow for live customisation of these graphics, including on-the-fly selection of visualisation thresholds for each feature. Possible applications of the program include display of transcription factor binding sites in a genomic context or the visualisation of structural domain assignments in protein sequences and many more. The dynamic visualisation of these annotations is useful, e.g., for the determination of cutoff values of predicted features to match experimental data. Program, source code and exemplary files are freely available at the BioSAVE homepage.

  1. 1973 Oregon timber harvest.

    Treesearch

    J.D. Jr. Lloyd

    1974-01-01

    The 1973 Oregon timber harvest of 9.36 billion board feet was 265 million board feet (2.8 percent) below the 1972 harvest. The greater portion of the decrease occurred in eastern Oregon where timber harvest dropped 9.4 percent compared with 0.9 percent in western Oregon.

  2. Forest Management Policy and Community Well-Being in the Pacific Northwest

    Treesearch

    Susan Charnley; Ellen M. Donoghue; Cassandra Moseley

    2008-01-01

    This study uses a multiscale, multimethods approach to examine the effects of declining timber harvests on the well-being of forest communities in the Pacific Northwest as a result of the Northwest Forest Plan (the Plan). We found that the effects of declining timber harvests were variable and depended on the importance of the timber sector in a community in the late...

  3. Forest management policy and community well-being in the Pacific Northwest

    Treesearch

    Susan Charnley; Ellen M. Donoghue; Cassandra Moseley

    2008-01-01

    This study uses a multiscale, multimethods approach to examine the effects of declining timber harvests on the well-being of forest communities in the Pacific Northwest as a result of the Northwest Forest Plan (the Plan). We found that the effects of declining timber harvests were variable and depended on the importance of the timber sector in a community in the late...

  4. A new harvest operation cost model to evaluate forest harvest layout alternatives

    Treesearch

    Mark M. Clark; Russell D. Meller; Timothy P. McDonald; Chao Chi Ting

    1997-01-01

    The authors develop a new model for harvest operation costs that can be used to evaluate stands for potential harvest. The model is based on felling, extraction, and access costs, and is unique in its consideration of the interaction between harvest area shapes and access roads. The scientists illustrate the model and evaluate the impact of stand size, volume, and road...

  5. Recovery and diversity of the forest shrub community 38 years after biomass harvesting in the northern Rocky Mountains

    Treesearch

    Woongsoon Jang; Christopher R. Keyes; Deborah S. Page-Dumroese

    2016-01-01

    We investigated the long-term impact of biomass utilization on shrub recovery, species composition, and biodiversity 38 years after harvesting at Coram Experimental Forest in northwestern Montana. Three levels of biomass removal intensity (high, medium, and low) treatments combined with prescribed burning treatment were nested within three regeneration harvest...

  6. Ghostwriting: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Simmons, Donald B.

    Drawn from communication journals, historical and news magazines, business and industrial magazines, political science and world affairs journals, general interest periodicals, and literary and political review magazines, the approximately 90 entries in this annotated bibliography discuss ghostwriting as practiced through the ages and reveal the…

  7. Annotation of Ehux ESTs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kuo, Alan; Grigoriev, Igor

    2009-06-12

    22 percent ESTs do no align with scaffolds. EST Pipeleine assembles 17126 consensi from the noaligned ESTs. Annotation Pipeline predicts 8564 ORFS on the consensi. Domain analysis of ORFs reveals missing genes. Cluster analysis reveals missing genes. Expression analysis reveals potential strain specific genes.

  8. Annotated Bibliography. First Edition.

    ERIC Educational Resources Information Center

    Haring, Norris G.

    An annotated bibliography which presents approximately 300 references from 1951 to 1973 on the education of severely/profoundly handicapped persons. Citations are grouped alphabetically by author's name within the following categories: characteristics and treatment, gross motor development, sensory and motor development, physical therapy for the…

  9. Maize - GO annotation methods, evaluation, and review (Maize-GAMER)

    USDA-ARS?s Scientific Manuscript database

    Making a genome sequence accessible and useful involves three basic steps: genome assembly, structural annotation, and functional annotation. The quality of data generated at each step influences the accuracy of inferences that can be made, with high-quality analyses produce better datasets resultin...

  10. Transcriptome sequencing and de novo annotation of the critically endangered Adriatic sturgeon.

    PubMed

    Vidotto, Michele; Grapputo, Alessandro; Boscari, Elisa; Barbisan, Federica; Coppe, Alessandro; Grandi, Gilberto; Kumar, Abhishek; Congiu, Leonardo

    2013-06-18

    Sturgeons are a group of Condrostean fish with very high evolutionary, economical and conservation interest. The eggs of these living fossils represent one of the most high prized foods of animal origin. The intense fishing pressure on wild stocks to harvest caviar has caused in the last decades a dramatic decline of their distribution and abundance leading the International Union for Conservation of Nature to list them as the more endangered group of species. As a direct consequence, world-wide efforts have been made to develop sturgeon aquaculture programmes for caviar production. In this context, the characterization of the genes involved in sex determination could provide relevant information for the selective farming of the more profitable females. The 454 sequencing of two cDNA libraries from the gonads and brain of one male and one female full-sib A. naccarii, yielded 182,066 and 167,776 reads respectively, which, after strict quality control, were iterative assembled into more than 55,000 high quality ESTs. The average per-base coverage reached by assembling the two libraries was 4X. The multi-step annotation process resulted in 16% successfully annotated sequences with GO terms. We screened the transcriptome for 32 sex-related genes and highlighted 7 genes that are potentially specifically expressed, 5 in male and 2 in females, at the first life stage at which sex is histologically identifiable. In addition we identified 21,791 putative EST-linked SNPs and 5,295 SSRs. This study represents the first large massive release of sturgeon transcriptome information that we organized into the public database AnaccariiBase, which is freely available at http://compgen.bio.unipd.it/anaccariibase/. This transcriptomic data represents an important source of information for further studies on sturgeon species. The hundreds of putative EST-linked molecular makers discovered in this study will be invaluable for sturgeon reintroduction and breeding programs.

  11. Harvesting wood for energy.

    Treesearch

    Rodger A. Arola; Edwin W. Miyata

    1981-01-01

    Illustrates the potential of harvesting wood for industrial energy, based on the results of five harvesting studies. Presents information on harvesting operations, equipment costs, and productivity. Discusses mechanized thinning of hardwoods, clearcutting of low-value stands and recovery of hardwood tops and limbs. Also includes basic information on the physical and...

  12. Ecological impacts of energy-wood harvests: lessons from whole-tree harvesting and natural disturbance

    USGS Publications Warehouse

    Berger, Alaina L.; Palik, Brian; D'Amato, Anthony W.; Fraver, Shawn; Bradford, John B.; Nislow, Keith H.; King, David; Brooks, Robert T.

    2013-01-01

    Recent interest in using forest residues and small-diameter material for biofuels is generating a renewed focus on harvesting impacts and forest sustainability. The rich legacy of research from whole-tree harvesting studies can be examined in light of this interest. Although this research largely focused on consequences for forest productivity, in particular carbon and nutrient pools, it also has relevance for examining potential consequences for biodiversity and aquatic ecosystems. This review is framed within a context of contrasting ecosystem impacts from whole-tree harvesting because it represents a high level of biomass removal. Although whole-tree harvesting does not fully use the nonmerchantable biomass available, it indicates the likely direction and magnitude of impacts that can occur through energy-wood harvesting compared with less-intensive conventional harvesting and to dynamics associated with various natural disturbances. The intent of this comparison is to gauge the degree of departure of energy-wood harvesting from less intensive conventional harvesting. The review of the literature found a gradient of increasing departure in residual structural conditions that remained in the forest when conventional and whole-tree harvesting was compared with stand-replacing natural disturbance. Important stand- and landscape-level processes were related to these structural conditions. The consequence of this departure may be especially potent because future energy-wood harvests may more completely use a greater range of forest biomass at potentially shortened rotations, creating a great need for research that explores the largely unknown scale of disturbance that may apply to our forest ecosystems.

  13. INFINITY harvest

    NASA Image and Video Library

    2012-05-07

    Shania Etheridge from Benjamin E. Mays Preparatory School in New Orleans shows off the head of lettuce she harvested at the INFINITY at NASA Stennis Space Center facility May 7, 2012. The Louisiana students assisted in the first harvest of lettuce from the Controlled Environment Agriculture unit, which uses an aeroponic process that involves no soil and advance LED lighting techniques.

  14. Microalgae harvesting techniques: A review.

    PubMed

    Singh, Gulab; Patidar, S K

    2018-07-01

    Microalgae with wide range of commercial applications have attracted a lot of attention of the researchers in the last few decades. However, microalgae utilization is not economically sustainable due to high cost of harvesting. A wide range of solid - liquid separation techniques are available for microalgae harvesting. The techniques include coagulation and flocculation, flotation, centrifugation and filtration or a combination of various techniques. Despite the importance of harvesting to the economics and energy balance, there is no universal harvesting technique for microalgae. Therefore, this review focuses on assessing technical, economical and application potential of various harvesting techniques so as to allow selection of an appropriate technology for cost effectively harvesting of microalgae from their culture medium. Various harvesting and concentrating techniques of microalgae were reviewed to suggest order of suitability of the techniques for four main microalgae applications i.e biofuel, human and animal food, high valued products, and water quality restoration. For deciding the order of suitability, a comparative analysis of various harvesting techniques based on the six common criterions (i.e biomass quality, cost, biomass quantity, processing time, species specific and toxicity) has been done. Based on the order of various techniques vis-a-vis various criteria and preferred order of criteria for various applications, order of suitability of harvesting techniques for various applications has been decided. Among various harvesting techniques, coagulation and flocculation, centrifugation and filtration were found to be most suitable for considered applications. These techniques may be used alone or in combination for increasing the harvesting efficiency. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. Facet Annotation by Extending CNN with a Matching Strategy.

    PubMed

    Wu, Bei; Wei, Bifan; Liu, Jun; Guo, Zhaotong; Zheng, Yuanhao; Chen, Yihe

    2018-06-01

    Most community question answering (CQA) websites manage plenty of question-answer pairs (QAPs) through topic-based organizations, which may not satisfy users' fine-grained search demands. Facets of topics serve as a powerful tool to navigate, refine, and group the QAPs. In this work, we propose FACM, a model to annotate QAPs with facets by extending convolution neural networks (CNNs) with a matching strategy. First, phrase information is incorporated into text representation by CNNs with different kernel sizes. Then, through a matching strategy among QAPs and facet label texts (FaLTs) acquired from Wikipedia, we generate similarity matrices to deal with the facet heterogeneity. Finally, a three-channel CNN is trained for facet label assignment of QAPs. Experiments on three real-world data sets show that FACM outperforms the state-of-the-art methods.

  16. Sequencing, Annotation and Analysis of the Syrian Hamster (Mesocricetus auratus) Transcriptome

    PubMed Central

    Tchitchek, Nicolas; Safronetz, David; Rasmussen, Angela L.; Martens, Craig; Virtaneva, Kimmo; Porcella, Stephen F.; Feldmann, Heinz

    2014-01-01

    Background The Syrian hamster (golden hamster, Mesocricetus auratus) is gaining importance as a new experimental animal model for multiple pathogens, including emerging zoonotic diseases such as Ebola. Nevertheless there are currently no publicly available transcriptome reference sequences or genome for this species. Results A cDNA library derived from mRNA and snRNA isolated and pooled from the brains, lungs, spleens, kidneys, livers, and hearts of three adult female Syrian hamsters was sequenced. Sequence reads were assembled into 62,482 contigs and 111,796 reads remained unassembled (singletons). This combined contig/singleton dataset, designated as the Syrian hamster transcriptome, represents a total of 60,117,204 nucleotides. Our Mesocricetus auratus Syrian hamster transcriptome mapped to 11,648 mouse transcripts representing 9,562 distinct genes, and mapped to a similar number of transcripts and genes in the rat. We identified 214 quasi-complete transcripts based on mouse annotations. Canonical pathways involved in a broad spectrum of fundamental biological processes were significantly represented in the library. The Syrian hamster transcriptome was aligned to the current release of the Chinese hamster ovary (CHO) cell transcriptome and genome to improve the genomic annotation of this species. Finally, our Syrian hamster transcriptome was aligned against 14 other rodents, primate and laurasiatheria species to gain insights about the genetic relatedness and placement of this species. Conclusions This Syrian hamster transcriptome dataset significantly improves our knowledge of the Syrian hamster's transcriptome, especially towards its future use in infectious disease research. Moreover, this library is an important resource for the wider scientific community to help improve genome annotation of the Syrian hamster and other closely related species. Furthermore, these data provide the basis for development of expression microarrays that can be used in functional

  17. Methods for eliciting, annotating, and analyzing databases for child speech development.

    PubMed

    Beckman, Mary E; Plummer, Andrew R; Munson, Benjamin; Reidy, Patrick F

    2017-09-01

    Methods from automatic speech recognition (ASR), such as segmentation and forced alignment, have facilitated the rapid annotation and analysis of very large adult speech databases and databases of caregiver-infant interaction, enabling advances in speech science that were unimaginable just a few decades ago. This paper centers on two main problems that must be addressed in order to have analogous resources for developing and exploiting databases of young children's speech. The first problem is to understand and appreciate the differences between adult and child speech that cause ASR models developed for adult speech to fail when applied to child speech. These differences include the fact that children's vocal tracts are smaller than those of adult males and also changing rapidly in size and shape over the course of development, leading to between-talker variability across age groups that dwarfs the between-talker differences between adult men and women. Moreover, children do not achieve fully adult-like speech motor control until they are young adults, and their vocabularies and phonological proficiency are developing as well, leading to considerably more within-talker variability as well as more between-talker variability. The second problem then is to determine what annotation schemas and analysis techniques can most usefully capture relevant aspects of this variability. Indeed, standard acoustic characterizations applied to child speech reveal that adult-centered annotation schemas fail to capture phenomena such as the emergence of covert contrasts in children's developing phonological systems, while also revealing children's nonuniform progression toward community speech norms as they acquire the phonological systems of their native languages. Both problems point to the need for more basic research into the growth and development of the articulatory system (as well as of the lexicon and phonological system) that is oriented explicitly toward the construction of

  18. Annotation-Based Learner's Personality Modeling in Distance Learning Context

    ERIC Educational Resources Information Center

    Omheni, Nizar; Kalboussi, Anis; Mazhoud, Omar; Kacem, Ahmed Hadj

    2016-01-01

    Researchers in distance education are interested in observing and modeling learners' personality profiles, and adapting their learning experiences accordingly. When learners read and interact with their reading materials, they do unselfconscious activities like annotation which may be key feature of their personalities. Annotation activity…

  19. Annotated Bibliography of Research in the Teaching of English

    ERIC Educational Resources Information Center

    Beach, Richard; Bigelow, Martha; Dillon, Deborah; Dockter, Jessie; Galda, Lee; Helman, Lori; Kalnin, Julie; Ngo, Bic; O'Brien, David; Sato, Mistilina; Scharber, Cassandra; Jorgensen, Karen; Liang, Lauren; Braaksma, Martine; Janssen, Tanja

    2008-01-01

    This article presents an annotated bibliography of research in the teaching of English. This annotated bibliography addresses the following topics: (1) discourse/cultural analysis; (2) literacy; (3) literary response/literature/narrative; (4) professional development/teacher education; (5) reading; (6) second language literacy; (7)…

  20. Enhancing Expressivity of Document-Centered Collaboration with Multimodal Annotations

    ERIC Educational Resources Information Center

    Yoon, Dongwook

    2017-01-01

    As knowledge work moves online, digital documents have become a staple of human collaboration. To communicate beyond the constraints of time and space, remote and asynchronous collaborators create digital annotations over documents, substituting face-to-face meetings with online conversations. However, existing document annotation interfaces…

  1. Una Introduccion a los Jovenes con Discapacidades: Bibliografia Anotada. Revisiones de CYDLINE (An Introduction to Youth with Disabilities: Annotated Bibliography. CYDINE Reviews).

    ERIC Educational Resources Information Center

    Minnesota Univ., Minneapolis. National Center for Youth with Disabilities.

    This Spanish-language annotated bibliography describes English-language resources covering a wide range of issues related to disabled youth and their families. The 38 bibliographic citations date from 1980 to 1989 and are grouped into the following categories: psychosocial issues, health issues, educational issues, and community living.…

  2. An annotation system for 3D fluid flow visualization

    NASA Technical Reports Server (NTRS)

    Loughlin, Maria M.; Hughes, John F.

    1995-01-01

    Annotation is a key activity of data analysis. However, current systems for data analysis focus almost exclusively on visualization. We propose a system which integrates annotations into a visualization system. Annotations are embedded in 3D data space, using the Post-it metaphor. This embedding allows contextual-based information storage and retrieval, and facilitates information sharing in collaborative environments. We provide a traditional database filter and a Magic Lens filter to create specialized views of the data. The system has been customized for fluid flow applications, with features which allow users to store parameters of visualization tools and sketch 3D volumes.

  3. Elucidating high-dimensional cancer hallmark annotation via enriched ontology.

    PubMed

    Yan, Shankai; Wong, Ka-Chun

    2017-09-01

    Cancer hallmark annotation is a promising technique that could discover novel knowledge about cancer from the biomedical literature. The automated annotation of cancer hallmarks could reveal relevant cancer transformation processes in the literature or extract the articles that correspond to the cancer hallmark of interest. It acts as a complementary approach that can retrieve knowledge from massive text information, advancing numerous focused studies in cancer research. Nonetheless, the high-dimensional nature of cancer hallmark annotation imposes a unique challenge. To address the curse of dimensionality, we compared multiple cancer hallmark annotation methods on 1580 PubMed abstracts. Based on the insights, a novel approach, UDT-RF, which makes use of ontological features is proposed. It expands the feature space via the Medical Subject Headings (MeSH) ontology graph and utilizes novel feature selections for elucidating the high-dimensional cancer hallmark annotation space. To demonstrate its effectiveness, state-of-the-art methods are compared and evaluated by a multitude of performance metrics, revealing the full performance spectrum on the full set of cancer hallmarks. Several case studies are conducted, demonstrating how the proposed approach could reveal novel insights into cancers. https://github.com/cskyan/chmannot. Copyright © 2017 Elsevier Inc. All rights reserved.

  4. Harvest and trade of caterpillar mushroom (Ophiocordyceps sinensis) and the implications for sustainable use in the Tibet Region of Southwest China.

    PubMed

    He, Jun

    2018-07-15

    Caterpillar mushroom (Ophiocordyceps sinensis) is a unique medicinal fungi which is only found in alpine grasslands in Himalayan mountain regions and the Tibetan Plateau. Known locally as Yartsa Gunbu, it has been widely used in Tibetan and Chinese Medicine for centuries. It is crucial to understand local commercial harvest and trade practices of caterpillar mushroom to support the sustainable management of this valuable resource. However, data derived from empirically grounded research is currently limited, particularly in China. The research aims to provide the most up-to-date insights into caterpillar mushroom harvest and trade in the main production area of the Tibet Region in Southwest China and to generate policy recommendations for sustainable use. The research was conducted in 2015-2016 in six Tibetan communities located in two counties in Diqing Tibetan Autonomous Prefecture, Southwest China. Quantitative and qualitative data were collected from in-depth interviews with local households engaged in caterpillar mushroom harvesting (n = 157), local caterpillar mushroom traders (n = 14), and from focus groups discussions (n = 5) with regional caterpillar mushroom industry stakeholders. The research found large regional- and community-level differences in caterpillar mushroom harvest practices. The harvest practices of communities involved in the co-management of a Nature Reserve were more sustainable than those communities not involved in such a scheme, and this was due to the external support and training provided via the co-management scheme. Moreover, a customary tenure system was proving effective for avoiding competition over caterpillar mushroom collection. However, in both counties, narrow marketing channel and non-grading system in trade limits the possibility of improving the local benefits generated from the commercial harvest of caterpillar mushroom. Meanwhile, the local traders play an important bridging role in the value chain and

  5. Simulating the biogeochemical and biogeophysical impacts of transient land cover change and wood harvest in the Community Climate System Model (CCSM4) from 1850 to 2100

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lawrence, Peter J.; Feddema, Johannes J.; Bonan, Gordon B.

    To assess the climate impacts of historical and projected land cover change and land use in the Community Climate System Model (CCSM4) we have developed new time series of transient Community Land Model (CLM4) Plant Functional Type (PFT) parameters and wood harvest parameters. The new parameters capture the dynamics of the Coupled Model Inter-comparison Project phase 5 (CMIP5) land cover change and wood harvest trajectories for the historical period from 1850 to 2005, and for the four Representative Concentration Pathways (RCP) periods from 2006 to 2100. Analysis of the biogeochemical impacts of land cover change in CCSM4 with the parametersmore » found the model produced an historical cumulative land use flux of 148.4 PgC from 1850 to 2005, which was in good agreement with other global estimates of around 156 PgC for the same period. The biogeophysical impacts of only applying the transient land cover change parameters in CCSM4 were cooling of the near surface atmospheric over land by -0.1OC, through increased surface albedo and reduced shortwave radiation absorption. When combined with other transient climate forcings, the higher albedo from land cover change was overwhelmed at global scales by decreases in snow albedo from black carbon deposition and from high latitude warming. At regional scales however the land cover change forcing persisted resulting in reduced warming, with the biggest impacts in eastern North America. The future CCSM4 RCP simulations showed that the CLM4 transient PFT and wood harvest parameters could be used to represent a wide range of human land cover change and land use scenarios. Furthermore, these simulations ranged from the RCP 4.5 reforestation scenario that was able to draw down 82.6 PgC from the atmosphere, to the RCP 8.5 wide scale deforestation scenario that released 171.6 PgC to the atmosphere.« less

  6. snpGeneSets: An R Package for Genome-Wide Study Annotation

    PubMed Central

    Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian

    2016-01-01

    Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048

  7. Metagenome phylogenetic profiling of microbial community evolution in a tetrachloroethene-contaminated aquifer responding to enhanced reductive dechlorination protocols.

    PubMed

    Reiss, Rebecca A; Guerra, Peter; Makhnin, Oleg

    2016-01-01

    Chlorinated solvent contamination of potable water supplies is a serious problem worldwide. Biostimulation protocols can successfully remediate chlorinated solvent contamination through enhanced reductive dechlorination pathways, however the process is poorly understood and sometimes stalls creating a more serious problem. Whole metagenome techniques have the potential to reveal details of microbial community changes induced by biostimulation. Here we compare the metagenome of a tetrachloroethene contaminated Environmental Protection Agency Superfund Site before and after the application of biostimulation protocols. Environmental DNA was extracted from uncultured microbes that were harvested by on-site filtration of groundwater one month prior to and five months after the injection of emulsified vegetable oil, nutrients, and hydrogen gas bioamendments. Pair-end libraries were prepared for high-throughput DNA sequencing and 90 basepairs from both ends of randomly fragmented 400 basepair DNA fragments were sequenced. Over 31 millions reads were annotated with Metagenome Rapid Annotation using Subsystem Technology representing 32 prokaryotic phyla, 869 genera, and 3,181 species. A 3.6 log 2 fold increase in biomass as measured by DNA yield per mL water was measured, but there was a 9% decrease in the number of genera detected post-remediation. We apply Bayesian statistical methods to assign false discovery rates to fold-change abundance data and use Zipf's power law to filter genera with low read counts. Plotting the log-rank against the log-fold-change facilitates the visualization of the changes in the community in response to the enhanced reductive dechlorination protocol. Members of the Archaea domain increased 4.7 log 2 fold, dominated by methanogens. Prior to remediation, classes Alphaproteobacteria and Betaproteobacteria dominated the community but exhibit significant decreases five months after biostimulation. Geobacter and Sulfurospirillum replace

  8. A new approach for annotation of transposable elements using small RNA mapping

    PubMed Central

    El Baidouri, Moaine; Kim, Kyung Do; Abernathy, Brian; Arikit, Siwaret; Maumus, Florian; Panaud, Olivier; Meyers, Blake C.; Jackson, Scott A.

    2015-01-01

    Transposable elements (TEs) are mobile genomic DNA sequences found in most organisms. They so densely populate the genomes of many eukaryotic species that they are often the major constituents. With the rapid generation of many plant genome sequencing projects over the past few decades, there is an urgent need for improved TE annotation as a prerequisite for genome-wide studies. Analogous to the use of RNA-seq for gene annotation, we propose a new method for de novo TE annotation that uses as a guide 24 nt-siRNAs that are a part of TE silencing pathways. We use this new approach, called TASR (for Transposon Annotation using Small RNAs), for de novo annotation of TEs in Arabidopsis, rice and soybean and demonstrate that this strategy can be successfully applied for de novo TE annotation in plants. Executable PERL is available for download from: http://tasr-pipeline.sourceforge.net/ PMID:25813049

  9. INFINITY harvest

    NASA Image and Video Library

    2012-05-07

    Janice Hueschen of Innovative Imaging & Research Corp. at Stennis Space Center helps students from Benjamin E. Mays Preparatory School in New Orleans harvest lettuce at the INFINITY at NASA Stennis Space Center facility May 7, 2012. The Louisiana students assisted in the first harvest of lettuce from the Controlled Environment Agriculture unit, which uses an aeroponic process that involves no soil and advance LED lighting techniques.

  10. K-Nearest Neighbors Relevance Annotation Model for Distance Education

    ERIC Educational Resources Information Center

    Ke, Xiao; Li, Shaozi; Cao, Donglin

    2011-01-01

    With the rapid development of Internet technologies, distance education has become a popular educational mode. In this paper, the authors propose an online image automatic annotation distance education system, which could effectively help children learn interrelations between image content and corresponding keywords. Image automatic annotation is…

  11. 1969 Washington timber harvest.

    Treesearch

    Brian R. Wall

    1970-01-01

    Washington's timber harvest increased slightly in 1969 to a 40-year high of 7 billion board feet. This is slightly below the record timber harvest of 7.38 billion board feet established in 1829. Private timberland owners in western Washington increased their production 10.9 percent, accounting for most of the increase in the 1969 total harvest. In eastern...

  12. GENCODE: the reference human genome annotation for The ENCODE Project.

    PubMed

    Harrow, Jennifer; Frankish, Adam; Gonzalez, Jose M; Tapanari, Electra; Diekhans, Mark; Kokocinski, Felix; Aken, Bronwen L; Barrell, Daniel; Zadissa, Amonida; Searle, Stephen; Barnes, If; Bignell, Alexandra; Boychenko, Veronika; Hunt, Toby; Kay, Mike; Mukherjee, Gaurab; Rajan, Jeena; Despacio-Reyes, Gloria; Saunders, Gary; Steward, Charles; Harte, Rachel; Lin, Michael; Howald, Cédric; Tanzer, Andrea; Derrien, Thomas; Chrast, Jacqueline; Walters, Nathalie; Balasubramanian, Suganthi; Pei, Baikang; Tress, Michael; Rodriguez, Jose Manuel; Ezkurdia, Iakes; van Baren, Jeltje; Brent, Michael; Haussler, David; Kellis, Manolis; Valencia, Alfonso; Reymond, Alexandre; Gerstein, Mark; Guigó, Roderic; Hubbard, Tim J

    2012-09-01

    The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.

  13. BioSAVE: Display of scored annotation within a sequence context

    PubMed Central

    Pollock, Richard F; Adryan, Boris

    2008-01-01

    Background Visualization of sequence annotation is a common feature in many bioinformatics tools. For many applications it is desirable to restrict the display of such annotation according to a score cutoff, as biological interpretation can be difficult in the presence of the entire data. Unfortunately, many visualisation solutions are somewhat static in the way they handle such score cutoffs. Results We present BioSAVE, a sequence annotation viewer with on-the-fly selection of visualisation thresholds for each feature. BioSAVE is a versatile OS X program for visual display of scored features (annotation) within a sequence context. The program reads sequence and additional supplementary annotation data (e.g., position weight matrix matches, conservation scores, structural domains) from a variety of commonly used file formats and displays them graphically. Onscreen controls then allow for live customisation of these graphics, including on-the-fly selection of visualisation thresholds for each feature. Conclusion Possible applications of the program include display of transcription factor binding sites in a genomic context or the visualisation of structural domain assignments in protein sequences and many more. The dynamic visualisation of these annotations is useful, e.g., for the determination of cutoff values of predicted features to match experimental data. Program, source code and exemplary files are freely available at the BioSAVE homepage. PMID:18366701

  14. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine

    PubMed Central

    Elsik, Christine G.; Tayal, Aditi; Diesh, Colin M.; Unni, Deepak R.; Emery, Marianne L.; Nguyen, Hung N.; Hagen, Darren E.

    2016-01-01

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. PMID:26578564

  15. Automated Gene Ontology annotation for anonymous sequence data.

    PubMed

    Hennig, Steffen; Groth, Detlef; Lehrach, Hans

    2003-07-01

    Gene Ontology (GO) is the most widely accepted attempt to construct a unified and structured vocabulary for the description of genes and their products in any organism. Annotation by GO terms is performed in most of the current genome projects, which besides generality has the advantage of being very convenient for computer based classification methods. However, direct use of GO in small sequencing projects is not easy, especially for species not commonly represented in public databases. We present a software package (GOblet), which performs annotation based on GO terms for anonymous cDNA or protein sequences. It uses the species independent GO structure and vocabulary together with a series of protein databases collected from various sites, to perform a detailed GO annotation by sequence similarity searches. The sensitivity and the reference protein sets can be selected by the user. GOblet runs automatically and is available as a public service on our web server. The paper also addresses the reliability of automated GO annotations by using a reference set of more than 6000 human proteins. The GOblet server is accessible at http://goblet.molgen.mpg.de.

  16. Timber harvest as the predominant disturbance regime in northeastern U.S. forests: Effects of harvest intensification

    USGS Publications Warehouse

    Brown, Michelle L.; Canham, Charles D.; Murphy, Lora; Donovan, Therese M.

    2018-01-01

    Harvesting is the leading cause of adult tree mortality in forests of the northeastern United States. While current rates of timber harvest are generally sustainable, there is considerable pressure to increase the contribution of forest biomass to meet renewable energy goals. We estimated current harvest regimes for different forest types and regions across the U.S. states of New York, Vermont, New Hampshire, and Maine using data from the U.S. Forest Inventory and Analysis Program. We implemented the harvest regimes in SORTIE‐ND, an individual‐based model of forest dynamics, and simulated the effects of current harvest regimes and five additional harvest scenarios that varied by harvest frequency and intensity over 150 yr. The best statistical model for the harvest regime described the annual probability of harvest as a function of forest type/region, total plot basal area, and distance to the nearest improved road. Forests were predicted to increase in adult aboveground biomass in all harvest scenarios in all forest type and region combinations. The magnitude of the increase, however, varied dramatically—increasing from 3% to 120% above current landscape averages as harvest frequency and intensity decreased. The variation can be largely explained by the disproportionately high harvest rates estimated for Maine as compared with the rest of the region. Despite steady biomass accumulation across the landscape, stands that exhibited old‐growth characteristics (defined as ≥300 metric tons of biomass/hectare) were rare (8% or less of stands). Intensified harvest regimes had little effect on species composition due to widespread partial harvesting in all scenarios, resulting in dominance by late‐successional species over time. Our analyses indicate that forest biomass can represent a sustainable, if small, component of renewable energy portfolios in the region, although there are tradeoffs between carbon sequestration in forest biomass and sustainable

  17. Improved annotation through genome-scale metabolic modeling of Aspergillus oryzae

    PubMed Central

    Vongsangnak, Wanwipa; Olsen, Peter; Hansen, Kim; Krogsgaard, Steen; Nielsen, Jens

    2008-01-01

    Background Since ancient times the filamentous fungus Aspergillus oryzae has been used in the fermentation industry for the production of fermented sauces and the production of industrial enzymes. Recently, the genome sequence of A. oryzae with 12,074 annotated genes was released but the number of hypothetical proteins accounted for more than 50% of the annotated genes. Considering the industrial importance of this fungus, it is therefore valuable to improve the annotation and further integrate genomic information with biochemical and physiological information available for this microorganism and other related fungi. Here we proposed the gene prediction by construction of an A. oryzae Expressed Sequence Tag (EST) library, sequencing and assembly. We enhanced the function assignment by our developed annotation strategy. The resulting better annotation was used to reconstruct the metabolic network leading to a genome scale metabolic model of A. oryzae. Results Our assembled EST sequences we identified 1,046 newly predicted genes in the A. oryzae genome. Furthermore, it was possible to assign putative protein functions to 398 of the newly predicted genes. Noteworthy, our annotation strategy resulted in assignment of new putative functions to 1,469 hypothetical proteins already present in the A. oryzae genome database. Using the substantially improved annotated genome we reconstructed the metabolic network of A. oryzae. This network contains 729 enzymes, 1,314 enzyme-encoding genes, 1,073 metabolites and 1,846 (1,053 unique) biochemical reactions. The metabolic reactions are compartmentalized into the cytosol, the mitochondria, the peroxisome and the extracellular space. Transport steps between the compartments and the extracellular space represent 281 reactions, of which 161 are unique. The metabolic model was validated and shown to correctly describe the phenotypic behavior of A. oryzae grown on different carbon sources. Conclusion A much enhanced annotation of the A

  18. Compound annotation with real time cellular activity profiles to improve drug discovery.

    PubMed

    Fang, Ye

    2016-01-01

    In the past decade, a range of innovative strategies have been developed to improve the productivity of pharmaceutical research and development. In particular, compound annotation, combined with informatics, has provided unprecedented opportunities for drug discovery. In this review, a literature search from 2000 to 2015 was conducted to provide an overview of the compound annotation approaches currently used in drug discovery. Based on this, a framework related to a compound annotation approach using real-time cellular activity profiles for probe, drug, and biology discovery is proposed. Compound annotation with chemical structure, drug-like properties, bioactivities, genome-wide effects, clinical phenotypes, and textural abstracts has received significant attention in early drug discovery. However, these annotations are mostly associated with endpoint results. Advances in assay techniques have made it possible to obtain real-time cellular activity profiles of drug molecules under different phenotypes, so it is possible to generate compound annotation with real-time cellular activity profiles. Combining compound annotation with informatics, such as similarity analysis, presents a good opportunity to improve the rate of discovery of novel drugs and probes, and enhance our understanding of the underlying biology.

  19. Computing of Learner's Personality Traits Based on Digital Annotations

    ERIC Educational Resources Information Center

    Omheni, Nizar; Kalboussi, Anis; Mazhoud, Omar; Kacem, Ahmed Hadj

    2017-01-01

    Researchers in education are interested in modeling of learner's profile and adapt their learning experiences accordingly. When learners read and interact with their reading materials, they do unconscious practices like annotations which may be, a key feature of their personalities. Annotation activity requires readers to be active, to think…

  20. The Eimeria Transcript DB: an integrated resource for annotated transcripts of protozoan parasites of the genus Eimeria

    PubMed Central

    Rangel, Luiz Thibério; Novaes, Jeniffer; Durham, Alan M.; Madeira, Alda Maria B. N.; Gruber, Arthur

    2013-01-01

    Parasites of the genus Eimeria infect a wide range of vertebrate hosts, including chickens. We have recently reported a comparative analysis of the transcriptomes of Eimeria acervulina, Eimeria maxima and Eimeria tenella, integrating ORESTES data produced by our group and publicly available Expressed Sequence Tags (ESTs). All cDNA reads have been assembled, and the reconstructed transcripts have been submitted to a comprehensive functional annotation pipeline. Additional studies included orthology assignment across apicomplexan parasites and clustering analyses of gene expression profiles among different developmental stages of the parasites. To make all this body of information publicly available, we constructed the Eimeria Transcript Database (EimeriaTDB), a web repository that provides access to sequence data, annotation and comparative analyses. Here, we describe the web interface, available sequence data sets and query tools implemented on the site. The main goal of this work is to offer a public repository of sequence and functional annotation data of reconstructed transcripts of parasites of the genus Eimeria. We believe that EimeriaTDB will represent a valuable and complementary resource for the Eimeria scientific community and for those researchers interested in comparative genomics of apicomplexan parasites. Database URL: http://www.coccidia.icb.usp.br/eimeriatdb/ PMID:23411718

  1. The Aspergillus Genome Database: multispecies curation and incorporation of RNA-Seq data to improve structural gene annotations.

    PubMed

    Cerqueira, Gustavo C; Arnaud, Martha B; Inglis, Diane O; Skrzypek, Marek S; Binkley, Gail; Simison, Matt; Miyasato, Stuart R; Binkley, Jonathan; Orvis, Joshua; Shah, Prachi; Wymore, Farrell; Sherlock, Gavin; Wortman, Jennifer R

    2014-01-01

    The Aspergillus Genome Database (AspGD; http://www.aspgd.org) is a freely available web-based resource that was designed for Aspergillus researchers and is also a valuable source of information for the entire fungal research community. In addition to being a repository and central point of access to genome, transcriptome and polymorphism data, AspGD hosts a comprehensive comparative genomics toolbox that facilitates the exploration of precomputed orthologs among the 20 currently available Aspergillus genomes. AspGD curators perform gene product annotation based on review of the literature for four key Aspergillus species: Aspergillus nidulans, Aspergillus oryzae, Aspergillus fumigatus and Aspergillus niger. We have iteratively improved the structural annotation of Aspergillus genomes through the analysis of publicly available transcription data, mostly expressed sequenced tags, as described in a previous NAR Database article (Arnaud et al. 2012). In this update, we report substantive structural annotation improvements for A. nidulans, A. oryzae and A. fumigatus genomes based on recently available RNA-Seq data. Over 26 000 loci were updated across these species; although those primarily comprise the addition and extension of untranslated regions (UTRs), the new analysis also enabled over 1000 modifications affecting the coding sequence of genes in each target genome.

  2. 3D annotation and manipulation of medical anatomical structures

    NASA Astrophysics Data System (ADS)

    Vitanovski, Dime; Schaller, Christian; Hahn, Dieter; Daum, Volker; Hornegger, Joachim

    2009-02-01

    Although the medical scanners are rapidly moving towards a three-dimensional paradigm, the manipulation and annotation/labeling of the acquired data is still performed in a standard 2D environment. Editing and annotation of three-dimensional medical structures is currently a complex task and rather time-consuming, as it is carried out in 2D projections of the original object. A major problem in 2D annotation is the depth ambiguity, which requires 3D landmarks to be identified and localized in at least two of the cutting planes. Operating directly in a three-dimensional space enables the implicit consideration of the full 3D local context, which significantly increases accuracy and speed. A three-dimensional environment is as well more natural optimizing the user's comfort and acceptance. The 3D annotation environment requires the three-dimensional manipulation device and display. By means of two novel and advanced technologies, Wii Nintendo Controller and Philips 3D WoWvx display, we define an appropriate 3D annotation tool and a suitable 3D visualization monitor. We define non-coplanar setting of four Infrared LEDs with a known and exact position, which are tracked by the Wii and from which we compute the pose of the device by applying a standard pose estimation algorithm. The novel 3D renderer developed by Philips uses either the Z-value of a 3D volume, or it computes the depth information out of a 2D image, to provide a real 3D experience without having some special glasses. Within this paper we present a new framework for manipulation and annotation of medical landmarks directly in three-dimensional volume.

  3. Comparative analysis of grapevine whole-genome gene predictions, functional annotation, categorization and integration of the predicted gene sequences

    PubMed Central

    2012-01-01

    Background The first draft assembly and gene prediction of the grapevine genome (8X base coverage) was made available to the scientific community in 2007, and functional annotation was developed on this gene prediction. Since then additional Sanger sequences were added to the 8X sequences pool and a new version of the genomic sequence with superior base coverage (12X) was produced. Results In order to more efficiently annotate the function of the genes predicted in the new assembly, it is important to build on as much of the previous work as possible, by transferring 8X annotation of the genome to the 12X version. The 8X and 12X assemblies and gene predictions of the grapevine genome were compared to answer the question, “Can we uniquely map 8X predicted genes to 12X predicted genes?” The results show that while the assemblies and gene structure predictions are too different to make a complete mapping between them, most genes (18,725) showed a one-to-one relationship between 8X predicted genes and the last version of 12X predicted genes. In addition, reshuffled genomic sequence structures appeared. These highlight regions of the genome where the gene predictions need to be taken with caution. Based on the new grapevine gene functional annotation and in-depth functional categorization, twenty eight new molecular networks have been created for VitisNet while the existing networks were updated. Conclusions The outcomes of this study provide a functional annotation of the 12X genes, an update of VitisNet, the system of the grapevine molecular networks, and a new functional categorization of genes. Data are available at the VitisNet website (http://www.sdstate.edu/ps/research/vitis/pathways.cfm). PMID:22554261

  4. Harvest and dynamics of duck populations

    USGS Publications Warehouse

    Sedinger, James S.; Herzog, Mark P.

    2012-01-01

    The role of harvest in the dynamics of waterfowl populations continues to be debated among scientists and managers. Our perception is that interested members of the public and some managers believe that harvest influences North American duck populations based on calls for more conservative harvest regulations. A recent review of harvest and population dynamics of North American mallard (Anas platyrhynchos) populations (Pöysä et al. 2004) reached similar conclusions. Because of the importance of this issue, we reviewed the evidence for an impact of harvest on duck populations. Our understanding of the effects of harvest is limited because harvest effects are typically confounded with those of population density; regulations are typically most liberal when populations are greatest. This problem also exists in the current Adaptive Harvest Management Program (Conn and Kendall 2004). Consequently, even where harvest appears additive to other mortality, this may be an artifact of ignoring effects of population density. Overall, we found no compelling evidence for strong additive effects of harvest on survival in duck populations that could not be explained by other factors.

  5. 1971 Washington timber harvest.

    Treesearch

    Brian R. Wall

    1972-01-01

    Washington's 1971 timber harvest of 6.45 billion board feet was nearly the same as the 1970 harvest level. The total timber harvest on public lands increased nearly 4 percent with a 30-percent increase in eastern Washington more than offsetting a 5-percent decline in western Washington. Part of the increase in eastern Washington reflects salvage of a large volume...

  6. Image annotation based on positive-negative instances learning

    NASA Astrophysics Data System (ADS)

    Zhang, Kai; Hu, Jiwei; Liu, Quan; Lou, Ping

    2017-07-01

    Automatic image annotation is now a tough task in computer vision, the main sense of this tech is to deal with managing the massive image on the Internet and assisting intelligent retrieval. This paper designs a new image annotation model based on visual bag of words, using the low level features like color and texture information as well as mid-level feature as SIFT, and mixture the pic2pic, label2pic and label2label correlation to measure the correlation degree of labels and images. We aim to prune the specific features for each single label and formalize the annotation task as a learning process base on Positive-Negative Instances Learning. Experiments are performed using the Corel5K Dataset, and provide a quite promising result when comparing with other existing methods.

  7. Behavioral Contributions to Teaching of Psychology: An Annotated Bibliography

    PubMed Central

    Karsten, Amanda M; Carr, James E

    2008-01-01

    An annotated bibliography that summarizes behavioral contributions to the journal Teaching of Psychology from 1974 to 2006 is provided. A total of 116 articles of potential utility to college-level instructors of behavior analysis and related areas were identified, annotated, and organized into nine categories for ease of accessibility. PMID:22478500

  8. Influence of harvesting on understory vegetation along a boreal riparian-upland gradient

    Treesearch

    Rebecca L. MacDonald; Han Y.H. Chen; Brian P. Palik; Ellie E. Prepas

    2014-01-01

    Management of riparian forests, and how they respond to disturbance, continues to be a focus of interest in the literature. Earlier studies on riparian plant community assembly following harvesting in the boreal forest have focused merely on highly contrasting microhabitats within a landscape, for example, streambank riparian habitat or upland habitat. Sustaining...

  9. Evaluation of harvest and information needs for North American sea ducks

    USGS Publications Warehouse

    Koneff, Mark D.; Zimmerman, Guthrie S.; Dwyer, Chris P.; Fleming, Kathleen K.; Padding, Paul I.; Devers, Patrick K.; Johnson, Fred A.; Runge, Michael C.; Roberts, Anthony J.

    2017-01-01

    risk of overharvest (i.e., observed harvest < allowable harvest in 5–7% and 19–26% of simulations, respectively depending on the functional form of density dependence), whereas the other populations appeared to be at moderate risk to low risk (observed harvest < allowable harvest in 22–68% of simulations, again conditional on the form of density dependence). We also evaluated the sensitivity of the difference between allowable and observed harvest estimates to uncertainty in individual demographic parameters to prioritize information needs. We found that uncertainty in overall fecundity had more influence on comparisons of allowable and observed harvest than adult survival or observed harvest for all species except long-tailed duck. Although adult survival was characterized by less uncertainty than individual components of fecundity, it was identified as a high priority information need given the sensitivity of growth rate and allowable harvest to this parameter. Uncertainty about population size was influential in the comparison of observed and allowable harvest for 5 of the 6 populations where it factored into the assessment. While this assessment highlights a high degree of uncertainty in allowable harvest, it provides a framework for integration of improved data from future research and monitoring. It could also serve as the basis for harvest strategy development as management objectives and regulatory alternatives are specified by the management community.

  10. Evaluation of harvest and information needs for North American sea ducks.

    PubMed

    Koneff, Mark D; Zimmerman, Guthrie S; Dwyer, Chris P; Fleming, Kathleen K; Padding, Paul I; Devers, Patrick K; Johnson, Fred A; Runge, Michael C; Roberts, Anthony J

    2017-01-01

    risk of overharvest (i.e., observed harvest < allowable harvest in 5-7% and 19-26% of simulations, respectively depending on the functional form of density dependence), whereas the other populations appeared to be at moderate risk to low risk (observed harvest < allowable harvest in 22-68% of simulations, again conditional on the form of density dependence). We also evaluated the sensitivity of the difference between allowable and observed harvest estimates to uncertainty in individual demographic parameters to prioritize information needs. We found that uncertainty in overall fecundity had more influence on comparisons of allowable and observed harvest than adult survival or observed harvest for all species except long-tailed duck. Although adult survival was characterized by less uncertainty than individual components of fecundity, it was identified as a high priority information need given the sensitivity of growth rate and allowable harvest to this parameter. Uncertainty about population size was influential in the comparison of observed and allowable harvest for 5 of the 6 populations where it factored into the assessment. While this assessment highlights a high degree of uncertainty in allowable harvest, it provides a framework for integration of improved data from future research and monitoring. It could also serve as the basis for harvest strategy development as management objectives and regulatory alternatives are specified by the management community.

  11. Evaluation of harvest and information needs for North American sea ducks

    PubMed Central

    Dwyer, Chris P.; Fleming, Kathleen K.; Padding, Paul I.; Devers, Patrick K.; Johnson, Fred A.; Runge, Michael C.; Roberts, Anthony J.

    2017-01-01

    risk of overharvest (i.e., observed harvest < allowable harvest in 5–7% and 19–26% of simulations, respectively depending on the functional form of density dependence), whereas the other populations appeared to be at moderate risk to low risk (observed harvest < allowable harvest in 22–68% of simulations, again conditional on the form of density dependence). We also evaluated the sensitivity of the difference between allowable and observed harvest estimates to uncertainty in individual demographic parameters to prioritize information needs. We found that uncertainty in overall fecundity had more influence on comparisons of allowable and observed harvest than adult survival or observed harvest for all species except long-tailed duck. Although adult survival was characterized by less uncertainty than individual components of fecundity, it was identified as a high priority information need given the sensitivity of growth rate and allowable harvest to this parameter. Uncertainty about population size was influential in the comparison of observed and allowable harvest for 5 of the 6 populations where it factored into the assessment. While this assessment highlights a high degree of uncertainty in allowable harvest, it provides a framework for integration of improved data from future research and monitoring. It could also serve as the basis for harvest strategy development as management objectives and regulatory alternatives are specified by the management community. PMID:28419113

  12. Using magnetic materials to harvest microalgal biomass: evaluation of harvesting and detachment efficiency.

    PubMed

    Zhu, L-D; Hiltunen, Erkki; Li, Zhaohua

    2017-12-15

    Using naked iron oxide (Fe 3 O 4 ) and yttrium iron oxide (Y 3 Fe 5 O 12 ) nanoparticles as flocculants, the harvesting efficiency of Chlorella vulgaris biomass was investigated. The harvesting process includes two steps, which are the separation of microalgae from the culture solution with the magnetic nanoparticles and then the separation of the algae from the magnetic nanoparticles. The optimal dosages and pH values for the magnetic harvesting of microalgal biomass were determined. Results showed that Y 3 Fe 5 O 12 nanoparticles were more efficient in microalgal biomass harvesting than Fe 3 O 4 nanoparticles. In an effort to achieve more than 90% of harvesting efficiency, optimal dosages for Fe 3 O 4 and Y 3 Fe 5 O 12 to harvest microalgal biomass were 10 and 2.5 g/L, while the appropriate pH values were 6.2 and 7.3, respectively. The harvesting efficiency of Fe 3 O 4 and Y 3 Fe 5 O 12 nanoparticles increased as the pH value decreased. The experimental results also showed that under a higher pH value Fe 3 O 4 nanoparticles were much easier to be separated from the flocs than Y 3 Fe 5 O 12 . 62.9% of Fe 3 O 4 nanoparticles could be de-attached from the aggregates, when the floc pH value reached 12.3.

  13. Incorporating Feature-Based Annotations into Automatically Generated Knowledge Representations

    NASA Astrophysics Data System (ADS)

    Lumb, L. I.; Lederman, J. I.; Aldridge, K. D.

    2006-12-01

    Earth Science Markup Language (ESML) is efficient and effective in representing scientific data in an XML- based formalism. However, features of the data being represented are not accounted for in ESML. Such features might derive from events (e.g., a gap in data collection due to instrument servicing), identifications (e.g., a scientifically interesting area/volume in an image), or some other source. In order to account for features in an ESML context, we consider them from the perspective of annotation, i.e., the addition of information to existing documents without changing the originals. Although it is possible to extend ESML to incorporate feature-based annotations internally (e.g., by extending the XML schema for ESML), there are a number of complicating factors that we identify. Rather than pursuing the ESML-extension approach, we focus on an external representation for feature-based annotations via XML Pointer Language (XPointer). In previous work (Lumb &Aldridge, HPCS 2006, IEEE, doi:10.1109/HPCS.2006.26), we have shown that it is possible to extract relationships from ESML-based representations, and capture the results in the Resource Description Format (RDF). Thus we explore and report on this same requirement for XPointer-based annotations of ESML representations. As in our past efforts, the Global Geodynamics Project (GGP) allows us to illustrate with a real-world example this approach for introducing annotations into automatically generated knowledge representations.

  14. SG-ADVISER CNV: copy-number variant annotation and interpretation.

    PubMed

    Erikson, Galina A; Deshpande, Neha; Kesavan, Balachandar G; Torkamani, Ali

    2015-09-01

    Copy-number variants have been associated with a variety of diseases, especially cancer, autism, schizophrenia, and developmental delay. The majority of clinically relevant events occur de novo, necessitating the interpretation of novel events. In this light, we present the Scripps Genome ADVISER CNV annotation pipeline and Web server, which aims to fill the gap between copy number variant detection and interpretation by performing in-depth annotations and functional predictions for copy number variants. The Scripps Genome ADVISER CNV suite includes a Web server interface to a high-performance computing environment for calculations of annotations and a table-based user interface that allows for the execution of numerous annotation-based variant filtration strategies and statistics. The annotation results include details regarding location, impact on the coding portion of genes, allele frequency information (including allele frequencies from the Scripps Wellderly cohort), and overlap information with other reference data sets (including ClinVar, DGV, DECIPHER). A summary variant classification is produced (ADVISER score) based on the American College of Medical Genetics and Genomics scoring guidelines. We demonstrate >90% sensitivity/specificity for detection of pathogenic events. Scripps Genome ADVISER CNV is designed to allow users with no prior bioinformatics expertise to manipulate large volumes of copy-number variant data. Scripps Genome ADVISER CNV is available at http://genomics.scripps.edu/ADVISER/.

  15. Patient Education: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Simmons, Jeannette

    Topics included in this annotated bibliography on patient education are (1) background on development of patient education programs, (2) patient education interventions, (3) references for health professionals, and (4) research and evaluation in patient education. (TA)

  16. Annotated Bibliography on Religious Development.

    ERIC Educational Resources Information Center

    Bucher, Anton A.; Reich, K. Helmut

    1991-01-01

    Presents an annotated bibliography on religious development that covers the areas of psychology and religion, measurement of religiousness, religious development during the life cycle, religious experiences, conversion, religion and morality, and images of God. (Author/BB)

  17. EST-PAC a web package for EST annotation and protein sequence prediction

    PubMed Central

    Strahm, Yvan; Powell, David; Lefèvre, Christophe

    2006-01-01

    With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST) from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST) annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1) searching local or remote biological databases for sequence similarities using Blast services, 2) predicting protein coding sequence from EST data and, 3) annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics. PMID:17147782

  18. 1968 Oregon timber harvest.

    Treesearch

    Brian R. Wall

    1969-01-01

    Oregon's 1968 timber harvest of 9.74 billion board feet was the largest since 1952, when a record 9.80 billion board feet was produced. Public agencies' harvests increased 25.0 percent in western Oregon and 4.1 percent in eastern Oregon for a total increase of 19.1 percent, 864.9 million board feet above the public harvest in 1967. National Forests had the...

  19. 1967 Oregon timber harvest.

    Treesearch

    Brian R. Wall

    1968-01-01

    Oregon's timber harvest was 8.4 billion board feet in 1967, 6.3 percent below the 1966 harvest. The total private harvest declined 7 percent in 1967 with a 153-million-board-foot (4.3-percent) decrease in western Oregon and a 138-million-board-foot (22.7-percent) drop in eastern Oregon. Forest industries had the greatest decline in production of all owners; their...

  20. 1967 Washington timber harvest.

    Treesearch

    Brian R. Wall

    1968-01-01

    Washington's 1967 timber harvest declined to 5.9 billion board feet, 2.3 percent below the 1966 harvest. The cut on public lands remained about the same as in 1966 with a 6.7-percent increase in public cut in eastern Washington, offsetting a 2.2-percent decrease in western Washington. The Indian lands had the greatest increase in harvest, up 35 million board feet...

  1. 1970 Washington timber harvest.

    Treesearch

    Brian R. Wall

    1971-01-01

    Washington's 1970 timber harvest of 6.46 billion board feet was 7.8 percent below the near record harvest of 7 billion board feet established in 1969. Timber harvests on all public lands declined 13 percent with a 9.0-percent reduction in western Washington and a 22.9-percent drop in eastern Washington. State lands led the decline in public production with a 142-...

  2. DEVA: An extensible ontology-based annotation model for visual document collections

    NASA Astrophysics Data System (ADS)

    Jelmini, Carlo; Marchand-Maillet, Stephane

    2003-01-01

    The description of visual documents is a fundamental aspect of any efficient information management system, but the process of manually annotating large collections of documents is tedious and far from being perfect. The need for a generic and extensible annotation model therefore arises. In this paper, we present DEVA, an open, generic and expressive multimedia annotation framework. DEVA is an extension of the Dublin Core specification. The model can represent the semantic content of any visual document. It is described in the ontology language DAML+OIL and can easily be extended with external specialized ontologies, adapting the vocabulary to the given application domain. In parallel, we present the Magritte annotation tool, which is an early prototype that validates the DEVA features. Magritte allows to manually annotating image collections. It is designed with a modular and extensible architecture, which enables the user to dynamically adapt the user interface to specialized ontologies merged into DEVA.

  3. An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets.

    PubMed

    Hosseini, Parsa; Tremblay, Arianne; Matthews, Benjamin F; Alkharouf, Nadim W

    2010-07-02

    The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially designed to translate sequence data

  4. An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets

    PubMed Central

    2010-01-01

    Background The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. Findings We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. Conclusions TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially

  5. Avian response to timber harvesting applied experimentally to manage Cerulean Warbler breeding populations

    USGS Publications Warehouse

    Sheehan, James; Wood, Petra Bohall; Buehler, David A.; Keyser, Patrick D.; Larkin, Jeffrey L.; Rodewald, Amanda D.; Wigley, T. Bently; Boves, Than J.; George, Gregory A.; Bakermans, Marja H.; Beachy, Tiffany A.; Evans, Andrea; McDermott, Molly E.; Newell, Felicity L.; Perkins, Kelly A.; White, Matthew

    2014-01-01

    Timber harvesting has been proposed as a management tool to enhance breeding habitat for the Cerulean Warbler (Setophaga cerulea), a declining Neotropical–Nearctic migratory songbird that nests in the canopy of mature eastern deciduous forests. To evaluate how this single-species management focus might fit within an ecologically based management approach for multiple forest birds, we performed a manipulative experiment using four treatments (three intensities of timber harvests and an unharvested control) at each of seven study areas within the core Cerulean Warbler breeding range. We collected pre-harvest (one year) and post-harvest (four years) data on the territory density of Cerulean Warblers and six additional focal species, avian community relative abundance, and several key habitat variables. We evaluated the avian and habitat responses across the 3–32 m2 ha−1 residual basal area (RBA) range of the treatments. Cerulean Warbler territory density peaked with medium RBA (∼16 m2 ha−1). In contrast, territory densities of the other focal species were negatively related to RBA (e.g., Hooded Warbler [Setophaga citrina]), were positively related to RBA (e.g., Ovenbird [Seiurus aurocapilla]), or were not sensitive to this measure (Scarlet Tanager [Piranga olivacea]). Some species (e.g., Hooded Warbler) increased with time post-treatment and were likely tied to a developing understory, whereas declines (e.g., Ovenbird) were immediate. Relative abundance responses of additional species were consistent with the territory density responses of the focal species. Across the RBA gradient, greatest separation in the avian community was between early successional forest species (e.g., Yellow-breasted Chat [Icteria virens]) and closed-canopy mature forest species (e.g., Ovenbird), with the Cerulean Warbler and other species located intermediate to these two extremes. Overall, our results suggest that harvests within 10–20 m2 ha−1 RBA yield the largest

  6. A web-based video annotation system for crowdsourcing surveillance videos

    NASA Astrophysics Data System (ADS)

    Gadgil, Neeraj J.; Tahboub, Khalid; Kirsh, David; Delp, Edward J.

    2014-03-01

    Video surveillance systems are of a great value to prevent threats and identify/investigate criminal activities. Manual analysis of a huge amount of video data from several cameras over a long period of time often becomes impracticable. The use of automatic detection methods can be challenging when the video contains many objects with complex motion and occlusions. Crowdsourcing has been proposed as an effective method for utilizing human intelligence to perform several tasks. Our system provides a platform for the annotation of surveillance video in an organized and controlled way. One can monitor a surveillance system using a set of tools such as training modules, roles and labels, task management. This system can be used in a real-time streaming mode to detect any potential threats or as an investigative tool to analyze past events. Annotators can annotate video contents assigned to them for suspicious activity or criminal acts. First responders are then able to view the collective annotations and receive email alerts about a newly reported incident. They can also keep track of the annotators' training performance, manage their activities and reward their success. By providing this system, the process of video analysis is made more efficient.

  7. Butternut (Juglans cinerea) annotated bibliography.

    Treesearch

    M.E. Ostry; M.J. Moore; S.A.N. Worrall

    2003-01-01

    An annotated bibliography of the major literature related to butternut (Juglans cinerea) from 1890 to 2002. Includes 230 citations and a topical index. Topics include diseases, conservation, genetics, insect pests, silvics, nut production, propagation, silviculture, and utilization.

  8. Intellectuals in China: Annotations.

    ERIC Educational Resources Information Center

    Parker, Franklin

    This annotated bibliography of 72 books, journal articles, government reports, and newspaper feature stories focuses on the changing role of intellectuals in China, primarily since the 1949 Chinese Revolution. Particular attention is given to the Hundred Flowers Movement of 1957 and the Cultural Revolution. Most of the cited works are in English,…

  9. Final Report of the HyPER Harvester Project

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Prasad, Nadipuram R; Ranade, Satishkuma J

    The HyPER Harvester Project resulted in the first full­scale design, fabrication and testing of two vertical­axis harvester prototypes at the Elephant Butte Irrigation District Drop 8 Station in Southern New Mexico. The design, followed by fabrication, and deployment clearly demonstrated the feasibility to manufacture and deploy harvester prototypes at low cost. While several issues common to irrigation canal systems have to be overcome, the electromechanical performance of the integrated turbine­generator system demonstrated proof­of­concept. Proof­of­concept includes 1) feasibility for using additive manufacturing techniques to fabricate Carbon­composite turbine­generator components at low cost, 2) ease of transportation and deployment, and 3) the harvestermore » performance. The benefits of modularity were demonstrated in terms of rapid deployment at the Drop 8 Station. Scalability and adaptability were proven in terms of the custom­fitting characteristics that enabled rapid deployment. While keeping the same shape and form, the harvester can be easily adapted to any drop environment. Self­supporting ability makes the harvester design minimally intrusive on existing structures. There are two technical challenges ahead that have to be addressed. Irregular flow patterns in canal flow induce vertical oscillations due to pressure change across the impeller. Despite the nosecone in conventional Kaplan turbine design that ordinarily dampens oscillations, an effective coupling design is required to eliminate the hydrodynamic effect on the generating system. In arid areas where tumbleweed is present, a robust design to prevent trash entering the drop is required. The compact shape and form have an aesthetic appearance and appear to illustrate an environmentally friendly attribute. The systems­engineered design enables rapid manufacturing and assembly of desired size units that can be deployed at sites along U.S. waterways as small hydropower plants. There

  10. Automatic medical image annotation and keyword-based image retrieval using relevance feedback.

    PubMed

    Ko, Byoung Chul; Lee, JiHyeon; Nam, Jae-Yeal

    2012-08-01

    This paper presents novel multiple keywords annotation for medical images, keyword-based medical image retrieval, and relevance feedback method for image retrieval for enhancing image retrieval performance. For semantic keyword annotation, this study proposes a novel medical image classification method combining local wavelet-based center symmetric-local binary patterns with random forests. For keyword-based image retrieval, our retrieval system use the confidence score that is assigned to each annotated keyword by combining probabilities of random forests with predefined body relation graph. To overcome the limitation of keyword-based image retrieval, we combine our image retrieval system with relevance feedback mechanism based on visual feature and pattern classifier. Compared with other annotation and relevance feedback algorithms, the proposed method shows both improved annotation performance and accurate retrieval results.

  11. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST)

    PubMed Central

    Overbeek, Ross; Olson, Robert; Pusch, Gordon D.; Olsen, Gary J.; Davis, James J.; Disz, Terry; Edwards, Robert A.; Gerdes, Svetlana; Parrello, Bruce; Shukla, Maulik; Vonstein, Veronika; Wattam, Alice R.; Xia, Fangfang; Stevens, Rick

    2014-01-01

    In 2004, the SEED (http://pubseed.theseed.org/) was created to provide consistent and accurate genome annotations across thousands of genomes and as a platform for discovering and developing de novo annotations. The SEED is a constantly updated integration of genomic data with a genome database, web front end, API and server scripts. It is used by many scientists for predicting gene functions and discovering new pathways. In addition to being a powerful database for bioinformatics research, the SEED also houses subsystems (collections of functionally related protein families) and their derived FIGfams (protein families), which represent the core of the RAST annotation engine (http://rast.nmpdr.org/). When a new genome is submitted to RAST, genes are called and their annotations are made by comparison to the FIGfam collection. If the genome is made public, it is then housed within the SEED and its proteins populate the FIGfam collection. This annotation cycle has proven to be a robust and scalable solution to the problem of annotating the exponentially increasing number of genomes. To date, >12 000 users worldwide have annotated >60 000 distinct genomes using RAST. Here we describe the interconnectedness of the SEED database and RAST, the RAST annotation pipeline and updates to both resources. PMID:24293654

  12. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST).

    PubMed

    Overbeek, Ross; Olson, Robert; Pusch, Gordon D; Olsen, Gary J; Davis, James J; Disz, Terry; Edwards, Robert A; Gerdes, Svetlana; Parrello, Bruce; Shukla, Maulik; Vonstein, Veronika; Wattam, Alice R; Xia, Fangfang; Stevens, Rick

    2014-01-01

    In 2004, the SEED (http://pubseed.theseed.org/) was created to provide consistent and accurate genome annotations across thousands of genomes and as a platform for discovering and developing de novo annotations. The SEED is a constantly updated integration of genomic data with a genome database, web front end, API and server scripts. It is used by many scientists for predicting gene functions and discovering new pathways. In addition to being a powerful database for bioinformatics research, the SEED also houses subsystems (collections of functionally related protein families) and their derived FIGfams (protein families), which represent the core of the RAST annotation engine (http://rast.nmpdr.org/). When a new genome is submitted to RAST, genes are called and their annotations are made by comparison to the FIGfam collection. If the genome is made public, it is then housed within the SEED and its proteins populate the FIGfam collection. This annotation cycle has proven to be a robust and scalable solution to the problem of annotating the exponentially increasing number of genomes. To date, >12 000 users worldwide have annotated >60 000 distinct genomes using RAST. Here we describe the interconnectedness of the SEED database and RAST, the RAST annotation pipeline and updates to both resources.

  13. Rural Transportation: An Annotated Bibliography

    DOT National Transportation Integrated Search

    1999-03-01

    This bibliography is downloadable in MS Word format. The annotated bibliography : is intended to provide an overview of different aspects of transportation in : rural America. Emphasis is on those studies published within the last 10 years, : but som...

  14. Generation of an annotated reference standard for vaccine adverse event reports.

    PubMed

    Foster, Matthew; Pandey, Abhishek; Kreimeyer, Kory; Botsis, Taxiarchis

    2018-07-05

    As part of a collaborative project between the US Food and Drug Administration (FDA) and the Centers for Disease Control and Prevention for the development of a web-based natural language processing (NLP) workbench, we created a corpus of 1000 Vaccine Adverse Event Reporting System (VAERS) reports annotated for 36,726 clinical features, 13,365 temporal features, and 22,395 clinical-temporal links. This paper describes the final corpus, as well as the methodology used to create it, so that clinical NLP researchers outside FDA can evaluate the utility of the corpus to aid their own work. The creation of this standard went through four phases: pre-training, pre-production, production-clinical feature annotation, and production-temporal annotation. The pre-production phase used a double annotation followed by adjudication strategy to refine and finalize the annotation model while the production phases followed a single annotation strategy to maximize the number of reports in the corpus. An analysis of 30 reports randomly selected as part of a quality control assessment yielded accuracies of 0.97, 0.96, and 0.83 for clinical features, temporal features, and clinical-temporal associations, respectively and speaks to the quality of the corpus. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. 1969 Oregon timber harvest.

    Treesearch

    Brian R. Wall

    1970-01-01

    The 1969 Oregon timber harvest of 9.15 billion board feet was 6.1 percent below the 1968 16-year peak of 9.74 billion board feet. In western Oregon, the 1969 harvest was down 9.1 percent with public production and private production off 10.8 and 7.2 percent, respectively. By contrast, log harvest in eastern Oregon rose 5 percent, with private production up 13.2 percent...

  16. 1966 Oregon timber harvest.

    Treesearch

    Brian R. Wall

    1967-01-01

    The 1966 Oregon timber harvest totaled 8.9 billion board feet, 5 percent less than the harvest in 1965. During 1966, the total public timber harvest declined 10 percent to 4.8 billion board feet. The uncut volume of public timber under contract at the end of 1966 was 7.6 billion board feet, up 1.3 billion board feet from 1965's year end total. National Forest...

  17. 1966 Washington timber harvest.

    Treesearch

    Brian R. Wall

    1967-01-01

    The 1966 Washington timber harvest of 6.1 billion board feet was 6.8 percent below the 1965 level. This was the first decline since 1961. In part, the lower harvest in 1966 was due to completion of salvage logging of the 1962 blowdown. The volume of dead timber salvaged in 1966 was only 6 percent of the total, compared with 15 percent in 1965. The live timber harvest...

  18. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine.

    PubMed

    Elsik, Christine G; Tayal, Aditi; Diesh, Colin M; Unni, Deepak R; Emery, Marianne L; Nguyen, Hung N; Hagen, Darren E

    2016-01-04

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. Scripps Genome ADVISER: Annotation and Distributed Variant Interpretation SERver

    PubMed Central

    Pham, Phillip H.; Shipman, William J.; Erikson, Galina A.; Schork, Nicholas J.; Torkamani, Ali

    2015-01-01

    Interpretation of human genomes is a major challenge. We present the Scripps Genome ADVISER (SG-ADVISER) suite, which aims to fill the gap between data generation and genome interpretation by performing holistic, in-depth, annotations and functional predictions on all variant types and effects. The SG-ADVISER suite includes a de-identification tool, a variant annotation web-server, and a user interface for inheritance and annotation-based filtration. SG-ADVISER allows users with no bioinformatics expertise to manipulate large volumes of variant data with ease – without the need to download large reference databases, install software, or use a command line interface. SG-ADVISER is freely available at genomics.scripps.edu/ADVISER. PMID:25706643

  20. Annotation: The Savant Syndrome

    ERIC Educational Resources Information Center

    Heaton, Pamela; Wallace, Gregory L.

    2004-01-01

    Background: Whilst interest has focused on the origin and nature of the savant syndrome for over a century, it is only within the past two decades that empirical group studies have been carried out. Methods: The following annotation briefly reviews relevant research and also attempts to address outstanding issues in this research area.…