Science.gov

Sample records for rapid virulence annotation

  1. Systematic annotation and analysis of "virmugens"-virulence factors whose mutants can be used as live attenuated vaccines.

    PubMed

    Racz, Rebecca; Chung, Monica; Xiang, Zuoshuang; He, Yongqun

    2013-01-21

    Live attenuated vaccines are usually generated by mutation of genes encoding virulence factors. "Virmugen" is coined here to represent a gene that encodes for a virulent factor of a pathogen and has been proven feasible in animal models to make a live attenuated vaccine by knocking out this gene. Not all virulence factors are virmugens. VirmugenDB is a web-based virmugen database (http://www.violinet.org/virmugendb). Currently, VirmugenDB includes 225 virmugens that have been verified to be valuable for vaccine development against 57 bacterial, viral, and protozoan pathogens. Bioinformatics analysis has revealed significant patterns in virmugens. For example, 10 Gram-negative and 1 Gram-positive bacterial aroA genes are virmugens. A sequence analysis has revealed at least 50% of identities in the protein sequences of the 10 Gram-negative bacterial aroA virmugens. As a pathogen case study, Brucella virmugens were analyzed. Out of 15 verified Brucella virmugens, 6 are related to carbohydrate or nucleotide transport and metabolism, and 2 involving cell membrane biogenesis. In addition, 54 virmugens from 24 viruses and 12 virmugens from 4 parasites are also stored in VirmugenDB. Virmugens tend to involve metabolism of nutrients (e.g., amino acids, carbohydrates, and nucleotides) and cell membrane formation. Host genes whose expressions were regulated by virmugen mutation vaccines or wild type virulent pathogens have also been annotated and systematically compared. The bioinformatics annotation and analysis of virmugens helps to elucidate enriched virmugen profiles and the mechanisms of protective immunity, and further supports rational vaccine design.

  2. Rapid identification of sequences for orphan enzymes to power accurate protein annotation.

    PubMed

    Ramkissoon, Kevin R; Miller, Jennifer K; Ojha, Sunil; Watson, Douglas S; Bomar, Martha G; Galande, Amit K; Shearer, Alexander G

    2013-01-01

    The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the "back catalog" of enzymology--"orphan enzymes," those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme "back catalog" is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology's "back catalog" another powerful tool to drive accurate genome annotation.

  3. Screen of Non-annotated Small Secreted Proteins of Pseudomonas syringae Reveals a Virulence Factor That Inhibits Tomato Immune Proteases

    PubMed Central

    Shindo, Takayuki; Kaschani, Farnusch; Kovács, Judit; Tian, Fang; Kourelis, Jiorgos; Hong, Tram Ngoc; Colby, Tom; Shabab, Mohammed; Chawla, Rohini; Kumari, Selva; Ilyas, Muhammad; Hörger, Anja C.; Alfano, James R.; van der Hoorn, Renier A. L.

    2016-01-01

    Pseudomonas syringae pv. tomato DC3000 (PtoDC3000) is an extracellular model plant pathogen, yet its potential to produce secreted effectors that manipulate the apoplast has been under investigated. Here we identified 131 candidate small, secreted, non-annotated proteins from the PtoDC3000 genome, most of which are common to Pseudomonas species and potentially expressed during apoplastic colonization. We produced 43 of these proteins through a custom-made gateway-compatible expression system for extracellular bacterial proteins, and screened them for their ability to inhibit the secreted immune protease C14 of tomato using competitive activity-based protein profiling. This screen revealed C14-inhibiting protein-1 (Cip1), which contains motifs of the chagasin-like protease inhibitors. Cip1 mutants are less virulent on tomato, demonstrating the importance of this effector in apoplastic immunity. Cip1 also inhibits immune protease Pip1, which is known to suppress PtoDC3000 infection, but has a lower affinity for its close homolog Rcr3, explaining why this protein is not recognized in tomato plants carrying the Cf-2 resistance gene, which uses Rcr3 as a co-receptor to detect pathogen-derived protease inhibitors. Thus, this approach uncovered a protease inhibitor of P. syringae, indicating that also P. syringae secretes effectors that selectively target apoplastic host proteases of tomato, similar to tomato pathogenic fungi, oomycetes and nematodes. PMID:27603016

  4. Screen of Non-annotated Small Secreted Proteins of Pseudomonas syringae Reveals a Virulence Factor That Inhibits Tomato Immune Proteases.

    PubMed

    Shindo, Takayuki; Kaschani, Farnusch; Yang, Fan; Kovács, Judit; Tian, Fang; Kourelis, Jiorgos; Hong, Tram Ngoc; Colby, Tom; Shabab, Mohammed; Chawla, Rohini; Kumari, Selva; Ilyas, Muhammad; Hörger, Anja C; Alfano, James R; van der Hoorn, Renier A L

    2016-09-01

    Pseudomonas syringae pv. tomato DC3000 (PtoDC3000) is an extracellular model plant pathogen, yet its potential to produce secreted effectors that manipulate the apoplast has been under investigated. Here we identified 131 candidate small, secreted, non-annotated proteins from the PtoDC3000 genome, most of which are common to Pseudomonas species and potentially expressed during apoplastic colonization. We produced 43 of these proteins through a custom-made gateway-compatible expression system for extracellular bacterial proteins, and screened them for their ability to inhibit the secreted immune protease C14 of tomato using competitive activity-based protein profiling. This screen revealed C14-inhibiting protein-1 (Cip1), which contains motifs of the chagasin-like protease inhibitors. Cip1 mutants are less virulent on tomato, demonstrating the importance of this effector in apoplastic immunity. Cip1 also inhibits immune protease Pip1, which is known to suppress PtoDC3000 infection, but has a lower affinity for its close homolog Rcr3, explaining why this protein is not recognized in tomato plants carrying the Cf-2 resistance gene, which uses Rcr3 as a co-receptor to detect pathogen-derived protease inhibitors. Thus, this approach uncovered a protease inhibitor of P. syringae, indicating that also P. syringae secretes effectors that selectively target apoplastic host proteases of tomato, similar to tomato pathogenic fungi, oomycetes and nematodes. PMID:27603016

  5. The evolutionary analysis of "orphans" from the Drosophila genome identifies rapidly diverging and incorrectly annotated genes.

    PubMed

    Schmid, K J; Aquadro, C F

    2001-10-01

    In genome projects of eukaryotic model organisms, a large number of novel genes of unknown function and evolutionary history ("orphans") are being identified. Since many orphans have no known homologs in distant species, it is unclear whether they are restricted to certain taxa or evolve rapidly, either because of a lack of constraints or positive Darwinian selection. Here we use three criteria for the selection of putatively rapidly evolving genes from a single sequence of Drosophila melanogaster. Thirteen candidate genes were chosen from the Adh region on the second chromosome and 1 from the tip of the X chromosome. We succeeded in obtaining sequence from 6 of these in the closely related species D. simulans and D. yakuba. Only 1 of the 6 genes showed a large number of amino acid replacements and in-frame insertions/deletions. A population survey of this gene suggests that its rapid evolution is due to the fixation of many neutral or nearly neutral mutations. Two other genes showed "normal" levels of divergence between species. Four genes had insertions/deletions that destroy the putative reading frame within exons, suggesting that these exons have been incorrectly annotated. The evolutionary analysis of orphan genes in closely related species is useful for the identification of both rapidly evolving and incorrectly annotated genes.

  6. Rapid Bacterial Identification, Resistance, Virulence and Type Profiling using Selected Reaction Monitoring Mass Spectrometry.

    PubMed

    Charretier, Yannick; Dauwalder, Olivier; Franceschi, Christine; Degout-Charmette, Elodie; Zambardi, Gilles; Cecchini, Tiphaine; Bardet, Chloe; Lacoux, Xavier; Dufour, Philippe; Veron, Laurent; Rostaing, Hervé; Lanet, Veronique; Fortin, Tanguy; Beaulieu, Corinne; Perrot, Nadine; Dechaume, Dominique; Pons, Sylvie; Girard, Victoria; Salvador, Arnaud; Durand, Géraldine; Mallard, Frédéric; Theretz, Alain; Broyer, Patrick; Chatellier, Sonia; Gervasi, Gaspard; Van Nuenen, Marc; Roitsch, Carolyn Ann; Van Belkum, Alex; Lemoine, Jérôme; Vandenesch, François; Charrier, Jean-Philippe

    2015-01-01

    Mass spectrometry (MS) in Selected Reaction Monitoring (SRM) mode is proposed for in-depth characterisation of microorganisms in a multiplexed analysis. Within 60-80 minutes, the SRM method performs microbial identification (I), antibiotic-resistance detection (R), virulence assessment (V) and it provides epidemiological typing information (T). This SRM application is illustrated by the analysis of the human pathogen Staphylococcus aureus, demonstrating its promise for rapid characterisation of bacteria from positive blood cultures of sepsis patients. PMID:26350205

  7. NDER: A novel web application using annotated whole slide images for rapid improvements in human pattern recognition

    PubMed Central

    Reder, Nicholas P.; Glasser, Daniel; Dintzis, Suzanne M.; Rendi, Mara H.; Garcia, Rochelle L.; Henriksen, Jonathan C.; Kilgore, Mark R.

    2016-01-01

    Context: Whole-slide images (WSIs) present a rich source of information for education, training, and quality assurance. However, they are often used in a fashion similar to glass slides rather than in novel ways that leverage the advantages of WSI. We have created a pipeline to transform annotated WSI into pattern recognition training, and quality assurance web application called novel diagnostic electronic resource (NDER). Aims: Create an efficient workflow for extracting annotated WSI for use by NDER, an attractive web application that provides high-throughput training. Materials and Methods: WSI were annotated by a resident and classified into five categories. Two methods of extracting images and creating image databases were compared. Extraction Method 1: Manual extraction of still images and validation of each image by four breast pathologists. Extraction Method 2: Validation of annotated regions on the WSI by a single experienced breast pathologist and automated extraction of still images tagged by diagnosis. The extracted still images were used by NDER. NDER briefly displays an image, requires users to classify the image after time has expired, then gives users immediate feedback. Results: The NDER workflow is efficient: annotation of a WSI requires 5 min and validation by an expert pathologist requires An additional one to 2 min. The pipeline is highly automated, with only annotation and validation requiring human input. NDER effectively displays hundreds of high-quality, high-resolution images and provides immediate feedback to users during a 30 min session. Conclusions: NDER efficiently uses annotated WSI to rapidly increase pattern recognition and evaluate for diagnostic proficiency. PMID:27563490

  8. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST)

    PubMed Central

    Overbeek, Ross; Olson, Robert; Pusch, Gordon D.; Olsen, Gary J.; Davis, James J.; Disz, Terry; Edwards, Robert A.; Gerdes, Svetlana; Parrello, Bruce; Shukla, Maulik; Vonstein, Veronika; Wattam, Alice R.; Xia, Fangfang; Stevens, Rick

    2014-01-01

    In 2004, the SEED (http://pubseed.theseed.org/) was created to provide consistent and accurate genome annotations across thousands of genomes and as a platform for discovering and developing de novo annotations. The SEED is a constantly updated integration of genomic data with a genome database, web front end, API and server scripts. It is used by many scientists for predicting gene functions and discovering new pathways. In addition to being a powerful database for bioinformatics research, the SEED also houses subsystems (collections of functionally related protein families) and their derived FIGfams (protein families), which represent the core of the RAST annotation engine (http://rast.nmpdr.org/). When a new genome is submitted to RAST, genes are called and their annotations are made by comparison to the FIGfam collection. If the genome is made public, it is then housed within the SEED and its proteins populate the FIGfam collection. This annotation cycle has proven to be a robust and scalable solution to the problem of annotating the exponentially increasing number of genomes. To date, >12 000 users worldwide have annotated >60 000 distinct genomes using RAST. Here we describe the interconnectedness of the SEED database and RAST, the RAST annotation pipeline and updates to both resources. PMID:24293654

  9. STRAP PTM: Software Tool for Rapid Annotation and Differential Comparison of Protein Post-Translational Modifications

    PubMed Central

    Spencer, Jean L.; Bhatia, Vivek N.; Whelan, Stephen A.; Costello, Catherine E.

    2014-01-01

    The identification of protein post-translational modifications (PTMs) is an increasingly important component of proteomics and biomarker discovery, but very few tools exist for performing fast and easy characterization of global PTM changes and differential comparison of PTMs across groups of data obtained from liquid chromatography-tandem mass spectrometry experiments. STRAP PTM (Software Tool for Rapid Annotation of Proteins: Post-Translational Modification edition) is a program that was developed to facilitate the characterization of PTMs using spectral counting and a novel scoring algorithm to accelerate the identification of differential PTMs from complex data sets. The software facilitates multi-sample comparison by collating, scoring, and ranking PTMs and by summarizing data visually. The freely available software (beta release) installs on a PC and processes data in protXML format obtained from files parsed through the Trans-Proteomic Pipeline. The easy-to-use interface allows examination of results at protein, peptide, and PTM levels, and the overall design offers tremendous flexibility that provides proteomics insight beyond simple assignment and counting. PMID:25422678

  10. Rapid-Viability PCR Method for Detection of Live, Virulent Bacillus anthracis in Environmental Samples ▿

    PubMed Central

    Létant, Sonia E.; Murphy, Gloria A.; Alfaro, Teneile M.; Avila, Julie R.; Kane, Staci R.; Raber, Ellen; Bunt, Thomas M.; Shah, Sanjiv R.

    2011-01-01

    In the event of a biothreat agent release, hundreds of samples would need to be rapidly processed to characterize the extent of contamination and determine the efficacy of remediation activities. Current biological agent identification and viability determination methods are both labor- and time-intensive such that turnaround time for confirmed results is typically several days. In order to alleviate this issue, automated, high-throughput sample processing methods were developed in which real-time PCR analysis is conducted on samples before and after incubation. The method, referred to as rapid-viability (RV)-PCR, uses the change in cycle threshold after incubation to detect the presence of live organisms. In this article, we report a novel RV-PCR method for detection of live, virulent Bacillus anthracis, in which the incubation time was reduced from 14 h to 9 h, bringing the total turnaround time for results below 15 h. The method incorporates a magnetic bead-based DNA extraction and purification step prior to PCR analysis, as well as specific real-time PCR assays for the B. anthracis chromosome and pXO1 and pXO2 plasmids. A single laboratory verification of the optimized method applied to the detection of virulent B. anthracis in environmental samples was conducted and showed a detection level of 10 to 99 CFU/sample with both manual and automated RV-PCR methods in the presence of various challenges. Experiments exploring the relationship between the incubation time and the limit of detection suggest that the method could be further shortened by an additional 2 to 3 h for relatively clean samples. PMID:21764960

  11. Rapid differentiation of Ralstonia solanacearum avirulent and virulent strains by cell fractioning of an isolate using high performance liquid chromatography.

    PubMed

    Zheng, Xuefang; Zhu, Yujing; Liu, Bo; Yu, Qian; Lin, Naiquan

    2016-01-01

    Ralstonia solanacearum is one of the most destructive plant bacterial pathogens worldwide. The population dynamics and genetic stability are important issues, especially when an avirulent strain is used for biocontrol. In this study, we developed a rapid method to differentiate the virulent and avirulent strains of R. solanacearum and to predict the biocontrol efficiency of an avirulent strain using high performance liquid chromatography (HPLC). Three chromatographic peaks P1, P2 and P3 were observed on the HPLC spectra among 68 avirulent and 28 virulent R. solanacearum strains. Based on the HPLC peaks, 96 strains total were assigned to three categories. For avirulent strains, the intense peak is P1, while for virulent strains, P3 is the majority. Based on the HLPC spectra of R. solanacearum strains, a chromatography titer index (CTI) was established as CTIi = Si/(S1+S2+S3) × 100% (i represents an individual HPLC peak; S1, S2 and S3 represent peak areas of P1, P2 and P3, respectively). The avirulent strains had high values of CTI1 ranging from 63.6 to 100.0%, while the virulent strains displayed high values of CTI3 ranging from 90.2 to 100.0%. Biological inoculation studies of 68 avirulent strains revealed that the biocontrol efficacy was the best when CTI1 = 100%. The purity and genetic stability of R. solanacearum strains were confirmed in the P1 fraction of avirulent strain FJAT-1957 and P3 fraction of virulent strain FJAT-1925 after 30 generations of consecutive subculture. These results confirmed that fractioning by HPLC and their deduced CTI can be used for rapid and efficient evaluation and prediction of an isolate of R. solanacearum. To the best of our knowledge, this is the first report that HPLC fractioning can be used for rapid differentiation of virulent and avirulent strains of R. solanacearum. PMID:26606869

  12. Rapid differentiation of Ralstonia solanacearum avirulent and virulent strains by cell fractioning of an isolate using high performance liquid chromatography.

    PubMed

    Zheng, Xuefang; Zhu, Yujing; Liu, Bo; Yu, Qian; Lin, Naiquan

    2016-01-01

    Ralstonia solanacearum is one of the most destructive plant bacterial pathogens worldwide. The population dynamics and genetic stability are important issues, especially when an avirulent strain is used for biocontrol. In this study, we developed a rapid method to differentiate the virulent and avirulent strains of R. solanacearum and to predict the biocontrol efficiency of an avirulent strain using high performance liquid chromatography (HPLC). Three chromatographic peaks P1, P2 and P3 were observed on the HPLC spectra among 68 avirulent and 28 virulent R. solanacearum strains. Based on the HPLC peaks, 96 strains total were assigned to three categories. For avirulent strains, the intense peak is P1, while for virulent strains, P3 is the majority. Based on the HLPC spectra of R. solanacearum strains, a chromatography titer index (CTI) was established as CTIi = Si/(S1+S2+S3) × 100% (i represents an individual HPLC peak; S1, S2 and S3 represent peak areas of P1, P2 and P3, respectively). The avirulent strains had high values of CTI1 ranging from 63.6 to 100.0%, while the virulent strains displayed high values of CTI3 ranging from 90.2 to 100.0%. Biological inoculation studies of 68 avirulent strains revealed that the biocontrol efficacy was the best when CTI1 = 100%. The purity and genetic stability of R. solanacearum strains were confirmed in the P1 fraction of avirulent strain FJAT-1957 and P3 fraction of virulent strain FJAT-1925 after 30 generations of consecutive subculture. These results confirmed that fractioning by HPLC and their deduced CTI can be used for rapid and efficient evaluation and prediction of an isolate of R. solanacearum. To the best of our knowledge, this is the first report that HPLC fractioning can be used for rapid differentiation of virulent and avirulent strains of R. solanacearum.

  13. Virulence Determination

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This chapter reviews the in vitro and in vivo assays that are available for determination of pathogenic potential of Listeria monocytogenes bacteria, highlighting the value of using multiplex PCR for rapid and accurate assessment of listerial virulence....

  14. Non-thermal Plasma Exposure Rapidly Attenuates Bacterial AHL-Dependent Quorum Sensing and Virulence

    PubMed Central

    Flynn, Padrig B.; Busetti, Alessandro; Wielogorska, Ewa; Chevallier, Olivier P.; Elliott, Christopher T.; Laverty, Garry; Gorman, Sean P.; Graham, William G.; Gilmore, Brendan F.

    2016-01-01

    The antimicrobial activity of atmospheric pressure non-thermal plasma has been exhaustively characterised, however elucidation of the interactions between biomolecules produced and utilised by bacteria and short plasma exposures are required for optimisation and clinical translation of cold plasma technology. This study characterizes the effects of non-thermal plasma exposure on acyl homoserine lactone (AHL)-dependent quorum sensing (QS). Plasma exposure of AHLs reduced the ability of such molecules to elicit a QS response in bacterial reporter strains in a dose-dependent manner. Short exposures (30–60 s) produce of a series of secondary compounds capable of eliciting a QS response, followed by the complete loss of AHL-dependent signalling following longer exposures. UPLC-MS analysis confirmed the time-dependent degradation of AHL molecules and their conversion into a series of by-products. FT-IR analysis of plasma-exposed AHLs highlighted the appearance of an OH group. In vivo assessment of the exposure of AHLs to plasma was examined using a standard in vivo model. Lettuce leaves injected with the rhlI/lasI mutant PAO-MW1 alongside plasma treated N-butyryl-homoserine lactone and n-(3-oxo-dodecanoyl)-homoserine lactone, exhibited marked attenuation of virulence. This study highlights the capacity of atmospheric pressure non-thermal plasma to modify and degrade AHL autoinducers thereby attenuating QS-dependent virulence in P. aeruginosa. PMID:27242335

  15. Non-thermal Plasma Exposure Rapidly Attenuates Bacterial AHL-Dependent Quorum Sensing and Virulence.

    PubMed

    Flynn, Padrig B; Busetti, Alessandro; Wielogorska, Ewa; Chevallier, Olivier P; Elliott, Christopher T; Laverty, Garry; Gorman, Sean P; Graham, William G; Gilmore, Brendan F

    2016-05-31

    The antimicrobial activity of atmospheric pressure non-thermal plasma has been exhaustively characterised, however elucidation of the interactions between biomolecules produced and utilised by bacteria and short plasma exposures are required for optimisation and clinical translation of cold plasma technology. This study characterizes the effects of non-thermal plasma exposure on acyl homoserine lactone (AHL)-dependent quorum sensing (QS). Plasma exposure of AHLs reduced the ability of such molecules to elicit a QS response in bacterial reporter strains in a dose-dependent manner. Short exposures (30-60 s) produce of a series of secondary compounds capable of eliciting a QS response, followed by the complete loss of AHL-dependent signalling following longer exposures. UPLC-MS analysis confirmed the time-dependent degradation of AHL molecules and their conversion into a series of by-products. FT-IR analysis of plasma-exposed AHLs highlighted the appearance of an OH group. In vivo assessment of the exposure of AHLs to plasma was examined using a standard in vivo model. Lettuce leaves injected with the rhlI/lasI mutant PAO-MW1 alongside plasma treated N-butyryl-homoserine lactone and n-(3-oxo-dodecanoyl)-homoserine lactone, exhibited marked attenuation of virulence. This study highlights the capacity of atmospheric pressure non-thermal plasma to modify and degrade AHL autoinducers thereby attenuating QS-dependent virulence in P. aeruginosa.

  16. Non-thermal Plasma Exposure Rapidly Attenuates Bacterial AHL-Dependent Quorum Sensing and Virulence.

    PubMed

    Flynn, Padrig B; Busetti, Alessandro; Wielogorska, Ewa; Chevallier, Olivier P; Elliott, Christopher T; Laverty, Garry; Gorman, Sean P; Graham, William G; Gilmore, Brendan F

    2016-01-01

    The antimicrobial activity of atmospheric pressure non-thermal plasma has been exhaustively characterised, however elucidation of the interactions between biomolecules produced and utilised by bacteria and short plasma exposures are required for optimisation and clinical translation of cold plasma technology. This study characterizes the effects of non-thermal plasma exposure on acyl homoserine lactone (AHL)-dependent quorum sensing (QS). Plasma exposure of AHLs reduced the ability of such molecules to elicit a QS response in bacterial reporter strains in a dose-dependent manner. Short exposures (30-60 s) produce of a series of secondary compounds capable of eliciting a QS response, followed by the complete loss of AHL-dependent signalling following longer exposures. UPLC-MS analysis confirmed the time-dependent degradation of AHL molecules and their conversion into a series of by-products. FT-IR analysis of plasma-exposed AHLs highlighted the appearance of an OH group. In vivo assessment of the exposure of AHLs to plasma was examined using a standard in vivo model. Lettuce leaves injected with the rhlI/lasI mutant PAO-MW1 alongside plasma treated N-butyryl-homoserine lactone and n-(3-oxo-dodecanoyl)-homoserine lactone, exhibited marked attenuation of virulence. This study highlights the capacity of atmospheric pressure non-thermal plasma to modify and degrade AHL autoinducers thereby attenuating QS-dependent virulence in P. aeruginosa. PMID:27242335

  17. Comparative Omics-Driven Genome Annotation Refinement: Application across Yersiniae

    SciTech Connect

    Rutledge, Alexandra C.; Jones, Marcus B.; Chauhan, Sadhana; Purvine, Samuel O.; Sanford, James; Monroe, Matthew E.; Brewer, Heather M.; Payne, Samuel H.; Ansong, Charles; Frank, Bryan C.; Smith, Richard D.; Peterson, Scott; Motin, Vladimir L.; Adkins, Joshua N.

    2012-03-27

    Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. To date, the perceived value of manual curation for genome annotations is not offset by the real cost and time associated with the process. In order to balance the large number of sequences generated, the annotation process is now performed almost exclusively in an automated fashion for most genome sequencing projects. One possible way to reduce errors inherent to automated computational annotations is to apply data from 'omics' measurements (i.e. transcriptional and proteomic) to the un-annotated genome with a proteogenomic-based approach. This approach does require additional experimental and bioinformatics methods to include omics technologies; however, the approach is readily automatable and can benefit from rapid developments occurring in those research domains as well. The annotation process can be improved by experimental validation of transcription and translation and aid in the discovery of annotation errors. Here the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species, as is becoming common in sequencing efforts. Transcriptomic and proteomic data derived from three highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 previously incorrect protein-coding sequences (e.g., observed frameshifts, extended start sites, and translated pseudogenes) within the three current Yersinia genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent pathogen, thus

  18. Rapid and Specific Detection, Molecular Epidemiology, and Experimental Virulence of the O16 Subgroup within Escherichia coli Sequence Type 131

    PubMed Central

    Clermont, Olivier; Johnston, Brian; Clabots, Connie; Tchesnokova, Veronika; Sokurenko, Evgeni; Junka, Adam F.; Maczynska, Beata; Denamur, Erick

    2014-01-01

    Escherichia coli sequence type 131 (ST131), a widely disseminated multidrug-resistant extraintestinal pathogen, typically exhibits serotype O25b:H4. However, certain ST131 isolates exhibit serotype O16:H5 and derive from a phylogenetic clade that is distinct from the classic O25b:H4 ST131 clade. Both clades are assigned to ST131 by the Achtman multilocus sequence typing (MLST) system and a screening PCR assay that targets ST131-specific sequence polymorphisms in the mdh and gyrB genes. However, they are classified as separate STs by the Pasteur Institute MLST system, and an ST131 PCR method that targets the O25b rfb region and an ST131-specific polymorphism in pabB detects only the O25b-associated clade. Here, we describe a novel PCR-based method that allows for rapid and specific detection of the O16-associated ST131 clade. The clade members uniformly contained allele 41 of fimH (type 1 fimbrial adhesin) and a narrow range of alleles of gyrA and parC (fluoroquinolone target genes). The virulence genotypes of the clade members resembled those of classic O25b:H4 ST131 isolates; representative isolates were variably lethal in a mouse subcutaneous sepsis model. Several pulsotypes spanned multiple sources (adults, children, pets, and human fecal samples) and locales. An analysis of recent clinical E. coli collections showed that the O16 ST131 clade is globally distributed, accounts for 1 to 5% of E. coli isolates overall, and, when compared with other ST131 isolates, it is associated with resistance to ampicillin, gentamicin, and trimethoprim-sulfamethoxazole and with susceptibility to fluoroquinolones and extended-spectrum cephalosporins. Attention to this O16-associated ST131 clade, which is facilitated by our novel PCR-based assay, is warranted in future epidemiological studies of ST131 and, conceivably, in clinical applications. PMID:24501035

  19. Rapid detection of virulence-associated genes in avian pathogenic Escherichia coli by multiplex polymerase chain reaction.

    PubMed

    Ewers, Christa; Janssen, Traute; Kiessling, Sabine; Philipp, Hans-C; Wieler, Lothar H

    2005-06-01

    Based on recently published prevalence data of virulence-associated factors in avian pathogenic Escherichia coli (APEC) and their roles in the pathogenesis of colibacillosis, we developed a multiplex polymerase chain reaction (PCR) as a molecular tool supplementing current diagnostic schemes that mainly rely on serological examination of strains isolated from diseased birds. Multiple isolates of E. coli from clinical cases of colibacillosis known to possess different combinations of eight genes were used as sources of template DNA to develop the multiplex PCR protocol, targeting genes for P-fimbriae (papC), aerobactin (iucD), iron-repressible protein (irp2), temperature-sensitive hemagglutinin (tsh), vacuolating autotransporter toxin (vat), enteroaggregative toxin (astA), increased serum survival protein (iss), and colicin V plasmid operon genes (cva/cvi). In order to verify the usefulness of this diagnostic tool, E. coli strains isolated from fecal samples of clinically healthy chickens were also included in this study, as were uropathogenic (UPEC), necrotoxigenic, and diarrhegenic E. coli strains. The application of the multiplex PCR protocol to 14 E. coli strains isolated from septicemic poultry showed that these strains harbored four to eight of the genes mentioned above. In contrast, those isolates that have been shown to be nonpathogenic for 5-wk-old chickens possessed either none or, at most, three of these genes. We found only one enterohemorrhagic (EHEC), one enteropathogenic (EPEC), and two enterotoxic (ETEC) E. coli strains positive for irp2, and another two ETEC strains positive for astA. As expected, UPEC isolates yielded different combinations of the genes iss, papC, iucD, irp2, and a sequence similar to vat. However, neither the colicin V operon genes cva/cvi nor tsh were amplified in UPEC isolates. The multiplex PCR results were compared with those obtained by DNA-DNA-hybridization analyses to validate the specificity of oligonucleotide primers, and

  20. MetaQuery: a web server for rapid annotation and quantitative analysis of specific genes in the human gut microbiome

    PubMed Central

    Nayfach, Stephen; Fischbach, Michael A.; Pollard, Katherine S.

    2015-01-01

    Summary: Microbiome researchers frequently want to know how abundant a particular microbial gene or pathway is across different human hosts, including its association with disease and its co-occurrence with other genes or microbial taxa. With thousands of publicly available metagenomes, these questions should be easy to answer. However, computational barriers prevent most researchers from conducting such analyses. We address this problem with MetaQuery, a web application for rapid and quantitative analysis of specific genes in the human gut microbiome. The user inputs one or more query genes, and our software returns the estimated abundance of these genes across 1267 publicly available fecal metagenomes from American, European and Chinese individuals. In addition, our application performs downstream statistical analyses to identify features that are associated with gene variation, including other query genes (i.e. gene co-variation), taxa, clinical variables (e.g. inflammatory bowel disease and diabetes) and average genome size. The speed and accessibility of MetaQuery are a step toward democratizing metagenomics research, which should allow many researchers to query the abundance and variation of specific genes in the human gut microbiome. Availability and implementation: http://metaquery.docpollard.org. Contact: snayfach@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26104745

  1. Rapidly Evolving Genes Are Key Players in Host Specialization and Virulence of the Fungal Wheat Pathogen Zymoseptoria tritici (Mycosphaerella graminicola)

    PubMed Central

    Poppe, Stephan; Dorsheimer, Lena; Happel, Petra; Stukenbrock, Eva Holtgrewe

    2015-01-01

    The speciation of pathogens can be driven by divergent host specialization. Specialization to a new host is possible via the acquisition of advantageous mutations fixed by positive selection. Comparative genome analyses of closely related species allows for the identification of such key substitutions via inference of genome-wide signatures of positive selection. We previously used a comparative genomics framework to identify genes that have evolved under positive selection during speciation of the prominent wheat pathogen Zymoseptoria tritici (synonym Mycosphaerella graminicola). In this study, we conducted functional analyses of four genes exhibiting strong signatures of positive selection in Z. tritici. We deleted the four genes in Z. tritici and confirm a virulence-related role of three of the four genes ΔZt80707, ΔZt89160 and ΔZt103264. The two mutants ΔZt80707 and ΔZt103264 show a significant reduction in virulence during infection of wheat; the ΔZt89160 mutant causes a hypervirulent phenotype in wheat. Mutant phenotypes of ΔZt80707, ΔZt89160 and ΔZt103264 can be restored by insertion of the wild-type genes. However, the insertion of the Zt80707 and Zt89160 orthologs from Z. pseudotritici and Z. ardabiliae do not restore wild-type levels of virulence, suggesting that positively selected substitutions in Z. tritici may relate to divergent host specialization. Interestingly, the gene Zt80707 encodes also a secretion signal that targets the protein for cell secretion. This secretion signal is however only transcribed in Z. tritici, suggesting that Z. tritici-specific substitutions relate to a new function of the protein in the extracellular space of the wheat-Z. tritici interaction. Together, the results presented here highlight that Zt80707, Zt103264 and Zt89160 represent key genes involved in virulence and host-specific disease development of Z. tritici. Our findings illustrate that evolutionary predictions provide a powerful tool for the

  2. Construction of customized sub-databases from NCBI-nr database for rapid annotation of huge metagenomic datasets using a combined BLAST and MEGAN approach.

    PubMed

    Yu, Ke; Zhang, Tong

    2013-01-01

    We developed a fast method to construct local sub-databases from the NCBI-nr database for the quick similarity search and annotation of huge metagenomic datasets based on BLAST-MEGAN approach. A three-step sub-database annotation pipeline (SAP) was further proposed to conduct the annotation in a much more time-efficient way which required far less computational capacity than the direct NCBI-nr database BLAST-MEGAN approach. The 1(st) BLAST of SAP was conducted using the original metagenomic dataset against the constructed sub-database for a quick screening of candidate target sequences. Then, the candidate target sequences identified in the 1(st) BLAST were subjected to the 2(nd) BLAST against the whole NCBI-nr database. The BLAST results were finally annotated using MEGAN to filter out those mistakenly selected sequences in the 1(st) BLAST to guarantee the accuracy of the results. Based on the tests conducted in this study, SAP achieved a speedup of ~150-385 times at the BLAST e-value of 1e-5, compared to the direct BLAST against NCBI-nr database. The annotation results of SAP are exactly in agreement with those of the direct NCBI-nr database BLAST-MEGAN approach, which is very time-consuming and computationally intensive. Selecting rigorous thresholds (e.g. e-value of 1e-10) would further accelerate SAP process. The SAP pipeline may also be coupled with novel similarity search tools (e.g. RAPsearch) other than BLAST to achieve even faster annotation of huge metagenomic datasets. Above all, this sub-database construction method and SAP pipeline provides a new time-efficient and convenient annotation similarity search strategy for laboratories without access to high performance computing facilities. SAP also offers a solution to high performance computing facilities for the processing of more similarity search tasks. PMID:23573212

  3. An Intradermal Inoculation Model of Scrub Typhus in Swiss CD-1 Mice Demonstrates More Rapid Dissemination of Virulent Strains of Orientia tsutsugamushi

    PubMed Central

    Sunyakumthorn, Piyanate; Paris, Daniel H.; Chan, Teik-Chye; Jones, Margaret; Luce-Fedrow, Alison; Chattopadhyay, Suchismita; Jiang, Ju; Anantatat, Tippawan; Turner, Gareth D. H.; Day, Nicholas P. J.; Richards, Allen L.

    2013-01-01

    Scrub typhus is an important endemic disease of the Asia-Pacific region caused by Orientia tsutsugamushi. To develop an effective vaccine to prevent scrub typhus infection, a better understanding of the initial host-pathogen interaction is needed. The objective of this study was to investigate early bacterial dissemination in a CD-1 Swiss outbred mouse model after intradermal injection of O. tsutsugamushi. Three human pathogenic strains of O. tsutsugamushi (Karp, Gilliam, and Woods) were chosen to investigate the early infection characteristics associated with bacterial virulence. Tissue biopsies of the intradermal injection site and draining lymph nodes were examined using histology and immunohistochemistry to characterize bacterial dissemination, and correlated with quantitative real-time PCR for O. tsutsugamushi in blood and tissue from major organs. Soluble adhesion molecules were measured to examine cellular activation in response to infection. No eschar formation was seen at the inoculation site and no clinical disease developed within the 7 day period of observation. However, O. tsutsugamushi was localized at the injection site and in the draining lymph nodes by day 7 post inoculation. Evidence of leukocyte and endothelial activation was present by day 7 with significantly raised levels of sL-selectin, sICAM-1 and sVCAM-1. Infection with the Karp strain was associated with earlier and higher bacterial loads and more extensive dissemination in various tissues than the less pathogenic Gilliam and Woods strains. The bacterial loads of O. tsutsugamushi were highest in the lungs and spleens of mice inoculated with Karp and Gilliam, but not Woods strains. Strains of higher virulence resulted in more rapid systemic infection and dissemination in this model. The CD-1 mouse intradermal inoculation model demonstrates features relevant to early scrub typhus infection in humans, including the development of regional lymphadenopathy, leukocyte activation and distant

  4. Ranking biomedical annotations with annotator's semantic relevancy.

    PubMed

    Wu, Aihua

    2014-01-01

    Biomedical annotation is a common and affective artifact for researchers to discuss, show opinion, and share discoveries. It becomes increasing popular in many online research communities, and implies much useful information. Ranking biomedical annotations is a critical problem for data user to efficiently get information. As the annotator's knowledge about the annotated entity normally determines quality of the annotations, we evaluate the knowledge, that is, semantic relationship between them, in two ways. The first is extracting relational information from credible websites by mining association rules between an annotator and a biomedical entity. The second way is frequent pattern mining from historical annotations, which reveals common features of biomedical entities that an annotator can annotate with high quality. We propose a weighted and concept-extended RDF model to represent an annotator, a biomedical entity, and their background attributes and merge information from the two ways as the context of an annotator. Based on that, we present a method to rank the annotations by evaluating their correctness according to user's vote and the semantic relevancy between the annotator and the annotated entity. The experimental results show that the approach is applicable and efficient even when data set is large. PMID:24899918

  5. Ranking Biomedical Annotations with Annotator's Semantic Relevancy

    PubMed Central

    2014-01-01

    Biomedical annotation is a common and affective artifact for researchers to discuss, show opinion, and share discoveries. It becomes increasing popular in many online research communities, and implies much useful information. Ranking biomedical annotations is a critical problem for data user to efficiently get information. As the annotator's knowledge about the annotated entity normally determines quality of the annotations, we evaluate the knowledge, that is, semantic relationship between them, in two ways. The first is extracting relational information from credible websites by mining association rules between an annotator and a biomedical entity. The second way is frequent pattern mining from historical annotations, which reveals common features of biomedical entities that an annotator can annotate with high quality. We propose a weighted and concept-extended RDF model to represent an annotator, a biomedical entity, and their background attributes and merge information from the two ways as the context of an annotator. Based on that, we present a method to rank the annotations by evaluating their correctness according to user's vote and the semantic relevancy between the annotator and the annotated entity. The experimental results show that the approach is applicable and efficient even when data set is large. PMID:24899918

  6. Computational algorithms to predict Gene Ontology annotations

    PubMed Central

    2015-01-01

    Background Gene function annotations, which are associations between a gene and a term of a controlled vocabulary describing gene functional features, are of paramount importance in modern biology. Datasets of these annotations, such as the ones provided by the Gene Ontology Consortium, are used to design novel biological experiments and interpret their results. Despite their importance, these sources of information have some known issues. They are incomplete, since biological knowledge is far from being definitive and it rapidly evolves, and some erroneous annotations may be present. Since the curation process of novel annotations is a costly procedure, both in economical and time terms, computational tools that can reliably predict likely annotations, and thus quicken the discovery of new gene annotations, are very useful. Methods We used a set of computational algorithms and weighting schemes to infer novel gene annotations from a set of known ones. We used the latent semantic analysis approach, implementing two popular algorithms (Latent Semantic Indexing and Probabilistic Latent Semantic Analysis) and propose a novel method, the Semantic IMproved Latent Semantic Analysis, which adds a clustering step on the set of considered genes. Furthermore, we propose the improvement of these algorithms by weighting the annotations in the input set. Results We tested our methods and their weighted variants on the Gene Ontology annotation sets of three model organism genes (Bos taurus, Danio rerio and Drosophila melanogaster ). The methods showed their ability in predicting novel gene annotations and the weighting procedures demonstrated to lead to a valuable improvement, although the obtained results vary according to the dimension of the input annotation set and the considered algorithm. Conclusions Out of the three considered methods, the Semantic IMproved Latent Semantic Analysis is the one that provides better results. In particular, when coupled with a proper

  7. Bacillus anthracis-Like Bacteria and Other B. cereus Group Members in a Microbial Community Within the International Space Station: A Challenge for Rapid and Easy Molecular Detection of Virulent B. anthracis

    PubMed Central

    van Tongeren, Sandra P.; Roest, Hendrik I. J.; Degener, John E.; Harmsen, Hermie J. M.

    2014-01-01

    For some microbial species, such as Bacillus anthracis, the etiologic agent of the disease anthrax, correct detection and identification by molecular methods can be problematic. The detection of virulent B. anthracis is challenging due to multiple virulence markers that need to be present in order for B. anthracis to be virulent and its close relationship to Bacillus cereus and other members of the B. cereus group. This is especially the case in environments where build-up of Bacillus spores can occur and several representatives of the B. cereus group may be present, which increases the chance for false-positives. In this study we show the presence of B. anthracis-like bacteria and other members of the B. cereus group in a microbial community within the human environment of the International Space Station and their preliminary identification by using conventional culturing as well as molecular techniques including 16S rDNA sequencing, PCR and real-time PCR. Our study shows that when monitoring the microbial hygiene in a given human environment, health risk assessment is troublesome in the case of virulent B. anthracis, especially if this should be done with rapid, easy to apply and on-site molecular methods. PMID:24945323

  8. Bacillus anthracis-like bacteria and other B. cereus group members in a microbial community within the International Space Station: a challenge for rapid and easy molecular detection of virulent B. anthracis.

    PubMed

    van Tongeren, Sandra P; Roest, Hendrik I J; Degener, John E; Harmsen, Hermie J M

    2014-01-01

    For some microbial species, such as Bacillus anthracis, the etiologic agent of the disease anthrax, correct detection and identification by molecular methods can be problematic. The detection of virulent B. anthracis is challenging due to multiple virulence markers that need to be present in order for B. anthracis to be virulent and its close relationship to Bacillus cereus and other members of the B. cereus group. This is especially the case in environments where build-up of Bacillus spores can occur and several representatives of the B. cereus group may be present, which increases the chance for false-positives. In this study we show the presence of B. anthracis-like bacteria and other members of the B. cereus group in a microbial community within the human environment of the International Space Station and their preliminary identification by using conventional culturing as well as molecular techniques including 16S rDNA sequencing, PCR and real-time PCR. Our study shows that when monitoring the microbial hygiene in a given human environment, health risk assessment is troublesome in the case of virulent B. anthracis, especially if this should be done with rapid, easy to apply and on-site molecular methods.

  9. RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes

    PubMed Central

    Brettin, Thomas; Davis, James J.; Disz, Terry; Edwards, Robert A.; Gerdes, Svetlana; Olsen, Gary J.; Olson, Robert; Overbeek, Ross; Parrello, Bruce; Pusch, Gordon D.; Shukla, Maulik; Thomason, James A.; Stevens, Rick; Vonstein, Veronika; Wattam, Alice R.; Xia, Fangfang

    2015-01-01

    The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception. PMID:25666585

  10. RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes

    SciTech Connect

    Brettin, Thomas; Davis, James J.; Disz, Terry; Edwards, Robert A.; Gerdes, Svetlana; Olsen, Gary J.; Olson, Robert; Overbeek, Ross; Parrello, Bruce; Pusch, Gordon D.; Shukla, Maulik; Thomason, III, James A.; Stevens, Rick; Vonstein, Veronika; Wattam, Alice R.; Xia, Fangfang

    2015-02-10

    The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.

  11. Galileo Reader and Annotator

    NASA Astrophysics Data System (ADS)

    Besomi, O.

    2011-06-01

    In his readings, Galileo made frequent use of annotations. Here, I will offer a general glance at them by discussing the case of the annotations to the Libra astronomica published in 1619 by Orazio Grassi, a Jesuit mathematician of the Collegio Romano. The annotations directly reflect Galileo's reaction to Grassi's book in a heated debate between the two astronomers. Galileo and Grassi had opposite ideas about the nature of the comets, which resulted in different scientific and theological implications. The annotations represent the starting point for Galileo's reply to the Libra, namely Il Saggiatore, which was published four years later and dedicated to the new pope Urban VIII.

  12. Dictionary-driven protein annotation.

    PubMed

    Rigoutsos, Isidore; Huynh, Tien; Floratos, Aris; Parida, Laxmi; Platt, Daniel

    2002-09-01

    Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has in turn generated a renewed demand for automated approaches that can annotate individual sequences and complete genomes quickly, exhaustively and objectively. In this paper, we present one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families. Our annotation approach also makes use of a weighted, position-specific scoring scheme that is unaffected by the over-representation of well-conserved proteins and protein fragments in the databases used. For a given query sequence, the method permits one to determine, in a single pass, the following: local and global similarities between the query and any protein already present in a public database; the likeness of the query to all available archaeal/ bacterial/eukaryotic/viral sequences in the database as a function of amino acid position within the query; the character of secondary structure of the query as a function of amino acid position within the query; the cytoplasmic, transmembrane or extracellular behavior of the query; the nature and position of binding domains, active sites, post-translationally modified sites, signal peptides, etc. In terms of performance, the proposed method is exhaustive, objective and allows for the rapid annotation of individual sequences and full genomes. Annotation examples are presented and discussed in Results, including individual queries and complete genomes that were

  13. Annotation extension through protein family annotation coherence metrics

    PubMed Central

    Bastos, Hugo P.; Clarke, Luka A.; Couto, Francisco M.

    2013-01-01

    Protein functional annotation consists in associating proteins with textual descriptors elucidating their biological roles. The bulk of annotation is done via automated procedures that ultimately rely on annotation transfer. Despite a large number of existing protein annotation procedures the ever growing protein space is never completely annotated. One of the facets of annotation incompleteness derives from annotation uncertainty. Often when protein function cannot be predicted with enough specificity it is instead conservatively annotated with more generic terms. In a scenario of protein families or functionally related (or even dissimilar) sets this leads to a more difficult task of using annotations to compare the extent of functional relatedness among all family or set members. However, we postulate that identifying sub-sets of functionally coherent proteins annotated at a very specific level, can help the annotation extension of other incompletely annotated proteins within the same family or functionally related set. As an example we analyse the status of annotation of a set of CAZy families belonging to the Polysaccharide Lyase class. We show that through the use of visualization methods and semantic similarity based metrics it is possible to identify families and respective annotation terms within them that are suitable for possible annotation extension. Based on our analysis we then propose a semi-automatic methodology leading to the extension of single annotation terms within these partially annotated protein sets or families. PMID:24130572

  14. K-Nearest Neighbors Relevance Annotation Model for Distance Education

    ERIC Educational Resources Information Center

    Ke, Xiao; Li, Shaozi; Cao, Donglin

    2011-01-01

    With the rapid development of Internet technologies, distance education has become a popular educational mode. In this paper, the authors propose an online image automatic annotation distance education system, which could effectively help children learn interrelations between image content and corresponding keywords. Image automatic annotation is…

  15. DAS Writeback: A Collaborative Annotation System

    PubMed Central

    2011-01-01

    Background Centralised resources such as GenBank and UniProt are perfect examples of the major international efforts that have been made to integrate and share biological information. However, additional data that adds value to these resources needs a simple and rapid route to public access. The Distributed Annotation System (DAS) provides an adequate environment to integrate genomic and proteomic information from multiple sources, making this information accessible to the community. DAS offers a way to distribute and access information but it does not provide domain experts with the mechanisms to participate in the curation process of the available biological entities and their annotations. Results We designed and developed a Collaborative Annotation System for proteins called DAS Writeback. DAS writeback is a protocol extension of DAS to provide the functionalities of adding, editing and deleting annotations. We implemented this new specification as extensions of both a DAS server and a DAS client. The architecture was designed with the involvement of the DAS community and it was improved after performing usability experiments emulating a real annotation task. Conclusions We demonstrate that DAS Writeback is effective, usable and will provide the appropriate environment for the creation and evolution of community protein annotation. PMID:21569281

  16. An Introduction to Genome Annotation.

    PubMed

    Campbell, Michael S; Yandell, Mark

    2015-12-17

    Genome projects have evolved from large international undertakings to tractable endeavors for a single lab. Accurate genome annotation is critical for successful genomic, genetic, and molecular biology experiments. These annotations can be generated using a number of approaches and available software tools. This unit describes methods for genome annotation and a number of software tools commonly used in gene annotation.

  17. Interference competition and parasite virulence.

    PubMed Central

    Massey, Ruth C.; Buckling, Angus; ffrench-Constant, Richard

    2004-01-01

    Within-host competition between parasites, a consequence of infection by multiple strains, is predicted to favour rapid host exploitation and greater damage to hosts (virulence). However, the inclusion of biological variables can drastically change this relationship. For example, if competing parasite strains produce toxins that kill each other (interference competition), their growth rates and virulence may be reduced relative to single-strain infections. Bacteriocins are antimicrobial toxins produced by bacteria that target closely related strains and species, and to which the producing strain is immune. We investigated competition between bacteriocin-producing, insect-killing bacteria (Photorhabdus and Xenorhabdus) and how this competition affected virulence in caterpillars. Where one strain could kill the other, and not vice versa, the non-killing strain was competitively excluded, and insect mortality was the same as that of the killing strain alone. However, when caterpillars were multiply infected by strains that could kill each other, we did not observe competitive exclusion and their virulence was less than single-strain infections. The ubiquity and diversity of bacteriocins among pathogenic bacteria suggest mixed infections will be, on average, less virulent than single infections. PMID:15255095

  18. Virulence Attributes of Low-Virulence Organisms

    PubMed Central

    1994-01-01

    The vast majority of infections involving female pelvic structures arise from organisms that are members of the normal flora. In addition, exogenous organisms that invade through the lower genital tract must interact with organisms that are part of the host's flora. In contrast to the concept that the normal flora is entirely innocuous, recent research has begun to identify what appear to be virulence attributes among these ordinarily low-virulence organisms. Most of our understanding of virulence has been derived from highly virulent organisms, of which Neisseria gonorrhoeae provides an example of relevance to the female genital tract. A review of the virulence factors of the gonococcus is presented to serve as an example of the variety of virulence properties associated with pathogenic bacteria. Molecular biology has begun to clarify one of the important paradigms of pathogenic bacteriology—that bacteria change their expression of virulence properties in response to their location within a host or to the stage of infection. Thus, infection involves not only the possession of virulence factors, but also the carefully controlled use of those factors. Virulence is often controlled by the coordinate expression of many virulence-associated genes in response to one environmental signal. With regard to low- virulence organisms present in the female lower genital tract, we are beginning to identify some of their virulence attributes. Examples from the work of our laboratory include the hemolysin of Gardnerella vaginalis and an immunosuppressive mycotoxin produced by Candida albicans. Demonstrating the coordinate expression (or other control mechanisms) of virulence factors in these sometimes innocuous and sometimes inimical organisms represents the next frontier in the study of normal vaginal microbiology. PMID:18475373

  19. Annotation and visualization of endogenous retroviral sequences using the Distributed Annotation System (DAS) and eBioX

    PubMed Central

    Martínez Barrio, Álvaro; Lagercrantz, Erik; Sperber, Göran O; Blomberg, Jonas; Bongcam-Rudloff, Erik

    2009-01-01

    Background The Distributed Annotation System (DAS) is a widely used network protocol for sharing biological information. The distributed aspects of the protocol enable the use of various reference and annotation servers for connecting biological sequence data to pertinent annotations in order to depict an integrated view of the data for the final user. Results An annotation server has been devised to provide information about the endogenous retroviruses detected and annotated by a specialized in silico tool called RetroTector. We describe the procedure to implement the DAS 1.5 protocol commands necessary for constructing the DAS annotation server. We use our server to exemplify those steps. Data distribution is kept separated from visualization which is carried out by eBioX, an easy to use open source program incorporating multiple bioinformatics utilities. Some well characterized endogenous retroviruses are shown in two different DAS clients. A rapid analysis of areas free from retroviral insertions could be facilitated by our annotations. Conclusion The DAS protocol has shown to be advantageous in the distribution of endogenous retrovirus data. The distributed nature of the protocol is also found to aid in combining annotation and visualization along a genome in order to enhance the understanding of ERV contribution to its evolution. Reference and annotation servers are conjointly used by eBioX to provide visualization of ERV annotations as well as other data sources. Our DAS data source can be found in the central public DAS service repository, , or at . PMID:19534743

  20. Protein Sequence Annotation Tool (PSAT): A centralized web-based meta-server for high-throughput sequence annotations

    DOE PAGESBeta

    Leung, Elo; Huang, Amy; Cadag, Eithon; Montana, Aldrin; Soliman, Jan Lorenz; Zhou, Carol L. Ecale

    2016-01-20

    In this study, we introduce the Protein Sequence Annotation Tool (PSAT), a web-based, sequence annotation meta-server for performing integrated, high-throughput, genome-wide sequence analyses. Our goals in building PSAT were to (1) create an extensible platform for integration of multiple sequence-based bioinformatics tools, (2) enable functional annotations and enzyme predictions over large input protein fasta data sets, and (3) provide a web interface for convenient execution of the tools. In this paper, we demonstrate the utility of PSAT by annotating the predicted peptide gene products of Herbaspirillum sp. strain RV1423, importing the results of PSAT into EC2KEGG, and using the resultingmore » functional comparisons to identify a putative catabolic pathway, thereby distinguishing RV1423 from a well annotated Herbaspirillum species. This analysis demonstrates that high-throughput enzyme predictions, provided by PSAT processing, can be used to identify metabolic potential in an otherwise poorly annotated genome. Lastly, PSAT is a meta server that combines the results from several sequence-based annotation and function prediction codes, and is available at http://psat.llnl.gov/psat/. PSAT stands apart from other sequencebased genome annotation systems in providing a high-throughput platform for rapid de novo enzyme predictions and sequence annotations over large input protein sequence data sets in FASTA. PSAT is most appropriately applied in annotation of large protein FASTA sets that may or may not be associated with a single genome.« less

  1. Draft Genome Sequence of Brucella abortus Virulent Strain 544

    PubMed Central

    Singh, D. K.; Kumar, Ashok; Tiwari, A. K.; Sankarasubramanian, Jagadesan; Vishnu, Udayakumar S.; Sridhar, Jayavel; Gunasekaran, Paramasamy

    2015-01-01

    Here, we present the draft genome sequence and annotation of Brucella abortus virulent strain 544. The genome of this strain is 3,289,405 bp long, with 57.2% G+C content. A total of 3,259 protein-coding genes and 60 RNA genes were predicted. PMID:25953161

  2. Draft Genome Sequence of Brucella abortus Virulent Strain 544.

    PubMed

    Singh, D K; Kumar, Ashok; Tiwari, A K; Sankarasubramanian, Jagadesan; Vishnu, Udayakumar S; Sridhar, Jayavel; Gunasekaran, Paramasamy; Rajendhran, Jeyaprakash

    2015-05-07

    Here, we present the draft genome sequence and annotation of Brucella abortus virulent strain 544. The genome of this strain is 3,289,405 bp long, with 57.2% G+C content. A total of 3,259 protein-coding genes and 60 RNA genes were predicted.

  3. Rapid multiplex PCR and Real-Time TaqMan PCR assays for detection of Salmonella enterica and the highly virulent serovars Choleraesuis and Paratyphi C

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Salmonella enterica is a human pathogen with over 2,500 serovars characterized. S. enterica serovars Choleraesuis (Cs) and Paratyphi C (Pc) are two globally distributed serovars. We have developed a rapid molecular typing method to detect Cs and Pc in food samples by using a comparative genomics ap...

  4. O-antigen and virulence profiling of Shiga toxin-producing Escherichia coli by a rapid and cost-effective DNA microarray colorimetric method

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Shiga toxin-producing Escherichia coli (STEC) is a leading cause of foodborne illness worldwide. To evaluate better methods to rapidly detect and genotype Shiga toxin-producing Escherichia coli strains, the present study evaluated the use of the ampliPHOX colorimetric detection technology, based on ...

  5. MannDB: A microbial annotation database for protein characterization

    SciTech Connect

    Zhou, C; Lam, M; Smith, J; Zemla, A; Dyer, M; Kuczmarski, T; Vitalis, E; Slezak, T

    2006-05-19

    MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-source tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high

  6. Algal functional annotation tool

    SciTech Connect

    2012-07-12

    Abstract BACKGROUND: Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. DESCRIPTION: The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of genes on KEGG

  7. Algal functional annotation tool

    2012-07-12

    Abstract BACKGROUND: Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations tomore » interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. DESCRIPTION: The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of genes on

  8. Human Genome Annotation

    NASA Astrophysics Data System (ADS)

    Gerstein, Mark

    A central problem for 21st century science is annotating the human genome and making this annotation useful for the interpretation of personal genomes. My talk will focus on annotating the 99% of the genome that does not code for canonical genes, concentrating on intergenic features such as structural variants (SVs), pseudogenes (protein fossils), binding sites, and novel transcribed RNAs (ncRNAs). In particular, I will describe how we identify regulatory sites and variable blocks (SVs) based on processing next-generation sequencing experiments. I will further explain how we cluster together groups of sites to create larger annotations. Next, I will discuss a comprehensive pseudogene identification pipeline, which has enabled us to identify >10K pseudogenes in the genome and analyze their distribution with respect to age, protein family, and chromosomal location. Throughout, I will try to introduce some of the computational algorithms and approaches that are required for genome annotation. Much of this work has been carried out in the framework of the ENCODE, modENCODE, and 1000 genomes projects.

  9. Algal functional annotation tool

    SciTech Connect

    Lopez, D.; Casero, D.; Cokus, S. J.; Merchant, S. S.; Pellegrini, M.

    2012-07-01

    The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of genes on KEGG pathway maps and batch gene identifier conversion.

  10. Re-Annotator: Annotation Pipeline for Microarray Probe Sequences.

    PubMed

    Arloth, Janine; Bader, Daniel M; Röh, Simone; Altmann, Andre

    2015-01-01

    Microarray technologies are established approaches for high throughput gene expression, methylation and genotyping analysis. An accurate mapping of the array probes is essential to generate reliable biological findings. However, manufacturers of the microarray platforms typically provide incomplete and outdated annotation tables, which often rely on older genome and transcriptome versions that differ substantially from up-to-date sequence databases. Here, we present the Re-Annotator, a re-annotation pipeline for microarray probe sequences. It is primarily designed for gene expression microarrays but can also be adapted to other types of microarrays. The Re-Annotator uses a custom-built mRNA reference database to identify the positions of gene expression array probe sequences. We applied Re-Annotator to the Illumina Human-HT12 v4 microarray platform and found that about one quarter (25%) of the probes differed from the manufacturer's annotation. In further computational experiments on experimental gene expression data, we compared Re-Annotator to another probe re-annotation tool, ReMOAT, and found that Re-Annotator provided an improved re-annotation of microarray probes. A thorough re-annotation of probe information is crucial to any microarray analysis. The Re-Annotator pipeline is freely available at http://sourceforge.net/projects/reannotator along with re-annotated files for Illumina microarrays HumanHT-12 v3/v4 and MouseRef-8 v2.

  11. Correction of the Caulobacter crescentus NA1000 genome annotation.

    PubMed

    Ely, Bert; Scott, LaTia Etheredge

    2014-01-01

    Bacterial genome annotations are accumulating rapidly in the GenBank database and the use of automated annotation technologies to create these annotations has become the norm. However, these automated methods commonly result in a small, but significant percentage of genome annotation errors. To improve accuracy and reliability, we analyzed the Caulobacter crescentus NA1000 genome utilizing computer programs Artemis and MICheck to manually examine the third codon position GC content, alignment to a third codon position GC frame plot peak, and matches in the GenBank database. We identified 11 new genes, modified the start site of 113 genes, and changed the reading frame of 38 genes that had been incorrectly annotated. Furthermore, our manual method of identifying protein-coding genes allowed us to remove 112 non-coding regions that had been designated as coding regions. The improved NA1000 genome annotation resulted in a reduction in the use of rare codons since noncoding regions with atypical codon usage were removed from the annotation and 49 new coding regions were added to the annotation. Thus, a more accurate codon usage table was generated as well. These results demonstrate that a comparison of the location of peaks third codon position GC content to the location of protein coding regions could be used to verify the annotation of any genome that has a GC content that is greater than 60%.

  12. The GATO gene annotation tool for research laboratories.

    PubMed

    Fujita, A; Massirer, K B; Durham, A M; Ferreira, C E; Sogayar, M C

    2005-11-01

    Large-scale genome projects have generated a rapidly increasing number of DNA sequences. Therefore, development of computational methods to rapidly analyze these sequences is essential for progress in genomic research. Here we present an automatic annotation system for preliminary analysis of DNA sequences. The gene annotation tool (GATO) is a Bioinformatics pipeline designed to facilitate routine functional annotation and easy access to annotated genes. It was designed in view of the frequent need of genomic researchers to access data pertaining to a common set of genes. In the GATO system, annotation is generated by querying some of the Web-accessible resources and the information is stored in a local database, which keeps a record of all previous annotation results. GATO may be accessed from everywhere through the internet or may be run locally if a large number of sequences are going to be annotated. It is implemented in PHP and Perl and may be run on any suitable Web server. Usually, installation and application of annotation systems require experience and are time consuming, but GATO is simple and practical, allowing anyone with basic skills in informatics to access it without any special training. GATO can be downloaded at [http://mariwork.iq.usp.br/gato/]. Minimum computer free space required is 2 MB. PMID:16258624

  13. Rapid identification of Salmonella serovars in feces by specific detection of virulence genes, invA and spvC, by an enrichment broth culture-multiplex PCR combination assay.

    PubMed

    Chiu, C H; Ou, J T

    1996-10-01

    In order to make a rapid and definite diagnosis of Salmonella enteritis in children, an enrichment broth culture-multiplex PCR combination assay was devised to identify Salmonella serovars directly from fecal samples. Two pairs of oligonucleotide primers were prepared according to the sequences of the chromosomal invA and plasmid spvC genes. PCR with these two primers would produce either one amplicon (from the invA gene) or two amplicons (from the invA and spvC genes), depending on whether or not the Salmonella bacteria contained a virulence plasmid. The fecal sample was diluted 10- to 20-fold into gram-negative enrichment broth and incubated to eliminate inhibitory compounds and also to allow selective enrichment of the bacteria. One or two amplicons were obtained, the expected result if Salmonella bacteria were present. The detection limit of this PCR was about 200 bacteria per reaction mixture. The primers were specific, as no amplification products were obtained with 18 species and 22 isolates of non-Salmonella bacteria tested which could be present in the feces or cause contamination. In contrast, when 23 commonly seen Salmonella serovars (38 isolates) were tested, all were shown to carry the invA gene and seven concomitantly harbored the spvC gene of the virulence plasmid. This assay was applied to the diagnosis of Salmonella enteritis in 57 children who were suffering from mucoid and/or bloody diarrhea. Of the 57 children, 38 were PCR positive and 22 were culture positive. There were two culture-positive samples that were not detected by PCR. Thus, this PCR assay showed an efficiency of 95% (38 of 40), which is much higher than the 60% (24 of 40) by culture alone. Not only is this method more sensitive, rapid, and efficient but it will cause only an incremental increase in the cost of stool processing, since enrichment cultivation of fecal samples from diarrheal patients using gram-negative enrichment broth is a routine practice for identification in many

  14. Toward an improved laboratory definition of Listeria monocytogenes virulence.

    PubMed

    Liu, Dongyou; Lawrence, Mark L; Ainsworth, A Jerald; Austin, Frank W

    2007-09-15

    Listeria monocytogenes is an opportunistic foodborne pathogen that encompasses a diversity of strains with varied virulence. The ability to rapidly determine the pathogenic potential of L. monocytogenes strains is integral to the control and prevention campaign against listeriosis. Early methods for assessing L. monocytogenes virulence include in vivo bioassays and in vitro cell assays. While in vivo bioassays provide a measurement of all virulence determinants of L. monocytogenes, they are not applied routinely due to their reliance on experimental animals whose costs have become increasingly prohibitive. As a low cost alternative, in vitro cell assays are useful for estimating the virulence of L. monocytogenes strains. However, these assays are often slow, and at times variable. Prior attempts to ascertain L. monocytogenes virulence by targeting virulence-associated proteins and genes have been largely unsuccessful, since many of the assay targets are present in both virulent and avirulent strains. Recent identification of novel virulence-specific genes (particularly internalin gene inlJ) has opened a new avenue for rapid, sensitive, and precise differentiation of virulent L. monocytogenes strains from avirulent strains. The application of DNA sequencing technique also offers an additional tool for assessing L. monocytogenes virulence potential. By providing an update on the laboratory methods that have been reported for the determination of L. monocytogenes pathogenicity, this review discusses future research needs that may help achieve an improved laboratory definition of L. monocytogenes virulence.

  15. Annotation of Ehux ESTs

    SciTech Connect

    Kuo, Alan; Grigoriev, Igor

    2009-06-12

    22 percent ESTs do no align with scaffolds. EST Pipeleine assembles 17126 consensi from the noaligned ESTs. Annotation Pipeline predicts 8564 ORFS on the consensi. Domain analysis of ORFs reveals missing genes. Cluster analysis reveals missing genes. Expression analysis reveals potential strain specific genes.

  16. Annotation: The Savant Syndrome

    ERIC Educational Resources Information Center

    Heaton, Pamela; Wallace, Gregory L.

    2004-01-01

    Background: Whilst interest has focused on the origin and nature of the savant syndrome for over a century, it is only within the past two decades that empirical group studies have been carried out. Methods: The following annotation briefly reviews relevant research and also attempts to address outstanding issues in this research area.…

  17. Intellectuals in China: Annotations.

    ERIC Educational Resources Information Center

    Parker, Franklin

    This annotated bibliography of 72 books, journal articles, government reports, and newspaper feature stories focuses on the changing role of intellectuals in China, primarily since the 1949 Chinese Revolution. Particular attention is given to the Hundred Flowers Movement of 1957 and the Cultural Revolution. Most of the cited works are in English,…

  18. Collaborative Movie Annotation

    NASA Astrophysics Data System (ADS)

    Zad, Damon Daylamani; Agius, Harry

    In this paper, we focus on metadata for self-created movies like those found on YouTube and Google Video, the duration of which are increasing in line with falling upload restrictions. While simple tags may have been sufficient for most purposes for traditionally very short video footage that contains a relatively small amount of semantic content, this is not the case for movies of longer duration which embody more intricate semantics. Creating metadata is a time-consuming process that takes a great deal of individual effort; however, this effort can be greatly reduced by harnessing the power of Web 2.0 communities to create, update and maintain it. Consequently, we consider the annotation of movies within Web 2.0 environments, such that users create and share that metadata collaboratively and propose an architecture for collaborative movie annotation. This architecture arises from the results of an empirical experiment where metadata creation tools, YouTube and an MPEG-7 modelling tool, were used by users to create movie metadata. The next section discusses related work in the areas of collaborative retrieval and tagging. Then, we describe the experiments that were undertaken on a sample of 50 users. Next, the results are presented which provide some insight into how users interact with existing tools and systems for annotating movies. Based on these results, the paper then develops an architecture for collaborative movie annotation.

  19. Annotated Bibliography. First Edition.

    ERIC Educational Resources Information Center

    Haring, Norris G.

    An annotated bibliography which presents approximately 300 references from 1951 to 1973 on the education of severely/profoundly handicapped persons. Citations are grouped alphabetically by author's name within the following categories: characteristics and treatment, gross motor development, sensory and motor development, physical therapy for the…

  20. Ghostwriting: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Simmons, Donald B.

    Drawn from communication journals, historical and news magazines, business and industrial magazines, political science and world affairs journals, general interest periodicals, and literary and political review magazines, the approximately 90 entries in this annotated bibliography discuss ghostwriting as practiced through the ages and reveal the…

  1. Use of Alignment-Free Phylogenetics for Rapid Genome Sequence-Based Typing of Helicobacter pylori Virulence Markers and Antibiotic Susceptibility

    PubMed Central

    Kusters, Johannes G.

    2015-01-01

    Whole-genome sequencing is becoming a leading technology in the typing and epidemiology of microbial pathogens, but the increase in genomic information necessitates significant investment in bioinformatic resources and expertise, and currently used methodologies struggle with genetically heterogeneous bacteria such as the human gastric pathogen Helicobacter pylori. Here we demonstrate that the alignment-free analysis method feature frequency profiling (FFP) can be used to rapidly construct phylogenetic trees of draft bacterial genome sequences on a standard desktop computer and that coupling with in silico genotyping methods gives useful information for comparative and clinical genomic and molecular epidemiology applications. FFP-based phylogenetic trees of seven gastric Helicobacter species matched those obtained by analysis of 16S rRNA genes and ribosomal proteins, and FFP- and core genome single nucleotide polymorphism-based analysis of 63 H. pylori genomes again showed comparable phylogenetic clustering, consistent with genomotypes assigned by using multilocus sequence typing (MLST). Analysis of 377 H. pylori genomes highlighted the conservation of genomotypes and linkage with phylogeographic characteristics and predicted the presence of an incomplete or nonfunctional cag pathogenicity island in 18/276 genomes. In silico analysis of antibiotic susceptibility markers suggests that most H. pylori hspAmerind and hspEAsia isolates are predicted to carry the T2812C mutation potentially conferring low-level clarithromycin resistance, while levels of metronidazole resistance were similar in all multilocus sequence types. In conclusion, the use of FFP phylogenetic clustering and in silico genotyping allows determination of genome evolution and phylogeographic clustering and can contribute to clinical microbiology by genomotyping for outbreak management and the prediction of pathogenic potential and antibiotic susceptibility. PMID:26135867

  2. MitoFish and MitoAnnotator: A Mitochondrial Genome Database of Fish with an Accurate and Automatic Annotation Pipeline

    PubMed Central

    Iwasaki, Wataru; Fukunaga, Tsukasa; Isagozawa, Ryota; Yamada, Koichiro; Maeda, Yasunobu; Satoh, Takashi P.; Sado, Tetsuya; Mabuchi, Kohji; Takeshima, Hirohiko; Miya, Masaki; Nishida, Mutsumi

    2013-01-01

    Mitofish is a database of fish mitochondrial genomes (mitogenomes) that includes powerful and precise de novo annotations for mitogenome sequences. Fish occupy an important position in the evolution of vertebrates and the ecology of the hydrosphere, and mitogenomic sequence data have served as a rich source of information for resolving fish phylogenies and identifying new fish species. The importance of a mitogenomic database continues to grow at a rapid pace as massive amounts of mitogenomic data are generated with the advent of new sequencing technologies. A severe bottleneck seems likely to occur with regard to mitogenome annotation because of the overwhelming pace of data accumulation and the intrinsic difficulties in annotating sequences with degenerating transfer RNA structures, divergent start/stop codons of the coding elements, and the overlapping of adjacent elements. To ease this data backlog, we developed an annotation pipeline named MitoAnnotator. MitoAnnotator automatically annotates a fish mitogenome with a high degree of accuracy in approximately 5 min; thus, it is readily applicable to data sets of dozens of sequences. MitoFish also contains re-annotations of previously sequenced fish mitogenomes, enabling researchers to refer to them when they find annotations that are likely to be erroneous or while conducting comparative mitogenomic analyses. For users who need more information on the taxonomy, habitats, phenotypes, or life cycles of fish, MitoFish provides links to related databases. MitoFish and MitoAnnotator are freely available at http://mitofish.aori.u-tokyo.ac.jp/ (last accessed August 28, 2013); all of the data can be batch downloaded, and the annotation pipeline can be used via a web interface. PMID:23955518

  3. MitoFish and MitoAnnotator: a mitochondrial genome database of fish with an accurate and automatic annotation pipeline.

    PubMed

    Iwasaki, Wataru; Fukunaga, Tsukasa; Isagozawa, Ryota; Yamada, Koichiro; Maeda, Yasunobu; Satoh, Takashi P; Sado, Tetsuya; Mabuchi, Kohji; Takeshima, Hirohiko; Miya, Masaki; Nishida, Mutsumi

    2013-11-01

    Mitofish is a database of fish mitochondrial genomes (mitogenomes) that includes powerful and precise de novo annotations for mitogenome sequences. Fish occupy an important position in the evolution of vertebrates and the ecology of the hydrosphere, and mitogenomic sequence data have served as a rich source of information for resolving fish phylogenies and identifying new fish species. The importance of a mitogenomic database continues to grow at a rapid pace as massive amounts of mitogenomic data are generated with the advent of new sequencing technologies. A severe bottleneck seems likely to occur with regard to mitogenome annotation because of the overwhelming pace of data accumulation and the intrinsic difficulties in annotating sequences with degenerating transfer RNA structures, divergent start/stop codons of the coding elements, and the overlapping of adjacent elements. To ease this data backlog, we developed an annotation pipeline named MitoAnnotator. MitoAnnotator automatically annotates a fish mitogenome with a high degree of accuracy in approximately 5 min; thus, it is readily applicable to data sets of dozens of sequences. MitoFish also contains re-annotations of previously sequenced fish mitogenomes, enabling researchers to refer to them when they find annotations that are likely to be erroneous or while conducting comparative mitogenomic analyses. For users who need more information on the taxonomy, habitats, phenotypes, or life cycles of fish, MitoFish provides links to related databases. MitoFish and MitoAnnotator are freely available at http://mitofish.aori.u-tokyo.ac.jp/ (last accessed August 28, 2013); all of the data can be batch downloaded, and the annotation pipeline can be used via a web interface.

  4. RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes

    DOE PAGESBeta

    Brettin, Thomas; Davis, James J.; Disz, Terry; Edwards, Robert A.; Gerdes, Svetlana; Olsen, Gary J.; Olson, Robert; Overbeek, Ross; Parrello, Bruce; Pusch, Gordon D.; et al

    2015-02-10

    The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offersmore » a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.« less

  5. Functional Annotation Analytics of Rhodopseudomonas palustris Genomes

    PubMed Central

    Simmons, Shaneka S.; Isokpehi, Raphael D.; Brown, Shyretha D.; McAllister, Donee L.; Hall, Charnia C.; McDuffy, Wanaki M.; Medley, Tamara L.; Udensi, Udensi K.; Rajnarayanan, Rajendram V.; Ayensu, Wellington K.; Cohly, Hari H.P.

    2011-01-01

    Rhodopseudomonas palustris, a nonsulphur purple photosynthetic bacteria, has been extensively investigated for its metabolic versatility including ability to produce hydrogen gas from sunlight and biomass. The availability of the finished genome sequences of six R. palustris strains (BisA53, BisB18, BisB5, CGA009, HaA2 and TIE-1) combined with online bioinformatics software for integrated analysis presents new opportunities to determine the genomic basis of metabolic versatility and ecological lifestyles of the bacteria species. The purpose of this investigation was to compare the functional annotations available for multiple R. palustris genomes to identify annotations that can be further investigated for strain-specific or uniquely shared phenotypic characteristics. A total of 2,355 protein family Pfam domain annotations were clustered based on presence or absence in the six genomes. The clustering process identified groups of functional annotations including those that could be verified as strain-specific or uniquely shared phenotypes. For example, genes encoding water/glycerol transport were present in the genome sequences of strains CGA009 and BisB5, but absent in strains BisA53, BisB18, HaA2 and TIE-1. Protein structural homology modeling predicted that the two orthologous 240 aa R. palustris aquaporins have water-specific transport function. Based on observations in other microbes, the presence of aquaporin in R. palustris strains may improve freeze tolerance in natural conditions of rapid freezing such as nitrogen fixation at low temperatures where access to liquid water is a limiting factor for nitrogenase activation. In the case of adaptive loss of aquaporin genes, strains may be better adapted to survive in conditions of high-sugar content such as fermentation of biomass for biohydrogen production. Finally, web-based resources were developed to allow for interactive, user-defined selection of the relationship between protein family annotations and the R

  6. The Ensembl gene annotation system.

    PubMed

    Aken, Bronwen L; Ayling, Sarah; Barrell, Daniel; Clarke, Laura; Curwen, Valery; Fairley, Susan; Fernandez Banet, Julio; Billis, Konstantinos; García Girón, Carlos; Hourlier, Thibaut; Howe, Kevin; Kähäri, Andreas; Kokocinski, Felix; Martin, Fergal J; Murphy, Daniel N; Nag, Rishi; Ruffier, Magali; Schuster, Michael; Tang, Y Amy; Vogel, Jan-Hinnerk; White, Simon; Zadissa, Amonida; Flicek, Paul; Searle, Stephen M J

    2016-01-01

    The Ensembl gene annotation system has been used to annotate over 70 different vertebrate species across a wide range of genome projects. Furthermore, it generates the automatic alignment-based annotation for the human and mouse GENCODE gene sets. The system is based on the alignment of biological sequences, including cDNAs, proteins and RNA-seq reads, to the target genome in order to construct candidate transcript models. Careful assessment and filtering of these candidate transcripts ultimately leads to the final gene set, which is made available on the Ensembl website. Here, we describe the annotation process in detail.Database URL: http://www.ensembl.org/index.html. PMID:27337980

  7. The Ensembl gene annotation system

    PubMed Central

    Aken, Bronwen L.; Ayling, Sarah; Barrell, Daniel; Clarke, Laura; Curwen, Valery; Fairley, Susan; Fernandez Banet, Julio; Billis, Konstantinos; García Girón, Carlos; Hourlier, Thibaut; Howe, Kevin; Kähäri, Andreas; Kokocinski, Felix; Martin, Fergal J.; Murphy, Daniel N.; Nag, Rishi; Ruffier, Magali; Schuster, Michael; Tang, Y. Amy; Vogel, Jan-Hinnerk; White, Simon; Zadissa, Amonida; Flicek, Paul

    2016-01-01

    The Ensembl gene annotation system has been used to annotate over 70 different vertebrate species across a wide range of genome projects. Furthermore, it generates the automatic alignment-based annotation for the human and mouse GENCODE gene sets. The system is based on the alignment of biological sequences, including cDNAs, proteins and RNA-seq reads, to the target genome in order to construct candidate transcript models. Careful assessment and filtering of these candidate transcripts ultimately leads to the final gene set, which is made available on the Ensembl website. Here, we describe the annotation process in detail. Database URL: http://www.ensembl.org/index.html PMID:27337980

  8. Structural and functional annotation of the porcine immunome

    PubMed Central

    2013-01-01

    evolution as compared to 4.1% across the entire genome. Conclusions This extensive annotation dramatically extends the genome-based knowledge of the molecular genetics and structure of a major portion of the porcine immunome. Our complementary functional approach using co-expression during immune response has provided new putative immune response annotation for over 500 porcine genes. Our phylogenetic analysis of this core immunome cluster confirms rapid evolutionary change in this set of genes, and that, as in other species, such genes are important components of the pig’s adaptation to pathogen challenge over evolutionary time. These comprehensive and integrated analyses increase the value of the porcine genome sequence and provide important tools for global analyses and data-mining of the porcine immune response. PMID:23676093

  9. Drug Education: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Mathieson, Moira B.

    This bibliography consists of a total of 215 entries dealing with drug education, including curriculum guides, and drawn from documents in the ERIC system. There are two sections, the first containing 130 annotated citations of documents and journal articles, and the second containing 85 citations of journal articles without annotations, but with…

  10. Adult Basic Education Annotated Bibliography.

    ERIC Educational Resources Information Center

    Carter, Nancy B.

    This annotated bibliography contains sections divided according to area of study, and within each category materials are listed alphabetically by publisher. Publishers and mailing addresses are listed at the end of the bibliography. Throughout the annotations, whenever specific grade level divisions are not named, the regular Adult Basic Education…

  11. Morphosyntactic Annotation of CHILDES Transcripts

    ERIC Educational Resources Information Center

    Sagae, Kenji; Davis, Eric; Lavie, Alon; MacWhinney, Brian; Wintner, Shuly

    2010-01-01

    Corpora of child language are essential for research in child language acquisition and psycholinguistics. Linguistic annotation of the corpora provides researchers with better means for exploring the development of grammatical constructions and their usage. We describe a project whose goal is to annotate the English section of the CHILDES database…

  12. Towards Automated Annotation of Benthic Survey Images: Variability of Human Experts and Operational Modes of Automation

    PubMed Central

    Beijbom, Oscar; Edmunds, Peter J.; Roelfsema, Chris; Smith, Jennifer; Kline, David I.; Neal, Benjamin P.; Dunlap, Matthew J.; Moriarty, Vincent; Fan, Tung-Yung; Tan, Chih-Jui; Chan, Stephen; Treibitz, Tali; Gamst, Anthony; Mitchell, B. Greg; Kriegman, David

    2015-01-01

    Global climate change and other anthropogenic stressors have heightened the need to rapidly characterize ecological changes in marine benthic communities across large scales. Digital photography enables rapid collection of survey images to meet this need, but the subsequent image annotation is typically a time consuming, manual task. We investigated the feasibility of using automated point-annotation to expedite cover estimation of the 17 dominant benthic categories from survey-images captured at four Pacific coral reefs. Inter- and intra- annotator variability among six human experts was quantified and compared to semi- and fully- automated annotation methods, which are made available at coralnet.ucsd.edu. Our results indicate high expert agreement for identification of coral genera, but lower agreement for algal functional groups, in particular between turf algae and crustose coralline algae. This indicates the need for unequivocal definitions of algal groups, careful training of multiple annotators, and enhanced imaging technology. Semi-automated annotation, where 50% of the annotation decisions were performed automatically, yielded cover estimate errors comparable to those of the human experts. Furthermore, fully-automated annotation yielded rapid, unbiased cover estimates but with increased variance. These results show that automated annotation can increase spatial coverage and decrease time and financial outlay for image-based reef surveys. PMID:26154157

  13. CART—a chemical annotation retrieval toolkit

    PubMed Central

    Deghou, Samy; Zeller, Georg; Iskar, Murat; Driessen, Marja; Castillo, Mercedes; van Noort, Vera; Bork, Peer

    2016-01-01

    Motivation: Data on bioactivities of drug-like chemicals are rapidly accumulating in public repositories, creating new opportunities for research in computational systems pharmacology. However, integrative analysis of these data sets is difficult due to prevailing ambiguity between chemical names and identifiers and a lack of cross-references between databases. Results: To address this challenge, we have developed CART, a Chemical Annotation Retrieval Toolkit. As a key functionality, it matches an input list of chemical names into a comprehensive reference space to assign unambiguous chemical identifiers. In this unified space, bioactivity annotations can be easily retrieved from databases covering a wide variety of chemical effects on biological systems. Subsequently, CART can determine annotations enriched in the input set of chemicals and display these in tabular format and interactive network visualizations, thereby facilitating integrative analysis of chemical bioactivity data. Availability and Implementation: CART is available as a Galaxy web service (cart.embl.de). Source code and an easy-to-install command line tool can also be obtained from the web site. Contact: bork@embl.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27256313

  14. Virulence factors of the Mycobacterium tuberculosis complex

    PubMed Central

    Forrellad, Marina A.; Klepp, Laura I.; Gioffré, Andrea; Sabio y García, Julia; Morbidoni, Hector R.; Santangelo, María de la Paz; Cataldi, Angel A.; Bigi, Fabiana

    2013-01-01

    The Mycobacterium tuberculosis complex (MTBC) consists of closely related species that cause tuberculosis in both humans and animals. This illness, still today, remains to be one of the leading causes of morbidity and mortality throughout the world. The mycobacteria enter the host by air, and, once in the lungs, are phagocytated by macrophages. This may lead to the rapid elimination of the bacillus or to the triggering of an active tuberculosis infection. A large number of different virulence factors have evolved in MTBC members as a response to the host immune reaction. The aim of this review is to describe the bacterial genes/proteins that are essential for the virulence of MTBC species, and that have been demonstrated in an in vivo model of infection. Knowledge of MTBC virulence factors is essential for the development of new vaccines and drugs to help manage the disease toward an increasingly more tuberculosis-free world. PMID:23076359

  15. Gene Ontology annotations and resources.

    PubMed

    Blake, J A; Dolan, M; Drabkin, H; Hill, D P; Li, Ni; Sitnikov, D; Bridges, S; Burgess, S; Buza, T; McCarthy, F; Peddinti, D; Pillai, L; Carbon, S; Dietze, H; Ireland, A; Lewis, S E; Mungall, C J; Gaudet, P; Chrisholm, R L; Fey, P; Kibbe, W A; Basu, S; Siegele, D A; McIntosh, B K; Renfro, D P; Zweifel, A E; Hu, J C; Brown, N H; Tweedie, S; Alam-Faruque, Y; Apweiler, R; Auchinchloss, A; Axelsen, K; Bely, B; Blatter, M -C; Bonilla, C; Bouguerleret, L; Boutet, E; Breuza, L; Bridge, A; Chan, W M; Chavali, G; Coudert, E; Dimmer, E; Estreicher, A; Famiglietti, L; Feuermann, M; Gos, A; Gruaz-Gumowski, N; Hieta, R; Hinz, C; Hulo, C; Huntley, R; James, J; Jungo, F; Keller, G; Laiho, K; Legge, D; Lemercier, P; Lieberherr, D; Magrane, M; Martin, M J; Masson, P; Mutowo-Muellenet, P; O'Donovan, C; Pedruzzi, I; Pichler, K; Poggioli, D; Porras Millán, P; Poux, S; Rivoire, C; Roechert, B; Sawford, T; Schneider, M; Stutz, A; Sundaram, S; Tognolli, M; Xenarios, I; Foulgar, R; Lomax, J; Roncaglia, P; Khodiyar, V K; Lovering, R C; Talmud, P J; Chibucos, M; Giglio, M Gwinn; Chang, H -Y; Hunter, S; McAnulla, C; Mitchell, A; Sangrador, A; Stephan, R; Harris, M A; Oliver, S G; Rutherford, K; Wood, V; Bahler, J; Lock, A; Kersey, P J; McDowall, D M; Staines, D M; Dwinell, M; Shimoyama, M; Laulederkind, S; Hayman, T; Wang, S -J; Petri, V; Lowry, T; D'Eustachio, P; Matthews, L; Balakrishnan, R; Binkley, G; Cherry, J M; Costanzo, M C; Dwight, S S; Engel, S R; Fisk, D G; Hitz, B C; Hong, E L; Karra, K; Miyasato, S R; Nash, R S; Park, J; Skrzypek, M S; Weng, S; Wong, E D; Berardini, T Z; Huala, E; Mi, H; Thomas, P D; Chan, J; Kishore, R; Sternberg, P; Van Auken, K; Howe, D; Westerfield, M

    2013-01-01

    The Gene Ontology (GO) Consortium (GOC, http://www.geneontology.org) is a community-based bioinformatics resource that classifies gene product function through the use of structured, controlled vocabularies. Over the past year, the GOC has implemented several processes to increase the quantity, quality and specificity of GO annotations. First, the number of manual, literature-based annotations has grown at an increasing rate. Second, as a result of a new 'phylogenetic annotation' process, manually reviewed, homology-based annotations are becoming available for a broad range of species. Third, the quality of GO annotations has been improved through a streamlined process for, and automated quality checks of, GO annotations deposited by different annotation groups. Fourth, the consistency and correctness of the ontology itself has increased by using automated reasoning tools. Finally, the GO has been expanded not only to cover new areas of biology through focused interaction with experts, but also to capture greater specificity in all areas of the ontology using tools for adding new combinatorial terms. The GOC works closely with other ontology developers to support integrated use of terminologies. The GOC supports its user community through the use of e-mail lists, social media and web-based resources.

  16. Gene Ontology annotations and resources.

    PubMed

    Blake, J A; Dolan, M; Drabkin, H; Hill, D P; Li, Ni; Sitnikov, D; Bridges, S; Burgess, S; Buza, T; McCarthy, F; Peddinti, D; Pillai, L; Carbon, S; Dietze, H; Ireland, A; Lewis, S E; Mungall, C J; Gaudet, P; Chrisholm, R L; Fey, P; Kibbe, W A; Basu, S; Siegele, D A; McIntosh, B K; Renfro, D P; Zweifel, A E; Hu, J C; Brown, N H; Tweedie, S; Alam-Faruque, Y; Apweiler, R; Auchinchloss, A; Axelsen, K; Bely, B; Blatter, M -C; Bonilla, C; Bouguerleret, L; Boutet, E; Breuza, L; Bridge, A; Chan, W M; Chavali, G; Coudert, E; Dimmer, E; Estreicher, A; Famiglietti, L; Feuermann, M; Gos, A; Gruaz-Gumowski, N; Hieta, R; Hinz, C; Hulo, C; Huntley, R; James, J; Jungo, F; Keller, G; Laiho, K; Legge, D; Lemercier, P; Lieberherr, D; Magrane, M; Martin, M J; Masson, P; Mutowo-Muellenet, P; O'Donovan, C; Pedruzzi, I; Pichler, K; Poggioli, D; Porras Millán, P; Poux, S; Rivoire, C; Roechert, B; Sawford, T; Schneider, M; Stutz, A; Sundaram, S; Tognolli, M; Xenarios, I; Foulgar, R; Lomax, J; Roncaglia, P; Khodiyar, V K; Lovering, R C; Talmud, P J; Chibucos, M; Giglio, M Gwinn; Chang, H -Y; Hunter, S; McAnulla, C; Mitchell, A; Sangrador, A; Stephan, R; Harris, M A; Oliver, S G; Rutherford, K; Wood, V; Bahler, J; Lock, A; Kersey, P J; McDowall, D M; Staines, D M; Dwinell, M; Shimoyama, M; Laulederkind, S; Hayman, T; Wang, S -J; Petri, V; Lowry, T; D'Eustachio, P; Matthews, L; Balakrishnan, R; Binkley, G; Cherry, J M; Costanzo, M C; Dwight, S S; Engel, S R; Fisk, D G; Hitz, B C; Hong, E L; Karra, K; Miyasato, S R; Nash, R S; Park, J; Skrzypek, M S; Weng, S; Wong, E D; Berardini, T Z; Huala, E; Mi, H; Thomas, P D; Chan, J; Kishore, R; Sternberg, P; Van Auken, K; Howe, D; Westerfield, M

    2013-01-01

    The Gene Ontology (GO) Consortium (GOC, http://www.geneontology.org) is a community-based bioinformatics resource that classifies gene product function through the use of structured, controlled vocabularies. Over the past year, the GOC has implemented several processes to increase the quantity, quality and specificity of GO annotations. First, the number of manual, literature-based annotations has grown at an increasing rate. Second, as a result of a new 'phylogenetic annotation' process, manually reviewed, homology-based annotations are becoming available for a broad range of species. Third, the quality of GO annotations has been improved through a streamlined process for, and automated quality checks of, GO annotations deposited by different annotation groups. Fourth, the consistency and correctness of the ontology itself has increased by using automated reasoning tools. Finally, the GO has been expanded not only to cover new areas of biology through focused interaction with experts, but also to capture greater specificity in all areas of the ontology using tools for adding new combinatorial terms. The GOC works closely with other ontology developers to support integrated use of terminologies. The GOC supports its user community through the use of e-mail lists, social media and web-based resources. PMID:23161678

  17. Metagenomic gene annotation by a homology-independent approach

    SciTech Connect

    Froula, Jeff; Zhang, Tao; Salmeen, Annette; Hess, Matthias; Kerfeld, Cheryl A.; Wang, Zhong; Du, Changbin

    2011-06-02

    Fully understanding the genetic potential of a microbial community requires functional annotation of all the genes it encodes. The recently developed deep metagenome sequencing approach has enabled rapid identification of millions of genes from a complex microbial community without cultivation. Current homology-based gene annotation fails to detect distantly-related or structural homologs. Furthermore, homology searches with millions of genes are very computational intensive. To overcome these limitations, we developed rhModeller, a homology-independent software pipeline to efficiently annotate genes from metagenomic sequencing projects. Using cellulases and carbonic anhydrases as two independent test cases, we demonstrated that rhModeller is much faster than HMMER but with comparable accuracy, at 94.5percent and 99.9percent accuracy, respectively. More importantly, rhModeller has the ability to detect novel proteins that do not share significant homology to any known protein families. As {approx}50percent of the 2 million genes derived from the cow rumen metagenome failed to be annotated based on sequence homology, we tested whether rhModeller could be used to annotate these genes. Preliminary results suggest that rhModeller is robust in the presence of missense and frameshift mutations, two common errors in metagenomic genes. Applying the pipeline to the cow rumen genes identified 4,990 novel cellulases candidates and 8,196 novel carbonic anhydrase candidates.In summary, we expect rhModeller to dramatically increase the speed and quality of metagnomic gene annotation.

  18. Cold Shock Exoribonuclease R(VacB) is involved in Aeromonas hydrophila Virulence

    EPA Science Inventory

    In this study, we cloned and sequenced a virulence-associated gene (vacB) from a clinical isolate SSU of Aeromonas hydrophila. We identified this gene based on our recently annotated genome sequence of the environmental isolate ATCC 7966T of A. hydrophila and the vacB gene of Shi...

  19. Microcomputers and the Media Specialist: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Miller, Inabeth

    An overview of the literature reflecting the rapid development of interest in microcomputer use in education since 1978 is followed by an annotated bibliography which lists books, articles, and ERIC documents in nine categories. The first section includes materials of general interest--historical background, guides to using computers in the…

  20. Annotated Bibliography on Religious Development.

    ERIC Educational Resources Information Center

    Bucher, Anton A.; Reich, K. Helmut

    1991-01-01

    Presents an annotated bibliography on religious development that covers the areas of psychology and religion, measurement of religiousness, religious development during the life cycle, religious experiences, conversion, religion and morality, and images of God. (Author/BB)

  1. Patient Education: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Simmons, Jeannette

    Topics included in this annotated bibliography on patient education are (1) background on development of patient education programs, (2) patient education interventions, (3) references for health professionals, and (4) research and evaluation in patient education. (TA)

  2. Combined evidence annotation of transposable elements in genome sequences.

    PubMed

    Quesneville, Hadi; Bergman, Casey M; Andrieu, Olivier; Autard, Delphine; Nouaud, Danielle; Ashburner, Michael; Anxolabehere, Dominique

    2005-07-01

    Transposable elements (TEs) are mobile, repetitive sequences that make up significant fractions of metazoan genomes. Despite their near ubiquity and importance in genome and chromosome biology, most efforts to annotate TEs in genome sequences rely on the results of a single computational program, RepeatMasker. In contrast, recent advances in gene annotation indicate that high-quality gene models can be produced from combining multiple independent sources of computational evidence. To elevate the quality of TE annotations to a level comparable to that of gene models, we have developed a combined evidence-model TE annotation pipeline, analogous to systems used for gene annotation, by integrating results from multiple homology-based and de novo TE identification methods. As proof of principle, we have annotated "TE models" in Drosophila melanogaster Release 4 genomic sequences using the combined computational evidence derived from RepeatMasker, BLASTER, TBLASTX, all-by-all BLASTN, RECON, TE-HMM and the previous Release 3.1 annotation. Our system is designed for use with the Apollo genome annotation tool, allowing automatic results to be curated manually to produce reliable annotations. The euchromatic TE fraction of D. melanogaster is now estimated at 5.3% (cf. 3.86% in Release 3.1), and we found a substantially higher number of TEs (n = 6,013) than previously identified (n = 1,572). Most of the new TEs derive from small fragments of a few hundred nucleotides long and highly abundant families not previously annotated (e.g., INE-1). We also estimated that 518 TE copies (8.6%) are inserted into at least one other TE, forming a nest of elements. The pipeline allows rapid and thorough annotation of even the most complex TE models, including highly deleted and/or nested elements such as those often found in heterochromatic sequences. Our pipeline can be easily adapted to other genome sequences, such as those of the D. melanogaster heterochromatin or other species in the

  3. NCBI prokaryotic genome annotation pipeline.

    PubMed

    Tatusova, Tatiana; DiCuccio, Michael; Badretdin, Azat; Chetvernin, Vyacheslav; Nawrocki, Eric P; Zaslavsky, Leonid; Lomsadze, Alexandre; Pruitt, Kim D; Borodovsky, Mark; Ostell, James

    2016-08-19

    Recent technological advances have opened unprecedented opportunities for large-scale sequencing and analysis of populations of pathogenic species in disease outbreaks, as well as for large-scale diversity studies aimed at expanding our knowledge across the whole domain of prokaryotes. To meet the challenge of timely interpretation of structure, function and meaning of this vast genetic information, a comprehensive approach to automatic genome annotation is critically needed. In collaboration with Georgia Tech, NCBI has developed a new approach to genome annotation that combines alignment based methods with methods of predicting protein-coding and RNA genes and other functional elements directly from sequence. A new gene finding tool, GeneMarkS+, uses the combined evidence of protein and RNA placement by homology as an initial map of annotation to generate and modify ab initio gene predictions across the whole genome. Thus, the new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies more on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence. The pipeline provides a framework for generation and analysis of annotation on the full breadth of prokaryotic taxonomy. For additional information on PGAP see https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ and the NCBI Handbook, https://www.ncbi.nlm.nih.gov/books/NBK174280/. PMID:27342282

  4. NCBI prokaryotic genome annotation pipeline.

    PubMed

    Tatusova, Tatiana; DiCuccio, Michael; Badretdin, Azat; Chetvernin, Vyacheslav; Nawrocki, Eric P; Zaslavsky, Leonid; Lomsadze, Alexandre; Pruitt, Kim D; Borodovsky, Mark; Ostell, James

    2016-08-19

    Recent technological advances have opened unprecedented opportunities for large-scale sequencing and analysis of populations of pathogenic species in disease outbreaks, as well as for large-scale diversity studies aimed at expanding our knowledge across the whole domain of prokaryotes. To meet the challenge of timely interpretation of structure, function and meaning of this vast genetic information, a comprehensive approach to automatic genome annotation is critically needed. In collaboration with Georgia Tech, NCBI has developed a new approach to genome annotation that combines alignment based methods with methods of predicting protein-coding and RNA genes and other functional elements directly from sequence. A new gene finding tool, GeneMarkS+, uses the combined evidence of protein and RNA placement by homology as an initial map of annotation to generate and modify ab initio gene predictions across the whole genome. Thus, the new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies more on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence. The pipeline provides a framework for generation and analysis of annotation on the full breadth of prokaryotic taxonomy. For additional information on PGAP see https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ and the NCBI Handbook, https://www.ncbi.nlm.nih.gov/books/NBK174280/.

  5. Cryptosporidium Pathogenicity and Virulence

    PubMed Central

    Bouzid, Maha; Chalmers, Rachel M.; Tyler, Kevin M.

    2013-01-01

    Cryptosporidium is a protozoan parasite of medical and veterinary importance that causes gastroenteritis in a variety of vertebrate hosts. Several studies have reported different degrees of pathogenicity and virulence among Cryptosporidium species and isolates of the same species as well as evidence of variation in host susceptibility to infection. The identification and validation of Cryptosporidium virulence factors have been hindered by the renowned difficulties pertaining to the in vitro culture and genetic manipulation of this parasite. Nevertheless, substantial progress has been made in identifying putative virulence factors for Cryptosporidium. This progress has been accelerated since the publication of the Cryptosporidium parvum and C. hominis genomes, with the characterization of over 25 putative virulence factors identified by using a variety of immunological and molecular techniques and which are proposed to be involved in aspects of host-pathogen interactions from adhesion and locomotion to invasion and proliferation. Progress has also been made in the contribution of host factors that are associated with variations in both the severity and risk of infection. Here we provide a review comprised of the current state of knowledge on Cryptosporidium infectivity, pathogenesis, and transmissibility in light of our contemporary understanding of microbial virulence. PMID:23297262

  6. Parasitoid wasp virulence

    PubMed Central

    Mortimer, Nathan T

    2013-01-01

    In nature, larvae of the fruit fly Drosophila melanogaster are commonly infected by parasitoid wasps. Following infection, flies mount an immune response termed cellular encapsulation in which fly immune cells form a multilayered capsule that covers and kills the wasp egg. Parasitoids have thus evolved virulence factors to suppress cellular encapsulation. To uncover the molecular mechanisms underlying the antiwasp response, we and others have begun identifying and functionally characterizing these virulence factors. Our recent work on the Drosophila parasitoid Ganaspis sp.1 has demonstrated that a virulence factor encoding a SERCA-type calcium pump plays an important role in Ganaspis sp.1 virulence. This venom SERCA antagonizes fly immune cell calcium signaling and thereby prevents the activation of the encapsulation response. In this way, the study of wasp virulence factors has revealed a novel aspect of fly immunity, namely a role for calcium signaling in fly immune cell activation, which is conserved with human immunity, again illustrating the marked conservation between fly and mammalian immune responses. Our findings demonstrate that the cellular encapsulation response can serve as a model of immune cell function and can also provide valuable insight into basic cell biological processes. PMID:24088661

  7. Beyond Mortality: Sterility As a Neglected Component of Parasite Virulence.

    PubMed

    Abbate, Jessica L; Kada, Sarah; Lion, Sébastien

    2015-12-01

    Virulence is generally defined as the reduction in host fitness following infection by a parasite (see Box 1 for glossary) [1]. In general, parasite exploitation of host resources may reduce host survival (mortality virulence), decrease host fecundity (sterility virulence), or even have sub-lethal effects that disturb the way individuals interact within a community (morbidity) [2,3]. In fact, the virulence of many parasites involves a combination of these various effects (Box 2). In practice, however, virulence is most often defined as disease-induced mortality [1, 4-6]. This is especially true in the theoretical literature, where the evolution of sterility virulence, morbidity, and mixed strategies of host exploitation have received relatively little attention. While the focus on mortality effects has allowed for easy comparison between models and, thus, rapid advancement of the field, we ask whether these theoretical simplifications have led us to inadvertently minimize the evolutionary importance of host sterilization and secondary virulence effects. As explicit theoretical work on morbidity is currently lacking (but see [7]), our aim in this Opinion piece is to discuss what is understood about sterility virulence evolution, its adaptive potential, and the implications for parasites that utilize a combination of host survival and reproductive resources.

  8. Beyond Mortality: Sterility As a Neglected Component of Parasite Virulence

    PubMed Central

    Abbate, Jessica L.; Kada, Sarah; Lion, Sébastien

    2015-01-01

    Virulence is generally defined as the reduction in host fitness following infection by a parasite (see Box 1 for glossary) [1]. In general, parasite exploitation of host resources may reduce host survival (mortality virulence), decrease host fecundity (sterility virulence), or even have sub-lethal effects that disturb the way individuals interact within a community (morbidity) [2,3]. In fact, the virulence of many parasites involves a combination of these various effects (Box 2). In practice, however, virulence is most often defined as disease-induced mortality [1, 4–6]. This is especially true in the theoretical literature, where the evolution of sterility virulence, morbidity, and mixed strategies of host exploitation have received relatively little attention. While the focus on mortality effects has allowed for easy comparison between models and, thus, rapid advancement of the field, we ask whether these theoretical simplifications have led us to inadvertently minimize the evolutionary importance of host sterilization and secondary virulence effects. As explicit theoretical work on morbidity is currently lacking (but see [7]), our aim in this Opinion piece is to discuss what is understood about sterility virulence evolution, its adaptive potential, and the implications for parasites that utilize a combination of host survival and reproductive resources. PMID:26632822

  9. AGeS: A Software System for Microbial Genome Sequence Annotation

    PubMed Central

    Kumar, Kamal; Desai, Valmik; Cheng, Li; Khitrov, Maxim; Grover, Deepak; Satya, Ravi Vijaya; Yu, Chenggang; Zavaljevski, Nela; Reifman, Jaques

    2011-01-01

    Background The annotation of genomes from next-generation sequencing platforms needs to be rapid, high-throughput, and fully integrated and automated. Although a few Web-based annotation services have recently become available, they may not be the best solution for researchers that need to annotate a large number of genomes, possibly including proprietary data, and store them locally for further analysis. To address this need, we developed a standalone software application, the Annotation of microbial Genome Sequences (AGeS) system, which incorporates publicly available and in-house-developed bioinformatics tools and databases, many of which are parallelized for high-throughput performance. Methodology The AGeS system supports three main capabilities. The first is the storage of input contig sequences and the resulting annotation data in a central, customized database. The second is the annotation of microbial genomes using an integrated software pipeline, which first analyzes contigs from high-throughput sequencing by locating genomic regions that code for proteins, RNA, and other genomic elements through the Do-It-Yourself Annotation (DIYA) framework. The identified protein-coding regions are then functionally annotated using the in-house-developed Pipeline for Protein Annotation (PIPA). The third capability is the visualization of annotated sequences using GBrowse. To date, we have implemented these capabilities for bacterial genomes. AGeS was evaluated by comparing its genome annotations with those provided by three other methods. Our results indicate that the software tools integrated into AGeS provide annotations that are in general agreement with those provided by the compared methods. This is demonstrated by a >94% overlap in the number of identified genes, a significant number of identical annotated features, and a >90% agreement in enzyme function predictions. PMID:21408217

  10. Objective-guided image annotation.

    PubMed

    Mao, Qi; Tsang, Ivor Wai-Hung; Gao, Shenghua

    2013-04-01

    Automatic image annotation, which is usually formulated as a multi-label classification problem, is one of the major tools used to enhance the semantic understanding of web images. Many multimedia applications (e.g., tag-based image retrieval) can greatly benefit from image annotation. However, the insufficient performance of image annotation methods prevents these applications from being practical. On the other hand, specific measures are usually designed to evaluate how well one annotation method performs for a specific objective or application, but most image annotation methods do not consider optimization of these measures, so that they are inevitably trapped into suboptimal performance of these objective-specific measures. To address this issue, we first summarize a variety of objective-guided performance measures under a unified representation. Our analysis reveals that macro-averaging measures are very sensitive to infrequent keywords, and hamming measure is easily affected by skewed distributions. We then propose a unified multi-label learning framework, which directly optimizes a variety of objective-specific measures of multi-label learning tasks. Specifically, we first present a multilayer hierarchical structure of learning hypotheses for multi-label problems based on which a variety of loss functions with respect to objective-guided measures are defined. And then, we formulate these loss functions as relaxed surrogate functions and optimize them by structural SVMs. According to the analysis of various measures and the high time complexity of optimizing micro-averaging measures, in this paper, we focus on example-based measures that are tailor-made for image annotation tasks but are seldom explored in the literature. Experiments show consistency with the formal analysis on two widely used multi-label datasets, and demonstrate the superior performance of our proposed method over state-of-the-art baseline methods in terms of example-based measures on four

  11. Collective dynamics of social annotation.

    PubMed

    Cattuto, Ciro; Barrat, Alain; Baldassarri, Andrea; Schehr, Gregory; Loreto, Vittorio

    2009-06-30

    The enormous increase of popularity and use of the worldwide web has led in the recent years to important changes in the ways people communicate. An interesting example of this fact is provided by the now very popular social annotation systems, through which users annotate resources (such as web pages or digital photographs) with keywords known as "tags." Understanding the rich emergent structures resulting from the uncoordinated actions of users calls for an interdisciplinary effort. In particular concepts borrowed from statistical physics, such as random walks (RWs), and complex networks theory, can effectively contribute to the mathematical modeling of social annotation systems. Here, we show that the process of social annotation can be seen as a collective but uncoordinated exploration of an underlying semantic space, pictured as a graph, through a series of RWs. This modeling framework reproduces several aspects, thus far unexplained, of social annotation, among which are the peculiar growth of the size of the vocabulary used by the community and its complex network structure that represents an externalization of semantic structures grounded in cognition and that are typically hard to access. PMID:19506244

  12. Genomic Data and Annotation from the SEED

    DOE Data Explorer

    Fonstein, Michael; Kogan, Yakov; Osterman, Andrei; Overbeek, Ross; Vonstein, Veronika The Fellowship for Interpretation of Genomes (FIG)

    The SEED Project is a cooperative effort to annotate ever-expanding genomic data so researchers can conduct effective comparative analyses of genomes. Launched in 2003 by the Fellowship for Interpretation of Genomes (FIG), the project is one of several initiatives in ongoing development of data curation systems. SEED is designed to be used by scientists from numerous centers and with varied research objectives. As such, several institutions have since joined FIG in a consortium, including the University of Chicago, DOE’s Argonne National Laboratory (ANL), the University of Illinois at Urbana-Champaign, and others. As one example, ANL has used SEED to develop the National Microbial Pathogen Data Resource. Other agencies and institutions have used the project to discover genome components and clarify gene functions such as metabolism. SEED also has enabled researchers to conduct comparative analyses of closely related genomes and has supported derivation of stoichiometric models to understand metabolic processes. The SEED Project has been extended to support metagenomic samples and concomitant analytical tools. Moreover, the number of genomes being introduced into SEED is growing very rapidly. Building a framework to support this growth while providing highly accurate annotations is centrally important to SEED. The project’s subsystem-based annotation strategy has become the technological foundation for addressing these challenges.(copied from Appendix 7 of Systems Biology Knowledgebase for a New Era in Biology, A Genomics:GTL Report from the May 2008 Workshop, DOE/SC-0113, Grequrick, S; Fredrickson, J.K.; Stevens, R., Pub March 1, 2009.)

  13. Mouse genome annotation by the RefSeq project.

    PubMed

    McGarvey, Kelly M; Goldfarb, Tamara; Cox, Eric; Farrell, Catherine M; Gupta, Tripti; Joardar, Vinita S; Kodali, Vamsi K; Murphy, Michael R; O'Leary, Nuala A; Pujar, Shashikant; Rajput, Bhanu; Rangwala, Sanjida H; Riddick, Lillian D; Webb, David; Wright, Mathew W; Murphy, Terence D; Pruitt, Kim D

    2015-10-01

    Complete and accurate annotation of the mouse genome is critical to the advancement of research conducted on this important model organism. The National Center for Biotechnology Information (NCBI) develops and maintains many useful resources to assist the mouse research community. In particular, the reference sequence (RefSeq) database provides high-quality annotation of multiple mouse genome assemblies using a combinatorial approach that leverages computation, manual curation, and collaboration. Implementation of this conservative and rigorous approach, which focuses on representation of only full-length and non-redundant data, produces high-quality annotation products. RefSeq records explicitly link sequences to current knowledge in a timely manner, updating public records regularly and rapidly in response to nomenclature updates, addition of new relevant publications, collaborator discussion, and user feedback. Whole genome re-annotation is also conducted at least every 12-18 months, and often more frequently in response to assembly updates or availability of informative data. This article highlights key features and advantages of RefSeq genome annotation products and presents an overview of NCBI processes to generate these data. Further discussion of NCBI's resources highlights useful features and the best methods for accessing our data.

  14. Virulence factors of medically important fungi.

    PubMed Central

    Hogan, L H; Klein, B S; Levitz, S M

    1996-01-01

    Human fungal pathogens have become an increasingly important medical problem with the explosion in the number of immunocompromised patients as a result of cancer, steroid therapy, chemotherapy, and AIDS. Additionally, the globalization of travel and expansion of humankind into previously undisturbed habitats have led to the reemergence of old fungi and new exposure to previously undescribed fungi. Until recently, relatively little was known about virulence factors for the medically important fungi. With the advent of molecular genetics, rapid progress has now been made in understanding the basis of pathogenicity for organisms such as Aspergillus species and Cryptococcus neoformans. The twin technologies of genetic transformation and "knockout" deletion construction allowed for genetic tests of virulence factors in these organisms. Such knowledge will prove invaluable for the rational design of antifungal therapies. Putative virulence factors and attributes are reviewed for Aspergillus species, C. neoformans, the dimorphic fungal pathogens, and others, with a focus upon a molecular genetic approach. Candida species are excluded from coverage, having been the subject of numerous recent reviews. This growing body of knowledge about fungal pathogens and their virulence factors will significantly aid efforts to treat the serious diseases they cause. PMID:8894347

  15. Novel Strategies to Combat Bacterial Virulence

    PubMed Central

    Lynch, S.V.; Wiener-Kronish, J.P.

    2010-01-01

    Purpose of review Incidences of antimicrobial resistant infections have increased dramatically over the past several decades and are associated with adverse patient outcomes. Alternative approaches to combat infection are critical, and have led to the development of more specific drugs targeted at particular bacterial virulence systems or essential regulatory pathways. The purpose of this review is to highlight the recent developments in anti-bacterial therapy and the novel approaches toward increasing our therapeutic armory against bacterial infection. Recent findings Although classic antibiotic development is not occurring rapidly, alternative therapeutics that target specific bacterial virulence systems are progressing from the discovery stage through the FDA approval process. Here we review novel antibodies that target specific virulence systems as well as a variety of newly discovered small molecules that block bacterial attachment, communication systems (quorum sensing) or important regulatory processes associated with virulence gene expression. Summary The success of novel therapeutics could significantly change clinical practice. Furthermore, the complications of collateral damage due to antibiotic administration e.g. suprainfections or decreased host immunity due to loss of synergistic bacterial communities, may be minimized using therapeutics that specifically target pathogenic behavior. PMID:18787455

  16. IMG ER: A System for Microbial Genome Annotation Expert Review and Curation

    SciTech Connect

    Markowitz, Victor M.; Mavromatis, Konstantinos; Ivanova, Natalia N.; Chen, I-Min A.; Chu, Ken; Kyrpides, Nikos C.

    2009-05-25

    A rapidly increasing number of microbial genomes are sequenced by organizations worldwide and are eventually included into various public genome data resources. The quality of the annotations depends largely on the original dataset providers, with erroneous or incomplete annotations often carried over into the public resources and difficult to correct. We have developed an Expert Review (ER) version of the Integrated Microbial Genomes (IMG) system, with the goal of supporting systematic and efficient revision of microbial genome annotations. IMG ER provides tools for the review and curation of annotations of both new and publicly available microbial genomes within IMG's rich integrated genome framework. New genome datasets are included into IMG ER prior to their public release either with their native annotations or with annotations generated by IMG ER's annotation pipeline. IMG ER tools allow addressing annotation problems detected with IMG's comparative analysis tools, such as genes missed by gene prediction pipelines or genes without an associated function. Over the past year, IMG ER was used for improving the annotations of about 150 microbial genomes.

  17. Sonoran Pronghorn Literature: An Annotated Bibliography

    USGS Publications Warehouse

    Krausman, Paul R.; Morgart, John R.; Harris, Lisa K.; O'Brien, Chantal S.; Cain, James W.; Rosenstock, Steve S.

    2005-01-01

    EXECUTIVE SUMMARY The Sonoran pronghorn (Antilocapra americana sonoriensis) is 1 of 5 subspecies of pronghorn in North America. Sonoran pronghorn historically ranged from eastern California into southeastern Arizona and south to Sonora, Mexico. Sonoran pronghorn currently inhabit the Sonoran Desert in Southwestern Arizona and northern Sonora, Mexico. Unfortunately, their future in North America is uncertain. In the United States, as of December 2004, there were <51 freeranging individual Sonoran pronghorn. This subspecies has been listed as endangered by the United States Fish and Wildlife Service since 1967. Because of the rapid decline in population size, biologists and managers increased management efforts to reverse the downward spiral to extinction. To assist with enhanced management we have compiled an annotated bibliography of most of the works published on Sonoran pronghorn including peer-reviewed papers (n = 31, including submitted manuscripts), books (n = 26), theses and dissertations (n = 5), conferences, proceedings and symposiums (n = 31), reports (n = 84), abstracts (n = 14), popular articles (n = 41), and others (n = 4). These are the same categories under which we list annotations. Most of the articles involve A. a. sonoriensis. We present the scientific name of other pronghorn when clarification is needed.

  18. Interactive Display of Scenes with Annotations

    NASA Technical Reports Server (NTRS)

    Vona, Marsette; Powell, Mark; Backes, Paul; Norris, Jeffrey; Steinke, Robert

    2005-01-01

    ThreeDView is a computer program that enables high-performance interactive display of real-world scenes with annotations. ThreeDView was developed primarily as a component of the Science Activity Planner (SAP) software, wherein it is to be used to display annotated images of terrain acquired by exploratory robots on Mars and possibly other remote planets. The images can be generated from sets of multiple-texture image data in the Visible Scalable Terrain (ViSTa) format, which was described in "Format for Interchange and Display of 3D Terrain Data" (NPO-30600) NASA Tech Briefs, Vol. 28, No. 12 (December 2004), page 25. In ThreeDView, terrain data can be loaded rapidly, the geometric level of detail and texture resolution can be selected, false colors can be used to represent scientific data mapped onto terrain, and the user can select among navigation modes. ThreeDView consists largely of modular Java software components that can easily be reused and extended to produce new high-performance, application-specific software systems for displaying images of three-dimensional real-world scenes.

  19. Preserving sequence annotations across reference sequences

    PubMed Central

    2014-01-01

    Background Matching and comparing sequence annotations of different reference sequences is vital to genomics research, yet many annotation formats do not specify the reference sequence types or versions used. This makes the integration of annotations from different sources difficult and error prone. Results As part of our effort to create linked data for interoperable sequence annotations, we present an RDF data model for sequence annotation using the ontological framework established by the OBO Foundry ontologies and the Basic Formal Ontology (BFO). We defined reference sequences as the common domain of integration for sequence annotations, and identified three semantic relationships between sequence annotations. In doing so, we created the Reference Sequence Annotation to compensate for gaps in the SO and in its mapping to BFO, particularly for annotations that refer to versions of consensus reference sequences. Moreover, we present three integration models for sequence annotations using different reference assemblies. Conclusions We demonstrated a working example of a sequence annotation instance, and how this instance can be linked to other annotations on different reference sequences. Sequence annotations in this format are semantically rich and can be integrated easily with different assemblies. We also identify other challenges of modeling reference sequences with the BFO. PMID:25093075

  20. Optimizing high performance computing workflow for protein functional annotation.

    PubMed

    Stanberry, Larissa; Rekepalli, Bhanu; Liu, Yuan; Giblock, Paul; Higdon, Roger; Montague, Elizabeth; Broomall, William; Kolker, Natali; Kolker, Eugene

    2014-09-10

    Functional annotation of newly sequenced genomes is one of the major challenges in modern biology. With modern sequencing technologies, the protein sequence universe is rapidly expanding. Newly sequenced bacterial genomes alone contain over 7.5 million proteins. The rate of data generation has far surpassed that of protein annotation. The volume of protein data makes manual curation infeasible, whereas a high compute cost limits the utility of existing automated approaches. In this work, we present an improved and optmized automated workflow to enable large-scale protein annotation. The workflow uses high performance computing architectures and a low complexity classification algorithm to assign proteins into existing clusters of orthologous groups of proteins. On the basis of the Position-Specific Iterative Basic Local Alignment Search Tool the algorithm ensures at least 80% specificity and sensitivity of the resulting classifications. The workflow utilizes highly scalable parallel applications for classification and sequence alignment. Using Extreme Science and Engineering Discovery Environment supercomputers, the workflow processed 1,200,000 newly sequenced bacterial proteins. With the rapid expansion of the protein sequence universe, the proposed workflow will enable scientists to annotate big genome data. PMID:25313296

  1. Optimizing high performance computing workflow for protein functional annotation.

    PubMed

    Stanberry, Larissa; Rekepalli, Bhanu; Liu, Yuan; Giblock, Paul; Higdon, Roger; Montague, Elizabeth; Broomall, William; Kolker, Natali; Kolker, Eugene

    2014-09-10

    Functional annotation of newly sequenced genomes is one of the major challenges in modern biology. With modern sequencing technologies, the protein sequence universe is rapidly expanding. Newly sequenced bacterial genomes alone contain over 7.5 million proteins. The rate of data generation has far surpassed that of protein annotation. The volume of protein data makes manual curation infeasible, whereas a high compute cost limits the utility of existing automated approaches. In this work, we present an improved and optmized automated workflow to enable large-scale protein annotation. The workflow uses high performance computing architectures and a low complexity classification algorithm to assign proteins into existing clusters of orthologous groups of proteins. On the basis of the Position-Specific Iterative Basic Local Alignment Search Tool the algorithm ensures at least 80% specificity and sensitivity of the resulting classifications. The workflow utilizes highly scalable parallel applications for classification and sequence alignment. Using Extreme Science and Engineering Discovery Environment supercomputers, the workflow processed 1,200,000 newly sequenced bacterial proteins. With the rapid expansion of the protein sequence universe, the proposed workflow will enable scientists to annotate big genome data.

  2. Virulence evolution at the front line of spreading epidemics.

    PubMed

    Griette, Quentin; Raoul, Gaël; Gandon, Sylvain

    2015-11-01

    Understanding and predicting the spatial spread of emerging pathogens is a major challenge for the public health management of infectious diseases. Theoretical epidemiology shows that the speed of an epidemic is governed by the life-history characteristics of the pathogen and its ability to disperse. Rapid evolution of these traits during the invasion may thus affect the speed of epidemics. Here we study the influence of virulence evolution on the spatial spread of an epidemic. At the edge of the invasion front, we show that more virulent and transmissible genotypes are expected to win the competition with other pathogens. Behind the front line, however, more prudent exploitation strategies outcompete virulent pathogens. Crucially, even when the presence of the virulent mutant is limited to the edge of the front, the invasion speed can be dramatically altered by pathogen evolution. We support our analysis with individual-based simulations and we discuss the additional effects of demographic stochasticity taking place at the front line on virulence evolution. We confirm that an increase of virulence can occur at the front, but only if the carrying capacity of the invading pathogen is large enough. These results are discussed in the light of recent empirical studies examining virulence evolution at the edge of spreading epidemics. PMID:26416254

  3. Virulence evolution at the front line of spreading epidemics.

    PubMed

    Griette, Quentin; Raoul, Gaël; Gandon, Sylvain

    2015-11-01

    Understanding and predicting the spatial spread of emerging pathogens is a major challenge for the public health management of infectious diseases. Theoretical epidemiology shows that the speed of an epidemic is governed by the life-history characteristics of the pathogen and its ability to disperse. Rapid evolution of these traits during the invasion may thus affect the speed of epidemics. Here we study the influence of virulence evolution on the spatial spread of an epidemic. At the edge of the invasion front, we show that more virulent and transmissible genotypes are expected to win the competition with other pathogens. Behind the front line, however, more prudent exploitation strategies outcompete virulent pathogens. Crucially, even when the presence of the virulent mutant is limited to the edge of the front, the invasion speed can be dramatically altered by pathogen evolution. We support our analysis with individual-based simulations and we discuss the additional effects of demographic stochasticity taking place at the front line on virulence evolution. We confirm that an increase of virulence can occur at the front, but only if the carrying capacity of the invading pathogen is large enough. These results are discussed in the light of recent empirical studies examining virulence evolution at the edge of spreading epidemics.

  4. Infant Feeding: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Crowhurst, Christine Marie, Comp.; Kumer, Bonnie Lee, Comp.

    Intended for parents, health professionals and allied health workers, and others involved in caring for infants and young children, this annotated bibliography brings together in one selective listing a review of over 700 current publications related to infant feeding. Reflecting current knowledge in infant feeding, the bibliography has as its…

  5. English Language Learners: Annotated Bibliography

    ERIC Educational Resources Information Center

    Hector-Mason, Anestine; Bardack, Sarah

    2010-01-01

    This annotated bibliography represents a first step toward compiling a comprehensive overview of current research on issues related to English language learners (ELLs). It is intended to be a resource for researchers, policymakers, administrators, and educators who are engaged in efforts to bridge the divide between research, policy, and practice…

  6. Appalachian Women. An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Hamm, Mary Margo

    This bibliography compiles annotations of 178 books, journal articles, ERIC documents, and dissertations on Appalachian women and their social, cultural, and economic environment. Entries were published 1966-93 and are listed in the following categories: (1) authors and literary criticism; (2) bibliographies and resource guides; (3) economics,…

  7. Radiocarbon Dating: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Fortine, Suellen

    This selective annotated bibliography covers various sources of information on the radiocarbon dating method, including journal articles, conference proceedings, and reports, reflecting the most important and useful sources of the last 25 years. The bibliography is divided into five parts--general background on radiocarbon, radiocarbon dating,…

  8. Hispanic Heritage. An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Denver Univ., CO. School of Education.

    This annotated bibliography of a wide range of materials for the social studies teacher is concerned with the Hispano heritage. The sections are introduced by a brief description. The sections are: 1) general materials, 2) the land and the people, 3) the European background, 4) Spain's colonial system, 5) the Spanish borderlands, 6) the Anglo…

  9. Rural Education: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Massey, Sara

    The 120-item annotated bibliography was compiled to facilitate the development of a recently approved course entitled "Topics in Rural Education" at the University of Maine at Machias. Although the dates range from 1964 to 1982, most of the materials were prepared in the 1970s and 1980s. The interrelatedness of the issues makes categorization…

  10. Workforce Reductions. An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Hickok, Thomas A.; Hickok, Thomas A.

    This report, which is based on a review of practitioner-oriented sources and scholarly journals, uses a three-part framework to organize annotated bibliographies that, together, list a total of 104 sources that provide the following three perspectives on work force reduction issues: organizational, organizational-individual relationship, and…

  11. Vietnamese Amerasians: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Johnson, Mark C.; And Others

    This annotated bibliography on Vietnamese Amerasians includes primary and secondary sources as well as reviews of three documentary films. Sources were selected in order to provide an overview of the historical and political context of Amerasian resettlement and a review of the scant available research on coping and adaptation with this…

  12. Instructional Materials Centers; Annotated Bibliography.

    ERIC Educational Resources Information Center

    Poli, Rosario, Comp.

    An annotated bibliography lists 74 articles and reports on instructional materials centers (IMC) which appeared from 1967-70. The articles deal with such topics as the purposes of an IMC, guidelines for setting up an IMC, and the relationship of an IMC to technology. Most articles deal with use of an IMC on an elementary or secondary level, but…

  13. Nikos Kazantzakis: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Qiu, Kui

    This research paper consists of an annotated bibliography about Nikos Kazantzakis, one of the major modern Greek writers and author of "The Last Temptation of Christ,""Zorba the Greek," and many other works. Because of Kazantzakis' position in world literature there are many critical works about him; however, bibliographical control of these works…

  14. An Annotated Bibliography for Art.

    ERIC Educational Resources Information Center

    Minnesota State Dept. of Education, St. Paul. Div. of Instruction.

    The annotated bibliography presents approximately 450 references about art for elementary, secondary, and professional levels. It is presented in three sections. Section one identifies 19 resources about art from a professional or teaching perspective. Included are books explaining how to teach various techniques to students of beginning or…

  15. Annotated Bibliography on Humanistic Education

    ERIC Educational Resources Information Center

    Ganung, Cynthia

    1975-01-01

    Part I of this annotated bibliography deals with books and articles on such topics as achievement motivation, process education, transactional analysis, discipline without punishment, role-playing, interpersonal skills, self-acceptance, moral education, self-awareness, values clarification, and non-verbal communication. Part II focuses on…

  16. MSDAC Resource Library Annotated Bibliography.

    ERIC Educational Resources Information Center

    Watson, Cristel; And Others

    This annotated bibliography lists books, films, filmstrips, recordings, and booklets on sex equity. Entries are arranged according to the following topics: career resources, curriculum resources, management, sex equity, sex roles, women's studies, student activities, and sex-fair fiction. Included in each entry are name of author, editor or…

  17. Multicultural Education. An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Narang, H. L.

    This annotated bibliography contains references to books, journal articles, ERIC documents, doctoral dissertations, and audio-visual materials on the subject of multicultural education. Topics include integrating multiculturalism in school subjects, prejudice and discrimination, intercultural communication, ethnic identity and ethnic bias.…

  18. Systems Theory and Communication. Annotated Bibliography.

    ERIC Educational Resources Information Center

    Covington, William G., Jr.

    This annotated bibliography presents annotations of 31 books and journal articles dealing with systems theory and its relation to organizational communication, marketing, information theory, and cybernetics. Materials were published between 1963 and 1992 and are listed alphabetically by author. (RS)

  19. Agrobacterium virulence gene induction.

    PubMed

    Gelvin, Stanton B

    2006-01-01

    The ability of Agrobacterium to transform plants and other organisms is under highly regulated genetic control. Two Virulence (Vir) proteins, VirA and VirG, function as a two-component regulatory system to sense particular phenolic compounds synthesized by wounded plant tissues. Induction by these phenolic compounds, in the presence of certain neutral or acid sugars, results in activation of other vir genes, leading to the processing of T-DNA from the Ti-plasmid and transfer of T-DNA to recipient host cells. Many plant, and most nonplant, species do not provide sufficient quantities of the correct phenolic compounds to permit efficient Agrobacterium-mediated genetic transformation to occur. In order to transform these species, phenolic inducing compounds must be added to agrobacteria before and/or during cocultivation of recipient cells with the bacteria. This chapter discusses conditions for efficient induction of Agrobacterium virulence genes by phenolic compounds. PMID:16988335

  20. Improving GENCODE reference gene annotation using a high-stringency proteogenomics workflow

    PubMed Central

    Wright, James C.; Mudge, Jonathan; Weisser, Hendrik; Barzine, Mitra P.; Gonzalez, Jose M.; Brazma, Alvis; Choudhary, Jyoti S.; Harrow, Jennifer

    2016-01-01

    Complete annotation of the human genome is indispensable for medical research. The GENCODE consortium strives to provide this, augmenting computational and experimental evidence with manual annotation. The rapidly developing field of proteogenomics provides evidence for the translation of genes into proteins and can be used to discover and refine gene models. However, for both the proteomics and annotation groups, there is a lack of guidelines for integrating this data. Here we report a stringent workflow for the interpretation of proteogenomic data that could be used by the annotation community to interpret novel proteogenomic evidence. Based on reprocessing of three large-scale publicly available human data sets, we show that a conservative approach, using stringent filtering is required to generate valid identifications. Evidence has been found supporting 16 novel protein-coding genes being added to GENCODE. Despite this many peptide identifications in pseudogenes cannot be annotated due to the absence of orthogonal supporting evidence. PMID:27250503

  1. Improving GENCODE reference gene annotation using a high-stringency proteogenomics workflow.

    PubMed

    Wright, James C; Mudge, Jonathan; Weisser, Hendrik; Barzine, Mitra P; Gonzalez, Jose M; Brazma, Alvis; Choudhary, Jyoti S; Harrow, Jennifer

    2016-01-01

    Complete annotation of the human genome is indispensable for medical research. The GENCODE consortium strives to provide this, augmenting computational and experimental evidence with manual annotation. The rapidly developing field of proteogenomics provides evidence for the translation of genes into proteins and can be used to discover and refine gene models. However, for both the proteomics and annotation groups, there is a lack of guidelines for integrating this data. Here we report a stringent workflow for the interpretation of proteogenomic data that could be used by the annotation community to interpret novel proteogenomic evidence. Based on reprocessing of three large-scale publicly available human data sets, we show that a conservative approach, using stringent filtering is required to generate valid identifications. Evidence has been found supporting 16 novel protein-coding genes being added to GENCODE. Despite this many peptide identifications in pseudogenes cannot be annotated due to the absence of orthogonal supporting evidence. PMID:27250503

  2. Genomic Correlates of Virulence Attenuation in the Deadly Amphibian Chytrid Fungus, Batrachochytrium dendrobatidis.

    PubMed

    Refsnider, Jeanine M; Poorten, Thomas J; Langhammer, Penny F; Burrowes, Patricia A; Rosenblum, Erica Bree

    2015-11-01

    Emerging infectious diseasespose a significant threat to global health, but predicting disease outcomes for particular species can be complicated when pathogen virulence varies across space, time, or hosts. The pathogenic chytrid fungus Batrachochytrium dendrobatidis (Bd) has caused worldwide declines in frog populations. Not only do Bd isolates from wild populations vary in virulence, but virulence shifts can occur over short timescales when Bd is maintained in the laboratory. We leveraged changes in Bd virulence over multiple generations of passage to better understand mechanisms of pathogen virulence. We conducted whole-genome resequencing of two samples of the same Bd isolate, differing only in passage history, to identify genomic processes associated with virulence attenuation. The isolate with shorter passage history (and greater virulence) had greater chromosome copy numbers than the isolate maintained in culture for longer, suggesting that virulence attenuation may be associated with loss of chromosome copies. Our results suggest that genomic processes proposed as mechanisms for rapid evolution in Bd are correlated with virulence attenuation in laboratory culture within a single lineage of Bd. Moreover, these genomic processes can occur over extremely short timescales. On a practical level, our results underscore the importance of immediately cryo-archiving new Bd isolates and using fresh isolates, rather than samples cultured in the laboratory for long periods, for laboratory infection experiments. Finally, when attempting to predict disease outcomes for this ecologically important pathogen, it is critical to consider existing variation in virulence among isolates and the potential for shifts in virulence over short timescales. PMID:26333840

  3. Genomic Correlates of Virulence Attenuation in the Deadly Amphibian Chytrid Fungus, Batrachochytrium dendrobatidis

    PubMed Central

    Refsnider, Jeanine M.; Poorten, Thomas J.; Langhammer, Penny F.; Burrowes, Patricia A.; Rosenblum, Erica Bree

    2015-01-01

    Emerging infectious diseasespose a significant threat to global health, but predicting disease outcomes for particular species can be complicated when pathogen virulence varies across space, time, or hosts. The pathogenic chytrid fungus Batrachochytrium dendrobatidis (Bd) has caused worldwide declines in frog populations. Not only do Bd isolates from wild populations vary in virulence, but virulence shifts can occur over short timescales when Bd is maintained in the laboratory. We leveraged changes in Bd virulence over multiple generations of passage to better understand mechanisms of pathogen virulence. We conducted whole-genome resequencing of two samples of the same Bd isolate, differing only in passage history, to identify genomic processes associated with virulence attenuation. The isolate with shorter passage history (and greater virulence) had greater chromosome copy numbers than the isolate maintained in culture for longer, suggesting that virulence attenuation may be associated with loss of chromosome copies. Our results suggest that genomic processes proposed as mechanisms for rapid evolution in Bd are correlated with virulence attenuation in laboratory culture within a single lineage of Bd. Moreover, these genomic processes can occur over extremely short timescales. On a practical level, our results underscore the importance of immediately cryo-archiving new Bd isolates and using fresh isolates, rather than samples cultured in the laboratory for long periods, for laboratory infection experiments. Finally, when attempting to predict disease outcomes for this ecologically important pathogen, it is critical to consider existing variation in virulence among isolates and the potential for shifts in virulence over short timescales. PMID:26333840

  4. ANNOTATED BIBLIOGRAPHY ON CREATIVITY AND GIFTEDNESS.

    ERIC Educational Resources Information Center

    GOWAN, JOHN CURTIS

    THIS ANNOTATED BIBLIOGRAPHY REPRESENTS A SAMPLING OF PUBLISHED WRITING ON CREATIVITY AND GIFTED CHILDREN SINCE 1960. THE LIST WAS COMPILED FOR EDUCATIONAL RESEARCHERS. IN A FEW INSTANCES THE ANNOTATIONS HAVE BEEN MODIFIED OR ABRIDGED FROM THOSE FOUND IN "PSYCHOLOGICAL ABSTRACTS" OR OTHER JOURNAL ABSTRACTS. SOME OF THE ANNOTATIONS HAVE PREVIOUSLY…

  5. Alcohol Education Materials; An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Milgram, Gail Gleason

    This 873-item annotated bibliography cites books, pamphlets, leaflets, and other materials produced for education about alcohol from 1950 to May 1973. The major part of each annotation is a brief summary of the contents. The annotation also contains a statement of orientation or type of presentation and evaluative comments. Each item is classified…

  6. Annotation and Classification of Argumentative Writing Revisions

    ERIC Educational Resources Information Center

    Zhang, Fan; Litman, Diane

    2015-01-01

    This paper explores the annotation and classification of students' revision behaviors in argumentative writing. A sentence-level revision schema is proposed to capture why and how students make revisions. Based on the proposed schema, a small corpus of student essays and revisions was annotated. Studies show that manual annotation is reliable with…

  7. Riboregulators: Fine-Tuning Virulence in Shigella

    PubMed Central

    Fris, Megan E.; Murphy, Erin R.

    2016-01-01

    Within the past several years, RNA-mediated regulation (ribo-regulation) has become increasingly recognized for its importance in controlling critical bacterial processes. Regulatory RNA molecules, or riboregulators, are perpetually responsive to changes within the micro-environment of a bacterium. Notably, several characterized riboregulators control virulence in pathogenic bacteria, as is the case for each riboregulator characterized to date in Shigella. The timing of virulence gene expression and the ability of the pathogen to adapt to rapidly changing environmental conditions is critical to the establishment and progression of infection by Shigella species; ribo-regulators mediate each of these important processes. This mini review will present the current state of knowledge regarding RNA-mediated regulation in Shigella by detailing the characterization and function of each identified riboregulator in these pathogens. PMID:26858941

  8. 3D annotation and manipulation of medical anatomical structures

    NASA Astrophysics Data System (ADS)

    Vitanovski, Dime; Schaller, Christian; Hahn, Dieter; Daum, Volker; Hornegger, Joachim

    2009-02-01

    Although the medical scanners are rapidly moving towards a three-dimensional paradigm, the manipulation and annotation/labeling of the acquired data is still performed in a standard 2D environment. Editing and annotation of three-dimensional medical structures is currently a complex task and rather time-consuming, as it is carried out in 2D projections of the original object. A major problem in 2D annotation is the depth ambiguity, which requires 3D landmarks to be identified and localized in at least two of the cutting planes. Operating directly in a three-dimensional space enables the implicit consideration of the full 3D local context, which significantly increases accuracy and speed. A three-dimensional environment is as well more natural optimizing the user's comfort and acceptance. The 3D annotation environment requires the three-dimensional manipulation device and display. By means of two novel and advanced technologies, Wii Nintendo Controller and Philips 3D WoWvx display, we define an appropriate 3D annotation tool and a suitable 3D visualization monitor. We define non-coplanar setting of four Infrared LEDs with a known and exact position, which are tracked by the Wii and from which we compute the pose of the device by applying a standard pose estimation algorithm. The novel 3D renderer developed by Philips uses either the Z-value of a 3D volume, or it computes the depth information out of a 2D image, to provide a real 3D experience without having some special glasses. Within this paper we present a new framework for manipulation and annotation of medical landmarks directly in three-dimensional volume.

  9. Automatic annotation of organellar genomes with DOGMA

    SciTech Connect

    Wyman, Stacia; Jansen, Robert K.; Boore, Jeffrey L.

    2004-06-01

    Dual Organellar GenoMe Annotator (DOGMA) automates the annotation of extra-nuclear organellar (chloroplast and animal mitochondrial) genomes. It is a web-based package that allows the use of comparative BLAST searches to identify and annotate genes in a genome. DOGMA presents a list of putative genes to the user in a graphical format for viewing and editing. Annotations are stored on our password-protected server. Complete annotations can be extracted for direct submission to GenBank. Furthermore, intergenic regions of specified length can be extracted, as well the nucleotide sequences and amino acid sequences of the genes.

  10. PCR Detection of Virulence Genes in Yersinia enterocolitica and Yersinia pseudotuberculosis and Investigation of Virulence Gene Distribution

    PubMed Central

    Thoerner, P.; Bin Kingombe, C. I.; Bögli-Stuber, K.; Bissig-Choisat, B.; Wassenaar, T. M.; Frey, J.; Jemmi, T.

    2003-01-01

    PCR-based assays were developed for the detection of plasmid- and chromosome-borne virulence genes in Yersinia enterocolitica and Yersinia pseudotuberculosis, to investigate the distribution of these genes in isolates from various sources. The results of PCR genotyping, based on 5 virulence-associated genes of 140 strains of Y. enterocolitica, were compared to phenotypic tests, such as biotyping and serotyping, and to virulence plasmid-associated properties such as calcium-dependent growth at 37°C and Congo red uptake. The specificity of the PCR results was validated by hybridization. Genotyping data correlated well with biotype data, and most biotypes resulted in (nearly) homogeneous genotypes for the chromosomal virulence genes (ystA, ystB, and ail); however, plasmid-borne genes (yadA and virF) were detected with variable efficiency, due to heterogeneity within the bacterial population for the presence of the virulence plasmid. Of the virulence genes, only ystB was present in biotype 1A; however, within this biotype, pathogenic and apathogenic isolates could not be distinguished based on the detection of virulence genes. Forty Y. pseudotuberculosis isolates were tested by PCR for the presence of inv, yadA, and lcrF. All isolates were inv positive, and 88% of the isolates contained the virulence plasmid genes yadA and lcrF. In conclusion, this study shows that genotyping of Yersinia spp., based on both chromosome- and plasmid-borne virulence genes, is feasible and informative and can provide a rapid and reliable genotypic characterization of field isolates. PMID:12620874

  11. Virulence of two strains of Mycobacterium bovis in cattle following aerosol infection

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background Over the past two decades, highly virulent strains of Mycobacterium tuberculosis have emerged and spread rapidly in humans, suggesting a selective advantage based upon virulence. A similar scenario has not been described for Mycobacterium bovis infection in cattle (i.e., Bovine Tuberculos...

  12. Oncotator: cancer variant annotation tool.

    PubMed

    Ramos, Alex H; Lichtenstein, Lee; Gupta, Manaswi; Lawrence, Michael S; Pugh, Trevor J; Saksena, Gordon; Meyerson, Matthew; Getz, Gad

    2015-04-01

    Oncotator is a tool for annotating genomic point mutations and short nucleotide insertions/deletions (indels) with variant- and gene-centric information relevant to cancer researchers. This information is drawn from 14 different publicly available resources that have been pooled and indexed, and we provide an extensible framework to add additional data sources. Annotations linked to variants range from basic information, such as gene names and functional classification (e.g. missense), to cancer-specific data from resources such as the Catalogue of Somatic Mutations in Cancer (COSMIC), the Cancer Gene Census, and The Cancer Genome Atlas (TCGA). For local use, Oncotator is freely available as a python module hosted on Github (https://github.com/broadinstitute/oncotator). Furthermore, Oncotator is also available as a web service and web application at http://www.broadinstitute.org/oncotator/.

  13. Virulence of enterococci.

    PubMed Central

    Jett, B D; Huycke, M M; Gilmore, M S

    1994-01-01

    Enterococci are commensal organisms well suited to survival in intestinal and vaginal tracts and the oral cavity. However, as for most bacteria described as causing human disease, enterococci also possess properties that can be ascribed roles in pathogenesis. The natural ability of enterococci to readily acquire, accumulate, and share extrachromosomal elements encoding virulence traits or antibiotic resistance genes lends advantages to their survival under unusual environmental stresses and in part explains their increasing importance as nosocomial pathogens. This review discusses the current understanding of enterococcal virulence relating to (i) adherence to host tissues, (ii) invasion and abscess formation, (iii) factors potentially relevant to modulation of host inflammatory responses, and (iv) potentially toxic secreted products. Aggregation substance, surface carbohydrates, or fibronectin-binding moieties may facilitate adherence to host tissues. Enterococcus faecalis appears to have the capacity to translocate across intact intestinal mucosa in models of antibiotic-induced superinfection. Extracellular toxins such as cytolysin can induce tissue damage as shown in an endophthalmitis model, increase mortality in combination with aggregation substance in an endocarditis model, and cause systemic toxicity in a murine peritonitis model. Finally, lipoteichoic acid, superoxide production, or pheromones and corresponding peptide inhibitors each may modulate local inflammatory reactions. Images PMID:7834601

  14. Cancer Survivorship for Primary Care Annotated Bibliography

    PubMed Central

    Westfall, Matthew Y.; Overholser, Linda; Zittleman, Linda; Westfall, John M.

    2015-01-01

    Long-term cancer survivorship care is a relatively new and rapidly advancing field of research. Increasing cancer survivorship rates have created a huge population of long-term cancer survivors whose cancer-specific needs challenge healthcare infrastructure and highlight a significant deficit of knowledge and guidelines in transitional care from treatment to normalcy/prolonged survivorship. As the paradigm of cancer care has changed from a fixation on the curative to the maintenance on long-term overall quality of life, so to, has the delineation of responsibility between oncologists and primary care physicians (PCPs). As more patients enjoy long-term survival, PCPs play a more comprehensive role in cancer care following acute treatment. To this end, this annotated bibliography was written to provide PCPs and other readers with an up-to-date and robust base of knowledge on long-term cancer survivorship, including definitions and epidemiological information as well as specific considerations and recommendations on physical, psychosocial, sexual, and comorbidity needs of survivors. Additionally, significant information is included on survivorship care, specifically Survivorship Care Plans (SPCs) and their evolution, utilization by oncologists and PCPs, and current gaps, as well as an introduction to patient navigation programs. Given rapid advancements in cancer research, this bibliography is meant to serve as current baseline reference outlining the state of the science. PMID:26114091

  15. Investigating the ?Trojan Horse? Mechanism of Yersinia pestis Virulence

    SciTech Connect

    McCutchen-Maloney, S L; Fitch, J P

    2005-02-08

    Yersinia pestis, the etiological agent of plague, is a Gram-negative, highly communicable, enteric bacterium that has been responsible for three historic plague pandemics. Currently, several thousand cases of plague are reported worldwide annually, and Y. pestis remains a considerable threat from a biodefense perspective. Y. pestis infection can manifest in three forms: bubonic, septicemic, and pneumonic plague. Of these three forms, pneumonic plague has the highest fatality rate ({approx}100% if left untreated), the shortest intervention time ({approx}24 hours), and is highly contagious. Currently, there are no rapid, widely available vaccines for plague and though plague may be treated with antibiotics, the emergence of both naturally occurring and potentially engineered antibiotic resistant strains makes the search for more effective therapies and vaccines for plague of pressing concern. The virulence mechanism of this deadly bacterium involves induction of a Type III secretion system, a syringe-like apparatus that facilitates the injection of virulence factors, termed Yersinia outer membrane proteins (Yops), into the host cell. These virulence factors inhibit phagocytosis and cytokine secretion, and trigger apoptosis of the host cell. Y. pestis virulence factors and the Type III secretion system are induced thermally, when the bacterium enters the mammalian host from the flea vector, and through host cell contact (or conditions of low Ca{sup 2+} in vitro). Apart from the temperature increase from 26 C to 37 C and host cell contact (or low Ca{sup 2+} conditions), other molecular mechanisms that influence virulence induction in Y. pestis are largely uncharacterized. This project focused on characterizing two novel mechanisms that regulate virulence factor induction in Y. pestis, immunoglobulin G (IgG) binding and quorum sensing, using a real-time reporter system to monitor induction of virulence. Incorporating a better understanding of the mechanisms of virulence

  16. Toward an Improved Laboratory Definition of Listeria monocytogenes Virulence

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Listeria monocytogenes is an opportunistic foodborne pathogen that encompasses a diversity of strains with varied virulence. The ability to rapidly determine the pathogenic potential of L. monocytogenes strains is integral to the control and prevention campaign against listeriosis. Early methods for...

  17. Targeting virulence not viability in the search for future antibacterials

    PubMed Central

    Heras, Begoña; Scanlon, Martin J; Martin, Jennifer L

    2015-01-01

    New antibacterials need new approaches to overcome the problem of rapid antibiotic resistance. Here we review the development of potential new antibacterial drugs that do not kill bacteria or inhibit their growth, but combat disease instead by targeting bacterial virulence. PMID:24552512

  18. Genome Sequences of Salmonella enterica Serovar Typhimurium, Choleraesuis, Dublin, and Gallinarum Strains of Well- Defined Virulence in Food-Producing Animals ▿

    PubMed Central

    Richardson, Emily J.; Limaye, Bhakti; Inamdar, Harshal; Datta, Avik; Manjari, K. Sunitha; Pullinger, Gillian D.; Thomson, Nicholas R.; Joshi, Rajendra R.; Watson, Michael; Stevens, Mark P.

    2011-01-01

    Salmonella enterica is an animal and zoonotic pathogen of worldwide importance and may be classified into serovars differing in virulence and host range. We sequenced and annotated the genomes of serovar Typhimurium, Choleraesuis, Dublin, and Gallinarum strains of defined virulence in each of three food-producing animal hosts. This provides valuable measures of intraserovar diversity and opportunities to formally link genotypes to phenotypes in target animals. PMID:21478351

  19. Virulence Factors of Erwinia amylovora: A Review.

    PubMed

    Piqué, Núria; Miñana-Galbis, David; Merino, Susana; Tomás, Juan M

    2015-06-05

    Erwinia amylovora, a Gram negative bacteria of the Enterobacteriaceae family, is the causal agent of fire blight, a devastating plant disease affecting a wide range of host species within Rosaceae and a major global threat to commercial apple and pear production. Among the limited number of control options currently available, prophylactic application of antibiotics during the bloom period appears the most effective. Pathogen cells enter plants through the nectarthodes of flowers and other natural openings, such as wounds, and are capable of rapid movement within plants and the establishment of systemic infections. Many virulence determinants of E. amylovora have been characterized, including the Type III secretion system (T3SS), the exopolysaccharide (EPS) amylovoran, biofilm formation, and motility. To successfully establish an infection, E. amylovora uses a complex regulatory network to sense the relevant environmental signals and coordinate the expression of early and late stage virulence factors involving two component signal transduction systems, bis-(3'-5')-cyclic di-GMP (c-di-GMP) and quorum sensing. The LPS biosynthetic gene cluster is one of the relatively few genetic differences observed between Rubus- and Spiraeoideae-infecting genotypes of E. amylovora. Other differential factors, such as the presence and composition of an integrative conjugative element associated with the Hrp T3SS (hrp genes encoding the T3SS apparatus), have been recently described. In the present review, we present the recent findings on virulence factors research, focusing on their role in bacterial pathogenesis and indicating other virulence factors that deserve future research to characterize them.

  20. Virulence Factors of Erwinia amylovora: A Review

    PubMed Central

    Piqué, Núria; Miñana-Galbis, David; Merino, Susana; Tomás, Juan M.

    2015-01-01

    Erwinia amylovora, a Gram negative bacteria of the Enterobacteriaceae family, is the causal agent of fire blight, a devastating plant disease affecting a wide range of host species within Rosaceae and a major global threat to commercial apple and pear production. Among the limited number of control options currently available, prophylactic application of antibiotics during the bloom period appears the most effective. Pathogen cells enter plants through the nectarthodes of flowers and other natural openings, such as wounds, and are capable of rapid movement within plants and the establishment of systemic infections. Many virulence determinants of E. amylovora have been characterized, including the Type III secretion system (T3SS), the exopolysaccharide (EPS) amylovoran, biofilm formation, and motility. To successfully establish an infection, E. amylovora uses a complex regulatory network to sense the relevant environmental signals and coordinate the expression of early and late stage virulence factors involving two component signal transduction systems, bis-(3′-5′)-cyclic di-GMP (c-di-GMP) and quorum sensing. The LPS biosynthetic gene cluster is one of the relatively few genetic differences observed between Rubus- and Spiraeoideae-infecting genotypes of E. amylovora. Other differential factors, such as the presence and composition of an integrative conjugative element associated with the Hrp T3SS (hrp genes encoding the T3SS apparatus), have been recently described. In the present review, we present the recent findings on virulence factors research, focusing on their role in bacterial pathogenesis and indicating other virulence factors that deserve future research to characterize them. PMID:26057748

  1. TriAnnot: A Versatile and High Performance Pipeline for the Automated Annotation of Plant Genomes

    PubMed Central

    Leroy, Philippe; Guilhot, Nicolas; Sakai, Hiroaki; Bernard, Aurélien; Choulet, Frédéric; Theil, Sébastien; Reboux, Sébastien; Amano, Naoki; Flutre, Timothée; Pelegrin, Céline; Ohyanagi, Hajime; Seidel, Michael; Giacomoni, Franck; Reichstadt, Mathieu; Alaux, Michael; Gicquello, Emmanuelle; Legeai, Fabrice; Cerutti, Lorenzo; Numa, Hisataka; Tanaka, Tsuyoshi; Mayer, Klaus; Itoh, Takeshi; Quesneville, Hadi; Feuillet, Catherine

    2012-01-01

    In support of the international effort to obtain a reference sequence of the bread wheat genome and to provide plant communities dealing with large and complex genomes with a versatile, easy-to-use online automated tool for annotation, we have developed the TriAnnot pipeline. Its modular architecture allows for the annotation and masking of transposable elements, the structural, and functional annotation of protein-coding genes with an evidence-based quality indexing, and the identification of conserved non-coding sequences and molecular markers. The TriAnnot pipeline is parallelized on a 712 CPU computing cluster that can run a 1-Gb sequence annotation in less than 5 days. It is accessible through a web interface for small scale analyses or through a server for large scale annotations. The performance of TriAnnot was evaluated in terms of sensitivity, specificity, and general fitness using curated reference sequence sets from rice and wheat. In less than 8 h, TriAnnot was able to predict more than 83% of the 3,748 CDS from rice chromosome 1 with a fitness of 67.4%. On a set of 12 reference Mb-sized contigs from wheat chromosome 3B, TriAnnot predicted and annotated 93.3% of the genes among which 54% were perfectly identified in accordance with the reference annotation. It also allowed the curation of 12 genes based on new biological evidences, increasing the percentage of perfect gene prediction to 63%. TriAnnot systematically showed a higher fitness than other annotation pipelines that are not improved for wheat. As it is easily adaptable to the annotation of other plant genomes, TriAnnot should become a useful resource for the annotation of large and complex genomes in the future. PMID:22645565

  2. Curation, integration and visualization of bacterial virulence factors in PATRIC

    PubMed Central

    Mao, Chunhong; Abraham, David; Wattam, Alice R.; Wilson, Meredith J.C.; Shukla, Maulik; Yoo, Hyun Seung; Sobral, Bruno W.

    2015-01-01

    Motivation: We’ve developed a highly curated bacterial virulence factor (VF) library in PATRIC (Pathosystems Resource Integration Center, www.patricbrc.org) to support infectious disease research. Although several VF databases are available, there is still a need to incorporate new knowledge found in published experimental evidence and integrate these data with other information known for these specific VF genes, including genomic and other omics data. This integration supports the identification of VFs, comparative studies and hypothesis generation, which facilitates the understanding of virulence and pathogenicity. Results: We have manually curated VFs from six prioritized NIAID (National Institute of Allergy and Infectious Diseases) category A–C bacterial pathogen genera, Mycobacterium, Salmonella, Escherichia, Shigella, Listeria and Bartonella, using published literature. This curated information on virulence has been integrated with data from genomic functional annotations, trancriptomic experiments, protein–protein interactions and disease information already present in PATRIC. Such integration gives researchers access to a broad array of information about these individual genes, and also to a suite of tools to perform comparative genomic and transcriptomics analysis that are available at PATRIC. Availability and implementation: All tools and data are freely available at PATRIC (http://patricbrc.org). Contact: cmao@vbi.vt.edu. Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25273106

  3. Prevalence and significance of plasmid maintenance functions in the virulence plasmids of pathogenic bacteria.

    PubMed

    Sengupta, Manjistha; Austin, Stuart

    2011-07-01

    Virulence functions of pathogenic bacteria are often encoded on large extrachromosomal plasmids. These plasmids are maintained at low copy number to reduce the metabolic burden on their host. Low-copy-number plasmids risk loss during cell division. This is countered by plasmid-encoded systems that ensure that each cell receives at least one plasmid copy. Plasmid replication and recombination can produce plasmid multimers that hinder plasmid segregation. These are removed by multimer resolution systems. Equitable distribution of the resulting monomers to daughter cells is ensured by plasmid partition systems that actively segregate plasmid copies to daughter cells in a process akin to mitosis in higher organisms. Any plasmid-free cells that still arise due to occasional failures of replication, multimer resolution, or partition are eliminated by plasmid-encoded postsegregational killing systems. Here we argue that all of these three systems are essential for the stable maintenance of large low-copy-number plasmids. Thus, they should be found on all large virulence plasmids. Where available, well-annotated sequences of virulence plasmids confirm this. Indeed, virulence plasmids often appear to contain more than one example conforming to each of the three system classes. Since these systems are essential for virulence, they can be regarded as ubiquitous virulence factors. As such, they should be informative in the search for new antibacterial agents and drug targets.

  4. The Staphylococcus aureus RNome and Its Commitment to Virulence

    PubMed Central

    Felden, Brice; Vandenesch, François; Bouloc, Philippe; Romby, Pascale

    2011-01-01

    Staphylococcus aureus is a major human pathogen causing a wide spectrum of nosocomial and community-associated infections with high morbidity and mortality. S. aureus generates a large number of virulence factors whose timing and expression levels are precisely tuned by regulatory proteins and RNAs. The aptitude of bacteria to use RNAs to rapidly modify gene expression, including virulence factors in response to stress or environmental changes, and to survive in a host is an evolving concept. Here, we focus on the recently inventoried S. aureus regulatory RNAs, with emphasis on those with identified functions, two of which are directly involved in pathogenicity. PMID:21423670

  5. MAGPIE/EGRET Annotation of the 2.9-Mb Drosophila melanogaster Adh Region

    PubMed Central

    Gaasterland, Terry; Sczyrba, Alexander; Thomas, Elizabeth; Aytekin-Kurban, Gulriz; Gordon, Paul; Sensen, Christoph W.

    2000-01-01

    Our challenge in annotating the 2.91-Mb Adh region of the Drosophila melanogaster genome was to identify genetic and genomic features automatically, completely, and precisely within a 6-week period. To do so, we augmented the MAGPIE microbial genome annotation system to handle eukaryotic genomic sequence data. The new configuration required the integration of eukaryotic gene-finding tools and DNA repeat tools into the automatic data collection module. It also required us to define in MAGPIE new strategies to combine data about eukaryotic exon predictions with functional data to refine the exon predictions. At the heart of the resulting new eukaryotic genome annotation system is a reverse comparison of public protein and complementary DNA sequences against the input genome to identify missing exons and to refine exon boundaries. The software modules that add eukaryotic genome annotation capability to MAGPIE are available as EGRET (Eukaryotic Genome Rapid Evaluation Tool). PMID:10779489

  6. The quick and the deadly: growth vs virulence in a seed bank pathogen.

    PubMed

    Meyer, Susan E; Stewart, Thomas E; Clement, Suzette

    2010-07-01

    *We studied the relationship between virulence (ability to kill nondormant Bromus tectorum seeds) and mycelial growth index in the necrotrophic seed pathogen Pyrenophora semeniperda. Seed pathosystems involving necrotrophs differ from those commonly treated in traditional evolution-of-virulence models in that host death increases pathogen fitness by preventing germination, thereby increasing available resources. Because fast-germinating, nondormant B. tectorum seeds commonly escape mortality, we expected virulence to be positively correlated with mycelial growth index. *We performed seed inoculations using conidia from 78 pathogen isolates and scored subsequent mortality. For a subset of 40 of these isolates, representing a range of virulence phenotypes, we measured mycelial growth index. *Virulence varied over a wide range (3-43% seed mortality) and was significantly negatively correlated with mycelial growth index (R(2) = 0.632). More virulent isolates grew more slowly than less virulent isolates. *We concluded that there is an apparent tradeoff between virulence and growth in this pathogen, probably because the production of toxins necessary for necrotrophic pathogenesis competes with metabolic processes associated with growth. Variation in both virulence and growth rate in this pathosystem may be maintained in part by seasonal variation in the relative abundance of rapidly germinating vs dormant host seeds available to the pathogen.

  7. New in protein structure and function annotation: hotspots, single nucleotide polymorphisms and the 'Deep Web'.

    PubMed

    Bromberg, Yana; Yachdav, Guy; Ofran, Yanay; Schneider, Reinhard; Rost, Burkhard

    2009-05-01

    The rapidly increasing quantity of protein sequence data continues to widen the gap between available sequences and annotations. Comparative modeling suggests some aspects of the 3D structures of approximately half of all known proteins; homology- and network-based inferences annotate some aspect of function for a similar fraction of the proteome. For most known protein sequences, however, there is detailed knowledge about neither their function nor their structure. Comprehensive efforts towards the expert curation of sequence annotations have failed to meet the demand of the rapidly increasing number of available sequences. Only the automated prediction of protein function in the absence of homology can close the gap between available sequences and annotations in the foreseeable future. This review focuses on two novel methods for automated annotation, and briefly presents an outlook on how modern web software may revolutionize the field of protein sequence annotation. First, predictions of protein binding sites and functional hotspots, and the evolution of these into the most successful type of prediction of protein function from sequence will be discussed. Second, a new tool, comprehensive in silico mutagenesis, which contributes important novel predictions of function and at the same time prepares for the onset of the next sequencing revolution, will be described. While these two new sub-fields of protein prediction represent the breakthroughs that have been achieved methodologically, it will then be argued that a different development might further change the way biomedical researchers benefit from annotations: modern web software can connect the worldwide web in any browser with the 'Deep Web' (ie, proprietary data resources). The availability of this direct connection, and the resulting access to a wealth of data, may impact drug discovery and development more than any existing method that contributes to protein annotation.

  8. Annotated Catalog of Bilingual Vocational Training Materials.

    ERIC Educational Resources Information Center

    Miranda (L.) and Associates, Bethesda, MD.

    This catalog contains annotations for 170 bilingual vocational training materials. Most of the materials are written in English, but materials written in 13 source languages and directed toward speakers of 17 target languages are provided. Annotations are provided for the following different types of documents: administrative, assessment and…

  9. Re-Entry Women: Annotated Bibliography.

    ERIC Educational Resources Information Center

    Porterfield, Patricia Lamb

    This is an annotated bibliography on topics related to reentry women. Topic categories include general and administrative issues, programs and services, needs assessment and evaluation, counseling and personal development, career planning and job placement, curriculum and instruction, admissions and recruiting, and financial aid. Annotations cite…

  10. Citizen Participation in Education: Annotated Bibliography.

    ERIC Educational Resources Information Center

    Davies, Don

    The emphasis in this annotated bibliography is citizen participation in education in the areas of decision making, policy development, and school governance. The focus is on the public school and school system rather than on private and parochial schools. One hundred fifty books, parts of books, and published reports are annotated, together with…

  11. Public School Choice: A Selected Annotated Bibliography.

    ERIC Educational Resources Information Center

    Crohn, Leslie; Hansen, Kenneth H.

    This annotated bibliography offers a sampling of a wide variety of viewpoints on the topic of school choice. Fourteen references selected for annotation, ranging from a 3-page journal article to a 266-page book, are listed at the beginning of the bibliography. Among the viewpoints that different authors represent are the following: (1) unlimited…

  12. Virulence Potential of Fusogenic Orthoreoviruses

    PubMed Central

    Cheng, Peter K.C.; Lai, Mary Y.Y.; Leung, Peter C.K.; Wong, Kitty K.Y.; Lee, W.Y.; Lim, Wilina W.L.

    2012-01-01

    Several severe respiratory virus infections that have emerged during the past decade originated in animals, including bats. In Indonesia, exposure to bats has been associated with increased risk of acquiring orthoreovirus infection. Although orthoreovirus infections are mild and self-limiting, we explored their potential for evolution into a more virulent form. We used conventional virus culture, electron microscopy, and molecular sequencing to isolate and identify orthoreoviruses from 3 patients in whom respiratory tract infection developed after travel to Indonesia. Virus characterization by plaque-reduction neutralization testing showed antigenic similarity, but sequencing of the small segment genes suggested virus reassortment, which could lead to increased virulence. Bats as a reservoir might contribute to virus evolution and genetic diversity, giving orthoreoviruses the potential to become more virulent. Evolution of this virus should be closely monitored so that prevention and control measures can be taken should it become more virulent. PMID:22608100

  13. Sophia: A Expedient UMLS Concept Extraction Annotator

    PubMed Central

    Divita, Guy; Zeng, Qing T; Gundlapalli, Adi V.; Duvall, Scott; Nebeker, Jonathan; Samore, Matthew H.

    2014-01-01

    An opportunity exists for meaningful concept extraction and indexing from large corpora of clinical notes in the Veterans Affairs (VA) electronic medical record. Currently available tools such as MetaMap, cTAKES and HITex do not scale up to address this big data need. Sophia, a rapid UMLS concept extraction annotator was developed to fulfill a mandate and address extraction where high throughput is needed while preserving performance. We report on the development, testing and benchmarking of Sophia against MetaMap and cTAKEs. Sophia demonstrated improved performance on recall as compared to cTAKES and MetaMap (0.71 vs 0.66 and 0.38). The overall f-score was similar to cTAKES and an improvement over MetaMap (0.53 vs 0.57 and 0.43). With regard to speed of processing records, we noted Sophia to be several fold faster than cTAKES and the scaled-out MetaMap service. Sophia offers a viable alternative for high-throughput information extraction tasks. PMID:25954351

  14. Assisted annotation of medical free text using RapTAT

    PubMed Central

    Gobbel, Glenn T; Garvin, Jennifer; Reeves, Ruth; Cronin, Robert M; Heavirland, Julia; Williams, Jenifer; Weaver, Allison; Jayaramaraja, Shrimalini; Giuse, Dario; Speroff, Theodore; Brown, Steven H; Xu, Hua; Matheny, Michael E

    2014-01-01

    Objective To determine whether assisted annotation using interactive training can reduce the time required to annotate a clinical document corpus without introducing bias. Materials and methods A tool, RapTAT, was designed to assist annotation by iteratively pre-annotating probable phrases of interest within a document, presenting the annotations to a reviewer for correction, and then using the corrected annotations for further machine learning-based training before pre-annotating subsequent documents. Annotators reviewed 404 clinical notes either manually or using RapTAT assistance for concepts related to quality of care during heart failure treatment. Notes were divided into 20 batches of 19–21 documents for iterative annotation and training. Results The number of correct RapTAT pre-annotations increased significantly and annotation time per batch decreased by ∼50% over the course of annotation. Annotation rate increased from batch to batch for assisted but not manual reviewers. Pre-annotation F-measure increased from 0.5 to 0.6 to >0.80 (relative to both assisted reviewer and reference annotations) over the first three batches and more slowly thereafter. Overall inter-annotator agreement was significantly higher between RapTAT-assisted reviewers (0.89) than between manual reviewers (0.85). Discussion The tool reduced workload by decreasing the number of annotations needing to be added and helping reviewers to annotate at an increased rate. Agreement between the pre-annotations and reference standard, and agreement between the pre-annotations and assisted annotations, were similar throughout the annotation process, which suggests that pre-annotation did not introduce bias. Conclusions Pre-annotations generated by a tool capable of interactive training can reduce the time required to create an annotated document corpus by up to 50%. PMID:24431336

  15. Identification and functional annotation of mycobacterial septum formation genes using cell division mutants of Escherichia coli.

    PubMed

    Gaiwala Sharma, Sujata S; Kishore, Vimal; Raghunand, Tirumalai R

    2016-01-01

    The major virulence trait of Mycobacterium tuberculosis is its ability to enter a latent state in the face of robust host immunity. Clues to the molecular basis of latency can emerge from understanding the mechanism of cell division, beginning with identification of proteins involved in this process. Using complementation of Escherichia coli mutants, we functionally annotated M. tuberculosis and Mycobacterium smegmatis homologs of divisome proteins FtsW and AmiC. Our results demonstrate that E. coli can be used as a surrogate model to discover mycobacterial cell division genes, and should prove invaluable in delineating the mechanisms of this fundamental process in mycobacteria.

  16. Francisella tularensis novicida proteomic and transcriptomic data integration and annotation based on semantic web technologies

    PubMed Central

    Anwar, Nadia; Hunt, Ela

    2009-01-01

    Background This paper summarises the lessons and experiences gained from a case study of the application of semantic web technologies to the integration of data from the bacterial species Francisella tularensis novicida (Fn). Fn data sources are disparate and heterogeneous, as multiple laboratories across the world, using multiple technologies, perform experiments to understand the mechanism of virulence. It is hard to integrate these data sources in a flexible manner that allows new experimental data to be added and compared when required. Results Public domain data sources were combined in RDF. Using this connected graph of database cross references, we extended the annotations of an experimental data set by superimposing onto it the annotation graph. Identifiers used in the experimental data automatically resolved and the data acquired annotations in the rest of the RDF graph. This happened without the expensive manual annotation that would normally be required to produce these links. This graph of resolved identifiers was then used to combine two experimental data sets, a proteomics experiment and a transcriptomic experiment studying the mechanism of virulence through the comparison of wildtype Fn with an avirulent mutant strain. Conclusion We produced a graph of Fn cross references which enabled the combination of two experimental datasets. Through combination of these data we are able to perform queries that compare the results of the two experiments. We found that data are easily combined in RDF and that experimental results are easily compared when the data are integrated. We conclude that semantic data integration offers a convenient, simple and flexible solution to the integration of published and unpublished experimental data. PMID:19796400

  17. Genes involved in virulence of the entomopathogenic fungus Beauveria bassiana.

    PubMed

    Valero-Jiménez, Claudio A; Wiegers, Harm; Zwaan, Bas J; Koenraadt, Constantianus J M; van Kan, Jan A L

    2016-01-01

    Pest insects cause severe damage to global crop production and pose a threat to human health by transmitting diseases. Traditionally, chemical pesticides (insecticides) have been used to control such pests and have proven to be effective only for a limited amount of time because of the rapid spread of genetic insecticide resistance. The basis of this resistance is mostly caused by (co)dominant mutations in single genes, which explains why insecticide use alone is an unsustainable solution. Therefore, robust solutions for insect pest control need to be sought in alternative methods such as biological control agents for which single-gene resistance is less likely to evolve. The entomopathogenic fungus Beauveria bassiana has shown potential as a biological control agent of insects, and insight into the mechanisms of virulence is essential to show the robustness of its use. With the recent availability of the whole genome sequence of B. bassiana, progress in understanding the genetics that constitute virulence toward insects can be made more quickly. In this review we divide the infection process into distinct steps and provide an overview of what is currently known about genes and mechanisms influencing virulence in B. bassiana. We also discuss the need for novel strategies and experimental methods to better understand the infection mechanisms deployed by entomopathogenic fungi. Such knowledge can help improve biocontrol agents, not only by selecting the most virulent genotypes, but also by selecting the genotypes that use combinations of virulence mechanisms for which resistance in the insect host is least likely to develop.

  18. Concept annotation in the CRAFT corpus

    PubMed Central

    2012-01-01

    Background Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. Results This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. Conclusions As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http

  19. Facilitating functional annotation of chicken microarray data

    PubMed Central

    2009-01-01

    Background Modeling results from chicken microarray studies is challenging for researchers due to little functional annotation associated with these arrays. The Affymetrix GenChip chicken genome array, one of the biggest arrays that serve as a key research tool for the study of chicken functional genomics, is among the few arrays that link gene products to Gene Ontology (GO). However the GO annotation data presented by Affymetrix is incomplete, for example, they do not show references linked to manually annotated functions. In addition, there is no tool that facilitates microarray researchers to directly retrieve functional annotations for their datasets from the annotated arrays. This costs researchers amount of time in searching multiple GO databases for functional information. Results We have improved the breadth of functional annotations of the gene products associated with probesets on the Affymetrix chicken genome array by 45% and the quality of annotation by 14%. We have also identified the most significant diseases and disorders, different types of genes, and known drug targets represented on Affymetrix chicken genome array. To facilitate functional annotation of other arrays and microarray experimental datasets we developed an Array GO Mapper (AGOM) tool to help researchers to quickly retrieve corresponding functional information for their dataset. Conclusion Results from this study will directly facilitate annotation of other chicken arrays and microarray experimental datasets. Researchers will be able to quickly model their microarray dataset into more reliable biological functional information by using AGOM tool. The disease, disorders, gene types and drug targets revealed in the study will allow researchers to learn more about how genes function in complex biological systems and may lead to new drug discovery and development of therapies. The GO annotation data generated will be available for public use via AgBase website and will be updated on regular

  20. Comparative Genome Analysis of Two Isolates of the Fish Pathogen Piscirickettsia salmonis from Different Hosts Reveals Major Differences in Virulence-Associated Secretion Systems

    PubMed Central

    Bohle, Harry; Henríquez, Patricio; Grothusen, Horst; Navas, Esteban; Sandoval, Alvaro; Bustamante, Fernando; Bustos, Patricio

    2014-01-01

    Outbreaks caused by Piscirickettsia salmonis are one of the major threats to the sustainability of the Chilean salmon industry. We report here the annotated draft genomes of two P. salmonis isolates recovered from different salmonid species. A comparative analysis showed that the number of virulence-associated secretion systems constitutes a main genomic difference. PMID:25523762

  1. The Causes and Consequences of Changes in Virulence following Pathogen Host Shifts

    PubMed Central

    Longdon, Ben; Hadfield, Jarrod D.; Day, Jonathan P.; Smith, Sophia C. L.; McGonigle, John E.; Cogni, Rodrigo; Cao, Chuan; Jiggins, Francis M.

    2015-01-01

    Emerging infectious diseases are often the result of a host shift, where the pathogen originates from a different host species. Virulence—the harm a pathogen does to its host—can be extremely high following a host shift (for example Ebola, HIV, and SARs), while other host shifts may go undetected as they cause few symptoms in the new host. Here we examine how virulence varies across host species by carrying out a large cross infection experiment using 48 species of Drosophilidae and an RNA virus. Host shifts resulted in dramatic variation in virulence, with benign infections in some species and rapid death in others. The change in virulence was highly predictable from the host phylogeny, with hosts clustering together in distinct clades displaying high or low virulence. High levels of virulence are associated with high viral loads, and this may determine the transmission rate of the virus. PMID:25774803

  2. Evaluation of mycobacterial virulence using rabbit skin liquefaction model.

    PubMed

    Zhang, Guoping; Zhu, Bingdong; Shi, Wanliang; Wang, Mingzhu; Da, Zejiao; Zhang, Ying

    2010-01-01

    Liquefaction is an important pathological process that can subsequently lead to cavitation where large numbers of bacilli can be coughed up which in turn causes spread of tuberculosis in humans. Current animal models to study the liquefaction process and to evaluate virulence of mycobacteria are tedious. In this study, we evaluated a rabbit skin model as a rapid model for liquefaction and virulence assessment using M. bovis BCG, M. tuberculosis avirulent strain H37Ra, M. smegmatis, and the H37Ra strains complemented with selected genes from virulent M. tuberculosis strain H37Rv. We found that with prime and/or boosting immunization, all of these live bacteria at enough high number could induce liquefaction, and the boosting induced stronger liquefaction and more severe lesions in shorter time compared with the prime injection. The skin lesions caused by high dose live BCG (5×10 (6) ) were the most severe followed by live M. tuberculosis H37Ra with M. smegmatis being the least pathogenic. It is of interest to note that none of the above heat-killed mycobacteria induced liquefaction. When H37Ra was complemented with certain wild type genes of H37Rv, some of the complemented H37Ra strains produced more severe skin lesions than H37Ra. These results suggest that the rabbit skin liquefaction model can be a more visual, convenient, rapid and useful model to evaluate virulence of different mycobacteria and to study the mechanisms of liquefaction.

  3. Making web annotations persistent over time

    SciTech Connect

    Sanderson, Robert; Van De Sompel, Herbert

    2010-01-01

    As Digital Libraries (DL) become more aligned with the web architecture, their functional components need to be fundamentally rethought in terms of URIs and HTTP. Annotation, a core scholarly activity enabled by many DL solutions, exhibits a clearly unacceptable characteristic when existing models are applied to the web: due to the representations of web resources changing over time, an annotation made about a web resource today may no longer be relevant to the representation that is served from that same resource tomorrow. We assume the existence of archived versions of resources, and combine the temporal features of the emerging Open Annotation data model with the capability offered by the Memento framework that allows seamless navigation from the URI of a resource to archived versions of that resource, and arrive at a solution that provides guarantees regarding the persistence of web annotations over time. More specifically, we provide theoretical solutions and proof-of-concept experimental evaluations for two problems: reconstructing an existing annotation so that the correct archived version is displayed for all resources involved in the annotation, and retrieving all annotations that involve a given archived version of a web resource.

  4. Annotating user-defined abstractions for optimization

    SciTech Connect

    Quinlan, D; Schordan, M; Vuduc, R; Yi, Q

    2005-12-05

    This paper discusses the features of an annotation language that we believe to be essential for optimizing user-defined abstractions. These features should capture semantics of function, data, and object-oriented abstractions, express abstraction equivalence (e.g., a class represents an array abstraction), and permit extension of traditional compiler optimizations to user-defined abstractions. Our future work will include developing a comprehensive annotation language for describing the semantics of general object-oriented abstractions, as well as automatically verifying and inferring the annotated semantics.

  5. Automated Knowledge Annotation for Dynamic Collaborative Environments

    SciTech Connect

    Cowell, Andrew J.; Gregory, Michelle L.; Marshall, Eric J.; McGrath, Liam R.

    2009-05-19

    This paper describes the Knowledge Encapsulation Framework (KEF), a suite of tools to enable automated knowledge annotation for modeling and simulation projects. This framework can be used to capture evidence (e.g., facts extracted from journal articles and government reports), discover new evidence (from similar peer-reviewed material as well as social media), enable discussions surrounding domain-specific topics and provide automatically generated semantic annotations for improved corpus investigation. The current KEF implementation is presented within a wiki environment, providing a simple but powerful collaborative space for team members to review, annotate, discuss and align evidence with their modeling frameworks.

  6. BioBuilder as a database development and functional annotation platform for proteins

    PubMed Central

    Navarro, J Daniel; Talreja, Naveen; Peri, Suraj; Vrushabendra, BM; Rashmi, BP; Padma, N; Surendranath, Vineeth; Jonnalagadda, Chandra Kiran; Kousthub, PS; Deshpande, Nandan; Shanker, K; Pandey, Akhilesh

    2004-01-01

    Background The explosion in biological information creates the need for databases that are easy to develop, easy to maintain and can be easily manipulated by annotators who are most likely to be biologists. However, deployment of scalable and extensible databases is not an easy task and generally requires substantial expertise in database development. Results BioBuilder is a Zope-based software tool that was developed to facilitate intuitive creation of protein databases. Protein data can be entered and annotated through web forms along with the flexibility to add customized annotation features to protein entries. A built-in review system permits a global team of scientists to coordinate their annotation efforts. We have already used BioBuilder to develop Human Protein Reference Database , a comprehensive annotated repository of the human proteome. The data can be exported in the extensible markup language (XML) format, which is rapidly becoming as the standard format for data exchange. Conclusions As the proteomic data for several organisms begins to accumulate, BioBuilder will prove to be an invaluable platform for functional annotation and development of customizable protein centric databases. BioBuilder is open source and is available under the terms of LGPL. PMID:15099404

  7. Computer-assisted annotation of murine Sertoli cell small RNA transcriptome.

    PubMed

    Ortogero, Nicole; Hennig, Grant W; Langille, Chad; Ro, Seungil; McCarrey, John R; Yan, Wei

    2013-01-01

    Mammalian genomes encode a large number of small noncoding RNAs (sncRNAs) that play regulatory roles during development and adulthood by affecting gene expression. Several sncRNA species, including microRNAs (miRNAs), piwi-interacting RNAs (piRNAs), endogenous small interfering RNAs (endo-siRNAs), and small nucleolar RNAs (snoRNAs), are abundantly expressed in the testis and required for normal testicular development and spermatogenesis. To evaluate global changes in sncRNA expression, the next-generation sequencing (NGS)-based sncRNA transcriptomic analysis has become routine, because it allows rapid determination of the small RNA transcriptome of a particular testicular cell type. However, annotation of small RNA NGS reads can be challenging due to the volume of reads obtained, which is usually in the millions. Therefore, we developed a computer-assisted sncRNA annotation protocol that could identify not only known sncRNAs but also previously uncharacterized ones. Using this protocol, we annotated NGS reads of a Sertoli cell sncRNA library, and we report to our knowledge the first comprehensive annotation of the sncRNA transcriptome of immature murine Sertoli cells. Moreover, the computer-assisted sncRNA annotation pipeline that we report is applicable for annotating NGS reads derived from other cell types and/or sequencing platforms.

  8. Dengue virus virulence and transmission determinants.

    PubMed

    Rico-Hesse, R

    2010-01-01

    The mechanisms of dengue virus (DENV) pathogenesis are little understood because we have no models of disease; only humans develop symptoms (dengue fever, DF, or dengue hemorrhagic fever, DHF) and research has been limited to studies involving patients. DENV is very diverse: there are four antigenic groups (serotypes) and three to five genetic groups (genotypes) within each serotype. Thus, it has been difficult to evaluate the relative virulence or transmissibility of each DENV genotype; both of these factors are important determinants of epidemiology and their measurement is complex because the natural cycle of this disease involves human-mosquito-human transmission. Although epidemiological and evolutionary studies have pointed to viral factors in determining disease outcome, only recently developed models could prove the importance of specific viral genotypes in causing severe epidemics and their potential to spread to other continents. These new models involve infection of primary human cell cultures, "humanized" mice and field-collected mosquitoes; also, new mathematical models can estimate the impact of viral replication, human immunity and mosquito transmission on epidemic behavior. DENV evolution does not seem to be rapid and the transmission and dispersal of stable, replication-fit genotypes has been more important in the causation of more severe epidemics. Controversy regarding viral determinants of DENV pathogenesis and epidemiology will continue until virulence and transmissibility can be measured under various conditions.

  9. ISsaga is an ensemble of web-based methods for high throughput identification and semi-automatic annotation of insertion sequences in prokaryotic genomes.

    PubMed

    Varani, Alessandro M; Siguier, Patricia; Gourbeyre, Edith; Charneau, Vincent; Chandler, Mick

    2011-01-01

    Insertion sequences (ISs) play a key role in prokaryotic genome evolution but are seldom well annotated. We describe a web application pipeline, ISsaga (http://issaga.biotoul.fr/ISsaga/issaga_index.php), that provides computational tools and methods for high-quality IS annotation. It uses established ISfinder annotation standards and permits rapid processing of single or multiple prokaryote genomes. ISsaga provides general prediction and annotation tools, information on genome context of individual ISs and a graphical overview of IS distribution around the genome of interest.

  10. Do pathogens become more virulent as they spread? Evidence from the amphibian declines in Central America.

    PubMed

    Phillips, Ben L; Puschendorf, Robert

    2013-09-01

    The virulence of a pathogen can vary strongly through time. While cyclical variation in virulence is regularly observed, directional shifts in virulence are less commonly observed and are typically associated with decreasing virulence of biological control agents through coevolution. It is increasingly appreciated, however, that spatial effects can lead to evolutionary trajectories that differ from standard expectations. One such possibility is that, as a pathogen spreads through a naive host population, its virulence increases on the invasion front. In Central America, there is compelling evidence for the recent spread of pathogenic Batrachochytrium dendrobatidis (Bd) and for its strong impact on amphibian populations. Here, we re-examine data on Bd prevalence and amphibian population decline across 13 sites from southern Mexico through Central America, and show that, in the initial phases of the Bd invasion, amphibian population decline lagged approximately 9 years behind the arrival of the pathogen, but that this lag diminished markedly over time. In total, our analysis suggests an increase in Bd virulence as it spread southwards, a pattern consistent with rapid evolution of increased virulence on Bd's invading front. The impact of Bd on amphibians might therefore be driven by rapid evolution in addition to more proximate environmental drivers.

  11. Do pathogens become more virulent as they spread? Evidence from the amphibian declines in Central America

    PubMed Central

    Phillips, Ben L.; Puschendorf, Robert

    2013-01-01

    The virulence of a pathogen can vary strongly through time. While cyclical variation in virulence is regularly observed, directional shifts in virulence are less commonly observed and are typically associated with decreasing virulence of biological control agents through coevolution. It is increasingly appreciated, however, that spatial effects can lead to evolutionary trajectories that differ from standard expectations. One such possibility is that, as a pathogen spreads through a naive host population, its virulence increases on the invasion front. In Central America, there is compelling evidence for the recent spread of pathogenic Batrachochytrium dendrobatidis (Bd) and for its strong impact on amphibian populations. Here, we re-examine data on Bd prevalence and amphibian population decline across 13 sites from southern Mexico through Central America, and show that, in the initial phases of the Bd invasion, amphibian population decline lagged approximately 9 years behind the arrival of the pathogen, but that this lag diminished markedly over time. In total, our analysis suggests an increase in Bd virulence as it spread southwards, a pattern consistent with rapid evolution of increased virulence on Bd's invading front. The impact of Bd on amphibians might therefore be driven by rapid evolution in addition to more proximate environmental drivers. PMID:23843393

  12. An automated protein annotation filter for integrating web-based annotation tools

    PubMed Central

    Saravanan, Vijayakumar; Shanmughavel, Primanayagam

    2007-01-01

    A wide range of web based prediction and annotation tools are frequently used for determining protein function from sequence. However, parallel processing of sequences for annotation through web tools is not possible due to several constraints in functional programming for multiple queries. Here, we propose the development of APAF as an automated protein annotation filter to overcome some of these difficulties through an integrated approach. PMID:18188426

  13. An automated protein annotation filter for integrating web-based annotation tools.

    PubMed

    Saravanan, Vijayakumar; Shanmughavel, Primanayagam

    2007-12-15

    A wide range of web based prediction and annotation tools are frequently used for determining protein function from sequence. However, parallel processing of sequences for annotation through web tools is not possible due to several constraints in functional programming for multiple queries. Here, we propose the development of APAF as an automated protein annotation filter to overcome some of these difficulties through an integrated approach.

  14. Virtual annotation: Verbal communication in virtual reality

    NASA Astrophysics Data System (ADS)

    Verlinden, Jouke C.; Bolter, Jay David; Vandermast, Charles

    A system that was developed to explore communication in virtual reality and which offers a simple and powerful method to embed verbal communication in simulations and visualizers by means of voice annotation is described. The prototype demonstrates that the addition of verbal communication opens up a range of new uses for virtual environments. A similar voice annotation facility is easily added to existing visualizers and simulations, and it enables reading, writing and communicating.

  15. Genepi: a blackboard framework for genome annotation

    PubMed Central

    Descorps-Declère, Stéphane; Ziébelin, Danielle; Rechenmann, François; Viari, Alain

    2006-01-01

    Background Genome annotation can be viewed as an incremental, cooperative, data-driven, knowledge-based process that involves multiple methods to predict gene locations and structures. This process might have to be executed more than once and might be subjected to several revisions as the biological (new data) or methodological (new methods) knowledge evolves. In this context, although a lot of annotation platforms already exist, there is still a strong need for computer systems which take in charge, not only the primary annotation, but also the update and advance of the associated knowledge. In this paper, we propose to adopt a blackboard architecture for designing such a system Results We have implemented a blackboard framework (called Genepi) for developing automatic annotation systems. The system is not bound to any specific annotation strategy. Instead, the user will specify a blackboard structure in a configuration file and the system will instantiate and run this particular annotation strategy. The characteristics of this framework are presented and discussed. Specific adaptations to the classical blackboard architecture have been required, such as the description of the activation patterns of the knowledge sources by using an extended set of Allen's temporal relations. Although the system is robust enough to be used on real-size applications, it is of primary use to bioinformatics researchers who want to experiment with blackboard architectures. Conclusion In the context of genome annotation, blackboards have several interesting features related to the way methodological and biological knowledge can be updated. They can readily handle the cooperative (several methods are implied) and opportunistic (the flow of execution depends on the state of our knowledge) aspects of the annotation process. PMID:17038181

  16. Quantitative Trait Locus (QTL) Mapping Reveals a Role for Unstudied Genes in Aspergillus Virulence

    PubMed Central

    Christians, Julian K.; Cheema, Manjinder S.; Vergara, Ismael A.; Watt, Cortney A.; Pinto, Linda J.; Chen, Nansheng; Moore, Margo M.

    2011-01-01

    Infections caused by the fungus Aspergillus are a major cause of morbidity and mortality in immunocompromised populations. To identify genes required for virulence that could be used as targets for novel treatments, we mapped quantitative trait loci (QTL) affecting virulence in the progeny of a cross between two strains of A. nidulans (FGSC strains A4 and A91). We genotyped 61 progeny at 739 single nucleotide polymorphisms (SNP) spread throughout the genome, and constructed a linkage map that was largely consistent with the genomic sequence, with the exception of one potential inversion of ∼527 kb on Chromosome V. The estimated genome size was 3705 cM and the average intermarker spacing was 5.0 cM. The average ratio of physical distance to genetic distance was 8.1 kb/cM, which is similar to previous estimates, and variation in recombination rate was significantly positively correlated with GC content, a pattern seen in other taxa. To map QTL affecting virulence, we measured the ability of each progeny strain to kill model hosts, larvae of the wax moth Galleria mellonella. We detected three QTL affecting in vivo virulence that were distinct from QTL affecting in vitro growth, and mapped the virulence QTL to regions containing 7–24 genes, excluding genes with no sequence variation between the parental strains and genes with only synonymous SNPs. None of the genes in our QTL target regions have been previously associated with virulence in Aspergillus, and almost half of these genes are currently annotated as “hypothetical”. This study is the first to map QTL affecting the virulence of a fungal pathogen in an animal host, and our results illustrate the power of this approach to identify a short list of unknown genes for further investigation. PMID:21559404

  17. Functional genomic characterization of virulence factors from necrotizing fasciitis-causing strains of Aeromonas hydrophila.

    PubMed

    Grim, Christopher J; Kozlova, Elena V; Ponnusamy, Duraisamy; Fitts, Eric C; Sha, Jian; Kirtley, Michelle L; van Lier, Christina J; Tiner, Bethany L; Erova, Tatiana E; Joseph, Sandeep J; Read, Timothy D; Shak, Joshua R; Joseph, Sam W; Singletary, Ed; Felland, Tracy; Baze, Wallace B; Horneman, Amy J; Chopra, Ashok K

    2014-07-01

    The genomes of 10 Aeromonas isolates identified and designated Aeromonas hydrophila WI, Riv3, and NF1 to NF4; A. dhakensis SSU; A. jandaei Riv2; and A. caviae NM22 and NM33 were sequenced and annotated. Isolates NF1 to NF4 were from a patient with necrotizing fasciitis (NF). Two environmental isolates (Riv2 and -3) were from the river water from which the NF patient acquired the infection. While isolates NF2 to NF4 were clonal, NF1 was genetically distinct. Outside the conserved core genomes of these 10 isolates, several unique genomic features were identified. The most virulent strains possessed one of the following four virulence factors or a combination of them: cytotoxic enterotoxin, exotoxin A, and type 3 and 6 secretion system effectors AexU and Hcp. In a septicemic-mouse model, SSU, NF1, and Riv2 were the most virulent, while NF2 was moderately virulent. These data correlated with high motility and biofilm formation by the former three isolates. Conversely, in a mouse model of intramuscular infection, NF2 was much more virulent than NF1. Isolates NF2, SSU, and Riv2 disseminated in high numbers from the muscular tissue to the visceral organs of mice, while NF1 reached the liver and spleen in relatively lower numbers on the basis of colony counting and tracking of bioluminescent strains in real time by in vivo imaging. Histopathologically, degeneration of myofibers with significant infiltration of polymorphonuclear cells due to the highly virulent strains was noted. Functional genomic analysis provided data that allowed us to correlate the highly infectious nature of Aeromonas pathotypes belonging to several different species with virulence signatures and their potential ability to cause NF.

  18. Functional Genomic Characterization of Virulence Factors from Necrotizing Fasciitis-Causing Strains of Aeromonas hydrophila

    PubMed Central

    Grim, Christopher J.; Kozlova, Elena V.; Ponnusamy, Duraisamy; Fitts, Eric C.; Sha, Jian; Kirtley, Michelle L.; van Lier, Christina J.; Tiner, Bethany L.; Erova, Tatiana E.; Joseph, Sandeep J.; Read, Timothy D.; Shak, Joshua R.; Joseph, Sam W.; Singletary, Ed; Felland, Tracy; Baze, Wallace B.; Horneman, Amy J.

    2014-01-01

    The genomes of 10 Aeromonas isolates identified and designated Aeromonas hydrophila WI, Riv3, and NF1 to NF4; A. dhakensis SSU; A. jandaei Riv2; and A. caviae NM22 and NM33 were sequenced and annotated. Isolates NF1 to NF4 were from a patient with necrotizing fasciitis (NF). Two environmental isolates (Riv2 and -3) were from the river water from which the NF patient acquired the infection. While isolates NF2 to NF4 were clonal, NF1 was genetically distinct. Outside the conserved core genomes of these 10 isolates, several unique genomic features were identified. The most virulent strains possessed one of the following four virulence factors or a combination of them: cytotoxic enterotoxin, exotoxin A, and type 3 and 6 secretion system effectors AexU and Hcp. In a septicemic-mouse model, SSU, NF1, and Riv2 were the most virulent, while NF2 was moderately virulent. These data correlated with high motility and biofilm formation by the former three isolates. Conversely, in a mouse model of intramuscular infection, NF2 was much more virulent than NF1. Isolates NF2, SSU, and Riv2 disseminated in high numbers from the muscular tissue to the visceral organs of mice, while NF1 reached the liver and spleen in relatively lower numbers on the basis of colony counting and tracking of bioluminescent strains in real time by in vivo imaging. Histopathologically, degeneration of myofibers with significant infiltration of polymorphonuclear cells due to the highly virulent strains was noted. Functional genomic analysis provided data that allowed us to correlate the highly infectious nature of Aeromonas pathotypes belonging to several different species with virulence signatures and their potential ability to cause NF. PMID:24795370

  19. Campylobacter virulence and survival factors.

    PubMed

    Bolton, Declan J

    2015-06-01

    Despite over 30 years of research, campylobacteriosis is the most prevalent foodborne bacterial infection in many countries including in the European Union and the United States of America. However, relatively little is known about the virulence factors in Campylobacter or how an apparently fragile organism can survive in the food chain, often with enhanced pathogenicity. This review collates information on the virulence and survival determinants including motility, chemotaxis, adhesion, invasion, multidrug resistance, bile resistance and stress response factors. It discusses their function in transition through the food processing environment and human infection. In doing so it provides a fundamental understanding of Campylobacter, critical for improved diagnosis, surveillance and control.

  20. JGI Plant Genomics Gene Annotation Pipeline

    SciTech Connect

    Shu, Shengqiang; Rokhsar, Dan; Goodstein, David; Hayes, David; Mitros, Therese

    2014-07-14

    Plant genomes vary in size and are highly complex with a high amount of repeats, genome duplication and tandem duplication. Gene encodes a wealth of information useful in studying organism and it is critical to have high quality and stable gene annotation. Thanks to advancement of sequencing technology, many plant species genomes have been sequenced and transcriptomes are also sequenced. To use these vastly large amounts of sequence data to make gene annotation or re-annotation in a timely fashion, an automatic pipeline is needed. JGI plant genomics gene annotation pipeline, called integrated gene call (IGC), is our effort toward this aim with aid of a RNA-seq transcriptome assembly pipeline. It utilizes several gene predictors based on homolog peptides and transcript ORFs. See Methods for detail. Here we present genome annotation of JGI flagship green plants produced by this pipeline plus Arabidopsis and rice except for chlamy which is done by a third party. The genome annotations of these species and others are used in our gene family build pipeline and accessible via JGI Phytozome portal whose URL and front page snapshot are shown below.

  1. Collaborative annotation of 3D crystallographic models.

    PubMed

    Hunter, J; Henderson, M; Khan, I

    2007-01-01

    This paper describes the AnnoCryst system-a tool that was designed to enable authenticated collaborators to share online discussions about 3D crystallographic structures through the asynchronous attachment, storage, and retrieval of annotations. Annotations are personal comments, interpretations, questions, assessments, or references that can be attached to files, data, digital objects, or Web pages. The AnnoCryst system enables annotations to be attached to 3D crystallographic models retrieved from either private local repositories (e.g., Fedora) or public online databases (e.g., Protein Data Bank or Inorganic Crystal Structure Database) via a Web browser. The system uses the Jmol plugin for viewing and manipulating the 3D crystal structures but extends Jmol by providing an additional interface through which annotations can be created, attached, stored, searched, browsed, and retrieved. The annotations are stored on a standardized Web annotation server (Annotea), which has been extended to support 3D macromolecular structures. Finally, the system is embedded within a security framework that is capable of authenticating users and restricting access only to trusted colleagues.

  2. Metabolic pathfinding using RPAIR annotation.

    PubMed

    Faust, Karoline; Croes, Didier; van Helden, Jacques

    2009-05-01

    Metabolic databases contain information about thousands of small molecules and reactions, which can be represented as networks. In the context of metabolic reconstruction, pathways can be inferred by searching optimal paths in such networks. A recurrent problem is the presence of pool metabolites (e.g., water, energy carriers, and cofactors), which are connected to hundreds of reactions, thus establishing irrelevant shortcuts between nodes of the network. One solution to this problem relies on weighted networks to penalize highly connected compounds. A more refined solution takes the chemical structure of reactants into account in order to differentiate between side and main compounds of a reaction. Thanks to an intensive annotation effort at KEGG, decompositions of reactions into reactant pairs (RPAIR) categorized by their role (main, trans, cofac, ligase, and leave) are now available. The goal of this article is to evaluate the impact of RPAIR data on pathfinding in metabolic networks. To this end, we measure the impact of different parameters concerning the construction of the metabolic network: mapping of reactions and reactant pairs onto a graph, use of selected categories of reactant pairs, weighting schemes for compounds and reactions, removal of highly connected metabolites, and reaction directionality. In total, we tested 104 combinations of parameters and identified their optimal values for pathfinding on the basis of 55 reference pathways from three organisms. The best-performing metabolic network combines the biochemical knowledge encoded by KEGG RPAIR with a weighting scheme penalizing highly connected compounds. With this network, we could recover reference pathways from Escherichia coli with an average accuracy of 93% (32 pathways), from Saccharomyces cerevisiae with an average accuracy of 66% (11 pathways), and from humans with an average accuracy of 70% (12 pathways). Our pathfinding approach is available as part of the Network Analysis Tools.

  3. Annotated checklist of Georgia birds

    USGS Publications Warehouse

    Beaton, G.; Sykes, P.W.; Parrish, J.W.

    2003-01-01

    This edition of the checklist includes 446 species, of which 407 are on the Regular Species List, 8 on the Provisional, and 31 on the Hypothetical. This new publication has been greatly expanded and much revised over the previous checklist (GOS Occasional Publ. No. 10, 1986, 48 pp., 6x9 inches) to a 7x10-inch format with an extensive Literature Cited section added, 22 species added to the Regular List, 2 to the Provisional List, and 9 to the Hypothetical List. Each species account is much more comprehensive over all previous editions of the checklist. Among some of the new features are citations for sources of most information used, high counts of individuals for each species on the Regular List, extreme dates of occurrence within physiographic regions, a list of abbreviations and acronyms, and for each species the highest form of verifiable documentation given with its repository institution with a catalog number. This checklist is helpful for anyone working with birds in the Southeastern United States or birding in that region. Sykes' contribution to this fifth edition of the Annotated Checklist of Georgia Birds includes: suggestion of the large format and spiral binding, use of Richard A. Parks' painting of the Barn Owl on the front cover, use of literature citations throughout, and inclusion of high counts for each species. Sykes helped plan all phases of the publication, wrote about 90% of the Introduction and 84 species accounts (Osprey through Red Phalarope), designed the four maps in the introduction section and format for the Literature Cited, and with Giff Beaton designed the layout of the title page.

  4. Differences in virulence of Naegleria fowleri.

    PubMed

    De Jonckheere, J

    1979-10-01

    All pathogenic Naegleria fowleri isolated from the environment were highly virulent to mice when instilled intranasally. Axenic cultivation gradually decreased virulence of highly virulent strains. This decrease was most pronounced in environmental isolates and of minor importance in N. fowleri isolated from human cerebrospinal fluid. The low virulent strains obtained by continuous axenic cultivation appeared after clonation to consist of individuals with different virulence. Virulence could be enhanced in low virulent strains by brain passage and passages in Vero cell cultures, but could not be induced by these methods in nonvirulent strains isolated from the environment. Different mice strains showed different sensitivities to infection with pathogenic Naegleria. In addition, older mice were less sensitive than younger animals to low virulent strains. PMID:392414

  5. Differences in virulence of Naegleria fowleri.

    PubMed

    De Jonckheere, J

    1979-10-01

    All pathogenic Naegleria fowleri isolated from the environment were highly virulent to mice when instilled intranasally. Axenic cultivation gradually decreased virulence of highly virulent strains. This decrease was most pronounced in environmental isolates and of minor importance in N. fowleri isolated from human cerebrospinal fluid. The low virulent strains obtained by continuous axenic cultivation appeared after clonation to consist of individuals with different virulence. Virulence could be enhanced in low virulent strains by brain passage and passages in Vero cell cultures, but could not be induced by these methods in nonvirulent strains isolated from the environment. Different mice strains showed different sensitivities to infection with pathogenic Naegleria. In addition, older mice were less sensitive than younger animals to low virulent strains.

  6. Bioinformatics for Diagnostics, Forensics, and Virulence Characterization and Detection

    SciTech Connect

    Gardner, S; Slezak, T

    2005-04-05

    We summarize four of our group's high-risk/high-payoff research projects funded by the Intelligence Technology Innovation Center (ITIC) in conjunction with our DHS-funded pathogen informatics activities. These are (1) quantitative assessment of genomic sequencing needs to predict high quality DNA and protein signatures for detection, and comparison of draft versus finished sequences for diagnostic signature prediction; (2) development of forensic software to identify SNP and PCR-RFLP variations from a large number of viral pathogen sequences and optimization of the selection of markers for maximum discrimination of those sequences; (3) prediction of signatures for the detection of virulence, antibiotic resistance, and toxin genes and genetic engineering markers in bacteria; (4) bioinformatic characterization of virulence factors to rapidly screen genomic data for potential genes with similar functions and to elucidate potential health threats in novel organisms. The results of (1) are being used by policy makers to set national sequencing priorities. Analyses from (2) are being used in collaborations with the CDC to genotype and characterize many variola strains, and reports from these collaborations have been made to the President. We also determined SNPs for serotype and strain discrimination of 126 foot and mouth disease virus (FMDV) genomes. For (3), currently >1000 probes have been predicted for the specific detection of >4000 virulence, antibiotic resistance, and genetic engineering vector sequences, and we expect to complete the bioinformatic design of a comprehensive ''virulence detection chip'' by August 2005. Results of (4) will be a system to rapidly predict potential virulence pathways and phenotypes in organisms based on their genomic sequences.

  7. Crowdsourcing image annotation for nucleus detection and segmentation in computational pathology: evaluating experts, automated methods, and the crowd.

    PubMed

    Irshad, H; Montaser-Kouhsari, L; Waltz, G; Bucur, O; Nowak, J A; Dong, F; Knoblauch, N W; Beck, A H

    2015-01-01

    The development of tools in computational pathology to assist physicians and biomedical scientists in the diagnosis of disease requires access to high-quality annotated images for algorithm learning and evaluation. Generating high-quality expert-derived annotations is time-consuming and expensive. We explore the use of crowdsourcing for rapidly obtaining annotations for two core tasks in com- putational pathology: nucleus detection and nucleus segmentation. We designed and implemented crowdsourcing experiments using the CrowdFlower platform, which provides access to a large set of labor channel partners that accesses and manages millions of contributors worldwide. We obtained annotations from four types of annotators and compared concordance across these groups. We obtained: crowdsourced annotations for nucleus detection and segmentation on a total of 810 images; annotations using automated methods on 810 images; annotations from research fellows for detection and segmentation on 477 and 455 images, respectively; and expert pathologist-derived annotations for detection and segmentation on 80 and 63 images, respectively. For the crowdsourced annotations, we evaluated performance across a range of contributor skill levels (1, 2, or 3). The crowdsourced annotations (4,860 images in total) were completed in only a fraction of the time and cost required for obtaining annotations using traditional methods. For the nucleus detection task, the research fellow-derived annotations showed the strongest concordance with the expert pathologist- derived annotations (F-M =93.68%), followed by the crowd-sourced contributor levels 1,2, and 3 and the automated method, which showed relatively similar performance (F-M = 87.84%, 88.49%, 87.26%, and 86.99%, respectively). For the nucleus segmentation task, the crowdsourced contributor level 3-derived annotations, research fellow-derived annotations, and automated method showed the strongest concordance with the expert pathologist

  8. Insights into Entamoeba histolytica virulence modulation.

    PubMed

    Padilla-Vaca, F; Anaya-Velázquez, F

    2010-08-01

    Entamoeba histolytica is able to invade human tissues by means of several molecules and biological properties related to the virulence. Pathogenic amebas use three major virulence factors, Gal/GalNAc lectin, amebapore and proteases, for lyse, phagocytose, kill and destroy a variety of cells and tissues in the host. Responses of the parasite to host components such as mucins and bacterial flora influence the behavior of pathogenic amebas altering their expression of virulence factors. The relative virulence of different strains of E. histolytica has been shown to vary as a consequence of changes in conditions of in vitro cultivation which implies substantial changes in basic metabolic aspects and factors directly and indirectly related to amebic virulence. Comparison of E. histolytica strains with different virulence phenotypes and under different conditions of growth will help to identify new virulence factor candidates and define the interplay between virulence factors and invasive phenotype. Virulence attenuate mutants of E. histolytica are useful also to uncover novel virulence determinants. The comparison of biological properties and virulence factors between E. histolytica and E. dispar, a non-pathogenic species, has been a useful approach to investigate the key factors involved in the experimental presentation of amebiasis and its complex regulation. The molecular mechanisms that regulate these variations in virulence are not yet known. Their elucidation will help us to better understand the gene expression plasticity that enables the effective adaptation of the ameba to changes in growth culture conditions and host factors.

  9. Citrate uptake into Pectobacterium atrosepticum is critical for bacterial virulence.

    PubMed

    Urbany, Claude; Neuhaus, H Ekkehard

    2008-05-01

    To analyze whether metabolite import into Pectobacterium atrosepticum cells affects bacterial virulence, we investigated the function of a carrier which exhibits significant structural homology to characterized carboxylic-acid transport proteins. The corresponding gene, ECA3984, previously annotated as coding for a Na(+)/sulphate carrier, in fact encodes a highly specific citrate transporter (Cit1) which is energized by the proton-motive force. Expression of the cit1 gene is stimulated by the presence of citrate in the growth medium and is substantial during growth of P. atrosepticum on potato tuber tissue. Infection of tuber tissue with P. atrosepticum leads to reduced citrate levels. P. atrosepticum insertion mutants, lacking the functional Cit1 protein, did not grow in medium containing citrate as the sole carbon source, showed a substantially reduced ability to macerate potato tuber tissue, and did not provoke reduced citrate levels in the plant tissue upon infection. We propose that citrate uptake into P. atrosepticum is critical for full bacterial virulence.

  10. GO annotation in InterPro: why stability does not indicate accuracy in a sea of changing annotations.

    PubMed

    Sangrador-Vegas, Amaia; Mitchell, Alex L; Chang, Hsin-Yu; Yong, Siew-Yit; Finn, Robert D

    2016-01-01

    The removal of annotation from biological databases is often perceived as an indicator of erroneous annotation. As a corollary, annotation stability is considered to be a measure of reliability. However, diverse data-driven events can affect the stability of annotations in both primary protein sequence databases and the protein family databases that are built upon the sequence databases and used to help annotate them. Here, we describe some of these events and their consequences for the InterPro database, and demonstrate that annotation removal or reassignment is not always linked to incorrect annotation by the curator. Database URL: http://www.ebi.ac.uk/interpro.

  11. Annotated Chemical Patent Corpus: A Gold Standard for Text Mining

    PubMed Central

    Akhondi, Saber A.; Klenner, Alexander G.; Tyrchan, Christian; Manchala, Anil K.; Boppana, Kiran; Lowe, Daniel; Zimmermann, Marc; Jagarlapudi, Sarma A. R. P.; Sayle, Roger; Kors, Jan A.; Muresan, Sorel

    2014-01-01

    Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org. PMID:25268232

  12. Annotated chemical patent corpus: a gold standard for text mining.

    PubMed

    Akhondi, Saber A; Klenner, Alexander G; Tyrchan, Christian; Manchala, Anil K; Boppana, Kiran; Lowe, Daniel; Zimmermann, Marc; Jagarlapudi, Sarma A R P; Sayle, Roger; Kors, Jan A; Muresan, Sorel

    2014-01-01

    Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org.

  13. Annotated chemical patent corpus: a gold standard for text mining.

    PubMed

    Akhondi, Saber A; Klenner, Alexander G; Tyrchan, Christian; Manchala, Anil K; Boppana, Kiran; Lowe, Daniel; Zimmermann, Marc; Jagarlapudi, Sarma A R P; Sayle, Roger; Kors, Jan A; Muresan, Sorel

    2014-01-01

    Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org. PMID:25268232

  14. Different food sources elicit fast changes to bacterial virulence.

    PubMed

    Ketola, T; Mikonranta, L; Laakso, J; Mappes, J

    2016-01-01

    Environmentally transmitted, opportunistic bacterial pathogens have a life cycle that alternates between hosts and environmental reservoirs. Resources are often scarce and fluctuating in the outside-host environment, whereas overcoming the host immune system could allow pathogens to establish a new, resource abundant and stable niche within the host. We tested if short-term exposure to different outside-host resource types and concentrations affect Serratia marcescens-(bacterium)'s virulence in Galleria mellonella (moth). As expected, virulence was mostly dictated by the bacterial dose, but we also found a clear increase in virulence when the bacterium had inhabited a low (versus high) resource concentration, or animal-based (versus plant-based) resources for 48 h prior to injection. The results suggest that temporal changes in pathogen's resource environment can induce very rapid changes in virulence and affect infection severity. Such changes could also play an important role in shifts from environmental lifestyle to pathogenicity or switches in host range and have implications for the management of opportunistic pathogens and disease outbreaks.

  15. Carbohydrate Availability Regulates Virulence Gene Expression in Streptococcus suis

    PubMed Central

    Ferrando, M. Laura; van Baarlen, Peter; Orrù, Germano; Piga, Rosaria; Bongers, Roger S.; Wels, Michiel; De Greeff, Astrid; Smith, Hilde E.; Wells, Jerry M.

    2014-01-01

    Streptococcus suis is a major bacterial pathogen of young pigs causing worldwide economic problems for the pig industry. S. suis is also an emerging pathogen of humans. Colonization of porcine oropharynx by S. suis is considered to be a high risk factor for invasive disease. In the oropharyngeal cavity, where glucose is rapidly absorbed but dietary α-glucans persist, there is a profound effect of carbohydrate availability on the expression of virulence genes. Nineteen predicted or confirmed S. suis virulence genes that promote adhesion to and invasion of epithelial cells were expressed at higher levels when S. suis was supplied with the α-glucan starch/pullulan compared to glucose as the single carbon source. Additionally the production of suilysin, a toxin that damages epithelial cells, was increased more than ten-fold when glucose levels were low and S. suis was growing on pullulan. Based on biochemical, bioinformatics and in vitro and in vivo gene expression studies, we developed a biological model that postulates the effect of carbon catabolite repression on expression of virulence genes in the mucosa, organs and blood. This research increases our understanding of S. suis virulence mechanisms and has important implications for the design of future control strategies including the development of anti-infective strategies by modulating animal feed composition. PMID:24642967

  16. Non-Formal Education and Radio: A Selected, Annotated Bibliography. Annotated Bibliography #14.

    ERIC Educational Resources Information Center

    Vergeldt, Vicki; And Others

    Materials concerning the use of radio and mass communications for non-formal education and development are listed in a selected annotated bibliography, intended for those actively involved in non-formal education and development. Three sections contain annotated entries (which range from 1972-1983), each of which includes source information and…

  17. MixtureTree annotator: a program for automatic colorization and visual annotation of MixtureTree.

    PubMed

    Chen, Shu-Chuan; Ogata, Aaron

    2015-01-01

    The MixtureTree Annotator, written in JAVA, allows the user to automatically color any phylogenetic tree in Newick format generated from any phylogeny reconstruction program and output the Nexus file. By providing the ability to automatically color the tree by sequence name, the MixtureTree Annotator provides a unique advantage over any other programs which perform a similar function. In addition, the MixtureTree Annotator is the only package that can efficiently annotate the output produced by MixtureTree with mutation information and coalescent time information. In order to visualize the resulting output file, a modified version of FigTree is used. Certain popular methods, which lack good built-in visualization tools, for example, MEGA, Mesquite, PHY-FI, TreeView, treeGraph and Geneious, may give results with human errors due to either manually adding colors to each node or with other limitations, for example only using color based on a number, such as branch length, or by taxonomy. In addition to allowing the user to automatically color any given Newick tree by sequence name, the MixtureTree Annotator is the only method that allows the user to automatically annotate the resulting tree created by the MixtureTree program. The MixtureTree Annotator is fast and easy-to-use, while still allowing the user full control over the coloring and annotating process. PMID:25826378

  18. MixtureTree annotator: a program for automatic colorization and visual annotation of MixtureTree.

    PubMed

    Chen, Shu-Chuan; Ogata, Aaron

    2015-01-01

    The MixtureTree Annotator, written in JAVA, allows the user to automatically color any phylogenetic tree in Newick format generated from any phylogeny reconstruction program and output the Nexus file. By providing the ability to automatically color the tree by sequence name, the MixtureTree Annotator provides a unique advantage over any other programs which perform a similar function. In addition, the MixtureTree Annotator is the only package that can efficiently annotate the output produced by MixtureTree with mutation information and coalescent time information. In order to visualize the resulting output file, a modified version of FigTree is used. Certain popular methods, which lack good built-in visualization tools, for example, MEGA, Mesquite, PHY-FI, TreeView, treeGraph and Geneious, may give results with human errors due to either manually adding colors to each node or with other limitations, for example only using color based on a number, such as branch length, or by taxonomy. In addition to allowing the user to automatically color any given Newick tree by sequence name, the MixtureTree Annotator is the only method that allows the user to automatically annotate the resulting tree created by the MixtureTree program. The MixtureTree Annotator is fast and easy-to-use, while still allowing the user full control over the coloring and annotating process.

  19. MvirDB: Microbial Database of Protein Toxins, Virulence Factors and Antibiotic Resistance Genes for Bio-Defense Applications

    DOE Data Explorer

    Zhou, C. E.; Smith, J.; Lam, M.; Zemla, M. D.; Slezak, T.

    MvirDB is a cenntralized resource (data warehouse) comprising all publicly accessible, organized sequence data for protein toxins, virulence factors, and antibiotic resistance genes. Protein entries in MvirDB are annotated using a high-throughput, fully automated computational annotation system; annotations are updated periodically to ensure that results are derived using current public database and open-source tool releases. Tools provided for using MvirDB include a web-based browser tool and BLAST interfaces. MvirDB serves researchers in the bio-defense and medical fields. (taken from page 3 of PI's paper of same title published in Nucleic Acids Research, 2007, Vol.35, Database Issue (Open Source)

  20. ACID: annotation of cassette and integron data

    PubMed Central

    Joss, Michael J; Koenig, Jeremy E; Labbate, Maurizio; Polz, Martin F; Gillings, Michael R; Stokes, Harold W; Doolittle, W Ford; Boucher, Yan

    2009-01-01

    Background Although integrons and their associated gene cassettes are present in ~10% of bacteria and can represent up to 3% of the genome in which they are found, very few have been properly identified and annotated in public databases. These genetic elements have been overlooked in comparison to other vectors that facilitate lateral gene transfer between microorganisms. Description By automating the identification of integron integrase genes and of the non-coding cassette-associated attC recombination sites, we were able to assemble a database containing all publicly available sequence information regarding these genetic elements. Specialists manually curated the database and this information was used to improve the automated detection and annotation of integrons and their encoded gene cassettes. ACID (annotation of cassette and integron data) can be searched using a range of queries and the data can be downloaded in a number of formats. Users can readily annotate their own data and integrate it into ACID using the tools provided. Conclusion ACID is a community resource providing easy access to annotations of integrons and making tools available to detect them in novel sequence data. ACID also hosts a forum to prompt integron-related discussion, which can hopefully lead to a more universal definition of this genetic element. PMID:19383137

  1. Pathway Analysis Software: Annotation Errors and Solutions

    PubMed Central

    Henderson-MacLennan, Nicole K.; Papp, Jeanette C.; Talbot, C. Conover; McCabe, Edward R.B.; Presson, Angela P.

    2010-01-01

    Genetic databases contain a variety of annotation errors that often go unnoticed due to the large size of modern genetic data sets. Interpretation of these data sets requires bioinformatics tools that may contribute to this problem. While providing gene symbol annotations for identifiers (IDs) such as microarray probeset, RefSeq, GenBank and Entrez Gene is seemingly trivial, the accuracy is fundamental to any subsequent conclusions. We examine gene symbol annotations and results from three commercial pathway analysis software (PAS) packages: Ingenuity Pathways Analysis, GeneGO and Pathway Studio. We compare gene symbol annotations and canonical pathway results over time and among different input ID types. We find that PAS results can be affected by variation in gene symbol annotations across software releases and the input ID type analyzed. As a result, we offer suggestions for using commercial PAS and reporting microarray results to improve research quality. We propose a wiki type website to facilitate communication of bioinformatics software problems within the scientific community. PMID:20663702

  2. Automated analysis and annotation of basketball video

    NASA Astrophysics Data System (ADS)

    Saur, Drew D.; Tan, Yap-Peng; Kulkarni, Sanjeev R.; Ramadge, Peter J.

    1997-01-01

    Automated analysis and annotation of video sequences are important for digital video libraries, content-based video browsing and data mining projects. A successful video annotation system should provide users with useful video content summary in a reasonable processing time. Given the wide variety of video genres available today, automatically extracting meaningful video content for annotation still remains hard by using current available techniques. However, a wide range video has inherent structure such that some prior knowledge about the video content can be exploited to improve our understanding of the high-level video semantic content. In this paper, we develop tools and techniques for analyzing structured video by using the low-level information available directly from MPEG compressed video. Being able to work directly in the video compressed domain can greatly reduce the processing time and enhance storage efficiency. As a testbed, we have developed a basketball annotation system which combines the low-level information extracted from MPEG stream with the prior knowledge of basketball video structure to provide high level content analysis, annotation and browsing for events such as wide- angle and close-up views, fast breaks, steals, potential shots, number of possessions and possession times. We expect our approach can also be extended to structured video in other domains.

  3. Quantifying Variability of Manual Annotation in Cryo-Electron Tomograms.

    PubMed

    Hecksel, Corey W; Darrow, Michele C; Dai, Wei; Galaz-Montoya, Jesús G; Chin, Jessica A; Mitchell, Patrick G; Chen, Shurui; Jakana, Jemba; Schmid, Michael F; Chiu, Wah

    2016-06-01

    Although acknowledged to be variable and subjective, manual annotation of cryo-electron tomography data is commonly used to answer structural questions and to create a "ground truth" for evaluation of automated segmentation algorithms. Validation of such annotation is lacking, but is critical for understanding the reproducibility of manual annotations. Here, we used voxel-based similarity scores for a variety of specimens, ranging in complexity and segmented by several annotators, to quantify the variation among their annotations. In addition, we have identified procedures for merging annotations to reduce variability, thereby increasing the reliability of manual annotation. Based on our analyses, we find that it is necessary to combine multiple manual annotations to increase the confidence level for answering structural questions. We also make recommendations to guide algorithm development for automated annotation of features of interest. PMID:27225525

  4. Antimicrobial Resistance and Virulence: a Successful or Deleterious Association in the Bacterial World?

    PubMed Central

    Beceiro, Alejandro; Tomás, María

    2013-01-01

    SUMMARY Hosts and bacteria have coevolved over millions of years, during which pathogenic bacteria have modified their virulence mechanisms to adapt to host defense systems. Although the spread of pathogens has been hindered by the discovery and widespread use of antimicrobial agents, antimicrobial resistance has increased globally. The emergence of resistant bacteria has accelerated in recent years, mainly as a result of increased selective pressure. However, although antimicrobial resistance and bacterial virulence have developed on different timescales, they share some common characteristics. This review considers how bacterial virulence and fitness are affected by antibiotic resistance and also how the relationship between virulence and resistance is affected by different genetic mechanisms (e.g., coselection and compensatory mutations) and by the most prevalent global responses. The interplay between these factors and the associated biological costs depend on four main factors: the bacterial species involved, virulence and resistance mechanisms, the ecological niche, and the host. The development of new strategies involving new antimicrobials or nonantimicrobial compounds and of novel diagnostic methods that focus on high-risk clones and rapid tests to detect virulence markers may help to resolve the increasing problem of the association between virulence and resistance, which is becoming more beneficial for pathogenic bacteria. PMID:23554414

  5. A Zebrafish Larval Model to Assess Virulence of Porcine Streptococcus suis Strains

    PubMed Central

    Zaccaria, Edoardo; Cao, Rui; Wells, Jerry M.; van Baarlen, Peter

    2016-01-01

    Streptococcus suis is an encapsulated Gram-positive bacterium, and the leading cause of sepsis and meningitis in young pigs resulting in considerable economic losses in the porcine industry. It is also considered an emerging zoonotic agent. In the environment, both avirulent and virulent strains occur in pigs, and virulent strains appear to cause disease in both humans and pigs. There is a need for a convenient, reliable and standardized animal model to assess S. suis virulence. A zebrafish (Danio rerio) larvae infection model has several advantages, including transparency of larvae, low cost, ease of use and exemption from ethical legislation up to 6 days post fertilization, but has not been previously established as a model for S. suis. Microinjection of different porcine strains of S. suis in zebrafish larvae resulted in highly reproducible dose- and strain-dependent larval death, strongly correlating with presence of the S. suis capsule and to the original virulence of the strain in pigs. Additionally we compared the virulence of the two-component system mutant of ciaRH, which is attenuated for virulence in both mice and pigs in vivo. Infection of larvae with the ΔciaRH strain resulted in significantly higher survival rate compared to infection with the S10 wild-type strain. Our data demonstrate that zebrafish larvae are a rapid and reliable model to assess the virulence of clinical porcine S. suis isolates. PMID:26999052

  6. Genomic variant annotation and prioritization with ANNOVAR and wANNOVAR

    PubMed Central

    Yang, Hui; Wang, Kai

    2016-01-01

    Recent developments in sequencing techniques have enabled rapid and high-throughput generation of sequence data, democratizing the ability to compile information on large amounts of genetic variations in individual laboratories. However, there is a growing gap between the generation of raw sequencing data and the extraction of meaningful biological information. Here, we describe a protocol to use the ANNOVAR (ANNOtate VARiation) software to facilitate fast and easy variant annotations, including gene-based, region-based and filter-based annotations on a variant call format (VCF) file generated from human genomes. We further describe a protocol for gene-based annotation of a newly sequenced nonhuman species. Finally, we describe how to use a user-friendly and easily accessible web server called wANNOVAR to prioritize candidate genes for a Mendelian disease. The variant annotation protocols take 5–30 min of computer time, depending on the size of the variant file, and 5–10 min of hands-on time. In summary, through the command-line tool and the web server, these protocols provide a convenient means to analyze genetic variants generated in humans and other species. PMID:26379229

  7. Functional annotation of hypothetical proteins - A review.

    PubMed

    Sivashankari, Selvarajan; Shanmughavel, Piramanayagam

    2006-12-29

    The complete human genome sequences in the public database provide ways to understand the blue print of life. As of June 29, 2006, 27 archaeal, 326 bacterial and 21 eukaryotes is complete genomes are available and the sequencing for 316 bacterial, 24 archaeal, 126 eukaryotic genomes are in progress. The traditional biochemical/molecular experiments can assign accurate functions for genes in these genomes. However, the process is time-consuming and costly. Despite several efforts, only 50-60 % of genes have been annotated in most completely sequenced genomes. Automated genome sequence analysis and annotation may provide ways to understand genomes. Thus, determination of protein function is one of the challenging problems of the post-genome era. This demands bioinformatics to predict functions of un-annotated protein sequences by developing efficient tools. Here, we discuss some of the recent and popular approaches developed in Bioinformatics to predict functions for hypothetical proteins.

  8. I2Cnet medical image annotation service.

    PubMed

    Chronaki, C E; Zabulis, X; Orphanoudakis, S C

    1997-01-01

    I2Cnet (Image Indexing by Content network) aims to provide services related to the content-based management of images in healthcare over the World-Wide Web. Each I2Cnet server maintains an autonomous repository of medical images and related information. The annotation service of I2Cnet allows specialists to interact with the contents of the repository, adding comments or illustrations to medical images of interest. I2Cnet annotations may be communicated to other users via e-mail or posted to I2Cnet for inclusion in its local repositories. This paper discusses the annotation service of I2Cnet and argues that such services pave the way towards the evolution of active digital medical image libraries.

  9. Annotating images by mining image search results.

    PubMed

    Wang, Xin-Jing; Zhang, Lei; Li, Xirong; Ma, Wei-Ying

    2008-11-01

    Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search results. Some 2.4 million images with their surrounding text are collected from a few photo forums to support this approach. The entire process is formulated in a divide-and-conquer framework where a query keyword is provided along with the uncaptioned image to improve both the effectiveness and efficiency. This is helpful when the collected data set is not dense everywhere. In this sense, our approach contains three steps: 1) the search process to discover visually and semantically similar search results, 2) the mining process to identify salient terms from textual descriptions of the search results, and 3) the annotation rejection process to filter out noisy terms yielded by Step 2. To ensure real-time annotation, two key techniques are leveraged-one is to map the high-dimensional image visual features into hash codes, the other is to implement it as a distributed system, of which the search and mining processes are provided as Web services. As a typical result, the entire process finishes in less than 1 second. Since no training data set is required, our approach enables annotating with unlimited vocabulary and is highly scalable and robust to outliers. Experimental results on both real Web images and a benchmark image data set show the effectiveness and efficiency of the proposed algorithm. It is also worth noting that, although the entire approach is illustrated within the divide-and conquer framework, a query keyword is not crucial to our current implementation. We provide experimental results to prove this.

  10. Listeria Pathogenesis and Molecular Virulence Determinants

    PubMed Central

    Vázquez-Boland, José A.; Kuhn, Michael; Berche, Patrick; Chakraborty, Trinad; Domínguez-Bernal, Gustavo; Goebel, Werner; González-Zorn, Bruno; Wehland, Jürgen; Kreft, Jürgen

    2001-01-01

    , rapid intracytoplasmic multiplication, bacterially induced actin-based motility, and direct spread to neighboring cells, in which they reinitiate the cycle. In this way, listeriae disseminate in host tissues sheltered from the humoral arm of the immune system. Over the last 15 years, a number of virulence factors involved in key steps of this intracellular life cycle have been identified. This review describes in detail the molecular determinants of Listeria virulence and their mechanism of action and summarizes the current knowledge on the pathophysiology of listeriosis and the cell biology and host cell responses to Listeria infection. This article provides an updated perspective of the development of our understanding of Listeria pathogenesis from the first molecular genetic analyses of virulence mechanisms reported in 1985 until the start of the genomic era of Listeria research. PMID:11432815

  11. Uropathogenic Escherichia coli virulence genes: invaluable approaches for designing DNA microarray probes

    PubMed Central

    Jahandeh, Nadia; Ranjbar, Reza; Behzadi, Elham

    2015-01-01

    Introduction The pathotypes of uropathogenic Escherichia coli (UPEC) cause different types of urinary tract infections (UTIs). The presence of a wide range of virulence genes in UPEC enables us to design appropriate DNA microarray probes. These probes, which are used in DNA microarray technology, provide us with an accurate and rapid diagnosis and definitive treatment in association with UTIs caused by UPEC pathotypes. The main goal of this article is to introduce the UPEC virulence genes as invaluable approaches for designing DNA microarray probes. Material and methods Main search engines such as Google Scholar and databases like NCBI were searched to find and study several original pieces of literature, review articles, and DNA gene sequences. In parallel with in silico studies, the experiences of the authors were helpful for selecting appropriate sources and writing this review article. Results There is a significant variety of virulence genes among UPEC strains. The DNA sequences of virulence genes are fabulous patterns for designing microarray probes. The location of virulence genes and their sequence lengths influence the quality of probes. Conclusions The use of selected virulence genes for designing microarray probes gives us a wide range of choices from which the best probe candidates can be chosen. DNA microarray technology provides us with an accurate, rapid, cost-effective, sensitive, and specific molecular diagnostic method which is facilitated by designing microarray probes. Via these tools, we are able to have an accurate diagnosis and a definitive treatment regarding UTIs caused by UPEC pathotypes. PMID:26855801

  12. Rag Virulence Among Soybean Aphids (Hemiptera: Aphididae) in Wisconsin.

    PubMed

    Crossley, Michael S; Hogg, David B

    2015-02-01

    Soybean aphid, Aphis glycines Matsumura, a pest of soybean, Glycine max (L.) Merr., and native of Asia, invaded North America sometime before 2000 and rapidly became the most significant insect pest of soybean in the upper Midwest. Plant resistance, a key component of integrated pest management, has received significant attention in the past decade, and several resistance (Rag) genes have been identified. However, the efficacy of Rag (Resistance to Aphis glycines) genes in suppressing aphid abundance has been challenged by the occurrence of soybean aphids capable of overcoming Rag gene-mediated resistance. Although the occurrence of these Rag virulent biotypes poses a serious threat to effective and sustainable management of soybean aphid, little is known about the current abundance of biotypes in North America. The objective of this research was to determine the distribution of Rag virulent soybean aphids in Wisconsin. Soybean aphids were collected from Wisconsin during the summers of 2012 and 2013, and assayed for Rag1, Rag2, and Rag1+2 virulence using no-choice tests in a greenhouse. One clone from Monroe County in 2012 reacted like biotype 4, three clones in different counties in 2013 responded like biotype 2, and eight others expressed varying degrees of Rag virulence. Rag virulence in 2013 was observed in aphids from 33% of the sampled sites and was accounted for by just 4.5% of sampled clones, although this is likely a conservative estimate. No-choice test results are discussed in light of current questions on the biology, ecology, and population genetics of soybean aphid.

  13. Annotation for information extraction from mammography reports.

    PubMed

    Bozkurt, Selen; Gulkesen, Kemal Hakan; Rubin, Daniel

    2013-01-01

    Inter and intra-observer variability in mammographic interpretation is a challenging problem, and decision support systems (DSS) may be helpful to reduce variation in practice. Since radiology reports are created as unstructured text reports, Natural language processing (NLP) techniques are needed to extract structured information from reports in order to provide the inputs to DSS. Before creating NLP systems, producing high quality annotated data set is essential. The goal of this project is to develop an annotation schema to guide the information extraction tasks needed from free-text mammography reports. PMID:23823416

  14. An annotated bibliography of psychiatric medical ethics.

    PubMed

    Anzia, D J; La Puma, J

    1991-03-01

    We offer an annotated bibliography of psychiatric medical ethics that we hope will be useful for psychiatrists and other mental health professionals who are interested in the moral dimensions of psychiatric care. We present the educational and clinical rationale for the bibliography, ways to use the bibliography, and the bibliography itself. Using the American Psychiatric Association's Principles of Medical Ethics With Annotations Especially Applicable to Psychiatry as a principled framework, we selected references based primarily on educational and clinical relevance for physicians. We include both empirical and conceptual analyses of the ethical issues seen daily in the office, clinic, hospital, nursing home, and in society at large.

  15. Real-Time Biological Annotation of Synthetic Compounds.

    PubMed

    Gerry, Christopher J; Hua, Bruce K; Wawer, Mathias J; Knowles, Jonathan P; Nelson, Shawn D; Verho, Oscar; Dandapani, Sivaraman; Wagner, Bridget K; Clemons, Paul A; Booker-Milburn, Kevin I; Boskovic, Zarko V; Schreiber, Stuart L

    2016-07-20

    Organic chemists are able to synthesize molecules in greater number and chemical complexity than ever before. Yet, a majority of these compounds go untested in biological systems, and those that do are often tested long after the chemist can incorporate the results into synthetic planning. We propose the use of high-dimensional "multiplex" assays, which are capable of measuring thousands of cellular features in one experiment, to annotate rapidly and inexpensively the biological activities of newly synthesized compounds. This readily accessible and inexpensive "real-time" profiling method can be used in a prospective manner to facilitate, for example, the efficient construction of performance-diverse small-molecule libraries that are enriched in bioactives. Here, we demonstrate this concept by synthesizing ten triads of constitutionally isomeric compounds via complexity-generating photochemical and thermal rearrangements and measuring compound-induced changes in cellular morphology via an imaging-based "cell painting" assay. Our results indicate that real-time biological annotation can inform optimization efforts and library syntheses by illuminating trends relating to biological activity that would be difficult to predict if only chemical structure were considered. We anticipate that probe and drug discovery will benefit from the use of optimization efforts and libraries that implement this approach. PMID:27398798

  16. The Gene Wiki: community intelligence applied to human gene annotation.

    PubMed

    Huss, Jon W; Lindenbaum, Pierre; Martone, Michael; Roberts, Donabel; Pizarro, Angel; Valafar, Faramarz; Hogenesch, John B; Su, Andrew I

    2010-01-01

    Annotating the function of all human genes is a critical, yet formidable, challenge. Current gene annotation efforts focus on centralized curation resources, but it is increasingly clear that this approach does not scale with the rapid growth of the biomedical literature. The Gene Wiki utilizes an alternative and complementary model based on the principle of community intelligence. Directly integrated within the online encyclopedia, Wikipedia, the goal of this effort is to build a gene-specific review article for every gene in the human genome, where each article is collaboratively written, continuously updated and community reviewed. Previously, we described the creation of Gene Wiki 'stubs' for approximately 9000 human genes. Here, we describe ongoing systematic improvements to these articles to increase their utility. Moreover, we retrospectively examine the community usage and improvement of the Gene Wiki, providing evidence of a critical mass of users and editors. Gene Wiki articles are freely accessible within the Wikipedia web site, and additional links and information are available at http://en.wikipedia.org/wiki/Portal:Gene_Wiki.

  17. GEMINI: Integrative Exploration of Genetic Variation and Genome Annotations

    PubMed Central

    Paila, Umadevi; Chapman, Brad A.; Kirchner, Rory; Quinlan, Aaron R.

    2013-01-01

    Modern DNA sequencing technologies enable geneticists to rapidly identify genetic variation among many human genomes. However, isolating the minority of variants underlying disease remains an important, yet formidable challenge for medical genetics. We have developed GEMINI (GEnome MINIng), a flexible software package for exploring all forms of human genetic variation. Unlike existing tools, GEMINI integrates genetic variation with a diverse and adaptable set of genome annotations (e.g., dbSNP, ENCODE, UCSC, ClinVar, KEGG) into a unified database to facilitate interpretation and data exploration. Whereas other methods provide an inflexible set of variant filters or prioritization methods, GEMINI allows researchers to compose complex queries based on sample genotypes, inheritance patterns, and both pre-installed and custom genome annotations. GEMINI also provides methods for ad hoc queries and data exploration, a simple programming interface for custom analyses that leverage the underlying database, and both command line and graphical tools for common analyses. We demonstrate GEMINI's utility for exploring variation in personal genomes and family based genetic studies, and illustrate its ability to scale to studies involving thousands of human samples. GEMINI is designed for reproducibility and flexibility and our goal is to provide researchers with a standard framework for medical genomics. PMID:23874191

  18. Salmonella-secreted Virulence Factors

    SciTech Connect

    Heffron, Fred; Niemann, George; Yoon, Hyunjin; Kidwai, Afshan S.; Brown, Roslyn N.; McDermott, Jason E.; Smith, Richard D.; Adkins, Joshua N.

    2011-05-01

    In this short review we discuss secreted virulence factors of Salmonella, which directly affect Salmonella interaction with its host. Salmonella secretes protein to subvert host defenses but also, as discussed, to reduce virulence thereby permitting the bacteria to persist longer and more successfully disperse. The type III secretion system (TTSS) is the best known and well studied of the mechanisms that enable secretion from the bacterial cytoplasm to the host cell cytoplasm. Other secretion systems include outer membrane vesicles, which are present in all Gram-negative bacteria examined to date, two-partner secretion, and type VI secretion will also be addressed. Excellent reviews of Salmonella secreted effectors have focused on themes such as actin rearrangements, vesicular trafficking, ubiquitination, and the activities of the virulence factors themselves. This short review is based on S. Typhimurium infection of mice because it is a model of typhoid like disease in humans. We have organized effectors in terms of events that happen during the infection cycle and how secreted effectors may be involved.

  19. Solar Tutorial and Annotation Resource (STAR)

    NASA Astrophysics Data System (ADS)

    Showalter, C.; Rex, R.; Hurlburt, N. E.; Zita, E. J.

    2009-12-01

    We have written a software suite designed to facilitate solar data analysis by scientists, students, and the public, anticipating enormous datasets from future instruments. Our “STAR" suite includes an interactive learning section explaining 15 classes of solar events. Users learn software tools that exploit humans’ superior ability (over computers) to identify many events. Annotation tools include time slice generation to quantify loop oscillations, the interpolation of event shapes using natural cubic splines (for loops, sigmoids, and filaments) and closed cubic splines (for coronal holes). Learning these tools in an environment where examples are provided prepares new users to comfortably utilize annotation software with new data. Upon completion of our tutorial, users are presented with media of various solar events and asked to identify and annotate the images, to test their mastery of the system. Goals of the project include public input into the data analysis of very large datasets from future solar satellites, and increased public interest and knowledge about the Sun. In 2010, the Solar Dynamics Observatory (SDO) will be launched into orbit. SDO’s advancements in solar telescope technology will generate a terabyte per day of high-quality data, requiring innovation in data management. While major projects develop automated feature recognition software, so that computers can complete much of the initial event tagging and analysis, still, that software cannot annotate features such as sigmoids, coronal magnetic loops, coronal dimming, etc., due to large amounts of data concentrated in relatively small areas. Previously, solar physicists manually annotated these features, but with the imminent influx of data it is unrealistic to expect specialized researchers to examine every image that computers cannot fully process. A new approach is needed to efficiently process these data. Providing analysis tools and data access to students and the public have proven

  20. [A diagnostic serum with antibodies to virulent Yersinia].

    PubMed

    Smirnov, I V; Gorokhov, V I

    1990-01-01

    A diagnostic agglutinating adsorbed rabbit serum with antibodies to Yersinia strains containing virulence plasmid with molecular mass of 40-50 mD was prepared. Trials of this serum in agglutination test on the glass with 69 Yersinia strains (Y. enterocolitica, Y. frederiksenii, Y. intermedia, Y. kristensenii, Y. pseudotuberculosis) and with 42 other Enterobacteriaceae strains have confirmed the specificity and sufficiently high activity of the agent in respect of virulent Yersinia strains. Experiments have demonstrated the possibility of using this serum in tests for the detection of yersiniasis agents and for rapid assessment of individual Yersinia clones in respect of the presence of plasmid with a molecular mass of 40-50 mD.

  1. High-throughput comparison, functional annotation, and metabolic modeling of plant genomes using the PlantSEED resource.

    PubMed

    Seaver, Samuel M D; Gerdes, Svetlana; Frelin, Océane; Lerma-Ortiz, Claudia; Bradbury, Louis M T; Zallot, Rémi; Hasnain, Ghulam; Niehaus, Thomas D; El Yacoubi, Basma; Pasternak, Shiran; Olson, Robert; Pusch, Gordon; Overbeek, Ross; Stevens, Rick; de Crécy-Lagard, Valérie; Ware, Doreen; Hanson, Andrew D; Henry, Christopher S

    2014-07-01

    The increasing number of sequenced plant genomes is placing new demands on the methods applied to analyze, annotate, and model these genomes. Today's annotation pipelines result in inconsistent gene assignments that complicate comparative analyses and prevent efficient construction of metabolic models. To overcome these problems, we have developed the PlantSEED, an integrated, metabolism-centric database to support subsystems-based annotation and metabolic model reconstruction for plant genomes. PlantSEED combines SEED subsystems technology, first developed for microbial genomes, with refined protein families and biochemical data to assign fully consistent functional annotations to orthologous genes, particularly those encoding primary metabolic pathways. Seamless integration with its parent, the prokaryotic SEED database, makes PlantSEED a unique environment for cross-kingdom comparative analysis of plant and bacterial genomes. The consistent annotations imposed by PlantSEED permit rapid reconstruction and modeling of primary metabolism for all plant genomes in the database. This feature opens the unique possibility of model-based assessment of the completeness and accuracy of gene annotation and thus allows computational identification of genes and pathways that are restricted to certain genomes or need better curation. We demonstrate the PlantSEED system by producing consistent annotations for 10 reference genomes. We also produce a functioning metabolic model for each genome, gapfilling to identify missing annotations and proposing gene candidates for missing annotations. Models are built around an extended biomass composition representing the most comprehensive published to date. To our knowledge, our models are the first to be published for seven of the genomes analyzed. PMID:24927599

  2. Online Annotation--Research and Practices

    ERIC Educational Resources Information Center

    Glover, Ian; Xu, Zhijie; Hardaker, Glenn

    2007-01-01

    Annotation can be a valuable exercise when trying to understand new information. The technique can be used to create a "condensed" version of the original information for later review and to add additional information into the existing document. The growth in web-based learning materials and information sources has created requirement for systems…

  3. Effective function annotation through catalytic residue conservation.

    PubMed

    George, Richard A; Spriggs, Ruth V; Bartlett, Gail J; Gutteridge, Alex; MacArthur, Malcolm W; Porter, Craig T; Al-Lazikani, Bissan; Thornton, Janet M; Swindells, Mark B

    2005-08-30

    Because of the extreme impact of genome sequencing projects, protein sequences without accompanying experimental data now dominate public databases. Homology searches, by providing an opportunity to transfer functional information between related proteins, have become the de facto way to address this. Although a single, well annotated, close relationship will often facilitate sufficient annotation, this situation is not always the case, particularly if mutations are present in important functional residues. When only distant relationships are available, the transfer of function information is more tenuous, and the likelihood of encountering several well annotated proteins with different functions is increased. The consequence for a researcher is a range of candidate functions with little way of knowing which, if any, are correct. Here, we address the problem directly by introducing a computational approach to accurately identify and segregate related proteins into those with a functional similarity and those where function differs. This approach should find a wide range of applications, including the interpretation of genomics/proteomics data and the prioritization of targets for high-throughput structure determination. The method is generic, but here we concentrate on enzymes and apply high-quality catalytic site data. In addition to providing a series of comprehensive benchmarks to show the overall performance of our approach, we illustrate its utility with specific examples that include the correct identification of haptoglobin as a nonenzymatic relative of trypsin, discrimination of acid-d-amino acid ligases from a much larger ligase pool, and the successful annotation of BioH, a structural genomics target.

  4. Studies of Scientific Disciplines. An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Weisz, Diane; Kruytbosch, Carlos

    Provided in this bibliography are annotated lists of social studies of science literature, arranged alphabetically by author in 13 disciplinary areas. These areas include astronomy; general biology; biochemistry and molecular biology; biomedicine; chemistry; earth and space sciences; economics; engineering; mathematics; physics; political science;…

  5. Counseling American Indians: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Tisdale, Elizabeth; Thomason, Timothy C.

    This bibliography presents 75 annotated entries on counseling and psychotherapy with American Indians. Entries include journal articles, books, book chapters, newspaper and newsletter articles, and conference papers, published 1964-96. Topics covered include counseling approaches and techniques, mental health services for Native Americans,…

  6. Core French: A Selected Annotated Resource List.

    ERIC Educational Resources Information Center

    Boyd, J. A.; Mollica, Anthony

    1985-01-01

    This is an annotated bibliography of: readers, workbooks, conversation books, cultural sources and readings, flash cards, duplicating or line masters, and media kits submitted by publishers as applicable to French second language instruction from kindergarten through senior high school levels. (MSE)

  7. Learning To Lead: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Stehno, Joe

    This annotated bibliography reviews some of the leadership development training programs currently being offered to business, industry, and educational personnel. Section 1 focuses on programs for corporate personnel. Section 2 reviews both preparatory and continuing professional education programs for top college and university administrators.…

  8. Reflective Annotations: On Becoming a Scholar

    ERIC Educational Resources Information Center

    Alexander, Mark; Taylor, Caroline; Greenberger, Scott; Watts, Margie; Balch, Riann

    2012-01-01

    This article presents the authors' reflective annotations on becoming a scholar. This paper begins with a discussion on socialization for teaching, followed by a discussion on socialization for service and sense of belonging. Then, it describes how the doctoral process evolves. Finally, it talks about adult learners who pursue doctoral education.

  9. Skin Cancer Education Materials: Selected Annotations.

    ERIC Educational Resources Information Center

    National Cancer Inst. (NIH), Bethesda, MD.

    This annotated bibliography presents 85 entries on a variety of approaches to cancer education. The entries are grouped under three broad headings, two of which contain smaller sub-divisions. The first heading, Public Education, contains prevention and general information, and non-print materials. The second heading, Professional Education,…

  10. Annotated bibliography of psychomotor testing. Technical report

    SciTech Connect

    Ervin, C.

    1987-03-01

    An annotated bibliography of 67 publications in the field of psychomotor testing has been prepared. The collection includes technical reports, journal articles, presented at scientific meetings, books and conference proceedings. The publications were assembled as preliminary work in the development of a dexterity test battery designed to measure the effects of chemical-defense-treatment drugs.

  11. Intellectual Freedom and Censorship: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Hoffmann, Frank

    Intended to act as a general introduction for high school and college students, this book presents an annotated bibliography of books, periodical articles, legal materials, and other documents dealing with the subject of intellectual freedom and censorship. The book is divided into five parts: (1) "The Theoretical Foundations of Censorship and…

  12. An Annotated Bibliography on Early Childhood.

    ERIC Educational Resources Information Center

    Michigan Univ., Ann Arbor. Architectural Research Lab.

    This annotated bibliography of more than 150 books and articles covers a wide range of topical areas concerned with the relationship of the young child to his environment. Among the 18 topics included are: child development; health, educational, staff, and community programs; infants and toddlers, handicapped children; Project Head Start; day…

  13. Environment and the Community: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Department of Housing and Urban Development, Washington, DC.

    Three hundred and nine citations of books, reports, and articles dating from 1964 to 1971 are included in this annotated bibliography, intended as a selection tool for concerned citizens, architects, builders, and city planners emphasizing the environment of American cities and communities. It is topically arranged into sixteen broad sections with…

  14. Nutrition & Adolescent Pregnancy: A Selected Annotated Bibliography.

    ERIC Educational Resources Information Center

    National Agricultural Library (USDA), Washington, DC.

    This annotated bibliography on nutrition and adolescent pregnancy is intended to be a source of technical assistance for nurses, nutritionists, physicians, educators, social workers, and other personnel concerned with improving the health of teenage mothers and their babies. It is divided into two major sections. The first section lists selected…

  15. Adolescent Reproductive Behaviour: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    United Nations, New York, NY. Population Div.

    A general overview of the literature on adolescent fertility and closely related issues is provided in this annotated bibliography. Material on the following topics is included: (1) programs related to adolescent pregnancy, contraception, abortion, and births; (2) studies relating socioeconomic characteristics of pregnant adolescents to their…

  16. College Students in Transition: An Annotated Bibliography

    ERIC Educational Resources Information Center

    Foote, Stephanie M., Ed.; Hinkle, Sara M., Ed.; Kranzow, Jeannine, Ed.; Pistilli, Matthew D., Ed.; Miles, LaTonya Rease, Ed.; Simmons, Jannell G., Ed.

    2013-01-01

    The transition from high school to college is an important milestone, but it is only one of many steps in the journey through higher education. This volume is an annotated bibliography of the emerging literature examining the many other transitions students make beyond the first year, including the sophomore year, the transfer experience, and the…

  17. Suggested Books for Children: An Annotated Bibliography

    ERIC Educational Resources Information Center

    NHSA Dialog, 2008

    2008-01-01

    This article provides an annotated bibliography of various children's books. It includes listings of books that illustrate the dynamic relationships within the natural environment, economic context, racial and cultural identities, cross-group similarities and differences, gender, different abilities and stories of injustice and resistance.

  18. Participative Decision Making: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Henson, Ramon; Camp, Richaurd

    An annotated bibliography of 40 articles on participative decision making (PDM) published from 1968 through 1975 is presented. The following categories were used in summarizing each article: description, sample, type of study, variables, PDM variables, results and discussion. An introduction to the bibliography discusses some issues related to…

  19. Sex and Proxemics: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Nelson, Audrey A.

    This annotated bibliography focuses on the sex differences and similarities in two proxemic variables, physical distance and orientation of the body. The majority of the more than 90 titles, dating from 1965 to the present, are selected from the following sources: dissertation abstracts, social-psychology journals, communication journals, and…

  20. Small Group Communication: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Gouran, Dennis S.; Guadagnino, Christopher S.

    This annotated bibliography includes sources of information that are primarily concerned with problem solving, decision making, and processes of social influence in small groups, and secondarily deal with other aspects of communication and interaction in groups, such as conflict management and negotiation. The 57 entries, all dating from 1980…

  1. A Partially Annotated Political Communication Bibliography.

    ERIC Educational Resources Information Center

    Thornton, Barbara C.

    This 63-page annotated bibliography contains available materials in the area of political communication, a relatively new field of political science. Political communication includes facets of the election process and interaction between political parties and the voter. A variety of materials dating from 1960 to 1972 include books, pamphlets,…

  2. Rates of Comprehension: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Berger, Allen, Comp.; Peebles, James D., Comp.

    This booklet is a revision of an earlier annotated bibliography, "Speed Reading," compiled by Allen Berger in 1967 and revised in 1970. The 82 entries are arranged alphabetically by author in the following ten categories: tachistoscope and controlled pacing, paperback scanning, flexible rates of comprehension, retention of gains, perception,…

  3. Ontological Annotation with WordNet

    SciTech Connect

    Sanfilippo, Antonio P.; Tratz, Stephen C.; Gregory, Michelle L.; Chappell, Alan R.; Whitney, Paul D.; Posse, Christian; Paulson, Patrick R.; Baddeley, Bob; Hohimer, Ryan E.; White, Amanda M.

    2006-06-06

    Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of WordNet to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.

  4. Automating Ontological Annotation with WordNet

    SciTech Connect

    Sanfilippo, Antonio P.; Tratz, Stephen C.; Gregory, Michelle L.; Chappell, Alan R.; Whitney, Paul D.; Posse, Christian; Paulson, Patrick R.; Baddeley, Bob L.; Hohimer, Ryan E.; White, Amanda M.

    2006-01-22

    Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of WordNet to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.

  5. Document Delivery: An Annotated Selective Bibliography.

    ERIC Educational Resources Information Center

    Khalil, Mounir A.; Katz, Suzanne R.

    1992-01-01

    Presents a selective annotated bibliography of 61 items that deal with topics related to document delivery, including networks; hypertext; interlibrary loan; computer security; electronic publishing; copyright; online catalogs; resource sharing; electronic mail; electronic libraries; optical character recognition; microcomputers; liability issues;…

  6. An Annotated Journalism Bibliography; 1958-1968.

    ERIC Educational Resources Information Center

    Price, Warren C.; Pickett, Calder M.

    Annotated entries of 2172 books in journalism which have appeared between 1958 and 1968 comprise this volume. Materials are listed alphabetically, by author, and an index of names and subject headings is provided. General categories of entries are biographies, narratives of journalists at work, anthologies of journalistic writing, ethical and…

  7. Health Communication and Literacy: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Beveridge, Jennifer

    This annotated bibliography lists publications and World Wide Web sites dealing with health communication and literacy. The 51 publications, which were all published between 1982 and 1998, contain information about and/or for use in the following areas: assessment, assessment tools, elderly adults, empowerment, maternal and child health, patient…

  8. Revenue Producing Athletes: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Ervin, Leroy; And Others

    An annotated bibliography on revenue producing sports is presented, with attention to: Proposition 48, exploitation of athletes, legal proceedings, research related to athletes and academic performance, psychological characteristics of athletes, and counseling programs for athletes. Introductions to each of the six topics are included. The section…

  9. Annotated Psychodynamic Bibliography for Residents in Psychiatry

    PubMed Central

    CALIGOR, EVE

    1996-01-01

    The author provides an annotated bibliography to introduce psychodynamic psychotherapy and psychoanalysis to residents in psychiatry. The emphasis of the selection is on relevance to practice. The entries are grouped by topic, levels of difficulty are noted, and readings are identified as being of either current or historic relevance. PMID:22700303

  10. The Mentally Retarded Offender: Annotated Bibliography.

    ERIC Educational Resources Information Center

    Schilit, Jeffrey; And Others

    An annotated bibliography of approximately 150 books and articles on the mentally retarded offender as well as 30 nonannotated entries are provided. Topics covered include such areas as characteristics of mentally retarded delinquents, rehabilitation of the retarded offender, community services for retarded persons, rights of the mentally…

  11. The Community; A Classified, Annotated Bibliography.

    ERIC Educational Resources Information Center

    Payne, Raymond, Comp.; Bailey, Wilfrid C., Comp.

    This is a classified retrospective bibliography of 839 items on the community (about 140 are annotated) from rural sociology and agricultural economics departments and sections, agricultural experiment stations, extension services, and related agencies. Items are categorized as follows: bibliography and reference lists; location and delineation of…

  12. Chemical Principles Revisited: Annotating Reaction Equations.

    ERIC Educational Resources Information Center

    Tykodi, R. J.

    1987-01-01

    Urges chemistry teachers to have students annotate the chemical reactions in aqueous-solutions that they see in their textbooks and witness in the laboratory. Suggests this will help students recognize the reaction type more readily. Examples are given for gas formation, precipitate formation, redox interaction, acid-base interaction, and…

  13. Teleconferencing, an annotated bibliography, volume 3

    NASA Technical Reports Server (NTRS)

    Shervis, K.

    1971-01-01

    In this annotated and indexed listing of works on teleconferencing, emphasis has been placed upon teleconferencing as real-time, two way audio communication with or without visual aids. However, works on the use of television in two-way or multiway nets, data transmission, regional communications networks and on telecommunications in general are also included.

  14. The Alaska Eskimos. A Selected, Annotated Bibliography.

    ERIC Educational Resources Information Center

    Hippler, Arthur E.; Wood, John R.

    This annotated bibliography, containing approximately 732 entries, provides a general overview of English literature concerning Alaska Eskimos and cities. Although the earliest date of publication is 1843, the majority of the works have been done since 1900; there are no entries published later than 1975. Section I lists the works alphabetically…

  15. An Annotated Bibliography of Migrant Related Materials.

    ERIC Educational Resources Information Center

    Florida Atlantic Univ., Boca Raton.

    Over 1,000 annotated entries in this bibliography present a wide variety of materials related to the teaching and understanding of the migrant and culturally deprived student. Materials are divided into 6 major content areas: (1) health, (2) information on migrants and culturally disadvantaged, (3) curriculum materials, (4) guidance, (5)…

  16. Communication and Sexuality: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Buley, Jerry, Comp.; And Others

    The entries in this annotated bibliography represent books, educational journals, dissertations, popular magazines, and research studies that deal with the topic of communication and sexuality. Arranged alphabetically by author and also indexed according to subject matter, the titles span a variety of topics, including the following: sex and…

  17. Research: Annotated Bibliography of New Canadian Studies.

    ERIC Educational Resources Information Center

    Toronto Board of Education (Ontario). Research Dept.

    This annotated bibliography of twenty-one research reports that provide knowledge about various cultures and educational experiences of the major ethnic groups in the Toronto schools is designed to present information for not only special English teachers, but other school personnel as well. The bibliography consists of reports that aim to: 1)…

  18. Annotated Bibliography of Literature on Narcotic Addiction.

    ERIC Educational Resources Information Center

    Bowden, R. Renee

    Nearly 150 abstracts have been included in this annotated bibliography; its purpose has been to scan the voluminous number of documents on the problem of drug addiction in order to summarize the present state of knowledge on narcotic addiction and on methods for its treatment and control. The literature reviewed has been divided into the following…

  19. Ludwig von Mises: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Gordon, David

    A 117-item annotated bibliography of books, articles, essays, lectures, and reviews by economist Ludwig von Mises is presented. The bibliography is arranged chronologicaly, and is followed by an alphabetical listing of the citations, excluding books. An index and information on the Ludwig von Mises Institute at Auburn University (Alabama) are…

  20. Greeks in Canada (an Annotated Bibliography).

    ERIC Educational Resources Information Center

    Bombas, Leonidas C.

    This bibliography on Greeks in Canada includes annotated references to both published and (mostly) unpublished works. Among the 70 entries (arranged in alphabetical order by author) are articles, reports, papers, and theses that deal either exclusively with or include a separate section on Greeks in the various Canadian provinces. (GC)

  1. Educational Quality Indicators: Annotated Bibliography. Second Edition.

    ERIC Educational Resources Information Center

    Alberta Dept. of Education, Edmonton.

    This annotated bibliography of journal articles and documents on educational quality indicators contains approximately 230 entries arranged by the following topics: (1) indicator systems, including international, local/provincial/state, models, and national/federal systems; (2) interpretive framework (context, inputs, processes), including…

  2. Visitor Reports about Chinese Schools: Annotated Bibliography.

    ERIC Educational Resources Information Center

    Parker, Franklin

    An annotated bibliography of 77 books, journal articles, congressional reports, and conference papers all based on visits to Chinese schools by U.S. and British visitors including professional educators, teachers, government officials, historians, and lay citizens is presented. A wide range of entries includes specialized, scholarly journals and…

  3. Health Economics Research: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Dillard, Carole D.; And Others

    This annotated bibliography lists books and journal articles published since 1976 which deal with health economics and which are based on health services research supported by the National Center for Health Services Research (NCHSR). Articles prepared by NCHSR staff are listed as intramural. All other articles cite the NCHSR grant or contract…

  4. Human object annotation for surveillance video forensics

    NASA Astrophysics Data System (ADS)

    Fraz, Muhammad; Zafar, Iffat; Tzanidou, Giounona; Edirisinghe, Eran A.; Sarfraz, Muhammad Saquib

    2013-10-01

    A system that can automatically annotate surveillance video in a manner useful for locating a person with a given description of clothing is presented. Each human is annotated based on two appearance features: primary colors of clothes and the presence of text/logos on clothes. The annotation occurs after a robust foreground extraction stage employing a modified Gaussian mixture model-based approach. The proposed pipeline consists of a preprocessing stage where color appearance of an image is improved using a color constancy algorithm. In order to annotate color information for human clothes, we use the color histogram feature in HSV space and find local maxima to extract dominant colors for different parts of a segmented human object. To detect text/logos on clothes, we begin with the extraction of connected components of enhanced horizontal, vertical, and diagonal edges in the frames. These candidate regions are classified as text or nontext on the basis of their local energy-based shape histogram features. Further, to detect humans, a novel technique has been proposed that uses contourlet transform-based local binary pattern (CLBP) features. In the proposed method, we extract the uniform direction invariant LBP feature descriptor for contourlet transformed high-pass subimages from vertical and diagonal directional bands. In the final stage, extracted CLBP descriptors are classified by a trained support vector machine. Experimental results illustrate the superiority of our method on large-scale surveillance video data.

  5. Women and World Development: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Buvinic, Mayra; And Others

    This annotated bibliography focuses on the effects of socioeconomic development and cultural change on women and on women's reactions to these changes. It is an expanded version of one which was prepared for the American Association of Science Seminar on Women in Development held in Mexico City in June 1975. The objectives were to disseminate this…

  6. Food for Thought: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Bennett, Susan G., Ed.

    Most of the 24 books reviewed in this annotated bibliography concern writing and are recent publications (1980-1985). Titles and authors are as follows: "Teacher" (Sylvia Ashton-Warner); "What Did I Write? Beginning Writing Behavior" (Marie M. Clay); "Composing: Writing as a Self-Creating Process" (William E. Coles); "Right Brain...Write On!…

  7. People: Annotated Multiethnic Bibliography K-12.

    ERIC Educational Resources Information Center

    Gilmore, Dolores D., Comp.; Petrie, Kenneth, Comp.

    This annotated bibliography has been compiled to assist personnel in the selection of multiethnic media for schools. The bibliography includes sections entitled "Asian Americans,""Jewish Americans,""Mexican Americans,""Native Americans,""Puerto Rican Americans,""Other Hyphenated Americans," and "All Americans (Multiethnic)." The entries for the…

  8. Statistical mechanics of ontology based annotations

    NASA Astrophysics Data System (ADS)

    Hoyle, David C.; Brass, Andrew

    2016-01-01

    We present a statistical mechanical theory of the process of annotating an object with terms selected from an ontology. The term selection process is formulated as an ideal lattice gas model, but in a highly structured inhomogeneous field. The model enables us to explain patterns recently observed in real-world annotation data sets, in terms of the underlying graph structure of the ontology. By relating the external field strengths to the information content of each node in the ontology graph, the statistical mechanical model also allows us to propose a number of practical metrics for assessing the quality of both the ontology, and the annotations that arise from its use. Using the statistical mechanical formalism we also study an ensemble of ontologies of differing size and complexity; an analysis not readily performed using real data alone. Focusing on regular tree ontology graphs we uncover a rich set of scaling laws describing the growth in the optimal ontology size as the number of objects being annotated increases. In doing so we provide a further possible measure for assessment of ontologies.

  9. Annotated Bibliography of Products/Materials.

    ERIC Educational Resources Information Center

    Lee, Carolyn S., Comp.; Jennings, Mark B., Comp.; Mayo, Linda P., Comp.; Young, Debra A., Comp.

    This document, which is intended for teachers, program directors, researchers, businesspeople, and students, is an annotated bibliography of more than 600 programs and resources that were developed with funds from the Office of Vocational and Adult Education in fiscal years 1987-1998. The document is divided into two parts. Part 1 is a summary of…

  10. Anthropology and Education: An Annotated Bibliographic Guide.

    ERIC Educational Resources Information Center

    Burnett, Jaquetta H.; And Others

    References in this annotated bibliography concentrate on anthropological research concerning formal and informal education. The bibliography is selective, and the criteria are guided primarily by four questions: What basic concepts oriented the writer? What was done? How was it done? What was the disciplinary, or cultural, identity of the person…

  11. La Mujer Chicana: An Annotated Bibliography, 1976.

    ERIC Educational Resources Information Center

    Chapa, Evey, Ed.; And Others

    Intended to provide interested persons, researchers, and educators with information about "la mujer Chicana", this annotated bibliography cites 320 materials published between 1916 and 1975, with the majority being between 1960 and 1975. The 12 sections cover the following subject areas: Chicana publications; Chicana feminism and "el movimiento";…

  12. An Annotated Bibliography of Small Town Research.

    ERIC Educational Resources Information Center

    Smith, Suzanne M.

    The purpose of this annotated bibliography is to list books, articles, and bulletins (written from 1900 to 1968) related to small towns in the United States. The work contributes to the project "Population Changes in Small Towns," sponsored by the Division of Social Sciences of the National Science Foundation and by the University of Wisconsin…

  13. Annotated Bibliography of Special Education Instructional Materials.

    ERIC Educational Resources Information Center

    Cook, Iva Dean, Comp.

    The annotated bibliography lists approximately 900 commercially prepared materials available for statewide distribution from the West Virginia College of Graduate Studies Special Education Instructional Materials Center (WEIMC) for use in teaching educable (EMR) and trainable mentally retarded (TMR) students. Materials are grouped under subject…

  14. Project for Global Education: Annotated Bibliography.

    ERIC Educational Resources Information Center

    Institute for World Order, New York, NY.

    Over 260 books, textbooks, articles, pamphlets, periodicals, films, and multi-media packages appropriate for the analysis of global issues at the college level are briefly annotated. Entries include classic books and articles as well as a number of recent (1976-1981) publications. The purpose is to assist students and educators in developing a…

  15. Children and Poetry: A Selective, Annotated Bibliography.

    ERIC Educational Resources Information Center

    Haviland, Virginia; Smith, William Jay

    This annotated bibliography of over 120 books was compiled to call attention to poetry for children that is both pleasing and rewarding. Omitted are traditional materials such as Mother Goose rhymes, textbooks, and collections designed especially for the classroom. Sample illustrations from the books noted and lines from poems are reproduced…

  16. Bibliografia de Aztlan: An Annotated Chicano Bibliography.

    ERIC Educational Resources Information Center

    Barrios, Ernie, Ed.

    More than 300 books and articles published from 1920 to 1971 are reviewed in this annotated bibliography of literature on the Chicano. The citations and reviews are categorized by subject area and deal with contemporary Chicano history, education, health, history of Mexico, literature, native Americans, philosophy, political science, pre-Columbian…

  17. Sexually Transmitted Diseases: A Selective, Annotated Bibliography.

    ERIC Educational Resources Information Center

    Planned Parenthood Federation of America, Inc., New York, NY. Education Dept.

    This document contains a reference sheet and an annotated bibliography concerned with sexually transmitted diseases (STD). The reference sheet provides a brief, accurate overview of STDs which includes both statistical and background information. The bibliography contains 83 entries, listed alphabetically, that deal with STDs. Books and articles…

  18. Postsecondary Peer Cooperative Learning Programs: Annotated Bibliography

    ERIC Educational Resources Information Center

    Arendale, David R., Comp.

    2005-01-01

    Purpose: This annotated bibliography is focused intentionally on postsecondary peer cooperative learning programs that increasing student achievement. Peer learning has been popular in education for decades. As both a pedagogy and learning strategy, it has been frequently adapted for a wide range of academic content areas at the elementary,…

  19. An Annotated Bibliography of Latino Educational Research

    ERIC Educational Resources Information Center

    Baumann, Paul; Cabrera, Alberto; Swail, Watson Scott

    2007-01-01

    This bibliography lists and provides annotations for 59 recent research studies on a variety of Latino educational issues. Descriptions of the focus of each item, as well as implications for policy and practice are provided. Items range in publication date from 1993 to 2007. [This document was compiled by the Educational Policy Institute in…

  20. MEETING: Chlamydomonas Annotation Jamboree - October 2003

    SciTech Connect

    Grossman, Arthur R

    2007-04-13

    Shotgun sequencing of the nuclear genome of Chlamydomonas reinhardtii (Chlamydomonas throughout) was performed at an approximate 10X coverage by JGI. Roughly half of the genome is now contained on 26 scaffolds, all of which are at least 1.6 Mb, and the coverage of the genome is ~95%. There are now over 200,000 cDNA sequence reads that we have generated as part of the Chlamydomonas genome project (Grossman, 2003; Shrager et al., 2003; Grossman et al. 2007; Merchant et al., 2007); other sequences have also been generated by the Kasuza sequence group (Asamizu et al., 1999; Asamizu et al., 2000) or individual laboratories that have focused on specific genes. Shrager et al. (2003) placed the reads into distinct contigs (an assemblage of reads with overlapping nucleotide sequences), and contigs that group together as part of the same genes have been designated ACEs (assembly of contigs generated from EST information). All of the reads have also been mapped to the Chlamydomonas nuclear genome and the cDNAs and their corresponding genomic sequences have been reassembled, and the resulting assemblage is called an ACEG (an Assembly of contiguous EST sequences supported by genomic sequence) (Jain et al., 2007). Most of the unique genes or ACEGs are also represented by gene models that have been generated by the Joint Genome Institute (JGI, Walnut Creek, CA). These gene models have been placed onto the DNA scaffolds and are presented as a track on the Chlamydomonas genome browser associated with the genome portal (http://genome.jgi-psf.org/Chlre3/Chlre3.home.html). Ultimately, the meeting grant awarded by DOE has helped enormously in the development of an annotation pipeline (a set of guidelines used in the annotation of genes) and resulted in high quality annotation of over 4,000 genes; the annotators were from both Europe and the USA. Some of the people who led the annotation initiative were Arthur Grossman, Olivier Vallon, and Sabeeha Merchant (with many individual

  1. Computer systems for annotation of single molecule fragments

    DOEpatents

    Schwartz, David Charles; Severin, Jessica

    2016-07-19

    There are provided computer systems for visualizing and annotating single molecule images. Annotation systems in accordance with this disclosure allow a user to mark and annotate single molecules of interest and their restriction enzyme cut sites thereby determining the restriction fragments of single nucleic acid molecules. The markings and annotations may be automatically generated by the system in certain embodiments and they may be overlaid translucently onto the single molecule images. An image caching system may be implemented in the computer annotation systems to reduce image processing time. The annotation systems include one or more connectors connecting to one or more databases capable of storing single molecule data as well as other biomedical data. Such diverse array of data can be retrieved and used to validate the markings and annotations. The annotation systems may be implemented and deployed over a computer network. They may be ergonomically optimized to facilitate user interactions.

  2. Data annotation of aerial reconnaissance imagery and exploitation

    NASA Astrophysics Data System (ADS)

    Wareberg, P. Gunnar; Prunes, V.; Scholes, Richard W.

    1995-09-01

    This paper reviews the use of LED recording head assemblies (RHAs) for film annotation in aerial reconnaissance cameras and discusses code matrix block readers (CMBRs). Annotation of video imagery is also covered.

  3. Representing annotation compositionality and provenance for the Semantic Web

    PubMed Central

    2013-01-01

    Background Though the annotation of digital artifacts with metadata has a long history, the bulk of that work focuses on the association of single terms or concepts to single targets. As annotation efforts expand to capture more complex information, annotations will need to be able to refer to knowledge structures formally defined in terms of more atomic knowledge structures. Existing provenance efforts in the Semantic Web domain primarily focus on tracking provenance at the level of whole triples and do not provide enough detail to track how individual triple elements of annotations were derived from triple elements of other annotations. Results We present a task- and domain-independent ontological model for capturing annotations and their linkage to their denoted knowledge representations, which can be singular concepts or more complex sets of assertions. We have implemented this model as an extension of the Information Artifact Ontology in OWL and made it freely available, and we show how it can be integrated with several prominent annotation and provenance models. We present several application areas for the model, ranging from linguistic annotation of text to the annotation of disease-associations in genome sequences. Conclusions With this model, progressively more complex annotations can be composed from other annotations, and the provenance of compositional annotations can be represented at the annotation level or at the level of individual elements of the RDF triples composing the annotations. This in turn allows for progressively richer annotations to be constructed from previous annotation efforts, the precise provenance recording of which facilitates evidence-based inference and error tracking. PMID:24268021

  4. VideoANT: Extending Online Video Annotation beyond Content Delivery

    ERIC Educational Resources Information Center

    Hosack, Bradford

    2010-01-01

    This paper expands the boundaries of video annotation in education by outlining the need for extended interaction in online video use, identifying the challenges faced by existing video annotation tools, and introducing Video-ANT, a tool designed to create text-based annotations integrated within the time line of a video hosted online. Several…

  5. Large-scale annotation of small-molecule libraries using public databases.

    PubMed

    Zhou, Yingyao; Zhou, Bin; Chen, Kaisheng; Yan, S Frank; King, Frederick J; Jiang, Shumei; Winzeler, Elizabeth A

    2007-01-01

    While many large publicly accessible databases provide excellent annotation for biological macromolecules, the same is not true for small chemical compounds. Commercial data sources also fail to encompass an annotation interface for large numbers of compounds and tend to be cost prohibitive to be widely available to biomedical researchers. Therefore, using annotation information for the selection of lead compounds from a modern day high-throughput screening (HTS) campaign presently occurs only under a very limited scale. The recent rapid expansion of the NIH PubChem database provides an opportunity to link existing biological databases with compound catalogs and provides relevant information that potentially could improve the information garnered from large-scale screening efforts. Using the 2.5 million compound collection at the Genomics Institute of the Novartis Research Foundation (GNF) as a model, we determined that approximately 4% of the library contained compounds with potential annotation in such databases as PubChem and the World Drug Index (WDI) as well as related databases such as the Kyoto Encyclopedia of Genes and Genomes (KEGG) and ChemIDplus. Furthermore, the exact structure match analysis showed 32% of GNF compounds can be linked to third party databases via PubChem. We also showed annotations such as MeSH (medical subject headings) terms can be applied to in-house HTS databases in identifying signature biological inhibition profiles of interest as well as expediting the assay validation process. The automated annotation of thousands of screening hits in batch is becoming feasible and has the potential to play an essential role in the hit-to-lead decision making process.

  6. Model and Interoperability using Meta Data Annotations

    NASA Astrophysics Data System (ADS)

    David, O.

    2011-12-01

    Software frameworks and architectures are in need for meta data to efficiently support model integration. Modelers have to know the context of a model, often stepping into modeling semantics and auxiliary information usually not provided in a concise structure and universal format, consumable by a range of (modeling) tools. XML often seems the obvious solution for capturing meta data, but its wide adoption to facilitate model interoperability is limited by XML schema fragmentation, complexity, and verbosity outside of a data-automation process. Ontologies seem to overcome those shortcomings, however the practical significance of their use remains to be demonstrated. OMS version 3 took a different approach for meta data representation. The fundamental building block of a modular model in OMS is a software component representing a single physical process, calibration method, or data access approach. Here, programing language features known as Annotations or Attributes were adopted. Within other (non-modeling) frameworks it has been observed that annotations lead to cleaner and leaner application code. Framework-supported model integration, traditionally accomplished using Application Programming Interfaces (API) calls is now achieved using descriptive code annotations. Fully annotated components for various hydrological and Ag-system models now provide information directly for (i) model assembly and building, (ii) data flow analysis for implicit multi-threading or visualization, (iii) automated and comprehensive model documentation of component dependencies, physical data properties, (iv) automated model and component testing, calibration, and optimization, and (v) automated audit-traceability to account for all model resources leading to a particular simulation result. Such a non-invasive methodology leads to models and modeling components with only minimal dependencies on the modeling framework but a strong reference to its originating code. Since models and

  7. Annotations and the Collaborative Digital Library: Effects of an Aligned Annotation Interface on Student Argumentation and Reading Strategies

    ERIC Educational Resources Information Center

    Wolfe, Joanna

    2008-01-01

    Recent research on annotation interfaces provides provocative evidence that anchored, annotation-based discussion environments may lead to better conversations about a text. However, annotation interfaces raise complicated tradeoffs regarding screen real estate and positioning. It is argued that solving this screen real estate problem requires…

  8. AnnoTALE: bioinformatics tools for identification, annotation, and nomenclature of TALEs from Xanthomonas genomic sequences.

    PubMed

    Grau, Jan; Reschke, Maik; Erkes, Annett; Streubel, Jana; Morgan, Richard D; Wilson, Geoffrey G; Koebnik, Ralf; Boch, Jens

    2016-01-01

    Transcription activator-like effectors (TALEs) are virulence factors, produced by the bacterial plant-pathogen Xanthomonas, that function as gene activators inside plant cells. Although the contribution of individual TALEs to infectivity has been shown, the specific roles of most TALEs, and the overall TALE diversity in Xanthomonas spp. is not known. TALEs possess a highly repetitive DNA-binding domain, which is notoriously difficult to sequence. Here, we describe an improved method for characterizing TALE genes by the use of PacBio sequencing. We present 'AnnoTALE', a suite of applications for the analysis and annotation of TALE genes from Xanthomonas genomes, and for grouping similar TALEs into classes. Based on these classes, we propose a unified nomenclature for Xanthomonas TALEs that reveals similarities pointing to related functionalities. This new classification enables us to compare related TALEs and to identify base substitutions responsible for the evolution of TALE specificities. PMID:26876161

  9. How well are protein structures annotated in secondary databases?

    PubMed

    Rother, Kristian; Michalsky, Elke; Leser, Ulf

    2005-09-01

    We investigated to what extent Protein Data Bank (PDB) entries are annotated with second-party information based on existing cross-references between PDB and 15 other databases. We report 2 interesting findings. First, there is a clear "annotation gap" for structures less than 7 years old for secondary databases that are manually curated. Second, the examined databases overlap with each other quite well, dividing the PDB into 2 well-annotated thirds and one poorly annotated third. Both observations should be taken into account in any study depending on the selection of protein structures by their annotation.

  10. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  11. Cognition inspired framework for indoor scene annotation

    NASA Astrophysics Data System (ADS)

    Ye, Zhipeng; Liu, Peng; Zhao, Wei; Tang, Xianglong

    2015-09-01

    We present a simple yet effective scene annotation framework based on a combination of bag-of-visual words (BoVW), three-dimensional scene structure estimation, scene context, and cognitive theory. From a macroperspective, the proposed cognition-based hybrid motivation framework divides the annotation problem into empirical inference and real-time classification. Inspired by the inference ability of human beings, common objects of indoor scenes are defined for experience-based inference, while in the real-time classification stage, an improved BoVW-based multilayer abstract semantics labeling method is proposed by introducing abstract semantic hierarchies to narrow the semantic gap and improve the performance of object categorization. The proposed framework was evaluated on a variety of common data sets and experimental results proved its effectiveness.

  12. Web-based Video Annotation and its Applications

    NASA Astrophysics Data System (ADS)

    Yamamoto, Daisuke; Nagao, Katashi

    In this paper, we developed a Web-based video annotation system, named iVAS (intelligent Video Annotation Server). Audiences can associate any video content on the Internet with annotations. The system analyzes video content in order to acquire cut/shot information and color histograms. And it also automatically generates a Web page for editing annotations. Then, audiences can create annotation data by two methods. The first one helps the users to create text data such as person/object names, scene descriptions, and comments interactively. The second method facilitates the users associating any video fragments with their subjective impression by just clicking a mouse button. The generated annotation data are accumulated and managed by an XML database connected with iVAS. We also developed some application systems based on annotations such as video retrieval, video simplification, and video-content-based community support. One of the major advantages of our approach is easy integration of hand-coded and automatically-generated (such as color histograms and cut/shot information) annotations. Additionally, since our annotation system is open for public, we must consider some reliability or correctness of annotation data. We also developed an automatic evaluation method of annotation reliability using the users' feedback. In the future, these fundamental technologies will contribute to the formation of new communities centered around video content.

  13. Deburring: an annotated bibliography. Volume VI

    SciTech Connect

    Gillespie, L.K.

    1980-07-01

    An annotated summary of 138 articles and publications on burrs, burr prevention and deburring is presented. Thirty-seven deburring processes are listed. Entries cited include English, Russian, French, Japanese, and German language articles. Entries are indexed by deburring processes, author, and language. Indexes also indicate which references discuss equipment and tooling, how to use a proces economics, burr properties, and how to design to minimize burr problems. Research studies are identified as are the materials deburred.

  14. Evolution of viral virulence: empirical studies

    USGS Publications Warehouse

    Kurath, Gael; Wargo, Andrew R.

    2016-01-01

    The concept of virulence as a pathogen trait that can evolve in response to selection has led to a large body of virulence evolution theory developed in the 1980-1990s. Various aspects of this theory predict increased or decreased virulence in response to a complex array of selection pressures including mode of transmission, changes in host, mixed infection, vector-borne transmission, environmental changes, host vaccination, host resistance, and co-evolution of virus and host. A fundamental concept is prediction of trade-offs between the costs and benefits associated with higher virulence, leading to selection of optimal virulence levels. Through a combination of observational and experimental studies, including experimental evolution of viruses during serial passage, many of these predictions have now been explored in systems ranging from bacteriophage to viruses of plants, invertebrates, and vertebrate hosts. This chapter summarizes empirical studies of viral virulence evolution in numerous diverse systems, including the classic models myxomavirus in rabbits, Marek's disease virus in chickens, and HIV in humans. Collectively these studies support some aspects of virulence evolution theory, suggest modifications for other aspects, and show that predictions may apply in some virus:host interactions but not in others. Finally, we consider how virulence evolution theory applies to disease management in the field.

  15. Virulence Evolution Within the Ug99 Lineage

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Race TTKSK (syn. Ug99) of Puccinia graminis f. sp. tritici, recognized for possessing virulence to the stem rust resistance gene Sr31, was first identified in Uganda in 1998. Since then, TTKSK has been identified in Kenya in 2005 and Yemen in 2006. In addition to virulence to Sr31, race TTKSK was ...

  16. Pathogenicity islands and virulence evolution in Listeria.

    PubMed

    Vázquez-Boland, J A; Domínguez-Bernal, G; González-Zorn, B; Kreft, J; Goebel, W

    2001-06-01

    As in other bacterial pathogens, the virulence determinants of Listeria species are clustered in genomic islands scattered along the chromosome. This review summarizes current knowledge about the structure, distribution and role in pathogenesis of Listeria virulence loci. Hypotheses about the mode of acquisition and evolution of these loci in this group of Gram-positive bacteria are presented and discussed.

  17. Identification of new sub-genotypes of virulent Newcastle disease virus with potential panzootic features

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Strains of virulent Newcastle disease virus (NDV) with epizootic characteristics are rapidly spreading through Asia and the Middle East causing outbreaks of Newcastle disease (ND). Significant illness and mortality in vaccinated poultry caused by highly related viruses of new sub-genotypes within ge...

  18. Enzyme reaction annotation using cloud techniques.

    PubMed

    Huang, Chuan-Ching; Lin, Chun-Yuan; Chang, Cheng-Wen; Tang, Chuan Yi

    2013-01-01

    An understanding of the activities of enzymes could help to elucidate the metabolic pathways of thousands of chemical reactions that are catalyzed by enzymes in living systems. Sophisticated applications such as drug design and metabolic reconstruction could be developed using accurate enzyme reaction annotation. Because accurate enzyme reaction annotation methods create potential for enhanced production capacity in these applications, they have received greater attention in the global market. We propose the enzyme reaction prediction (ERP) method as a novel tool to deduce enzyme reactions from domain architecture. We used several frequency relationships between architectures and reactions to enhance the annotation rates for single and multiple catalyzed reactions. The deluge of information which arose from high-throughput techniques in the postgenomic era has improved our understanding of biological data, although it presents obstacles in the data-processing stage. The high computational capacity provided by cloud computing has resulted in an exponential growth in the volume of incoming data. Cloud services also relieve the requirement for large-scale memory space required by this approach to analyze enzyme kinetic data. Our tool is designed as a single execution file; thus, it could be applied to any cloud platform in which multiple queries are supported.

  19. UCSC Data Integrator and Variant Annotation Integrator

    PubMed Central

    Hinrichs, Angie S.; Raney, Brian J.; Speir, Matthew L.; Rhead, Brooke; Casper, Jonathan; Karolchik, Donna; Kuhn, Robert M.; Rosenbloom, Kate R.; Zweig, Ann S.; Haussler, David; Kent, W. James

    2016-01-01

    Summary: Two new tools on the UCSC Genome Browser web site provide improved ways of combining information from multiple datasets, optionally including the user's own custom track data and/or data from track hubs. The Data Integrator combines columns from multiple data tracks, showing all items from the first track along with overlapping items from the other tracks. The Variant Annotation Integrator is tailored to adding functional annotations to variant calls; it offers a more restricted set of underlying data tracks but adds predictions of each variant's consequences for any overlapping or nearby gene transcript. When available, it optionally adds additional annotations including effect prediction scores from dbNSFP for missense mutations, ENCODE regulatory summary tracks and conservation scores. Availability and implementation: The web tools are freely available at http://genome.ucsc.edu/ and the underlying database is available for download at http://hgdownload.cse.ucsc.edu/. The software (written in C and Javascript) is available from https://genome-store.ucsc.edu/ and is freely available for academic and non-profit usage; commercial users must obtain a license. Contact: angie@soe.ucsc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26740527

  20. Jannovar: a java library for exome annotation.

    PubMed

    Jäger, Marten; Wang, Kai; Bauer, Sebastian; Smedley, Damian; Krawitz, Peter; Robinson, Peter N

    2014-05-01

    Transcript-based annotation and pedigree analysis are two basic steps in the computational analysis of whole-exome sequencing experiments in genetic diagnostics and disease-gene discovery projects. Here, we present Jannovar, a stand-alone Java application as well as a Java library designed to be used in larger software frameworks for exome and genome analysis. Jannovar uses an interval tree to identify all transcripts affected by a given variant, and provides Human Genome Variation Society-compliant annotations both for variants affecting coding sequences and splice junctions as well as untranslated regions and noncoding RNA transcripts. Jannovar can also perform family-based pedigree analysis with Variant Call Format (VCF) files with data from members of a family segregating a Mendelian disorder. Using a desktop computer, Jannovar requires a few seconds to annotate a typical VCF file with exome data. Jannovar is freely available under the BSD2 license. Source code as well as the Java application and library file can be downloaded from http://compbio.charite.de (with tutorial) and https://github.com/charite/jannovar. PMID:24677618

  1. GAMOLA: a new local solution for sequence annotation and analyzing draft and finished prokaryotic genomes.

    PubMed

    Altermann, Eric; Klaenhammer, Todd R

    2003-01-01

    Laboratories working with draft phase genomes have specific software needs, such as the unattended processing of hundreds of single scaffolds and subsequent sequence annotation. In addition, it is critical to follow the "movement" and the manual annotation of single open reading frames (ORFs) within the successive sequence updates. Even with finished genomes, regular database updates can lead to significant changes in the annotation of single ORFs. In functional genomics it is important to mine data and identify new genetic targets rapidly and easily. Often there is no need for sophisticated relational databases (RDB) that greatly reduce the system-independent access of the results. Another aspect is the internet dependency of most software packages. If users are working with confidential data, this dependency poses a security issue. GAMOLA was designed to handle the numerous scaffolds and changing contents of draft phase genomes in an automated process and stores the results for each predicted ORF in flatfile databases. In addition, annotation transfers, ORF designation tracking, Blast comparisons, and primer design for whole genome microarrays have been implemented. The software is available under the license of North Carolina State University. A website and a downloadable example are accessible under (http://fsweb2.schaub. ncsu.edu/TRKwebsite/index.htm). PMID:14506845

  2. Global profiling of Shewanella oneidensis MR-1: expression of hypothetical genes and improved functional annotations.

    PubMed

    Kolker, Eugene; Picone, Alex F; Galperin, Michael Y; Romine, Margaret F; Higdon, Roger; Makarova, Kira S; Kolker, Natali; Anderson, Gordon A; Qiu, Xiaoyun; Auberry, Kenneth J; Babnigg, Gyorgy; Beliaev, Alex S; Edlefsen, Paul; Elias, Dwayne A; Gorby, Yuri A; Holzman, Ted; Klappenbach, Joel A; Konstantinidis, Konstantinos T; Land, Miriam L; Lipton, Mary S; McCue, Lee-Ann; Monroe, Matthew; Pasa-Tolic, Ljiljana; Pinchuk, Grigoriy; Purvine, Samuel; Serres, Margrethe H; Tsapin, Sasha; Zakrajsek, Brian A; Zhu, Wenhong; Zhou, Jizhong; Larimer, Frank W; Lawrence, Charles E; Riley, Monica; Collart, Frank R; Yates, John R; Smith, Richard D; Giometti, Carol S; Nealson, Kenneth H; Fredrickson, James K; Tiedje, James M

    2005-02-01

    The gamma-proteobacterium Shewanella oneidensis strain MR-1 is a metabolically versatile organism that can reduce a wide range of organic compounds, metal ions, and radionuclides. Similar to most other sequenced organisms, approximately 40% of the predicted ORFs in the S. oneidensis genome were annotated as uncharacterized "hypothetical" genes. We implemented an integrative approach by using experimental and computational analyses to provide more detailed insight into gene function. Global expression profiles were determined for cells after UV irradiation and under aerobic and suboxic growth conditions. Transcriptomic and proteomic analyses confidently identified 538 hypothetical genes as expressed in S. oneidensis cells both as mRNAs and proteins (33% of all predicted hypothetical proteins). Publicly available analysis tools and databases and the expression data were applied to improve the annotation of these genes. The annotation results were scored by using a seven-category schema that ranked both confidence and precision of the functional assignment. We were able to identify homologs for nearly all of these hypothetical proteins (97%), but could confidently assign exact biochemical functions for only 16 proteins (category 1; 3%). Altogether, computational and experimental evidence provided functional assignments or insights for 240 more genes (categories 2-5; 45%). These functional annotations advance our understanding of genes involved in vital cellular processes, including energy conversion, ion transport, secondary metabolism, and signal transduction. We propose that this integrative approach offers a valuable means to undertake the enormous challenge of characterizing the rapidly growing number of hypothetical proteins with each newly sequenced genome. PMID:15684069

  3. dictyBase 2015: Expanding data and annotations in a new software environment

    PubMed Central

    Jimenez-Morales, David; Dodson, Robert J.; Chisholm, Rex L.

    2015-01-01

    dictyBase is the model organism database for the social amoeba Dictyostelium discoideum and related species. The primary mission of dictyBase is to provide the biomedical research community with well-integrated high quality data, and tools that enable original research. Data presented at dictyBase is obtained from sequencing centers, groups performing high throughput experiments such as large-scale mutagenesis studies, and RNAseq data, as well as a growing number of manually added functional gene annotations from the published literature, including Gene Ontology, strain, and phenotype annotations. Through the Dicty Stock Center we provide the community with an impressive amount of annotated strains and plasmids. Recently dictyBase accomplished a major overhaul to adapt an outdated infrastructure to the current technological advances, thus facilitating the implementation of innovative tools and comparative genomics. It also provides new strategies for high quality annotations that enable bench researchers to benefit from the rapidly increasing volume of available data. dictyBase is highly responsive to its users needs, building a successful relationship that capitalizes on the vast efforts of the Dictyostelium research community. dictyBase has become the trusted data resource for Dictyostelium investigators, other investigators or organizations seeking information about Dictyostelium, as well as educators who use this model system. PMID:26088819

  4. Global profiling of Shewanella oneidensis MR-1: Expression of hypothetical genes and improved functional annotations

    SciTech Connect

    Picone, Alex F.; Galperin, Michael Y.; Romine, Margaret; Higdon, Roger; Makarova, Kira S.; Kolker, Natali; Anderson, Gordon A; Qiu, Xiaoyun; Babnigg, Gyorgy; Beliaev, Alexander S; Edlefsen, Paul; Elias, Dwayne A.; Gorby, Dr. Yuri A.; Holzman, Ted; Klappenbach, Joel; Konstantinidis, Konstantinos T; Land, Miriam L; Lipton, Mary S.; McCue, Lee Ann; Monroe, Matthew; Pasa-Tolic, Ljiljana; Pinchuk, Grigoriy; Purvine, Samuel; Serres, Margrethe H.; Tsapin, Sasha; Zakrajsek, Brian A.; Zhu, Wenguang; Zhou, Jizhong; Larimer, Frank W; Lawrence, Charles E.; Riley, Monica; Collart, Frank; YatesIII, John R.; Smith, Richard D.; Nealson, Kenneth H.; Fredrickson, James K; Tiedje, James M.

    2005-01-01

    The gamma-proteobacterium Shewanella oneidensis strain MR-1 is a metabolically versatile organism that can reduce a wide range of organic compounds, metal ions, and radionuclides. Similar to most other sequenced organisms, approximate to40% of the predicted ORFs in the S. oneidensis genome were annotated as uncharacterized "hypothetical" genes. We implemented an integrative approach by using experimental and computational analyses to provide more detailed insight into gene function. Global expression profiles were determined for cells after UV irradiation and under aerobic and suboxic growth conditions. Transcriptomic and proteomic analyses confidently identified 538 hypothetical genes as expressed in S. oneidensis cells both as mRNAs and proteins (33% of all predicted hypothetical proteins). Publicly available analysis tools and databases and the expression data were applied to improve the annotation of these genes. The annotation results were scored by using a seven-category schema that ranked both confidence and precision of the functional assignment. We were able to identify homologs for nearly all of these hypothetical proteins (97%), but could confidently assign exact biochemical functions for only 16 proteins (category 1; 3%). Altogether, computational and experimental evidence provided functional assignments or insights for 240 more genes (categories 2-5; 45%). These functional annotations advance our understanding of genes involved in vital cellular processes, including energy conversion, ion transport, secondary metabolism, and signal transduction. We propose that this integrative approach offers a valuable means to undertake the enormous challenge of characterizing the rapidly growing number of hypothetical proteins with each newly sequenced genome.

  5. eggNOG: automated construction and annotation of orthologous groups of genes.

    PubMed

    Jensen, Lars Juhl; Julien, Philippe; Kuhn, Michael; von Mering, Christian; Muller, Jean; Doerks, Tobias; Bork, Peer

    2008-01-01

    The identification of orthologous genes forms the basis for most comparative genomics studies. Existing approaches either lack functional annotation of the identified orthologous groups, hampering the interpretation of subsequent results, or are manually annotated and thus lag behind the rapid sequencing of new genomes. Here we present the eggNOG database ('evolutionary genealogy of genes: Non-supervised Orthologous Groups'), which contains orthologous groups constructed from Smith-Waterman alignments through identification of reciprocal best matches and triangular linkage clustering. Applying this procedure to 312 bacterial, 26 archaeal and 35 eukaryotic genomes yielded 43 582 course-grained orthologous groups of which 9724 are extended versions of those from the original COG/KOG database. We also constructed more fine-grained groups for selected subsets of organisms, such as the 19 914 mammalian orthologous groups. We automatically annotated our non-supervised orthologous groups with functional descriptions, which were derived by identifying common denominators for the genes based on their individual textual descriptions, annotated functional categories, and predicted protein domains. The orthologous groups in eggNOG contain 1 241 751 genes and provide at least a broad functional description for 77% of them. Users can query the resource for individual genes via a web interface or download the complete set of orthologous groups at http://eggnog.embl.de.

  6. dictyBase 2015: Expanding data and annotations in a new software environment.

    PubMed

    Basu, Siddhartha; Fey, Petra; Jimenez-Morales, David; Dodson, Robert J; Chisholm, Rex L

    2015-08-01

    dictyBase is the model organism database for the social amoeba Dictyostelium discoideum and related species. The primary mission of dictyBase is to provide the biomedical research community with well-integrated high quality data, and tools that enable original research. Data presented at dictyBase is obtained from sequencing centers, groups performing high throughput experiments such as large-scale mutagenesis studies, and RNAseq data, as well as a growing number of manually added functional gene annotations from the published literature, including Gene Ontology, strain, and phenotype annotations. Through the Dicty Stock Center we provide the community with an impressive amount of annotated strains and plasmids. Recently, dictyBase accomplished a major overhaul to adapt an outdated infrastructure to the current technological advances, thus facilitating the implementation of innovative tools and comparative genomics. It also provides new strategies for high quality annotations that enable bench researchers to benefit from the rapidly increasing volume of available data. dictyBase is highly responsive to its users needs, building a successful relationship that capitalizes on the vast efforts of the Dictyostelium research community. dictyBase has become the trusted data resource for Dictyostelium investigators, other investigators or organizations seeking information about Dictyostelium, as well as educators who use this model system.

  7. Proteomic Characterization of Yersinia pestis Virulence

    SciTech Connect

    Chromy, B; Murphy, G; Gonzales, A; Fitch, J P; McCutchen-Maloney, S L

    2005-01-05

    Yersinia pestis, the etiological agent of plague, functions via the Type III secretion mechanism whereby virulence factors are induced upon interactions with a mammalian host. Here, the Y. pestis proteome was studied by two-dimensional differential gel electrophoresis (2-D DIGE) under physiologically relevant growth conditions mimicking the calcium concentrations and temperatures that the pathogen would encounter in the flea vector and upon interaction with the mammalian host. Over 4100 individual protein spots were detected of which hundreds were differentially expressed in the entire comparative experiment. A total of 43 proteins that were differentially expressed between the vector and host growth conditions were identified by mass spectrometry. Expected differences in expression were observed for several known virulence factors including catalase-peroxidase (KatY), murine toxin (Ymt), plasminogen activator (Pla), and F1 capsule antigen (Caf1), as well as putative virulence factors. Chaperone proteins and signaling molecules hypothesized to be involved in virulence due to their role in Type III secretion were also identified. Other differentially expressed proteins not previously reported to contribute to virulence are candidates for more detailed mechanistic studies, representing potential new virulence determinants. For example, several sugar metabolism proteins were differentially regulated in response to lower calcium and higher temperature, suggesting these proteins, while not directly connected to virulence, either represent a metabolic switch for survival in the host environment or may facilitate production of virulence factors. Results presented here contribute to a more thorough understanding of the virulence mechanism of Y. pestis through proteomic characterization of the pathogen under induced virulence.

  8. Gene Characterization Index: Assessing the Depth of Gene Annotation

    PubMed Central

    Yusuf, Dimas; Brumm, Jochen; Cheung, Warren; Wahlestedt, Claes; Lenhard, Boris; Wasserman, Wyeth W.

    2008-01-01

    Background We introduce the Gene Characterization Index, a bioinformatics method for scoring the extent to which a protein-encoding gene is functionally described. Inherently a reflection of human perception, the Gene Characterization Index is applied for assessing the characterization status of individual genes, thus serving the advancement of both genome annotation and applied genomics research by rapid and unbiased identification of groups of uncharacterized genes for diverse applications such as directed functional studies and delineation of novel drug targets. Methodology/Principal Findings The scoring procedure is based on a global survey of researchers, who assigned characterization scores from 1 (poor) to 10 (extensive) for a sample of genes based on major online resources. By evaluating the survey as training data, we developed a bioinformatics procedure to assign gene characterization scores to all genes in the human genome. We analyzed snapshots of functional genome annotation over a period of 6 years to assess temporal changes reflected by the increase of the average Gene Characterization Index. Applying the Gene Characterization Index to genes within pharmaceutically relevant classes, we confirmed known drug targets as high-scoring genes and revealed potentially interesting novel targets with low characterization indexes. Removing known drug targets and genes linked to sequence-related patent filings from the entirety of indexed genes, we identified sets of low-scoring genes particularly suited for further experimental investigation. Conclusions/Significance The Gene Characterization Index is intended to serve as a tool to the scientific community and granting agencies for focusing resources and efforts on unexplored areas of the genome. The Gene Characterization Index is available from http://cisreg.ca/gci/. PMID:18213364

  9. CMAS: a rich media annotation system for medical imaging

    NASA Astrophysics Data System (ADS)

    Lin, I.-Jong; Chao, Hui

    2006-03-01

    We have developed the CMAS system (Collaborative Medical Annotation System) so that medical professionals will be able to easily annotate digital medical records that contain medical imaging or procedure videos. The CMAS system enables a non-technical person to annotate a medical image or video with their recorded presence. The CMAS system displays medical images via a projector onto a screen; when a doctor (or patient) physically walks in front of this screen with the medical image and gives his/her opinion while gesturing at the image, the CMAS system intuitively captures this interaction by creating a video annotation with HP's Active Shadows technology. The CMAS system automatically transforms physical interactions, ranging from a laser pointer spot to a doctor's physical presence, into video annotation that then can be overlaid on top of the medical image or seamlessly inserted into the procedure video. Annotated in such a manner, the medical record retains the historical development of the diagnostic medical opinion, explained through presence of doctors and their respective annotations. The CMAS system structures the annotation of digital medical records such that image/video annotations from multiple sources, at different times, and from different locations can be maintained within a historical context and be consistently referenced among multiple annotations.

  10. Identification of novel virulence-associated genes via genome analysis of hypothetical genes.

    PubMed

    Garbom, Sara; Forsberg, Ake; Wolf-Watz, Hans; Kihlberg, Britt-Marie

    2004-03-01

    The sequencing of bacterial genomes has opened new perspectives for identification of targets for treatment of infectious diseases. We have identified a set of novel virulence-associated genes (vag genes) by comparing the genome sequences of six human pathogens that are known to cause persistent or chronic infections in humans: Yersinia pestis, Neisseria gonorrhoeae, Helicobacter pylori, Borrelia burgdorferi, Streptococcus pneumoniae, and Treponema pallidum. This comparison was limited to genes annotated as hypothetical in the T. pallidum genome project. Seventeen genes with unknown functions were found to be conserved among these pathogens. Insertional inactivation of 14 of these genes generated nine mutants that were attenuated for virulence in a mouse infection model. Out of these nine genes, five were found to be specifically associated with virulence in mice as demonstrated by infection with Yersinia pseudotuberculosis in-frame deletion mutants. In addition, these five vag genes were essential only in vivo, since all the mutants were able to grow in vitro. These genes are broadly conserved among bacteria. Therefore, we propose that the corresponding vag gene products may constitute novel targets for antimicrobial therapy and that some vag mutants could serve as carrier strains for live vaccines. PMID:14977936

  11. Virulence in malaria: an evolutionary viewpoint.

    PubMed Central

    Mackinnon, Margaret J; Read, Andrew F

    2004-01-01

    Malaria parasites cause much morbidity and mortality to their human hosts. From our evolutionary perspective, this is because virulence is positively associated with parasite transmission rate. Natural selection therefore drives virulence upwards, but only to the point where the cost to transmission caused by host death begins to outweigh the transmission benefits. In this review, we summarize data from the laboratory rodent malaria model, Plasmodium chabaudi, and field data on the human malaria parasite, P. falciparum, in relation to this virulence trade-off hypothesis. The data from both species show strong positive correlations between asexual multiplication, transmission rate, infection length, morbidity and mortality, and therefore support the underlying assumptions of the hypothesis. Moreover, the P. falciparum data show that expected total lifetime transmission of the parasite is maximized in young children in whom the fitness cost of host mortality balances the fitness benefits of higher transmission rates and slower clearance rates, thus exhibiting the hypothesized virulence trade-off. This evolutionary explanation of virulence appears to accord well with the clinical and molecular explanations of pathogenesis that involve cytoadherence, red cell invasion and immune evasion, although direct evidence of the fitness advantages of these mechanisms is scarce. One implication of this evolutionary view of virulence is that parasite populations are expected to evolve new levels of virulence in response to medical interventions such as vaccines and drugs. PMID:15306410

  12. Structure of the catalytic domain of the Salmonella virulence factor SseI.

    PubMed

    Bhaskaran, Shyam S; Stebbins, C Erec

    2012-12-01

    SseI is secreted into host cells by Salmonella and contributes to the establishment of systemic infections. The crystal structure of the C-terminal domain of SseI has been solved to 1.70 Å resolution, revealing it to be a member of the cysteine protease superfamily with a catalytic triad consisting of Cys178, His216 and Asp231 that is critical to its virulence activities. Structure-based analysis revealed that SseI is likely to possess either acyl hydrolase or acyltransferase activity, placing this virulence factor in the rapidly growing class of enzymes of this family utilized by bacterial pathogens inside eukaryotic cells.

  13. The extinction differential induced virulence macroevolution

    NASA Astrophysics Data System (ADS)

    Zhang, Feng; Xu, Liufang; Wang, Jin

    2014-04-01

    We apply the potential-flux landscape theory to deal with the large fluctuation induced extinction phenomena. We quantify the most probable extinction pathway on the landscape and measure the extinction risk by the landscape topography. In this Letter, we investigate the disease extinction through an epidemic model described by a set of chemical reaction. We found the virulence-differential-dependent symbioses between mother and daughter pathogen species: mutualism and parasitism. The symbioses, whether mutualism or parasitism, benefit the higher virulence species. This implies that speciation towards lower virulence is an effective strategy for a pathogen species to reduce its extinction risk.

  14. Evaluating virulence of waterborne and clinical Aeromonas isolates using gene expression and mortality in neonatal mice followed by assessing cell culture’s ability to predict virulence based on transcriptional response

    SciTech Connect

    Hayes, S L; Rodgers, M R; Lye, D J; Stelma, G N; McKinstry, Craig A.; Malard, Joel M.; Vesper, Sephen J.

    2007-10-01

    Aims: To assess the virulence of Aeromonas spp. using two models, a neonatal mouse assay and a mouse intestinal cell culture. Methods and Results: After artificial infection with a variety of Aeromonas spp., mRNA extracts from the two models were processed and hydridized to murine microarrays to determine host gene response. Definition of virulence was determined based on host mRNA production in murine neonatal intestinal tissue and mortality of infected animals. Infections of mouse intestinal cell cultures were then performed to determine whether this simpler model system’s mRNA responses correlated to neonatal results and therefore be predictive of virulence of Aeromonas spp. Virulent aeromonads up-regulated transcripts in both models including multiple host defense gene products (chemokines, regulation of transcription and apoptosis and cell signalling). Avirulent species exhibited little or no host response in neonates. Mortality results correlated well with both bacterial dose and average fold change of up-regulated transcripts in the neonatal mice. Conclusions: Cell culture results were less discriminating but showed promise as potentially being able to be predictive of virulence. Jun oncogene up-regulation in murine cell culture is potentially predictive of Aeromonas virulence. Significance and Impact of the Study: Having the ability to determine virulence of waterborne pathogens quickly would potentially assist public health officials to rapidly assess exposure risks.

  15. Polymorphism Identification and Improved Genome Annotation of Brassica rapa Through Deep RNA Sequencing

    PubMed Central

    Devisetty, Upendra Kumar; Covington, Michael F.; Tat, An V.; Lekkala, Saradadevi; Maloof, Julin N.

    2014-01-01

    The mapping and functional analysis of quantitative traits in Brassica rapa can be greatly improved with the availability of physically positioned, gene-based genetic markers and accurate genome annotation. In this study, deep transcriptome RNA sequencing (RNA-Seq) of Brassica rapa was undertaken with two objectives: SNP detection and improved transcriptome annotation. We performed SNP detection on two varieties that are parents of a mapping population to aid in development of a marker system for this population and subsequent development of high-resolution genetic map. An improved Brassica rapa transcriptome was constructed to detect novel transcripts and to improve the current genome annotation. This is useful for accurate mRNA abundance and detection of expression QTL (eQTLs) in mapping populations. Deep RNA-Seq of two Brassica rapa genotypes—R500 (var. trilocularis, Yellow Sarson) and IMB211 (a rapid cycling variety)—using eight different tissues (root, internode, leaf, petiole, apical meristem, floral meristem, silique, and seedling) grown across three different environments (growth chamber, greenhouse and field) and under two different treatments (simulated sun and simulated shade) generated 2.3 billion high-quality Illumina reads. A total of 330,995 SNPs were identified in transcribed regions between the two genotypes with an average frequency of one SNP in every 200 bases. The deep RNA-Seq reassembled Brassica rapa transcriptome identified 44,239 protein-coding genes. Compared with current gene models of B. rapa, we detected 3537 novel transcripts, 23,754 gene models had structural modifications, and 3655 annotated proteins changed. Gaps in the current genome assembly of B. rapa are highlighted by our identification of 780 unmapped transcripts. All the SNPs, annotations, and predicted transcripts can be viewed at http://phytonetworks.ucdavis.edu/. PMID:25122667

  16. Polymorphism identification and improved genome annotation of Brassica rapa through Deep RNA sequencing.

    PubMed

    Devisetty, Upendra Kumar; Covington, Michael F; Tat, An V; Lekkala, Saradadevi; Maloof, Julin N

    2014-08-12

    The mapping and functional analysis of quantitative traits in Brassica rapa can be greatly improved with the availability of physically positioned, gene-based genetic markers and accurate genome annotation. In this study, deep transcriptome RNA sequencing (RNA-Seq) of Brassica rapa was undertaken with two objectives: SNP detection and improved transcriptome annotation. We performed SNP detection on two varieties that are parents of a mapping population to aid in development of a marker system for this population and subsequent development of high-resolution genetic map. An improved Brassica rapa transcriptome was constructed to detect novel transcripts and to improve the current genome annotation. This is useful for accurate mRNA abundance and detection of expression QTL (eQTLs) in mapping populations. Deep RNA-Seq of two Brassica rapa genotypes-R500 (var. trilocularis, Yellow Sarson) and IMB211 (a rapid cycling variety)-using eight different tissues (root, internode, leaf, petiole, apical meristem, floral meristem, silique, and seedling) grown across three different environments (growth chamber, greenhouse and field) and under two different treatments (simulated sun and simulated shade) generated 2.3 billion high-quality Illumina reads. A total of 330,995 SNPs were identified in transcribed regions between the two genotypes with an average frequency of one SNP in every 200 bases. The deep RNA-Seq reassembled Brassica rapa transcriptome identified 44,239 protein-coding genes. Compared with current gene models of B. rapa, we detected 3537 novel transcripts, 23,754 gene models had structural modifications, and 3655 annotated proteins changed. Gaps in the current genome assembly of B. rapa are highlighted by our identification of 780 unmapped transcripts. All the SNPs, annotations, and predicted transcripts can be viewed at http://phytonetworks.ucdavis.edu/.

  17. Annotated bibliography of software engineering laboratory literature

    NASA Technical Reports Server (NTRS)

    Buhler, Melanie; Valett, Jon

    1989-01-01

    An annotated bibliography is presented of technical papers, documents, and memorandums produced by or related to the Software Engineering Laboratory. The bibliography was updated and reorganized substantially since the original version (SEL-82-006, November 1982). All materials were grouped into eight general subject areas for easy reference: (1) The Software Engineering Laboratory; (2) The Software Engineering Laboratory: Software Development Documents; (3) Software Tools; (4) Software Models; (5) Software Measurement; (6) Technology Evaluations; (7) Ada Technology; and (8) Data Collection. Subject and author indexes further classify these documents by specific topic and individual author.

  18. Annotated bibliography of Software Engineering Laboratory literature

    NASA Technical Reports Server (NTRS)

    Morusiewicz, Linda; Valett, Jon

    1993-01-01

    This document is an annotated bibliography of technical papers, documents, and memorandums produced by or related to the Software Engineering Laboratory. Nearly 200 publications are summarized. These publications cover many areas of software engineering and range from research reports to software documentation. This document has been updated and reorganized substantially since the original version (SEL-82-006, November 1982). All materials have been grouped into eight general subject areas for easy reference: the Software Engineering Laboratory; the Software Engineering Laboratory: software development documents; software tools; software models; software measurement; technology evaluations; Ada technology; and data collection. This document contains an index of these publications classified by individual author.

  19. Annotated bibliography of software engineering laboratory literature

    NASA Technical Reports Server (NTRS)

    Groves, Paula; Valett, Jon

    1990-01-01

    An annotated bibliography of technical papers, documents, and memorandums produced by or related to the Software Engineering Laboratory is given. More than 100 publications are summarized. These publications cover many areas of software engineering and range from research reports to software documentation. This document has been updated and reorganized substantially since the original version (SEL-82-006, November 1982). All materials have been grouped into eight general subject areas for easy reference: the Software Engineering Laboratory; the Software Engineering Laboratory-software development documents; software tools; software models; software measurement; technology evaluations; Ada technology; and data collection. Subject and author indexes further classify these documents by specific topic and individual author.

  20. Annotated bibliography of Software Engineering Laboratory literature

    NASA Technical Reports Server (NTRS)

    Morusiewicz, Linda; Valett, Jon D.

    1991-01-01

    An annotated bibliography of technical papers, documents, and memorandums produced by or related to the Software Engineering Laboratory is given. More than 100 publications are summarized. These publications cover many areas of software engineering and range from research reports to software documentation. All materials have been grouped into eight general subject areas for easy reference: The Software Engineering Laboratory; The Software Engineering Laboratory: Software Development Documents; Software Tools; Software Models; Software Measurement; Technology Evaluations; Ada Technology; and Data Collection. Subject and author indexes further classify these documents by specific topic and individual author.

  1. Annotated bibliography of Software Engineering Laboratory literature

    NASA Technical Reports Server (NTRS)

    1985-01-01

    An annotated bibliography of technical papers, documents, and memorandums produced by or related to the Software Engineering Laboratory is presented. More than 100 publications are summarized. These publications are summarized. These publications cover many areas of software engineering and range from research reports to software documentation. This document has been updated and reorganized substantially since the original version (SEL-82-006, November 1982). All materials are grouped into five general subject areas for easy reference: (1) the software engineering laboratory; (2) software tools; (3) models and measures; (4) technology evaluations; and (5) data collection. An index further classifies these documents by specific topic.

  2. Film annotation system for a space experiment

    NASA Technical Reports Server (NTRS)

    Browne, W. R.; Johnson, S. S.

    1989-01-01

    This microprocessor system was designed to control and annotate a Nikon 35 mm camera for the purpose of obtaining photographs and data at predefined time intervals. The single STD BUSS interface card was designed in such a way as to allow it to be used in either a stand alone application with minimum features or installed in a STD BUSS computer allowing for maximum features. This control system also allows the exposure of twenty eight alpha/numeric characters across the bottom of each photograph. The data contains such information as camera identification, frame count, user defined text, and time to .01 second.

  3. Annotations in Refseq (GSC8 Meeting)

    SciTech Connect

    Tatusova, Tatiana

    2009-09-10

    The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding "Research Coordination Network" from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Tatiana Tatusova of NCBI discusses "Annotations in Refseq" at the Genomic Standards Consortium's 8th meeting at the DOE JGI in Walnut Creek, Calif. on Sept. 10, 2009.

  4. Annotations in Refseq (GSC8 Meeting)

    ScienceCinema

    Tatusova, Tatiana

    2016-07-12

    The Genomic Standards Consortium was formed in September 2005. It is an international, open-membership working body which promotes standardization in the description of genomes and the exchange and integration of genomic data. The 2009 meeting was an activity of a five-year funding "Research Coordination Network" from the National Science Foundation and was organized held at the DOE Joint Genome Institute with organizational support provided by the JGI and by the University of California - San Diego. Tatiana Tatusova of NCBI discusses "Annotations in Refseq" at the Genomic Standards Consortium's 8th meeting at the DOE JGI in Walnut Creek, Calif. on Sept. 10, 2009.

  5. Human disturbances of waterfowl: An annotated bibliography

    USGS Publications Warehouse

    Dahlgren, R.B.; Korschgen, C.E.

    1992-01-01

    The expansion of outdoor recreation greatly increased the interaction between the public, waterfowl, and waterfowl habitat. The effects of these interactions on waterfowl habitats are visible and obvious, whereas the effects of interactions that disrupt the normal behavior of waterfowl are subtle and often overlooked, but perhaps no less harmful than destruction of habitat. Resource managers and administrators require information on the types, magnitude, and effect of disturbances from human contact with wildlife. This bibliography contains annotations for 211 articles with information about effects of human disturbances on waterfowl. Indexes are provided by subject or key words, geographic locations, species of waterfowl, and authors.

  6. Film annotation system for a space experiment

    NASA Astrophysics Data System (ADS)

    Browne, W. R.; Johnson, S. S.

    1989-07-01

    This microprocessor system was designed to control and annotate a Nikon 35 mm camera for the purpose of obtaining photographs and data at predefined time intervals. The single STD BUSS interface card was designed in such a way as to allow it to be used in either a stand alone application with minimum features or installed in a STD BUSS computer allowing for maximum features. This control system also allows the exposure of twenty eight alpha/numeric characters across the bottom of each photograph. The data contains such information as camera identification, frame count, user defined text, and time to .01 second.

  7. Human disturbances of waterfowl: an annotated bibliography

    USGS Publications Warehouse

    Dahlgren, R.B.; Korschgen, C.E.

    1992-01-01

    The expansion of outdoor recreation greatly increased the interaction between the public, waterfowl, and waterfowl habitat. The effects of these interactions on waterfowl habitats are visible and obvious, whereas the effects of interactions that disrupt the normal behavior of waterfowl are subtle and often overlooked, but perhaps no less harmful than destruction of habitat. Resource managers and administrators require information on the types, magnitude, and effects of disturbances from human contact with wildlife. This bibliography contains annotations for 211 articles with information about effects of human disturbances on waterfowl. Indexes are provided by subject or key words, geographic locations, species of waterfowl, and authors.

  8. Biological database of images and genomes: tools for community annotations linking image and genomic information.

    PubMed

    Oberlin, Andrew T; Jurkovic, Dominika A; Balish, Mitchell F; Friedberg, Iddo

    2013-01-01

    Genomic data and biomedical imaging data are undergoing exponential growth. However, our understanding of the phenotype-genotype connection linking the two types of data is lagging behind. While there are many types of software that enable the manipulation and analysis of image data and genomic data as separate entities, there is no framework established for linking the two. We present a generic set of software tools, BioDIG, that allows linking of image data to genomic data. BioDIG tools can be applied to a wide range of research problems that require linking images to genomes. BioDIG features the following: rapid construction of web-based workbenches, community-based annotation, user management and web services. By using BioDIG to create websites, researchers and curators can rapidly annotate a large number of images with genomic information. Here we present the BioDIG software tools that include an image module, a genome module and a user management module. We also introduce a BioDIG-based website, MyDIG, which is being used to annotate images of mycoplasmas. PMID:23550062

  9. Biological Database of Images and Genomes: tools for community annotations linking image and genomic information

    PubMed Central

    Oberlin, Andrew T; Jurkovic, Dominika A; Balish, Mitchell F; Friedberg, Iddo

    2013-01-01

    Genomic data and biomedical imaging data are undergoing exponential growth. However, our understanding of the phenotype–genotype connection linking the two types of data is lagging behind. While there are many types of software that enable the manipulation and analysis of image data and genomic data as separate entities, there is no framework established for linking the two. We present a generic set of software tools, BioDIG, that allows linking of image data to genomic data. BioDIG tools can be applied to a wide range of research problems that require linking images to genomes. BioDIG features the following: rapid construction of web-based workbenches, community-based annotation, user management and web services. By using BioDIG to create websites, researchers and curators can rapidly annotate a large number of images with genomic information. Here we present the BioDIG software tools that include an image module, a genome module and a user management module. We also introduce a BioDIG-based website, MyDIG, which is being used to annotate images of mycoplasmas. Database URL: BioDIG website: http://biodig.org BioDIG source code repository: http://github.com/FriedbergLab/BioDIG The MyDIG database: http://mydig.biodig.org/ PMID:23550062

  10. Women in Development: A Selected Annotated Bibliography and Resource Guide. Annotated Bibliography #1.

    ERIC Educational Resources Information Center

    Vavrus, Linda Gire; Cadieux, Ron

    This annotated bibliography on the subject of women in development is compiled from the resource collection of the Non-Formal Education Information Center of Michigan State University. Planned development efforts are beginning to reflect a greater appreciation of nontraditional, as well as traditional, role options for women. Moreover, constraints…

  11. Projects, Training, and Strategies for Generating Income: A Selected Annotated Bibliography. Annotated Bibliography #4.

    ERIC Educational Resources Information Center

    Michigan State Univ., East Lansing. Non-Formal Education Information Center.

    A selected annotated bibliography on projects, training, and strategies for generating income, intended for persons actively engaged in non-formal education for development, reflects a growing number of projects on income generation by and for women's groups, and a reliance upon indigenous associations and group action. Documents dating from 1969…

  12. Literacy and Basic Education: A Selected, Annotated Bibliography. Annotated Bibliography #3.

    ERIC Educational Resources Information Center

    Michigan State Univ., East Lansing. Non-Formal Education Information Center.

    A selected annotated bibliography on literacy and basic education, including contributions from practitioners in the worldwide non-formal education network and compiled for them, has three interrelated themes: integration of literacy programs with broader development efforts; the learner-centered or "psycho-social" approach to literacy, often with…

  13. Streptolysin S-like virulence factors: the continuing sagA

    PubMed Central

    Molloy, Evelyn M.; Cotter, Paul D.; Hill, Colin; Mitchell, Douglas A.; Ross, R. Paul

    2014-01-01

    Streptolysin S (SLS) is a potent cytolytic toxin and virulence factor produced by nearly all Streptococcus pyogenes strains. Despite a 100-year history of research on this toxin, it has only recently been established that SLS represents the archetypal example of an extended family of post-translationally modified virulence factors also produced by some other streptococci and Gram-positive pathogens, such as Listeria monocytogenes and Clostridium botulinum. In this Review we describe the identification, genetics, biochemistry and various functions of SLS. We also discuss the shared features of the virulence-associated SLS-like peptides, as well as their place within the rapidly expanding family of thiazole/oxazole-modified microcins (TOMMs). PMID:21822292

  14. Small RNA functions in carbon metabolism and virulence of enteric pathogens

    PubMed Central

    Papenfort, Kai; Vogel, Jörg

    2014-01-01

    Enteric pathogens often cycle between virulent and saprophytic lifestyles. To endure these frequent changes in nutrient availability and composition bacteria possess an arsenal of regulatory and metabolic genes allowing rapid adaptation and high flexibility. While numerous proteins have been characterized with regard to metabolic control in pathogenic bacteria, small non-coding RNAs have emerged as additional regulators of metabolism. Recent advances in sequencing technology have vastly increased the number of candidate regulatory RNAs and several of them have been found to act at the interface of bacterial metabolism and virulence factor expression. Importantly, studying these riboregulators has not only provided insight into their metabolic control functions but also revealed new mechanisms of post-transcriptional gene control. This review will focus on the recent advances in this area of host-microbe interaction and discuss how regulatory small RNAs may help coordinate metabolism and virulence of enteric pathogens. PMID:25077072

  15. A Novel Approach to Semantic and Coreference Annotation at LLNL

    SciTech Connect

    Firpo, M

    2005-02-04

    A case is made for the importance of high quality semantic and coreference annotation. The challenges of providing such annotation are described. Asperger's Syndrome is introduced, and the connections are drawn between the needs of text annotation and the abilities of persons with Asperger's Syndrome to meet those needs. Finally, a pilot program is recommended wherein semantic annotation is performed by people with Asperger's Syndrome. The primary points embodied in this paper are as follows: (1) Document annotation is essential to the Natural Language Processing (NLP) projects at Lawrence Livermore National Laboratory (LLNL); (2) LLNL does not currently have a system in place to meet its need for text annotation; (3) Text annotation is challenging for a variety of reasons, many related to its very rote nature; (4) Persons with Asperger's Syndrome are particularly skilled at rote verbal tasks, and behavioral experts agree that they would excel at text annotation; and (6) A pilot study is recommend in which two to three people with Asperger's Syndrome annotate documents and then the quality and throughput of their work is evaluated relative to that of their neuro-typical peers.

  16. Interactive annotation of textures in thoracic CT scans

    NASA Astrophysics Data System (ADS)

    Kockelkorn, Thessa T. J. P.; de Jong, Pim A.; Gietema, Hester A.; Grutters, Jan C.; Prokop, Mathias; van Ginneken, Bram

    2010-03-01

    This study describes a system for interactive annotation of thoracic CT scans. Lung volumes in these scans are segmented and subdivided into roughly spherical volumes of interest (VOIs) with homogeneous texture using a clustering procedure. For each 3D VOI, 72 features are calculated. The observer inspects the scan to determine which textures are present and annotates, with mouse clicks, several VOIs of each texture. Based on these annotations, a k-nearest-neighbor classifier is trained, which classifies all remaining VOIs in the scan. The algorithm then presents a slice with suggested annotations to the user, in which the user can correct mistakes. The classifier is retrained, taking into account these new annotations, and the user is presented another slice for correction. This process continues until at least 50% of all lung voxels in the scan have been classified. The remaining VOIs are classified automatically. In this way, the entire lung volume is annotated. The system has been applied to scans of patients with usual and non-specific interstitial pneumonia. The results of interactive annotation are compared to a setup in which the user annotates all predefined VOIs manually. The interactive system is 3.7 times as fast as complete manual annotation of VOIs and differences between the methods are similar to interobserver variability. This is a first step towards precise volumetric quantitation of texture patterns in thoracic CT in clinical research and in clinical practice.

  17. Semantator: semantic annotator for converting biomedical text to linked data.

    PubMed

    Tao, Cui; Song, Dezhao; Sharma, Deepak; Chute, Christopher G

    2013-10-01

    More than 80% of biomedical data is embedded in plain text. The unstructured nature of these text-based documents makes it challenging to easily browse and query the data of interest in them. One approach to facilitate browsing and querying biomedical text is to convert the plain text to a linked web of data, i.e., converting data originally in free text to structured formats with defined meta-level semantics. In this paper, we introduce Semantator (Semantic Annotator), a semantic-web-based environment for annotating data of interest in biomedical documents, browsing and querying the annotated data, and interactively refining annotation results if needed. Through Semantator, information of interest can be either annotated manually or semi-automatically using plug-in information extraction tools. The annotated results will be stored in RDF and can be queried using the SPARQL query language. In addition, semantic reasoners can be directly applied to the annotated data for consistency checking and knowledge inference. Semantator has been released online and was used by the biomedical ontology community who provided positive feedbacks. Our evaluation results indicated that (1) Semantator can perform the annotation functionalities as designed; (2) Semantator can be adopted in real applications in clinical and transactional research; and (3) the annotated results using Semantator can be easily used in Semantic-web-based reasoning tools for further inference.

  18. African American Literature, 1989-94: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Miller, R. Baxter; Butts, Tracy; Jones, Sharon

    1997-01-01

    Contains an annotated bibliography of African American literature (published between 1989 and 1994), including anthologies, fiction, poetry, drama, criticism, cultural studies, biography, interviews, and letters. (TB)

  19. Re-sequencing of a virulent strain of Campylobacter jejuni NCTC11168 reveals potential virulence factors.

    PubMed

    Cooper, Kerry K; Cooper, Margarethe A; Zuccolo, Andrea; Joens, Lynn A

    2013-01-01

    In vitro passage of Campylobacter jejuni strains results in phenotypic changes and a general loss of virulence, as is the case with the genome-sequenced strain C. jejuni NCTC11168. Re-sequencing of a virulent strain of NCTC11168 identified 41 SNPs or indels involving 20 genes, four intergenic regions and three pseudogenes. The genes include six motility genes, two chemotaxis genes, three hypothetical genes and a capsule biosynthesis gene, which might have a critical role in C. jejuni virulence. Additionally, we found an insertion in both Cj0676 and Cj1470c, pseudogenes in avirulent NCTC11168, but functional proteins in virulent NCTC11168.

  20. Neoparamoeba perurans loses virulence during clonal culture.

    PubMed

    Bridle, Andrew R; Davenport, Danielle L; Crosbie, Philip B B; Polinski, Mark; Nowak, Barbara F

    2015-08-01

    Amoebic Gill Disease affects farmed salmonids and is caused by Neoparamoeba perurans. Clonal cultures of this amoeba have been used for challenge experiments, however the effect of long-term culture on virulence has not been investigated. Here we show, using in vitro and in vivo methods, that a clone of N. perurans which was virulent 70 days after clonal culture lost virulence after 3 years in clonal culture. We propose that this is related either to the lack of attachment to the gills or the absence of an extracellular product, as shown by the lack of cytopathic effect on Chinook salmon embryo cells. The avirulent clonal culture of N. perurans allowed us to propose two potential virulence mechanisms/factors involved in Amoebic Gill Disease and is an invaluable tool for host-pathogen studies of Amoebic Gill Disease. PMID:26008963

  1. Phenotypic Plasticity Regulates Candida albicans Interactions and Virulence in the Vertebrate Host

    PubMed Central

    Mallick, Emily M.; Bergeron, Audrey C.; Jones, Stephen K.; Newman, Zachary R.; Brothers, Kimberly M.; Creton, Robbert; Wheeler, Robert T.; Bennett, Richard J.

    2016-01-01

    Phenotypic diversity is critical to the lifestyles of many microbial species, enabling rapid responses to changes in environmental conditions. In the human fungal pathogen Candida albicans, cells exhibit heritable switching between two phenotypic states, white and opaque, which yield differences in mating, filamentous growth, and interactions with immune cells in vitro. Here, we address the in vivo virulence properties of the two cell states in a zebrafish model of infection. Multiple attributes were compared including the stability of phenotypic states, filamentation, virulence, dissemination, and phagocytosis by immune cells, and phenotypes equated across three different host temperatures. Importantly, we found that both white and opaque cells could establish a lethal systemic infection. The relative virulence of the two cell types was temperature dependent; virulence was similar at 25°C, but at higher temperatures (30 and 33°C) white cells were significantly more virulent than opaque cells. Despite the difference in virulence, fungal burden, and dissemination were similar between cells in the two states. Additionally, both white and opaque cells exhibited robust filamentation during infection and blocking filamentation resulted in decreased virulence, establishing that this program is critical for pathogenesis in both cell states. Interactions between C. albicans cells and immune cells differed between white and opaque states. Macrophages and neutrophils preferentially phagocytosed white cells over opaque cells in vitro, and neutrophils showed preferential phagocytosis of white cells in vivo. Together, these studies distinguish the properties of white and opaque cells in a vertebrate host, and establish that the two cell types demonstrate both important similarities and key differences during infection. PMID:27303374

  2. Virulence of viral haemorrhagic septicaemia virus (VHSV) genotype III in rainbow trout.

    PubMed

    Ito, Takafumi; Kurita, Jun; Mori, Koh-ichiro; Olesen, Niels J

    2016-01-08

    In general, viral haemorrhagic septicaemia virus (VHSV) isolates from marine fish species in European waters (genotypes GIb, GII and GIII) are non- to low virulent in rainbow trout. However, a VHSV isolation was made in 2007 from a disease outbreak in sea farmed rainbow trout in Norway. The isolate, named NO-2007-50-385, was demonstrated to belong to GIII. This isolate has attracted attention to assess which of the viral genome/proteins might be associated with the virulence in rainbow trout. In this study, we describe the difference of virulence in rainbow trout between the NO-2007-50-385 and 4p168 isolates as representatives of virulent and non-virulent GIII isolates, respectively. Rainbow trout were bath challenged with VHSV NO-2007-50-385 for 1 and 6 h, resulting in cumulative mortalities of 5 and 35%, respectively. No mortality was observed in the rainbow trout groups immersed with the genotype III VHSV isolate 4p168 for 1 and 6 h. The viral titre in organs from fish challenged with NO-2007-50-385 for 6 h increased more rapidly than those exposed for 1 h. By in vitro studies it was demonstrated that the final titres of VHSV DK-3592B (GI), NO-2007-50-385 and 4p168 inoculated on EPC cells were very similar, whereas when inoculated on the rainbow trout cell line RTG-2 the titre of the non-virulent 4p168 isolate was 3-4 logs below the two other VHSV isolates. Based on a comparative analysis of the entire genome of the genotype III isolates, we suggest that substitutions of amino acids in positions 118-123 of the nucleo-protein are candidates for being related to virulence of VHSV GIII in rainbow trout.

  3. Campylobacter Polysaccharide Capsules: Virulence and Vaccines

    PubMed Central

    Guerry, Patricia; Poly, Frédéric; Riddle, Mark; Maue, Alexander C.; Chen, Yu-Han; Monteiro, Mario A.

    2012-01-01

    Campylobacter jejuni remains a major cause of bacterial diarrhea worldwide and is associated with numerous sequelae, including Guillain Barré Syndrome, inflammatory bowel disease, reactive arthritis, and irritable bowel syndrome. C. jejuni is unusual for an intestinal pathogen in its ability to coat its surface with a polysaccharide capsule (CPS). These capsular polysaccharides vary in sugar composition and linkage, especially those involving heptoses of unusual configuration and O-methyl phosphoramidate linkages. This structural diversity is consistent with CPS being the major serodeterminant of the Penner scheme, of which there are 47 C. jejuni serotypes. Both CPS expression and expression of modifications are subject to phase variation by slip strand mismatch repair. Although capsules are virulence factors for other pathogens, the role of CPS in C. jejuni disease has not been well defined beyond descriptive studies demonstrating a role in serum resistance and for diarrhea in a ferret model of disease. However, perhaps the most compelling evidence for a role in pathogenesis are data that CPS conjugate vaccines protect against diarrheal disease in non-human primates. A CPS conjugate vaccine approach against this pathogen is intriguing, but several questions need to be addressed, including the valency of CPS types required for an effective vaccine. There have been numerous studies of prevalence of CPS serotypes in the developed world, but few studies from developing countries where the disease incidence is higher. The complexity and cost of Penner serotyping has limited its usefulness, and a recently developed multiplex PCR method for determination of capsule type offers the potential of a more rapid and affordable method. Comparative studies have shown a strong correlation of the two methods and studies are beginning to ascertain CPS-type distribution worldwide, as well as examination of correlation of severity of illness with specific CPS types. PMID:22919599

  4. [Proteus bacilli: features and virulence factors].

    PubMed

    Rózalski, Antoni; Kwil, Iwona; Torzewska, Agnieszka; Baranowska, Magdalena; Staczek, Paweł

    2007-01-01

    In this article, different aspects of virulence factors of Proteus bacilii (P. mirabilis, P. vulgaris, P. penneri i P. hauseri) are presented. These are opportunistic pathogens that cause different kinds of infections, most frequently of the urinary tract. These bacteria have developed several virulence factors, such as adherence due to the presence of fimbriae or afimbrial adhesins, invasiveness, swarming phenomenon, hemolytic activity, urea hydrolysis, proteolysis, and endotoxicity. Below we focus on data concerning the molecular basis of the pathogenicity of Proteus bacilli.

  5. Identifying Virulence-Associated Genes Using Transcriptomic and Proteomic Association Analyses of the Plant Parasitic Nematode Bursaphelenchus mucronatus

    PubMed Central

    Zhou, Lifeng; Chen, Fengmao; Pan, Hongyang; Ye, Jianren; Dong, Xuejiao; Li, Chunyan; Lin, Fengling

    2016-01-01

    Bursaphelenchus mucronatus (B. mucronatus) isolates that originate from different regions may vary in their virulence, but their virulence-associated genes and proteins are poorly understood. Thus, we conducted an integrated study coupling RNA-Seq and isobaric tags for relative and absolute quantitation (iTRAQ) to analyse transcriptomic and proteomic data of highly and weakly virulent B. mucronatus isolates during the pathogenic processes. Approximately 40,000 annotated unigenes and 5000 proteins were gained from the isolates. When we matched all of the proteins with their detected transcripts, a low correlation coefficient of r = 0.138 was found, indicating probable post-transcriptional gene regulation involved in the pathogenic processes. A functional analysis showed that five differentially expressed proteins which were all highly expressed in the highly virulent isolate were involved in the pathogenic processes of nematodes. Peroxiredoxin, fatty acid- and retinol-binding protein, and glutathione peroxidase relate to resistance against plant defence responses, while β-1,4-endoglucanase and expansin are associated with the breakdown of plant cell walls. Thus, the pathogenesis of B. mucronatus depends on its successful survival in host plants. Our work adds to the understanding of B. mucronatus’ pathogenesis, and will aid in controlling B. mucronatus and other pinewood nematode species complexes in the future. PMID:27618012

  6. Identifying Virulence-Associated Genes Using Transcriptomic and Proteomic Association Analyses of the Plant Parasitic Nematode Bursaphelenchus mucronatus.

    PubMed

    Zhou, Lifeng; Chen, Fengmao; Pan, Hongyang; Ye, Jianren; Dong, Xuejiao; Li, Chunyan; Lin, Fengling

    2016-01-01

    Bursaphelenchus mucronatus (B. mucronatus) isolates that originate from different regions may vary in their virulence, but their virulence-associated genes and proteins are poorly understood. Thus, we conducted an integrated study coupling RNA-Seq and isobaric tags for relative and absolute quantitation (iTRAQ) to analyse transcriptomic and proteomic data of highly and weakly virulent B. mucronatus isolates during the pathogenic processes. Approximately 40,000 annotated unigenes and 5000 proteins were gained from the isolates. When we matched all of the proteins with their detected transcripts, a low correlation coefficient of r = 0.138 was found, indicating probable post-transcriptional gene regulation involved in the pathogenic processes. A functional analysis showed that five differentially expressed proteins which were all highly expressed in the highly virulent isolate were involved in the pathogenic processes of nematodes. Peroxiredoxin, fatty acid- and retinol-binding protein, and glutathione peroxidase relate to resistance against plant defence responses, while β-1,4-endoglucanase and expansin are associated with the breakdown of plant cell walls. Thus, the pathogenesis of B. mucronatus depends on its successful survival in host plants. Our work adds to the understanding of B. mucronatus' pathogenesis, and will aid in controlling B. mucronatus and other pinewood nematode species complexes in the future. PMID:27618012

  7. Siderophore biosynthesis coordinately modulated the virulence-associated interactive metabolome of uropathogenic Escherichia coli and human urine

    PubMed Central

    Su, Qiao; Guan, Tianbing; Lv, Haitao

    2016-01-01

    Uropathogenic Escherichia coli (UPEC) growth in women’s bladders during urinary tract infection (UTI) incurs substantial chemical exchange, termed the “interactive metabolome”, which primarily accounts for the metabolic costs (utilized metabolome) and metabolic donations (excreted metabolome) between UPEC and human urine. Here, we attempted to identify the individualized interactive metabolome between UPEC and human urine. We were able to distinguish UPEC from non-UPEC by employing a combination of metabolomics and genetics. Our results revealed that the interactive metabolome between UPEC and human urine was markedly different from that between non-UPEC and human urine, and that UPEC triggered much stronger perturbations in the interactive metabolome in human urine. Furthermore, siderophore biosynthesis coordinately modulated the individualized interactive metabolome, which we found to be a critical component of UPEC virulence. The individualized virulence-associated interactive metabolome contained 31 different metabolites and 17 central metabolic pathways that were annotated to host these different metabolites, including energetic metabolism, amino acid metabolism, and gut microbe metabolism. Changes in the activities of these pathways mechanistically pinpointed the virulent capability of siderophore biosynthesis. Together, our findings provide novel insights into UPEC virulence, and we propose that siderophores are potential targets for further discovery of drugs to treat UPEC-induced UTI. PMID:27076285

  8. Siderophore biosynthesis coordinately modulated the virulence-associated interactive metabolome of uropathogenic Escherichia coli and human urine.

    PubMed

    Su, Qiao; Guan, Tianbing; Lv, Haitao

    2016-04-14

    Uropathogenic Escherichia coli (UPEC) growth in women's bladders during urinary tract infection (UTI) incurs substantial chemical exchange, termed the "interactive metabolome", which primarily accounts for the metabolic costs (utilized metabolome) and metabolic donations (excreted metabolome) between UPEC and human urine. Here, we attempted to identify the individualized interactive metabolome between UPEC and human urine. We were able to distinguish UPEC from non-UPEC by employing a combination of metabolomics and genetics. Our results revealed that the interactive metabolome between UPEC and human urine was markedly different from that between non-UPEC and human urine, and that UPEC triggered much stronger perturbations in the interactive metabolome in human urine. Furthermore, siderophore biosynthesis coordinately modulated the individualized interactive metabolome, which we found to be a critical component of UPEC virulence. The individualized virulence-associated interactive metabolome contained 31 different metabolites and 17 central metabolic pathways that were annotated to host these different metabolites, including energetic metabolism, amino acid metabolism, and gut microbe metabolism. Changes in the activities of these pathways mechanistically pinpointed the virulent capability of siderophore biosynthesis. Together, our findings provide novel insights into UPEC virulence, and we propose that siderophores are potential targets for further discovery of drugs to treat UPEC-induced UTI.

  9. Increased virulence of Rabbit Haemorrhagic Disease Virus associated with genetic resistance in wild Australian rabbits (Oryctolagus cuniculus)

    PubMed Central

    Elsworth, Peter; Cooke, Brian D.; Kovaliski, John; Sinclair, Ronald; Holmes, Edward C.; Strive, Tanja

    2015-01-01

    The release of myxoma virus (MYXV) and Rabbit Haemorrhagic Disease Virus (RHDV) in Australia with the aim of controlling overabundant rabbits has provided a unique opportunity to study the initial spread and establishment of emerging pathogens, as well as their co-evolution with their mammalian hosts. In contrast to MYXV, which attenuated shortly after its introduction, rapid attenuation of RHDV has not been observed. By studying the change in virulence of recent field isolates at a single field site we show, for the first time, that RHDV virulence has increased through time, likely because of selection to overcome developing genetic resistance in Australian wild rabbits. High virulence also appears to be favoured as rabbit carcasses, rather than diseased animals, are the likely source of mechanical insect transmission. These findings not only help elucidate the co-evolutionary interaction between rabbits and RHDV, but reveal some of the key factors shaping virulence evolution. PMID:25146599

  10. Spaceflight Effects on Virulence of Pseudomonas Aeruginosa

    NASA Astrophysics Data System (ADS)

    Broadway, S.; Goins, T.; Crandell, C.; Richards, C.; Patel, M.; Pyle, B.

    2008-06-01

    Pseudomonas aeruginosa is an opportunistic pathogen found in the environment. It is known to infect the immunocompromised. The organism has about 25 virulence genes that play different roles in disease processes. Several exotoxin proteins may be produced, including ExoA, ExoS, ExoT and ExoY, and other virulence factors. In spaceflight, possible increased expression of P. aeruginosa virulence proteins could increase health risks for spaceflight crews who experience decreased immunity. Cultures of P. aeruginosa strains PA01 and PA103 grown on orbit on Shuttle Endeavour flight STS-123 vs. static ground controls were used for analysis. The production of ETA was quantitated using an ELISA procedure. Results showed that while flight cultures of PA103 produced slightly more ETA than corresponding ground controls, the opposite was found for PA01. While it appears that spaceflight has little effect on ETA, stimulation of other virulence factors could cause increased virulence of this organism in space flight. Similar increased virulence in spaceflight has been observed for other bacteria. This is important because astronauts may be more susceptible to opportunistic pathogens including P. aeruginosa.

  11. Towards a Library of Standard Operating Procedures (SOPs) for (meta)genomic annotation

    SciTech Connect

    Kyrpides, Nikos; Angiuoli, Samuel V.; Cochrane, Guy; Field, Dawn; Garrity, George; Gussman, Aaron; Kodira, Chinnappa D.; Klimke, William; Kyrpides, Nikos; Madupu, Ramana; Markowitz, Victor; Tatusova, Tatiana; Thomson, Nick; White, Owen

    2008-04-01

    Genome annotations describe the features of genomes and accompany sequences in genome databases. The methodologies used to generate genome annotation are diverse and typically vary amongst groups. Descriptions of the annotation procedure are helpful in interpreting genome annotation data. Standard Operating Procedures (SOPs) for genome annotation describe the processes that generate genome annotations. Some groups are currently documenting procedures but standards are lacking for structure and content of annotation SOPs. In addition, there is no central repository to store and disseminate procedures and protocols for genome annotation. We highlight the importance of SOPs for genome annotation and endorse a central online repository of SOPs.

  12. Virulent Newcastle disease virus elicits a strong innate immune response in chickens.

    PubMed

    Rue, Cary A; Susta, Leonardo; Cornax, Ingrid; Brown, Corrie C; Kapczynski, Darrell R; Suarez, David L; King, Daniel J; Miller, Patti J; Afonso, Claudio L

    2011-04-01

    Newcastle disease virus (NDV) is an avian paramyxovirus that causes significant economic losses to the poultry industry worldwide. There is limited knowledge about the avian immune response to infection with virulent NDVs, and how this response may contribute to disease. In this study, pathogenesis and the transcriptional host response of chickens to a virulent NDV strain that rapidly causes 100% mortality was characterized. Using microarrays, a strong transcriptional host response was observed in spleens at early times after infection with the induction of groups of genes involved in innate antiviral and pro-inflammatory responses. There were multiple genes induced at 48 h post-infection including: type I and II interferons (IFNs), several cytokines and chemokines, IFN effectors and inducible nitric oxide synthase (iNOS). The increased transcription of nitric oxide synthase was confirmed by immunohistochemistry for iNOS in spleens and measured levels of nitric oxide in serum. In vitro experiments showed strong induction of the key host response genes, alpha IFN, beta interferon, and interleukin 1β and interleukin 6, in splenic leukocytes at 6 h post-infection in comparison to a non-virulent NDV. The robust host response to virulent NDV, in conjunction with severe pathological damage observed, is somewhat surprising considering that all NDV encode a gene, V, which functions as a suppressor of class I IFNs. Taken together, these results suggest that the host response itself may contribute to the pathogenesis of this highly virulent strain in chickens.

  13. Host-Pathogen Coevolution: The Selective Advantage of Bacillus thuringiensis Virulence and Its Cry Toxin Genes.

    PubMed

    Masri, Leila; Branca, Antoine; Sheppard, Anna E; Papkou, Andrei; Laehnemann, David; Guenther, Patrick S; Prahl, Swantje; Saebelfeld, Manja; Hollensteiner, Jacqueline; Liesegang, Heiko; Brzuszkiewicz, Elzbieta; Daniel, Rolf; Michiels, Nicolaas K; Schulte, Rebecca D; Kurtz, Joachim; Rosenstiel, Philip; Telschow, Arndt; Bornberg-Bauer, Erich; Schulenburg, Hinrich

    2015-06-01

    Reciprocal coevolution between host and pathogen is widely seen as a major driver of evolution and biological innovation. Yet, to date, the underlying genetic mechanisms and associated trait functions that are unique to rapid coevolutionary change are generally unknown. We here combined experimental evolution of the bacterial biocontrol agent Bacillus thuringiensis and its nematode host Caenorhabditis elegans with large-scale phenotyping, whole genome analysis, and functional genetics to demonstrate the selective benefit of pathogen virulence and the underlying toxin genes during the adaptation process. We show that: (i) high virulence was specifically favoured during pathogen-host coevolution rather than pathogen one-sided adaptation to a nonchanging host or to an environment without host; (ii) the pathogen genotype BT-679 with known nematocidal toxin genes and high virulence specifically swept to fixation in all of the independent replicate populations under coevolution but only some under one-sided adaptation; (iii) high virulence in the BT-679-dominated populations correlated with elevated copy numbers of the plasmid containing the nematocidal toxin genes; (iv) loss of virulence in a toxin-plasmid lacking BT-679 isolate was reconstituted by genetic reintroduction or external addition of the toxins. We conclude that sustained coevolution is distinct from unidirectional selection in shaping the pathogen's genome and life history characteristics. To our knowledge, this study is the first to characterize the pathogen genes involved in coevolutionary adaptation in an animal host-pathogen interaction system. PMID:26042786

  14. Pathogenicity and virulence: another view.

    PubMed Central

    Isenberg, H D

    1988-01-01

    The concepts of pathogenicity and virulence have governed our perception of microbial harmfulness since the time of Pasteur and Koch. These concepts resulted in the recognition and identification of numerous etiological agents and provided natural and synthetic agents effective in therapy and prevention of diseases. However, Koch's postulates--the premier product of this view--place the onus of harmfulness solely on the microbial world. Our recent experiences with polymicrobic and nosocomial infections, legionellosis, and acquired immunodeficiency syndrome point to the host as the major determinant of disease. The principles of parasitism, enunciated by Theobold Smith, approximate more accurately the disturbances of the host-parasite equilibrium we designate as infection. Many complex attributes of microbial anatomy and physiology have been obscured by our dependency on the pure-culture technique. For example, bacterial attachment organelles and the production of exopolysaccharides enable microorganisms to interact with mammalian glycocalyces and specific receptors. In addition, selection, through the use of therapeutic agents, aids in the progression of environmental organisms to members of the intimate human biosphere, with the potential to complicate the recovery of patients. These factors emphasize further the pivotal significance of host reactions in infections. Parasitism, in its negative aspects, explains the emergence of "new" infections that involve harm to more than host organs and cells: we may encounter subtler infections that reveal parasitic and host cell nucleic acid interactions in a form of genomic parasitism. PMID:3060244

  15. Genomic sequence and virulence of clonal isolates of vaccinia virus Tiantan, the Chinese smallpox vaccine strain.

    PubMed

    Zhang, Qicheng; Tian, Meijuan; Feng, Yi; Zhao, Kai; Xu, Jing; Liu, Ying; Shao, Yiming

    2013-01-01

    Despite the worldwide eradication of smallpox in 1979, the potential bioterrorism threat from variola virus and the ongoing use of vaccinia virus (VACV) as a vector for vaccine development argue for continued research on VACV. In China, the VACV Tiantan strain (TT) was used in the smallpox eradication campaign. Its progeny strain is currently being used to develop a human immunodeficiency virus (HIV) vaccine. Here we sequenced the full genomes of five TT clones isolated by plaque purification from the TT (752-1) viral stock. Phylogenetic analysis with other commonly used VACV strains showed that TT (752-1) and its clones clustered and exhibited higher sequence diversity than that found in Dryvax clones. The ∼190 kbp genomes of TT appeared to encode 273 open reading frames (ORFs). ORFs located in the middle of the genome were more conserved than those located at the two termini, where many virulence and immunomodulation associated genes reside. Several patterns of nucleotide changes including point mutations, insertions and deletions were identified. The polymorphisms in seven virulence-associated proteins and six immunomodulation-related proteins were analyzed. We also investigated the neuro- and skin- virulence of TT clones in mice and rabbits, respectively. The TT clones exhibited significantly less virulence than the New York City Board of Health (NYCBH) strain, as evidenced by less extensive weight loss and morbidity in mice as well as produced smaller skin lesions and lower incidence of putrescence in rabbits. The complete genome sequences, ORF annotations, and phenotypic diversity yielded from this study aid our understanding of the Chinese historic TT strain and are useful for HIV vaccine projects employing TT as a vector.

  16. The type III secreted protein BspR regulates the virulence genes in Bordetella bronchiseptica.

    PubMed

    Kurushima, Jun; Kuwae, Asaomi; Abe, Akio

    2012-01-01

    Bordetella bronchiseptica is closely related with B. pertussis and B. parapertussis, the causative agents of whooping cough. These pathogenic species share a number of virulence genes, including the gene locus for the type III secretion system (T3SS) that delivers effector proteins. To identify unknown type III effectors in Bordetella, secreted proteins in the bacterial culture supernatants of wild-type B. bronchiseptica and an isogenic T3SS-deficient mutant were compared with iTRAQ-based, quantitative proteomic analysis method. BB1639, annotated as a hypothetical protein, was identified as a novel type III secreted protein and was designated BspR (Bordetella secreted protein regulator). The virulence of a BspR mutant (ΔbspR) in B. bronchiseptica was significantly attenuated in a mouse infection model. BspR was also highly conserved in B. pertussis and B. parapertussis, suggesting that BspR is an essential virulence factor in these three Bordetella species. Interestingly, the BspR-deficient strain showed hyper-secretion of T3SS-related proteins. Furthermore, T3SS-dependent host cell cytotoxicity and hemolytic activity were also enhanced in the absence of BspR. By contrast, the expression of filamentous hemagglutinin, pertactin, and adenylate cyclase toxin was completely abolished in the BspR-deficient strain. Finally, we demonstrated that BspR is involved in the iron-responsive regulation of T3SS. Thus, Bordetella virulence factors are coordinately but inversely controlled by BspR, which functions as a regulator in response to iron starvation.

  17. Annotate-it: a Swiss-knife approach to annotation, analysis and interpretation of single nucleotide variation in human disease.

    PubMed

    Sifrim, Alejandro; Van Houdt, Jeroen Kj; Tranchevent, Leon-Charles; Nowakowska, Beata; Sakai, Ryo; Pavlopoulos, Georgios A; Devriendt, Koen; Vermeesch, Joris R; Moreau, Yves; Aerts, Jan

    2012-01-01

    The increasing size and complexity of exome/genome sequencing data requires new tools for clinical geneticists to discover disease-causing variants. Bottlenecks in identifying the causative variation include poor cross-sample querying, constantly changing functional annotation and not considering existing knowledge concerning the phenotype. We describe a methodology that facilitates exploration of patient sequencing data towards identification of causal variants under different genetic hypotheses. Annotate-it facilitates handling, analysis and interpretation of high-throughput single nucleotide variant data. We demonstrate our strategy using three case studies. Annotate-it is freely available and test data are accessible to all users at http://www.annotate-it.org.

  18. Identification of Secreted Exoproteome Fingerprints of Highly-Virulent and Non-Virulent Staphylococcus aureus Strains

    PubMed Central

    Bonar, Emilia; Wojcik, Iwona; Jankowska, Urszula; Kedracka-Krok, Sylwia; Bukowski, Michal; Polakowska, Klaudia; Lis, Marcin W.; Kosecka-Strojek, Maja; Sabat, Artur J.; Dubin, Grzegorz; Friedrich, Alexander W.; Miedzobrodzki, Jacek; Dubin, Adam; Wladyka, Benedykt

    2016-01-01

    Staphylococcus aureus is a commensal inhabitant of skin and mucous membranes in nose vestibule but also an important opportunistic pathogen of humans and livestock. The extracellular proteome as a whole constitutes its major virulence determinant; however, the involvement of particular proteins is still relatively poorly understood. In this study, we compared the extracellular proteomes of poultry-derived S. aureus strains exhibiting a virulent (VIR) and non-virulent (NVIR) phenotype in a chicken embryo experimental infection model with the aim to identify proteomic signatures associated with the particular phenotypes. Despite significant heterogeneity within the analyzed proteomes, we identified alpha-haemolysin and bifunctional autolysin as indicators of virulence, whereas glutamylendopeptidase production was characteristic for non-virulent strains. Staphopain C (StpC) was identified in both the VIR and NVIR proteomes and the latter fact contradicted previous findings suggesting its involvement in virulence. By supplementing NVIR, StpC-negative strains with StpC, and comparing the virulence of parental and supplemented strains, we demonstrated that staphopain C alone does not affect staphylococcal virulence in a chicken embryo model. PMID:27242969

  19. A User-Driven Annotation Framework for Scientific Data

    ERIC Educational Resources Information Center

    Li, Qinglan

    2013-01-01

    Annotations play an increasingly crucial role in scientific exploration and discovery, as the amount of data and the level of collaboration among scientists increases. There are many systems today focusing on annotation management, querying, and propagation. Although all such systems are implemented to take user input (i.e., the annotations…

  20. A Factor Graph Approach to Automated GO Annotation.

    PubMed

    Spetale, Flavio E; Tapia, Elizabeth; Krsticevic, Flavia; Roda, Fernando; Bulacio, Pilar

    2016-01-01

    As volume of genomic data grows, computational methods become essential for providing a first glimpse onto gene annotations. Automated Gene Ontology (GO) annotation methods based on hierarchical ensemble classification techniques are particularly interesting when interpretability of annotation results is a main concern. In these methods, raw GO-term predictions computed by base binary classifiers are leveraged by checking the consistency of predefined GO relationships. Both formal leveraging strategies, with main focus on annotation precision, and heuristic alternatives, with main focus on scalability issues, have been described in literature. In this contribution, a factor graph approach to the hierarchical ensemble formulation of the automated GO annotation problem is presented. In this formal framework, a core factor graph is first built based on the GO structure and then enriched to take into account the noisy nature of GO-term predictions. Hence, starting from raw GO-term predictions, an iterative message passing algorithm between nodes of the factor graph is used to compute marginal probabilities of target GO-terms. Evaluations on Saccharomyces cerevisiae, Arabidopsis thaliana and Drosophila melanogaster protein sequences from the GO Molecular Function domain showed significant improvements over competing approaches, even when protein sequences were naively characterized by their physicochemical and secondary structure properties or when loose noisy annotation datasets were considered. Based on these promising results and using Arabidopsis thaliana annotation data, we extend our approach to the identification of most promising molecular function annotations for a set of proteins of unknown function in Solanum lycopersicum. PMID:26771463

  1. Evaluating techniques for metagenome annotation using simulated sequence data.

    PubMed

    Randle-Boggis, Richard J; Helgason, Thorunn; Sapp, Melanie; Ashton, Peter D

    2016-07-01

    The advent of next-generation sequencing has allowed huge amounts of DNA sequence data to be produced, advancing the capabilities of microbial ecosystem studies. The current challenge is to identify from which microorganisms and genes the DNA originated. Several tools and databases are available for annotating DNA sequences. The tools, databases and parameters used can have a significant impact on the results: naïve choice of these factors can result in a false representation of community composition and function. We use a simulated metagenome to show how different parameters affect annotation accuracy by evaluating the sequence annotation performances of MEGAN, MG-RAST, One Codex and Megablast. This simulated metagenome allowed the recovery of known organism and function abundances to be quantitatively evaluated, which is not possible for environmental metagenomes. The performance of each program and database varied, e.g. One Codex correctly annotated many sequences at the genus level, whereas MG-RAST RefSeq produced many false positive annotations. This effect decreased as the taxonomic level investigated increased. Selecting more stringent parameters decreases the annotation sensitivity, but increases precision. Ultimately, there is a trade-off between taxonomic resolution and annotation accuracy. These results should be considered when annotating metagenomes and interpreting results from previous studies. PMID:27162180

  2. Annotated Bibliography of Research in the Teaching of English

    ERIC Educational Resources Information Center

    Beach, Richard; Bigelow, Martha; Dillon, Deborah; Dockter, Jessie; Galda, Lee; Helman, Lori; Kalnin, Julie; Ngo, Bic; O'Brien, David; Sato, Mistilina; Scharber, Cassandra; Jorgensen, Karen; Liang, Lauren; Braaksma, Martine; Janssen, Tanja

    2008-01-01

    This article presents an annotated bibliography of research in the teaching of English. This annotated bibliography addresses the following topics: (1) discourse/cultural analysis; (2) literacy; (3) literary response/literature/narrative; (4) professional development/teacher education; (5) reading; (6) second language literacy; (7)…

  3. Behavioral contributions to teaching of psychology: an annotated bibliography.

    PubMed

    Karsten, Amanda M; Carr, James E

    2008-01-01

    An annotated bibliography that summarizes behavioral contributions to the journal Teaching of Psychology from 1974 to 2006 is provided. A total of 116 articles of potential utility to college-level instructors of behavior analysis and related areas were identified, annotated, and organized into nine categories for ease of accessibility.

  4. Protein Annotators' Assistant: A Novel Application of Information Retrieval Techniques.

    ERIC Educational Resources Information Center

    Wise, Michael J.

    2000-01-01

    Protein Annotators' Assistant (PAA) is a software system which assists protein annotators in assigning functions to newly sequenced proteins. PAA employs a number of information retrieval techniques in a novel setting and is thus related to text categorization, where multiple categories may be suggested, except that in this case none of the…

  5. Maize - GO annotation methods, evaluation, and review (Maize-GAMER)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Making a genome sequence accessible and useful involves three basic steps: genome assembly, structural annotation, and functional annotation. The quality of data generated at each step influences the accuracy of inferences that can be made, with high-quality analyses produce better datasets resultin...

  6. Evaluating techniques for metagenome annotation using simulated sequence data

    PubMed Central

    Randle-Boggis, Richard J.; Helgason, Thorunn; Sapp, Melanie; Ashton, Peter D.

    2016-01-01

    The advent of next-generation sequencing has allowed huge amounts of DNA sequence data to be produced, advancing the capabilities of microbial ecosystem studies. The current challenge is to identify from which microorganisms and genes the DNA originated. Several tools and databases are available for annotating DNA sequences. The tools, databases and parameters used can have a significant impact on the results: naïve choice of these factors can result in a false representation of community composition and function. We use a simulated metagenome to show how different parameters affect annotation accuracy by evaluating the sequence annotation performances of MEGAN, MG-RAST, One Codex and Megablast. This simulated metagenome allowed the recovery of known organism and function abundances to be quantitatively evaluated, which is not possible for environmental metagenomes. The performance of each program and database varied, e.g. One Codex correctly annotated many sequences at the genus level, whereas MG-RAST RefSeq produced many false positive annotations. This effect decreased as the taxonomic level investigated increased. Selecting more stringent parameters decreases the annotation sensitivity, but increases precision. Ultimately, there is a trade-off between taxonomic resolution and annotation accuracy. These results should be considered when annotating metagenomes and interpreting results from previous studies. PMID:27162180

  7. Behavioral Contributions to Teaching of Psychology: An Annotated Bibliography

    PubMed Central

    Karsten, Amanda M; Carr, James E

    2008-01-01

    An annotated bibliography that summarizes behavioral contributions to the journal Teaching of Psychology from 1974 to 2006 is provided. A total of 116 articles of potential utility to college-level instructors of behavior analysis and related areas were identified, annotated, and organized into nine categories for ease of accessibility. PMID:22478500

  8. A Factor Graph Approach to Automated GO Annotation

    PubMed Central

    Spetale, Flavio E.; Tapia, Elizabeth; Krsticevic, Flavia; Roda, Fernando; Bulacio, Pilar

    2016-01-01

    As volume of genomic data grows, computational methods become essential for providing a first glimpse onto gene annotations. Automated Gene Ontology (GO) annotation methods based on hierarchical ensemble classification techniques are particularly interesting when interpretability of annotation results is a main concern. In these methods, raw GO-term predictions computed by base binary classifiers are leveraged by checking the consistency of predefined GO relationships. Both formal leveraging strategies, with main focus on annotation precision, and heuristic alternatives, with main focus on scalability issues, have been described in literature. In this contribution, a factor graph approach to the hierarchical ensemble formulation of the automated GO annotation problem is presented. In this formal framework, a core factor graph is first built based on the GO structure and then enriched to take into account the noisy nature of GO-term predictions. Hence, starting from raw GO-term predictions, an iterative message passing algorithm between nodes of the factor graph is used to compute marginal probabilities of target GO-terms. Evaluations on Saccharomyces cerevisiae, Arabidopsis thaliana and Drosophila melanogaster protein sequences from the GO Molecular Function domain showed significant improvements over competing approaches, even when protein sequences were naively characterized by their physicochemical and secondary structure properties or when loose noisy annotation datasets were considered. Based on these promising results and using Arabidopsis thaliana annotation data, we extend our approach to the identification of most promising molecular function annotations for a set of proteins of unknown function in Solanum lycopersicum. PMID:26771463

  9. Attitudes & Disability: An Annotated Bibliography, 1975-1981.

    ERIC Educational Resources Information Center

    Makas, Elaine, Comp.

    The annotated bibliography contains approximately 1,000 citations (1975-1981) dealing with attitudes related to disability. In addition to a brief annotation, entries include information on author, title, source, date, and pagination. Citations are classified according to the following topics: specific disabilities (cardiovascular impairment,…

  10. A Factor Graph Approach to Automated GO Annotation.

    PubMed

    Spetale, Flavio E; Tapia, Elizabeth; Krsticevic, Flavia; Roda, Fernando; Bulacio, Pilar

    2016-01-01

    As volume of genomic data grows, computational methods become essential for providing a first glimpse onto gene annotations. Automated Gene Ontology (GO) annotation methods based on hierarchical ensemble classification techniques are particularly interesting when interpretability of annotation results is a main concern. In these methods, raw GO-term predictions computed by base binary classifiers are leveraged by checking the consistency of predefined GO relationships. Both formal leveraging strategies, with main focus on annotation precision, and heuristic alternatives, with main focus on scalability issues, have been described in literature. In this contribution, a factor graph approach to the hierarchical ensemble formulation of the automated GO annotation problem is presented. In this formal framework, a core factor graph is first built based on the GO structure and then enriched to take into account the noisy nature of GO-term predictions. Hence, starting from raw GO-term predictions, an iterative message passing algorithm between nodes of the factor graph is used to compute marginal probabilities of target GO-terms. Evaluations on Saccharomyces cerevisiae, Arabidopsis thaliana and Drosophila melanogaster protein sequences from the GO Molecular Function domain showed significant improvements over competing approaches, even when protein sequences were naively characterized by their physicochemical and secondary structure properties or when loose noisy annotation datasets were considered. Based on these promising results and using Arabidopsis thaliana annotation data, we extend our approach to the identification of most promising molecular function annotations for a set of proteins of unknown function in Solanum lycopersicum.

  11. Annotated selected references on natural resources investigations, Collier County, Florida

    USGS Publications Warehouse

    Swayze, L.J.

    1981-01-01

    A data base for future natural resources investigations in Collier County, Fla., was initiated by compiling a selected annotated bibliography. This report provides references and annotations for selected reports released between 1950 and 1978. The references are presented by subject material as follows: biologic, ecologic, geologic, geochemical, and hydrologic. (USGS)

  12. Online Metacognitive Strategies, Hypermedia Annotations, and Motivation on Hypertext Comprehension

    ERIC Educational Resources Information Center

    Shang, Hui-Fang

    2016-01-01

    This study examined the effect of online metacognitive strategies, hypermedia annotations, and motivation on reading comprehension in a Taiwanese hypertext environment. A path analysis model was proposed based on the assumption that if English as a foreign language learners frequently use online metacognitive strategies and hypermedia annotations,…

  13. An Annotated Bibliography of Experimental Research concerning Competitive Swimming.

    ERIC Educational Resources Information Center

    Bachman, John C.

    This annotated bibliography has been compiled as a guide for the researcher of swimming in referring to experimental studies in the physiological, mechanical, psychological, and medical aspects of swimming. The studies have been briefly annotated to enable the reader to quickly determine the salient points the authors made in their studies. The…

  14. International Development and the Human Environment. An Annotated Bibliography.

    ERIC Educational Resources Information Center

    1974

    Most of the material in this annotated bibliography has been selected from the literature published between 1968 and 1972. Each annotation and citation is indexed by author, subject, and publisher. Entries are organized into 11 chapters: Environment, Development, and Conservation of Natural Resources; The Third World: Development and Economic…

  15. Math for Learning, Math for Life: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Elliott, Claire

    This document presents a total of 109 references and annotations of works that are in some way related to the topic of math for learning and life. Section 1 presents 68 annotated references with keywords drawn from the Canadian Literacy Thesaurus. Selected topics covered in the listed publications are as follows: numeracy as social practice; the…

  16. Short-Term Memory: An Annotated Bibliography. Supplement II.

    ERIC Educational Resources Information Center

    Fisher, Dennis F.

    This bibliography is an annotated compilation of 198 references dealing with short-term memory. It is added as a second supplement to Short-Term Memory: An Annotated Bibliography, August, 1968. The time period covered is predominantly June, 1969 to December, 1970. References included are arranged alphabetically by author. An alphabetical index of…

  17. From the Margins to the Center: The Future of Annotation.

    ERIC Educational Resources Information Center

    Wolfe, Joanna L.; Neuwirth, Christine M.

    2001-01-01

    Describes the importance of annotation to reading and writing practices and reviews new technologies that complicate the ways annotation can be used to support and enhance traditional reading, writing, and collaboration processes. Emphasizes issues and methods that will be productive for enhancing theories of workplace and classroom communication…

  18. Materials for Teaching about the Bicentiennial: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Wiley, Karen B.; Pestello, Roxy

    This annotated bibliography is intended for elementary through secondary social studies teachers who are looking for curriculum materials and resources for teaching about the Bicentennial. Over 100 annotated entries of selected curriculum and teacher materials are included in this bibliography, along with a selective list of organizations and…

  19. Orienteering: An Annotated Bibliography = Orientierungslauf: Eine kommentierte Bibliographie.

    ERIC Educational Resources Information Center

    Seiler, Roland, Ed.; Hartmann, Wolfgang, Ed.

    1994-01-01

    Annotated bibliography of 220 books, monographs, and journal articles on orienteering published 1984-94, from SPOLIT database of the Federal Institute of Sport Science (Cologne, Germany). Annotations in English or German. Ten sections including psychological, physiological, health, sociological, and environmental aspects; training and coaching;…

  20. PROVIDING CLINICAL SERVICES IN READING--AN ANNOTATED BIBLIOGRAPHY.

    ERIC Educational Resources Information Center

    JOHNSON, MARJORIE S.; KRESS, ROY A.

    THIS ANNOTATED BIBLIOGRAPHY IS COMPOSED OF 55 CITATIONS RANGING IN DATE FROM 1932 TO 1965. IT IS DESIGNED TO AID THOSE INTERESTED IN SETTING UP A READING CLINIC AND IS DIVIDED INTO TWO SECTIONS--THE ANNOTATED REVIEWS OF THE ARTICLES SELECTED AND A LIST OF PUBLICATIONS DEALING PRIMARILY WITH A CLINICAL APPROACH TO THE DIAGNOSIS AND TREATMENT OF…

  1. Automatic Annotation Method on Learners' Opinions in Case Method Discussion

    ERIC Educational Resources Information Center

    Samejima, Masaki; Hisakane, Daichi; Komoda, Norihisa

    2015-01-01

    Purpose: The purpose of this paper is to annotate an attribute of a problem, a solution or no annotation on learners' opinions automatically for supporting the learners' discussion without a facilitator. The case method aims at discussing problems and solutions in a target case. However, the learners miss discussing some of problems and solutions.…

  2. Using Annotation Services in a Ubiquitous Jigsaw Cooperative Learning Environment

    ERIC Educational Resources Information Center

    Huang, Yueh-Min; Huang, Tien-Chi; Hsieh, Meng-Yeh

    2008-01-01

    This study describes the development of a ubiquitous cooperative learning environment using proposed annotation services, wireless communication devices, and the Jigsaw method of cooperative learning. The purpose of the study is to investigate the potential benefits of studying digital course materials with embedded annotations. The SQ3R study…

  3. Molecular epidemiologic evaluation of transmissibility and virulence of Mycobacterium tuberculosis.

    PubMed

    Rhee, J T; Piatek, A S; Small, P M; Harris, L M; Chaparro, S V; Kramer, F R; Alland, D

    1999-06-01

    Discovery of genotypic markers associated with increased transmissibility in Mycobacterium tuberculosis would represent an important step in advancing mycobacterial virulence studies. M. tuberculosis strains may be classified into one of three genotypes on the basis of the presence of specific nucleotide substitutions in codon 463 of the katG gene (katG-463) and codon 95 of the gyrA gene (gyrA-95). It has previously been reported that two of these three genotypes are associated with increased IS6110-based clustering, a potential proxy of virulence. We designed a case-control analysis of U.S.-born patients with tuberculosis in San Francisco, Calif., between 1991 and 1997 to investigate associations between katG-463 and gyrA-95 genotypes and epidemiologically determined measures of strain-specific infectivity and pathogenicity and IS6110-based clustering status. We used a new class of molecular probes called molecular beacons to genotype the isolates rapidly. Infectivity was defined as the propensity of isolates to cause tuberculin skin test conversions among named contacts, and pathogenicity was defined as their propensity to cause active disease among named contacts. The molecular beacon assay was a simple and reproducible method for the detection of known single nucleotide polymorphisms in large numbers of clinical M. tuberculosis isolates. The results showed that no genotype of the katG-463- and gyrA-95-based classification system was associated with increased infectivity and pathogenicity or with increased IS6110-based clustering in San Francisco during the study period. We speculate that molecular epidemiologic studies investigating clinically relevant outcomes may contribute to the knowledge of the significance of laboratory-derived virulence factors in the propagation of tuberculosis in human communities.

  4. Annotated bibliography of software engineering laboratory literature

    NASA Technical Reports Server (NTRS)

    Kistler, David; Bristow, John; Smith, Don

    1994-01-01

    This document is an annotated bibliography of technical papers, documents, and memorandums produced by or related to the Software Engineering Laboratory. Nearly 200 publications are summarized. These publications cover many areas of software engineering and range from research reports to software documentation. This document has been updated and reorganized substantially since the original version (SEL-82-006, November 1982). All materials have been grouped into eight general subject areas for easy reference: (1) The Software Engineering Laboratory; (2) The Software Engineering Laboratory: Software Development Documents; (3) Software Tools; (4) Software Models; (5) Software Measurement; (6) Technology Evaluations; (7) Ada Technology; and (8) Data Collection. This document contains an index of these publications classified by individual author.

  5. Desert tortoise annotated bibliography, 1991-2015

    USGS Publications Warehouse

    Berry, Kristin H.; Lyren, Lisa M.; Mack, Jeremy S.; Brand, L. Arriana; Wood, Dustin A.

    2016-01-01

    Agassiz’s Desert Tortoise (hereinafter called desert tortoise) is a state- and federally-listed threatened species (U.S. Fish and Wildlife Service, 1990; California Department of Fish and Game, 2015). The first population federally listed as threatened occurred on the Beaver Dam Slope, Utah (U.S. Fish and Wildlife Service, 1980). In 1990, the entire geographic range north and west of the Colorado River was federally listed as threatened (U.S. Fish and Wildlife Service, 1990), with the exception being a small population in northwestern Arizona. The purpose of this annotated bibliography is to support recovery efforts for the species, because populations have continued to decline in spite of designation of critical habitat and publication of a recovery plan (U.S. Fish and Wildlife Service, 1994). For example, between 2005 and 2014, populations in critical habitats declined about 50% (U.S. Fish and Wildlife Service, 2015).

  6. Desert tortoise annotated bibliography, 1991-2015

    USGS Publications Warehouse

    Berry, Kristin H.; Lyren, Lisa M.; Mack, Jeremy S.; Brand, L. Arriana; Wood, Dustin A.

    2016-03-01

    Agassiz’s Desert Tortoise (hereinafter called desert tortoise) is a state- and federally-listed threatened species (U.S. Fish and Wildlife Service, 1990; California Department of Fish and Game, 2015). The first population federally listed as threatened occurred on the Beaver Dam Slope, Utah (U.S. Fish and Wildlife Service, 1980). In 1990, the entire geographic range north and west of the Colorado River was federally listed as threatened (U.S. Fish and Wildlife Service, 1990), with the exception being a small population in northwestern Arizona. The purpose of this annotated bibliography is to support recovery efforts for the species, because populations have continued to decline in spite of designation of critical habitat and publication of a recovery plan (U.S. Fish and Wildlife Service, 1994). For example, between 2005 and 2014, populations in critical habitats declined about 50% (U.S. Fish and Wildlife Service, 2015).

  7. Interpretation Errors related to the GO Annotation File Format

    PubMed Central

    Moreira, Dilvan A.; Shah, Nigam H.; Musen, Mark A.

    2007-01-01

    The Gene Ontology (GO) is the most widely used ontology for creating biomedical annotations. GO annotations are statements associating a biological entity with a GO term. These statements comprise a large dataset of biological knowledge that is used widely in biomedical research. GO Annotations are available as “gene association files” from the GO website in a tab-delimited file format (GO Annotation File Format) composed of rows of 15 tab-delimited fields. This simple format lacks the knowledge representation (KR) capabilities to represent unambiguously semantic relationships between each field. This paper demonstrates that this KR shortcoming leads users to interpret the files in ways that can be erroneous. We propose a complementary format to represent GO annotation files as knowledge bases using the W3C recommended Web Ontology Language (OWL). PMID:18693894

  8. The Vertebrate Genome Annotation browser 10 years on

    PubMed Central

    Harrow, Jennifer L.; Steward, Charles A.; Frankish, Adam; Gilbert, James G.; Gonzalez, Jose M.; Loveland, Jane E.; Mudge, Jonathan; Sheppard, Dan; Thomas, Mark; Trevanion, Stephen; Wilming, Laurens G.

    2014-01-01

    The Vertebrate Genome Annotation (VEGA) database (http://vega.sanger.ac.uk), initially designed as a community resource for browsing manual annotation of the human genome project, now contains five reference genomes (human, mouse, zebrafish, pig and rat). Its introduction pages have been redesigned to enable the user to easily navigate between whole genomes and smaller multi-species haplotypic regions of interest such as the major histocompatibility complex. The VEGA browser is unique in that annotation is updated via the Human And Vertebrate Analysis aNd Annotation (HAVANA) update track every 2 weeks, allowing single gene updates to be made publicly available to the research community quickly. The user can now access different haplotypic subregions more easily, such as those from the non-obese diabetic mouse, and display them in a more intuitive way using the comparative tools. We also highlight how the user can browse manually annotated updated patches from the Genome Reference Consortium (GRC). PMID:24316575

  9. Semantator: annotating clinical narratives with semantic web ontologies.

    PubMed

    Song, Dezhao; Chute, Christopher G; Tao, Cui

    2012-01-01

    To facilitate clinical research, clinical data needs to be stored in a machine processable and understandable way. Manual annotating clinical data is time consuming. Automatic approaches (e.g., Natural Language Processing systems) have been adopted to convert such data into structured formats; however, the quality of such automatically extracted data may not always be satisfying. In this paper, we propose Semantator, a semi-automatic tool for document annotation with Semantic Web ontologies. With a loaded free text document and an ontology, Semantator supports the creation/deletion of ontology instances for any document fragment, linking/disconnecting instances with the properties in the ontology, and also enables automatic annotation by connecting to the NCBO annotator and cTAKES. By representing annotations in Semantic Web standards, Semantator supports reasoning based upon the underlying semantics of the owl:disjointWith and owl:equivalentClass predicates. We present discussions based on user experiences of using Semantator.

  10. Interpretation errors related to the GO annotation file format.

    PubMed

    Moreira, Dilvan A; Shah, Nigam H; Musen, Mark A

    2007-01-01

    The Gene Ontology (GO) is the most widely used ontology for creating biomedical annotations. GO annotations are statements associating a biological entity with a GO term. These statements comprise a large dataset of biological knowledge that is used widely in biomedical research. GO Annotations are available as "gene association files" from the GO website in a tab-delimited file format (GO Annotation File Format) composed of rows of 15 tab-delimited fields. This simple format lacks the knowledge representation (KR) capabilities to represent unambiguously semantic relationships between each field. This paper demonstrates that this KR shortcoming leads users to interpret the files in ways that can be erroneous. We propose a complementary format to represent GO annotation files as knowledge bases using the W3C recommended Web Ontology Language (OWL).

  11. Inter-Annotator Reliability of Medical Events, Coreferences and Temporal Relations in Clinical Narratives by Annotators with Varying Levels of Clinical Expertise

    PubMed Central

    Raghavan, Preethi; Fosler-Lussier, Eric; Lai, Albert M.

    2012-01-01

    The manual annotation of clinical narratives is an important step for training and validating the performance of automated systems that utilize these clinical narratives. We build an annotation specification to capture medical events, and coreferences and temporal relations between medical events in clinical text. Unfortunately, the process of clinical data annotation is both time consuming and costly. Many annotation efforts have used physicians to annotate the data. We investigate using annotators that are current students or graduates from diverse clinical backgrounds with varying levels of clinical experience. In spite of this diversity, the annotation agreement across our team of annotators is high; the average inter-annotator kappa statistic for medical events, coreferences, temporal relations, and medical event concept unique identifiers was 0.843, 0.859, 0.833, and 0.806, respectively. We describe methods towards leveraging the annotations to support temporal reasoning with medical events. PMID:23304416

  12. The Disease and Gene Annotations (DGA): an annotation resource for human disease.

    PubMed

    Peng, Kai; Xu, Wei; Zheng, Jianyong; Huang, Kegui; Wang, Huisong; Tong, Jiansong; Lin, Zhifeng; Liu, Jun; Cheng, Wenqing; Fu, Dong; Du, Pan; Kibbe, Warren A; Lin, Simon M; Xia, Tian

    2013-01-01

    Disease and Gene Annotations database (DGA, http://dga.nubic.northwestern.edu) is a collaborative effort aiming to provide a comprehensive and integrative annotation of the human genes in disease network context by integrating computable controlled vocabulary of the Disease Ontology (DO version 3 revision 2510, which has 8043 inherited, developmental and acquired human diseases), NCBI Gene Reference Into Function (GeneRIF) and molecular interaction network (MIN). DGA integrates these resources together using semantic mappings to build an integrative set of disease-to-gene and gene-to-gene relationships with excellent coverage based on current knowledge. DGA is kept current by periodically reparsing DO, GeneRIF, and MINs. DGA provides a user-friendly and interactive web interface system enabling users to efficiently query, download and visualize the DO tree structure and annotations as a tree, a network graph or a tabular list. To facilitate integrative analysis, DGA provides a web service Application Programming Interface for integration with external analytic tools.

  13. Learning virulent proteins from integrated query networks

    PubMed Central

    2012-01-01

    Background Methods of weakening and attenuating pathogens’ abilities to infect and propagate in a host, thus allowing the natural immune system to more easily decimate invaders, have gained attention as alternatives to broad-spectrum targeting approaches. The following work describes a technique to identifying proteins involved in virulence by relying on latent information computationally gathered across biological repositories, applicable to both generic and specific virulence categories. Results A lightweight method for data integration is used, which links information regarding a protein via a path-based query graph. A method of weighting is then applied to query graphs that can serve as input to various statistical classification methods for discrimination, and the combined usage of both data integration and learning methods are tested against the problem of both generalized and specific virulence function prediction. Conclusions This approach improves coverage of functional data over a protein. Moreover, while depending largely on noisy and potentially non-curated data from public sources, we find it outperforms other techniques to identification of general virulence factors and baseline remote homology detection methods for specific virulence categories. PMID:23198735

  14. Genetically manipulated virulence of Yersinia enterocolitica.

    PubMed Central

    Heesemann, J; Algermissen, B; Laufs, R

    1984-01-01

    Mobilizable virulence plasmids of Yersinia enterocolitica of serotypes O:3 and O:9 were constructed by cointegration of a mobilizable vector into the virulence plasmids. The obtained cointegrates were mobilized into plasmidless Y. enterocolitica strains of serotypes O:3, O:5, O:8, and O:9. The transfer experiments revealed the existence of two different subgroups of plasmid-associated traits. (i) Animal virulence functions (mouse lethality and conjuctivitis provocation) were only transferable to plasmid-cured derivatives of virulent parent strains (serotypes O:3, O:8, and O:9), but they were not transferable to Y. enterocolitica antigen reference strains (serotypes O:3 and O:8) or to a plasmidless clinical isolate of serotype O:5. A further striking result was that a serotype O:8 strain regained the mouse lethality trait after receipt of a plasmid from a strain not lethal to mice. These results demonstrate that plasmid-mediated animal virulence functions are not uniformly expressed within Y. enterocolitica. (ii) The second subgroup of plasmid-mediated traits (calcium dependency, surface agglutinogens, HEp-2 cell adherence, and protein release) were transferable to all Y. enterocolitica recipient strains tested (serotypes O:3, O:5, O:8, and O:9 of different origin). For the first time HEp-2 cell adherence and temperature-induced release of five major protein species are described as transferable traits. Images PMID:6480101

  15. Salmonella promotes virulence by repressing cellulose production.

    PubMed

    Pontes, Mauricio H; Lee, Eun-Jin; Choi, Jeongjoon; Groisman, Eduardo A

    2015-04-21

    Cellulose is the most abundant organic polymer on Earth. In bacteria, cellulose confers protection against environmental insults and is a constituent of biofilms typically formed on abiotic surfaces. We report that, surprisingly, Salmonella enterica serovar Typhimurium makes cellulose when inside macrophages. We determine that preventing cellulose synthesis increases virulence, whereas stimulation of cellulose synthesis inside macrophages decreases virulence. An attenuated mutant lacking the mgtC gene exhibited increased cellulose levels due to increased expression of the cellulose synthase gene bcsA and of cyclic diguanylate, the allosteric activator of the BcsA protein. Inactivation of bcsA restored wild-type virulence to the Salmonella mgtC mutant, but not to other attenuated mutants displaying a wild-type phenotype regarding cellulose. Our findings indicate that a virulence determinant can promote pathogenicity by repressing a pathogen's antivirulence trait. Moreover, they suggest that controlling antivirulence traits increases long-term pathogen fitness by mediating a trade-off between acute virulence and transmission.

  16. Plant Natural Products Targeting Bacterial Virulence Factors.

    PubMed

    Silva, Laura Nunes; Zimmer, Karine Rigon; Macedo, Alexandre José; Trentin, Danielle Silva

    2016-08-24

    Decreased antimicrobial efficiency has become a global public health issue. The paucity of new antibacterial drugs is evident, and the arsenal against infectious diseases needs to be improved urgently. The selection of plants as a source of prototype compounds is appropriate, since plant species naturally produce a wide range of secondary metabolites that act as a chemical line of defense against microorganisms in the environment. Although traditional approaches to combat microbial infections remain effective, targeting microbial virulence rather than survival seems to be an exciting strategy, since the modulation of virulence factors might lead to a milder evolutionary pressure for the development of resistance. Additionally, anti-infective chemotherapies may be successfully achieved by combining antivirulence and conventional antimicrobials, extending the lifespan of these drugs. This review presents an updated discussion of natural compounds isolated from plants with chemically characterized structures and activity against the major bacterial virulence factors: quorum sensing, bacterial biofilms, bacterial motility, bacterial toxins, bacterial pigments, bacterial enzymes, and bacterial surfactants. Moreover, a critical analysis of the most promising virulence factors is presented, highlighting their potential as targets to attenuate bacterial virulence. The ongoing progress in the field of antivirulence therapy may therefore help to translate this promising concept into real intervention strategies in clinical areas. PMID:27437994

  17. Salmonella promotes virulence by repressing cellulose production

    PubMed Central

    Pontes, Mauricio H.; Lee, Eun-Jin; Choi, Jeongjoon; Groisman, Eduardo A.

    2015-01-01

    Cellulose is the most abundant organic polymer on Earth. In bacteria, cellulose confers protection against environmental insults and is a constituent of biofilms typically formed on abiotic surfaces. We report that, surprisingly, Salmonella enterica serovar Typhimurium makes cellulose when inside macrophages. We determine that preventing cellulose synthesis increases virulence, whereas stimulation of cellulose synthesis inside macrophages decreases virulence. An attenuated mutant lacking the mgtC gene exhibited increased cellulose levels due to increased expression of the cellulose synthase gene bcsA and of cyclic diguanylate, the allosteric activator of the BcsA protein. Inactivation of bcsA restored wild-type virulence to the Salmonella mgtC mutant, but not to other attenuated mutants displaying a wild-type phenotype regarding cellulose. Our findings indicate that a virulence determinant can promote pathogenicity by repressing a pathogen's antivirulence trait. Moreover, they suggest that controlling antivirulence traits increases long-term pathogen fitness by mediating a trade-off between acute virulence and transmission. PMID:25848006

  18. Salmonella promotes virulence by repressing cellulose production.

    PubMed

    Pontes, Mauricio H; Lee, Eun-Jin; Choi, Jeongjoon; Groisman, Eduardo A

    2015-04-21

    Cellulose is the most abundant organic polymer on Earth. In bacteria, cellulose confers protection against environmental insults and is a constituent of biofilms typically formed on abiotic surfaces. We report that, surprisingly, Salmonella enterica serovar Typhimurium makes cellulose when inside macrophages. We determine that preventing cellulose synthesis increases virulence, whereas stimulation of cellulose synthesis inside macrophages decreases virulence. An attenuated mutant lacking the mgtC gene exhibited increased cellulose levels due to increased expression of the cellulose synthase gene bcsA and of cyclic diguanylate, the allosteric activator of the BcsA protein. Inactivation of bcsA restored wild-type virulence to the Salmonella mgtC mutant, but not to other attenuated mutants displaying a wild-type phenotype regarding cellulose. Our findings indicate that a virulence determinant can promote pathogenicity by repressing a pathogen's antivirulence trait. Moreover, they suggest that controlling antivirulence traits increases long-term pathogen fitness by mediating a trade-off between acute virulence and transmission. PMID:25848006

  19. Indole and 7‐hydroxyindole diminish Pseudomonas aeruginosa virulence

    PubMed Central

    Lee, Jintae; Attila, Can; Cirillo, Suat L. G.; Cirillo, Jeffrey D.; Wood, Thomas K.

    2009-01-01

    Summary Indole is an extracellular biofilm signal for Escherichia coli, and many bacterial oxygenases readily convert indole to various oxidized compounds including 7‐hydroxyindole (7HI). Here we investigate the impact of indole and 7HI on Pseudomonas aeruginosa PAO1 virulence and quorum sensing (QS)‐regulated phenotypes; this strain does not synthesize these compounds but degrades them rapidly. Indole and 7HI both altered extensively gene expression in a manner opposite that of acylhomoserine lactones; the most repressed genes encode the mexGHI‐opmD multidrug efflux pump and genes involved in the synthesis of QS‐regulated virulence factors including pyocyanin (phz operon), 2‐heptyl‐3‐hydroxy‐4(1H)‐quinolone (PQS) signal (pqs operon), pyochelin (pch operon) and pyoverdine (pvd operon). Corroborating these microarray results, indole and 7HI decreased production of pyocyanin, rhamnolipid, PQS and pyoverdine and enhanced antibiotic resistance. In addition, indole affected the utilization of carbon, nitrogen and phosphorus, and 7HI abolished swarming motility. Furthermore, 7HI reduced pulmonary colonization of P. aeruginosa in guinea pigs and increased clearance in lungs. Hence, indole‐related compounds have potential as a novel antivirulence approach for the recalcitrant pathogen P. aeruginosa. PMID:21261883

  20. Stress response signaling and virulence: insights from entomopathogenic fungi.

    PubMed

    Ortiz-Urquiza, Almudena; Keyhani, Nemat O

    2015-08-01

    The Ascomycete fungal insect pathogens, Beauveria and Metarhizium spp. have emerged as model systems with which to probe diverse aspects of fungal growth, stress response, and pathogenesis. Due to the availability of genomic resources and the development of robust methods for genetic manipulation, the last 5 years have witnessed a rapid increase in the molecular characterization of genes and their pathways involved in stress response and signal transduction in these fungi. These studies have been performed mainly via characterization of gene deletion/knockout mutants and have included the targeting of general proteins involved in stress response and/or virulence, e.g. catalases, superoxide dismutases, and osmolyte balance maintenance enzymes, membrane proteins and signaling pathways including GPI anchored proteins and G-protein coupled membrane receptors, MAPK pathways, e.g. (i) the pheromone/nutrient sensing, Fus3/Kss1, (ii) the cell wall integrity, Mpk1, and (iii) the high osmolarity, Hog1, the PKA/adenyl cyclase pathway, and various downstream transcription factors, e.g. Msn2, CreA and Pac1. Here, we will discuss current research that strongly suggests extensive underlying contributions of these biochemical and signaling pathways to both abiotic stress response and virulence. PMID:25113413

  1. Klebsiella pneumoniae FimK Promotes Virulence in Murine Pneumonia.

    PubMed

    Rosen, David A; Hilliard, Julia K; Tiemann, Kristin M; Todd, Elizabeth M; Morley, S Celeste; Hunstad, David A

    2016-02-15

    Klebsiella pneumoniae, a chief cause of nosocomial pneumonia, is a versatile and commonly multidrug-resistant human pathogen for which further insight into pathogenesis is needed. We show that the pilus regulatory gene fimK promotes the virulence of K. pneumoniae strain TOP52 in murine pneumonia. This contrasts with the attenuating effect of fimK on urinary tract virulence, illustrating that a single factor may exert opposing effects on pathogenesis in distinct host niches. Loss of fimK in TOP52 pneumonia was associated with diminished lung bacterial burden, limited innate responses within the lung, and improved host survival. FimK expression was shown to promote serum resistance, capsule production, and protection from phagocytosis by host immune cells. Finally, while the widely used K. pneumoniae model strain 43816 produces rapid dissemination and death in mice, TOP52 caused largely localized pneumonia with limited lethality, thereby providing an alternative tool for studying K. pneumoniae pathogenesis and control within the lung.

  2. Indole and 7-hydroxyindole diminish Pseudomonas aeruginosa virulence.

    PubMed

    Lee, Jintae; Attila, Can; Cirillo, Suat L G; Cirillo, Jeffrey D; Wood, Thomas K

    2009-01-01

    Indole is an extracellular biofilm signal for Escherichia coli, and many bacterial oxygenases readily convert indole to various oxidized compounds including 7-hydroxyindole (7HI). Here we investigate the impact of indole and 7HI on Pseudomonas aeruginosa PAO1 virulence and quorum sensing (QS)-regulated phenotypes; this strain does not synthesize these compounds but degrades them rapidly. Indole and 7HI both altered extensively gene expression in a manner opposite that of acylhomoserine lactones; the most repressed genes encode the mexGHI-opmD multidrug efflux pump and genes involved in the synthesis of QS-regulated virulence factors including pyocyanin (phz operon), 2-heptyl-3-hydroxy-4(1H)-quinolone (PQS) signal (pqs operon), pyochelin (pch operon) and pyoverdine (pvd operon). Corroborating these microarray results, indole and 7HI decreased production of pyocyanin, rhamnolipid, PQS and pyoverdine and enhanced antibiotic resistance. In addition, indole affected the utilization of carbon, nitrogen and phosphorus, and 7HI abolished swarming motility. Furthermore, 7HI reduced pulmonary colonization of P. aeruginosa in guinea pigs and increased clearance in lungs. Hence, indole-related compounds have potential as a novel antivirulence approach for the recalcitrant pathogen P. aeruginosa. PMID:21261883

  3. Drosophila Gene Expression Pattern Annotation Using Sparse Features and Term-Term Interactions

    PubMed Central

    Ji, Shuiwang; Yuan, Lei; Li, Ying-Xin; Zhou, Zhi-Hua; Kumar, Sudhir; Ye, Jieping

    2010-01-01

    The Drosophila gene expression pattern images document the spatial and temporal dynamics of gene expression and they are valuable tools for explicating the gene functions, interaction, and networks during Drosophila embryogenesis. To provide text-based pattern searching, the images in the Berkeley Drosophila Genome Project (BDGP) study are annotated with ontology terms manually by human curators. We present a systematic approach for automating this task, because the number of images needing text descriptions is now rapidly increasing. We consider both improved feature representation and novel learning formulation to boost the annotation performance. For feature representation, we adapt the bag-of-words scheme commonly used in visual recognition problems so that the image group information in the BDGP study is retained. Moreover, images from multiple views can be integrated naturally in this representation. To reduce the quantization error caused by the bag-of-words representation, we propose an improved feature representation scheme based on the sparse learning technique. In the design of learning formulation, we propose a local regularization framework that can incorporate the correlations among terms explicitly. We further show that the resulting optimization problem admits an analytical solution. Experimental results show that the representation based on sparse learning outperforms the bag-of-words representation significantly. Results also show that incorporation of the term-term correlations improves the annotation performance consistently. PMID:21614142

  4. Functional annotation of the human chromosome 7 "missing" proteins: a bioinformatics approach.

    PubMed

    Ranganathan, Shoba; Khan, Javed M; Garg, Gagan; Baker, Mark S

    2013-06-01

    The chromosome-centric human proteome project aims to systematically map all human proteins, chromosome by chromosome, in a gene-centric manner through dedicated efforts from national and international teams. This mapping will lead to a knowledge-based resource defining the full set of proteins encoded in each chromosome and laying the foundation for the development of a standardized approach to analyze the massive proteomic data sets currently being generated. The neXtProt database lists 946 proteins as the human proteome of chromosome 7. However, 170 (18%) proteins of human chromosome 7 have no evidence at the proteomic, antibody, or structural levels and are considered "missing" in this study as they lack experimental support. We have developed a protocol for the functional annotation of these "missing" proteins by integrating several bioinformatics analysis and annotation tools, sequential BLAST homology searches, protein domain/motif and gene ontology (GO) mapping, and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. Using the BLAST search strategy, homologues for reviewed non-human mammalian proteins with protein evidence were identified for 90 "missing" proteins while another 38 had reviewed non-human mammalian homologues. Putative functional annotations were assigned to 27 of the remaining 43 novel proteins. Proteotypic peptides have been computationally generated to facilitate rapid identification of these proteins. Four of the "missing" chromosome 7 proteins have been substantiated by the ENCODE proteogenomic peptide data.

  5. Two enzymes with redundant fructose bisphosphatase activity sustain gluconeogenesis and virulence in Mycobacterium tuberculosis

    PubMed Central

    Ganapathy, Uday; Marrero, Joeli; Calhoun, Susannah; Eoh, Hyungjin; de Carvalho, Luiz Pedro Sorio; Rhee, Kyu; Ehrt, Sabine

    2015-01-01

    The human pathogen Mycobacterium tuberculosis (Mtb) likely utilizes host fatty acids as a carbon source during infection. Gluconeogenesis is essential for the conversion of fatty acids into biomass. A rate-limiting step in gluconeogenesis is the conversion of fructose 1,6-bisphosphate to fructose 6-phosphate by a fructose bisphosphatase (FBPase). The Mtb genome contains only one annotated FBPase gene, glpX. Here we show that, unexpectedly, an Mtb mutant lacking GLPX grows on gluconeogenic carbon sources and has detectable FBPase activity. We demonstrate that the Mtb genome encodes an alternative FBPase (GPM2, Rv3214) that can maintain gluconeogenesis in the absence of GLPX. Consequently, deletion of both GLPX and GPM2 is required for disruption of gluconeogenesis and attenuation of Mtb in a mouse model of infection. Our work affirms a role for gluconeogenesis in Mtb virulence and reveals previously unidentified metabolic redundancy at the FBPase-catalysed reaction step of the pathway. PMID:26258286

  6. Viral population dynamics and virulence thresholds.

    PubMed

    Lancaster, Karen Z; Pfeiffer, Julie K

    2012-08-01

    Viral factors and host barriers influence virally induced disease, and asymptomatic versus symptomatic infection is governed by a 'virulence threshold'. Understanding modulation of virulence thresholds could lend insight into disease outcome and aid in rational therapeutic and vaccine design. RNA viruses are an excellent system to study virulence thresholds in the context of quasispecies population dynamics. RNA viruses have high error frequencies and our understanding of viral population dynamics has been shaped by quasispecies evolutionary theory. In turn, research using RNA viruses as replicons with short generation times and high mutation rates has been an invaluable tool to test models of quasispecies theory. The challenge and new frontier of RNA virus population dynamics research is to combine multiple theoretical models and experimental data to describe viral population behavior as it changes, moving within and between hosts, to predict disease and pathogen emergence. Several excellent studies have begun to undertake this challenge using novel approaches.

  7. Potential drivers of virulence evolution in aquaculture.

    PubMed

    Kennedy, David A; Kurath, Gael; Brito, Ilana L; Purcell, Maureen K; Read, Andrew F; Winton, James R; Wargo, Andrew R

    2016-02-01

    Infectious diseases are economically detrimental to aquaculture, and with continued expansion and intensification of aquaculture, the importance of managing infectious diseases will likely increase in the future. Here, we use evolution of virulence theory, along with examples, to identify aquaculture practices that might lead to the evolution of increased pathogen virulence. We identify eight practices common in aquaculture that theory predicts may favor evolution toward higher pathogen virulence. Four are related to intensive aquaculture operations, and four others are related specifically to infectious disease control. Our intention is to make aquaculture managers aware of these risks, such that with increased vigilance, they might be able to detect and prevent the emergence and spread of increasingly troublesome pathogen strains in the future.

  8. Genomic Recombination Leading to Decreased Virulence of Group B Streptococcus in a Mouse Model of Adult Invasive Disease

    PubMed Central

    Teatero, Sarah; Lemire, Paul; Dewar, Ken; Wasserscheid, Jessica; Calzas, Cynthia; Mallo, Gustavo V.; Li, Aimin; Athey, Taryn B.T.; Segura, Mariela; Fittipaldi, Nahuel

    2016-01-01

    Adult invasive disease caused by Group B Streptococcus (GBS) is increasing worldwide. Whole-genome sequencing (WGS) now permits rapid identification of recombination events, a phenomenon that occurs frequently in GBS. Using WGS, we described that strain NGBS375, a capsular serotype V GBS isolate of sequence type (ST)297, has an ST1 genomic background but has acquired approximately 300 kbp of genetic material likely from an ST17 strain. Here, we examined the virulence of this strain in an in vivo model of GBS adult invasive infection. The mosaic ST297 strain showed intermediate virulence, causing significantly less systemic infection and reduced mortality than a more virulent, serotype V ST1 isolate. Bacteremia induced by the ST297 strain was similar to that induced by a serotype III ST17 strain, which was the least virulent under the conditions tested. Yet, under normalized bacteremia levels, the in vivo intrinsic capacity to induce the production of pro-inflammatory cytokines was similar between the ST297 strain and the virulent ST1 strain. Thus, the diminished virulence of the mosaic strain may be due to reduced capacity to disseminate or multiply in blood during a systemic infection which could be mediated by regulatory factors contained in the recombined region. PMID:27527222

  9. Considerations to improve functional annotations in biological databases.

    PubMed

    Benítez-Páez, Alfonso

    2009-12-01

    Despite the great effort to design efficient systems allowing the electronic indexation of information concerning genes, proteins, structures, and interactions published daily in scientific journals, some problems are still observed in specific tasks such as functional annotation. The annotation of function is a critical issue for bioinformatic routines, such as for instance, in functional genomics and the further prediction of unknown protein function, which are highly dependent of the quality of existing annotations. Some information management systems evolve to efficiently incorporate information from large-scale projects, but often, annotation of single records from the literature is difficult and slow. In this short report, functional characterizations of a representative sample of the entire set of uncharacterized proteins from Escherichia coli K12 was compiled from Swiss-Prot, PubMed, and EcoCyc and demonstrate a functional annotation deficit in biological databases. Some issues are postulated as causes of the lack of annotation, and different solutions are evaluated and proposed to avoid them. The hope is that as a consequence of these observations, there will be new impetus to improve the speed and quality of functional annotation and ultimately provide updated, reliable information to the scientific community. PMID:20050264

  10. Open semantic annotation of scientific publications using DOMEO

    PubMed Central

    2012-01-01

    Background Our group has developed a useful shared software framework for performing, versioning, sharing and viewing Web annotations of a number of kinds, using an open representation model. Methods The Domeo Annotation Tool was developed in tandem with this open model, the Annotation Ontology (AO). Development of both the Annotation Framework and the open model was driven by requirements of several different types of alpha users, including bench scientists and biomedical curators from university research labs, online scientific communities, publishing and pharmaceutical companies. Several use cases were incrementally implemented by the toolkit. These use cases in biomedical communications include personal note-taking, group document annotation, semantic tagging, claim-evidence-context extraction, reagent tagging, and curation of textmining results from entity extraction algorithms. Results We report on the Domeo user interface here. Domeo has been deployed in beta release as part of the NIH Neuroscience Information Framework (NIF, http://www.neuinfo.org) and is scheduled for production deployment in the NIF’s next full release. Future papers will describe other aspects of this work in detail, including Annotation Framework Services and components for integrating with external textmining services, such as the NCBO Annotator web service, and with other textmining applications using the Apache UIMA framework. PMID:22541592

  11. Ontology modularization to improve semantic medical image annotation.

    PubMed

    Wennerberg, Pinar; Schulz, Klaus; Buitelaar, Paul

    2011-02-01

    Searching for medical images and patient reports is a significant challenge in a clinical setting. The contents of such documents are often not described in sufficient detail thus making it difficult to utilize the inherent wealth of information contained within them. Semantic image annotation addresses this problem by describing the contents of images and reports using medical ontologies. Medical images and patient reports are then linked to each other through common annotations. Subsequently, search algorithms can more effectively find related sets of documents on the basis of these semantic descriptions. A prerequisite to realizing such a semantic search engine is that the data contained within should have been previously annotated with concepts from medical ontologies. One major challenge in this regard is the size and complexity of medical ontologies as annotation sources. Manual annotation is particularly time consuming labor intensive in a clinical environment. In this article we propose an approach to reducing the size of clinical ontologies for more efficient manual image and text annotation. More precisely, our goal is to identify smaller fragments of a large anatomy ontology that are relevant for annotating medical images from patients suffering from lymphoma. Our work is in the area of ontology modularization, which is a recent and active field of research. We describe our approach, methods and data set in detail and we discuss our results.

  12. Validating Annotations for Uncharacterized Proteins in Shewanella oneidensis

    PubMed Central

    Louie, Brenton; Tarczy-Hornoch, Peter; Higdon, Roger

    2008-01-01

    Abstract Proteins of unknown function are a barrier to our understanding of molecular biology. Assigning function to these “uncharacterized” proteins is imperative, but challenging. The usual approach is similarity searches using annotation databases, which are useful for predicting function. However, since the performance of these databases on uncharacterized proteins is basically unknown, the accuracy of their predictions is suspect, making annotation difficult. To address this challenge, we developed a benchmark annotation dataset of 30 proteins in Shewanella oneidensis. The proteins in the dataset were originally uncharacterized after the initial annotation of the S. oneidensis proteome in 2002. In the intervening 5 years, the accumulation of new experimental evidence has enabled specific functions to be predicted. We utilized this benchmark dataset to evaluate several commonly utilized annotation databases. According to our criteria, six annotation databases accurately predicted functions for at least 60% of proteins in our dataset. Two of these six even had a “conditional accuracy” of 90%. Conditional accuracy is another evaluation metric we developed which excludes results from databases where no function was predicted. Also, 27 of the 30 proteins' functions were correctly predicted by at least one database. These represent one of the first performance evaluations of annotation databases on uncharacterized proteins. Our evaluation indicates that these databases readily incorporate new information and are accurate in predicting functions for uncharacterized proteins, provided that experimental function evidence exists. PMID:18687039

  13. Biofilm formation by virulent and non-virulent strains of Haemophilus parasuis.

    PubMed

    Bello-Ortí, Bernardo; Deslandes, Vincent; Tremblay, Yannick D N; Labrie, Josée; Howell, Kate J; Tucker, Alexander W; Maskell, Duncan J; Aragon, Virginia; Jacques, Mario

    2014-01-01

    Haemophilus parasuis is a commensal bacterium of the upper respiratory tract of healthy pigs. It is also the etiological agent of Glässer's disease, a systemic disease characterized by polyarthritis, fibrinous polyserositis and meningitis, which causes high morbidity and mortality in piglets. The aim of this study was to evaluate biofilm formation by well-characterized virulent and non-virulent strains of H. parasuis. We observed that non-virulent strains isolated from the nasal cavities of healthy pigs formed significantly (p < 0.05) more biofilms than virulent strains isolated from lesions of pigs with Glässer's disease. These differences were observed when biofilms were formed in microtiter plates under static conditions or formed in the presence of shear force in a drip-flow apparatus or a microfluidic system. Confocal laser scanning microscopy using different fluorescent probes on a representative subset of strains indicated that the biofilm matrix contains poly-N-acetylglucosamine, proteins and eDNA. The biofilm matrix was highly sensitive to degradation by proteinase K. Comparison of transcriptional profiles of biofilm and planktonic cells of the non-virulent H. parasuis F9 strain revealed a significant number of up-regulated membrane-related genes in biofilms, and genes previously identified in Actinobacillus pleuropneumoniae biofilms. Our data indicate that non-virulent strains of H. parasuis have the ability to form robust biofilms in contrast to virulent, systemic strains. Biofilm formation might therefore allow the non-virulent strains to colonize and persist in the upper respiratory tract of pigs. Conversely, the planktonic state of the virulent strains might allow them to disseminate within the host. PMID:25428823

  14. A Fatal Case of Necrotizing Fasciitis Caused by a Highly Virulent Escherichia coli Strain

    PubMed Central

    Vincent, André; Lin, Alex; Harel, Josée; Côté, Jean-Charles; Tremblay, Cécile

    2016-01-01

    Necrotizing fasciitis is a serious disease characterized by the necrosis of the subcutaneous tissues and fascia. E. coli as the etiologic agent of necrotizing fasciitis is a rare occurrence. A 66-year-old woman underwent total abdominal hysterectomy with bilateral salpingo-oophorectomy. She rapidly developed necrotizing fasciitis which led to her death 68 hours following surgery. An E. coli strain was isolated from blood and fascia cultures. DNA microarray revealed the presence of 20 virulence genes. PMID:27366162

  15. Gene ontology annotation by density and gravitation models.

    PubMed

    Hou, Wen-Juan; Lin, Kevin Hsin-Yih; Chen, Hsin-Hsi

    2006-01-01

    Gene Ontology (GO) is developed to provide standard vocabularies of gene products in different databases. The process of annotating GO terms to genes requires curators to read through lengthy articles. Methods for speeding up or automating the annotation process are thus of great importance. We propose a GO annotation approach using full-text biomedical documents for directing more relevant papers to curators. This system explores word density and gravitation relationships between genes and GO terms. Different density and gravitation models are built and several evaluation criteria are employed to assess the effects of the proposed methods. PMID:17503384

  16. Quantification of false positives within Moon Zoo crater annotations

    NASA Astrophysics Data System (ADS)

    Tar, P.; Thacker, N.

    2014-04-01

    The Moon Zoo citizen science project [1] allows members of the public to annotate lunar images, providing researchers with a wealth of location and size information regarding the population of small craters on the Moon. To date, approximately 4 million images have been inspected. Here, we show how a quantitative pattern recognition system can be used to estimate the quantity of contamination in Moon Zoo data from erroneous annotations. The proposed method produces not only estimates of true verses false crater annotations, but also a full error covariance, with additional conformity checks, which is essential for the meaningful interpretation of measurements, e.g. for plotting error bars.

  17. Coalescence and refinement of Moon Zoo crater annotations

    NASA Astrophysics Data System (ADS)

    Tar, P.; Thacker, N.

    2014-04-01

    The Moon Zoo citizen science project [1] allows members of the public to annotate lunar images, providing researchers with a wealth of location and size information regarding the population of small craters on the Moon. To date, approximately 4 million images have been inspected. Here, we show how data from multiple users can be combined to give a consensus as to the parameters of annotated craters. The process uses annotations and image data to provide Likelihood solutions, revealing the most probable crater parameters, from which crater Size-Frequency Distributions (SFDs) might be produced.

  18. An annotation system for 3D fluid flow visualization

    NASA Technical Reports Server (NTRS)

    Loughlin, Maria M.; Hughes, John F.

    1995-01-01

    Annotation is a key activity of data analysis. However, current systems for data analysis focus almost exclusively on visualization. We propose a system which integrates annotations into a visualization system. Annotations are embedded in 3D data space, using the Post-it metaphor. This embedding allows contextual-based information storage and retrieval, and facilitates information sharing in collaborative environments. We provide a traditional database filter and a Magic Lens filter to create specialized views of the data. The system has been customized for fluid flow applications, with features which allow users to store parameters of visualization tools and sketch 3D volumes.

  19. Gene ontology annotation by density and gravitation models.

    PubMed

    Hou, Wen-Juan; Lin, Kevin Hsin-Yih; Chen, Hsin-Hsi

    2006-01-01

    Gene Ontology (GO) is developed to provide standard vocabularies of gene products in different databases. The process of annotating GO terms to genes requires curators to read through lengthy articles. Methods for speeding up or automating the annotation process are thus of great importance. We propose a GO annotation approach using full-text biomedical documents for directing more relevant papers to curators. This system explores word density and gravitation relationships between genes and GO terms. Different density and gravitation models are built and several evaluation criteria are employed to assess the effects of the proposed methods.

  20. Using Comparative Genomics for Inquiry-Based Learning to Dissect Virulence of Escherichia coli O157:H7 and Yersinia pestis

    PubMed Central

    Baumler, David J.; Banta, Lois M.; Hung, Kai F.; Schwarz, Jodi A.; Cabot, Eric L.; Glasner, Jeremy D.; Perna, Nicole T.

    2012-01-01

    Genomics and bioinformatics are topics of increasing interest in undergraduate biological science curricula. Many existing exercises focus on gene annotation and analysis of a single genome. In this paper, we present two educational modules designed to enable students to learn and apply fundamental concepts in comparative genomics using examples related to bacterial pathogenesis. Students first examine alignments of genomes of Escherichia coli O157:H7 strains isolated from three food-poisoning outbreaks using the multiple-genome alignment tool Mauve. Students investigate conservation of virulence factors using the Mauve viewer and by browsing annotations available at the A Systematic Annotation Package for Community Analysis of Genomes database. In the second module, students use an alignment of five Yersinia pestis genomes to analyze single-nucleotide polymorphisms of three genes to classify strains into biovar groups. Students are then given sequences of bacterial DNA amplified from the teeth of corpses from the first and second pandemics of the bubonic plague and asked to classify these new samples. Learning-assessment results reveal student improvement in self-efficacy and content knowledge, as well as students' ability to use BLAST to identify genomic islands and conduct analyses of virulence factors from E. coli O157:H7 or Y. pestis. Each of these educational modules offers educators new ready-to-implement resources for integrating comparative genomic topics into their curricula. PMID:22383620

  1. T3SEdb: data warehousing of virulence effectors secreted by the bacterial Type III Secretion System

    PubMed Central

    2010-01-01

    Background Effectors of Type III Secretion System (T3SS) play a pivotal role in establishing and maintaining pathogenicity in the host and therefore the identification of these effectors is important in understanding virulence. However, the effectors display high level of sequence diversity, therefore making the identification a difficult process. There is a need to collate and annotate existing effector sequences in public databases to enable systematic analyses of these sequences for development of models for screening and selection of putative novel effectors from bacterial genomes that can be validated by a smaller number of key experiments. Results Herein, we present T3SEdb http://effectors.bic.nus.edu.sg/T3SEdb, a specialized database of annotated T3SS effector (T3SE) sequences containing 1089 records from 46 bacterial species compiled from the literature and public protein databases. Procedures have been defined for i) comprehensive annotation of experimental status of effectors, ii) submission and curation review of records by users of the database, and iii) the regular update of T3SEdb existing and new records. Keyword fielded and sequence searches (BLAST, regular expression) are supported for both experimentally verified and hypothetical T3SEs. More than 171 clusters of T3SEs were detected based on sequence identity comparisons (intra-cluster difference up to ~60%). Owing to this high level of sequence diversity of T3SEs, the T3SEdb provides a large number of experimentally known effector sequences with wide species representation for creation of effector predictors. We created a reliable effector prediction tool, integrated into the database, to demonstrate the application of the database for such endeavours. Conclusions T3SEdb is the first specialised database reported for T3SS effectors, enriched with manual annotations that facilitated systematic construction of a reliable prediction model for identification of novel effectors. The T3SEdb represents a

  2. Semantic Annotation for Biological Information Retrieval System

    PubMed Central

    Oshaiba, Mohamed Marouf Z.; El Houby, Enas M. F.; Salah, Akram

    2015-01-01

    Online literatures are increasing in a tremendous rate. Biological domain is one of the fast growing domains. Biological researchers face a problem finding what they are searching for effectively and efficiently. The aim of this research is to find documents that contain any combination of biological process and/or molecular function and/or cellular component. This research proposes a framework that helps researchers to retrieve meaningful documents related to their asserted terms based on gene ontology (GO). The system utilizes GO by semantically decomposing it into three subontologies (cellular component, biological process, and molecular function). Researcher has the flexibility to choose searching terms from any combination of the three subontologies. Document annotation is taking a place in this research to create an index of biological terms in documents to speed the searching process. Query expansion is used to infer semantically related terms to asserted terms. It increases the search meaningful results using the term synonyms and term relationships. The system uses a ranking method to order the retrieved documents based on the ranking weights. The proposed system achieves researchers' needs to find documents that fit the asserted terms semantically. PMID:25737720

  3. Semantic annotation for biological information retrieval system.

    PubMed

    Oshaiba, Mohamed Marouf Z; El Houby, Enas M F; Salah, Akram

    2015-01-01

    Online literatures are increasing in a tremendous rate. Biological domain is one of the fast growing domains. Biological researchers face a problem finding what they are searching for effectively and efficiently. The aim of this research is to find documents that contain any combination of biological process and/or molecular function and/or cellular component. This research proposes a framework that helps researchers to retrieve meaningful documents related to their asserted terms based on gene ontology (GO). The system utilizes GO by semantically decomposing it into three subontologies (cellular component, biological process, and molecular function). Researcher has the flexibility to choose searching terms from any combination of the three subontologies. Document annotation is taking a place in this research to create an index of biological terms in documents to speed the searching process. Query expansion is used to infer semantically related terms to asserted terms. It increases the search meaningful results using the term synonyms and term relationships. The system uses a ranking method to order the retrieved documents based on the ranking weights. The proposed system achieves researchers' needs to find documents that fit the asserted terms semantically.

  4. Arctic stream processes--an annotated bibliography

    USGS Publications Warehouse

    Scott, Kevin M.

    1979-01-01

    This bibliography selectively summarizes investigations to date (1978) dealing with the physical processes of streams in the Arctic. The specialized annotations include aspects of stream processes described in subordinate parts of general papers on the arctic environment and therefore not evident in author-abstract bibliographies. Foreign contributions--Canadian, Scandinavian, and Russian--are summarized, in the case of Russian literature primarily by means of papers in translation journals. Until 1970 the role of streams in development of the arctic landscape was commonly considered subordinate to that of glacial and frost-related processes. This conclusion changed, however, with the findings of the many new studies begun in response to oil and gas discoveries in the late 1960's. The conclusions of these studies, made to provide both the engineering data for resource development and the information to assess the impacts of that development, were in general agreement that stream processes throughout most of the Arctic were significantly more important than previously had been thought.

  5. Rapid storage and retrieval of genomic intervals from a relational database system using nested containment lists.

    PubMed

    Wiley, Laura K; Sivley, R Michael; Bush, William S

    2013-01-01

    Efficient storage and retrieval of genomic annotations based on range intervals is necessary, given the amount of data produced by next-generation sequencing studies. The indexing strategies of relational database systems (such as MySQL) greatly inhibit their use in genomic annotation tasks. This has led to the development of stand-alone applications that are dependent on flat-file libraries. In this work, we introduce MyNCList, an implementation of the NCList data structure within a MySQL database. MyNCList enables the storage, update and rapid retrieval of genomic annotations from the convenience of a relational database system. Range-based annotations of 1 million variants are retrieved in under a minute, making this approach feasible for whole-genome annotation tasks. Database URL: https://github.com/bushlab/mynclist. PMID:23894185

  6. Rapid storage and retrieval of genomic intervals from a relational database system using nested containment lists.

    PubMed

    Wiley, Laura K; Sivley, R Michael; Bush, William S

    2013-01-01

    Efficient storage and retrieval of genomic annotations based on range intervals is necessary, given the amount of data produced by next-generation sequencing studies. The indexing strategies of relational database systems (such as MySQL) greatly inhibit their use in genomic annotation tasks. This has led to the development of stand-alone applications that are dependent on flat-file libraries. In this work, we introduce MyNCList, an implementation of the NCList data structure within a MySQL database. MyNCList enables the storage, update and rapid retrieval of genomic annotations from the convenience of a relational database system. Range-based annotations of 1 million variants are retrieved in under a minute, making this approach feasible for whole-genome annotation tasks. Database URL: https://github.com/bushlab/mynclist.

  7. Enhanced Acylcarnitine Annotation in High-Resolution Mass Spectrometry Data: Fragmentation Analysis for the Classification and Annotation of Acylcarnitines

    PubMed Central

    van der Hooft, Justin J. J.; Ridder, Lars; Barrett, Michael P.; Burgess, Karl E. V.

    2015-01-01

    Metabolite annotation and identification are primary challenges in untargeted metabolomics experiments. Rigorous workflows for reliable annotation of mass features with chemical structures or compound classes are needed to enhance the power of untargeted mass spectrometry. High-resolution mass spectrometry considerably improves the confidence in assigning elemental formulas to mass features in comparison to nominal mass spectrometry, and embedding of fragmentation methods enables more reliable metabolite annotations and facilitates metabolite classification. However, the analysis of mass fragmentation spectra can be a time-consuming step and requires expert knowledge. This study demonstrates how characteristic fragmentations, specific to compound classes, can be used to systematically analyze their presence in complex biological extracts like urine that have undergone untargeted mass spectrometry combined with data dependent or targeted fragmentation. Human urine extracts were analyzed using normal phase liquid chromatography (hydrophilic interaction chromatography) coupled to an Ion Trap-Orbitrap hybrid instrument. Subsequently, mass chromatograms and collision-induced dissociation and higher-energy collisional dissociation (HCD) fragments were annotated using the freely available MAGMa software1. Acylcarnitines play a central role in energy metabolism by transporting fatty acids into the mitochondrial matrix. By filtering on a combination of a mass fragment and neutral loss designed based on the MAGMa fragment annotations, we were able to classify and annotate 50 acylcarnitines in human urine extracts, based on high-resolution mass spectrometry HCD fragmentation spectra at different energies for all of them. Of these annotated acylcarnitines, 31 are not described in HMDB yet and for only 4 annotated acylcarnitines the fragmentation spectra could be matched to reference spectra. Therefore, we conclude that the use of mass fragmentation filters within the context

  8. Leptospira interrogans Catalase Is Required for Resistance to H2O2 and for Virulence

    PubMed Central

    Eshghi, Azad; Lourdault, Kristel; Murray, Gerald L.; Bartpho, Thanatchaporn; Sermswan, Rasana W.; Picardeau, Mathieu; Adler, Ben; Snarr, Brendan; Zuerner, Richard L.

    2012-01-01

    Pathogenic Leptospira spp. are likely to encounter higher concentrations of reactive oxygen species induced by the host innate immune response. In this study, we characterized Leptospira interrogans catalase (KatE), the only annotated catalase found within pathogenic Leptospira species, by assessing its role in resistance to H2O2-induced oxidative stress and during infection in hamsters. Pathogenic L. interrogans bacteria had a 50-fold-higher survival rate under H2O2-induced oxidative stress than did saprophytic L. biflexa bacteria, and this was predominantly catalase dependent. We also characterized KatE, the only annotated catalase found within pathogenic Leptospira species. Catalase assays performed with recombinant KatE confirmed specific catalase activity, while protein fractionation experiments localized KatE to the bacterial periplasmic space. The insertional inactivation of katE in pathogenic Leptospira bacteria drastically diminished leptospiral viability in the presence of extracellular H2O2 and reduced virulence in an acute-infection model. Combined, these results suggest that L. interrogans KatE confers in vivo resistance to reactive oxygen species induced by the host innate immune response. PMID:22927050

  9. Mycobacterial virulence. Virulent strains of Mycobacteria tuberculosis have faster in vivo doubling times and are better equipped to resist growth-inhibiting functions of macrophages in the presence and absence of specific immunity.

    PubMed

    North, R J; Izzo, A A

    1993-06-01

    The kinetics of growth of two virulent strains of mycobacteria (M. tuberculosis Erdman and M. tuberculosis H37Rv) and two attenuated strains (M. tuberculosis H37Ra and M. bovis Bacillus Calmette-Guerin [BCG]) were studied in the lungs, livers, spleens, and kidneys of severe combined immunodeficient (SCID) mice and of their coisogenic CB-17 immunocompetent counterparts. It was found, in keeping with the findings of earlier investigators (Pierce, C. H., R. J. Dubos, and W. B. Schaefer. 1953. J. Exp. Med. 97:189.), that in immunocompetent mice, virulent organisms grew progressively only in the lungs, whereas the growth of attenuated organisms was controlled in all organs. In SCID mice, in contrast, virulent mycobacteria grew rapidly and progressively in all organs, as did BCG, although at a slower rate. However, H37Ra failed to grow progressively in any organs of SCID mice, unless the mice were treated with hydrocortisone. In fact, hydrocortisone treatment enabled virulent, as well as attenuated, organisms to grow strikingly more rapidly in all organs of SCID mice and in all organs of CB-17 mice. A histological study showed that in SCID mice, multiplication of mycobacteria in the liver occurs in the cytoplasm of macrophages in granulomas and presumably in macrophages in other organs. It is suggested, therefore, that the macrophages of SCID mice possess a glucocorticoid-sensitive mycobacterial mechanism that prevents virulent and avirulent mycobacteria from expressing their true minimal doubling times. In the absence of this mechanism in the lungs of hydrocortisone-treated SCID mice, the doubling times of Erdman, H37Rv, BCG, and H37Ra were 17.7, 17.4, 44.6, and 98.6 h, respectively. The possible importance of a rapid multiplication rate for mycobacterial virulence is discussed. PMID:8496688

  10. MitoBamAnnotator: A web-based tool for detecting and annotating heteroplasmy in human mitochondrial DNA sequences.

    PubMed

    Zhidkov, Ilia; Nagar, Tal; Mishmar, Dan; Rubin, Eitan

    2011-11-01

    The use of Next-Generation Sequencing of mitochondrial DNA is becoming widespread in biological and clinical research. This, in turn, creates a need for a convenient tool that detects and analyzes heteroplasmy. Here we present MitoBamAnnotator, a user friendly web-based tool that allows maximum flexibility and control in heteroplasmy research. MitoBamAnnotator provides the user with a comprehensively annotated overview of mitochondrial genetic variation, allowing for an in-depth analysis with no prior knowledge in programming.

  11. Entamoeba histolytica. Phagocytosis as a virulence factor

    PubMed Central

    1983-01-01

    In this paper, we attempted to define the role of phagocytosis in the virulence of Entamoeba histolytica. We have isolated, from a highly phagocytic and virulent strain, a clone deficient in phagocytosis. Trophozoites of wild-type strain HM1:IMSS were fed with Escherichia coli strain CR34-Thy- grown on 5-bromo,2'-deoxyuridine. The trophozoites that had incorporated the base analog through phagocytosis of the bacteria were killed by irradiation with 310 nm light. The survivors, presumably trophozoites defective in phagocytosis, were grown until log phase and submitted two more times to the selection procedure. Clone L-6, isolated from a subpopulation resulting from this selection procedure, showed 75-85% less erythrophagocytic activity than the wild-type strain. The virulence of clone L-6 and strain HM1:IMSS was measured. The inoculum required to induce liver abscesses in 50% of the newborn hamsters inoculated (AD50) of HM1:IMSS was 1.5 X 10(4) trophozoites. Clone L-6 trophozoites failed to induce liver abscesses in newborn hamsters even with inocula of 5 X 10(5) trophozoites. Virulence revertants were obtained by successive passage of L-6 trophozoites through the liver of young hamsters. The trophozoites that recovered the ability to produce liver abscesses simultaneously recuperate high erythrophagocytic rates. These results show that phagocytosis is involved in the aggressive mechanisms of E. histolytica. PMID:6313842

  12. Entamoeba histolytica: oxygen resistance and virulence.

    PubMed

    Ramos-Martínez, Espiridión; Olivos-García, Alfonso; Saavedra, Emma; Nequiz, Mario; Sánchez, Ernesto C; Tello, Eusebio; El-Hafidi, Mohamed; Saralegui, Andrés; Pineda, Erika; Delgado, José; Montfort, Irmgard; Pérez-Tamayo, Ruy

    2009-05-01

    Entamoeba histolytica virulence has been attributed to several amoebic molecules such as adhesins, amoebapores and cysteine proteinases, but supporting evidence is either partial or indirect. In this work we compared several in vitro and in vivo features of both virulent E. histolytica (vEh) and non-virulent E. histolytica (nvEh) axenic HM-1 IMSS strains, such as complement resistance, proteinase activity, haemolytic, phagocytic and cytotoxic capacities, survival in mice caecum, and susceptibility to O(2). The only difference observed was a higher in vitro susceptibility of nvEh to O(2). The molecular mechanism of that difference was analyzed in both groups of amoebae after high O(2) exposure. vEh O(2) resistance correlated with: (i) higher O(2) reduction (O(2)(-) and H(2)O(2) production); (ii) increased H(2)O(2) resistance and thiol peroxidase activity, and (iii) reversible pyruvate: ferredoxin oxidoreductase (PFOR) inhibition. Despite the high level of carbonylated proteins in nvEh after O(2) exposure, membrane oxidation by reactive oxygen species was not observed. These results suggest that the virulent phenotype of E. histolytica is related to the greater ability to reduce O(2) and H(2)O(2) as well as PFOR reactivation, whereas nvEh undergoes irreversible PFOR inhibition resulting in metabolic failure and amoebic death.

  13. Molecular nature of virulence in Entamoeba histolytica.

    PubMed

    Olivos-García, Alfonso; Saavedra, Emma; Ramos-Martínez, Espiridión; Nequiz, Mario; Pérez-Tamayo, Ruy

    2009-12-01

    For many years virulence of pathogenic Entamoeba histolytica has been attributed to the capacity of the parasite to destroy tissues through the expression and/or secretion of various molecules. Such view is supported mainly by in vitro experimentation, whereas data obtained by using animal models of the disease have clearly demonstrated that the host's inflammatory response is primarily responsible for tissue damage. This review analyzes the content and/or activity of some of the presumed toxic amebic molecules present in amebic strains with different degrees of virulence compared to various parasite in vitro functions that are supposed to correlate with in vivo virulence. The analysis suggests that amebic virulence is primarily determined by the parasite's capacity to adapt and survive the aerobic conditions present in animal tissues. This initial episode in the host-parasite relationship is an absolute requirement for the further development of tissue lesions, which result from the concerted action of many molecules derived from both, the host and the parasite.

  14. Rare Helicobacter pylori Virulence Genotypes in Bhutan

    PubMed Central

    Matsunari, Osamu; Miftahussurur, Muhammad; Shiota, Seiji; Suzuki, Rumiko; Vilaichone, Ratha-korn; Uchida, Tomohisa; Ratanachu-ek, Thawee; Tshering, Lotay; Mahachai, Varocha; Yamaoka, Yoshio

    2016-01-01

    Both the prevalence of Helicobacter pylori infection and the incidence of gastric cancer are high in Bhutan. The high incidence of atrophic gastritis and gastric cancer suggest the phylogeographic origin of an infection with a more virulent strain of H. pylori. More than 90% of Bhutanese strains possessed the highly virulent East Asian-type CagA and all strains had the most virulent type of vacA (s1 type). More than half also had multiple repeats in East Asian-type CagA, which are rare in other countries and are reported characteristictly found in assciation with atrophic gastritis and gastric cancer consistent with Bhutanese strains having multiple H. pylori virulence factors associated with an increase in gastric cancer risk. Phylogeographic analyses showed that most Bhutanese strains belonged to the East Asian population type with some strains (17.5%) sharing East Asian and Amerindian components. Only 9.5% belonged to the European type consistant with H. pylori in Bhutan representing an intermediate evolutionary stage between H. pylori from European and East Asian countries. PMID:26931643

  15. Entamoeba histolytica: oxygen resistance and virulence.

    PubMed

    Ramos-Martínez, Espiridión; Olivos-García, Alfonso; Saavedra, Emma; Nequiz, Mario; Sánchez, Ernesto C; Tello, Eusebio; El-Hafidi, Mohamed; Saralegui, Andrés; Pineda, Erika; Delgado, José; Montfort, Irmgard; Pérez-Tamayo, Ruy

    2009-05-01

    Entamoeba histolytica virulence has been attributed to several amoebic molecules such as adhesins, amoebapores and cysteine proteinases, but supporting evidence is either partial or indirect. In this work we compared several in vitro and in vivo features of both virulent E. histolytica (vEh) and non-virulent E. histolytica (nvEh) axenic HM-1 IMSS strains, such as complement resistance, proteinase activity, haemolytic, phagocytic and cytotoxic capacities, survival in mice caecum, and susceptibility to O(2). The only difference observed was a higher in vitro susceptibility of nvEh to O(2). The molecular mechanism of that difference was analyzed in both groups of amoebae after high O(2) exposure. vEh O(2) resistance correlated with: (i) higher O(2) reduction (O(2)(-) and H(2)O(2) production); (ii) increased H(2)O(2) resistance and thiol peroxidase activity, and (iii) reversible pyruvate: ferredoxin oxidoreductase (PFOR) inhibition. Despite the high level of carbonylated proteins in nvEh after O(2) exposure, membrane oxidation by reactive oxygen species was not observed. These results suggest that the virulent phenotype of E. histolytica is related to the greater ability to reduce O(2) and H(2)O(2) as well as PFOR reactivation, whereas nvEh undergoes irreversible PFOR inhibition resulting in metabolic failure and amoebic death. PMID:19073188

  16. Bacterial Reductionism: Host Thiols Enhance Virulence

    PubMed Central

    Sperandio, Vanessa

    2016-01-01

    Intracellular bacteria exploit host cytosolic signals to upregulate virulence genes. In this issue of Cell Host & Microbe, Wong et al. (2015) show that Burkholderia pseudomallei senses host cytosolic glutathione, a low-molecular-weight thiol, through the membrane-bound histidine sensor kinase VirA, highlighting the importance of inter-kingdom signaling in bacterial pathogenesis. PMID:26159714

  17. Virulence Factor-activity Relationships: Workshop Summary

    EPA Science Inventory

    The concept or notion of virulence factor–activity relationships (VFAR) is an approach for identifying an analogous process to the use of qualitative structure–activity relationships (QSAR) for identifying new microbial contaminants. In QSAR, it is hypothesized that, for new chem...

  18. Virulent Aeromonas hydrophila in channel catfish

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this study, we investigated factors that predisposed catfish to motile aeromonas septicemia (MAS) caused by virulent Aeromonas hydrophila (vAh). Our results revealed that wounding on fish body surface was a prerequisite for vAh infection and disease development. A reproducible waterborne challeng...

  19. Are secondary metabolites dispensable for virulence?

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The production of toxins by conidial fungal pathogens and their association with virulence has been assumed to occur in vivo and is widely accepted as dogma, but this association has yet to be definitively proven by either genetic or chemical means. Several studies from our labs have used targeted g...

  20. Geothermal wetlands: an annotated bibliography of pertinent literature

    SciTech Connect

    Stanley, N.E.; Thurow, T.L.; Russell, B.F.; Sullivan, J.F.

    1980-05-01

    This annotated bibliography covers the following topics: algae, wetland ecosystems; institutional aspects; macrophytes - general, production rates, and mineral absorption; trace metal absorption; wetland soils; water quality; and other aspects of marsh ecosystems. (MHR)

  1. The Integration of Baseball: An Annotated Bibliography of Nonfiction Books.

    ERIC Educational Resources Information Center

    Kaplan, Ron

    2002-01-01

    This annotated bibliography of nonfiction books on the integration of baseball focuses on the Negro leagues, books for young readers, individual teams, autobiographies and biographies of the pioneers, and autobiographies and biographies of the African American major leaguers. (SM)

  2. Annotated Bibliography of Recent Research Related to Academic Advising

    ERIC Educational Resources Information Center

    Mottarella, Karen, Comp.

    2011-01-01

    This article presents an annotated bibliography of recent research related to academic advising. It includes research papers that focus on advising and a special section of the "Journal of Career Development" that is devoted to multicultural graduate advising relationships.

  3. Ethics in oncology: an annotated bibliography of important literature.

    PubMed

    Tenner, Laura L; Helft, Paul R

    2013-07-01

    The aim of this annotated bibliography about important articles in the field of ethics and oncology is to provide the practicing hematologist/oncologist with a brief overview of some of the important literature in this crucial area.

  4. Effects of dehydration on performance in man: Annotated bibliography

    NASA Technical Reports Server (NTRS)

    Greenleaf, J. E.

    1973-01-01

    A compilation of studies on the effect of dehydration on human performance and related physiological mechanisms. The annotations are listed in alphabetical order by first author and cover material through June 1973.

  5. Civil War Resources on the Internet: An Annotated Bibliography.

    ERIC Educational Resources Information Center

    Vincenti, William; Sielaff, McKinley

    1997-01-01

    Presents an annotated bibliography of online resources on the Civil War from the Abolition movement through Reconstruction, drawn from Rutgers University's (New Jersey) Web page. Lists primary resources, secondary sources, online bibliographies and indexes, general sites, and listservs. (AEF)

  6. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine

    PubMed Central

    Elsik, Christine G.; Tayal, Aditi; Diesh, Colin M.; Unni, Deepak R.; Emery, Marianne L.; Nguyen, Hung N.; Hagen, Darren E.

    2016-01-01

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. PMID:26578564

  7. Analysis of Annotation on Documents for Recycling Information

    NASA Astrophysics Data System (ADS)

    Nakai, Tomohiro; Kondo, Nobuyuki; Kise, Koichi; Matsumoto, Keinosuke

    In order to make collaborative business activities fruitful, it is essential to know characteristics of organizations and persons in more details and to gather information relevant to the activities. In this paper, we describe a notion of “information recycle" that actualizes these requirements by analyzing documents. The key of recycling information is to utilize annotations on documents as clues for generating users' profiles and for weighting contents in the context of the activities. We also propose a method of extracting annotations on paper documents just by pressing one button with the help of techniques of camera-based document image analysis. Experimental results demonstrate that it is fundamentally capable of acquiring annotations on paper documents on condition that their electronic versions without annotations are available for the processing.

  8. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine.

    PubMed

    Elsik, Christine G; Tayal, Aditi; Diesh, Colin M; Unni, Deepak R; Emery, Marianne L; Nguyen, Hung N; Hagen, Darren E

    2016-01-01

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. PMID:26578564

  9. Ethics in Oncology: An Annotated Bibliography of Important Literature

    PubMed Central

    Tenner, Laura L.; Helft, Paul R.

    2013-01-01

    The aim of this annotated bibliography about important articles in the field of ethics and oncology is to provide the practicing hematologist/oncologist with a brief overview of some of the important literature in this crucial area. PMID:23942932

  10. Towards Experimental Annotation of Genes by High Throughput Sequencing

    SciTech Connect

    Bradbury, Andrew

    2010-06-03

    Andrew Bradbury of Los Alamos National Laboratory discusses turning annotation into a sequencing pipeline on June 3, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM

  11. Annotated Bibliography of Research in the Teaching of English.

    ERIC Educational Resources Information Center

    Brown, Deborah; Kalman, Judith; Gomez, Macrina; Rijlaarsdam, Gert; Stinson, Anne D'Antonio; Whiting, Melissa E.

    2001-01-01

    Presents 36 annotations of journal articles (published between January and June, 2001) dealing with assessment, bilingual/foreign language education, literacy, professional development, reading, teaching and learning of literature, teaching and learning of writing, and technology and literacy. (SG)

  12. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine.

    PubMed

    Elsik, Christine G; Tayal, Aditi; Diesh, Colin M; Unni, Deepak R; Emery, Marianne L; Nguyen, Hung N; Hagen, Darren E

    2016-01-01

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search.

  13. Descriptive Cataloging: A Selected, Annotated Bibliography, 1984-1985.

    ERIC Educational Resources Information Center

    Cook, C. Donald; Jones, Ellen

    1986-01-01

    This annotated bibliography of materials published during 1984-1985 on descriptive cataloging covers bibliographic control, Anglo American Cataloging Rules, 2nd edition (AACR2), specific types of materials, authority control, retrospective conversion, management issues, expert systems, and manuals. (EM)

  14. From the Outside-In: The Francisella tularensis Envelope and Virulence

    PubMed Central

    Rowe, Hannah M.; Huntley, Jason F.

    2015-01-01

    Francisella tularensis is a highly-infectious bacterium that causes the rapid, and often lethal disease, tularemia. Many studies have been performed to identify and characterize the virulence factors that F. tularensis uses to infect a wide variety of hosts and host cell types, evade immune defenses, and induce severe disease and death. This review focuses on the virulence factors that are present in the F. tularensis envelope, including capsule, LPS, outer membrane, periplasm, inner membrane, secretion systems, and various molecules in each of aforementioned sub-compartments. Whereas, no single bacterial molecule or molecular complex single-handedly controls F. tularensis virulence, we review here how diverse bacterial systems work in conjunction to subvert the immune system, attach to and invade host cells, alter phagosome/lysosome maturation pathways, replicate in host cells without being detected, inhibit apoptosis, and induce host cell death for bacterial release and infection of adjacent cells. Given that the F. tularensis envelope is the outermost layer of the bacterium, we highlight herein how many of these molecules directly interact with the host to promote infection and disease. These and future envelope studies are important to advance our collective understanding of F. tularensis virulence mechanisms and offer targets for future vaccine development efforts. PMID:26779445

  15. Selection, Recombination, and Virulence Gene Diversity among Group B Streptococcal Genotypes▿ †

    PubMed Central

    Springman, A. Cody; Lacher, David W.; Wu, Guangxi; Milton, Nicole; Whittam, Thomas S.; Davies, H. Dele; Manning, Shannon D.

    2009-01-01

    Transmission of group B Streptococcus (GBS) from mothers to neonates during childbirth is a leading cause of neonatal sepsis and meningitis. Although subtyping tools have identified specific GBS phylogenetic lineages that are important in neonatal disease, little is known about the genetic diversity of these lineages or the roles that recombination and selection play in the generation of emergent genotypes. Here, we examined genetic variation, selection, and recombination in seven multilocus sequence typing (MLST) loci from 94 invasive, colonizing, and bovine strains representing 38 GBS sequence types and performed DNA sequencing and PCR-based restriction fragment length polymorphism analysis of several putative virulence genes to identify gene content differences between genotypes. Despite the low level of diversity in the MLST loci, a neighbor net analysis revealed a variable range of genetic exchange among the seven clonal complexes (CCs) identified, suggesting that recombination is partly responsible for the diversity observed between genotypes. Recombination is also important for several virulence genes, as some gene alleles had evidence for lateral gene exchange across divergent genotypes. The CC-17 lineage, which is associated with neonatal disease, is relatively homogeneous and therefore appears to have diverged independently with an exclusive set of virulence characteristics. These data suggest that different GBS genetic backgrounds have distinct virulence gene profiles that may be important for disease pathogenesis. Such profiles could be used as markers for the rapid detection of strains with an increased propensity to cause neonatal disease and may be considered useful vaccine targets. PMID:19581371

  16. Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

    SciTech Connect

    Kuo, Alan; Grigoriev, Igor

    2009-04-17

    Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.

  17. AmiGO: online access to ontology and annotation data

    SciTech Connect

    Carbon, Seth; Ireland, Amelia; Mungall, Christopher J.; Shu, ShengQiang; Marshall, Brad; Lewis, Suzanna

    2009-01-15

    AmiGO is a web application that allows users to query, browse, and visualize ontologies and related gene product annotation (association) data. AmiGO can be used online at the Gene Ontology (GO) website to access the data provided by the GO Consortium; it can also be downloaded and installed to browse local ontologies and annotations. AmiGO is free open source software developed and maintained by the GO Consortium.

  18. Semantic annotation of biological concepts interplaying microbial cellular responses

    PubMed Central

    2011-01-01

    Background Automated extraction systems have become a time saving necessity in Systems Biology. Considerable human effort is needed to model, analyse and simulate biological networks. Thus, one of the challenges posed to Biomedical Text Mining tools is that of learning to recognise a wide variety of biological concepts with different functional roles to assist in these processes. Results Here, we present a novel corpus concerning the integrated cellular responses to nutrient starvation in the model-organism Escherichia coli. Our corpus is a unique resource in that it annotates biomedical concepts that play a functional role in expression, regulation and metabolism. Namely, it includes annotations for genetic information carriers (genes and DNA, RNA molecules), proteins (transcription factors, enzymes and transporters), small metabolites, physiological states and laboratory techniques. The corpus consists of 130 full-text papers with a total of 59043 annotations for 3649 different biomedical concepts; the two dominant classes are genes (highest number of unique concepts) and compounds (most frequently annotated concepts), whereas other important cellular concepts such as proteins account for no more than 10% of the annotated concepts. Conclusions To the best of our knowledge, a corpus that details such a wide range of biological concepts has never been presented to the text mining community. The inter-annotator agreement statistics provide evidence of the importance of a consolidated background when dealing with such complex descriptions, the ambiguities naturally arising from the terminology and their impact for modelling purposes. Availability is granted for the full-text corpora of 130 freely accessible documents, the annotation scheme and the annotation guidelines. Also, we include a corpus of 340 abstracts. PMID:22122862

  19. An Approach to Function Annotation for Proteins of Unknown Function (PUFs) in the Transcriptome of Indian Mulberry.

    PubMed

    Dhanyalakshmi, K H; Naika, Mahantesha B N; Sajeevan, R S; Mathew, Oommen K; Shafi, K Mohamed; Sowdhamini, Ramanathan; N Nataraja, Karaba

    2016-01-01

    The modern sequencing technologies are generating large volumes of information at the transcriptome and genome level. Translation of this information into a biological meaning is far behind the race due to which a significant portion of proteins discovered remain as proteins of unknown function (PUFs). Attempts to uncover the functional significance of PUFs are limited due to lack of easy and high throughput functional annotation tools. Here, we report an approach to assign putative functions to PUFs, identified in the transcriptome of mulberry, a perennial tree commonly cultivated as host of silkworm. We utilized the mulberry PUFs generated from leaf tissues exposed to drought stress at whole plant level. A sequence and structure based computational analysis predicted the probable function of the PUFs. For rapid and easy annotation of PUFs, we developed an automated pipeline by integrating diverse bioinformatics tools, designated as PUFs Annotation Server (PUFAS), which also provides a web service API (Application Programming Interface) for a large-scale analysis up to a genome. The expression analysis of three selected PUFs annotated by the pipeline revealed abiotic stress responsiveness of the genes, and hence their potential role in stress acclimation pathways. The automated pipeline developed here could be extended to assign functions to PUFs from any organism in general. PUFAS web server is available at http://caps.ncbs.res.in/pufas/ and the web service is accessible at http://capservices.ncbs.res.in/help/pufas. PMID:26982336

  20. An Approach to Function Annotation for Proteins of Unknown Function (PUFs) in the Transcriptome of Indian Mulberry

    PubMed Central

    Dhanyalakshmi, K. H.; Naika, Mahantesha B. N.; Sajeevan, R. S.; Mathew, Oommen K.; Shafi, K. Mohamed; Sowdhamini, Ramanathan; N. Nataraja, Karaba

    2016-01-01

    The modern sequencing technologies are generating large volumes of information at the transcriptome and genome level. Translation of this information into a biological meaning is far behind the race due to which a significant portion of proteins discovered remain as proteins of unknown function (PUFs). Attempts to uncover the functional significance of PUFs are limited due to lack of easy and high throughput functional annotation tools. Here, we report an approach to assign putative functions to PUFs, identified in the transcriptome of mulberry, a perennial tree commonly cultivated as host of silkworm. We utilized the mulberry PUFs generated from leaf tissues exposed to drought stress at whole plant level. A sequence and structure based computational analysis predicted the probable function of the PUFs. For rapid and easy annotation of PUFs, we developed an automated pipeline by integrating diverse bioinformatics tools, designated as PUFs Annotation Server (PUFAS), which also provides a web service API (Application Programming Interface) for a large-scale analysis up to a genome. The expression analysis of three selected PUFs annotated by the pipeline revealed abiotic stress responsiveness of the genes, and hence their potential role in stress acclimation pathways. The automated pipeline developed here could be extended to assign functions to PUFs from any organism in general. PUFAS web server is available at http://caps.ncbs.res.in/pufas/ and the web service is accessible at http://capservices.ncbs.res.in/help/pufas. PMID:26982336

  1. An Approach to Function Annotation for Proteins of Unknown Function (PUFs) in the Transcriptome of Indian Mulberry.

    PubMed

    Dhanyalakshmi, K H; Naika, Mahantesha B N; Sajeevan, R S; Mathew, Oommen K; Shafi, K Mohamed; Sowdhamini, Ramanathan; N Nataraja, Karaba

    2016-01-01

    The modern sequencing technologies are generating large volumes of information at the transcriptome and genome level. Translation of this information into a biological meaning is far behind the race due to which a significant portion of proteins discovered remain as proteins of unknown function (PUFs). Attempts to uncover the functional significance of PUFs are limited due to lack of easy and high throughput functional annotation tools. Here, we report an approach to assign putative functions to PUFs, identified in the transcriptome of mulberry, a perennial tree commonly cultivated as host of silkworm. We utilized the mulberry PUFs generated from leaf tissues exposed to drought stress at whole plant level. A sequence and structure based computational analysis predicted the probable function of the PUFs. For rapid and easy annotation of PUFs, we developed an automated pipeline by integrating diverse bioinformatics tools, designated as PUFs Annotation Server (PUFAS), which also provides a web service API (Application Programming Interface) for a large-scale analysis up to a genome. The expression analysis of three selected PUFs annotated by the pipeline revealed abiotic stress responsiveness of the genes, and hence their potential role in stress acclimation pathways. The automated pipeline developed here could be extended to assign functions to PUFs from any organism in general. PUFAS web server is available at http://caps.ncbs.res.in/pufas/ and the web service is accessible at http://capservices.ncbs.res.in/help/pufas.

  2. Sample collection of virulent and non-virulent B. anthracis and Y. pestis for bioforensics analysis

    SciTech Connect

    Hong-geller, Elizabeth; Valdez, Yolanda E; Shou, Yulin; Yoshida, Thomas M; Marrone, Babetta L; Dunbar, John

    2009-01-01

    Validated sample collection methods are needed for recovery of microbial evidence in the event of accidental or intentional release of biological agents into the environment. To address this need, we evaluated the sample recovery efficiencies of two collection methods -- swabs and wipes -- for both non-virulent and virulent strains of B. anthracis and Y. pestis from four types of non-porous surfaces: two hydrophilic surfaces, stainless steel and glass, and two hydrophobic surfaces, vinyl and plastic. Sample recovery was quantified using Real-time qPCR to assay for intact DNA signatures. We found no consistent difference in collection efficiency between swabs or wipes. Furthermore, collection efficiency was more surface-dependent for virulent strains than non-virulent strains. For the two non-virulent strains, B. anthracis Sterne and Y. pestis A1122, collection efficiency was approximately 100% and 1 %, respectively, from all four surfaces. In contrast, recovery of B. anthracis Ames spores and Y. pestis C092 from vinyl and plastic was generally lower compared to collection from glass or stainless steel, suggesting that surface hydrophobicity may playa role in the strength of pathogen adhesion. The surface-dependent collection efficiencies observed with the virulent strains may arise from strain-specific expression of capsular material or other cell surface receptors that alter cell adhesion to specific surfaces. These findings contribute to validation of standard bioforensics procedures and emphasize the importance of specific strain and surface interactions in pathogen detection.

  3. Microevolution and virulence of dengue viruses.

    PubMed

    Rico-Hesse, Rebeca

    2003-01-01

    The evolution of dengue viruses has had a major impact on their virulence for humans and on the epidemiology of dengue disease around the world. Although antigenic and genetic differences in virus strains had become evident, it is mainly due to the lack of animal models of disease that has made it difficult to detect differences in virulence of dengue viruses. However, phylogenetic studies of many different dengue virus samples have led to the association between specific genotypes (within serotypes) and the presentation of more or less severe disease. Currently, dengue viruses can be classified as being of epidemiologically low, medium, or high impact; i.e., some viruses may remain in sylvatic cycles of little or low transmissibility to humans, others produce dengue fever (DF) only, and some genotypes have been associated with the potential to cause the more severe dengue hemorrhagic fever (DHF) and dengue shock syndrome (DSS) in addition to DF. Although the factors that contribute to dengue virus epidemiology are complex, studies have suggested that specific viral structures may contribute to increased replication in human target cells and to increased transmission by the mosquito vector; however, the immune status and possibly the genetic background of the host are also determinants of virulence or disease presentation. As to the question of whether dengue viruses are evolving toward virulence as they continue to spread throughout the world, phylogenetic and epidemiological analyses suggest that the more virulent genotypes are now displacing those that have lower epidemiological impact; there is no evidence for the transmission of antigenically aberrant, new strains.

  4. GENCODE: the reference human genome annotation for The ENCODE Project.

    PubMed

    Harrow, Jennifer; Frankish, Adam; Gonzalez, Jose M; Tapanari, Electra; Diekhans, Mark; Kokocinski, Felix; Aken, Bronwen L; Barrell, Daniel; Zadissa, Amonida; Searle, Stephen; Barnes, If; Bignell, Alexandra; Boychenko, Veronika; Hunt, Toby; Kay, Mike; Mukherjee, Gaurab; Rajan, Jeena; Despacio-Reyes, Gloria; Saunders, Gary; Steward, Charles; Harte, Rachel; Lin, Michael; Howald, Cédric; Tanzer, Andrea; Derrien, Thomas; Chrast, Jacqueline; Walters, Nathalie; Balasubramanian, Suganthi; Pei, Baikang; Tress, Michael; Rodriguez, Jose Manuel; Ezkurdia, Iakes; van Baren, Jeltje; Brent, Michael; Haussler, David; Kellis, Manolis; Valencia, Alfonso; Reymond, Alexandre; Gerstein, Mark; Guigó, Roderic; Hubbard, Tim J

    2012-09-01

    The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.

  5. GFam: a platform for automatic annotation of gene families

    PubMed Central

    Sasidharan, Rajkumar; Nepusz, Tamás; Swarbreck, David; Huala, Eva; Paccanaro, Alberto

    2012-01-01

    We have developed GFam, a platform for automatic annotation of gene/protein families. GFam provides a framework for genome initiatives and model organism resources to build domain-based families, derive meaningful functional labels and offers a seamless approach to propagate functional annotation across periodic genome updates. GFam is a hybrid approach that uses a greedy algorithm to chain component domains from InterPro annotation provided by its 12 member resources followed by a sequence-based connected component analysis of un-annotated sequence regions to derive consensus domain architecture for each sequence and subsequently generate families based on common architectures. Our integrated approach increases sequence coverage by 7.2 percentage points and residue coverage by 14.6 percentage points higher than the coverage relative to the best single-constituent database within InterPro for the proteome of Arabidopsis. The true power of GFam lies in maximizing annotation provided by the different InterPro data sources that offer resource-specific coverage for different regions of a sequence. GFam’s capability to capture higher sequence and residue coverage can be useful for genome annotation, comparative genomics and functional studies. GFam is a general-purpose software and can be used for any collection of protein sequences. The software is open source and can be obtained from http://www.paccanarolab.org/software/gfam/. PMID:22790981

  6. MetaStorm: A Public Resource for Customizable Metagenomics Annotation.

    PubMed

    Arango-Argoty, Gustavo; Singh, Gargi; Heath, Lenwood S; Pruden, Amy; Xiao, Weidong; Zhang, Liqing

    2016-01-01

    Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution. PMID:27632579

  7. MITOS: improved de novo metazoan mitochondrial genome annotation.

    PubMed

    Bernt, Matthias; Donath, Alexander; Jühling, Frank; Externbrink, Fabian; Florentz, Catherine; Fritzsch, Guido; Pütz, Joern; Middendorf, Martin; Stadler, Peter F

    2013-11-01

    About 2000 completely sequenced mitochondrial genomes are available from the NCBI RefSeq data base together with manually curated annotations of their protein-coding genes, rRNAs, and tRNAs. This annotation information, which has accumulated over two decades, has been obtained with a diverse set of computational tools and annotation strategies. Despite all efforts of manual curation it is still plagued by misassignments of reading directions, erroneous gene names, and missing as well as false positive annotations in particular for the RNA genes. Taken together, this causes substantial problems for fully automatic pipelines that aim to use these data comprehensively for studies of animal phylogenetics and the molecular evolution of mitogenomes. The MITOS pipeline is designed to compute a consistent de novo annotation of the mitogenomic sequences. We show that the results of MITOS match RefSeq and MitoZoa in terms of annotation coverage and quality. At the same time we avoid biases, inconsistencies of nomenclature, and typos originating from manual curation strategies. The MITOS pipeline is accessible online at http://mitos.bioinf.uni-leipzig.de.

  8. MetaStorm: A Public Resource for Customizable Metagenomics Annotation

    PubMed Central

    Arango-Argoty, Gustavo; Singh, Gargi; Heath, Lenwood S.; Pruden, Amy; Xiao, Weidong; Zhang, Liqing

    2016-01-01

    Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution. PMID:27632579

  9. Automatic annotation of eukaryotic genes, pseudogenes and promoters

    PubMed Central

    Solovyev, Victor; Kosarev, Peter; Seledsov, Igor; Vorobyev, Denis

    2006-01-01

    Background The ENCODE gene prediction workshop (EGASP) has been organized to evaluate how well state-of-the-art automatic gene finding methods are able to reproduce the manual and experimental gene annotation of the human genome. We have used Softberry gene finding software to predict genes, pseudogenes and promoters in 44 selected ENCODE sequences representing approximately 1% (30 Mb) of the human genome. Predictions of gene finding programs were evaluated in terms of their ability to reproduce the ENCODE-HAVANA annotation. Results The Fgenesh++ gene prediction pipeline can identify 91% of coding nucleotides with a specificity of 90%. Our automatic pseudogene finder (PSF program) found 90% of the manually annotated pseudogenes and some new ones. The Fprom promoter prediction program identifies 80% of TATA promoters sequences with one false positive prediction per 2,000 base-pairs (bp) and 50% of TATA-less promoters with one false positive prediction per 650 bp. It can be used to identify transcription start sites upstream of annotated coding parts of genes found by gene prediction software. Conclusion We review our software and underlying methods for identifying these three important structural and functional genome components and discuss the accuracy of predictions, recent advances and open problems in annotating genomic sequences. We have demonstrated that our methods can be effectively used for initial automatic annotation of the eukaryotic genome. PMID:16925832

  10. A semi-automatic annotation tool for cooking video

    NASA Astrophysics Data System (ADS)

    Bianco, Simone; Ciocca, Gianluigi; Napoletano, Paolo; Schettini, Raimondo; Margherita, Roberto; Marini, Gianluca; Gianforme, Giorgio; Pantaleo, Giuseppe

    2013-03-01

    In order to create a cooking assistant application to guide the users in the preparation of the dishes relevant to their profile diets and food preferences, it is necessary to accurately annotate the video recipes, identifying and tracking the foods of the cook. These videos present particular annotation challenges such as frequent occlusions, food appearance changes, etc. Manually annotate the videos is a time-consuming, tedious and error-prone task. Fully automatic tools that integrate computer vision algorithms to extract and identify the elements of interest are not error free, and false positive and false negative detections need to be corrected in a post-processing stage. We present an interactive, semi-automatic tool for the annotation of cooking videos that integrates computer vision techniques under the supervision of the user. The annotation accuracy is increased with respect to completely automatic tools and the human effort is reduced with respect to completely manual ones. The performance and usability of the proposed tool are evaluated on the basis of the time and effort required to annotate the same video sequences.

  11. A survey on annotation tools for the biomedical literature.

    PubMed

    Neves, Mariana; Leser, Ulf

    2014-03-01

    New approaches to biomedical text mining crucially depend on the existence of comprehensive annotated corpora. Such corpora, commonly called gold standards, are important for learning patterns or models during the training phase, for evaluating and comparing the performance of algorithms and also for better understanding the information sought for by means of examples. Gold standards depend on human understanding and manual annotation of natural language text. This process is very time-consuming and expensive because it requires high intellectual effort from domain experts. Accordingly, the lack of gold standards is considered as one of the main bottlenecks for developing novel text mining methods. This situation led the development of tools that support humans in annotating texts. Such tools should be intuitive to use, should support a range of different input formats, should include visualization of annotated texts and should generate an easy-to-parse output format. Today, a range of tools which implement some of these functionalities are available. In this survey, we present a comprehensive survey of tools for supporting annotation of biomedical texts. Altogether, we considered almost 30 tools, 13 of which were selected for an in-depth comparison. The comparison was performed using predefined criteria and was accompanied by hands-on experiences whenever possible. Our survey shows that current tools can support many of the tasks in biomedical text annotation in a satisfying manner, but also that no tool can be considered as a true comprehensive solution.

  12. Beyond the Chromosome: The Prevalence of Unique Extra-Chromosomal Bacteriophages with Integrated Virulence Genes in Pathogenic Staphylococcus aureus

    PubMed Central

    Utter, Bryan; Deutsch, Douglas R.; Schuch, Raymond; Winer, Benjamin Y.; Verratti, Kathleen; Bishop-Lilly, Kim; Sozhamannan, Shanmuga; Fischetti, Vincent A.

    2014-01-01

    In Staphylococcus aureus, the disease impact of chromosomally integrated prophages on virulence is well described. However, the existence of extra-chromosomal prophages, both plasmidial and episomal, remains obscure. Despite the recent explosion in bacterial and bacteriophage genomic sequencing, studies have failed to specifically focus on extra-chromosomal elements. We selectively enriched and sequenced extra-chromosomal DNA from S. aureus isolates using Roche-454 technology and uncovered evidence for the widespread distribution of multiple extra-chromosomal prophages (ExPΦs) throughout both antibiotic-sensitive and -resistant strains. We completely sequenced one such element comprised of a 43.8 kbp, circular ExPΦ (designated ФBU01) from a vancomycin-intermediate S. aureus (VISA) strain. Assembly and annotation of ФBU01 revealed a number of putative virulence determinants encoded within a bacteriophage immune evasion cluster (IEC). Our identification of several potential ExPΦs and mobile genetic elements (MGEs) also revealed numerous putative virulence factors and antibiotic resistance genes. We describe here a previously unidentified level of genetic diversity of stealth extra-chromosomal elements in S. aureus, including phages with a larger presence outside the chromosome that likely play a prominent role in pathogenesis and strain diversity driven by horizontal gene transfer (HGT). PMID:24963913

  13. Beyond the chromosome: the prevalence of unique extra-chromosomal bacteriophages with integrated virulence genes in pathogenic Staphylococcus aureus.

    PubMed

    Utter, Bryan; Deutsch, Douglas R; Schuch, Raymond; Winer, Benjamin Y; Verratti, Kathleen; Bishop-Lilly, Kim; Sozhamannan, Shanmuga; Fischetti, Vincent A

    2014-01-01

    In Staphylococcus aureus, the disease impact of chromosomally integrated prophages on virulence is well described. However, the existence of extra-chromosomal prophages, both plasmidial and episomal, remains obscure. Despite the recent explosion in bacterial and bacteriophage genomic sequencing, studies have failed to specifically focus on extra-chromosomal elements. We selectively enriched and sequenced extra-chromosomal DNA from S. aureus isolates using Roche-454 technology and uncovered evidence for the widespread distribution of multiple extra-chromosomal prophages (ExPΦs) throughout both antibiotic-sensitive and -resistant strains. We completely sequenced one such element comprised of a 43.8 kbp, circular ExPΦ (designated ФBU01) from a vancomycin-intermediate S. aureus (VISA) strain. Assembly and annotation of ФBU01 revealed a number of putative virulence determinants encoded within a bacteriophage immune evasion cluster (IEC). Our identification of several potential ExPΦs and mobile genetic elements (MGEs) also revealed numerous putative virulence factors and antibiotic resistance genes. We describe here a previously unidentified level of genetic diversity of stealth extra-chromosomal elements in S. aureus, including phages with a larger presence outside the chromosome that likely play a prominent role in pathogenesis and strain diversity driven by horizontal gene transfer (HGT). PMID:24963913

  14. The Use of Annotations in Examination Marking: Opening a Window into Markers' Minds

    ERIC Educational Resources Information Center

    Crisp, Victoria; Johnson, Martin

    2007-01-01

    This study investigated the functions of annotations, the role of annotations in markers' decision-making processes, whether annotations conform to conventions, and whether these vary according to subject area. Across subjects a number of scripts were analysed to survey which annotations are subject specific and which are more general. Twelve…

  15. Gastrointestinal hormone research - with a Scandinavian annotation.

    PubMed

    Rehfeld, Jens F

    2015-06-01

    Gastrointestinal hormones are peptides released from neuroendocrine cells in the digestive tract. More than 30 hormone genes are currently known to be expressed in the gut, which makes it the largest hormone-producing organ in the body. Modern biology makes it feasible to conceive the hormones under five headings: The structural homology groups a majority of the hormones into nine families, each of which is assumed to originate from one ancestral gene. The individual hormone gene often has multiple phenotypes due to alternative splicing, tandem organization or differentiated posttranslational maturation of the prohormone. By a combination of these mechanisms, more than 100 different hormonally active peptides are released from the gut. Gut hormone genes are also widely expressed outside the gut, some only in extraintestinal endocrine cells and cerebral or peripheral neurons but others also in other cell types. The extraintestinal cells may release different bioactive fragments of the same prohormone due to cell-specific processing pathways. Moreover, endocrine cells, neurons, cancer cells and, for instance, spermatozoa secrete gut peptides in different ways, so the same peptide may act as a blood-borne hormone, a neurotransmitter, a local growth factor or a fertility factor. The targets of gastrointestinal hormones are specific G-protein-coupled receptors that are expressed in the cell membranes also outside the digestive tract. Thus, gut hormones not only regulate digestive functions, but also constitute regulatory systems operating in the whole organism. This overview of gut hormone biology is supplemented with an annotation on some Scandinavian contributions to gastrointestinal hormone research.

  16. Deep Question Answering for protein annotation.

    PubMed

    Gobeill, Julien; Gaudinat, Arnaud; Pasche, Emilie; Vishnyakova, Dina; Gaudet, Pascale; Bairoch, Amos; Ruch, Patrick

    2015-01-01

    Biomedical professionals have access to a huge amount of literature, but when they use a search engine, they often have to deal with too many documents to efficiently find the appropriate information in a reasonable time. In this perspective, question-answering (QA) engines are designed to display answers, which were automatically extracted from the retrieved documents. Standard QA engines in literature process a user question, then retrieve relevant documents and finally extract some possible answers out of these documents using various named-entity recognition processes. In our study, we try to answer complex genomics questions, which can be adequately answered only using Gene Ontology (GO) concepts. Such complex answers cannot be found using state-of-the-art dictionary- and redundancy-based QA engines. We compare the effectiveness of two dictionary-based classifiers for extracting correct GO answers from a large set of 100 retrieved abstracts per question. In the same way, we also investigate the power of GOCat, a GO supervised classifier. GOCat exploits the GOA database to propose GO concepts that were annotated by curators for similar abstracts. This approach is called deep QA, as it adds an original classification step, and exploits curated biological data to infer answers, which are not explicitly mentioned in the retrieved documents. We show that for complex answers such as protein functional descriptions, the redundancy phenomenon has a limited effect. Similarly usual dictionary-based approaches are relatively ineffective. In contrast, we demonstrate how existing curated data, beyond information extraction, can be exploited by a supervised classifier, such as GOCat, to massively improve both the quantity and the quality of the answers with a +100% improvement for both recall and precision. Database URL: http://eagl.unige.ch/DeepQA4PA/. PMID:26384372

  17. Deep Question Answering for protein annotation.

    PubMed

    Gobeill, Julien; Gaudinat, Arnaud; Pasche, Emilie; Vishnyakova, Dina; Gaudet, Pascale; Bairoch, Amos; Ruch, Patrick

    2015-01-01

    Biomedical professionals have access to a huge amount of literature, but when they use a search engine, they often have to deal with too many documents to efficiently find the appropriate information in a reasonable time. In this perspective, question-answering (QA) engines are designed to display answers, which were automatically extracted from the retrieved documents. Standard QA engines in literature process a user question, then retrieve relevant documents and finally extract some possible answers out of these documents using various named-entity recognition processes. In our study, we try to answer complex genomics questions, which can be adequately answered only using Gene Ontology (GO) concepts. Such complex answers cannot be found using state-of-the-art dictionary- and redundancy-based QA engines. We compare the effectiveness of two dictionary-based classifiers for extracting correct GO answers from a large set of 100 retrieved abstracts per question. In the same way, we also investigate the power of GOCat, a GO supervised classifier. GOCat exploits the GOA database to propose GO concepts that were annotated by curators for similar abstracts. This approach is called deep QA, as it adds an original classification step, and exploits curated biological data to infer answers, which are not explicitly mentioned in the retrieved documents. We show that for complex answers such as protein functional descriptions, the redundancy phenomenon has a limited effect. Similarly usual dictionary-based approaches are relatively ineffective. In contrast, we demonstrate how existing curated data, beyond information extraction, can be exploited by a supervised classifier, such as GOCat, to massively improve both the quantity and the quality of the answers with a +100% improvement for both recall and precision. Database URL: http://eagl.unige.ch/DeepQA4PA/.

  18. Candida Virulence Properties and Adverse Clinical Outcomes in Neonatal Candidiasis

    PubMed Central

    Bliss, Joseph M.; Wong, Angela Y.; Bhak, Grace; Laforce-Nesbitt, Sonia S.; Taylor, Sarah; Tan, Sylvia; Stoll, Barbara J.; Higgins, Rosemary D.; Shankaran, Seetha; Benjamin, Daniel K.

    2012-01-01

    Objective To determine if premature infants with invasive Candida infection caused by strains with increased virulence properties have worse clinical outcomes than those infected with less virulent strains. Study design Clinical isolates were studied from 2 populations; premature infants colonized with Candida (commensal, n=27), and those with invasive candidiasis (n=81). Individual isolates of C. albicans and C. parapsilosis were tested for virulence in each of 3 assays: phenotypic switching, adhesion, and cytotoxicity. Invasive isolates were considered to have enhanced virulence if they measured more than 1 SD above the mean for the commensal isolates in at least 1 assay. Outcomes of patients with invasive isolates with enhanced virulence were compared with those with invasive isolates lacking enhanced virulence characteristics. Results 61% of invasive isolates of C. albicans and 42% of invasive isolates of C. parapsilosis had enhanced virulence. All C. albicans cerebrospinal fluid (CSF) isolates (n=6) and 90% of urine isolates (n=10) had enhanced virulence, compared with 48% of blood isolates (n=40). Infants with more virulent isolates were younger at the time of positive culture and had higher serum creatinine. Conclusions Individual isolates of Candida species vary in their virulence properties. Strains with higher virulence are associated with certain clinical outcomes. PMID:22504098

  19. Enriching the annotation of Mycobacterium tuberculosis H37Rv proteome using remote homology detection approaches: insights into structure and function.

    PubMed

    Ramakrishnan, Gayatri; Ochoa-Montaño, Bernardo; Raghavender, Upadhyayula S; Mudgal, Richa; Joshi, Adwait G; Chandra, Nagasuma R; Sowdhamini, Ramanathan; Blundell, Tom L; Srinivasan, Narayanaswamy

    2015-01-01

    The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better

  20. Automatic annotation of protein function based on family identification.

    PubMed

    Abascal, Federico; Valencia, Alfonso

    2003-11-15

    Although genomes are being sequenced at an impressive rate, the information generated tells us little about protein function, which is slow to characterize by traditional methods. Automatic protein function annotation based on computational methods has alleviated this imbalance. The most powerful current approach for inferring the function of new proteins is by studying the annotations of their homologues, since their common origin is assumed to be reflected in their structure and function. Unfortunately, as proteins evolve they acquire new functions, so annotation based on homology must be carried out in the context of orthologues or subfamilies. Evolution adds new complications through domain shuffling: homology (or orthology) frequently corresponds to domains rather than complete proteins. Moreover, the function of a protein may be seen as the result of combining the functions of its domains. Additionally, automatic annotation has to deal with problems related to the annotations in the databases: errors (which are likely to be propagated), inconsistencies, or different degrees of function specification. We describe a method that addresses these difficulties for the annotation of protein function. Sequence relationships are detected and measured to obtain a map of the sequence space, which is searched for differentiated groups of proteins (similar to islands on the map), which are expected to have a common function and correspond to groups of orthologues or subfamilies. This mapmaking is done by applying a clustering algorithm based on Normalized cuts in graphs. The domain problem is addressed in a simple way: pairwise local alignments are analyzed to determine the extent to which they cover the entire sequence lengths of the two proteins. This analysis determines both what homologues are preferred for functional inheritance and the level of confidence of the annotation. To alleviate the problems associated with database annotations, the information on all the

  1. 'Ready made' virulence and 'dual use' virulence factors in pathogenic environmental fungi--the Cryptococcus neoformans paradigm.

    PubMed

    Casadevall, Arturo; Steenbergen, Judith N; Nosanchuk, Joshua D

    2003-08-01

    Environmental pathogenic fungi present a paradox in that they are virulent in animals without requiring animal hosts for replication or survival, a phenomenon we call 'ready-made' virulence. In the human pathogenic fungus Cryptococcus neoformans, the capacity for virulence in animals may originate from environmental selective pressures imposed by such organisms as amoeboid and nematode predators. Many C. neoformans virulence factors appear to have 'dual use' capabilities that confer survival advantages in both animal hosts and in the environment. The findings with C. neoformans may provide a paradigm for understanding the origin and maintenance of virulence in other pathogenic environmental fungi.

  2. Comprehensive comparative homeobox gene annotation in human and mouse

    PubMed Central

    Wilming, Laurens G.; Boychenko, Veronika; Harrow, Jennifer L.

    2015-01-01

    Homeobox genes are a group of genes coding for transcription factors with a DNA-binding helix-turn-helix structure called a homeodomain and which play a crucial role in pattern formation during embryogenesis. Many homeobox genes are located in clusters and some of these, most notably the HOX genes, are known to have antisense or opposite strand long non-coding RNA (lncRNA) genes that play a regulatory role. Because automated annotation of both gene clusters and non-coding genes is fraught with difficulty (over-prediction, under-prediction, inaccurate transcript structures), we set out to manually annotate all homeobox genes in the mouse and human genomes. This includes all supported splice variants, pseudogenes and both antisense and flanking lncRNAs. One of the areas where manual annotation has a significant advantage is the annotation of duplicated gene clusters. After comprehensive annotation of all homeobox genes and their antisense genes in human and in mouse, we found some discrepancies with the current gene set in RefSeq regarding exact gene structures and coding versus pseudogene locus biotype. We also identified previously un-annotated pseudogenes in the DUX, Rhox and Obox gene clusters, which helped us re-evaluate and update the gene nomenclature in these regions. We found that human homeobox genes are enriched in antisense lncRNA loci, some of which are known to play a role in gene or gene cluster regulation, compared to their mouse orthologues. Of the annotated set of 241 human protein-coding homeobox genes, 98 have an antisense locus (41%) while of the 277 orthologous mouse genes, only 62 protein coding gene have an antisense locus (22%), based on publicly available transcriptional evidence. PMID:26412852

  3. Impact of HLA-driven HIV adaptation on virulence in populations of high HIV seroprevalence.

    PubMed

    Payne, Rebecca; Muenchhoff, Maximilian; Mann, Jaclyn; Roberts, Hannah E; Matthews, Philippa; Adland, Emily; Hempenstall, Allison; Huang, Kuan-Hsiang; Brockman, Mark; Brumme, Zabrina; Sinclair, Marc; Miura, Toshiyuki; Frater, John; Essex, Myron; Shapiro, Roger; Walker, Bruce D; Ndung'u, Thumbi; McLean, Angela R; Carlson, Jonathan M; Goulder, Philip J R

    2014-12-16

    It is widely believed that epidemics in new hosts diminish in virulence over time, with natural selection favoring pathogens that cause minimal disease. However, a tradeoff frequently exists between high virulence shortening host survival on the one hand but allowing faster transmission on the other. This is the case in HIV infection, where high viral loads increase transmission risk per coital act but reduce host longevity. We here investigate the impact on HIV virulence of HIV adaptation to HLA molecules that protect against disease progression, such as HLA-B*57 and HLA-B*58:01. We analyzed cohorts in Botswana and South Africa, two countries severely affected by the HIV epidemic. In Botswana, where the epidemic started earlier and adult seroprevalence has been higher, HIV adaptation to HLA including HLA-B*57/58:01 is greater compared with South Africa (P = 7 × 10(-82)), the protective effect of HLA-B*57/58:01 is absent (P = 0.0002), and population viral replicative capacity is lower (P = 0.03). These data suggest that viral evolution is occurring relatively rapidly, and that adaptation of HIV to the most protective HLA alleles may contribute to a lowering of viral replication capacity at the population level, and a consequent reduction in HIV virulence over time. The potential role in this process played by increasing antiretroviral therapy (ART) access is also explored. Models developed here suggest distinct benefits of ART, in addition to reducing HIV disease and transmission, in driving declines in HIV virulence over the course of the epidemic, thereby accelerating the effects of HLA-mediated viral adaptation. PMID:25453107

  4. Impact of HLA-driven HIV adaptation on virulence in populations of high HIV seroprevalence

    PubMed Central

    Payne, Rebecca; Muenchhoff, Maximilian; Mann, Jaclyn; Roberts, Hannah E.; Matthews, Philippa; Adland, Emily; Hempenstall, Allison; Huang, Kuan-Hsiang; Brockman, Mark; Brumme, Zabrina; Sinclair, Marc; Miura, Toshiyuki; Frater, John; Essex, Myron; Shapiro, Roger; Walker, Bruce D.; Ndung’u, Thumbi; McLean, Angela R.; Carlson, Jonathan M.; Goulder, Philip J. R.

    2014-01-01

    It is widely believed that epidemics in new hosts diminish in virulence over time, with natural selection favoring pathogens that cause minimal disease. However, a tradeoff frequently exists between high virulence shortening host survival on the one hand but allowing faster transmission on the other. This is the case in HIV infection, where high viral loads increase transmission risk per coital act but reduce host longevity. We here investigate the impact on HIV virulence of HIV adaptation to HLA molecules that protect against disease progression, such as HLA-B*57 and HLA-B*58:01. We analyzed cohorts in Botswana and South Africa, two countries severely affected by the HIV epidemic. In Botswana, where the epidemic started earlier and adult seroprevalence has been higher, HIV adaptation to HLA including HLA-B*57/58:01 is greater compared with South Africa (P = 7 × 10−82), the protective effect of HLA-B*57/58:01 is absent (P = 0.0002), and population viral replicative capacity is lower (P = 0.03). These data suggest that viral evolution is occurring relatively rapidly, and that adaptation of HIV to the most protective HLA alleles may contribute to a lowering of viral replication capacity at the population level, and a consequent reduction in HIV virulence over time. The potential role in this process played by increasing antiretroviral therapy (ART) access is also explored. Models developed here suggest distinct benefits of ART, in addition to reducing HIV disease and transmission, in driving declines in HIV virulence over the course of the epidemic, thereby accelerating the effects of HLA-mediated viral adaptation. PMID:25453107

  5. Impact of HLA-driven HIV adaptation on virulence in populations of high HIV seroprevalence.

    PubMed

    Payne, Rebecca; Muenchhoff, Maximilian; Mann, Jaclyn; Roberts, Hannah E; Matthews, Philippa; Adland, Emily; Hempenstall, Allison; Huang, Kuan-Hsiang; Brockman, Mark; Brumme, Zabrina; Sinclair, Marc; Miura, Toshiyuki; Frater, John; Essex, Myron; Shapiro, Roger; Walker, Bruce D; Ndung'u, Thumbi; McLean, Angela R; Carlson, Jonathan M; Goulder, Philip J R

    2014-12-16

    It is widely believed that epidemics in new hosts diminish in virulence over time, with natural selection favoring pathogens that cause minimal disease. However, a tradeoff frequently exists between high virulence shortening host survival on the one hand but allowing faster transmission on the other. This is the case in HIV infection, where high viral loads increase transmission risk per coital act but reduce host longevity. We here investigate the impact on HIV virulence of HIV adaptation to HLA molecules that protect against disease progression, such as HLA-B*57 and HLA-B*58:01. We analyzed cohorts in Botswana and South Africa, two countries severely affected by the HIV epidemic. In Botswana, where the epidemic started earlier and adult seroprevalence has been higher, HIV adaptation to HLA including HLA-B*57/58:01 is greater compared with South Africa (P = 7 × 10(-82)), the protective effect of HLA-B*57/58:01 is absent (P = 0.0002), and population viral replicative capacity is lower (P = 0.03). These data suggest that viral evolution is occurring relatively rapidly, and that adaptation of HIV to the most protective HLA alleles may contribute to a lowering of viral replication capacity at the population level, and a consequent reduction in HIV virulence over time. The potential role in this process played by increasing antiretroviral therapy (ART) access is also explored. Models developed here suggest distinct benefits of ART, in addition to reducing HIV disease and transmission, in driving declines in HIV virulence over the course of the epidemic, thereby accelerating the effects of HLA-mediated viral adaptation.

  6. Copper tolerance and virulence in bacteria.

    PubMed

    Ladomersky, Erik; Petris, Michael J

    2015-06-01

    Copper (Cu) is an essential trace element for all aerobic organisms. It functions as a cofactor in enzymes that catalyze a wide variety of redox reactions due to its ability to cycle between two oxidation states, Cu(I) and Cu(II). This same redox property of copper has the potential to cause toxicity if copper homeostasis is not maintained. Studies suggest that the toxic properties of copper are harnessed by the innate immune system of the host to kill bacteria. To counter such defenses, bacteria rely on copper tolerance genes for virulence within the host. These discoveries suggest bacterial copper intoxication is a component of host nutritional immunity, thus expanding our knowledge of the roles of copper in biology. This review summarizes our current understanding of copper tolerance in bacteria, and the extent to which these pathways contribute to bacterial virulence within the host.

  7. Regulation of virulence of Entamoeba histolytica.

    PubMed

    Marie, Chelsea; Petri, William A

    2014-01-01

    Entamoeba histolytica is the third-leading cause of parasitic mortality globally. E. histolytica infection generally does not cause symptoms, but the parasite has potent pathogenic potential. The origins, benefits, and triggers of amoebic virulence are complex. Amoebic pathogenesis entails depletion of the host mucosal barrier, adherence to the colonic lumen, cytotoxicity, and invasion of the colonic epithelium. Parasite damage results in colitis and, in some cases, disseminated disease. Both host and parasite genotypes influence the development of disease, as do the regulatory responses they govern at the host-pathogen interface. Host environmental factors determine parasite transmission and shape the colonic microenvironment E. histolytica infects. Here we highlight research that illuminates novel links between host, parasite, and environmental factors in the regulation of E. histolytica virulence.

  8. Copper tolerance and virulence in bacteria

    PubMed Central

    Ladomersky, Erik; Petris, Michael J.

    2015-01-01

    Copper (Cu) is an essential trace element for all aerobic organisms. It functions as a cofactor in enzymes that catalyze a wide variety of redox reactions due to its ability to cycle between two oxidation states, Cu(I) and Cu(II). This same redox property of copper has the potential to cause toxicity if copper homeostasis is not maintained. Studies suggest that the toxic properties of copper are harnessed by the innate immune system of the host to kill bacteria. To counter such defenses, bacteria rely on copper tolerance genes for virulence within the host. These discoveries suggest bacterial copper intoxication is a component of host nutritional immunity, thus expanding our knowledge of the roles of copper in biology. This review summarizes our current understanding of copper tolerance in bacteria, and the extent to which these pathways contribute to bacterial virulence within the host. PMID:25652326

  9. [Genetic virulence markers of opportunistic bacteria].

    PubMed

    Bondarenko, V M

    2011-01-01

    The analysis of opportunistic bacteria phenotypic and genetic virulence markers indicates that pathogenicity formation is based on a structural modification of bacterial DNA which is linked with migration of interbacterial pathogenicity "islands" genetic determinants. Structural organization features of these mobile genetic elements determine high expression probability, and PCR detection of pathogenicity "islands" determinants that control adhesins, invasins, cytotoxic and cytolitic toxines synthesis may indicate etiopathogenetic significance of clinical isolates.

  10. Hantavirus interferon regulation and virulence determinants.

    PubMed

    Mackow, Erich R; Dalrymple, Nadine A; Cimica, Velasco; Matthys, Valery; Gorbunova, Elena; Gavrilovskaya, Irina

    2014-07-17

    Hantaviruses predominantly replicate in primary human endothelial cells and cause 2 diseases characterized by altered barrier functions of vascular endothelium. Most hantaviruses restrict the early induction of interferon-β (IFNβ) and interferon stimulated genes (ISGs) within human endothelial cells to permit their successful replication. PHV fails to regulate IFN induction within human endothelial cells which self-limits PHV replication and its potential as a human pathogen. These findings, and the altered regulation of endothelial cell barrier functions by pathogenic hantaviruses, suggest that virulence is determined by the ability of hantaviruses to alter key signaling pathways within human endothelial cells. Our findings indicate that the Gn protein from ANDV, but not PHV, inhibits TBK1 directed ISRE, kB and IFNβ induction through virulence determinants in the Gn cytoplasmic tail (GnT) that inhibit TBK1 directed IRF3 phosphorylation. Further studies indicate that in response to hypoxia induced VEGF, ANDV infection enhances the permeability and adherens junction internalization of microvascular and lymphatic endothelial cells. These hypoxia/VEGF directed responses are rapamycin sensitive and directed by mTOR signaling pathways. These results demonstrate the presence of at least two hantavirus virulence determinants that act on endothelial cell signaling pathways: one that regulates antiviral IFN signaling responses, and a second that enhances normal hypoxia-VEGF-mTOR signaling pathways to facilitate endothelial cell permeability. These findings suggest signaling pathways as potential targets for therapeutic regulation of vascular deficits that contribute to hantavirus diseases and viral protein targets for attenuating pathogenic hantaviruses.

  11. Tracking bacterial virulence: global modulators as indicators

    PubMed Central

    Prieto, Alejandro; Urcola, Imanol; Blanco, Jorge; Dahbi, Ghizlane; Muniesa, Maite; Quirós, Pablo; Falgenhauer, Linda; Chakraborty, Trinad; Hüttener, Mário; Juárez, Antonio

    2016-01-01

    The genomes of Gram-negative bacteria encode paralogues and/or orthologues of global modulators. The nucleoid-associated H-NS and Hha proteins are an example: several enterobacteria such as Escherichia coli or Salmonella harbor H-NS, Hha and their corresponding paralogues, StpA and YdgT proteins, respectively. Remarkably, the genome of the pathogenic enteroaggregative E. coli strain 042 encodes, in addition to the hha and ydgT genes, two additional hha paralogues, hha2 and hha3. We show in this report that there exists a strong correlation between the presence of these paralogues and the virulence phenotype of several E. coli strains. hha2 and hha3 predominate in some groups of intestinal pathogenic E. coli strains (enteroaggregative and shiga toxin-producing isolates), as well as in the widely distributed extraintestinal ST131 isolates. Because of the relationship between the presence of hha2/hha3 and some virulence factors, we have been able to provide evidence for Hha2/Hha3 modulating the expression of the antigen 43 pathogenic determinants. We show that tracking global modulators or their paralogues/orthologues can be a new strategy to identify bacterial pathogenic clones and propose PCR amplification of hha2 and hha3 as a virulence indicator in environmental and clinical E. coli isolates. PMID:27169404

  12. Differing virulence of Aphanomyces astaci isolates and elevated resistance of noble crayfish Astacus astacus against crayfish plague.

    PubMed

    Makkonen, J; Jussila, J; Kortet, R; Vainikka, A; Kokko, H

    2012-12-27

    Crayfish plague epidemics (caused by Aphanomyces astaci) have been causing population collapses among native European crayfish stocks since the late 1800s. Recent indirect and direct evidence has shown that its virulence has been variable, with native European crayfish even acting as carriers. We tested the differences in A. astaci virulence under experimental conditions using both PsI- and As-genotypes with 3 Finnish noble crayfish Astacus astacus populations. We infected crayfish with adjusted quantities of A. astaci zoospores and monitored the symptoms and mortality of the crayfish. The PsI-genotype isolate caused rapid and total mortality among the tested populations, while the As-genotype isolates expressed more variable virulence. In some cases, mortality among the As-genotype-infected crayfish did not exceed the mortality level of the control group. All of the tested noble crayfish stocks showed lower mortality towards the As-genotype of A. astaci isolated from the River Kemijoki epidemic. We conclude that there are clear differences in virulence between different A. astaci genotypes and also differences in virulence within As-genotypes. Furthermore, we observed clear signs of increased resistance in different populations of noble crayfish towards some of the tested strains belonging to the As-genotype of A. astaci.

  13. Deciphering the role of coumarin as a novel quorum sensing inhibitor suppressing virulence phenotypes in bacterial pathogens.

    PubMed

    Gutiérrez-Barranquero, José A; Reen, F Jerry; McCarthy, Ronan R; O'Gara, Fergal

    2015-04-01

    The rapid unchecked rise in antibiotic resistance over the last few decades has led to an increased focus on the need for alternative therapeutic strategies for the treatment and clinical management of microbial infections. In particular, small molecules that can suppress microbial virulence systems independent of any impact on growth are receiving increased attention. Quorum sensing (QS) is a cell-to-cell signalling communication system that controls the virulence behaviour of a broad spectrum of bacterial pathogens. QS systems have been proposed as an effective target, particularly as they control biofilm formation in pathogens, a key driver of antibiotic ineffectiveness. In this study, we identified coumarin, a natural plant phenolic compound, as a novel QS inhibitor, with potent anti-virulence activity in a broad spectrum of pathogens. Using a range of biosensor systems, coumarin was active against short, medium and long chain N-acyl-homoserine lactones, independent of any effect on growth. To determine if this suppression was linked to anti-virulence activity, key virulence systems were studied in the nosocomial pathogen Pseudomonas aeruginosa. Consistent with suppression of QS, coumarin inhibited biofilm, the production of phenazines and swarming motility in this organism potentially linked to reduced expression of the rhlI and pqsA quorum sensing genes. Furthermore, coumarin significantly inhibited biofilm formation and protease activity in other bacterial pathogens and inhibited bioluminescence in Aliivibrio fischeri. In light of these findings, coumarin would appear to have potential as a novel quorum sensing inhibitor with a broad spectrum of action.

  14. Development of a DNA Microarray for Enterococcal Species, Virulence, and Antibiotic Resistance Gene Determinations among Isolates from Poultry▿

    PubMed Central

    Champagne, J.; Diarra, M. S.; Rempel, H.; Topp, E.; Greer, C. W.; Harel, J.; Masson, L.

    2011-01-01

    A DNA microarray (Enteroarray) was designed with probes targeting four species-specific taxonomic identifiers to discriminate among 18 different enterococcal species, while other probes were designed to identify 18 virulence factors and 174 antibiotic resistance genes. In total, 262 genes were utilized for rapid species identification of enterococcal isolates, while characterizing their virulence potential through the simultaneous identification of endogenous antibiotic resistance and virulence genes. Enterococcal isolates from broiler chicken farms were initially identified by using the API 20 Strep system, and the results were compared to those obtained with the taxonomic genes atpA, recA, pheS, and ddl represented on our microarray. Among the 171 isolates studied, five different enterococcal species were identified by using the API 20 Strep system: Enterococcus faecium, E. faecalis, E. durans, E. gallinarum, and E. avium. The Enteroarray detected the same species as API 20 Strep, as well as two more: E. casseliflavus and E. hirae. Species comparisons resulted in 15% (27 isolates) disagreement between the two methods among the five API 20 Strep identifiable species and 24% (42 isolates) disagreement when considering the seven Enteroarray identified species. The species specificity of key antibiotic and virulence genes identified by the Enteroarray were consistent with the literature adding further robustness to the redundant taxonomic probe data. Sequencing of the cpn60 gene further confirmed the complete accuracy of the microarray results. The new Enteroarray should prove to be a useful tool to accurately genotype strains of enterococci and assess their virulence potential. PMID:21335389

  15. Integrative annotation of chromatin elements from ENCODE data

    PubMed Central

    Hoffman, Michael M.; Ernst, Jason; Wilder, Steven P.; Kundaje, Anshul; Harris, Robert S.; Libbrecht, Max; Giardine, Belinda; Ellenbogen, Paul M.; Bilmes, Jeffrey A.; Birney, Ewan; Hardison, Ross C.; Dunham, Ian; Kellis, Manolis; Noble, William Stafford

    2013-01-01

    The ENCODE Project has generated a wealth of experimental information mapping diverse chromatin properties in several human cell lines. Although each such data track is independently informative toward the annotation of regulatory elements, their interrelations contain much richer information for the systematic annotation of regulatory elements. To uncover these interrelations and to generate an interpretable summary of the massive datasets of the ENCODE Project, we apply unsupervised learning methodologies, converting dozens of chromatin datasets into discrete annotation maps of regulatory regions and other chromatin elements across the human genome. These methods rediscover and summarize diverse aspects of chromatin architecture, elucidate the interplay between chromatin activity and RNA transcription, and reveal that a large proportion of the genome lies in a quiescent state, even across multiple cell types. The resulting annotation of non-coding regulatory elements correlate strongly with mammalian evolutionary constraint, and provide an unbiased approach for evaluating metrics of evolutionary constraint in human. Lastly, we use the regulatory annotations to revisit previously uncharacterized disease-associated loci, resulting in focused, testable hypotheses through the lens of the chromatin landscape. PMID:23221638

  16. Annotations of Mexican bullfighting videos for semantic index

    NASA Astrophysics Data System (ADS)

    Montoya Obeso, Abraham; Oropesa Morales, Lester Arturo; Fernando Vázquez, Luis; Cocolán Almeda, Sara Ivonne; Stoian, Andrei; García Vázquez, Mireya Saraí; Zamudio Fuentes, Luis Miguel; Montiel Perez, Jesús Yalja; de la O Torres, Saul; Ramírez Acosta, Alejandro Alvaro

    2015-09-01

    The video annotation is important for web indexing and browsing systems. Indeed, in order to evaluate the performance of video query and mining techniques, databases with concept annotations are required. Therefore, it is necessary generate a database with a semantic indexing that represents the digital content of the Mexican bullfighting atmosphere. This paper proposes a scheme to make complex annotations in a video in the frame of multimedia search engine project. Each video is partitioned using our segmentation algorithm that creates shots of different length and different number of frames. In order to make complex annotations about the video, we use ELAN software. The annotations are done in two steps: First, we take note about the whole content in each shot. Second, we describe the actions as parameters of the camera like direction, position and deepness. As a consequence, we obtain a more complete descriptor of every action. In both cases we use the concepts of the TRECVid 2014 dataset. We also propose new concepts. This methodology allows to generate a database with the necessary information to create descriptors and algorithms capable to detect actions to automatically index and classify new bullfighting multimedia content.

  17. Automated semantic annotation of rare disease cases: a case study

    PubMed Central

    Taboada, Maria; Rodríguez, Hadriana; Martínez, Diego; Pardo, María; Sobrido, María Jesús

    2014-01-01

    Motivation: As the number of clinical reports in the peer-reviewed medical literature keeps growing, there is an increasing need for online search tools to find and analyze publications on patients with similar clinical characteristics. This problem is especially critical and challenging for rare diseases, where publications of large series are scarce. Through an applied example, we illustrate how to automatically identify new relevant cases and semantically annotate the relevant literature about patient case reports to capture the phenotype of a rare disease named cerebrotendinous xanthomatosis. Results: Our results confirm that it is possible to automatically identify new relevant case reports with a high precision and to annotate them with a satisfactory quality (74% F-measure). Automated annotation with an emphasis to entirely describe all phenotypic abnormalities found in a disease may facilitate curation efforts by supplying phenotype retrieval and assessment of their frequency. Availability and Supplementary information: http://www.usc.es/keam/Phenotype Annotation/. Database URL: http://www.usc.es/keam/PhenotypeAnnotation/ PMID:24903515

  18. Variation Ontology for annotation of variation effects and mechanisms

    PubMed Central

    Vihinen, Mauno

    2014-01-01

    Ontology organizes and formally conceptualizes information in a knowledge domain with a controlled vocabulary having defined terms and relationships between them. Several ontologies have been used to annotate numerous databases in biology and medicine. Due to their unambiguous nature, ontological annotations facilitate systematic description and data organization, data integration and mining, and pattern recognition and statistics, as well as development of analysis and prediction tools. The Variation Ontology (VariO) was developed to allow the annotation of effects, consequences, and mechanisms of DNA, RNA, and protein variations. Variation types are systematically organized, and a detailed description of effects and mechanisms is possible. VariO is for annotating the variant, not the normal-state features or properties, and requires a reference (e.g., reference sequence, reference-state property, activity, etc.) compared to which the changes are indicated. VariO is versatile and can be used for variations ranging from genomic multiplications to single nucleotide or amino acid changes, whether of genetic or nongenetic origin. VariO annotations are position-specific and can be used for variations in any organism. PMID:24162187

  19. WEGO: a web tool for plotting GO annotations.

    PubMed

    Ye, Jia; Fang, Lin; Zheng, Hongkun; Zhang, Yong; Chen, Jie; Zhang, Zengjin; Wang, Jing; Li, Shengting; Li, Ruiqiang; Bolund, Lars; Wang, Jun

    2006-07-01

    Unified, structured vocabularies and classifications freely provided by the Gene Ontology (GO) Consortium are widely accepted in most of the large scale gene annotation projects. Consequently, many tools have been created for use with the GO ontologies. WEGO (Web Gene Ontology Annotation Plot) is a simple but useful tool for visualizing, comparing and plotting GO annotation results. Different from other commercial software for creating chart, WEGO is designed to deal with the directed acyclic graph structure of GO to facilitate histogram creation of GO annotation results. WEGO has been used widely in many important biological research projects, such as the rice genome project and the silkworm genome project. It has become one of the daily tools for downstream gene annotation analysis, especially when performing comparative genomics tasks. WEGO, along with the two other tools, namely External to GO Query and GO Archive Query, are freely available for all users at http://wego.genomics.org.cn. There are two available mirror sites at http://wego2.genomics.org.cn and http://wego.genomics.com.cn. Any suggestions are welcome at wego@genomics.org.cn. PMID:16845012

  20. Statistical algorithms for ontology-based annotation of scientific literature

    PubMed Central

    2014-01-01

    Background Ontologies encode relationships within a domain in robust data structures that can be used to annotate data objects, including scientific papers, in ways that ease tasks such as search and meta-analysis. However, the annotation process requires significant time and effort when performed by humans. Text mining algorithms can facilitate this process, but they render an analysis mainly based upon keyword, synonym and semantic matching. They do not leverage information embedded in an ontology's structure. Methods We present a probabilistic framework that facilitates the automatic annotation of literature by indirectly modeling the restrictions among the different classes in the ontology. Our research focuses on annotating human functional neuroimaging literature within the Cognitive Paradigm Ontology (CogPO). We use an approach that combines the stochastic simplicity of naïve Bayes with the formal transparency of decision trees. Our data structure is easily modifiable to reflect changing domain knowledge. Results We compare our results across naïve Bayes, Bayesian Decision Trees, and Constrained Decision Tree classifiers that keep a human expert in the loop, in terms of the quality measure of the F1-mirco score. Conclusions Unlike traditional text mining algorithms, our framework can model the knowledge encoded by the dependencies in an ontology, albeit indirectly. We successfully exploit the fact that CogPO has explicitly stated restrictions, and implicit dependencies in the form of patterns in the expert curated annotations. PMID:25093071